Patents by Inventor Ankur Datta

Ankur Datta has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

DETERMINING CAMERA HEIGHT USING DISTRIBUTIONS OF OBJECT HEIGHTS AND OBJECT IMAGE HEIGHTS

Publication number: 20160203611

Abstract: A camera at a fixed vertical height positioned above a reference plane, with an axis of a camera lens at an acute angle with respect to a perpendicular of the reference plane. One or more processors receive camera images of a multiplicity of people of unknown height and vertical axis of the images are transformed into pixel counts. The known heights of people from a known statistical distribution of heights of people are received by one or more processors and transformed to a normalized measurement of pixel counts, based in part on a focal length of the camera lens, the angle of the camera, and an objective function summing differences between pixel counts of the known heights of people and the unknown heights of people. The fixed vertical height of the camera is determined by adjusting the estimated camera height to minimize the objective function.

Type: Application

Filed: March 22, 2016

Publication date: July 14, 2016

Inventors: Ankur Datta, Rogerio S. Feris, Sharathchandra U. Pankanti, Yun Zhai
Recognizing text from frames of image data using contextual information

Patent number: 9355336

Abstract: Disclosed are techniques for recognizing text from one or more frames of image data using contextual information. In some implementations, image data including a captured textual item is processed to identify an entity in the image data. A context can be selected using the entity, where the context corresponds to a dictionary. Text in the captured textual item can be identified using the dictionary. The identified text can be output to a display device.

Type: Grant

Filed: April 23, 2014

Date of Patent: May 31, 2016

Assignee: Amazon Technologies, Inc.

Inventors: Sonjeev Jahagirdar, Matthew Joseph Cole, David Paul Ramos, Utkarsh Prateek, Emilie Noelle McConville, Ankur Datta, Laura Varnum Finney, Yue Liu, Bhavesh Anil Doshi, Avnish Sikka, Michael Vanne
TOPOLOGY DETERMINATION FOR NON-OVERLAPPING CAMERA NETWORK

Publication number: 20160140732

Abstract: Image-matching tracks the movements of the objects from initial camera scenes to ending camera scenes in non-overlapping cameras. Paths are defined through scenes for pairings of initial and ending cameras by different respective scene entry and exit points. For each of said camera pairings a combination path having a highest total number of tracked movements relative to all other combinations of one path through the initial and ending camera scene is chosen, and the scene exit point of the selected path through the initial camera and the scene entry point of the selected path into the ending camera define a path connection of the initial camera scene to the ending camera scene.

Type: Application

Filed: January 22, 2016

Publication date: May 19, 2016

Inventors: Lisa M. Brown, Ankur Datta, Rogerio S. Feris, Sharathchandra Pankanti
IMAGE RANKING BASED ON ATTRIBUTE CORRELATION

Publication number: 20160124996

Abstract: Images are retrieved and ranked according to relevance to attributes of a multi-attribute query through training image attribute detectors for different attributes annotated in a training dataset. Pair-wise correlations are learned between pairs of the annotated attributes from the training dataset of images. Image datasets may are searched via the trained attribute detectors for images comprising attributes in a multi-attribute query. The retrieved images are ranked as a function of comprising attributes that are not within the query subset plurality of attributes but are paired to one of the query subset plurality of attributes by the pair-wise correlations, wherein the ranking is an order of likelihood that the different ones of the attributes will appear in an image with the paired one of the query subset plurality of attributes.

Type: Application

Filed: January 13, 2016

Publication date: May 5, 2016

Inventors: Ankur Datta, Rogerio S. Feris, Sharathchandra U. Pankanti, Behjat Siddiquie
Hierarchical ranking of facial attributes

Patent number: 9330111

Abstract: In response to a query of discernable facial attributes, the locations of distinct and different facial regions are estimated from face image data, each relevant to different attributes. Different features are extracted from the estimated facial regions from database facial images, which are ranked in base layer rankings as a function of relevance of extracted features to attributes relevant to the estimated regions, and in second-layer rankings as a function of combinations of the base layer rankings and relevance of the extracted features to common ones of the attributes relevant to the estimated regions. The images are ranked in relevance to the query as a function of the second-layer rankings.

Type: Grant

Filed: July 20, 2015

Date of Patent: May 3, 2016

Assignee: International Business Machines Corporation

Inventors: Ankur Datta, Rogerio S. Feris, Sharathchandra U. Pankanti, Daniel A. Vaquero
Determining camera height using distributions of object heights and object image heights

Patent number: 9322647

Abstract: A camera at a fixed vertical height positioned above a reference plane, with an axis of a camera lens at an acute angle with respect to a perpendicular of the reference plane. One or more processors receive images of different people. The vertical measurement values of the images of different people are determined. The one or more processors determine a first statistical measure associated with a statistical distribution of the vertical measurement values. The known heights of people from a known statistical distribution of heights of people are transformed to normalized measurements, based in part on a focal length of the camera lens, the angle of the camera, and a division operator in an objective function of differences between the normalized measurements and the vertical measurement values. The fixed vertical height of the camera is determined, based at least on minimizing the objective function.

Type: Grant

Filed: March 28, 2013

Date of Patent: April 26, 2016

Assignee: International Business Machines Corporation

Inventors: Ankur Datta, Rogerio S. Feris, Sharathchandra U. Pankanti, Yun Zhai
Topology determination for non-overlapping camera network

Patent number: 9280833

Abstract: Image-matching tracks the movements of the objects from initial camera scenes to ending camera scenes in non-overlapping cameras. Paths are defined through scenes for pairings of initial and ending cameras by different respective scene entry and exit points. For each of said camera pairings a combination path having a highest total number of tracked movements relative to all other combinations of one path through the initial and ending camera scene is chosen, and the scene exit point of the selected path through the initial camera and the scene entry point of the selected path into the ending camera define a path connection of the initial camera scene to the ending camera scene.

Type: Grant

Filed: March 5, 2013

Date of Patent: March 8, 2016

Assignee: International Business Machines Corporation

Inventors: Lisa M. Brown, Ankur Datta, Rogerio S. Feris, Sharathchandra U. Pankanti
Image ranking based on attribute correlation

Patent number: 9262445

Abstract: Images are retrieved and ranked according to relevance to attributes of a multi-attribute query through training image attribute detectors for different attributes annotated in a training dataset. Pair-wise correlations are learned between pairs of the annotated attributes from the training dataset of images. Image datasets may are searched via the trained attribute detectors for images comprising attributes in a multi-attribute query. The retrieved images are ranked as a function of comprising attributes that are not within the query subset plurality of attributes but are paired to one of the query subset plurality of attributes by the pair-wise correlations, wherein the ranking is an order of likelihood that the different ones of the attributes will appear in an image with the paired one of the query subset plurality of attributes.

Type: Grant

Filed: October 17, 2014

Date of Patent: February 16, 2016

Assignee: International Business Machines Corporation

Inventors: Ankur Datta, Rogerio S. Feris, Sharathchandra U. Pankanti, Behjat Siddiquie
Object retrieval in video data using complementary detectors

Patent number: 9251425

Abstract: Automatic object retrieval from input video is based on learned, complementary detectors created for each of a plurality of different motionlet clusters. The motionlet clusters are partitioned from a dataset of training vehicle images as a function of determining that vehicles within each of the scenes of the images in each cluster share similar two-dimensional motion direction attributes within their scenes. To train the complementary detectors, a first detector is trained on motion blobs of vehicle objects detected and collected within each of the training dataset vehicle images within the motionlet cluster via a background modeling process; a second detector is trained on each of the training dataset vehicle images within the motionlet cluster that have motion blobs of the vehicle objects but are misclassified by the first detector; and the training repeats until all of the training dataset vehicle images have been eliminated as false positives or correctly classified.

Type: Grant

Filed: February 12, 2015

Date of Patent: February 2, 2016

Assignee: International Business Machines Corporation

Inventors: Ankur Datta, Rogerio S. Feris, Sharathchandra U. Pankanti, Yun Zhai
MULTI-CUE OBJECT DETECTION AND ANALYSIS

Publication number: 20160012606

Abstract: Foreground objects of interest are distinguished from a background model by dividing a region of interest of a video data image into a grid array of individual cells. Each of the cells are labeled as foreground if accumulated edge energy within the cell meets an edge energy threshold, or if color intensities for different colors within each cell differ by a color intensity differential threshold, or as a function of combinations of said determinations.

Type: Application

Filed: September 22, 2015

Publication date: January 14, 2016

Inventors: ANKUR DATTA, ROGERIO S. FERIS, SHARATHCHANDRA U. PANKANTI, XIAOYU WANG
AUTOMATICALLY DETERMINING FIELD OF VIEW OVERLAP AMONG MULTIPLE CAMERAS

Publication number: 20150379729

Abstract: Field of view overlap among multiple cameras are automatically determined as a function of the temporal overlap of object tracks determined within their fields-of-view. Object tracks with the highest similarity value are assigned into pairs, and portions of the assigned object track pairs having a temporally overlapping period of time are determined. Scene entry points are determined from object locations on the tracks at a beginning of the temporally overlapping period of time, and scene exit points from object locations at an ending of the temporally overlapping period of time. Boundary lines for the overlapping fields-of-view portions within the corresponding camera fields-of-view are defined as a function of the determined entry and exit points in their respective fields-of-view.

Type: Application

Filed: September 14, 2015

Publication date: December 31, 2015

Inventors: Ankur Datta, Rogerio S. Feris, Sharathchandra U. Pankanti, Yun Zhai
ESTIMATION OF OBJECT PROPERTIES IN 3D WORLD

Publication number: 20150379768

Abstract: Objects within two-dimensional video data are modeled by three-dimensional models as a function of object type and motion through manually calibrating a two-dimensional image to the three spatial dimensions of a three-dimensional modeling cube. Calibrated three-dimensional locations of an object in motion in the two-dimensional image field of view of a video data input are determined and used to determine a heading direction of the object as a function of the camera calibration and determined movement between the determined three-dimensional locations. The two-dimensional object image is replaced in the video data input with an object-type three-dimensional polygonal model having a projected bounding box that best matches a bounding box of an image blob, the model oriented in the determined heading direction. The bounding box of the replacing model is then scaled to fit the object image blob bounding box, and rendered with extracted image features.

Type: Application

Filed: September 9, 2015

Publication date: December 31, 2015

Inventors: Ankur Datta, Rogerio S. Feris, Yun Zhai
EFFICIENT RETRIEVAL OF ANOMALOUS EVENTS WITH PRIORITY LEARNING

Publication number: 20150379357

Abstract: Local models learned from anomaly detection are used to rank detected anomalies. The local models include image feature values extracted from an image field of video image data with respect to different predefined spatial and temporal local units, wherein anomaly results are determined by failures to fit to applied anomaly detection module local models. Image features values extracted from the image field local units associated with anomaly results are normalized, and image feature values extracted from the image field local units are clustered. Weights for anomaly results are learned as a function of the relations of the normalized extracted image feature values to the clustered image feature values. The normalized values are multiplied by the learned weights to generate ranking values to rank the anomalies.

Type: Application

Filed: September 4, 2015

Publication date: December 31, 2015

Inventors: ANKUR DATTA, BALAMANOHAR PALURI, SHARATHCHANDRA U. PANKANTI, YUN ZHAI
HIERARCHICAL RANKING OF FACIAL ATTRIBUTES

Publication number: 20150324368

Abstract: In response to a query of discernable facial attributes, the locations of distinct and different facial regions are estimated from face image data, each relevant to different attributes. Different features are extracted from the estimated facial regions from database facial images, which are ranked in base layer rankings as a function of relevance of extracted features to attributes relevant to the estimated regions, and in second-layer rankings as a function of combinations of the base layer rankings and relevance of the extracted features to common ones of the attributes relevant to the estimated regions. The images are ranked in relevance to the query as a function of the second-layer rankings.

Type: Application

Filed: July 20, 2015

Publication date: November 12, 2015

Inventors: Ankur Datta, Rogerio S. Feris, Sharathchandra U. Pankanti, Daniel A. Vaquero
Multi-cue object detection and analysis

Patent number: 9171375

Abstract: Foreground objects of interest are distinguished from a background model by dividing a region of interest of a video data image into a grid array of individual cells. Each of the cells are labeled as foreground if accumulated edge energy within the cell meets an edge energy threshold, or if color intensities for different colors within each cell differ by a color intensity differential threshold, or as a function of combinations of said determinations.

Type: Grant

Filed: November 11, 2014

Date of Patent: October 27, 2015

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Ankur Datta, Rogerio S. Feris, Sharathchandra U. Pankanti, Xiaoyu Wang
Automatically determining field of view overlap among multiple cameras

Patent number: 9165375

Abstract: Field of view overlap among multiple cameras are automatically determined as a function of the temporal overlap of object tracks determined within their fields-of-view. Object tracks with the highest similarity value are assigned into pairs, and portions of the assigned object track pairs having a temporally overlapping period of time are determined. Scene entry points are determined from object locations on the tracks at a beginning of the temporally overlapping period of time, and scene exit points from object locations at an ending of the temporally overlapping period of time. Boundary lines for the overlapping fields-of-view portions within the corresponding camera fields-of-view are defined as a function of the determined entry and exit points in their respective fields-of-view.

Type: Grant

Filed: November 4, 2014

Date of Patent: October 20, 2015

Assignee: International Business Machines Corporation

Inventors: Ankur Datta, Rogerio S. Feris, Sharathchandra U. Pankanti, Yun Zhai
Estimation of object properties in 3D world

Patent number: 9158972

Abstract: Objects within two-dimensional video data are modeled by three-dimensional models as a function of object type and motion through manually calibrating a two-dimensional image to the three spatial dimensions of a three-dimensional modeling cube. Calibrated three-dimensional locations of an object in motion in the two-dimensional image field of view of a video data input are determined and used to determine a heading direction of the object as a function of the camera calibration and determined movement between the determined three-dimensional locations. The two-dimensional object image is replaced in the video data input with an object-type three-dimensional polygonal model having a projected bounding box that best matches a bounding box of an image blob, the model oriented in the determined heading direction. The bounding box of the replacing model is then scaled to fit the object image blob bounding box, and rendered with extracted image features.

Type: Grant

Filed: September 5, 2014

Date of Patent: October 13, 2015

Assignee: International Business Machines Corporation

Inventors: Ankur Datta, Rogerio S. Feris, Yun Zhai
Efficient retrieval of anomalous events with priority learning

Patent number: 9158976

Abstract: Local models learned from anomaly detection are used to rank detected anomalies. The local models include image feature values extracted from an image field of video image data with respect to different predefined spatial and temporal local units, wherein anomaly results are determined by failures to fit to applied anomaly detection module local models. Image features values extracted from the image field local units associated with anomaly results are normalized, and image feature values extracted from the image field local units are clustered. Weights for anomaly results are learned as a function of the relations of the normalized extracted image feature values to the clustered image feature values. The normalized values are multiplied by the learned weights to generate ranking values to rank the anomalies.

Type: Grant

Filed: May 18, 2011

Date of Patent: October 13, 2015

Assignee: International Business Machines Corporation

Inventors: Ankur Datta, Balamanohar Paluri, Sharathchandra U. Pankanti, Yun Zhai
Hierarchical ranking of facial attributes

Patent number: 9116925

Abstract: In response to a query of discernable facial attributes, the locations of distinct and different facial regions are estimated from face image data, each relevant to different attributes. Different features are extracted from the estimated facial regions from database facial images, which are ranked in base layer rankings as a function of relevance of extracted features to attributes relevant to the estimated regions, and in second-layer rankings as a function of combinations of the base layer rankings and relevance of the extracted features to common ones of the attributes relevant to the estimated regions. The images are ranked in relevance to the query as a function of the second-layer rankings.

Type: Grant

Filed: January 6, 2014

Date of Patent: August 25, 2015

Assignee: International Business Machines Corporation

Inventors: Ankur Datta, Rogerio S. Feris, Sharathchandra U. Pankanti, Daniel A. Vaquero
Multi-cue object association

Patent number: 9104919

Abstract: Multiple discrete objects within a scene image captured by a single camera track are distinguished as un-labeled from a background model within a first frame of a video data input. Object position, object appearance and/or object size attributes are determined for each of the blobs, and costs determined to assign to existing blobs of existing object tracks as a function of the determined attributes. The un-labeled object blob that has a lowest cost of association with any of the existing object tracks is labeled with the label of that track having the lowest cost, said track is removed from consideration for labeling remaining un-labeled object blobs, and the process iteratively repeated until each of the track labels have been used to label one of the un-labeled blobs.

Type: Grant

Filed: October 6, 2014

Date of Patent: August 11, 2015

Assignee: International Business Machines Corporation

Inventors: Ankur Datta, Rogerio S. Feris, Sharathchandra U. Pankanti, Yun Zhai

prev 1 2 3 4 5 next