Patents by Inventor Ishani Chakraborty

Ishani Chakraborty has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

TRAINING MULTI-OBJECT TRACKING MODELS USING SIMULATION

Publication number: 20240078682

Abstract: Training a multi-object tracking model includes: generating a plurality of training images based at least on scene generation information, each training image comprising a plurality of objects to be tracked; generating, for each training image, original simulated data based at least on the scene generation information, the original simulated data comprising tag data for a first object; locating, within the original simulated data, tag data for the first object, based on at least an anomaly alert (e.g., occlusion alert, proximity alert, motion alert) associated with the first object in the first training image; based at least on locating the tag data for the first object, modifying at least a portion of the tag data for the first object from the original simulated data, thereby generating preprocessed training data from the original simulated data; and training a multi-object tracking model with the preprocessed training data to produce a trained multi-object tracker.

Type: Application

Filed: November 13, 2023

Publication date: March 7, 2024

Inventors: Ishani CHAKRABORTY, Jonathan C. HANZELKA, Lu YUAN, Pedro Urbina ESCOS, Thomas M. SOEMO
Tracking multiple objects in a video stream using occlusion-aware single-object tracking

Patent number: 11880985

Abstract: The disclosure herein enables tracking of multiple objects in a real-time video stream. For each individual frame received from the video stream, a frame type of the frame is determined. Based on the individual frame being an object detection frame type, a set of object proposals is detected in the individual frame, associations between the set of object proposals and a set of object tracks are assigned, and statuses of the set of object tracks are updated based on the assigned associations. Based on the individual frame being an object tracking frame type, single-object tracking is performed on the frame based on each object track of the set of object tracks and the set of object tracks is updated based on the performed single-object tracking. For each frame received, a real-time object location data stream is provided based on the set of object tracks.

Type: Grant

Filed: May 28, 2022

Date of Patent: January 23, 2024

Assignee: Microsoft Technology Licensing, LLC.

Inventors: Ishani Chakraborty, Yi-Ling Chen, Lu Yuan
Training multi-object tracking models using simulation

Patent number: 11854211

Abstract: Training a multi-object tracking model includes: generating a plurality of training images based at least on scene generation information, each training image comprising a plurality of objects to be tracked; generating, for each training image, original simulated data based at least on the scene generation information, the original simulated data comprising tag data for a first object; locating, within the original simulated data, tag data for the first object, based on at least an anomaly alert (e.g., occlusion alert, proximity alert, motion alert) associated with the first object in the first training image; based at least on locating the tag data for the first object, modifying at least a portion of the tag data for the first object from the original simulated data, thereby generating preprocessed training data from the original simulated data; and training a multi-object tracking model with the preprocessed training data to produce a trained multi-object tracker.

Type: Grant

Filed: January 26, 2022

Date of Patent: December 26, 2023

Assignee: Microsoft Technology Licensing, LLC.

Inventors: Ishani Chakraborty, Jonathan C. Hanzelka, Lu Yuan, Pedro Urbina Escos, Thomas M. Soemo
Detecting objects in crowds using geometric context

Patent number: 11450019

Abstract: A computing system is configured to train an object classifier. Monocular image data and ground-truth data are received for a scene. Geometric context is determined including a three-dimensional camera position relative to a fixed plane. Regions of interest (RoI) and a set of potential occluders are identified within the image data. For each potential occluder, an occlusion zone is projected onto the fixed plane in three-dimensions. A set of occluded RoIs on the fixed plane are generated for each occlusion zone. Each occluded RoI is projected back to the image data in two-dimensions. The classifier is trained by minimizing a loss function generated by inputting information regarding the RoIs and the occluded RoIs into the classifier, and by minimizing location errors of each RoI and each occluded RoI of the set on the fixed plane based on the ground-truth data. The trained classifier is then output for object detection.

Type: Grant

Filed: October 22, 2020

Date of Patent: September 20, 2022

Assignee: Microsoft Technology Licensing, LLC

Inventors: Ishani Chakraborty, Gang Hua
TRACKING MULTIPLE OBJECTS IN A VIDEO STREAM USING OCCLUSION-AWARE SINGLE-OBJECT TRACKING

Publication number: 20220292828

Abstract: The disclosure herein enables tracking of multiple objects in a real-time video stream. For each individual frame received from the video stream, a frame type of the frame is determined. Based on the individual frame being an object detection frame type, a set of object proposals is detected in the individual frame, associations between the set of object proposals and a set of object tracks are assigned, and statuses of the set of object tracks are updated based on the assigned associations. Based on the individual frame being an object tracking frame type, single-object tracking is performed on the frame based on each object track of the set of object tracks and the set of object tracks is updated based on the performed single-object tracking. For each frame received, a real-time object location data stream is provided based on the set of object tracks.

Type: Application

Filed: May 28, 2022

Publication date: September 15, 2022

Inventors: Ishani CHAKRABORTY, Yi-Ling CHEN, Lu YUAN
Tracking multiple objects in a video stream using occlusion-aware single-object tracking

Patent number: 11386662

Abstract: The disclosure herein enables tracking of multiple objects in a real-time video stream. For each individual frame received from the video stream, a frame type of the frame is determined. Based on the individual frame being an object detection frame type, a set of object proposals is detected in the individual frame, associations between the set of object proposals and a set of object tracks are assigned, and statuses of the set of object tracks are updated based on the assigned associations. Based on the individual frame being an object tracking frame type, single-object tracking is performed on the frame based on each object track of the set of object tracks and the set of object tracks is updated based on the performed single-object tracking. For each frame received, a real-time object location data stream is provided based on the set of object tracks.

Type: Grant

Filed: May 28, 2020

Date of Patent: July 12, 2022

Assignee: Microsoft Technology Licensing, LLC

Inventors: Ishani Chakraborty, Yi-Ling Chen, Lu Yuan
Training multi-object tracking models using simulation

Patent number: 11335008

Abstract: Training a multi-object tracking model includes: generating a plurality of training images based at least on scene generation information, each training image comprising a plurality of objects to be tracked; generating, for each training image, original simulated data based at least on the scene generation information, the original simulated data comprising tag data for a first object; locating, within the original simulated data, tag data for the first object, based on at least an anomaly alert (e.g., occlusion alert, proximity alert, motion alert) associated with the first object in the first training image; based at least on locating the tag data for the first object, modifying at least a portion of the tag data for the first object from the original simulated data, thereby generating preprocessed training data from the original simulated data; and training a multi-object tracking model with the preprocessed training data to produce a trained multi-object tracker.

Type: Grant

Filed: September 18, 2020

Date of Patent: May 17, 2022

Assignee: Microsoft Technology Licensing, LLC

Inventors: Ishani Chakraborty, Jonathan C. Hanzelka, Lu Yuan, Pedro Urbina Escos, Thomas M. Soemo
TRAINING MULTI-OBJECT TRACKING MODELS USING SIMULATION

Publication number: 20220148197

Abstract: Training a multi-object tracking model includes: generating a plurality of training images based at least on scene generation information, each training image comprising a plurality of objects to be tracked; generating, for each training image, original simulated data based at least on the scene generation information, the original simulated data comprising tag data for a first object; locating, within the original simulated data, tag data for the first object, based on at least an anomaly alert (e.g., occlusion alert, proximity alert, motion alert) associated with the first object in the first training image; based at least on locating the tag data for the first object, modifying at least a portion of the tag data for the first object from the original simulated data, thereby generating preprocessed training data from the original simulated data; and training a multi-object tracking model with the preprocessed training data to produce a trained multi-object tracker.

Type: Application

Filed: January 26, 2022

Publication date: May 12, 2022

Inventors: Ishani CHAKRABORTY, Jonathan C. HANZELKA, Lu YUAN, Pedro Urbina ESCOS, Thomas M. SOEMO
TRAINING MULTI-OBJECT TRACKING MODELS USING SIMULATION

Publication number: 20220092792

Abstract: Training a multi-object tracking model includes: generating a plurality of training images based at least on scene generation information, each training image comprising a plurality of objects to be tracked; generating, for each training image, original simulated data based at least on the scene generation information, the original simulated data comprising tag data for a first object; locating, within the original simulated data, tag data for the first object, based on at least an anomaly alert (e.g., occlusion alert, proximity alert, motion alert) associated with the first object in the first training image; based at least on locating the tag data for the first object, modifying at least a portion of the tag data for the first object from the original simulated data, thereby generating preprocessed training data from the original simulated data; and training a multi-object tracking model with the preprocessed training data to produce a trained multi-object tracker.

Type: Application

Filed: September 18, 2020

Publication date: March 24, 2022

Inventors: Ishani CHAKRABORTY, Jonathan C. HANZELKA, Lu YUAN, Pedro Urbina ESCOS, Thomas M. SOEMO
TRACKING MULTIPLE OBJECTS IN A VIDEO STREAM USING OCCLUSION-AWARE SINGLE-OBJECT TRACKING

Publication number: 20210374421

Abstract: The disclosure herein enables tracking of multiple objects in a real-time video stream. For each individual frame received from the video stream, a frame type of the frame is determined. Based on the individual frame being an object detection frame type, a set of object proposals is detected in the individual frame, associations between the set of object proposals and a set of object tracks are assigned, and statuses of the set of object tracks are updated based on the assigned associations. Based on the individual frame being an object tracking frame type, single-object tracking is performed on the frame based on each object track of the set of object tracks and the set of object tracks is updated based on the performed single-object tracking. For each frame received, a real-time object location data stream is provided based on the set of object tracks.

Type: Application

Filed: May 28, 2020

Publication date: December 2, 2021

Inventors: Ishani CHAKRABORTY, Yi-Ling CHEN, Lu YUAN
DETECTING OBJECTS IN CROWDS USING GEOMETRIC CONTEXT

Publication number: 20210035322

Abstract: A computing system is configured to train an object classifier. Monocular image data and ground-truth data are received for a scene. Geometric context is determined including a three-dimensional camera position relative to a fixed plane. Regions of interest (RoI) and a set of potential occluders are identified within the image data. For each potential occluder, an occlusion zone is projected onto the fixed plane in three-dimensions. A set of occluded RoIs on the fixed plane are generated for each occlusion zone. Each occluded RoI is projected back to the image data in two-dimensions. The classifier is trained by minimizing a loss function generated by inputting information regarding the RoIs and the occluded RoIs into the classifier, and by minimizing location errors of each RoI and each occluded RoI of the set on the fixed plane based on the ground-truth data. The trained classifier is then output for object detection.

Type: Application

Filed: October 22, 2020

Publication date: February 4, 2021

Applicant: Microsoft Technology Licensing, LLC

Inventors: Ishani CHAKRABORTY, Gang HUA
Detecting objects in crowds using geometric context

Patent number: 10818028

Abstract: A computing system is configured to train an object classifier. Monocular image data and ground-truth data are received for a scene. Geometric context is determined including a three-dimensional camera position relative to a fixed plane. Regions of interest (RoI) and a set of potential occluders are identified within the image data. For each potential occluder, an occlusion zone is projected onto the fixed plane in three-dimensions. A set of occluded RoIs on the fixed plane are generated for each occlusion zone. Each occluded RoI is projected back to the image data in two-dimensions. The classifier is trained by minimizing a loss function generated by inputting information regarding the RoIs and the occluded RoIs into the classifier, and by minimizing location errors of each RoI and each occluded RoI of the set on the fixed plane based on the ground-truth data. The trained classifier is then output for object detection.

Type: Grant

Filed: December 17, 2018

Date of Patent: October 27, 2020

Assignee: Microsoft Technology Licensing, LLC

Inventors: Ishani Chakraborty, Gang Hua
DETECTING OBJECTS IN CROWDS USING GEOMETRIC CONTEXT

Publication number: 20200193628

Abstract: A computing system is configured to train an object classifier. Monocular image data and ground-truth data are received for a scene. Geometric context is determined including a three-dimensional camera position relative to a fixed plane. Regions of interest (RoI) and a set of potential occluders are identified within the image data. For each potential occluder, an occlusion zone is projected onto the fixed plane in three-dimensions. A set of occluded RoIs on the fixed plane are generated for each occlusion zone. Each occluded Rd is projected back to the image data in two-dimensions. The classifier is trained by minimizing a loss function generated by inputting information regarding the RoIs and the occluded RoIs into the classifier, and by minimizing location errors of each Rd and each occluded Rd of the set on the fixed plane based on the ground-truth data. The trained classifier is then output for object detection.

Type: Application

Filed: December 17, 2018

Publication date: June 18, 2020

Applicant: Microsoft Technology Licensing, LLC

Inventors: Ishani CHAKRABORTY, Gang HUA
Recognizing salient video events through learning-based multimodal analysis of visual features and audio-based analytics

Patent number: 10679063

Abstract: A computing system for recognizing salient events depicted in a video utilizes learning algorithms to detect audio and visual features of the video. The computing system identifies one or more salient events depicted in the video based on the audio and visual features.

Type: Grant

Filed: September 4, 2015

Date of Patent: June 9, 2020

Assignee: SRI International

Inventors: Hui Cheng, Ajay Divakaran, Elizabeth Shriberg, Harpreet Singh Sawhney, Jingen Liu, Ishani Chakraborty, Omar Javed, David Chisolm, Behjat Siddiquie, Steven S. Weiner
Recognizing entity interactions in visual media

Patent number: 10121076

Abstract: An entity interaction recognition system algorithmically recognizes a variety of different types of entity interactions that may be captured in two-dimensional images. In some embodiments, the system estimates the three-dimensional spatial configuration or arrangement of entities depicted in the image. In some embodiments, the system applies a proxemics-based analysis to determine an interaction type. In some embodiments, the system infers, from a characteristic of an entity detected in an image, an area or entity of interest in the image.

Type: Grant

Filed: May 2, 2016

Date of Patent: November 6, 2018

Assignee: SRI International

Inventors: Ishani Chakraborty, Hui Cheng, Omar Javed
AUTOMATED ACTIVITY-TIME TRAINING

Publication number: 20180204108

Abstract: Automatically training an actor upon the occurrence of a physical condition with respect to that actor. Upon detecting that the actor has the physical condition (e.g., is engaging in or is about to engage in a physical activity), the system determines that training is to be provided for that activity. Upon determining that training is to be provided, the system automatically dispatches training. For instance, the system might cause a human or robot to be dispatched to the actor to show the actor how to perform the activity. Alternatively or instead, a representation of a signal segment may be dispatched to the actor. The representation providing the training to the actor may include a similar target of work to what the actor is presently targeting by the activity. The representation may also include a representation of a person that engaged in the activity properly previously.

Type: Application

Filed: February 17, 2017

Publication date: July 19, 2018

Inventors: Vijay Mital, Robin Abraham, Victor Zhu, Liang Du, Ning Zhou, Pramod Kumar Sharma, Ishani Chakraborty
RECOGNIZING ENTITY INTERACTIONS IN VISUAL MEDIA

Publication number: 20160247023

Abstract: An entity interaction recognition system algorithmically recognizes a variety of different types of entity interactions that may be captured in two-dimensional images. In some embodiments, the system estimates the three-dimensional spatial configuration or arrangement of entities depicted in the image. In some embodiments, the system applies a proxemics-based analysis to determine an interaction type. In some embodiments, the system infers, from a characteristic of an entity detected in an image, an area or entity of interest in the image.

Type: Application

Filed: May 2, 2016

Publication date: August 25, 2016

Inventors: Ishani CHAKRABORTY, Hui CHENG, Omar JAVED
Recognizing entity interactions in visual media

Patent number: 9330296

Abstract: An entity interaction recognition system algorithmically recognizes a variety of different types of entity interactions that may be captured in two-dimensional images. In some embodiments, the system estimates the three-dimensional spatial configuration or arrangement of entities depicted in the image. In some embodiments, the system applies a proxemics-based analysis to determine an interaction type. In some embodiments, the system infers, from a characteristic of an entity detected in an image, an area or entity of interest in the image.

Type: Grant

Filed: September 9, 2013

Date of Patent: May 3, 2016

Assignee: SRI INTERNATIONAL

Inventors: Ishani Chakraborty, Hui Cheng, Omar Javed
3D visual proxemics: recognizing human interactions in 3D from a single image

Patent number: 9268994

Abstract: A unified framework detects and classifies people interactions in unconstrained user generated images. Previous approaches directly map people/face locations in two-dimensional image space into features for classification. Among other things, the disclosed framework estimates a camera viewpoint and people positions in 3D space and then extracts spatial configuration features from explicit three-dimensional people positions.

Type: Grant

Filed: August 15, 2013

Date of Patent: February 23, 2016

Assignee: SRI INTERNATIONAL

Inventors: Ishani Chakraborty, Hui Cheng, Omar Javed
RECOGNIZING SALIENT VIDEO EVENTS THROUGH LEARNING-BASED MULTIMODAL ANALYSIS OF VISUAL FEATURES AND AUDIO-BASED ANALYTICS

Publication number: 20160004911

Abstract: A computing system for recognizing salient events depicted in a video utilizes learning algorithms to detect audio and visual features of the video. The computing system identifies one or more salient events depicted in the video based on the audio and visual features.

Type: Application

Filed: September 4, 2015

Publication date: January 7, 2016

Inventors: Hui Cheng, Ajay Divakaran, Elizabeth Shriberg, Harpreet Singh Sawhney, Jingen Liu, Ishani Chakraborty, Omar Javed, David Chisolm, Behjat Siddiquie, Steven S. Weiner

1 2 next