Patents by Inventor Deva Ramanan
Deva Ramanan has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20260134652Abstract: A fine-tuned model for few-shot object detection is output. A dataset of K-shot classes is created for fine-tuning a pretrained vision language model (VLM). Concept alignment is performed between the dataset of K-shot classes and the VLM. Fine-tuning is performed on the VLM using the dataset of K-shot classes with pseudo-negative federated loss to generate a few-shot object detection (FSOD) model. The FSOD model is output for use in object detection of the K-shot classes in image data received from one or more sensors.Type: ApplicationFiled: November 12, 2024Publication date: May 14, 2026Inventors: Anish Madan, Neehar Peri, Shu Kong, Deva Ramanan, Chaithanya Kumar Mummadi, Filipe Condessa
-
Publication number: 20260070587Abstract: Methods for training a series of neural networks to output driving behavior control parameters is disclosed. The training dataset for the neural networks includes sensor-based vehicle driving recordings that may be categorized by geographical area, by qualitative driving behaviors, or by some combination, such that various training data subsets are used to train the series of neural networks. By learning either city-specific driving behavior control parameters, qualitative driving behavior specific driving behavior control parameters, or both, the resulting parameters may then be provided to a motion planning model for use in modeling predictive control for an autonomous vehicle. Rather than relying on XYZ trajectories of agent vehicles when planning future trajectories of the ego vehicle, the motion planning model is adaptive, due to the use of the learned driving behavior control parameters.Type: ApplicationFiled: September 11, 2024Publication date: March 12, 2026Inventors: Arun Balajee VASUDEVAN, Neehar PERI, Deva RAMANAN, Chaithanya KUMAR MUMMADI, Filipe J. CABRITA CONDESSA
-
Publication number: 20260038205Abstract: Systems and methods for simultaneous map dynamic object reconstruction using LIDAR are disclosed. A method includes generating point cloud data of an environment using a LIDAR system, and generating annotated frames based thereon, the first and second frames corresponding to first and second time points at a particular direction of the LIDAR. Intermediate frames between the first and second annotated frames are generated, and coordinate frame transformations are conducted for objects within the frames to determine respective positions and orientations. First and second optimizations are performed for a mesh of a three-dimensional space and positions/orientations within the space. The dynamic scene is reconstructed based on the optimizations.Type: ApplicationFiled: August 1, 2024Publication date: February 5, 2026Inventors: Nathaniel CHODOSH, Simon LUCEY, Deva RAMANAN, Chaithanya Kumar MUMMADI, Filipe J. CABRITA CONDESSA
-
Patent number: 12491868Abstract: Systems and methods of determining trajectories of an actor in an environment in which a vehicle is operating are provided. The method includes detecting an actor that may move within a scene in the environment by an object detection system of a vehicle in the environment, determining a kinematic history of the actor, and using context of the scene and the kinematic history of the actor to determine a plurality of reference polylines for the actor. The method further includes generating a contextual embedding of the kinematic history of the actor to generate a plurality of predicted trajectories of the actor, in which the generating conditions each of the predicted trajectories to correspond to one of the reference polylines. The method additionally includes using, by the vehicle, the plurality of predicted trajectories to plan movement of the vehicle.Type: GrantFiled: May 5, 2021Date of Patent: December 9, 2025Assignee: Ford Global Technologies, LLCInventors: Siddhesh Shyam Khandelwal, William Junbo Qi, Jagjeet Singh, Andrew T. Hartnett, Deva Ramanan
-
Publication number: 20250285425Abstract: A method of generating data for machine learning (ML) models includes receiving, from one or more sensors, a sequence of samples that includes time-stamp information, extracting from the sequence of samples a snippet of a pre-defined length to generate a training dataset that includes a target sample derived from the sequence of samples, fine-tuning a pre-trained diffusion model to condition based on a context sample associated with the sequence of samples and corresponding time-stamp information, wherein the context sample associated with the sequence of samples is less than all samples of the sequence of samples, and in response to the fine-tuning the pre-trained diffusion model to reach convergence, outputting a final-predicted sample associated with the target sample, wherein the final-predicted sample was not in the sequence of samples.Type: ApplicationFiled: March 6, 2024Publication date: September 11, 2025Inventors: Tarasha KHURANA, Deva RAMANAN, Chaithanya Kumar MUMMADI, Filipe J. CABRITA CONDESSA
-
Patent number: 11794731Abstract: Systems and methods of determining trajectories of an actor in an environment in which a vehicle is operating are provided. The method includes, by an object detection system of a vehicle in an environment, detecting an actor that may move within a scene in the environment. The method further includes using context of the scene to determine a reference polyline for the actor and determining a kinematic history of the actor. The method additionally includes using the kinematic history to predict a waypoint, which is a predicted position of the actor at a conclusion of a waypoint time period, and identifying a segment of the reference polyline, the segment extending from a current location to a point along the reference polyline that is closest to the waypoint and determining a trajectory for the actor conditioned by the segment of the reference polyline.Type: GrantFiled: May 5, 2021Date of Patent: October 24, 2023Assignee: Ford Global Technologies, LLCInventors: Siddhesh Shyam Khandelwal, William Junbo Qi, Jagjeet Singh, Andrew T. Hartnett, Deva Ramanan
-
Patent number: 11704912Abstract: A method is disclosed for evaluating a classifier used to determine a traffic light signal state in images. The method includes, by a computer vision system of a vehicle, receiving at least one image of a traffic signal device of an imminent intersection. The traffic signal device includes a traffic signal face including one or more traffic signal elements. The method includes classifying, by a traffic light classifier (TLC), a classification state of the traffic signal face using labeled images correlated to the received at least one image. The classification state controls an operation of the vehicle at the intersection. The method includes evaluating a performance of the classifying of the classification state generated by the TLC. The evaluation is a label-free performance evaluation based on unlabeled images. The method includes training the TLC based on the evaluated performance.Type: GrantFiled: June 16, 2020Date of Patent: July 18, 2023Assignee: Ford Global Technologies, LLCInventors: Guy Hotson, Richard L. Kwant, Brett Browning, Deva Ramanan
-
Patent number: 11568545Abstract: Various embodiments of a framework which allow, as an alternative to resource-taxing decompression, efficient computation of feature maps using a compressed content data subset, such as video, by exploiting the motion information, such as a motion vector, present in the compressed video. This framework allows frame-specific object recognition and action detection algorithms to be applied to compressed video and other media files by executing only on I-frames in a Group of Pictures and linearly interpolating the results. Training and machine learning increases recognition accuracy. Yielding significant computational gains, this approach accelerates frame-wise feature extraction I-frame/P-frame/P-frame videos as well as I-frame/P-frame/B-frame videos. The present techniques may also be used for segmentation to identify and label respective regions for objects in a video.Type: GrantFiled: December 27, 2019Date of Patent: January 31, 2023Assignee: A9.com, Inc.Inventors: R. Manmatha, Hexiang Hu, Deva Ramanan
-
Publication number: 20220122620Abstract: Systems and methods for siren detection in a vehicle are provided. A method includes recording an audio segment, using a first audio recording device coupled to an autonomous vehicle, separating, using a computing device coupled to the autonomous vehicle, the audio segment into one or more audio clips, generating a spectrogram of the one or more audio clips, and inputting each spectrogram into a Convolutional Neural Network (CNN) run on the computing device. The CNN may be pretrained to detect one or more sirens present in spectrographic data. The method further includes determining, using the CNN, whether a siren is present in the audio segment, and if the siren is determined to be present in the audio segment, determining a course of action of the autonomous vehicle.Type: ApplicationFiled: October 19, 2020Publication date: April 21, 2022Inventors: Olivia Watkins, Nathan Pendleton, Guy Hotson, Chao Fang, Richard L. Kwant, Weihua Gao, Deva Ramanan, Nicolas Cebron, Brett Browning
-
Publication number: 20220048498Abstract: Systems and methods of determining trajectories of an actor in an environment in which a vehicle is operating are provided. The method includes, by an object detection system of a vehicle in an environment, detecting an actor that may move within a scene in the environment. The method further includes using context of the scene to determine a reference polyline for the actor and determining a kinematic history of the actor. The method additionally includes using the kinematic history to predict a waypoint, which is a predicted position of the actor at a conclusion of a waypoint time period, and identifying a segment of the reference polyline, the segment extending from a current location to a point along the reference polyline that is closest to the waypoint and determining a trajectory for the actor conditioned by the segment of the reference polyline.Type: ApplicationFiled: May 5, 2021Publication date: February 17, 2022Inventors: Siddhesh Shyam Khandelwal, William Junbo Qi, Jagjeet Singh, Andrew T. Hartnett, Deva Ramanan
-
Publication number: 20220048503Abstract: Systems and methods of determining trajectories of an actor in an environment in which a vehicle is operating are provided. The method includes detecting an actor that may move within a scene in the environment by an object detection system of a vehicle in the environment, determining a kinematic history of the actor, and using context of the scene and the kinematic history of the actor to determine a plurality of reference polylines for the actor. The method further includes generating a contextual embedding of the kinematic history of the actor to generate a plurality of predicted trajectories of the actor, in which the generating conditions each of the predicted trajectories to correspond to one of the reference polylines. The method additionally includes using, by the vehicle, the plurality of predicted trajectories to plan movement of the vehicle.Type: ApplicationFiled: May 5, 2021Publication date: February 17, 2022Inventors: Siddhesh Shyam Khandelwal, William Junbo Qi, Jagjeet Singh, Andrew T. Hartnett, Deva Ramanan
-
Publication number: 20210390349Abstract: A method is disclosed for evaluating a classifier used to determine a traffic light signal state in images. The method includes, by a computer vision system of a vehicle, receiving at least one image of a traffic signal device of an imminent intersection. The traffic signal device includes a traffic signal face including one or more traffic signal elements. The method includes classifying, by a traffic light classifier (TLC), a classification state of the traffic signal face using labeled images correlated to the received at least one image. The classification state controls an operation of the vehicle at the intersection. The method includes evaluating a performance of the classifying of the classification state generated by the TLC. The evaluation is a label-free performance evaluation based on unlabeled images. The method includes training the TLC based on the evaluated performance.Type: ApplicationFiled: June 16, 2020Publication date: December 16, 2021Inventors: Guy Hotson, Richard L. Kwant, Brett Browning, Deva Ramanan
-
Publication number: 20210342924Abstract: Various embodiments of a framework which allow, as an alternative to resource-taxing decompression, efficient computation of feature maps using a compressed content data subset, such as video, by exploiting the motion information, such as a motion vector, present in the compressed video. This framework allows frame-specific object recognition and action detection algorithms to be applied to compressed video and other media files by executing only on I-frames in a Group of Pictures and linearly interpolating the results. Training and machine learning increases recognition accuracy. Yielding significant computational gains, this approach accelerates frame-wise feature extraction I-frame/P-frame/P-frame videos as well as I-frame/P-frame/B-frame videos. The present techniques may also be used for segmentation to identify and label respective regions for objects in a video.Type: ApplicationFiled: December 27, 2019Publication date: November 4, 2021Inventors: R. Manmatha, Hexiang Hu, Deva Ramanan
-
Publication number: 20200143457Abstract: Various embodiments of a framework which allow, as an alternative to resource-taxing decompression, efficient computation of feature maps using a compressed content data subset, such as video, by exploiting the motion information, such as a motion vector, present in the compressed video. This framework allows frame-specific object recognition and action detection algorithms to be applied to compressed video and other media files by executing only on I-frames in a Group of Pictures and linearly interpolating the results. Training and machine learning increases recognition accuracy. Yielding significant computational gains, this approach accelerates frame-wise feature extraction I-frame/P-frame/P-frame videos as well as I-frame/P-frame/B-frame videos. The present techniques may also be used for segmentation to identify and label respective regions for objects in a video.Type: ApplicationFiled: December 27, 2019Publication date: May 7, 2020Inventors: R. Manmatha, Hexiang Hu, Deva Ramanan
-
Patent number: 10528819Abstract: Various embodiments of a framework which allow, as an alternative to resource-taxing decompression, efficient computation of feature maps using a compressed content data subset, such as video, by exploiting the motion information, such as a motion vector, present in the compressed video. This framework allows frame-specific object recognition and action detection algorithms to be applied to compressed video and other media files by executing only on I-frames in a Group of Pictures and linearly interpolating the results. Training and machine learning increases recognition accuracy. Yielding significant computational gains, this approach accelerates frame-wise feature extraction I-frame/P-frame/P-frame videos as well as I-frame/P-frame/B-frame videos. The present techniques may also be used for segmentation to identify and label respective regions for objects in a video.Type: GrantFiled: November 20, 2017Date of Patent: January 7, 2020Assignee: A9.COM, INC.Inventors: R. Manmatha, Hexiang Hu, Deva Ramanan