Patents by Inventor Sudeep Pillai

Sudeep Pillai has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

SYSTEM AND METHOD FOR DETERMINING IF A VEHICLE IS PARKED

Publication number: 20220414388

Abstract: Described herein are systems and methods for determining if a vehicle is parked. In one example, a system includes a processor, a sensor system, and a memory. Both the sensor system and the memory are in communication with the processor. The memory includes a parking determination module having instructions that, when executed by the processor, cause the processor to determine, using a random forest model, when the vehicle is parked based on vehicle estimated features, vehicle learned features, and vehicle taillight features of the vehicle that are based on sensor data from the sensor system.

Type: Application

Filed: June 29, 2021

Publication date: December 29, 2022

Inventors: Chao Fang, Kuan-Hui Lee, Logan Michael Ellis, Jia-En Pan, Kun-Hsin Chen, Sudeep Pillai, Daniele Molinari, Constantin Franziskus Dominik Hubmann, T. Wolfram Burgard
MONOCULAR 3D VEHICLE MODELING AND AUTO-LABELING USING SEMANTIC KEYPOINTS

Publication number: 20220414981

Abstract: A method for 3D object modeling includes linking 2D semantic keypoints of an object within a video stream into a 2D structured object geometry. The method includes inputting, to a neural network, the object to generate a 2D NOCS image and a shape vector, the shape vector being mapped to a continuously traversable coordinate shape. The method includes applying a differentiable shape renderer to the SDF shape and the 2D NOCS image to render a shape of the object corresponding to a 3D object model in the continuously traversable coordinate shape space. The method includes lifting the linked, 2D semantic keypoints of the 2D structured object geometry to a 3D structured object geometry. The method includes geometrically and projectively aligning the 3D object model, the 3D structured object geometry, and the rendered shape to form a rendered object. The method includes generating 3D bounding boxes from the rendered object.

Type: Application

Filed: August 25, 2022

Publication date: December 29, 2022

Applicant: TOYOTA RESEARCH INSTITUTE, INC.

Inventors: Arjun BHARGAVA, Sudeep PILLAI, Kuan-Hui LEE, Kun-Hsin CHEN
Systems and methods for detecting and matching keypoints between different views of a scene

Patent number: 11531892

Abstract: Systems and methods for detecting and matching keypoints between different views of a scene are disclosed herein. One embodiment acquires first and second images; subdivides the first and second images into first and second pluralities of cells, respectively; processes both pluralities of cells using a neural keypoint detection network to identify a first keypoint for a particular cell in the first plurality of cells and a second keypoint for a particular cell in the second plurality of cells, at least one of the first and second keypoints lying in a cell other than the particular cell in the first or second plurality of cells for which it was identified; and classifies the first keypoint and the second keypoint as a matching keypoint pair based, at least in part, on a comparison between a first descriptor associated with the first keypoint and a second descriptor associated with the second keypoint.

Type: Grant

Filed: March 31, 2020

Date of Patent: December 20, 2022

Assignee: Toyota Research Institute, Inc.

Inventors: Jiexiong Tang, Rares A. Ambrus, Vitor Guizilini, Sudeep Pillai, Hanme Kim
End-to-end signalized intersection transition state estimator with scene graphs over semantic keypoints

Patent number: 11514685

Abstract: Systems, methods, computer-readable media, techniques, and methodologies are disclosed for performing end-to-end, learning-based keypoint detection and association. A scene graph of a signalized intersection is constructed from an input image of the intersection. The scene graph includes detected keypoints and linkages identified between the keypoints. The scene graph can be used along with a vehicle's localization information to identify which keypoint that represents a traffic signal is associated with the vehicle's current travel lane. An appropriate vehicle action may then be determined based on a transition state of the traffic signal keypoint and trajectory information for the vehicle. A control signal indicative of this vehicle action may then be output to cause an autonomous vehicle, for example, to implement the appropriate vehicle action.

Type: Grant

Filed: February 17, 2021

Date of Patent: November 29, 2022

Assignee: TOYOTA RESEARCH INSTITUTE, INC.

Inventors: Kun-Hsin Chen, Peiyan Gong, Sudeep Pillai, Arjun Bhargava, Shunsho Kaku, Hai Jin, Kuan-Hui Lee
Systems and methods for generic visual odometry using learned features via neural camera models

Patent number: 11508080

Abstract: Systems and methods for self-supervised learning for visual odometry using camera images captured on a camera, may include: using a key point network to learn a keypoint matrix for a target image and a context image captured by the camera; using the learned descriptors to estimate correspondences between the target image and the context image; based on the keypoint correspondences, lifting a set of 2D keypoints to 3D, using a learned neural camera model; estimating a transformation between the target image and the context image using 3D-2D keypoint correspondences; and projecting the 3D keypoints into the context image using the learned neural camera model.

Type: Grant

Filed: September 15, 2020

Date of Patent: November 22, 2022

Assignee: TOYOTA RESEARCH INSTITUTE, INC.

Inventors: Vitor Guizilini, Igor Vasiljevic, Rares A. Ambrus, Sudeep Pillai, Adrien Gaidon
SYSTEMS AND METHODS FOR SOFT MODEL ASSERTIONS

Publication number: 20220358401

Abstract: Systems and methods are provided for implementing soft model assertions (SMA) system and techniques designed to monitor and improve Machine Learning (ML) model quality by to detecting errors within the one or more ML models. SMA techniques and systems are distinctly designed to leverage: 1) a user's ability to specify features over data; and 2) large, existing datasets of organizations, in a manner that can improve the accuracy and quality of predicting potential errors in Machine Learning (ML) models. A SMA system can include a controller device receiving predictions generated based on the ML models and output from the SMA system. The controller performs autonomous operations of the system in response to determining that the one or more detected errors within the one or more ML models yield a high certainty of errors in the predictions. The SMA system also includes a domain specific language and a severity score module.

Type: Application

Filed: May 10, 2021

Publication date: November 10, 2022

Inventors: Daniel D. KANG, Nikos ARECHIGA GONZALEZ, Sudeep PILLAI
LEARNING MONOCULAR 3D OBJECT DETECTION FROM 2D SEMANTIC KEYPOINT DETECTION

Publication number: 20220343096

Abstract: A method for 3D object detection is described. The method includes detecting semantic keypoints from monocular images of a video stream capturing a 3D object. The method also includes inferring a 3D bounding box of the 3D object corresponding to the detected semantic vehicle keypoints. The method further includes scoring the inferred 3D bounding box of the 3D object. The method also includes detecting the 3D object according to a final 3D bounding box generated based on the scoring of the inferred 3D bounding box.

Type: Application

Filed: April 27, 2021

Publication date: October 27, 2022

Applicant: TOYOTA RESEARCH INSTITUTE, INC.

Inventors: Arjun BHARGAVA, Haofeng CHEN, Adrien David GAIDON, Rares A. AMBRUS, Sudeep PILLAI
SYSTEMS AND METHODS FOR DATASET AND MODEL MANAGEMENT FOR MULTI-MODAL AUTO-LABELING AND ACTIVE LEARNING

Publication number: 20220335258

Abstract: Datasets for autonomous driving systems and multi-modal scenes may be automatically labeled using previously trained models as priors to mitigate the limitations of conventional manual data labeling. Properly versioned models, including model weights and knowledge of the dataset on which the model was previously trained, may be used to run an inference operation on unlabeled data, thus automatically labeling the dataset. The newly labeled dataset may then be used to train new models, including sparse data sets, in a semi-supervised or weakly-supervised fashion.

Type: Application

Filed: April 16, 2021

Publication date: October 20, 2022

Applicant: TOYOTA RESEARCH INSTITUTE, INC.

Inventors: Allan RAVENTOS, Arjun BHARGAVA, Kun-Hsin CHEN, Sudeep PILLAI, Adrien David GAIDON
Auto-labeling of driving logs using analysis-by-synthesis and unsupervised domain adaptation

Patent number: 11475248

Abstract: Acquiring labeled data can be a significant bottleneck in the development of machine learning models that are accurate and efficient enough to enable safety-critical applications, such as automated driving. The process of labeling of driving logs can be automated. Unlabeled real-world driving logs, which include data captured by one or more vehicle sensors, can be automatically labeled to generate one or more labeled real-world driving logs. The automatic labeling can include analysis-by-synthesis on the unlabeled real-world driving logs to generate simulated driving logs, which can include reconstructed driving scenes or portions thereof. The automatic labeling can further include simulation-to-real automatic labeling on the simulated driving logs and the unlabeled real-world driving logs to generate one or more labeled real-world driving logs. The automatically labeled real-world driving logs can be stored in one or more data stores for subsequent training, validation, evaluation, and/or model management.

Type: Grant

Filed: October 30, 2018

Date of Patent: October 18, 2022

Assignee: Toyota Research Institute, Inc.

Inventors: Adrien David Gaidon, James J. Kuffner, Jr., Sudeep Pillai
Monocular 3D vehicle modeling and auto-labeling using semantic keypoints

Patent number: 11475628

Abstract: A method for 3D object modeling includes linking 2D semantic keypoints of an object within a video stream into a 2D structured object geometry. The method includes inputting, to a neural network, the object to generate a 2D NOCS image and a shape vector, the shape vector being mapped to a continuously traversable coordinate shape. The method includes applying a differentiable shape renderer to the SDF shape and the 2D NOCS image to render a shape of the object corresponding to a 3D object model in the continuously traversable coordinate shape space. The method includes lifting the linked, 2D semantic keypoints of the 2D structured object geometry to a 3D structured object geometry. The method includes geometrically and projectively aligning the 3D object model, the 3D structured object geometry, and the rendered shape to form a rendered object. The method includes generating 3D bounding boxes from the rendered object.

Type: Grant

Filed: January 12, 2021

Date of Patent: October 18, 2022

Assignee: TOYOTA RESEARCH INSTITUTE, INC.

Inventors: Arjun Bhargava, Sudeep Pillai, Kuan-Hui Lee, Kun-Hsin Chen
SEMI-SUPERVISED 3D OBJECT TRACKING IN VIDEOS VIA 2D SEMANTIC KEYPOINTS

Publication number: 20220284598

Abstract: A method for 3D object tracking is described. The method includes inferring first 2D semantic keypoints of a 3D object within a sparsely annotated video stream. The method also includes matching the first 2D semantic keypoints of a current frame with second 2D semantic keypoints in a next frame of the sparsely annotated video stream using embedded descriptors within the current frame and the next frame. The method further includes warping the first 2D semantic keypoints to the second 2D semantic keypoints to form warped 2D semantic keypoints in the next frame. The method also includes labeling a 3D bounding box in the next frame according to the warped 2D semantic keypoints in the next frame.

Type: Application

Filed: March 8, 2021

Publication date: September 8, 2022

Applicant: TOYOTA RESEARCH INSTITUTE, INC.

Inventors: Arjun BHARGAVA, Sudeep PILLAI, Kuan-Hui LEE
SYSTEMS AND METHODS FOR VEHICLE LIGHT SIGNAL CLASSIFICATION

Publication number: 20220284222

Abstract: In one embodiment, a vehicle light classification system captures a sequence of images of a scene that includes a front/rear view of a vehicle with front/rear-side lights, determines semantic keypoints, in the images and associated with the front/rear-side lights, based on inputting the images into a first neural network, obtains multiple difference images that are each a difference between successive images from among the sequence of images, the successive images being aligned based on their respective semantic keypoints, and determines a classification of the front/rear-side lights based at least in part on the difference images by inputting the difference images into a second neural network.

Type: Application

Filed: March 4, 2021

Publication date: September 8, 2022

Inventors: Jia-En Pan, Kuan-Hui Lee, Chao Fang, Kun-Hsin Chen, Arjun Bhargava, Sudeep Pillai
Systems and methods for semi-supervised depth estimation according to an arbitrary camera

Patent number: 11436743

Abstract: System, methods, and other embodiments described herein relate to semi-supervised training of a depth model using a neural camera model that is independent of a camera type. In one embodiment, a method includes acquiring training data including at least a pair of training images and depth data associated with the training images. The method includes training the depth model using the training data to generate a self-supervised loss from the pair of training images and a supervised loss from the depth data. Training the depth model includes learning the camera type by generating, using a ray surface model, a ray surface that approximates an image character of the training images as produced by a camera having the camera type. The method includes providing the depth model to infer depths from monocular images in a device.

Type: Grant

Filed: June 19, 2020

Date of Patent: September 6, 2022

Assignee: Toyota Research Institute, Inc.

Inventors: Vitor Guizilini, Igor Vasiljevic, Rares A. Ambrus, Sudeep Pillai, Adrien David Gaidon
END-TO-END SIGNALIZED INTERSECTION TRANSITION STATE ESTIMATOR WITH SCENE GRAPHS OVER SEMANTIC KEYPOINTS

Publication number: 20220261583

Abstract: Systems, methods, computer-readable media, techniques, and methodologies are disclosed for performing end-to-end, learning-based keypoint detection and association. A scene graph of a signalized intersection is constructed from an input image of the intersection. The scene graph includes detected keypoints and linkages identified between the keypoints. The scene graph can be used along with a vehicle's localization information to identify which keypoint that represents a traffic signal is associated with the vehicle's current travel lane. An appropriate vehicle action may then be determined based on a transition state of the traffic signal keypoint and trajectory information for the vehicle. A control signal indicative of this vehicle action may then be output to cause an autonomous vehicle, for example, to implement the appropriate vehicle action.

Type: Application

Filed: February 17, 2021

Publication date: August 18, 2022

Inventors: Kun-Hsin Chen, Peiyan Gong, Sudeep Pillai, Arjun Bhargava, Shunsho Kaku, Hai Jin, Kuan-Hui Lee
END-TO-END MONOCULAR 2D SEMANTIC KEYPOINT DETECTOR AND TRACKER LEARNING

Publication number: 20220245387

Abstract: A method for semantic keypoint detection is described. The method includes linking, using a keypoint graph neural network (KGNN), semantic keypoints of an object within a first image of a video stream into a 2D graph structure corresponding to a category of the object. The method also includes embedding descriptors within the semantic keypoints of the 2D graph structure corresponding to the category of the object. The method further includes tracking the object within subsequent images of the video stream using the embedded descriptors within the semantic keypoints of the 2D graph structure corresponding to the category of the object.

Type: Application

Filed: February 4, 2021

Publication date: August 4, 2022

Applicant: TOYOTA RESEARCH INSTITUTE, INC.

Inventors: Kuan-Hui LEE, Kun-Hsin CHEN, Haofeng CHEN, Arjun BHARGAVA, Sudeep PILLAI
SYSTEMS AND METHODS FOR GENERIC VISUAL ODOMETRY USING LEARNED FEATURES VIA NEURAL CAMERA MODELS

Publication number: 20220245843

Abstract: Systems and methods for self-supervised learning for visual odometry using camera images, may include: estimating correspondences between keypoints of a target camera image and keypoints of a context camera image; based on the keypoint correspondences, lifting a set of 2D keypoints to 3D, using a neural camera model; and projecting the 3D keypoints into the context camera image using the neural camera model. Some embodiments may use the neural camera model to achieve the lifting and projecting of keypoints without a known or calibrated camera model.

Type: Application

Filed: April 17, 2022

Publication date: August 4, 2022

Inventors: VITOR GUIZILINI, Igor Vasiljevic, Rares A. Ambrus, Sudeep Pillai, Adrien Gaidon
MONOCULAR 3D VEHICLE MODELING AND AUTO-LABELING USING SEMANTIC KEYPOINTS

Publication number: 20220222889

Abstract: A method for 3D object modeling includes linking 2D semantic keypoints of an object within a video stream into a 2D structured object geometry. The method includes inputting, to a neural network, the object to generate a 2D NOCS image and a shape vector, the shape vector being mapped to a continuously traversable coordinate shape. The method includes applying a differentiable shape renderer to the SDF shape and the 2D NOCS image to render a shape of the object corresponding to a 3D object model in the continuously traversable coordinate shape space. The method includes lifting the linked, 2D semantic keypoints of the 2D structured object geometry to a 3D structured object geometry. The method includes geometrically and projectively aligning the 3D object model, the 3D structured object geometry, and the rendered shape to form a rendered object. The method includes generating 3D bounding boxes from the rendered object.

Type: Application

Filed: January 12, 2021

Publication date: July 14, 2022

Applicant: TOYOTA RESEARCH INSTITUTE, INC.

Inventors: Arjun BHARGAVA, Sudeep PILLAI, Kuan-Hui LEE, Kun-Hsin CHEN
Systems and methods for weakly supervised training of a model for monocular depth estimation

Patent number: 11386567

Abstract: System, methods, and other embodiments described herein relate to semi-supervised training of a depth model for monocular depth estimation. In one embodiment, a method includes training the depth model according to a first stage that is self-supervised and that includes using first training data that comprises pairs of training images. Respective ones of the pairs including separate frames depicting a scene of a monocular video. The method includes training the depth model according to a second stage that is weakly supervised and that includes using second training data to produce depth maps according to the depth model. The second training data comprising individual images with corresponding sparse depth data. The second training data providing for updating the depth model according to second stage loss values that are based, at least in part, on the depth maps and the depth data.

Type: Grant

Filed: December 3, 2019

Date of Patent: July 12, 2022

Assignee: Toyota Research Institute, Inc.

Inventors: Vitor Guizilini, Sudeep Pillai, Rares A. Ambrus, Jie Li, Adrien David Gaidon
SYSTEMS AND METHODS FOR TRAFFIC LIGHT DETECTION AND CLASSIFICATION

Publication number: 20220198204

Abstract: In one embodiment, a traffic light classification system for a vehicle, includes an image capture device to capture an image of a scene that includes a traffic light with multiple light signals, a processor, and a memory communicably coupled to the processor and storing a first neural network module including instructions that when executed by the processor cause the processor to determine, based on inputting the image into a neural network, a semantic keypoint for each light signal in the traffic light, and determine, based on each semantic keypoint, a classification state of each light signal.

Type: Application

Filed: December 22, 2020

Publication date: June 23, 2022

Inventors: Kun-Hsin Chen, Kuan-Hui Lee, Jia-En Pan, Sudeep Pillai
CAMERA AGNOSTIC DEPTH NETWORK

Publication number: 20220148206

Abstract: A method for monocular depth/pose estimation in a camera agnostic network is described. The method includes projecting lifted 3D points onto an image plane according to a predicted ray vector based on a monocular depth model, a monocular pose model, and a camera center of a camera agnostic network. The method also includes predicting a warped target image from a predicted depth map of the monocular depth model, a ray surface of the predicted ray vector, and a projection of the lifted 3D points according to the camera agnostic network.

Type: Application

Filed: January 21, 2022

Publication date: May 12, 2022

Applicant: TOYOTA RESEARCH INSTITUTE, INC.

Inventors: Vitor GUIZILINI, Sudeep PILLAI, Adrien David GAIDON, Rares A. AMBRUS, Igor VASILJEVIC

prev 1 2 3 4 next