Patents by Inventor Sudeep Pillai
Sudeep Pillai has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20220414388Abstract: Described herein are systems and methods for determining if a vehicle is parked. In one example, a system includes a processor, a sensor system, and a memory. Both the sensor system and the memory are in communication with the processor. The memory includes a parking determination module having instructions that, when executed by the processor, cause the processor to determine, using a random forest model, when the vehicle is parked based on vehicle estimated features, vehicle learned features, and vehicle taillight features of the vehicle that are based on sensor data from the sensor system.Type: ApplicationFiled: June 29, 2021Publication date: December 29, 2022Inventors: Chao Fang, Kuan-Hui Lee, Logan Michael Ellis, Jia-En Pan, Kun-Hsin Chen, Sudeep Pillai, Daniele Molinari, Constantin Franziskus Dominik Hubmann, T. Wolfram Burgard
-
Publication number: 20220414981Abstract: A method for 3D object modeling includes linking 2D semantic keypoints of an object within a video stream into a 2D structured object geometry. The method includes inputting, to a neural network, the object to generate a 2D NOCS image and a shape vector, the shape vector being mapped to a continuously traversable coordinate shape. The method includes applying a differentiable shape renderer to the SDF shape and the 2D NOCS image to render a shape of the object corresponding to a 3D object model in the continuously traversable coordinate shape space. The method includes lifting the linked, 2D semantic keypoints of the 2D structured object geometry to a 3D structured object geometry. The method includes geometrically and projectively aligning the 3D object model, the 3D structured object geometry, and the rendered shape to form a rendered object. The method includes generating 3D bounding boxes from the rendered object.Type: ApplicationFiled: August 25, 2022Publication date: December 29, 2022Applicant: TOYOTA RESEARCH INSTITUTE, INC.Inventors: Arjun BHARGAVA, Sudeep PILLAI, Kuan-Hui LEE, Kun-Hsin CHEN
-
Patent number: 11531892Abstract: Systems and methods for detecting and matching keypoints between different views of a scene are disclosed herein. One embodiment acquires first and second images; subdivides the first and second images into first and second pluralities of cells, respectively; processes both pluralities of cells using a neural keypoint detection network to identify a first keypoint for a particular cell in the first plurality of cells and a second keypoint for a particular cell in the second plurality of cells, at least one of the first and second keypoints lying in a cell other than the particular cell in the first or second plurality of cells for which it was identified; and classifies the first keypoint and the second keypoint as a matching keypoint pair based, at least in part, on a comparison between a first descriptor associated with the first keypoint and a second descriptor associated with the second keypoint.Type: GrantFiled: March 31, 2020Date of Patent: December 20, 2022Assignee: Toyota Research Institute, Inc.Inventors: Jiexiong Tang, Rares A. Ambrus, Vitor Guizilini, Sudeep Pillai, Hanme Kim
-
Patent number: 11514685Abstract: Systems, methods, computer-readable media, techniques, and methodologies are disclosed for performing end-to-end, learning-based keypoint detection and association. A scene graph of a signalized intersection is constructed from an input image of the intersection. The scene graph includes detected keypoints and linkages identified between the keypoints. The scene graph can be used along with a vehicle's localization information to identify which keypoint that represents a traffic signal is associated with the vehicle's current travel lane. An appropriate vehicle action may then be determined based on a transition state of the traffic signal keypoint and trajectory information for the vehicle. A control signal indicative of this vehicle action may then be output to cause an autonomous vehicle, for example, to implement the appropriate vehicle action.Type: GrantFiled: February 17, 2021Date of Patent: November 29, 2022Assignee: TOYOTA RESEARCH INSTITUTE, INC.Inventors: Kun-Hsin Chen, Peiyan Gong, Sudeep Pillai, Arjun Bhargava, Shunsho Kaku, Hai Jin, Kuan-Hui Lee
-
Patent number: 11508080Abstract: Systems and methods for self-supervised learning for visual odometry using camera images captured on a camera, may include: using a key point network to learn a keypoint matrix for a target image and a context image captured by the camera; using the learned descriptors to estimate correspondences between the target image and the context image; based on the keypoint correspondences, lifting a set of 2D keypoints to 3D, using a learned neural camera model; estimating a transformation between the target image and the context image using 3D-2D keypoint correspondences; and projecting the 3D keypoints into the context image using the learned neural camera model.Type: GrantFiled: September 15, 2020Date of Patent: November 22, 2022Assignee: TOYOTA RESEARCH INSTITUTE, INC.Inventors: Vitor Guizilini, Igor Vasiljevic, Rares A. Ambrus, Sudeep Pillai, Adrien Gaidon
-
Publication number: 20220358401Abstract: Systems and methods are provided for implementing soft model assertions (SMA) system and techniques designed to monitor and improve Machine Learning (ML) model quality by to detecting errors within the one or more ML models. SMA techniques and systems are distinctly designed to leverage: 1) a user's ability to specify features over data; and 2) large, existing datasets of organizations, in a manner that can improve the accuracy and quality of predicting potential errors in Machine Learning (ML) models. A SMA system can include a controller device receiving predictions generated based on the ML models and output from the SMA system. The controller performs autonomous operations of the system in response to determining that the one or more detected errors within the one or more ML models yield a high certainty of errors in the predictions. The SMA system also includes a domain specific language and a severity score module.Type: ApplicationFiled: May 10, 2021Publication date: November 10, 2022Inventors: Daniel D. KANG, Nikos ARECHIGA GONZALEZ, Sudeep PILLAI
-
Publication number: 20220343096Abstract: A method for 3D object detection is described. The method includes detecting semantic keypoints from monocular images of a video stream capturing a 3D object. The method also includes inferring a 3D bounding box of the 3D object corresponding to the detected semantic vehicle keypoints. The method further includes scoring the inferred 3D bounding box of the 3D object. The method also includes detecting the 3D object according to a final 3D bounding box generated based on the scoring of the inferred 3D bounding box.Type: ApplicationFiled: April 27, 2021Publication date: October 27, 2022Applicant: TOYOTA RESEARCH INSTITUTE, INC.Inventors: Arjun BHARGAVA, Haofeng CHEN, Adrien David GAIDON, Rares A. AMBRUS, Sudeep PILLAI
-
Publication number: 20220335258Abstract: Datasets for autonomous driving systems and multi-modal scenes may be automatically labeled using previously trained models as priors to mitigate the limitations of conventional manual data labeling. Properly versioned models, including model weights and knowledge of the dataset on which the model was previously trained, may be used to run an inference operation on unlabeled data, thus automatically labeling the dataset. The newly labeled dataset may then be used to train new models, including sparse data sets, in a semi-supervised or weakly-supervised fashion.Type: ApplicationFiled: April 16, 2021Publication date: October 20, 2022Applicant: TOYOTA RESEARCH INSTITUTE, INC.Inventors: Allan RAVENTOS, Arjun BHARGAVA, Kun-Hsin CHEN, Sudeep PILLAI, Adrien David GAIDON
-
Patent number: 11475248Abstract: Acquiring labeled data can be a significant bottleneck in the development of machine learning models that are accurate and efficient enough to enable safety-critical applications, such as automated driving. The process of labeling of driving logs can be automated. Unlabeled real-world driving logs, which include data captured by one or more vehicle sensors, can be automatically labeled to generate one or more labeled real-world driving logs. The automatic labeling can include analysis-by-synthesis on the unlabeled real-world driving logs to generate simulated driving logs, which can include reconstructed driving scenes or portions thereof. The automatic labeling can further include simulation-to-real automatic labeling on the simulated driving logs and the unlabeled real-world driving logs to generate one or more labeled real-world driving logs. The automatically labeled real-world driving logs can be stored in one or more data stores for subsequent training, validation, evaluation, and/or model management.Type: GrantFiled: October 30, 2018Date of Patent: October 18, 2022Assignee: Toyota Research Institute, Inc.Inventors: Adrien David Gaidon, James J. Kuffner, Jr., Sudeep Pillai
-
Patent number: 11475628Abstract: A method for 3D object modeling includes linking 2D semantic keypoints of an object within a video stream into a 2D structured object geometry. The method includes inputting, to a neural network, the object to generate a 2D NOCS image and a shape vector, the shape vector being mapped to a continuously traversable coordinate shape. The method includes applying a differentiable shape renderer to the SDF shape and the 2D NOCS image to render a shape of the object corresponding to a 3D object model in the continuously traversable coordinate shape space. The method includes lifting the linked, 2D semantic keypoints of the 2D structured object geometry to a 3D structured object geometry. The method includes geometrically and projectively aligning the 3D object model, the 3D structured object geometry, and the rendered shape to form a rendered object. The method includes generating 3D bounding boxes from the rendered object.Type: GrantFiled: January 12, 2021Date of Patent: October 18, 2022Assignee: TOYOTA RESEARCH INSTITUTE, INC.Inventors: Arjun Bhargava, Sudeep Pillai, Kuan-Hui Lee, Kun-Hsin Chen
-
Publication number: 20220284598Abstract: A method for 3D object tracking is described. The method includes inferring first 2D semantic keypoints of a 3D object within a sparsely annotated video stream. The method also includes matching the first 2D semantic keypoints of a current frame with second 2D semantic keypoints in a next frame of the sparsely annotated video stream using embedded descriptors within the current frame and the next frame. The method further includes warping the first 2D semantic keypoints to the second 2D semantic keypoints to form warped 2D semantic keypoints in the next frame. The method also includes labeling a 3D bounding box in the next frame according to the warped 2D semantic keypoints in the next frame.Type: ApplicationFiled: March 8, 2021Publication date: September 8, 2022Applicant: TOYOTA RESEARCH INSTITUTE, INC.Inventors: Arjun BHARGAVA, Sudeep PILLAI, Kuan-Hui LEE
-
Publication number: 20220284222Abstract: In one embodiment, a vehicle light classification system captures a sequence of images of a scene that includes a front/rear view of a vehicle with front/rear-side lights, determines semantic keypoints, in the images and associated with the front/rear-side lights, based on inputting the images into a first neural network, obtains multiple difference images that are each a difference between successive images from among the sequence of images, the successive images being aligned based on their respective semantic keypoints, and determines a classification of the front/rear-side lights based at least in part on the difference images by inputting the difference images into a second neural network.Type: ApplicationFiled: March 4, 2021Publication date: September 8, 2022Inventors: Jia-En Pan, Kuan-Hui Lee, Chao Fang, Kun-Hsin Chen, Arjun Bhargava, Sudeep Pillai
-
Patent number: 11436743Abstract: System, methods, and other embodiments described herein relate to semi-supervised training of a depth model using a neural camera model that is independent of a camera type. In one embodiment, a method includes acquiring training data including at least a pair of training images and depth data associated with the training images. The method includes training the depth model using the training data to generate a self-supervised loss from the pair of training images and a supervised loss from the depth data. Training the depth model includes learning the camera type by generating, using a ray surface model, a ray surface that approximates an image character of the training images as produced by a camera having the camera type. The method includes providing the depth model to infer depths from monocular images in a device.Type: GrantFiled: June 19, 2020Date of Patent: September 6, 2022Assignee: Toyota Research Institute, Inc.Inventors: Vitor Guizilini, Igor Vasiljevic, Rares A. Ambrus, Sudeep Pillai, Adrien David Gaidon
-
Publication number: 20220261583Abstract: Systems, methods, computer-readable media, techniques, and methodologies are disclosed for performing end-to-end, learning-based keypoint detection and association. A scene graph of a signalized intersection is constructed from an input image of the intersection. The scene graph includes detected keypoints and linkages identified between the keypoints. The scene graph can be used along with a vehicle's localization information to identify which keypoint that represents a traffic signal is associated with the vehicle's current travel lane. An appropriate vehicle action may then be determined based on a transition state of the traffic signal keypoint and trajectory information for the vehicle. A control signal indicative of this vehicle action may then be output to cause an autonomous vehicle, for example, to implement the appropriate vehicle action.Type: ApplicationFiled: February 17, 2021Publication date: August 18, 2022Inventors: Kun-Hsin Chen, Peiyan Gong, Sudeep Pillai, Arjun Bhargava, Shunsho Kaku, Hai Jin, Kuan-Hui Lee
-
Publication number: 20220245387Abstract: A method for semantic keypoint detection is described. The method includes linking, using a keypoint graph neural network (KGNN), semantic keypoints of an object within a first image of a video stream into a 2D graph structure corresponding to a category of the object. The method also includes embedding descriptors within the semantic keypoints of the 2D graph structure corresponding to the category of the object. The method further includes tracking the object within subsequent images of the video stream using the embedded descriptors within the semantic keypoints of the 2D graph structure corresponding to the category of the object.Type: ApplicationFiled: February 4, 2021Publication date: August 4, 2022Applicant: TOYOTA RESEARCH INSTITUTE, INC.Inventors: Kuan-Hui LEE, Kun-Hsin CHEN, Haofeng CHEN, Arjun BHARGAVA, Sudeep PILLAI
-
Publication number: 20220245843Abstract: Systems and methods for self-supervised learning for visual odometry using camera images, may include: estimating correspondences between keypoints of a target camera image and keypoints of a context camera image; based on the keypoint correspondences, lifting a set of 2D keypoints to 3D, using a neural camera model; and projecting the 3D keypoints into the context camera image using the neural camera model. Some embodiments may use the neural camera model to achieve the lifting and projecting of keypoints without a known or calibrated camera model.Type: ApplicationFiled: April 17, 2022Publication date: August 4, 2022Inventors: VITOR GUIZILINI, Igor Vasiljevic, Rares A. Ambrus, Sudeep Pillai, Adrien Gaidon
-
Publication number: 20220222889Abstract: A method for 3D object modeling includes linking 2D semantic keypoints of an object within a video stream into a 2D structured object geometry. The method includes inputting, to a neural network, the object to generate a 2D NOCS image and a shape vector, the shape vector being mapped to a continuously traversable coordinate shape. The method includes applying a differentiable shape renderer to the SDF shape and the 2D NOCS image to render a shape of the object corresponding to a 3D object model in the continuously traversable coordinate shape space. The method includes lifting the linked, 2D semantic keypoints of the 2D structured object geometry to a 3D structured object geometry. The method includes geometrically and projectively aligning the 3D object model, the 3D structured object geometry, and the rendered shape to form a rendered object. The method includes generating 3D bounding boxes from the rendered object.Type: ApplicationFiled: January 12, 2021Publication date: July 14, 2022Applicant: TOYOTA RESEARCH INSTITUTE, INC.Inventors: Arjun BHARGAVA, Sudeep PILLAI, Kuan-Hui LEE, Kun-Hsin CHEN
-
Patent number: 11386567Abstract: System, methods, and other embodiments described herein relate to semi-supervised training of a depth model for monocular depth estimation. In one embodiment, a method includes training the depth model according to a first stage that is self-supervised and that includes using first training data that comprises pairs of training images. Respective ones of the pairs including separate frames depicting a scene of a monocular video. The method includes training the depth model according to a second stage that is weakly supervised and that includes using second training data to produce depth maps according to the depth model. The second training data comprising individual images with corresponding sparse depth data. The second training data providing for updating the depth model according to second stage loss values that are based, at least in part, on the depth maps and the depth data.Type: GrantFiled: December 3, 2019Date of Patent: July 12, 2022Assignee: Toyota Research Institute, Inc.Inventors: Vitor Guizilini, Sudeep Pillai, Rares A. Ambrus, Jie Li, Adrien David Gaidon
-
Publication number: 20220198204Abstract: In one embodiment, a traffic light classification system for a vehicle, includes an image capture device to capture an image of a scene that includes a traffic light with multiple light signals, a processor, and a memory communicably coupled to the processor and storing a first neural network module including instructions that when executed by the processor cause the processor to determine, based on inputting the image into a neural network, a semantic keypoint for each light signal in the traffic light, and determine, based on each semantic keypoint, a classification state of each light signal.Type: ApplicationFiled: December 22, 2020Publication date: June 23, 2022Inventors: Kun-Hsin Chen, Kuan-Hui Lee, Jia-En Pan, Sudeep Pillai
-
Publication number: 20220148206Abstract: A method for monocular depth/pose estimation in a camera agnostic network is described. The method includes projecting lifted 3D points onto an image plane according to a predicted ray vector based on a monocular depth model, a monocular pose model, and a camera center of a camera agnostic network. The method also includes predicting a warped target image from a predicted depth map of the monocular depth model, a ray surface of the predicted ray vector, and a projection of the lifted 3D points according to the camera agnostic network.Type: ApplicationFiled: January 21, 2022Publication date: May 12, 2022Applicant: TOYOTA RESEARCH INSTITUTE, INC.Inventors: Vitor GUIZILINI, Sudeep PILLAI, Adrien David GAIDON, Rares A. AMBRUS, Igor VASILJEVIC