Patents by Inventor Sudeep Pillai

Sudeep Pillai has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

System and method for monocular depth estimation from semantic information

Patent number: 11966234

Abstract: A method for controlling an ego agent includes capturing a two-dimensional (2D) image of an environment adjacent to the ego agent. The method also includes generating a semantically segmented image of the environment based on the 2D image. The method further includes generating a depth map of the environment based on the semantically segmented image. The method additionally includes generating a three-dimensional (3D) estimate of the environment based on the depth map. The method also includes controlling an action of the ego agent based on the identified location.

Type: Grant

Filed: July 23, 2020

Date of Patent: April 23, 2024

Assignee: TOYOTA RESEARCH INSTITUTE, INC.

Inventors: Vitor Guizilini, Jie Li, Rares A. Ambrus, Sudeep Pillai, Adrien Gaidon
Traffic light active learning pipeline

Patent number: 11954919

Abstract: Systems and methods are provided for developing/updating training datasets for traffic light detection/perception models. V2I-based information may indicate a particular traffic light state/state of transition. This information can be compared to a traffic light perception prediction. When the prediction is inconsistent with the V2I-based information, data regarding the condition(s)/traffic light(s)/etc. can be saved and uploaded to a training database to update/refine the training dataset(s) maintained therein. In this way, an existing traffic light perception model can be updated/improved and/or a better traffic light perception model can be developed.

Type: Grant

Filed: October 8, 2020

Date of Patent: April 9, 2024

Assignee: TOYOTA RESEARCH INSTITUTE, INC.

Inventors: Kun-Hsin Chen, Peiyan Gong, Shunsho Kaku, Sudeep Pillai, Hai Jin, Sarah Yoo, David L. Garber, Ryan W. Wolcott
Semi-supervised 3D object tracking in videos via 2D semantic keypoints

Patent number: 11922640

Abstract: A method for 3D object tracking is described. The method includes inferring first 2D semantic keypoints of a 3D object within a sparsely annotated video stream. The method also includes matching the first 2D semantic keypoints of a current frame with second 2D semantic keypoints in a next frame of the sparsely annotated video stream using embedded descriptors within the current frame and the next frame. The method further includes warping the first 2D semantic keypoints to the second 2D semantic keypoints to form warped 2D semantic keypoints in the next frame. The method also includes labeling a 3D bounding box in the next frame according to the warped 2D semantic keypoints in the next frame.

Type: Grant

Filed: March 8, 2021

Date of Patent: March 5, 2024

Assignee: TOYOTA RESEARCH INSTITUTE, INC.

Inventors: Arjun Bhargava, Sudeep Pillai, Kuan-Hui Lee
System and method for self-supervised depth and ego-motion overfitting

Patent number: 11915487

Abstract: Systems and methods to improve machine learning by explicitly over-fitting environmental data obtained by an imaging system, such as a monocular camera are disclosed. The system includes training self-supervised depth and pose networks in monocular visual data collected from a certain area over multiple passes. Pose and depth networks may be trained by extracting data from multiple images of a single environment or trajectory, allowing the system to overfit the image data.

Type: Grant

Filed: May 5, 2020

Date of Patent: February 27, 2024

Assignee: TOYOTA RESEARCH INSTITUTE, INC.

Inventors: Rares A. Ambrus, Vitor Guizilini, Sudeep Pillai, Adrien David Gaidon
Self-supervised 3D keypoint learning for ego-motion estimation

Patent number: 11900626

Abstract: A method for learning depth-aware keypoints and associated descriptors from monocular video for ego-motion estimation is described. The method includes training a keypoint network and a depth network to learn depth-aware keypoints and the associated descriptors. The training is based on a target image and a context image from successive images of the monocular video. The method also includes lifting 2D keypoints from the target image to learn 3D keypoints based on a learned depth map from the depth network. The method further includes estimating ego-motion from the target image to the context image based on the learned 3D keypoints.

Type: Grant

Filed: November 9, 2020

Date of Patent: February 13, 2024

Assignee: TOYOTA RESEARCH INSTITUTE, INC.

Inventors: Jiexiong Tang, Rares A. Ambrus, Vitor Guizilini, Sudeep Pillai, Hanme Kim, Adrien David Gaidon
Using a neural network to produce a digital map for a location

Patent number: 11891094

Abstract: Information that identifies a location can be received. In response to a receipt of the information that identifies the location, a file can be retrieved. The file can be for the location. The file can include image data and a set of node data. The set of node data can include information that identifies nodes in a neural network, information that identifies inputs of the nodes, and values of weights to be applied to the inputs. In response to a retrieval of the file, the weights can be applied to the inputs of the nodes and the image data can be received for the neural network. In response to an application of the weights and a receipt of the image data, the neural network can be executed to produce a digital map for the location. The digital map for the location can be transmitted to an automotive navigation system.

Type: Grant

Filed: June 10, 2020

Date of Patent: February 6, 2024

Assignee: Toyota Research Institute, Inc.

Inventors: Vitor Guizilini, Rares A. Ambrus, Sudeep Pillai, Adrien David Gaidon
Learning monocular 3D object detection from 2D semantic keypoint detection

Patent number: 11854280

Abstract: A method for 3D object detection is described. The method includes detecting semantic keypoints from monocular images of a video stream capturing a 3D object. The method also includes inferring a 3D bounding box of the 3D object corresponding to the detected semantic vehicle keypoints. The method further includes scoring the inferred 3D bounding box of the 3D object. The method also includes detecting the 3D object according to a final 3D bounding box generated based on the scoring of the inferred 3D bounding box.

Type: Grant

Filed: April 27, 2021

Date of Patent: December 26, 2023

Assignee: TOYOTA RESEARCH INSTITUTE, INC.

Inventors: Arjun Bhargava, Haofeng Chen, Adrien David Gaidon, Rares A. Ambrus, Sudeep Pillai
System and method for determining if a vehicle is parked

Patent number: 11810367

Abstract: Described herein are systems and methods for determining if a vehicle is parked. In one example, a system includes a processor, a sensor system, and a memory. Both the sensor system and the memory are in communication with the processor. The memory includes a parking determination module having instructions that, when executed by the processor, cause the processor to determine, using a random forest model, when the vehicle is parked based on vehicle estimated features, vehicle learned features, and vehicle taillight features of the vehicle that are based on sensor data from the sensor system.

Type: Grant

Filed: June 29, 2021

Date of Patent: November 7, 2023

Assignee: Toyota Research Institute, Inc.

Inventors: Chao Fang, Kuan-Hui Lee, Logan Michael Ellis, Jia-En Pan, Kun-Hsin Chen, Sudeep Pillai, Daniele Molinari, Constantin Franziskus Dominik Hubmann, T. Wolfram Burgard
Using a multimode neural network to determine a representation of a spatial structure objects in an image

Patent number: 11783591

Abstract: A representation of a spatial structure of objects in an image can be determined. A mode of a neural network can be set, in response to a receipt of the image and a receipt of a facing direction of a camera that produced the image. The mode can account for the facing direction. The facing direction can include one or more of a first facing direction of a first camera disposed on a vehicle or a second facing direction of a second camera disposed on the vehicle. The neural network can be executed, in response to the mode having been set, to determine the representation of the spatial structure of the objects in the image. The representation of the spatial structure of the objects in the image can be transmitted to an automotive navigation system to determine a distance between the vehicle and a specific object in the image.

Type: Grant

Filed: June 11, 2020

Date of Patent: October 10, 2023

Assignee: Toyota Research Institute, Inc.

Inventors: Sudeep Pillai, Vitor Guizilini, Rares A. Ambrus, Adrien David Gaidon
Systems and methods for traffic light detection and classification

Patent number: 11776281

Abstract: A traffic light classification system for a vehicle includes an image capture device to capture an image of a scene that includes a traffic light with multiple light signals, a processor, and a memory communicably coupled to the processor and storing a first neural network module including instructions that when executed by the processor cause the processor to determine, based on inputting the image into a neural network, a semantic keypoint for each light signal in the traffic light, and determine, based on each semantic keypoint, a classification state of each light signal.

Type: Grant

Filed: December 22, 2020

Date of Patent: October 3, 2023

Assignee: Toyota Research Institute, Inc.

Inventors: Kun-Hsin Chen, Kuan-Hui Lee, Jia-En Pan, Sudeep Pillai
Keypoint matching using graph convolutions

Patent number: 11741728

Abstract: A method for keypoint matching includes determining a first set of keypoints corresponding to a current environment of the agent. The method further includes determining a second set of keypoints from a pre-built map of the current environment. The method still further includes identifying matching pairs of keypoints from the first set of keypoints and the second set of keypoints based on geometrical similarities between respective keypoints of the first set of keypoints and the second set of keypoints. The method also includes determining a current location of the agent based on the identified matching pairs of keypoints. The method further includes controlling an action of the agent based on the current location.

Type: Grant

Filed: April 15, 2021

Date of Patent: August 29, 2023

Assignee: TOYOTA RESEARCH INSTITUTE, INC.

Inventors: Jiexiong Tang, Rares Andrei Ambrus, Jie Li, Vitor Guizilini, Sudeep Pillai, Adrien David Gaidon
Monocular 3D vehicle modeling and auto-labeling using semantic keypoints

Patent number: 11721065

Abstract: A method for 3D object modeling includes linking 2D semantic keypoints of an object within a video stream into a 2D structured object geometry. The method includes inputting, to a neural network, the object to generate a 2D NOCS image and a shape vector, the shape vector being mapped to a continuously traversable coordinate shape. The method includes applying a differentiable shape renderer to the SDF shape and the 2D NOCS image to render a shape of the object corresponding to a 3D object model in the continuously traversable coordinate shape space. The method includes lifting the linked, 2D semantic keypoints of the 2D structured object geometry to a 3D structured object geometry. The method includes geometrically and projectively aligning the 3D object model, the 3D structured object geometry, and the rendered shape to form a rendered object. The method includes generating 3D bounding boxes from the rendered object.

Type: Grant

Filed: August 25, 2022

Date of Patent: August 8, 2023

Inventors: Arjun Bhargava, Sudeep Pillai, Kuan-Hui Lee, Kun-Hsin Chen
Traffic light detection auto-labeling and federated learning based on vehicle-to-infrastructure communications

Patent number: 11704385

Abstract: A method for traffic light auto-labeling includes aggregating vehicle-to-infrastructure (V2I) traffic light signals at an intersection to determine transition states of each driving lane at the intersection during operation of an ego vehicle. The method also includes automatically labeling image training data to form auto-labeled image training data for a traffic light recognition model within the ego vehicle according to the determined transition states of each driving lane at the intersection. The method further includes planning a trajectory of the ego vehicle to comply with a right-of-way according to the determined transition states of each driving lane at the intersection according to a trained traffic light detection model. A federated learning module may train the traffic light recognition model using the auto-labeled image training data during the operation of the ego vehicle.

Type: Grant

Filed: September 25, 2020

Date of Patent: July 18, 2023

Assignee: TOYOTA RESEARCH INSTITUTE, INC.

Inventors: Kun-Hsin Chen, Sudeep Pillai, Shunsho Kaku, Hai Jin, Peiyan Gong, Wolfram Burgard
Camera agnostic depth network

Patent number: 11704821

Abstract: A method for monocular depth/pose estimation in a camera agnostic network is described. The method includes projecting lifted 3D points onto an image plane according to a predicted ray vector based on a monocular depth model, a monocular pose model, and a camera center of a camera agnostic network. The method also includes predicting a warped target image from a predicted depth map of the monocular depth model, a ray surface of the predicted ray vector, and a projection of the lifted 3D points according to the camera agnostic network.

Type: Grant

Filed: January 21, 2022

Date of Patent: July 18, 2023

Assignee: TOYOTA RESEARCH INSTITUTE, INC.

Inventors: Vitor Guizilini, Sudeep Pillai, Adrien David Gaidon, Rares A. Ambrus, Igor Vasiljevic
Systems and methods for self-supervised depth estimation according to an arbitrary camera

Patent number: 11652972

Abstract: System, methods, and other embodiments described herein relate to improving depth estimates for monocular images using a neural camera model that is independent of a camera type. In one embodiment, a method includes receiving a monocular image from a pair of training images derived from a monocular video. The method includes generating, using a ray surface network, a ray surface that approximates an image character of the monocular image as produced by a camera having the camera type. The method includes creating a synthesized image according to at least the ray surface and a depth map associated with the monocular image.

Type: Grant

Filed: June 12, 2020

Date of Patent: May 16, 2023

Assignee: Toyota Research Institute, Inc.

Inventors: Vitor Guizilini, Igor Vasiljevic, Rares A. Ambrus, Sudeep Pillai, Adrien David Gaidon
SYSTEMS AND METHODS FOR DYNAMICALLY SCALING REMOTE RESOURCES

Publication number: 20230137673

Abstract: Systems and methods for dynamically selecting idle or underutilized resources to complete tasks in a queue are disclosed. The systems and methods include maintaining a plurality of processing resources operable to process one or more tasks. Each resource is scalable to increase or decrease a number of nodes available to perform the one or more tasks. The systems and methods include maintaining a queue for tasks to be processed, receiving a first task requiring a processing resource, and accessing at least a portion of the plurality of processing resources. A first processing resource of the plurality of processing resources is identified that is operating below a predetermined processing threshold. The first processing resource is assigned to the first task and scaled up according according to a processing requirement of the first task.

Type: Application

Filed: October 28, 2021

Publication date: May 4, 2023

Inventors: David Rowan, Sudeep Pillai
Bird's eye view based velocity estimation via self-supervised learning

Patent number: 11625839

Abstract: Systems and methods determining velocity of an object associated with a three-dimensional (3D) scene may include: a LIDAR system generating two sets of 3D point cloud data of the scene from two consecutive point cloud sweeps; a pillar feature network encoding data of the point cloud data to extract two-dimensional (2D) bird's-eye-view embeddings for each of the point cloud data sets in the form of pseudo images, wherein the 2D bird's-eye-view embeddings for a first of the two point cloud data sets comprises pillar features for the first point cloud data set and the 2D bird's-eye-view embeddings for a second of the two point cloud data sets comprises pillar features for the second point cloud data set; and a feature pyramid network encoding the pillar features and performing a 2D optical flow estimation to estimate the velocity of the object.

Type: Grant

Filed: May 18, 2020

Date of Patent: April 11, 2023

Assignee: TOYOTA RESEARCH INSTITUTE, INC.

Inventors: Kuan-Hui Lee, Sudeep Pillai, Adrien David Gaidon
Systems and methods for end-to-end map building from a video sequence using neural camera models

Patent number: 11615544

Abstract: Systems and methods for map construction using a video sequence captured on a camera of a vehicle in an environment, comprising: receiving a video sequence from the camera, the video sequence including a plurality of image frames capturing a scene of the environment of the vehicle; using a neural camera model to predict a depth map and a ray surface for the plurality of image frames in the received video sequence; and constructing a map of the scene of the environment based on image data captured in the plurality of frames and depth information in the predicted depth maps.

Type: Grant

Filed: September 15, 2020

Date of Patent: March 28, 2023

Assignee: TOYOTA RESEARCH INSTITUTE, INC.

Inventors: Vitor Guizilini, Igor Vasiljevic, Rares A. Ambrus, Sudeep Pillai, Adrien Gaidon
END-TO-END SIGNALIZED INTERSECTION TRANSITION STATE ESTIMATOR WITH SCENE GRAPHS OVER SEMANTIC KEYPOINTS

Publication number: 20230047160

Abstract: Systems, methods, computer-readable media, techniques, and methodologies are disclosed for performing end-to-end, learning-based keypoint detection and association. A scene graph of a signalized intersection is constructed from an input image of the intersection. The scene graph includes detected keypoints and linkages identified between the keypoints. The scene graph can be used along with a vehicle's localization information to identify which keypoint that represents a traffic signal is associated with the vehicle's current travel lane. An appropriate vehicle action may then be determined based on a transition state of the traffic signal keypoint and trajectory information for the vehicle. A control signal indicative of this vehicle action may then be output to cause an autonomous vehicle, for example, to implement the appropriate vehicle action.

Type: Application

Filed: October 29, 2022

Publication date: February 16, 2023

Inventors: KUN-HSIN CHEN, PEIYAN GONG, SUDEEP PILLAI, ARJUN BHARGAVA, SHUNSHO KAKU, HAI JIN, KUAN-HUI LEE
MONOCULAR 2D SEMANTIC KEYPOINT DETECTION AND TRACKING

Publication number: 20230031289

Abstract: A method for 2D semantic keypoint detection and tracking is described. The method includes learning embedded descriptors of salient object keypoints detected in previous images according to a descriptor embedding space model. The method also includes predicting, using a shared image encoder backbone, salient object keypoints within a current image of a video stream. The method further includes inferring an object represented by the predicted, salient object keypoints within the current image of the video stream. The method also includes tracking the inferred object by matching embedded descriptors of the predicted, salient object keypoints representing the inferred object within the previous images of the video stream based on the descriptor embedding space model.

Type: Application

Filed: July 30, 2021

Publication date: February 2, 2023

Applicant: TOYOTA RESEARCH INSTITUTE, INC.

Inventors: Haofeng CHEN, Arjun BHARGAVA, Rares Andrei AMBRUS, Sudeep PILLAI

1 2 3 4 next