Patents by Inventor Nishant Puri

Nishant Puri has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

MULTI-MODAL SENSOR FUSION FOR CONTENT IDENTIFICATION IN APPLICATIONS OF HUMAN-MACHINE INTERFACES

Publication number: 20250124734

Abstract: Interactions with virtual systems may be difficult when users inadvertently fail to provide sufficient information to proceed with their requests. Certain types of inputs, such as auditory inputs, may lack sufficient information to properly provide a response to the user. Additional information, such as image data, may enable user gestures or poses to supplement the auditory inputs to enable response generation without requesting additional information from users.

Type: Application

Filed: December 23, 2024

Publication date: April 17, 2025

Inventors: Sakthivel Sivaraman, Nishant Puri, Yuzhuo Ren, Atousa Torabi, Shubhadeep Das, Niranjan Avadhanam, Sumit Kumar Bhattacharya, Jason Roche
Gaze determination machine learning system having adaptive weighting of inputs

Patent number: 12260017

Abstract: Machine learning systems and methods that determine gaze direction by using face orientation information, such as facial landmarks, to modify eye direction information determined from images of the subject's eyes. System inputs include eye crops of the eyes of the subject, as well as face orientation information such as facial landmarks of the subject's face in the input image. Facial orientation information, or facial landmark information, is used to determine a coarse prediction of gaze direction as well as to learn a context vector of features describing subject face pose. The context vector is then used to adaptively re-weight the eye direction features determined from the eye crops. The re-weighted features are then combined with the coarse gaze prediction to determine gaze direction.

Type: Grant

Filed: March 10, 2023

Date of Patent: March 25, 2025

Assignee: NVIDIA Corporation

Inventor: Nishant Puri
Multi-modal sensor fusion for content identification in applications of human-machine interfaces

Patent number: 12211308

Abstract: Interactions with virtual systems may be difficult when users inadvertently fail to provide sufficient information to proceed with their requests. Certain types of inputs, such as auditory inputs, may lack sufficient information to properly provide a response to the user. Additional information, such as image data, may enable user gestures or poses to supplement the auditory inputs to enable response generation without requesting additional information from users.

Type: Grant

Filed: August 31, 2021

Date of Patent: January 28, 2025

Assignee: Nvidia Corporation

Inventors: Sakthivel Sivaraman, Nishant Puri, Yuzhuo Ren, Atousa Torabi, Shubhadeep Das, Niranjan Avadhanam, Sumit Kumar Bhattacharya, Jason Roche
USING NEURAL NETWORKS TO GENERATE SYNTHETIC DATA

Publication number: 20240412491

Abstract: Apparatuses, system, and techniques use one or more first neural networks to generate one or more synthetic data to train one or more second neural networks based, at least in part, on one or more performance metrics of one or more second neural networks.

Type: Application

Filed: June 9, 2023

Publication date: December 12, 2024

Inventors: Shagan Sah, Nishant Puri, Yuzhuo Ren, Rajath Bellipady Shetty, Weili Nie, Arash Vahdat, Animashree Anandkumar
GENERATION OF GROUND TRUTH GAZE DATA FOR TRAINING IN-CABIN MONITORING SYSTEMS AND APPLICATIONS

Publication number: 20240290112

Abstract: In various examples, systems and method are provided for generation of ground truth gaze data for training in-cabin monitoring systems and applications. A gaze target projector mounted to a known position inside a cabin may be used to project a gaze target onto an interior surface of the cabin. Because a beam of light may be used to produce the projected gaze target, the projected gaze target may be displayed at a projection point on the surface of the cabin interior, even if the surface at the projection point is curved, small, or an irregular shape. Three-dimensional coordinates of a projected gaze target in the cabin coordinate system may be determined and used to label image data that is captured as a projected gaze target is selectively projected onto an interior surface of the cabin and a test occupant's gaze is directed at the projected gaze target.

Type: Application

Filed: February 28, 2023

Publication date: August 29, 2024

Inventors: Martin HEMPEL, Nishant PURI, Anshul JAIN, Chun-Wei CHEN, Dae Jin KIM, Frederic VATNSDAL
NEURAL NETWORK BASED FACIAL ANALYSIS USING FACIAL LANDMARKS AND ASSOCIATED CONFIDENCE VALUES

Publication number: 20240265254

Abstract: Systems and methods for more accurate and robust determination of subject characteristics from an image of the subject. One or more machine learning models receive as input an image of a subject, and output both facial landmarks and associated confidence values. Confidence values represent the degrees to which portions of the subject's face corresponding to those landmarks are occluded, i.e., the amount of uncertainty in the position of each landmark location. These landmark points and their associated confidence values, and/or associated information, may then be input to another set of one or more machine learning models which may output any facial analysis quantity or quantities, such as the subject's gaze direction, head pose, drowsiness state, cognitive load, or distraction state.

Type: Application

Filed: March 14, 2024

Publication date: August 8, 2024

Inventors: Nuri Murat Arar, Niranjan Avadhanam, Nishant Puri, Shagan Sah, Rajath Shetty, Sujay Yadawadkar, Pavlo Molchanov
PERSONALIZED CALIBRATION FUNCTIONS FOR USER GAZE DETECTION IN AUTONOMOUS DRIVING APPLICATIONS

Publication number: 20240143072

Abstract: In various examples, systems and methods are disclosed that provide highly accurate gaze predictions that are specific to a particular user by generating and applying, in deployment, personalized calibration functions to outputs and/or layers of a machine learning model. The calibration functions corresponding to a specific user may operate on outputs (e.g., gaze predictions from a machine learning model) to provide updated values and gaze predictions. The calibration functions may also be applied one or more last layers of the machine learning model to operate on features identified by the model and provide values that are more accurate. The calibration functions may be generated using explicit calibration methods by instructing users to gaze at a number of identified ground truth locations within the interior of the vehicle. Once generated, the calibration functions may be modified or refined through implicit gaze calibration points and/or regions based on gaze saliency maps.

Type: Application

Filed: January 11, 2024

Publication date: May 2, 2024

Inventors: Nuri Murat Arar, Sujay Yadawadkar, Hairong Jiang, Nishant Puri, Niranjan Avadhanam
Neural network based facial analysis using facial landmarks and associated confidence values

Patent number: 11934955

Abstract: Systems and methods for more accurate and robust determination of subject characteristics from an image of the subject. One or more machine learning models receive as input an image of a subject, and output both facial landmarks and associated confidence values. Confidence values represent the degrees to which portions of the subject's face corresponding to those landmarks are occluded, i.e., the amount of uncertainty in the position of each landmark location. These landmark points and their associated confidence values, and/or associated information, may then be input to another set of one or more machine learning models which may output any facial analysis quantity or quantities, such as the subject's gaze direction, head pose, drowsiness state, cognitive load, or distraction state.

Type: Grant

Filed: October 31, 2022

Date of Patent: March 19, 2024

Assignee: NVIDIA Corporation

Inventors: Nuri Murat Arar, Niranjan Avadhanam, Nishant Puri, Shagan Sah, Rajath Shetty, Sujay Yadawadkar, Pavlo Molchanov
Personalized calibration functions for user gaze detection in autonomous driving applications

Patent number: 11886634

Abstract: In various examples, systems and methods are disclosed that provide highly accurate gaze predictions that are specific to a particular user by generating and applying, in deployment, personalized calibration functions to outputs and/or layers of a machine learning model. The calibration functions corresponding to a specific user may operate on outputs (e.g., gaze predictions from a machine learning model) to provide updated values and gaze predictions. The calibration functions may also be applied one or more last layers of the machine learning model to operate on features identified by the model and provide values that are more accurate. The calibration functions may be generated using explicit calibration methods by instructing users to gaze at a number of identified ground truth locations within the interior of the vehicle. Once generated, the calibration functions may be modified or refined through implicit gaze calibration points and/or regions based on gaze saliency maps.

Type: Grant

Filed: March 19, 2021

Date of Patent: January 30, 2024

Assignee: NVIDIA Corporation

Inventors: Nuri Murat Arar, Sujay Yadawadkar, Hairong Jiang, Nishant Puri, Niranjan Avadhanam
Gaze determination using glare as input

Patent number: 11841987

Abstract: Machine learning systems and methods that learn glare, and thus determine gaze direction in a manner more resilient to the effects of glare on input images. The machine learning systems have an isolated representation of glare, e.g., information on the locations of glare points in an image, as an explicit input, in addition to the image itself. In this manner, the machine learning systems explicitly consider glare while making a determination of gaze direction, thus producing more accurate results for images containing glare.

Type: Grant

Filed: May 23, 2022

Date of Patent: December 12, 2023

Assignee: NVIDIA Corporation

Inventors: Hairong Jiang, Nishant Puri, Niranjan Avadhanam, Nuri Murat Arar
DATA SET GENERATION AND AUGMENTATION FOR MACHINE LEARNING MODELS

Publication number: 20230351807

Abstract: A machine learning model (MLM) may be trained and evaluated. Attribute-based performance metrics may be analyzed to identify attributes for which the MLM is performing below a threshold when each are present in a sample. A generative neural network (GNN) may be used to generate samples including compositions of the attributes, and the samples may be used to augment the data used to train the MLM. This may be repeated until one or more criteria are satisfied. In various examples, a temporal sequence of data items, such as frames of a video, may be generated which may form samples of the data set. Sets of attribute values may be determined based on one or more temporal scenarios to be represented in the data set, and one or more GNNs may be used to generate the sequence to depict information corresponding to the attribute values.

Type: Application

Filed: May 2, 2022

Publication date: November 2, 2023

Inventors: Yuzhuo Ren, Weili Nie, Arash Vahdat, Animashree Anandkumar, Nishant Puri, Niranjan Avadhanam
NEURAL NETWORK BASED DETERMINATION OF GAZE DIRECTION USING SPATIAL MODELS

Publication number: 20230244941

Abstract: Systems and methods for determining the gaze direction of a subject and projecting this gaze direction onto specific regions of an arbitrary three-dimensional geometry. In an exemplary embodiment, gaze direction may be determined by a regression-based machine learning model. The determined gaze direction is then projected onto a three-dimensional map or set of surfaces that may represent any desired object or system. Maps may represent any three-dimensional layout or geometry, whether actual or virtual. Gaze vectors can thus be used to determine the object of gaze within any environment. Systems can also readily and efficiently adapt for use in different environments by retrieving a different set of surfaces or regions for each environment.

Type: Application

Filed: April 10, 2023

Publication date: August 3, 2023

Inventors: Nuri Murat Arar, Hairong Jiang, Nishant Puri, Rajath Shetty, Niranjan Avadhanam
Adaptive eye tracking machine learning model engine

Patent number: 11704814

Abstract: In various examples, an adaptive eye tracking machine learning model engine (“adaptive-model engine”) for an eye tracking system is described. The adaptive-model engine may include an eye tracking or gaze tracking development pipeline (“adaptive-model training pipeline”) that supports collecting data, training, optimizing, and deploying an adaptive eye tracking model that is a customized eye tracking model based on a set of features of an identified deployment environment. The adaptive-model engine supports ensembling the adaptive eye tracking model that may be trained on gaze vector estimation in surround environments and ensemble based on a plurality of eye tracking variant models and a plurality of facial landmark neural network metrics.

Type: Grant

Filed: May 13, 2021

Date of Patent: July 18, 2023

Assignee: NVIDIA Corporation

Inventors: Nuri Murat Arar, Niranjan Avadhanam, Hairong Jiang, Nishant Puri, Rajath Shetty, Shagan Sah
GAZE DETERMINATION MACHINE LEARNING SYSTEM HAVING ADAPTIVE WEIGHTING OF INPUTS

Publication number: 20230206488

Abstract: Machine learning systems and methods that determine gaze direction by using face orientation information, such as facial landmarks, to modify eye direction information determined from images of the subject’s eyes. System inputs include eye crops of the eyes of the subject, as well as face orientation information such as facial landmarks of the subject’s face in the input image. Facial orientation information, or facial landmark information, is used to determine a coarse prediction of gaze direction as well as to learn a context vector of features describing subject face pose. The context vector is then used to adaptively re-weight the eye direction features determined from the eye crops. The re-weighted features are then combined with the coarse gaze prediction to determine gaze direction.

Type: Application

Filed: March 10, 2023

Publication date: June 29, 2023

Inventor: Nishant Puri
Data augmentation including background modification for robust prediction using neural networks

Patent number: 11688074

Abstract: In various examples, a background of an object may be modified to generate a training image. A segmentation mask may be generated and used to generate an object image that includes image data representing the object. The object image may be integrated into a different background and used for data augmentation in training a neural network. Data augmentation may also be performed using hue adjustment (e.g., of the object image) and/or rendering three-dimensional capture data that corresponds to the object from selected views. Inference scores may be analyzed to select a background for an image to be included in a training dataset. Backgrounds may be selected and training images may be added to a training dataset iteratively during training (e.g., between epochs). Additionally, early or late fusion nay be employed that uses object mask data to improve inferencing performed by a neural network trained using object mask data.

Type: Grant

Filed: September 30, 2020

Date of Patent: June 27, 2023

Assignee: NVIDIA Corporation

Inventors: Nishant Puri, Sakthivel Sivaraman, Rajath Shetty, Niranjan Avadhanam
Neural network based determination of gaze direction using spatial models

Patent number: 11657263

Abstract: Systems and methods for determining the gaze direction of a subject and projecting this gaze direction onto specific regions of an arbitrary three-dimensional geometry. In an exemplary embodiment, gaze direction may be determined by a regression-based machine learning model. The determined gaze direction is then projected onto a three-dimensional map or set of surfaces that may represent any desired object or system. Maps may represent any three-dimensional layout or geometry, whether actual or virtual. Gaze vectors can thus be used to determine the object of gaze within any environment. Systems can also readily and efficiently adapt for use in different environments by retrieving a different set of surfaces or regions for each environment.

Type: Grant

Filed: August 28, 2020

Date of Patent: May 23, 2023

Assignee: NVIDIA Corporation

Inventors: Nuri Murat Arar, Hairong Jiang, Nishant Puri, Rajath Shetty, Niranjan Avadhanam
Gaze determination machine learning system having adaptive weighting of inputs

Patent number: 11636609

Abstract: Machine learning systems and methods that determine gaze direction by using face orientation information, such as facial landmarks, to modify eye direction information determined from images of the subject's eyes. System inputs include eye crops of the eyes of the subject, as well as face orientation information such as facial landmarks of the subject's face in the input image. Facial orientation information, or facial landmark information, is used to determine a coarse prediction of gaze direction as well as to learn a context vector of features describing subject face pose. The context vector is then used to adaptively re-weight the eye direction features determined from the eye crops. The re-weighted features are then combined with the coarse gaze prediction to determine gaze direction.

Type: Grant

Filed: September 2, 2020

Date of Patent: April 25, 2023

Assignee: NVIDIA Corporation

Inventor: Nishant Puri
NEURAL NETWORK BASED FACIAL ANALYSIS USING FACIAL LANDMARKS AND ASSOCIATED CONFIDENCE VALUES

Publication number: 20230078171

Abstract: Systems and methods for more accurate and robust determination of subject characteristics from an image of the subject. One or more machine learning models receive as input an image of a subject, and output both facial landmarks and associated confidence values. Confidence values represent the degrees to which portions of the subject's face corresponding to those landmarks are occluded, i.e., the amount of uncertainty in the position of each landmark location. These landmark points and their associated confidence values, and/or associated information, may then be input to another set of one or more machine learning models which may output any facial analysis quantity or quantities, such as the subject's gaze direction, head pose, drowsiness state, cognitive load, or distraction state.

Type: Application

Filed: October 31, 2022

Publication date: March 16, 2023

Inventors: Nuri Murat Arar, Niranjan Avadhanam, Nishant Puri, Shagan Sah, Rajath Shetty, Sujay Yadawadkar, Pavlo Molchanov
MULTI-MODAL SENSOR FUSION FOR CONTENT IDENTIFICATION IN APPLICATIONS OF HUMAN-MACHINE INTERFACES

Publication number: 20230064049

Abstract: Interactions with virtual systems may be difficult when users inadvertently fail to provide sufficient information to proceed with their requests. Certain types of inputs, such as auditory inputs, may lack sufficient information to properly provide a response to the user. Additional information, such as image data, may enable user gestures or poses to supplement the auditory inputs to enable response generation without requesting additional information from users.

Type: Application

Filed: August 31, 2021

Publication date: March 2, 2023

Inventors: Sakthivel Sivaraman, Nishant Puri, Yuzhuo Ren, Atousa Torabi, Shubhadeep Das, Niranjan Avadhanam, Sumit Kumar Bhattacharya, Jason Roche
ADAPTIVE EYE TRACKING MACHINE LEARNING MODEL ENGINE

Publication number: 20220366568

Abstract: In various examples, an adaptive eye tracking machine learning model engine (“adaptive-model engine”) for an eye tracking system is described. The adaptive-model engine may include an eye tracking or gaze tracking development pipeline (“adaptive-model training pipeline”) that supports collecting data, training, optimizing, and deploying an adaptive eye tracking model that is a customized eye tracking model based on a set of features of an identified deployment environment. The adaptive-model engine supports ensembling the adaptive eye tracking model that may be trained on gaze vector estimation in surround environments and ensemble based on a plurality of eye tracking variant models and a plurality of facial landmark neural network metrics.

Type: Application

Filed: May 13, 2021

Publication date: November 17, 2022

Inventors: Nuri Murat Arar, Niranjan Avadhanam, Hairong Jiang, Nishant Puri, Rajath Shetty, Shagan Sah

1 2 next