Patents by Inventor Stephen Walter Tyree

Stephen Walter Tyree has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

OBJECT POSE TRACKING FROM VIDEO IMAGES

Publication number: 20240005547

Abstract: Apparatuses, systems, and techniques to determined a pose of an object from a plurality of images. In at least one embodiment, the pose of an object is determined from at least two images of a video sequence using one or more neural networks, in which the neural network produces a distribution of pose information that is filtered to determine the current pose.

Type: Application

Filed: May 23, 2022

Publication date: January 4, 2024

Inventors: Yunzhi Lin, Jonathan Tremblay, Stephen Walter Tyree, Stanley Thomas Birchfield
SINGLE-STAGE CATEGORY-LEVEL OBJECT POSE ESTIMATION

Publication number: 20220277472

Abstract: Apparatuses, systems, and techniques to determine a pose and relative dimensions of an object from an image. In at least one embodiment, a pose and relative dimensions of an object are determined from an image based at least in part on, for example, features of the image.

Type: Application

Filed: September 9, 2021

Publication date: September 1, 2022

Inventors: Stanley Thomas Birchfield, Jonathan Tremblay, Yunzhi Lin, Stephen Walter Tyree
Determining a three-dimensional representation of a scene

Patent number: 11417063

Abstract: One or more images (e.g., images taken from one or more cameras) may be received, where each of the one or more images may depict a two-dimensional (2D) view of a three-dimensional (3D) scene. Additionally, the one or more images may be utilized to determine a three-dimensional (3D) representation of a scene. This representation may help an entity navigate an environment represented by the 3D scene.

Type: Grant

Filed: February 22, 2021

Date of Patent: August 16, 2022

Assignee: NVIDIA CORPORATION

Inventors: Yunzhi Lin, Jonathan Tremblay, Stephen Walter Tyree, Stanley Thomas Birchfield
Systems and methods for pruning neural networks for resource efficient inference

Patent number: 11315018

Abstract: A method, computer readable medium, and system are disclosed for neural network pruning. The method includes the steps of receiving first-order gradients of a cost function relative to layer parameters for a trained neural network and computing a pruning criterion for each layer parameter based on the first-order gradient corresponding to the layer parameter, where the pruning criterion indicates an importance of each neuron that is included in the trained neural network and is associated with the layer parameter. The method includes the additional steps of identifying at least one neuron having a lowest importance and removing the at least one neuron from the trained neural network to produce a pruned neural network.

Type: Grant

Filed: October 17, 2017

Date of Patent: April 26, 2022

Assignee: NVIDIA Corporation

Inventors: Pavlo Molchanov, Stephen Walter Tyree, Tero Tapani Karras, Timo Oskari Aila, Jan Kautz
DETERMINING A THREE-DIMENSIONAL REPRESENTATION OF A SCENE

Publication number: 20220068024

Abstract: One or more images (e.g., images taken from one or more cameras) may be received, where each of the one or more images may depict a two-dimensional (2D) view of a three-dimensional (3D) scene. Additionally, the one or more images may be utilized to determine a three-dimensional (3D) representation of a scene. This representation may help an entity navigate an environment represented by the 3D scene.

Type: Application

Filed: February 22, 2021

Publication date: March 3, 2022

Inventors: Yunzhi Lin, Jonathan Tremblay, Stephen Walter Tyree, Stanley Thomas Birchfield
OBJECT-TO-ROBOT POSE ESTIMATION FROM A SINGLE RGB IMAGE

Publication number: 20200311855

Abstract: Pose estimation generally refers to a computer vision technique that determines the pose of some object, usually with respect to a particular camera. Pose estimation has many applications, but is particularly useful in the context of robotic manipulation systems. To date, robotic manipulation systems have required a camera to be installed on the robot itself (i.e. a camera-in-hand) for capturing images of the object and/or a camera external to the robot for capturing images of the object. Unfortunately, the camera-in-hand has a limited field of view for capturing objects, whereas the external camera, which may have a greater field of view, requires costly calibration each time the camera is even slightly moved. Similar issues apply when estimating the pose of any object with respect to another object (i.e. which may be moving or not). The present disclosure avoids these issues and provides object-to-object pose estimation from a single image.

Type: Application

Filed: June 15, 2020

Publication date: October 1, 2020

Inventors: Jonathan Tremblay, Stephen Walter Tyree, Stanley Thomas Birchfield
Equivariant landmark transformation for landmark localization

Patent number: 10783394

Abstract: A method, computer readable medium, and system are disclosed to generate coordinates of landmarks within images. The landmark locations may be identified on an image of a human face and used for emotion recognition, face identity verification, eye gaze tracking, pose estimation, etc. A transform is applied to input image data to produce transformed input image data. The transform is also applied to predicted coordinates for landmarks of the input image data to produce transformed predicted coordinates. A neural network model processes the transformed input image data to generate additional landmarks of the transformed input image data and additional predicted coordinates for each one of the additional landmarks. Parameters of the neural network model are updated to reduce differences between the transformed predicted coordinates and the additional predicted coordinates.

Type: Grant

Filed: June 12, 2018

Date of Patent: September 22, 2020

Assignee: NVIDIA Corporation

Inventors: Pavlo Molchanov, Stephen Walter Tyree, Jan Kautz, Sina Honari
Semi-supervised learning for landmark localization

Patent number: 10783393

Abstract: A method, computer readable medium, and system are disclosed for sequential multi-tasking to generate coordinates of landmarks within images. The landmark locations may be identified on an image of a human face and used for emotion recognition, face identity verification, eye gaze tracking, pose estimation, etc. A neural network model processes input image data to generate pixel-level likelihood estimates for landmarks in the input image data and a soft-argmax function computes predicted coordinates of each landmark based on the pixel-level likelihood estimates.

Type: Grant

Filed: June 12, 2018

Date of Patent: September 22, 2020

Assignee: NVIDIA Corporation

Inventors: Pavlo Molchanov, Stephen Walter Tyree, Jan Kautz, Sina Honari
EQUIVARIANT LANDMARK TRANSFORMATION FOR LANDMARK LOCALIZATION

Publication number: 20180365512

Abstract: A method, computer readable medium, and system are disclosed to generate coordinates of landmarks within images. The landmark locations may be identified on an image of a human face and used for emotion recognition, face identity verification, eye gaze tracking, pose estimation, etc. A transform is applied to input image data to produce transformed input image data. The transform is also applied to predicted coordinates for landmarks of the input image data to produce transformed predicted coordinates. A neural network model processes the transformed input image data to generate additional landmarks of the transformed input image data and additional predicted coordinates for each one of the additional landmarks. Parameters of the neural network model are updated to reduce differences between the transformed predicted coordinates and the additional predicted coordinates.

Type: Application

Filed: June 12, 2018

Publication date: December 20, 2018

Inventors: Pavlo Molchanov, Stephen Walter Tyree, Jan Kautz, Sina Honari
SEMI-SUPERVISED LEARNING FOR LANDMARK LOCALIZATION

Publication number: 20180365532

Abstract: A method, computer readable medium, and system are disclosed for sequential multi-tasking to generate coordinates of landmarks within images. The landmark locations may be identified on an image of a human face and used for emotion recognition, face identity verification, eye gaze tracking, pose estimation, etc. A neural network model processes input image data to generate pixel-level likelihood estimates for landmarks in the input image data and a soft-argmax function computes predicted coordinates of each landmark based on the pixel-level likelihood estimates.

Type: Application

Filed: June 12, 2018

Publication date: December 20, 2018

Inventors: Pavlo Molchanov, Stephen Walter Tyree, Jan Kautz, Sina Honari
Online detection and classification of dynamic gestures with recurrent convolutional neural networks

Patent number: 10157309

Abstract: A method, computer readable medium, and system are disclosed for detecting and classifying hand gestures. The method includes the steps of receiving an unsegmented stream of data associated with a hand gesture, extracting spatio-temporal features from the unsegmented stream by a three-dimensional convolutional neural network (3DCNN), and producing a class label for the hand gesture based on the spatio-temporal features.

Type: Grant

Filed: January 9, 2017

Date of Patent: December 18, 2018

Assignee: NVIDIA CORPORATION

Inventors: Pavlo Molchanov, Xiaodong Yang, Shalini De Mello, Kihwan Kim, Stephen Walter Tyree, Jan Kautz
SYSTEMS AND METHODS FOR PRUNING NEURAL NETWORKS FOR RESOURCE EFFICIENT INFERENCE

Publication number: 20180114114

Abstract: A method, computer readable medium, and system are disclosed for neural network pruning. The method includes the steps of receiving first-order gradients of a cost function relative to layer parameters for a trained neural network and computing a pruning criterion for each layer parameter based on the first-order gradient corresponding to the layer parameter, where the pruning criterion indicates an importance of each neuron that is included in the trained neural network and is associated with the layer parameter. The method includes the additional steps of identifying at least one neuron having a lowest importance and removing the at least one neuron from the trained neural network to produce a pruned neural network.

Type: Application

Filed: October 17, 2017

Publication date: April 26, 2018

Inventors: Pavlo Molchanov, Stephen Walter Tyree, Tero Tapani Karras, Timo Oskari Aila, Jan Kautz
ONLINE DETECTION AND CLASSIFICATION OF DYNAMIC GESTURES WITH RECURRENT CONVOLUTIONAL NEURAL NETWORKS

Publication number: 20170206405

Abstract: A method, computer readable medium, and system are disclosed for detecting and classifying hand gestures. The method includes the steps of receiving an unsegmented stream of data associated with a hand gesture, extracting spatio-temporal features from the unsegmented stream by a three-dimensional convolutional neural network (3DCNN), and producing a class label for the hand gesture based on the spatio-temporal features.

Type: Application

Filed: January 9, 2017

Publication date: July 20, 2017

Inventors: Pavlo Molchanov, Xiaodong Yang, Shalini De Mello, Kihwan Kim, Stephen Walter Tyree, Jan Kautz