Patents by Inventor Tanmay Batra

Tanmay Batra has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Two-stage body pose estimation

Patent number: 12211307

Abstract: In one implementation, a method of body pose estimation is performed at a device including one or more processors and non-transitory memory. The method includes obtaining a plurality of two-dimensional images of a body in a three-dimensional environment at a respective plurality of times. The method includes determining, for each of the plurality of two-dimensional images, the two-dimensional location in the two-dimensional image of one or more joints of the body at the respective plurality of times. The method includes determining, based on the two-dimensional locations, a plurality of three-dimensional locations in the three-dimensional environment of the one or more joints of the body at the respective plurality of times. The method includes determining, based on the three-dimensional locations, a plurality of updated three-dimensional locations in the three-dimensional environment of the one or more joints of the body at the respective plurality of times.

Type: Grant

Filed: March 16, 2022

Date of Patent: January 28, 2025

Assignee: APPLE INC.

Inventors: Tanmay Batra, Bharath Kumar Comandur Jagannathan Raghunathan, Stefano Alletto
Iterative neural network training using quality assurance neural network

Patent number: 12033058

Abstract: In some implementations initially training a first neural network includes inputting the training inputs and corresponding training labels into the first neural network to produce output labels, comparing the output labels to the corresponding training labels using a second neural network that learns and applies a comparison metric, and adjusting parameters of the first neural network based on the comparing. The device then inputs additional inputs into the first neural network to produce additional output labels and corresponding confidence values from the second neural network. The device selects, based on the confidence values, an automatically-labeled training set of data including a subset of the additional inputs and a corresponding subset of the additional output labels. During a second training stage, the device trains the first neural network and the second neural network using the automatically-labeled training set of data.

Type: Grant

Filed: May 24, 2019

Date of Patent: July 9, 2024

Assignee: Apple Inc.

Inventors: Peter Meier, Tanmay Batra
Plane detection using semantic segmentation

Patent number: 11972607

Abstract: In one implementation, a method of generating a plane hypothesis is performed by a device including one or more processors, non-transitory memory, and a scene camera. The method includes obtaining an image of a scene including a plurality of pixels. The method includes obtaining a plurality of points of a point cloud based on the image of the scene. The method includes obtaining an object classification set based on the image of the scene. Each element of the object classification set includes a plurality of pixels respectively associated with a corresponding object in the scene. The method includes detecting a plane within the scene by identifying a subset of the plurality of points of the point cloud that correspond to a particular element of the object classification set.

Type: Grant

Filed: February 18, 2023

Date of Patent: April 30, 2024

Assignee: APPLE INC.

Inventors: Daniel Ulbricht, Angela Blechschmidt, Mohammad Haris Baig, Tanmay Batra, Eshan Verma, Amit Kumar Kc
Semi-supervised learning using clustering as an additional constraint

Patent number: 11954881

Abstract: In some implementations a neural network is trained to perform a main task using a clustering constraint, for example, using both a main task training loss and a clustering training loss. Training inputs are inputted into a main task neural network to produce output labels predicting locations of the parts of the objects in the training inputs. Data from pooled layers of the main task neural network is inputted into a clustering neural network. The main task neural network and the clustering neural network are trained based on a main task loss from the main task neural network and a clustering loss from the clustering neural network. The main task loss is determined by comparing differences between the output labels and the training labels. The clustering loss encourages the clustering network to learn to label the parts of the objects individually, e.g., to learn groups corresponding to the object parts.

Type: Grant

Filed: July 17, 2019

Date of Patent: April 9, 2024

Assignee: Apple Inc.

Inventors: Peter Meier, Tanmay Batra
Identity-based inclusion/exclusion in a computer-generated reality experience

Patent number: 11783552

Abstract: In one implementation, a method of including a person in a CGR experience or excluding the person from the CGR experience is performed by a device including one or more processors, non-transitory memory, and a scene camera. The method includes, while presenting a CGR experience, capturing an image of scene; detecting, in the image of the scene, a person; and determining an identity of the person. The method includes determining, based on the identity of the person, whether to include the person in the CGR experience or exclude the person from the CGR experience. The method includes presenting the CGR experience based on the determination.

Type: Grant

Filed: December 21, 2021

Date of Patent: October 10, 2023

Assignee: APPLE INC.

Inventors: Daniel Ulbricht, Amit Kumar K C, Angela Blechschmidt, Chen-Yu Lee, Eshan Verma, Mohammad Haris Baig, Tanmay Batra
Visual search refinement for computer generated rendering environments

Patent number: 11710283

Abstract: Various implementations disclosed herein include devices, systems, and methods that enable faster and more efficient real-time physical object recognition, information retrieval, and updating of a CGR environment. In some implementations, the CGR environment is provided at a first device based on a classification of the physical object, image or video data including the physical object is transmitted by the first device to a second device, and the CGR environment is updated by the first device based on a response associated with the physical object received from the second device.

Type: Grant

Filed: October 22, 2021

Date of Patent: July 25, 2023

Assignee: Apple Inc.

Inventors: Eshan Verma, Daniel Ulbricht, Angela Blechschmidt, Mohammad Haris Baig, Chen-Yu Lee, Tanmay Batra
Plane Detection Using Semantic Segmentation

Publication number: 20230206623

Abstract: In one implementation, a method of generating a plane hypothesis is performed by a device including one or more processors, non-transitory memory, and a scene camera. The method includes obtaining an image of a scene including a plurality of pixels. The method includes obtaining a plurality of points of a point cloud based on the image of the scene. The method includes obtaining an object classification set based on the image of the scene. Each element of the object classification set includes a plurality of pixels respectively associated with a corresponding object in the scene. The method includes detecting a plane within the scene by identifying a subset of the plurality of points of the point cloud that correspond to a particular element of the object classification set.

Type: Application

Filed: February 18, 2023

Publication date: June 29, 2023

Inventors: Daniel Ulbricht, Angela Blechschmidt, Mohammad Haris Baig, Tanmay Batra, Eshan Verma, Amit Kumar KC
Plane detection using semantic segmentation

Patent number: 11610397

Abstract: In one implementation, a method of generating a plane hypothesis is performed by a device including one or more processors, non-transitory memory, and a scene camera. The method includes obtaining an image of a scene including a plurality of pixels. The method includes obtaining a plurality of points of a point cloud based on the image of the scene. The method includes obtaining an object classification set based on the image of the scene. Each element of the object classification set includes a plurality of pixels respectively associated with a corresponding object in the scene. The method includes detecting a plane within the scene by identifying a subset of the plurality of points of the point cloud that correspond to a particular element of the object classification set.

Type: Grant

Filed: September 13, 2021

Date of Patent: March 21, 2023

Assignee: APPLE INC.

Inventors: Daniel Ulbricht, Angela Blechschmidt, Mohammad Haris Baig, Tanmay Batra, Eshan Verma, Amit Kumar KC
Unsupervised annotation using dual network system with pre-defined structure

Patent number: 11403511

Abstract: In some implementations at an electronic device, training a dual EDNN includes defining a data structure of attributes corresponding to defined parts of a task, processing a first instance of an input using a first EDNN to produce a first output while encoding a first set of the attributes in a first latent space, and processing a second instance of the input using a second EDNN to produce a second output while encoding attribute differences from attribute averages in a second latent space. The device then determines a second set of the attributes based on the attribute differences and the attribute averages. The device then adjusts parameters of the first and second EDNNs based on comparing the first instance of the input to the first output, the second instance of the input to the second output, and the first set of attributes to the second set of attributes.

Type: Grant

Filed: July 18, 2019

Date of Patent: August 2, 2022

Assignee: Apple Inc.

Inventors: Peter Meier, Tanmay Batra
Object detection and orientation estimation

Patent number: 11315278

Abstract: In one implementation, a method of estimating the orientation of an object in an image is performed by a device including one or more processors, non-transitory memory, and a scene camera. The method includes obtaining an image of a scene including a plurality of pixels at a respective plurality of pixel locations and having a respective plurality of pixel values. The method includes determining a first set of pixels locations corresponding to a 2D boundary surrounding an object represented in the image and determining, based on the first set of pixel locations, a second set of pixel locations corresponding to a 3D boundary surrounding the object.

Type: Grant

Filed: September 24, 2019

Date of Patent: April 26, 2022

Assignee: APPLE INC.

Inventors: Daniel Ulbricht, Amit Kumar K C, Angela Blechschmidt, Chen-Yu Lee, Eshan Verma, Mohammad Haris Baig, Tanmay Batra
IDENTITY-BASED INCLUSION/EXCLUSION IN A COMPUTER-GENERATED REALITY EXPERIENCE

Publication number: 20220114796

Abstract: In one implementation, a method of including a person in a CGR experience or excluding the person from the CGR experience is performed by a device including one or more processors, non-transitory memory, and a scene camera. The method includes, while presenting a CGR experience, capturing an image of scene; detecting, in the image of the scene, a person; and determining an identity of the person. The method includes determining, based on the identity of the person, whether to include the person in the CGR experience or exclude the person from the CGR experience. The method includes presenting the CGR experience based on the determination.

Type: Application

Filed: December 21, 2021

Publication date: April 14, 2022

Inventors: Daniel Ulbricht, Amit Kumar K C, Angela Blechschmidt, Chen-Yu Lee, Eshan Verma, Mohammad Haris Baig, Tanmay Batra
Identity-based inclusion/exclusion in a computer-generated reality experience

Patent number: 11295529

Abstract: In one implementation, a method of including a person in a CGR experience or excluding the person from the CGR experience is performed by a device including one or more processors, non-transitory memory, and a scene camera. The method includes, while presenting a CGR experience, capturing an image of scene; detecting, in the image of the scene, a person; and determining an identity of the person. The method includes determining, based on the identity of the person, whether to include the person in the CGR experience or exclude the person from the CGR experience. The method includes presenting the CGR experience based on the determination.

Type: Grant

Filed: January 15, 2021

Date of Patent: April 5, 2022

Assignee: APPLE INC.

Inventors: Daniel Ulbricht, Amit Kumar K C, Angela Blechschmidt, Chen-Yu Lee, Eshan Verma, Mohammad Haris Baig, Tanmay Batra
VISUAL SEARCH REFINEMENT FOR COMPUTER GENERATED RENDERING ENVIRONMENTS

Publication number: 20220044486

Abstract: Various implementations disclosed herein include devices, systems, and methods that enable faster and more efficient real-time physical object recognition, information retrieval, and updating of a CGR environment. In some implementations, the CGR environment is provided at a first device based on a classification of the physical object, image or video data including the physical object is transmitted by the first device to a second device, and the CGR environment is updated by the first device based on a response associated with the physical object received from the second device.

Type: Application

Filed: October 22, 2021

Publication date: February 10, 2022

Inventors: Eshan Verma, Daniel Ulbricht, Angela Blechschmidt, Mohammad Haris Baig, Chen-Yu Lee, Tanmay Batra
Plane Detection Using Semantic Segmentation

Publication number: 20210406541

Abstract: In one implementation, a method of generating a plane hypothesis is performed by a device including one or more processors, non-transitory memory, and a scene camera. The method includes obtaining an image of a scene including a plurality of pixels. The method includes obtaining a plurality of points of a point cloud based on the image of the scene. The method includes obtaining an object classification set based on the image of the scene. Each element of the object classification set includes a plurality of pixels respectively associated with a corresponding object in the scene. The method includes detecting a plane within the scene by identifying a subset of the plurality of points of the point cloud that correspond to a particular element of the object classification set.

Type: Application

Filed: September 13, 2021

Publication date: December 30, 2021

Inventors: Daniel Ulbricht, Angela Blechschmidt, Mohammad Haris Baig, Tanmay Batra, Eshan Verma, Amit Kumar KC
Visual search refinement for computer generated rendering environments

Patent number: 11189103

Abstract: Various implementations disclosed herein include devices, systems, and methods that enable faster and more efficient real-time physical object recognition, information retrieval, and updating of a CGR environment. In some implementations, the CGR environment is provided at a first device based on a classification of the physical object, image or video data including the physical object is transmitted by the first device to a second device, and the CGR environment is updated by the first device based on a response associated with the physical object received from the second device.

Type: Grant

Filed: July 9, 2020

Date of Patent: November 30, 2021

Assignee: Apple Inc.

Inventors: Eshan Verma, Daniel Ulbricht, Angela Blechschmidt, Mohammad Haris Baig, Chen-Yu Lee, Tanmay Batra
Plane detection using semantic segmentation

Patent number: 11132546

Abstract: In one implementation, a method of generating a plane hypothesis is performed by a head-mounted device (HMD) including one or more processors, non-transitory memory, and a scene camera. The method includes obtaining an image of a scene including a plurality of pixels. The method include obtaining a point cloud based on the image of the scene and generating an object classification set based on the image of the scene, each element of the object classification set including a respective plurality of pixels classified as a respective object in the scene. The method includes generating a plane hypothesis based on the point cloud and the object classification set.

Type: Grant

Filed: September 25, 2020

Date of Patent: September 28, 2021

Assignee: APPLE INC.

Inventors: Daniel Ulbricht, Angela Blechschmidt, Mohammad Haris Baig, Tanmay Batra, Eshan Verma, Amit Kumar KC
Depth map generation

Patent number: 11100720

Abstract: In one implementation, a method of generating a depth map is performed by a device including one or more processors, non-transitory memory, and a scene camera. The method includes generating, based on a first image and a second image, a first depth map of the second image. The method includes generating, based on the first depth map of the second image and pixel values of the second image, a second depth map of the second image.

Type: Grant

Filed: September 24, 2020

Date of Patent: August 24, 2021

Assignee: APPLE INC.

Inventors: Daniel Ulbricht, Amit Kumar K C, Angela Blechschmidt, Chen-Yu Lee, Eshan Verma, Mohammad Haris Baig, Tanmay Batra
IDENTITY-BASED INCLUSION/EXCLUSION IN A COMPUTER-GENERATED REALITY EXPERIENCE

Publication number: 20210134067

Abstract: In one implementation, a method of including a person in a CGR experience or excluding the person from the CGR experience is performed by a device including one or more processors, non-transitory memory, and a scene camera. The method includes, while presenting a CGR experience, capturing an image of scene; detecting, in the image of the scene, a person; and determining an identity of the person. The method includes determining, based on the identity of the person, whether to include the person in the CGR experience or exclude the person from the CGR experience. The method includes presenting the CGR experience based on the determination.

Type: Application

Filed: January 15, 2021

Publication date: May 6, 2021

Inventors: Daniel Ulbricht, Amit Kumar K C, Angela Blechschmidt, Chen-Yu Lee, Eshan Verma, Mohammad Haris Baig, Tanmay Batra
VISUAL SEARCH REFINEMENT FOR COMPUTER GENERATED RENDERING ENVIRONMENTS

Publication number: 20210035367

Abstract: Various implementations disclosed herein include devices, systems, and methods that enable faster and more efficient real-time physical object recognition, information retrieval, and updating of a CGR environment. In some implementations, the CGR environment is provided at a first device based on a classification of the physical object, image or video data including the physical object is transmitted by the first device to a second device, and the CGR environment is updated by the first device based on a response associated with the physical object received from the second device.

Type: Application

Filed: July 9, 2020

Publication date: February 4, 2021

Inventors: Eshan Verma, Daniel Ulbricht, Angela Blechschmidt, Mohammad Haris Baig, Chen-Yu Lee, Tanmay Batra
DEPTH MAP GENERATION

Publication number: 20210019949

Abstract: In one implementation, a method of generating a depth map is performed by a device including one or more processors, non-transitory memory, and a scene camera. The method includes generating, based on a first image and a second image, a first depth map of the second image. The method includes generating, based on the first depth map of the second image and pixel values of the second image, a second depth map of the second image.

Type: Application

Filed: September 24, 2020

Publication date: January 21, 2021

Inventors: Daniel Ulbricht, Amit Kumar K C, Angela Blechschmidt, Chen-Yu Lee, Eshan Verma, Mohammad Haris Baig, Tanmay Batra

1 2 next