Patents by Inventor Rohit Kumar Pandey

Rohit Kumar Pandey has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Image Light Redistribution Based on Machine Learning Models

Publication number: 20250238904

Abstract: Apparatus and methods related to light redistribution in images are provided. An example method includes receiving, by a computing device, an input image comprising a subject. The method further includes adjusting, by a neural network, one or more of a specular component or a diffuse component associated with the input image. The adjusting involves redistributing a per-pixel light energy of the input image. The method additionally includes predicting, by the neural network, an output image comprising the subject with the adjusted one or more of the specular component or the diffuse component.

Type: Application

Filed: October 22, 2021

Publication date: July 24, 2025

Inventors: Rohit Kumar Pandey, Chloe LeGendre, Sergio Orts Escolano, Sean Ryan Francesco Fanello, Paul Debevec, Navin Padman Sarma, Christian Haene
AVATAR BASED ON MONOCULAR IMAGES

Publication number: 20240290025

Abstract: A method comprises receiving a first sequence of images of a portion of a user, the first sequence of images being monocular images; generating an avatar based on the first sequence of images, the avatar being based on a model including a feature vector associated with a vertex; receiving a second sequence of images of the portion of the user; and based on the second sequence of images, modifying the avatar with a displacement of the vertex to represent a gesture of the avatar.

Type: Application

Filed: February 27, 2024

Publication date: August 29, 2024

Inventors: Yinda Zhang, Sean Ryan Francesco Fanello, Ziqian Bai, Feitong Tan, Zeng Huang, Kripasindhu Sarkar, Danhang Tang, Di Qiu, Abhimitra Meka, Ruofei Du, Mingsong Dou, Sergio Orts Escolano, Rohit Kumar Pandey, Thabo Beeler
Volumetric performance capture with relighting

Patent number: 12066282

Abstract: A lighting stage includes a plurality of lights that project alternating spherical color gradient illumination patterns onto an object or human performer at a predetermined frequency. The lighting stage also includes a plurality of cameras that capture images of an object or human performer corresponding to the alternating spherical color gradient illumination patterns. The lighting stage also includes a plurality of depth sensors that capture depth maps of the object or human performer at the predetermined frequency. The lighting stage also includes (or is associated with) one or more processors that implement a machine learning algorithm to produce a three-dimensional (3D) model of the object or human performer. The 3D model includes relighting parameters used to relight the 3D model under different lighting conditions.

Type: Grant

Filed: November 11, 2020

Date of Patent: August 20, 2024

Assignee: GOOGLE LLC

Inventors: Sean Ryan Francesco Fanello, Kaiwen Guo, Peter Christopher Lincoln, Philip Lindsley Davidson, Jessica L. Busch, Xueming Yu, Geoffrey Harvey, Sergio Orts Escolano, Rohit Kumar Pandey, Jason Dourgarian, Danhang Tang, Adarsh Prakash Murthy Kowdle, Emily B. Cooper, Mingsong Dou, Graham Fyffe, Christoph Rhemann, Jonathan James Taylor, Shahram Izadi, Paul Ernest Debevec
Few-shot synthesis of talking heads

Patent number: 12026833

Abstract: Systems and methods are described for utilizing an image processing system with at least one processing device to perform operations including receiving a plurality of input images of a user, generating a three-dimensional mesh proxy based on a first set of features extracted from the plurality of input images and a second set of features extracted from the plurality of input images. The method may further include generating a neural texture based on a three-dimensional mesh proxy and the plurality of input images, generating a representation of the user including at least a neural texture, and sampling at least one portion of the neural texture from the three-dimensional mesh proxy. In response to providing the at least one sampled portion to a neural renderer, the method may include receiving, from the neural renderer, a synthesized image of the user that is previously not captured by the image processing system.

Type: Grant

Filed: October 28, 2020

Date of Patent: July 2, 2024

Assignee: Google LLC

Inventors: Ricardo Martin Brualla, Moustafa Meshry, Daniel Goldman, Rohit Kumar Pandey, Sofien Bouaziz, Ke Li
Photo Relighting and Background Replacement Based on Machine Learning Models

Publication number: 20240212106

Abstract: Apparatus and methods related to applying lighting models to images are provided. An example method includes receiving, via a computing device, an image comprising a subject. The method further includes relighting, via a neural network, a foreground of the image to maintain a consistent lighting of the foreground with a target illumination. The relighting is based on a per-pixel light representation indicative of a surface geometry of the foreground. The light representation includes a specular component, and a diffuse component, of surface reflection. The method additionally includes predicting, via the neural network, an output image comprising the subject in the relit foreground. One or more neural networks can be trained to perform one or more of the aforementioned aspects.

Type: Application

Filed: April 28, 2021

Publication date: June 27, 2024

Inventors: Chloe LeGendre, Paul Debevec, Sean Ryan Francesco Fanello, Rohit Kumar Pandey, Sergio Orts Escolano, Christian Haene, Sofien Bouaziz
Systems and Methods for Training Models to Predict Dense Correspondences in Images Using Geodesic Distances

Publication number: 20240212325

Abstract: Systems and methods for training models to predict dense correspondences across images such as human images. A model may be trained using synthetic training data created from one or more 3D computer models of a subject. In addition, one or more geodesic distances derived from the surfaces of one or more of the 3D models may be used to generate one or more loss values, which may in turn be used in modifying the model's parameters during training.

Type: Application

Filed: March 6, 2024

Publication date: June 27, 2024

Inventors: Yinda Zhang, Feitong Tan, Danhang Tang, Mingsong Dou, Kaiwen Guo, Sean Ryan Francesco Fanello, Sofien Bouaziz, Cem Keskin, Ruofei Du, Rohit Kumar Pandey, Deqing Sun
Systems and methods for training models to predict dense correspondences in images using geodesic distances

Patent number: 11954899

Abstract: Systems and methods for training models to predict dense correspondences across images such as human images. A model may be trained using synthetic training data created from one or more 3D computer models of a subject. In addition, one or more geodesic distances derived from the surfaces of one or more of the 3D models may be used to generate one or more loss values, which may in turn be used in modifying the model's parameters during training.

Type: Grant

Filed: March 11, 2021

Date of Patent: April 9, 2024

Assignee: GOOGLE LLC

Inventors: Yinda Zhang, Feitong Tan, Danhang Tang, Mingsong Dou, Kaiwen Guo, Sean Ryan Francesco Fanello, Sofien Bouaziz, Cem Keskin, Ruofei Du, Rohit Kumar Pandey, Deqing Sun
Systems and Methods for Training Models to Predict Dense Correspondences in Images Using Geodesic Distances

Publication number: 20240046618

Abstract: Systems and methods for training models to predict dense correspondences across images such as human images. A model may be trained using synthetic training data created from one or more 3D computer models of a subject. In addition, one or more geodesic distances derived from the surfaces of one or more of the 3D models may be used to generate one or more loss values, which may in turn be used in modifying the model's parameters during training.

Type: Application

Filed: March 11, 2021

Publication date: February 8, 2024

Inventors: Yinda Zhang, Feitong Tan, Danhang Tang, Mingsong Dou, Kaiwen Guo, Sean Ryan Francesco Fanello, Sofien Bouaziz, Cem Keskin, Ruofei Du, Rohit Kumar Pandey, Deqing Sun
GENERATIVE MODEL FOR 3D FACE SYNTHESIS WITH HDRI RELIGHTING

Publication number: 20240020915

Abstract: Techniques include introducing a neural generator configured to produce novel faces that can be rendered at free camera viewpoints (e.g., at any angle with respect to the camera) and relit under an arbitrary high dynamic range (HDR) light map. A neural implicit intrinsic field takes a randomly sampled latent vector as input and produces as output per-point albedo, volume density, and reflectance properties for any queried 3D location. These outputs are aggregated via a volumetric rendering to produce low resolution albedo, diffuse shading, specular shading, and neural feature maps. The low resolution maps are then upsampled to produce high resolution maps and input into a neural renderer to produce relit images.

Type: Application

Filed: July 17, 2023

Publication date: January 18, 2024

Inventors: Yinda Zhang, Feitong Tan, Sean Ryan Francesco Fanello, Abhimitra Meka, Sergio Orts Escolano, Danhang Tang, Rohit Kumar Pandey, Jonathan James Taylor
Eye gaze classification

Patent number: 11868523

Abstract: Techniques of tracking a user's gaze includes identifying a region of a display at which a gaze of a user is directed, the region including a plurality of pixels. By determining a region rather than a point, when the regions correspond to elements of a user interface, the improved technique enables a system to activate the element to which a determined region is selected. In some implementations, the system makes the determination using a classification engine including a convolutional neural network; such an engine takes as input images of the user's eye and outputs a list of probabilities that the gaze is directed to each of the regions.

Type: Grant

Filed: July 1, 2021

Date of Patent: January 9, 2024

Assignee: GOOGLE LLC

Inventors: Ivana Tosic Rodgers, Sean Ryan Francesco Fanello, Sofien Bouaziz, Rohit Kumar Pandey, Eric Aboussouan, Adarsh Prakash Murthy Kowdle
Enhanced Photo Relighting Based on Machine Learning Models

Publication number: 20230360182

Abstract: Apparatus and methods related to applying lighting models to images of objects are provided. An example method includes applying a geometry model to an input image to determine a surface orientation map indicative of a distribution of lighting on an object based on a surface geometry. The method further includes applying an environmental light estimation model to the input image to determine a direction of synthetic lighting to be applied to the input image. The method also includes applying, based on the surface orientation map and the direction of synthetic lighting, a light energy model to determine a quotient image indicative of an amount of light energy to be applied to each pixel of the input image. The method additionally includes enhancing, based on the quotient image, a portion of the input image. One or more neural networks can be trained to perform one or more of the aforementioned aspects.

Type: Application

Filed: May 17, 2021

Publication date: November 9, 2023

Inventors: Sean Ryan Francesco Fanello, Yun-Ta Tsai, Rohit Kumar Pandey, Paul Debevec, Michael Milne, Chloe LeGendre, Jonathan Tilton Barron, Christoph Rhemann, Sofien Bouaziz, Navin Padman Sarma
Generative latent textured proxies for object category modeling

Patent number: 11710287

Abstract: Systems and methods are described for generating a plurality of three-dimensional (3D) proxy geometries of an object, generating, based on the plurality of 3D proxy geometries, a plurality of neural textures of the object, the neural textures defining a plurality of different shapes and appearances representing the object, providing the plurality of neural textures to a neural renderer, receiving, from the neural renderer and based on the plurality of neural textures, a color image and an alpha mask representing an opacity of at least a portion of the object, and generating a composite image based on the pose, the color image, and the alpha mask.

Type: Grant

Filed: August 4, 2020

Date of Patent: July 25, 2023

Assignee: GOOGLE LLC

Inventors: Ricardo Martin Brualla, Daniel Goldman, Sofien Bouaziz, Rohit Kumar Pandey, Matthew Brown
EYE GAZE CLASSIFICATION

Publication number: 20230004216

Abstract: Techniques of tracking a user's gaze includes identifying a region of a display at which a gaze of a user is directed, the region including a plurality of pixels. By determining a region rather than a point, when the regions correspond to elements of a user interface, the improved technique enables a system to activate the element to which a determined region is selected. In some implementations, the system makes the determination using a classification engine including a convolutional neural network; such an engine takes as input images of the user's eye and outputs a list of probabilities that the gaze is directed to each of the regions.

Type: Application

Filed: July 1, 2021

Publication date: January 5, 2023

Inventors: Ivana Tosic Rodgers, Sean Ryan Francesco Fanello, Sofien Bouaziz, Rohit Kumar Pandey, Eric Aboussouan, Adarsh Prakash Murthy Kowdle
FEW-SHOT SYNTHESIS OF TALKING HEADS

Publication number: 20220130111

Abstract: Systems and methods are described for utilizing an image processing system with at least one processing device to perform operations including receiving a plurality of input images of a user, generating a three-dimensional mesh proxy based on a first set of features extracted from the plurality of input images and a second set of features extracted from the plurality of input images. The method may further include generating a neural texture based on a three-dimensional mesh proxy and the plurality of input images, generating a representation of the user including at least a neural texture, and sampling at least one portion of the neural texture from the three-dimensional mesh proxy. In response to providing the at least one sampled portion to a neural renderer, the method may include receiving, from the neural renderer, a synthesized image of the user that is previously not captured by the image processing system.

Type: Application

Filed: October 28, 2020

Publication date: April 28, 2022

Inventors: Ricardo Martin Brualla, Moustafa Meshry, Daniel Goldman, Rohit Kumar Pandey, Sofien Bouaziz, Ke Li
Method and system for reconstructing a field of view

Patent number: 11269172

Abstract: Embodiments of present disclosure discloses system and method for reconstruction of FOV. Initially, presence of one of single object and distinct objects in FOV of image of sample comprising one or more objects is determined based on sharpness of one or more objects. A single optimal representation of FOV may be generated when presence of single object is determined. At least one of single optimal representation and a depth-based enhanced representation of FOV may be generated when presence of distinct objects is determined. For generating depth-based enhanced representation, one or more first optimal images associated with each of distinct objects in FOV may be retrieved. An optimal representation of each of distinct objects is generated based on corresponding one or more first optimal images. Further, optimal representation of each of distinct objects is placed at corresponding optimal depth associated with respective distinct object to generate depth-based enhanced representation.

Type: Grant

Filed: February 8, 2018

Date of Patent: March 8, 2022

Assignee: SIGTUPLE TECHNOLOGIES PRIVATE LIMITED

Inventors: Harshit Pande, Abdul Aziz, Bharath Cheluvaraju, Tathagato Rai Dastidar, Apurv Anand, Rohit Kumar Pandey
VOLUMETRIC PERFORMANCE CAPTURE WITH RELIGHTING

Publication number: 20220065620

Abstract: A lighting stage includes a plurality of lights that project alternating spherical color gradient illumination patterns onto an object or human performer at a predetermined frequency. The lighting stage also includes a plurality of cameras that capture images of an object or human performer corresponding to the alternating spherical color gradient illumination patterns. The lighting stage also includes a plurality of depth sensors that capture depth maps of the object or human performer at the predetermined frequency. The lighting stage also includes (or is associated with) one or more processors that implement a machine learning algorithm to produce a three-dimensional (3D) model of the object or human performer. The 3D model includes relighting parameters used to relight the 3D model under different lighting conditions.

Type: Application

Filed: November 11, 2020

Publication date: March 3, 2022

Inventors: Sean Ryan Francesco Fanello, Kaiwen Guo, Peter Christopher Lincoln, Philip Lindsley Davidson, Jessica L. Busch, Xueming Yu, Geoffrey Harvey, Sergio Orts Escolano, Rohit Kumar Pandey, Jason Dourgarian, Danhang Tang, Adarsh Prakash Murthy Kowdle, Emily B. Cooper, Mingsong Dou, Graham Fyffe, Christoph Rhemann, Jonathan James Taylor, Shahram Izadi, Paul Ernest Debevec
GENERATIVE LATENT TEXTURED PROXIES FOR OBJECT CATEGORY MODELING

Publication number: 20220051485

Abstract: Systems and methods are described for generating a plurality of three-dimensional (3D) proxy geometries of an object, generating, based on the plurality of 3D proxy geometries, a plurality of neural textures of the object, the neural textures defining a plurality of different shapes and appearances representing the object, providing the plurality of neural textures to a neural renderer, receiving, from the neural renderer and based on the plurality of neural textures, a color image and an alpha mask representing an opacity of at least a portion of the object, and generating a composite image based on the pose, the color image, and the alpha mask.

Type: Application

Filed: August 4, 2020

Publication date: February 17, 2022

Inventors: Ricardo Martin Brualla, Daniel Goldman, Sofien Bouaziz, Rohit Kumar Pandey, Matthew Brown
Method and system for determining area to be scanned in peripheral blood smear for analysis

Patent number: 11189027

Abstract: Disclosed subject matter relates to Peripheral Blood Smear (PBS) that determines an area to be scanned in PBS for analysis. A PBS analysing system captures a focused image at each of plurality of positions in the PBS and determines Quality Indicators (QIs) in focused image. Further, a region is identified in PBS where QIs of focused image satisfy predefined QI threshold limits, as a monolayer region of PBS and determines an initiation point in monolayer region based on cell count value and co-ordinates of each of the plurality of positions located in the monolayer region. Finally, the area to be scanned in monolayer region is determined based on the initiation point and a predefined scan pattern. Determining the area to be scanned yields accurate and faster results.

Type: Grant

Filed: May 15, 2018

Date of Patent: November 30, 2021

Assignee: Sigtuple Technologies Private Limited

Inventors: Shreepad Potadar, Dheeraj Mundhra, Abhishek Shukla, Raghu G, Amrutha Muralidharan, Deepak Kapoor, Vijay Muralidharan, Nivedita Muthusubramanian, Bharath Cheluvaraju, Apurv Anand, Tathagato Rai Dastidar, Rohit Kumar Pandey
Methods, systems, and media for relighting images using predicted deep reflectance fields

Patent number: 10997457

Abstract: Methods, systems, and media for relighting images using predicted deep reflectance fields are provided.

Type: Grant

Filed: October 16, 2019

Date of Patent: May 4, 2021

Assignee: Google LLC

Inventors: Christoph Rhemann, Abhimitra Meka, Matthew Whalen, Jessica Lynn Busch, Sofien Bouaziz, Geoffrey Douglas Harvey, Andrea Tagliasacchi, Jonathan Taylor, Paul Debevec, Peter Joseph Denny, Sean Ryan Francesco Fanello, Graham Fyffe, Jason Angelo Dourgarian, Xueming Yu, Adarsh Prakash Murthy Kowdle, Julien Pascal Christophe Valentin, Peter Christopher Lincoln, Rohit Kumar Pandey, Christian Häne, Shahram Izadi
Method and system for determining quality of semen sample

Patent number: 10881382

Abstract: Disclosed herein is method and system for determining quality of semen sample. Trajectories of objects, identified in each of plurality of image frames of semen sample, are generated by tracking movement of the objects across image frames, and compensating a drift velocity of the semen sample. Further, generated trajectories are classified into sperm and non-sperm trajectories. Finally, total concentration estimate and total motility estimate of the semen sample are computed to generate a semen quality index, which indicates quality of the semen sample. In an embodiment, the method of present disclosure uses a multi-level Convolutional Neural Network (CNN) analysis technique for effectively classifying the object trajectories into sperm and non-sperm objects. Also, since the present method includes estimating and compensating drift velocity in the semen sample, it enhances overall accuracy of motility estimation and semen quality analysis.

Type: Grant

Filed: December 7, 2017

Date of Patent: January 5, 2021

Assignee: Sigtuple Technologies Private Limited

Inventors: Karan Dewan, Bharath Cheluvaraju, Tathagato Rai Dastidar, Apurv Anand, Rohit Kumar Pandey

1 2 next