Patents by Inventor Sean Ryan Francesco Fanello

Sean Ryan Francesco Fanello has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Systems and methods for training models to predict dense correspondences in images using geodesic distances

Patent number: 11954899

Abstract: Systems and methods for training models to predict dense correspondences across images such as human images. A model may be trained using synthetic training data created from one or more 3D computer models of a subject. In addition, one or more geodesic distances derived from the surfaces of one or more of the 3D models may be used to generate one or more loss values, which may in turn be used in modifying the model's parameters during training.

Type: Grant

Filed: March 11, 2021

Date of Patent: April 9, 2024

Assignee: GOOGLE LLC

Inventors: Yinda Zhang, Feitong Tan, Danhang Tang, Mingsong Dou, Kaiwen Guo, Sean Ryan Francesco Fanello, Sofien Bouaziz, Cem Keskin, Ruofei Du, Rohit Kumar Pandey, Deqing Sun
Systems and Methods for Training Models to Predict Dense Correspondences in Images Using Geodesic Distances

Publication number: 20240046618

Abstract: Systems and methods for training models to predict dense correspondences across images such as human images. A model may be trained using synthetic training data created from one or more 3D computer models of a subject. In addition, one or more geodesic distances derived from the surfaces of one or more of the 3D models may be used to generate one or more loss values, which may in turn be used in modifying the model's parameters during training.

Type: Application

Filed: March 11, 2021

Publication date: February 8, 2024

Inventors: Yinda Zhang, Feitong Tan, Danhang Tang, Mingsong Dou, Kaiwen Guo, Sean Ryan Francesco Fanello, Sofien Bouaziz, Cem Keskin, Ruofei Du, Rohit Kumar Pandey, Deqing Sun
GENERATIVE MODEL FOR 3D FACE SYNTHESIS WITH HDRI RELIGHTING

Publication number: 20240020915

Abstract: Techniques include introducing a neural generator configured to produce novel faces that can be rendered at free camera viewpoints (e.g., at any angle with respect to the camera) and relit under an arbitrary high dynamic range (HDR) light map. A neural implicit intrinsic field takes a randomly sampled latent vector as input and produces as output per-point albedo, volume density, and reflectance properties for any queried 3D location. These outputs are aggregated via a volumetric rendering to produce low resolution albedo, diffuse shading, specular shading, and neural feature maps. The low resolution maps are then upsampled to produce high resolution maps and input into a neural renderer to produce relit images.

Type: Application

Filed: July 17, 2023

Publication date: January 18, 2024

Inventors: Yinda Zhang, Feitong Tan, Sean Ryan Francesco Fanello, Abhimitra Meka, Sergio Orts Escolano, Danhang Tang, Rohit Kumar Pandey, Jonathan James Taylor
Eye gaze classification

Patent number: 11868523

Abstract: Techniques of tracking a user's gaze includes identifying a region of a display at which a gaze of a user is directed, the region including a plurality of pixels. By determining a region rather than a point, when the regions correspond to elements of a user interface, the improved technique enables a system to activate the element to which a determined region is selected. In some implementations, the system makes the determination using a classification engine including a convolutional neural network; such an engine takes as input images of the user's eye and outputs a list of probabilities that the gaze is directed to each of the regions.

Type: Grant

Filed: July 1, 2021

Date of Patent: January 9, 2024

Assignee: GOOGLE LLC

Inventors: Ivana Tosic Rodgers, Sean Ryan Francesco Fanello, Sofien Bouaziz, Rohit Kumar Pandey, Eric Aboussouan, Adarsh Prakash Murthy Kowdle
VOLUMETRIC PERFORMANCE CAPTURE WITH NEURAL RENDERING

Publication number: 20230419600

Abstract: Example embodiments relate to techniques for volumetric performance capture with neural rendering. A technique may involve initially obtaining images that depict a subject from multiple viewpoints and under various lighting conditions using a light stage and depth data corresponding to the subject using infrared cameras. A neural network may extract features of the subject from the images based on the depth data and map the features into a texture space (e.g., the UV texture space). A neural renderer can be used to generate an output image depicting the subject from a target view such that illumination of the subject in the output image aligns with the target view. The neural render may resample the features of the subject from the texture space to an image space to generate the output image.

Type: Application

Filed: November 5, 2020

Publication date: December 28, 2023

Inventors: Sean Ryan Francesco FANELLO, Abhi MEKA, Rohit Kumar PANDEY, Christian HAENE, Sergio Orts ESCOLANO, Christoph RHEMANN, Paul DEBEVEC, Sofien BOUAZIZ, Thabo BEELER, Ryan OVERBECK, Peter BARNUM, Daniel ERICKSON, Philip DAVIDSON, Yinda ZHANG, Jonathan TAYLOR, Chloe LeGENDRE, Shahram IZADI
Enhanced Photo Relighting Based on Machine Learning Models

Publication number: 20230360182

Abstract: Apparatus and methods related to applying lighting models to images of objects are provided. An example method includes applying a geometry model to an input image to determine a surface orientation map indicative of a distribution of lighting on an object based on a surface geometry. The method further includes applying an environmental light estimation model to the input image to determine a direction of synthetic lighting to be applied to the input image. The method also includes applying, based on the surface orientation map and the direction of synthetic lighting, a light energy model to determine a quotient image indicative of an amount of light energy to be applied to each pixel of the input image. The method additionally includes enhancing, based on the quotient image, a portion of the input image. One or more neural networks can be trained to perform one or more of the aforementioned aspects.

Type: Application

Filed: May 17, 2021

Publication date: November 9, 2023

Inventors: Sean Ryan Francesco Fanello, Yun-Ta Tsai, Rohit Kumar Pandey, Paul Debevec, Michael Milne, Chloe LeGendre, Jonathan Tilton Barron, Christoph Rhemann, Sofien Bouaziz, Navin Padman Sarma
Real-time stereo matching using a hierarchical iterative refinement network

Patent number: 11810313

Abstract: According to an aspect, a real-time active stereo system includes a capture system configured to capture stereo data, where the stereo data includes a first input image and a second input image, and a depth sensing computing system configured to predict a depth map. The depth sensing computing system includes a feature extractor configured to extract features from the first and second images at a plurality of resolutions, an initialization engine configured to generate a plurality of depth estimations, where each of the plurality of depth estimations corresponds to a different resolution, and a propagation engine configured to iteratively refine the plurality of depth estimations based on image warping and spatial propagation.

Type: Grant

Filed: February 19, 2021

Date of Patent: November 7, 2023

Assignee: GOOGLE LLC

Inventors: Vladimir Tankovich, Christian Haene, Sean Ryan Francesco Fanello, Yinda Zhang, Shahram Izadi, Sofien Bouaziz, Adarsh Prakash Murthy Kowdle, Sameh Khamis
Systems and Methods for Compression of Three-Dimensional Volumetric Representations

Publication number: 20230154051

Abstract: Systems and methods are directed to encoding and/or decoding of the textures/geometry of a three-dimensional volumetric representation. An encoding computing system can obtain voxel blocks from a three-dimensional volumetric representation of an object. The encoding computing system can encode voxel blocks with a machine-learned voxel encoding model to obtain encoded voxel blocks. The encoding computing system can decode the encoded voxel blocks with a machine-learned voxel decoding model to obtain reconstructed voxel blocks. The encoding computing system can generate a reconstructed mesh representation of the object based at least in part on the one or more reconstructed voxel blocks. The encoding computing system can encode textures associated with the voxel blocks according to an encoding scheme and based at least in part on the reconstructed mesh representation of the object to obtain encoded textures.

Type: Application

Filed: April 17, 2020

Publication date: May 18, 2023

Inventors: Danhang Tang, Saurabh Singh, Cem Keskin, Phillip Andrew Chou, Christian Haene, Mingsong Dou, Sean Ryan Francesco Fanello, Jonathan Taylor, Andrea Tagliasacchi, Philip Lindsley Davidson, Yinda Zhang, Onur Gonen Guleryuz, Shahram Izadi, Sofien Bouaziz
EYE GAZE CLASSIFICATION

Publication number: 20230004216

Abstract: Techniques of tracking a user's gaze includes identifying a region of a display at which a gaze of a user is directed, the region including a plurality of pixels. By determining a region rather than a point, when the regions correspond to elements of a user interface, the improved technique enables a system to activate the element to which a determined region is selected. In some implementations, the system makes the determination using a classification engine including a convolutional neural network; such an engine takes as input images of the user's eye and outputs a list of probabilities that the gaze is directed to each of the regions.

Type: Application

Filed: July 1, 2021

Publication date: January 5, 2023

Inventors: Ivana Tosic Rodgers, Sean Ryan Francesco Fanello, Sofien Bouaziz, Rohit Kumar Pandey, Eric Aboussouan, Adarsh Prakash Murthy Kowdle
Object Observation Tracking In Images Using Encoder-Decoder Models

Publication number: 20220405569

Abstract: A method including, in a training phase, training a gaze prediction model including a first model and a second model, the first model and the second model being configured in conjunction to predict segmentation data based on training data, training a third model together with the first model and the second model, the third model being configured to predict a training characteristic using an output of the first model based on the training data, and in an operational phase, receiving operational data and predicting an operational characteristic using the trained first model and the trained third model.

Type: Application

Filed: June 22, 2021

Publication date: December 22, 2022

Inventors: Onur G. Guleryuz, Sean Ryan Francesco Fanello
Volumetric capture of objects with a single RGBD camera

Patent number: 11328486

Abstract: A method includes receiving a first image including color data and depth data, determining a viewpoint associated with an augmented reality (AR) and/or virtual reality (VR) display displaying a second image, receiving at least one calibration image including an object in the first image, the object being in a different pose as compared to a pose of the object in the first image, and generating the second image based on the first image, the viewpoint and the at least one calibration image.

Type: Grant

Filed: April 29, 2020

Date of Patent: May 10, 2022

Assignee: Google LLC

Inventors: Anastasia Tkach, Ricardo Martin Brualla, Shahram Izadi, Shuoran Yang, Cem Keskin, Sean Ryan Francesco Fanello, Philip Davidson, Jonathan Taylor, Rohit Pandey, Andrea Tagliasacchi, Pavlo Pidlypenskyi
VOLUMETRIC PERFORMANCE CAPTURE WITH RELIGHTING

Publication number: 20220065620

Abstract: A lighting stage includes a plurality of lights that project alternating spherical color gradient illumination patterns onto an object or human performer at a predetermined frequency. The lighting stage also includes a plurality of cameras that capture images of an object or human performer corresponding to the alternating spherical color gradient illumination patterns. The lighting stage also includes a plurality of depth sensors that capture depth maps of the object or human performer at the predetermined frequency. The lighting stage also includes (or is associated with) one or more processors that implement a machine learning algorithm to produce a three-dimensional (3D) model of the object or human performer. The 3D model includes relighting parameters used to relight the 3D model under different lighting conditions.

Type: Application

Filed: November 11, 2020

Publication date: March 3, 2022

Inventors: Sean Ryan Francesco Fanello, Kaiwen Guo, Peter Christopher Lincoln, Philip Lindsley Davidson, Jessica L. Busch, Xueming Yu, Geoffrey Harvey, Sergio Orts Escolano, Rohit Kumar Pandey, Jason Dourgarian, Danhang Tang, Adarsh Prakash Murthy Kowdle, Emily B. Cooper, Mingsong Dou, Graham Fyffe, Christoph Rhemann, Jonathan James Taylor, Shahram Izadi, Paul Ernest Debevec
LEARNING ILLUMINATION FROM DIVERSE PORTRAITS

Publication number: 20220027659

Abstract: Techniques of estimating lighting from portraits includes generating a lighting estimate from a single image of a face based on a machine learning (ML) system using multiple bidirectional reflection distribution functions (BRDFs) as a loss function. In some implementations, the ML system is trained using images of faces formed with HDR illumination computed from LDR imagery. The technical solution includes training a lighting estimation model in a supervised manner using a dataset of portraits and their corresponding ground truth illumination.

Type: Application

Filed: September 21, 2020

Publication date: January 27, 2022

Inventors: Chloe LeGendre, Paul Debevec, Wan-Chun Ma, Rohit Pandey, Sean Ryan Francesco Fanello, Christina Tong
ENHANCING PERFORMANCE CAPTURE WITH REAL-TIME NEURAL RENDERING

Publication number: 20220014723

Abstract: Three-dimensional (3D) performance capture and machine learning can be used to re-render high quality novel viewpoints of a captured scene. A textured 3D reconstruction is first rendered to a novel viewpoint. Due to imperfections in geometry and low-resolution texture, the 2D rendered image contains artifacts and is low quality. Accordingly, a deep learning technique is disclosed that takes these images as input and generates more visually enhanced re-rendering. The system is specifically designed for VR and AR headsets, and accounts for consistency between two stereo views.

Type: Application

Filed: December 2, 2019

Publication date: January 13, 2022

Inventors: Rohit Pandey, Jonathan Taylor, Ricardo Martin Brualla, Shuoran Yang, Pavlo Pidlypenskyi, Daniel Goldman, Sean Ryan Francesco Fanello
Depth from motion for augmented reality for handheld user devices

Patent number: 11145075

Abstract: A handheld user device includes a monocular camera to capture a feed of images of a local scene and a processor to select, from the feed, a keyframe and perform, for a first image from the feed, stereo matching using the first image, the keyframe, and a relative pose based on a pose associated with the first image and a pose associated with the keyframe to generate a sparse disparity map representing disparities between the first image and the keyframe. The processor further is to determine a dense depth map from the disparity map using a bilateral solver algorithm, and process a viewfinder image generated from a second image of the feed with occlusion rendering based on the depth map to incorporate one or more virtual objects into the viewfinder image to generate an AR viewfinder image. Further, the processor is to provide the AR viewfinder image for display.

Type: Grant

Filed: October 4, 2019

Date of Patent: October 12, 2021

Assignee: Google LLC

Inventors: Julien Valentin, Onur G. Guleryuz, Mira Leung, Maksym Dzitsiuk, Jose Pascoal, Mirko Schmidt, Christoph Rhemann, Neal Wadhwa, Eric Turner, Sameh Khamis, Adarsh Prakash Murthy Kowdle, Ambrus Csaszar, João Manuel Castro Afonso, Jonathan T. Barron, Michael Schoenberg, Ivan Dryanovski, Vivek Verma, Vladimir Tankovich, Shahram Izadi, Sean Ryan Francesco Fanello, Konstantine Nicholas John Tsotsos
Methods, systems, and media for relighting images using predicted deep reflectance fields

Patent number: 10997457

Abstract: Methods, systems, and media for relighting images using predicted deep reflectance fields are provided.

Type: Grant

Filed: October 16, 2019

Date of Patent: May 4, 2021

Assignee: Google LLC

Inventors: Christoph Rhemann, Abhimitra Meka, Matthew Whalen, Jessica Lynn Busch, Sofien Bouaziz, Geoffrey Douglas Harvey, Andrea Tagliasacchi, Jonathan Taylor, Paul Debevec, Peter Joseph Denny, Sean Ryan Francesco Fanello, Graham Fyffe, Jason Angelo Dourgarian, Xueming Yu, Adarsh Prakash Murthy Kowdle, Julien Pascal Christophe Valentin, Peter Christopher Lincoln, Rohit Kumar Pandey, Christian Häne, Shahram Izadi
DEPTH FROM MOTION FOR AUGMENTED REALITY FOR HANDHELD USER DEVICES

Publication number: 20210004979

Abstract: A handheld user device includes a monocular camera to capture a feed of images of a local scene and a processor to select, from the feed, a keyframe and perform, for a first image from the feed, stereo matching using the first image, the keyframe, and a relative pose based on a pose associated with the first image and a pose associated with the keyframe to generate a sparse disparity map representing disparities between the first image and the keyframe. The processor further is to determine a dense depth map from the disparity map using a bilateral solver algorithm, and process a viewfinder image generated from a second image of the feed with occlusion rendering based on the depth map to incorporate one or more virtual objects into the viewfinder image to generate an AR viewfinder image. Further, the processor is to provide the AR viewfinder image for display.

Type: Application

Filed: October 4, 2019

Publication date: January 7, 2021

Inventors: Jullien VALENTIN, Onur G. GULERYUZ, Mira LEUNG, Maksym DZITSIUK, Jose PASCOAL, Mirko SCHMIDT, Christoph RHEMANN, Neal WADHWA, Eric TURNER, Sameh KHAMIS, Adarsh Prakash Murthy KOWDLE, Ambrus CSASZAR, João Manuel Castro AFONSO, Jonathan T. BARRON, Michael SCHOENBERG, Ivan DRYANOVSKI, Vivek VERMA, Vladimir TANKOVICH, Shahram IZADI, Sean Ryan Francesco FANELLO, Konstantine Nicholas John TSOTSOS
METHODS, SYSTEMS, AND MEDIA FOR RELIGHTING IMAGES USING PREDICTED DEEP REFLECTANCE FIELDS

Publication number: 20200372284

Abstract: Methods, systems, and media for relighting images using predicted deep reflectance fields are provided.

Type: Application

Filed: October 16, 2019

Publication date: November 26, 2020

Inventors: Christoph Rhemann, Abhimitra Meka, Matthew Whalen, Jessica Lynn Busch, Sofien Bouaziz, Geoffrey Douglas Harvey, Andrea Tagliasacchi, Jonathan Taylor, Paul Debevec, Peter Joseph Denny, Sean Ryan Francesco Fanello, Graham Fyffe, Jason Angelo Dourgarian, Xueming Yu, Adarsh Prakash Murthy Kowdle, Julien Pascal Christophe Valentin, Peter Christopher Lincoln, Rohit Kumar Pandey, Christian Häne, Shahram Izadi
Hierarchical disparity hypothesis generation with slanted support windows

Patent number: 10839541

Abstract: A method includes capturing a first image and a second image of a scene using at least one imaging camera of an imaging system. The first image and the second image form a stereo image pair and each comprises a plurality of pixels. Each of the plurality of pixels in the second image is initialized with a disparity hypothesis. Matching costs of the disparity hypothesis for each of the plurality of pixels in the second image are recursively determined, from an image tile of a smaller pixel size to an image tile of a larger pixel size, to generate an initial tiled disparity map including a plurality of image tiles. After refining the disparity value estimate of each image tile and including a slant hypothesis, a final disparity estimate for each pixel of the image is generated.

Type: Grant

Filed: October 12, 2018

Date of Patent: November 17, 2020

Assignee: Google LLC

Inventors: Vladimir Tankovich, Michael Schoenberg, Sean Ryan Francesco Fanello, Julien Valentin
VOLUMETRIC CAPTURE OF OBJECTS WITH A SINGLE RGBD CAMERA

Publication number: 20200349772

Abstract: A method includes receiving a first image including color data and depth data, determining a viewpoint associated with an augmented reality (AR) and/or virtual reality (VR) display displaying a second image, receiving at least one calibration image including an object in the first image, the object being in a different pose as compared to a pose of the object in the first image, and generating the second image based on the first image, the viewpoint and the at least one calibration image.

Type: Application

Filed: April 29, 2020

Publication date: November 5, 2020

Inventors: Anastasia Tkach, Ricardo Martin Brualla, Shahram Izadi, Shuoran Yang, Cem Keskin, Sean Ryan Francesco Fanello, Philip Davidson, Jonathan Taylor, Rohit Pandey, Andrea Tagliasacchi, Pavlo Pidlypenskyi

1 2 next