Patents by Inventor Christopher Richard Schroers

Christopher Richard Schroers has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

MULTI-CAMERA FACE SWAPPING

Publication number: 20240078726

Abstract: One embodiment of the present invention sets forth a technique for performing face swapping. The technique includes converting a first input image that depicts a first facial identity from a first viewpoint at a first time into a first latent representation and converting a second input image that depicts the first facial identity from a second viewpoint at the first time into a second latent representation. The technique also includes generating, via a first machine learning model, a first output image that depicts a second facial identity from the first viewpoint based on the first latent representation. The technique further includes generating, via the first machine learning model, a second output image that depicts the second facial identity from the second viewpoint based on the second latent representation.

Type: Application

Filed: September 7, 2022

Publication date: March 7, 2024

Inventors: Romann Matthew WEBER, Evan Matthew GOLDBERG, Jacek Krzysztof NARUNIEC, Christopher Richard SCHROERS
Codec Rate Distortion Compensating Downsampler

Publication number: 20230379475

Abstract: A system includes a machine learning (ML) model-based video downsampler configured to receive an input video sequence having a first display resolution, and to map the input video sequence to a lower resolution video sequence having a second display resolution lower than the first display resolution. The system also includes a neural network-based (NN-based) proxy video codec configured to transform the lower resolution video sequence into a decoded proxy bitstream. In addition, the system includes an upsampler configured to produce an output video sequence using the decoded proxy bitstream.

Type: Application

Filed: August 4, 2023

Publication date: November 23, 2023

Inventors: Christopher Richard Schroers, Roberto Gerson de Albuquerque Azevedo, Nicholas David Gregory, Yuanyi Xue, Scott Labrozzi, Abdelaziz Djelouah
TECHNIQUES FOR RESAMPLING IMAGES

Publication number: 20230377093

Abstract: Techniques are disclosed for resampling images. In some embodiments, a resampling model includes (1) one or more feature extraction layers that extract features from an input image and a degradation map; (2) one or more resampling layers that generate warped features from the extracted features and a warp grid; and (3) one or more prediction layers that generate, from the warped features, an output image or resampling kernels that can be applied to the input image to generate an output image. In some embodiments, the resampling model can be trained by applying degradation maps to output images in a training data set to generate corresponding input images, and training the resampling model using the input images and the corresponding output images.

Type: Application

Filed: May 19, 2023

Publication date: November 23, 2023

Inventors: Abdelaziz DJELOUAH, Michael Yves BERNASCONI, Farnood SALEHI, Christopher Richard SCHROERS
IDENTITY-PRESERVING IMAGE GENERATION USING DIFFUSION MODELS

Publication number: 20230377214

Abstract: One embodiment of the present invention sets forth a technique for performing identity-preserving image generation. The technique includes converting an identity image depicting a facial identity into an identity embedding. The technique further includes generating a combined embedding based on the identity embedding and a diffusion iteration identifier. The technique further includes converting, using a neural network and based on the combined embedding, a first input image that includes first noise into a first predicted image depicting one or more facial features that include one or more first facial identity features, wherein the one or more first facial identity features correspond to one or more respective second facial identity features of the identity image and are based at least on the identity embedding.

Type: Application

Filed: May 19, 2023

Publication date: November 23, 2023

Inventors: Manuel Jakob KANSY, Anton Julien RAËL, Jacek Krzysztof NARUNIEC, Christopher Richard SCHROERS, Romann Matthew WEBER
GENERATING AN IMAGE INCLUDING A SOURCE INDIVIDUAL

Publication number: 20230377213

Abstract: One embodiment of the present invention sets forth a technique for performing face swapping. The technique includes generating a latent representation of a first facial identity included in an input image. The technique further includes identifying a first identity-specific neural network layer associated with a second facial identity from a plurality of identity-specific neural network layers, wherein each neural network layer included in the plurality of identity-specific neural network layers is associated with a different facial identity. The technique further includes executing the first identity-specific neural network layer and one or more other neural network layers to generate one or more decoder input values corresponding to the latent representation. The technique further includes executing a decoder neural network that converts the one or more decoder input values into an output image depicting the second facial identity.

Type: Application

Filed: May 18, 2023

Publication date: November 23, 2023

Inventors: Jacek Krzysztof NARUNIEC, Manuel Jakob KANSY, Graziana MIGNONE, Christopher Richard SCHROERS, Romann Matthew WEBER
3D digital model surface rendering and conversion

Patent number: 11804007

Abstract: An image processing system includes a computing platform having processing hardware, a display, and a system memory storing a software code. The processing hardware is configured to execute the software code to receive a three-dimensional (3D) digital model, surround the 3D digital model with multiple virtual cameras oriented toward the 3D digital model, and generate, using the virtual cameras, a multiple renders of the 3D digital model. The processing hardware is further configured to execute the software code to generate a UV texture coordinate space for a surface projection of the 3D digital model, and to transfer, using the multiple renders, lighting color values for each of multiple surface portions of the 3D digital model to the UV texture coordinate space.

Type: Grant

Filed: March 31, 2021

Date of Patent: October 31, 2023

Assignee: Disney Enterprises, Inc.

Inventors: Dane M. Coffey, Siroberto Scerbo, Evan M. Goldberg, Christopher Richard Schroers, Daniel L Baker, Mark R. Mine, Erika Varis Doggett
TECHNIQUES FOR DENOISING VIDEOS

Publication number: 20230334626

Abstract: Techniques are disclosed for denoising videos. In some embodiments, video frames are denoised using a denoising model that includes an encoder-decoder architecture and attention modules. During training of the denoising model, the attention modules learn weightings to upweight certain dimensions of input features to help pixel registration, remove ghosting artifacts, and improve temporal consistency when the frames of a video are being denoised. The denoising model can also be used to train a student denoising model that has a same architecture as, but is smaller and faster than, the denoising model. After training, noisy video frames can be input into the denoising model and/or the student denoising model to generate corresponding denoised video frames.

Type: Application

Filed: April 14, 2022

Publication date: October 19, 2023

Inventors: Yang ZHANG, Tunc Ozan AYDIN, Christopher Richard SCHROERS
METHOD AND SYSTEM FOR LATENT-SPACE FACIAL FEATURE EDITING IN DEEP LEARNING BASED FACE SWAPPING

Publication number: 20230316587

Abstract: A computer-implemented method of changing a face within an output image or video frame that includes: receiving an input image that includes a face presenting a facial expression in a pose; processing the image with a neural network encoder to generate a latent space point that is an encoded representation of the image; decoding the latent space point to generate an initial output image in accordance with a desired facial identity but with the facial expression and pose of the face in the input image; identifying a feature of the facial expression in the initial output image to edit; applying an adjustment vector to a latent space point corresponding to the initial output image to generate an adjusted latent space point; and decoding the adjusted latent space point to generate an adjusted output image in accordance with the desired facial identity but with the facial expression and pose of the face in the input image altered in accordance with the adjustment vector

Type: Application

Filed: March 29, 2022

Publication date: October 5, 2023

Applicants: LUCASFILM ENTERTAINMENT COMPANY LTD. LLC, DISNEY ENTERPRISES, INC

Inventors: Sirak Ghebremusse, Stéphane Grabli, Jacek Krzysztof Naruniec, Romann Matthew Weber, Christopher Richard Schroers
METHOD AND SYSTEM FOR DEEP LEARNING BASED FACE SWAPPING WITH MULTIPLE ENCODERS

Publication number: 20230319223

Abstract: A computer-implemented method of changing a face within an output image or video frame includes: receiving an input image that includes a face presenting a facial expression in a pose; separately encoding different portions of the image by, for each separately encoded portion, generating a latent space point of the portion, thereby generating a plurality of multi-dimensional vectors where each multi-dimensional vector is an encoded representation of a different portion of the input image; concatenating the plurality of multi-dimensional vectors into a combined latent space vector; and decoding the combined latent space vector to generate the output image in accordance with a desired facial identity but with the facial expression and pose of the face in the input image

Type: Application

Filed: March 29, 2022

Publication date: October 5, 2023

Applicant: DISNEY ENTERPRISES, INC

Inventors: Jacek Krzysztof Naruniec, Romann Matthew Weber, Christopher Richard Schroers
Codec rate distortion compensating downsampler

Patent number: 11765360

Abstract: A system includes a machine learning (ML) model-based video downsampler configured to receive an input video sequence having a first display resolution, and to map the input video sequence to a lower resolution video sequence having a second display resolution lower than the first display resolution. The system also includes a neural network-based (NN-based) proxy video codec configured to transform the lower resolution video sequence into a decoded proxy bitstream. In addition, the system includes an upsampler configured to produce an output video sequence using the decoded proxy bitstream.

Type: Grant

Filed: October 13, 2021

Date of Patent: September 19, 2023

Assignees: Disney Enterprises, Inc., ETH Zurich

Inventors: Christopher Richard Schroers, Roberto Gerson de Albuquerque Azevedo, Nicholas David Gregory, Yuanyi Xue, Scott Labrozzi, Abdelaziz Djelouah
TECHNIQUES FOR PROCESSING VIDEOS USING TEMPORALLY-CONSISTENT TRANSFORMER MODEL

Publication number: 20230281757

Abstract: Techniques are disclosed for enhancing videos using a machine learning model that is a temporally-consistent transformer model. The machine learning model processes blocks of frames of a video in which the temporally first input video frame of each block of frames is a temporally second to last output video frame of a previous block of frames. After the machine learning model is trained, blocks of video frames, or features extracted from the video frames, can be warped using an optical flow technique and transformed using a wavelet transform technique. The transformed video frames are concatenated along a channel dimension and input into the machine learning model that generates corresponding processed video frames.

Type: Application

Filed: July 28, 2022

Publication date: September 7, 2023

Inventors: Yang ZHANG, Mingyang SONG, Tunc Ozan AYDIN, Christopher Richard SCHROERS
Data Object Classification Using an Optimized Neural Network

Publication number: 20230274138

Abstract: A system includes a computing platform having a hardware processor and a memory storing a software code and a neural network (NN) having multiple layers including a last activation layer and a loss layer. The hardware processor executes the software code to identify different combinations of layers for testing the NN, each combination including candidate function(s) for the last activation layer and candidate function(s) for the loss layer. For each different combination, the software code configures the NN based on the combination, inputs, into the configured NN, a training dataset including multiple data objects, receives, from the configured NN, a classification of the data objects, and generates a performance assessment for the combination based on the classification. The software code determines a preferred combination of layers for the NN including selected candidate functions for the last activation layer and the loss layer, based on a comparison of the performance assessments.

Type: Application

Filed: May 4, 2023

Publication date: August 31, 2023

Inventors: Hayko Jochen Wilhelm Riemenschneider, Leonhard Markus Helminger, Christopher Richard Schroers, Abdelaziz Djelouah
VIDEO REMASTERING VIA DEEP LEARNING

Publication number: 20230267706

Abstract: One embodiment of the present invention sets forth a technique for performing remastering of video content. The technique includes determining a first input frame corresponding to a first frame included in a first video and a first target frame corresponding to a second frame included in a second video based on one or more alignments between the first frame and the second frame. The technique also includes executing a machine learning model to convert the first input frame into a first output frame. The technique further includes training the machine learning model based on one or more losses associated with the first output frame and the first target frame.

Type: Application

Filed: February 21, 2023

Publication date: August 24, 2023

Inventors: Abdelaziz DJELOUAH, Shinobu HATTORI, Christopher Richard SCHROERS, Andrew John WAHLQUIST
Data object classification using an optimized neural network

Patent number: 11669723

Abstract: A system includes a computing platform having a hardware processor and a memory storing a software code and a neural network (NN) having multiple layers including a last activation layer and a loss layer. The hardware processor executes the software code to identify different combinations of layers for testing the NN, each combination including candidate function(s) for the last activation layer and candidate function(s) for the loss layer. For each different combination, the software code configures the NN based on the combination, inputs, into the configured NN, a training dataset including multiple data objects, receives, from the configured NN, a classification of the data objects, and generates a performance assessment for the combination based on the classification. The software code determines a preferred combination of layers for the NN including selected candidate functions for the last activation layer and the loss layer, based on a comparison of the performance assessments.

Type: Grant

Filed: September 16, 2022

Date of Patent: June 6, 2023

Assignees: Disney Enterprises, Inc., ETH Zürich (EIDGENÖSSISCHE TECHNISCHE HOCHSCHULE ZÜRICH)

Inventors: Hayko Jochen Wilhelm Riemenschneider, Leonhard Markus Helminger, Christopher Richard Schroers, Abdelaziz Djelouah
DEEP LEARNING FRAMEWORK FOR VIDEO REMASTERING

Publication number: 20230153952

Abstract: Restoration methods and systems are disclosed for video remastering. Techniques disclosed include receiving a video sequence. For each frame of the video sequence, techniques disclosed include encoding, by a degradation encoder, a video content associated with the frame into a latent vector. The latent vector is a representation of the degradation present in the video content; the degradation present in the video content includes one or more degradation types. Based on the latent vector and the video content, techniques disclosed further include generating, by a backbone network, one or more feature maps, and, then, restoring the frame based on the one or more feature maps.

Type: Application

Filed: February 11, 2022

Publication date: May 18, 2023

Applicant: DISNEY ENTERPRISES, INC.

Inventors: Abdelaziz Djelouah, Givi Meishvili, Christopher Richard Schroers, Shinobu Hattori
Method for temporal stabilization of landmark localization

Patent number: 11640676

Abstract: Various embodiments set forth systems and techniques for training a landmark model. The techniques include determining, using the landmark model, a first landmark in a set of first landmarks associated with a first image; performing, on the first image, a first perturbation to obtain a second image; determining, using the landmark model, a second landmark in a set of second landmarks associated with the second image; determining, based on a first distance between the first landmark and the second landmark, a first loss function; and updating, based on the first loss function, a first parameter of the landmark model.

Type: Grant

Filed: August 24, 2020

Date of Patent: May 2, 2023

Assignee: Disney Enterprises, Inc.

Inventors: Jacek Krzysztof Naruniec, Christopher Richard Schroers, Romann Matthew Weber
Codec Rate Distortion Compensating Downsampler

Publication number: 20230116696

Abstract: A system includes a machine learning (ML) model-based video downsampler configured to receive an input video sequence having a first display resolution, and to map the input video sequence to a lower resolution video sequence having a second display resolution lower than the first display resolution. The system also includes a neural network-based (NN-based) proxy video codec configured to transform the lower resolution video sequence into a decoded proxy bitstream. In addition, the system includes an upsampler configured to produce an output video sequence using the decoded proxy bitstream.

Type: Application

Filed: October 13, 2021

Publication date: April 13, 2023

Inventors: Christopher Richard Schroers, Roberto Gerson de Albuquerque Azevedo, Nicholas David Gregory, Yuanyi Xue, Scott Labrozzi, Abdelaziz Djelouah
Automated annotation of heterogeneous content

Patent number: 11599804

Abstract: A system includes a computing platform having a hardware processor, and a system memory storing a software code and a content labeling predictive model. The hardware processor is configured to execute the software code to scan a database to identify content assets stored in the database, parse metadata stored in the database to identify labels associated with the content assets, and generate a graph by creating multiple first links linking each of the content assets to its corresponding label or labels. The hardware processor is configured to further execute the software code to train, using the graph, the content labeling predictive model, to identify, using the trained content labeling predictive model, multiple second links among the content assets and the labels, and to annotate the content assets based on the second links.

Type: Grant

Filed: April 17, 2020

Date of Patent: March 7, 2023

Assignees: Disney Enterprises, Inc., ETH Zurich

Inventors: Hayko Jochen Wilhelm Riemenschneider, Leonhard Markus Helminger, Abdelaziz Djelouah, Christopher Richard Schroers
Re-Noising and Neural Network Based Image Enhancement

Publication number: 20230065392

Abstract: According to one implementation, a system for performing re-noising and neural network (NN) based image enhancement includes a computing platform having a processing hardware and a system memory storing a software code, a noise synthesizer, and an image restoration NN. The processing hardware is configured to execute the software code to receive a denoised image component and a noise component extracted from a degraded image, to generate, using the noise synthesizer and the noise component, synthesized noise corresponding to the noise component, and to interpolate, using the noise component and the synthesized noise, an output image noise. The processing hardware is further configured to execute the software code to enhance, using the image restoration NN, the denoised image component to provide an output image component, and to re-noise the output image component, using the output image noise, to produce an enhanced output image corresponding to the degraded image.

Type: Application

Filed: August 27, 2021

Publication date: March 2, 2023

Inventors: Abdelaziz Djelouah, Shinobu Hattori, Christopher Richard Schroers
Deinterlacing via deep learning

Patent number: 11570397

Abstract: One embodiment of the present invention sets forth a technique for performing deinterlacing. The technique includes separating a first interlaced video frame into a first sequence of fields ordered by time, the first sequence of fields including a first field. The technique also includes generating, by applying a deinterlacing network to a first field in the first sequence, a second field that is missing from the first sequence of fields and is complementary to the first field. The technique further includes constructing a progressive video frame based on the first field and the second field.

Type: Grant

Filed: July 10, 2020

Date of Patent: January 31, 2023

Assignees: DISNEY ENTERPRISES, INC., ETH ZÜRICH, (EIDGENÖSSISCHE TECHNISCHE HOCHSCHULE ZÜRICH)

Inventors: Michael Bernasconi, Daniel Konrad Dorda, Abdelaziz Djelouah, Shinobu Hattori, Christopher Richard Schroers

1 2 next