Patents by Inventor Leonhard Markus Helminger

Leonhard Markus Helminger has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

FACE SWAPPING WITH NEURAL NETWORK-BASED GEOMETRY REFINING

Publication number: 20250028787

Abstract: Various embodiments set forth systems and techniques for changing a face within an image. The techniques include receiving a first image including a face associated with a first facial identity; generating, via a machine learning model, at least a first texture map and a first position map based on the first image; rendering a second image including a face associated with a second facial identity based on the first texture map and the first position map, wherein the second facial identity is different from the first facial identity.

Type: Application

Filed: October 7, 2024

Publication date: January 23, 2025

Inventors: Jacek Krzysztof NARUNIEC, Derek Edward BRADLEY, Paulo Fabiano URNAU GOTARDO, Leonhard Markus HELMINGER, Christopher Andreas OTTO, Christopher Richard SCHROERS, Romann Matthew WEBER
Machine Learning Model-Based Video Compression

Publication number: 20250016382

Abstract: A system processing hardware executes a machine learning (ML) model-based video compression encoder to receive uncompressed video content and corresponding motion compensated video content, compare the uncompressed and motion compensated video content to identify an image space residual, transform the image space residual to a latent space representation of the uncompressed video content, and transform, using a trained image compression ML model, the motion compensated video content to a latent space representation of the motion compensated video content.

Type: Application

Filed: September 13, 2024

Publication date: January 9, 2025

Inventors: Abdelaziz Djelouah, Leonhard Markus Helminger, Roberto Gerson de Albuquerque Azevedo, Scott Labrozzi, Christopher Richard Schroers, Yuanyi Xue
Data object classification using an optimized neural network

Patent number: 12169778

Abstract: A system includes a computing platform having a hardware processor and a memory storing a software code and a neural network (NN) having multiple layers including a last activation layer and a loss layer. The hardware processor executes the software code to identify different combinations of layers for testing the NN, each combination including candidate function(s) for the last activation layer and candidate function(s) for the loss layer. For each different combination, the software code configures the NN based on the combination, inputs, into the configured NN, a training dataset including multiple data objects, receives, from the configured NN, a classification of the data objects, and generates a performance assessment for the combination based on the classification. The software code determines a preferred combination of layers for the NN including selected candidate functions for the last activation layer and the loss layer, based on a comparison of the performance assessments.

Type: Grant

Filed: May 4, 2023

Date of Patent: December 17, 2024

Assignees: Disney Enterprises, Inc., ETH Zürich (EIDGENÖSSISCHE TECHNISCHE HOCHSCHULE ZÜRICH)

Inventors: Hayko Jochen Wilhelm Riemenschneider, Leonhard Markus Helminger, Christopher Richard Schroers, Abdelaziz Djelouah
Machine learning model-based video compression

Patent number: 12120359

Abstract: A system processing hardware executes a machine learning (ML) model-based video compression encoder to receive uncompressed video content and corresponding motion compensated video content, compare the uncompressed and motion compensated video content to identify an image space residual, transform the image space residual to a latent space representation of the uncompressed video content, and transform, using a trained image compression ML model, the motion compensated video content to a latent space representation of the motion compensated video content.

Type: Grant

Filed: March 25, 2022

Date of Patent: October 15, 2024

Assignees: Disney Enterprises, Inc., ETH Zürich (EIDGENÖSSISCHE TECHNISCHE HOCHSCHULE ZÜRICH)

Inventors: Abdelaziz Djelouah, Leonhard Markus Helminger, Roberto Gerson De Albuquerque Azevedo, Scott Labrozzi, Christopher Richard Schroers, Yuanyi Xue
Face swapping with neural network-based geometry refining

Patent number: 12111880

Abstract: Various embodiments set forth systems and techniques for changing a face within an image. The techniques include receiving a first image including a face associated with a first facial identity; generating, via a machine learning model, at least a first texture map and a first position map based on the first image; rendering a second image including a face associated with a second facial identity based on the first texture map and the first position map, wherein the second facial identity is different from the first facial identity.

Type: Grant

Filed: September 24, 2021

Date of Patent: October 8, 2024

Assignees: DISNEY ENTERPRISES, INC., ETH Zurich (Eidgenssische Technische Hochschule Zurich)

Inventors: Jacek Krzysztof Naruniec, Derek Edward Bradley, Paulo Fabiano Urnau Gotardo, Leonhard Markus Helminger, Christopher Andreas Otto, Christopher Richard Schroers, Romann Matthew Weber
Image compression using normalizing flows

Patent number: 12087024

Abstract: According to one implementation, an image compression system includes a computing platform having a hardware processor and a system memory storing a software code. The hardware processor executes the software code to receive an input image, transform the input image to a latent space representation of the input image, and quantize the latent space representation of the input image to produce multiple quantized latents. The hardware processor further executes the software code to encode the quantized latents using a probability density function of the latent space representation of the input image, to generate a bitstream, and convert the bitstream into an output image corresponding to the input image. The probability density function of the latent space representation of the input image is obtained based on a normalizing flow mapping of one of the input image or the latent space representation of the input image.

Type: Grant

Filed: March 6, 2020

Date of Patent: September 10, 2024

Assignees: Disney Enterprises, Inc., ETH Zurich

Inventors: Abdelaziz Djelouah, Leonhard Markus Helminger, Scott Labrozzi, Yuanyi Xue, Erika Varis Doggett, Jared McPhillen, Christopher Richard Schroers
Microdosing For Low Bitrate Video Compression

Publication number: 20240283957

Abstract: A system includes a machine learning (ML) model-based video encoder configured to receive an uncompressed video sequence including multiple video frames, determine, from among the multiple video frames, a first video frame subset and a second video frame subset, encode the first video frame subset to produce a first compressed video frame subset, and identify a first decompression data for the first compressed video frame subset. The ML model-based video encoder is further configured to encode the second video frame subset to produce a second compressed video frame subset, and identify a second decompression data for the second compressed video frame subset. The first decompression data is specific to decoding the first compressed video frame subset but not the second compressed video frame subset, and the second decompression data is specific to decoding the second compressed video frame subset but not the first compressed video frame subset.

Type: Application

Filed: May 2, 2024

Publication date: August 22, 2024

Inventors: Abdelaziz Djelouah, Leonhard Markus Helminger, Roberto Gerson de Albuquerque Azevedo, Christopher Richard Schroers, Scott Labrozzi, Yuanyi Xue
Microdosing for low bitrate video compression

Patent number: 12010335

Abstract: A system includes a machine learning (ML) model-based video encoder configured to receive an uncompressed video sequence including multiple video frames, determine, from among the multiple video frames, a first video frame subset and a second video frame subset, encode the first video frame subset to produce a first compressed video frame subset, and identify a first decompression data for the first compressed video frame subset. The ML model-based video encoder is further configured to encode the second video frame subset to produce a second compressed video frame subset, and identify a second decompression data for the second compressed video frame subset. The first decompression data is specific to decoding the first compressed video frame subset but not the second compressed video frame subset, and the second decompression data is specific to decoding the second compressed video frame subset but not the first compressed video frame subset.

Type: Grant

Filed: March 25, 2022

Date of Patent: June 11, 2024

Assignee: Disney Enterprises, Inc.

Inventors: Abdelaziz Djelouah, Leonhard Markus Helminger, Roberto Gerson de Albuquerque Azevedo, Christopher Richard Schroers, Scott Labrozzi, Yuanyi Xue
Data Object Classification Using an Optimized Neural Network

Publication number: 20230274138

Abstract: A system includes a computing platform having a hardware processor and a memory storing a software code and a neural network (NN) having multiple layers including a last activation layer and a loss layer. The hardware processor executes the software code to identify different combinations of layers for testing the NN, each combination including candidate function(s) for the last activation layer and candidate function(s) for the loss layer. For each different combination, the software code configures the NN based on the combination, inputs, into the configured NN, a training dataset including multiple data objects, receives, from the configured NN, a classification of the data objects, and generates a performance assessment for the combination based on the classification. The software code determines a preferred combination of layers for the NN including selected candidate functions for the last activation layer and the loss layer, based on a comparison of the performance assessments.

Type: Application

Filed: May 4, 2023

Publication date: August 31, 2023

Inventors: Hayko Jochen Wilhelm Riemenschneider, Leonhard Markus Helminger, Christopher Richard Schroers, Abdelaziz Djelouah
Data object classification using an optimized neural network

Patent number: 11669723

Abstract: A system includes a computing platform having a hardware processor and a memory storing a software code and a neural network (NN) having multiple layers including a last activation layer and a loss layer. The hardware processor executes the software code to identify different combinations of layers for testing the NN, each combination including candidate function(s) for the last activation layer and candidate function(s) for the loss layer. For each different combination, the software code configures the NN based on the combination, inputs, into the configured NN, a training dataset including multiple data objects, receives, from the configured NN, a classification of the data objects, and generates a performance assessment for the combination based on the classification. The software code determines a preferred combination of layers for the NN including selected candidate functions for the last activation layer and the loss layer, based on a comparison of the performance assessments.

Type: Grant

Filed: September 16, 2022

Date of Patent: June 6, 2023

Assignees: Disney Enterprises, Inc., ETH Zürich (EIDGENÖSSISCHE TECHNISCHE HOCHSCHULE ZÜRICH)

Inventors: Hayko Jochen Wilhelm Riemenschneider, Leonhard Markus Helminger, Christopher Richard Schroers, Abdelaziz Djelouah
Automated annotation of heterogeneous content

Patent number: 11599804

Abstract: A system includes a computing platform having a hardware processor, and a system memory storing a software code and a content labeling predictive model. The hardware processor is configured to execute the software code to scan a database to identify content assets stored in the database, parse metadata stored in the database to identify labels associated with the content assets, and generate a graph by creating multiple first links linking each of the content assets to its corresponding label or labels. The hardware processor is configured to further execute the software code to train, using the graph, the content labeling predictive model, to identify, using the trained content labeling predictive model, multiple second links among the content assets and the labels, and to annotate the content assets based on the second links.

Type: Grant

Filed: April 17, 2020

Date of Patent: March 7, 2023

Assignees: Disney Enterprises, Inc., ETH Zurich

Inventors: Hayko Jochen Wilhelm Riemenschneider, Leonhard Markus Helminger, Abdelaziz Djelouah, Christopher Richard Schroers
Data Object Classification Using an Optimized Neural Network

Publication number: 20230009121

Abstract: A system includes a computing platform having a hardware processor and a memory storing a software code and a neural network (NN) having multiple layers including a last activation layer and a loss layer. The hardware processor executes the software code to identify different combinations of layers for testing the NN, each combination including candidate function(s) for the last activation layer and candidate function(s) for the loss layer. For each different combination, the software code configures the NN based on the combination, inputs, into the configured NN, a training dataset including multiple data objects, receives, from the configured NN, a classification of the data objects, and generates a performance assessment for the combination based on the classification. The software code determines a preferred combination of layers for the NN including selected candidate functions for the last activation layer and the loss layer, based on a comparison of the performance assessments.

Type: Application

Filed: September 16, 2022

Publication date: January 12, 2023

Inventors: Hayko Jochen Wilhelm Riemenschneider, Leonhard Markus Helminger, Christopher Richard Schroers, Abdelaziz Djelouah
FACE SWAPPING WITH NEURAL NETWORK-BASED GEOMETRY REFINING

Publication number: 20220374649

Abstract: Various embodiments set forth systems and techniques for changing a face within an image. The techniques include receiving a first image including a face associated with a first facial identity; generating, via a machine learning model, at least a first texture map and a first position map based on the first image; rendering a second image including a face associated with a second facial identity based on the first texture map and the first position map, wherein the second facial identity is different from the first facial identity.

Type: Application

Filed: September 24, 2021

Publication date: November 24, 2022

Inventors: Jacek Krzysztof NARUNIEC, Derek Edward BRADLEY, Paulo Fabiano Urnau GOTARDO, Leonhard Markus HELMINGER, Christopher Andreas OTTO, Christopher Richard SCHROERS, Romann Matthew WEBER
Microdosing For Low Bitrate Video Compression

Publication number: 20220337852

Abstract: A system includes a machine learning (ML) model-based video encoder configured to receive an uncompressed video sequence including multiple video frames, determine, from among the multiple video frames, a first video frame subset and a second video frame subset, encode the first video frame subset to produce a first compressed video frame subset, and identify a first decompression data for the first compressed video frame subset. The ML model-based video encoder is further configured to encode the second video frame subset to produce a second compressed video frame subset, and identify a second decompression data for the second compressed video frame subset. The first decompression data is specific to decoding the first compressed video frame subset but not the second compressed video frame subset, and the second decompression data is specific to decoding the second compressed video frame subset but not the first compressed video frame subset.

Type: Application

Filed: March 25, 2022

Publication date: October 20, 2022

Inventors: Abdelaziz Djelouah, Leonhard Markus Helminger, Roberto Gerson de Albuquerque Azevedo, Christopher Richard Schroers, Scott Labrozzi, Yuanyi Xue
Image enhancement using normalizing flows

Patent number: 11475543

Abstract: According to one implementation, an image enhancement system includes a computing platform including a hardware processor and a system memory storing a software code configured to provide a normalizing flow based generative model trained using an objective function. The hardware processor executes the software code to receive an input image, transform the input image to a latent space representation of the input image using the normalizing flow based generative model, and perform an optimization of the latent space representation of the input image to identify an enhanced latent space representation of the input image. The software code then uses the normalizing flow based generative model to reverse transform the enhanced latent space representation of the input image to an enhanced image corresponding to the input image.

Type: Grant

Filed: July 1, 2020

Date of Patent: October 18, 2022

Assignees: Disney Enterprises, Inc., ETH Zurich

Inventors: Abdelaziz Djelouah, Leonhard Markus Helminger, Michael Bernasconi, Christopher Richard Schroers
Data object classification using an optimized neural network

Patent number: 11475280

Abstract: A system includes a computing platform having a hardware processor and a memory storing a software code and a neural network (NN) having multiple layers including a last activation layer and a loss layer. The hardware processor executes the software code to identify different combinations of layers for testing the NN, each combination including candidate function(s) for the last activation layer and candidate function(s) for the loss layer. For each different combination, the software code configures the NN based on the combination, inputs, into the configured NN, a training dataset including multiple data objects, receives, from the configured NN, a classification of the data objects, and generates a performance assessment for the combination based on the classification. The software code determines a preferred combination of layers for the NN including selected candidate functions for the last activation layer and the loss layer, based on a comparison of the performance assessments.

Type: Grant

Filed: March 3, 2020

Date of Patent: October 18, 2022

Assignees: Disney Enterprises, Inc., ETH Zurich

Inventors: Hayko Jochen Wilhelm Riemenschneider, Leonhard Markus Helminger, Christopher Richard Schroers, Abdelaziz Djelouah
Machine Learning Model-Based Video Compression

Publication number: 20220329876

Abstract: A system processing hard e executes a machine learning (ML) model-based video compression encoder to receive uncompressed video content and corresponding motion compensated video content, compare the uncompressed and motion compensated video content to identify an image space residual, transform the image space residual to a latent space representation of the uncompressed video content, and transform, using a trained image compression ML model, the motion compensated video content to a latent space representation of the motion compensated video content.

Type: Application

Filed: March 25, 2022

Publication date: October 13, 2022

Inventors: Abdelaziz Djelouah, Leonhard Markus Helminger, Roberto Gerson de Albuquerque Azevedo, Scott Labrozzi, Christopher Richard Schroers, Yuanyi Xue
Three-dimensional geometry-based models for changing facial identities in video frames and images

Patent number: 11222466

Abstract: Techniques are disclosed for changing the identities of faces in video frames and images. In embodiments, three-dimensional (3D) geometry of a face is used to inform the facial identity change produced by an image-to-image translation model, such as a comb network model. In some embodiments, the model can take a two-dimensional (2D) texture map and/or a 3D displacement map associated with one facial identity as inputs and output another 2D texture map and/or 3D displacement map associated with a different facial identity. The other 2D texture map and/or 3D displacement map can then be used to render an image that includes the different facial identity.

Type: Grant

Filed: September 30, 2020

Date of Patent: January 11, 2022

Assignees: Disney Enterprises, Inc., ETH Zürich (Eidgenössische Technische Hochschule Zürich)

Inventors: Jacek Krzysztof Naruniec, Derek Edward Bradley, Thomas Etterlin, Paulo Fabiano Urnau Gotardo, Leonhard Markus Helminger, Christopher Richard Schroers, Romann Matthew Weber
Image Enhancement Using Normalizing Flows

Publication number: 20220005161

Abstract: According to one implementation, an image enhancement system includes a computing platform including a hardware processor and a system memory storing a software code configured to provide a normalizing flow based generative model trained using an objective function. The hardware processor executes the software code to receive an input image, transform the input image to a latent space representation of the input image using the normalizing flow based generative model, and perform an optimization of the latent space representation of the input image to identify an enhanced latent space representation of the input image. The software code then uses the normalizing flow based generative model to reverse transform the enhanced latent space representation of the input image to an enhanced image corresponding to the input image.

Type: Application

Filed: July 1, 2020

Publication date: January 6, 2022

Inventors: Abdelaziz Djelouah, Leonhard Markus Helminger, Michael Bernasconi, Christopher Richard Schroers
Automated Annotation Of Heterogeneous Content

Publication number: 20210326720

Abstract: A system includes a computing platform having a hardware processor, and a system memory storing a software code and a content labeling predictive model. The hardware processor is configured to execute the software code to scan a database to identify content assets stored in the database, parse metadata stored in the database to identify labels associated with the content assets, and generate a graph by creating multiple first links linking each of the content assets to its corresponding label or labels. The hardware processor is configured to further execute the software code to train, using the graph, the content labeling predictive model, to identify, using the trained content labeling predictive model, multiple second links among the content assets and the labels, and to annotate the content assets based on the second links.

Type: Application

Filed: April 17, 2020

Publication date: October 21, 2021

Inventors: Hayko Jochen Wilhelm Riemenschneider, Leonhard Markus Helminger, Abdelaziz Djelouah, Christopher Richard Schroers

1 2 next