Patents by Inventor Robin Rombach

Robin Rombach has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

CONTENT SYNTHESIS USING GENERATIVE ARTIFICIAL INTELLIGENCE MODEL

Publication number: 20250299380

Abstract: A method including receiving an input from a user interface of a device, the input indicating a desired characteristic of an image. The method including transmitting a prompt indicating the desired characteristic to a set of servers with a request to generate the image, causing the set of servers to: generate, using a set of encoding models, a prompt encoding based on the prompt; generate, using a first transformer block of a diffusion transformer model, a first prompt embedding and a first image embedding based on the prompt encoding and a noise input; generate, using a second transformer block of the diffusion transformer model, a second image embedding based on the first image embedding and the first prompt embedding; and generate the image based on the second image embedding. The method including receiving the image from the set of servers and presenting the image on a display of the device.

Type: Application

Filed: February 25, 2025

Publication date: September 25, 2025

Applicant: Stability AI Ltd

Inventors: Rahim Entezari, Patrick Esser, Robin Rombach, Andreas Blattmann
CONTENT SYNTHESIS USING LATENT ADVERSARIAL DIFFUSION DISTILLATION

Publication number: 20250299399

Abstract: A method including receiving a first representation of an image in a first latent space of a first machine learning model. The method further includes generating, by a second machine learning model based at least in part on the first representation, a second representation of the image in a second latent space of the second machine learning model. The method further includes updating, without generating an output image corresponding to the image, a set of weights of the second machine learning model based at least in part on the first representation and the second representation.

Type: Application

Filed: January 31, 2025

Publication date: September 25, 2025

Applicant: Stability AI Ltd

Inventors: Axel Sauer, Frederic Boesel, Tim Dockhorn, Andreas Blattmann, Patrick Esser, Robin Rombach
Generating voxel representations using one or more neural networks

Patent number: 12322068

Abstract: Apparatuses, systems, and techniques are presented to generate digital content. In at least one embodiment, one or more neural networks are used to generate a three-dimensional voxel representation of a scene based, at least in part, upon a plurality of two-dimensional images of the scene.

Type: Grant

Filed: September 8, 2022

Date of Patent: June 3, 2025

Assignee: NVIDIA Corporation

Inventors: Seung Wook Kim, Karsten Kreis, Daiqing Li, Robin Rombach, Sanja Fidler, Antonio Torralba Barriuso, Bradley Brown
HIGH-RESOLUTION VIDEO GENERATION USING IMAGE DIFFUSION MODELS

Publication number: 20250142145

Abstract: In various examples, systems and methods are disclosed relating to aligning images into frames of a first video using at least one first temporal attention layer of a neural network model. The first video has a first spatial resolution. A second video having a second spatial resolution is generated by up-sampling the first video using at least one second temporal attention layer of an up-sampler neural network model, wherein the second spatial resolution is higher than the first spatial resolution.

Type: Application

Filed: January 6, 2025

Publication date: May 1, 2025

Applicant: NVIDIA Corporation

Inventors: Karsten Julian Kreis, Robin Rombach, Andreas Blattmann, Seung Wook Kim, Huan Ling, Sanja Fidler, Tim Dockhorn
Content synthesis using generative Artificial Intelligence model

Patent number: 12271978

Abstract: A method including receiving a prompt describing a desired characteristic of an image. The method further including generating, using a set of encoding models, a prompt encoding based on the prompt. The method further including generating, using a first transformer block of a diffusion transformer model, a first prompt embedding and a first image embedding based on the prompt encoding and a noise input. The method further including generating, using a second transformer block of the diffusion transformer model, a second image embedding based on the first image embedding and the first prompt embedding. The method further including generating the image based on the second image embedding.

Type: Grant

Filed: September 11, 2024

Date of Patent: April 8, 2025

Assignee: Stability AI Ltd

Inventors: Rahim Entezari, Patrick Esser, Robin Rombach, Andreas Blattmann
High-resolution video generation using image diffusion models

Patent number: 12192547

Abstract: In various examples, systems and methods are disclosed relating to aligning images into frames of a first video using at least one first temporal attention layer of a neural network model. The first video has a first spatial resolution. A second video having a second spatial resolution is generated by up-sampling the first video using at least one second temporal attention layer of an up-sampler neural network model, wherein the second spatial resolution is higher than the first spatial resolution.

Type: Grant

Filed: March 10, 2023

Date of Patent: January 7, 2025

Assignee: NVIDIA Corporation

Inventors: Karsten Julian Kreis, Robin Rombach, Andreas Blattmann, Seung Wook Kim, Huan Ling, Sanja Fidler, Tim Dockhorn
HIGH-RESOLUTION VIDEO GENERATION USING IMAGE DIFFUSION MODELS

Publication number: 20240171788

Abstract: In various examples, systems and methods are disclosed relating to aligning images into frames of a first video using at least one first temporal attention layer of a neural network model. The first video has a first spatial resolution. A second video having a second spatial resolution is generated by up-sampling the first video using at least one second temporal attention layer of an up-sampler neural network model, wherein the second spatial resolution is higher than the first spatial resolution.

Type: Application

Filed: March 10, 2023

Publication date: May 23, 2024

Applicant: NVIDIA Corporation

Inventors: Karsten Julian Kreis, Robin Rombach, Andreas Blattmann, Seung Wook Kim, Huan Ling, Sanja Fidler, Tim Dockhorn

CONTENT SYNTHESIS USING GENERATIVE ARTIFICIAL INTELLIGENCE MODEL

CONTENT SYNTHESIS USING LATENT ADVERSARIAL DIFFUSION DISTILLATION

Generating voxel representations using one or more neural networks

HIGH-RESOLUTION VIDEO GENERATION USING IMAGE DIFFUSION MODELS

Content synthesis using generative Artificial Intelligence model

High-resolution video generation using image diffusion models

HIGH-RESOLUTION VIDEO GENERATION USING IMAGE DIFFUSION MODELS