Patents by Inventor Antonio Torralba Barriuso

Antonio Torralba Barriuso has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Generating voxel representations using one or more neural networks

Patent number: 12322068

Abstract: Apparatuses, systems, and techniques are presented to generate digital content. In at least one embodiment, one or more neural networks are used to generate a three-dimensional voxel representation of a scene based, at least in part, upon a plurality of two-dimensional images of the scene.

Type: Grant

Filed: September 8, 2022

Date of Patent: June 3, 2025

Assignee: NVIDIA Corporation

Inventors: Seung Wook Kim, Karsten Kreis, Daiqing Li, Robin Rombach, Sanja Fidler, Antonio Torralba Barriuso, Bradley Brown
High-precision semantic image editing using neural networks for synthetic data generation systems and applications

Patent number: 12288277

Abstract: In various examples, high-precision semantic image editing for machine learning systems and applications are described. For example, a generative adversarial network (GAN) may be used to jointly model images and their semantic segmentations based on a same underlying latent code. Image editing may be achieved by using segmentation mask modifications (e.g., provided by a user, or otherwise) to optimize the latent code to be consistent with the updated segmentation, thus effectively changing the original, e.g., RGB image. To improve efficiency of the system, and to not require optimizations for each edit on each image, editing vectors may be learned in latent space that realize the edits, and that can be directly applied on other images with or without additional optimizations. As a result, a GAN in combination with the optimization approaches described herein may simultaneously allow for high precision editing in real-time with straightforward compositionality of multiple edits.

Type: Grant

Filed: May 27, 2022

Date of Patent: April 29, 2025

Assignee: NVIDIA Corporation

Inventors: Huan Ling, Karsten Kreis, Daiqing Li, Seung Wook Kim, Antonio Torralba Barriuso, Sanja Fidler
ITERATIVE SPATIAL GRAPH GENERATION

Publication number: 20250061153

Abstract: A generative model can be used for generation of spatial layouts and graphs. Such a model can progressively grow these layouts and graphs based on local statistics, where nodes can represent spatial control points of the layout, and edges can represent segments or paths between nodes, such as may correspond to road segments. A generative model can utilize an encoder-decoder architecture where the encoder is a recurrent neural network (RNN) that encodes local incoming paths into a node and the decoder is another RNN that generates outgoing nodes and edges connecting an existing node to the newly generated nodes. Generation is done iteratively, and can finish once all nodes are visited or another end condition is satisfied. Such a model can generate layouts by additionally conditioning on a set of attributes, giving control to a user in generating the layout.

Type: Application

Filed: November 1, 2024

Publication date: February 20, 2025

Inventors: Hang Chu, Daiqing Li, David Jesus Acuna Marrero, Amlan Kar, Maria Shugrina, Ming-Yu Liu, Antonio Torralba Barriuso, Sanja Fidler
UNSUPERVISED PRE-TRAINING OF NEURAL NETWORKS USING GENERATIVE MODELS

Publication number: 20240256831

Abstract: In various examples, systems and methods are disclosed relating to generating a response from image and/or video input for image/video-based artificial intelligence (AI) systems and applications. Systems and methods are disclosed for a first model (e.g., a teacher model) distilling its knowledge to a second model (a student model). The second model receives a downstream image in a downstream task and generates at least one feature. The first model generates first features corresponding to an image which can be a real image or a synthetic image. The second model generates second features using the image as an input to the second model. Loss with respect to first features is determined. The second model is updated using the loss.

Type: Application

Filed: January 26, 2023

Publication date: August 1, 2024

Applicant: NVIDIA Corporation

Inventors: Daiqing Li, Huan Ling, Seung Wook Kim, Karsten Julian Kreis, Antonio Torralba Barriuso, Sanja Fidler, Amlan Kar
GENERATING MASK INFORMATION

Publication number: 20240096064

Abstract: Apparatuses, systems, and techniques to annotate images using neural models. In at least one embodiment, neural networks generate mask information from labels of one or more objects within one or more images identified by one or more other neural networks.

Type: Application

Filed: June 3, 2022

Publication date: March 21, 2024

Inventors: Daiqing Li, Huan Ling, Seung Wook Kim, Karsten Julian Kreis, Sanja Fidler, Antonio Torralba Barriuso
MULTI-DOMAIN GENERATIVE ADVERSARIAL NETWORKS FOR SYNTHETIC DATA GENERATION

Publication number: 20230377324

Abstract: In various examples, systems and methods are disclosed relating to multi-domain generative adversarial networks with learned warp fields. Input data can be generated according to a noise function and provided as input to a generative machine-learning model. The generative machine-learning model can determine a plurality of output images each corresponding to one of a respective plurality of image domains. The generative machine-learning model can include at least one layer to generate a plurality of morph maps each corresponding to one of the respective plurality of image domains. The output images can be presented using a display device.

Type: Application

Filed: May 18, 2023

Publication date: November 23, 2023

Applicant: NVIDIA Corporation

Inventors: Seung Wook KIM, Karsten Julian KREIS, Daiqing LI, Sanja FIDLER, Antonio TORRALBA BARRIUSO
LEARNING TO GENERATE SYNTHETIC DATASETS FOR TRAINING NEURAL NETWORKS

Publication number: 20230229919

Abstract: In various examples, a generative model is used to synthesize datasets for use in training a downstream machine learning model to perform an associated task. The synthesized datasets may be generated by sampling a scene graph from a scene grammar—such as a probabilistic grammar— and applying the scene graph to the generative model to compute updated scene graphs more representative of object attribute distributions of real-world datasets. The downstream machine learning model may be validated against a real-world validation dataset, and the performance of the model on the real-world validation dataset may be used as an additional factor in further training or fine-tuning the generative model for generating the synthesized datasets specific to the task of the downstream machine learning model.

Type: Application

Filed: March 20, 2023

Publication date: July 20, 2023

Inventors: Amlan Kar, Aayush Prakash, Ming-Yu Liu, David Jesus Acuna Marrero, Antonio Torralba Barriuso, Sanja Fidler
NEURAL RENDERING FOR INVERSE GRAPHICS GENERATION

Publication number: 20230134690

Abstract: Approaches are presented for training an inverse graphics network. An image synthesis network can generate training data for an inverse graphics network. In turn, the inverse graphics network can teach the synthesis network about the physical three-dimensional (3D) controls. Such an approach can provide for accurate 3D reconstruction of objects from 2D images using the trained inverse graphics network, while requiring little annotation of the provided training data. Such an approach can extract and disentangle 3D knowledge learned by generative models by utilizing differentiable renderers, enabling a disentangled generative model to function as a controllable 3D “neural renderer,” complementing traditional graphics renderers.

Type: Application

Filed: November 7, 2022

Publication date: May 4, 2023

Inventors: Wenzheng Chen, Yuxuan Zhang, Sanja Fidler, Huan Ling, Jun Gao, Antonio Torralba Barriuso
Learning to generate synthetic datasets for training neural networks

Patent number: 11610115

Abstract: In various examples, a generative model is used to synthesize datasets for use in training a downstream machine learning model to perform an associated task. The synthesized datasets may be generated by sampling a scene graph from a scene grammar—such as a probabilistic grammar—and applying the scene graph to the generative model to compute updated scene graphs more representative of object attribute distributions of real-world datasets. The downstream machine learning model may be validated against a real-world validation dataset, and the performance of the model on the real-world validation dataset may be used as an additional factor in further training or fine-tuning the generative model for generating the synthesized datasets specific to the task of the downstream machine learning model.

Type: Grant

Filed: November 15, 2019

Date of Patent: March 21, 2023

Assignee: NVIDIA Corporation

Inventors: Amlan Kar, Aayush Prakash, Ming-Yu Liu, David Jesus Acuna Marrero, Antonio Torralba Barriuso, Sanja Fidler
HIGH-PRECISION SEMANTIC IMAGE EDITING USING NEURAL NETWORKS FOR SYNTHETIC DATA GENERATION SYSTEMS AND APPLICATIONS

Publication number: 20220383570

Abstract: In various examples, high-precision semantic image editing for machine learning systems and applications are described. For example, a generative adversarial network (GAN) may be used to jointly model images and their semantic segmentations based on a same underlying latent code. Image editing may be achieved by using segmentation mask modifications (e.g., provided by a user, or otherwise) to optimize the latent code to be consistent with the updated segmentation, thus effectively changing the original, e.g., RGB image. To improve efficiency of the system, and to not require optimizations for each edit on each image, editing vectors may be learned in latent space that realize the edits, and that can be directly applied on other images with or without additional optimizations. As a result, a GAN in combination with the optimization approaches described herein may simultaneously allow for high precision editing in real-time with straightforward compositionality of multiple edits.

Type: Application

Filed: May 27, 2022

Publication date: December 1, 2022

Inventors: Huan Ling, Karsten Kreis, Daiqing Li, Seung Wook Kim, Antonio Torralba Barriuso, Sanja Fidler
Neural rendering for inverse graphics generation

Patent number: 11494976

Abstract: Approaches are presented for training an inverse graphics network. An image synthesis network can generate training data for an inverse graphics network. In turn, the inverse graphics network can teach the synthesis network about the physical three-dimensional (3D) controls. Such an approach can provide for accurate 3D reconstruction of objects from 2D images using the trained inverse graphics network, while requiring little annotation of the provided training data. Such an approach can extract and disentangle 3D knowledge learned by generative models by utilizing differentiable renderers, enabling a disentangled generative model to function as a controllable 3D “neural renderer,” complementing traditional graphics renderers.

Type: Grant

Filed: March 5, 2021

Date of Patent: November 8, 2022

Assignee: Nvidia Corporation

Inventors: Wenzheng Chen, Yuxuan Zhang, Sanja Fidler, Huan Ling, Jun Gao, Antonio Torralba Barriuso
GENERATING FRAMES FOR NEURAL SIMULATION USING ONE OR MORE NEURAL NETWORKS

Publication number: 20220269937

Abstract: Apparatuses, systems, and techniques to use one or more neural networks to generate one or more images based, at least in part, on one or more spatially-independent features within the one or more images. In at least one embodiment, the one or more neural networks determine spatially-independent information and spatially-dependent information of the one or more images and process the spatially-independent information and the spatially-dependent information to generate the one or more spatially-independent features and one or more spatially-dependent features within the one or more images.

Type: Application

Filed: February 24, 2021

Publication date: August 25, 2022

Inventors: Seung Wook Kim, Jonah Philion, Sanja Fidler, Antonio Torralba Barriuso
GENERATING LABELS FOR SYNTHETIC IMAGES USING ONE OR MORE NEURAL NETWORKS

Publication number: 20220083807

Abstract: Apparatuses, systems, and techniques to determine pixel-level labels of a synthetic image. In at least one embodiment, the synthetic image is generated by one or more generative networks and the pixel-level labels are generated using a combination of data output by a plurality of layers of the generative networks.

Type: Application

Filed: September 14, 2020

Publication date: March 17, 2022

Inventors: Yuxuan Zhang, Huan Ling, Jun Gao, Wenzheng Chen, Antonio Torralba Barriuso, Sanja Fidler
ENVIRONMENT GENERATION USING ONE OR MORE NEURAL NETWORKS

Publication number: 20210390778

Abstract: Apparatuses, systems, and techniques are presented to generate a simulated environment. In at least one embodiment, one or more neural networks are used to generate a simulated environment based, at least in part, on stored information associated with objects within the simulated environment.

Type: Application

Filed: June 10, 2020

Publication date: December 16, 2021

Inventors: Seung Wook Kim, Sanja Fidler, Jonah Philion, Antonio Torralba Barriuso
NEURAL RENDERING FOR INVERSE GRAPHICS GENERATION

Publication number: 20210279952

Abstract: Approaches are presented for training an inverse graphics network. An image synthesis network can generate training data for an inverse graphics network. In turn, the inverse graphics network can teach the synthesis network about the physical three-dimensional (3D) controls. Such an approach can provide for accurate 3D reconstruction of objects from 2D images using the trained inverse graphics network, while requiring little annotation of the provided training data. Such an approach can extract and disentangle 3D knowledge learned by generative models by utilizing differentiable renderers, enabling a disentangled generative model to function as a controllable 3D “neural renderer,” complementing traditional graphics renderers.

Type: Application

Filed: March 5, 2021

Publication date: September 9, 2021

Inventors: Wenzheng Chen, Yuxuan Zhang, Sanja Fidler, Huan Ling, Jun Gao, Antonio Torralba Barriuso
ITERATIVE SPATIAL GRAPH GENERATION

Publication number: 20200302250

Abstract: A generative model can be used for generation of spatial layouts and graphs. Such a model can progressively grow these layouts and graphs based on local statistics, where nodes can represent spatial control points of the layout, and edges can represent segments or paths between nodes, such as may correspond to road segments. A generative model can utilize an encoder-decoder architecture where the encoder is a recurrent neural network (RNN) that encodes local incoming paths into a node and the decoder is another RNN that generates outgoing nodes and edges connecting an existing node to the newly generated nodes. Generation is done iteratively, and can finish once all nodes are visited or another end condition is satisfied. Such a model can generate layouts by additionally conditioning on a set of attributes, giving control to a user in generating the layout.

Type: Application

Filed: March 20, 2020

Publication date: September 24, 2020

Inventors: Hang Chu, Daiqing Li, David Jesus Acuna Marrero, Amlan Kar, Maria Shugrina, Ming-Yu Liu, Antonio Torralba Barriuso, Sanja Fidler
LEARNING TO GENERATE SYNTHETIC DATASETS FOR TRANING NEURAL NETWORKS

Publication number: 20200160178

Abstract: In various examples, a generative model is used to synthesize datasets for use in training a downstream machine learning model to perform an associated task. The synthesized datasets may be generated by sampling a scene graph from a scene grammar—such as a probabilistic grammar—and applying the scene graph to the generative model to compute updated scene graphs more representative of object attribute distributions of real-world datasets. The downstream machine learning model may be validated against a real-world validation dataset, and the performance of the model on the real-world validation dataset may be used as an additional factor in further training or fine-tuning the generative model for generating the synthesized datasets specific to the task of the downstream machine learning model.

Type: Application

Filed: November 15, 2019

Publication date: May 21, 2020

Inventors: Amlan Kar, Aayush Prakash, Ming-Yu Liu, David Jesus Acuna Marrero, Antonio Torralba Barriuso, Sanja Fidler
Identifying objects within an image

Patent number: 9754177

Abstract: One or more aspects of the subject disclosure are directed towards identifying objects within an image via image searching/matching. In one aspect, an image is processed into bounding boxes, with the bounding boxes further processed to each surround a possible object. A sub-image of pixels corresponding to the bounding box is featurized for matching with tagged database images. The information (tags) associated with any matched images is processed to identify/categorize the sub-image and thus the object corresponding thereto.

Type: Grant

Filed: June 21, 2013

Date of Patent: September 5, 2017

Assignee: Microsoft Technology Licensing, LLC

Inventors: Ce Liu, Yair Weiss, Antonio Torralba Barriuso
IMAGE RECOGNITION BY IMAGE SEARCH

Publication number: 20140376819

Abstract: One or more aspects of the subject disclosure are directed towards identifying objects within an image via image searching/matching. In one aspect, an image is processed into bounding boxes, with the bounding boxes further processed to each surround a possible object. A sub-image of pixels corresponding to the bounding box is featurized for matching with tagged database images. The information (tags) associated with any matched images is processed to identify/categorize the sub-image and thus the object corresponding thereto.

Type: Application

Filed: June 21, 2013

Publication date: December 25, 2014

Inventors: Ce Liu, Yair Weiss, Antonio Torralba Barriuso