Patents by Inventor Ming-Yu Liu

Ming-Yu Liu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

LEARNING TO GENERATE SYNTHETIC DATASETS FOR TRAINING NEURAL NETWORKS

Publication number: 20230229919

Abstract: In various examples, a generative model is used to synthesize datasets for use in training a downstream machine learning model to perform an associated task. The synthesized datasets may be generated by sampling a scene graph from a scene grammar—such as a probabilistic grammar— and applying the scene graph to the generative model to compute updated scene graphs more representative of object attribute distributions of real-world datasets. The downstream machine learning model may be validated against a real-world validation dataset, and the performance of the model on the real-world validation dataset may be used as an additional factor in further training or fine-tuning the generative model for generating the synthesized datasets specific to the task of the downstream machine learning model.

Type: Application

Filed: March 20, 2023

Publication date: July 20, 2023

Inventors: Amlan Kar, Aayush Prakash, Ming-Yu Liu, David Jesus Acuna Marrero, Antonio Torralba Barriuso, Sanja Fidler
IMAGE GENERATION USING ONE OR MORE NEURAL NETWORKS

Publication number: 20230153949

Abstract: Apparatuses, systems, and techniques are presented to generate one or more images. In at least one embodiment, one or more neural networks are used to generate one or more images based, at least in part, on one or noise values.

Type: Application

Filed: November 12, 2021

Publication date: May 18, 2023

Inventors: Xun Huang, Zinan Lin, Ming-Yu Liu
GENERATING IMAGES OF OBJECT MOTION USING ONE OR MORE NEURAL NETWORKS

Publication number: 20230147641

Abstract: Apparatuses, systems, and techniques are presented to reconstruct one or more images. In at least one embodiment, one or more neural networks are used to generate one or more images of one or more objects based, at least in part, on input indicating motion of the one or more objects.

Type: Application

Filed: November 9, 2021

Publication date: May 11, 2023

Inventors: Ting-Chun Wang, Tim Brooks, Ming-Yu Liu, Tero Karras, Jaakko Lehtinen
Iterative spatio-temporal action detection in video

Patent number: 11631239

Abstract: Iterative prediction systems and methods for the task of action detection process an inputted sequence of video frames to generate an output of both action tubes and respective action labels, wherein the action tubes comprise a sequence of bounding boxes on each video frame. An iterative predictor processes large offsets between the bounding boxes and the ground-truth.

Type: Grant

Filed: April 22, 2021

Date of Patent: April 18, 2023

Assignee: NVIDIA CORPORATION

Inventors: Xiaodong Yang, Ming-Yu Liu, Jan Kautz, Fanyi Xiao, Xitong Yang
GENERATIVE ADVERSARIAL NEURAL NETWORK ASSISTED VIDEO RECONSTRUCTION

Publication number: 20230110206

Abstract: A latent code defined in an input space is processed by the mapping neural network to produce an intermediate latent code defined in an intermediate latent space. The intermediate latent code may be used as appearance vector that is processed by the synthesis neural network to generate an image. The appearance vector is a compressed encoding of data, such as video frames including a person's face, audio, and other data. Captured images may be converted into appearance vectors at a local device and transmitted to a remote device using much less bandwidth compared with transmitting the captured images. A synthesis neural network at the remote device reconstructs the images for display.

Type: Application

Filed: December 12, 2022

Publication date: April 13, 2023

Inventors: Tero Tapani Karras, Samuli Matias Laine, David Patrick Luebke, Jaakko T. Lehtinen, Miika Samuli Aittala, Timo Oskari Aila, Ming-Yu Liu, Arun Mohanray Mallya, Ting-Chun Wang
Generative adversarial neural network assisted compression and broadcast

Patent number: 11625613

Abstract: A latent code defined in an input space is processed by the mapping neural network to produce an intermediate latent code defined in an intermediate latent space. The intermediate latent code may be used as appearance vector that is processed by the synthesis neural network to generate an image. The appearance vector is a compressed encoding of data, such as video frames including a person's face, audio, and other data. Captured images may be converted into appearance vectors at a local device and transmitted to a remote device using much less bandwidth compared with transmitting the captured images. A synthesis neural network at the remote device reconstructs the images for display.

Type: Grant

Filed: January 7, 2021

Date of Patent: April 11, 2023

Assignee: NVIDIA Corporation

Inventors: Tero Tapani Karras, Samuli Matias Laine, David Patrick Luebke, Jaakko T. Lehtinen, Miika Samuli Aittala, Timo Oskari Aila, Ming-Yu Liu, Arun Mohanray Mallya, Ting-Chun Wang
Generative adversarial neural network assisted reconstruction

Patent number: 11610122

Abstract: A latent code defined in an input space is processed by the mapping neural network to produce an intermediate latent code defined in an intermediate latent space. The intermediate latent code may be used as appearance vector that is processed by the synthesis neural network to generate an image. The appearance vector is a compressed encoding of data, such as video frames including a person's face, audio, and other data. Captured images may be converted into appearance vectors at a local device and transmitted to a remote device using much less bandwidth compared with transmitting the captured images. A synthesis neural network at the remote device reconstructs the images for display.

Type: Grant

Filed: January 7, 2021

Date of Patent: March 21, 2023

Assignee: NVIDIA Corporation

Inventors: Tero Tapani Karras, Samuli Matias Laine, David Patrick Luebke, Jaakko T. Lehtinen, Miika Samuli Aittala, Timo Oskari Aila, Ming-Yu Liu, Arun Mohanray Mallya, Ting-Chun Wang
Generative adversarial neural network assisted video compression and broadcast

Patent number: 11610435

Abstract: A latent code defined in an input space is processed by the mapping neural network to produce an intermediate latent code defined in an intermediate latent space. The intermediate latent code may be used as appearance vector that is processed by the synthesis neural network to generate an image. The appearance vector is a compressed encoding of data, such as video frames including a person's face, audio, and other data. Captured images may be converted into appearance vectors at a local device and transmitted to a remote device using much less bandwidth compared with transmitting the captured images. A synthesis neural network at the remote device reconstructs the images for display.

Type: Grant

Filed: October 13, 2020

Date of Patent: March 21, 2023

Assignee: NVIDIA Corporation

Inventors: Tero Tapani Karras, Samuli Matias Laine, David Patrick Luebke, Jaakko T. Lehtinen, Miika Samuli Aittala, Timo Oskari Aila, Ming-Yu Liu, Arun Mohanray Mallya, Ting-Chun Wang
Learning to generate synthetic datasets for training neural networks

Patent number: 11610115

Abstract: In various examples, a generative model is used to synthesize datasets for use in training a downstream machine learning model to perform an associated task. The synthesized datasets may be generated by sampling a scene graph from a scene grammar—such as a probabilistic grammar—and applying the scene graph to the generative model to compute updated scene graphs more representative of object attribute distributions of real-world datasets. The downstream machine learning model may be validated against a real-world validation dataset, and the performance of the model on the real-world validation dataset may be used as an additional factor in further training or fine-tuning the generative model for generating the synthesized datasets specific to the task of the downstream machine learning model.

Type: Grant

Filed: November 15, 2019

Date of Patent: March 21, 2023

Assignee: NVIDIA Corporation

Inventors: Amlan Kar, Aayush Prakash, Ming-Yu Liu, David Jesus Acuna Marrero, Antonio Torralba Barriuso, Sanja Fidler
Generative adversarial neural network assisted video reconstruction

Patent number: 11580395

Abstract: A latent code defined in an input space is processed by the mapping neural network to produce an intermediate latent code defined in an intermediate latent space. The intermediate latent code may be used as appearance vector that is processed by the synthesis neural network to generate an image. The appearance vector is a compressed encoding of data, such as video frames including a person's face, audio, and other data. Captured images may be converted into appearance vectors at a local device and transmitted to a remote device using much less bandwidth compared with transmitting the captured images. A synthesis neural network at the remote device reconstructs the images for display.

Type: Grant

Filed: October 13, 2020

Date of Patent: February 14, 2023

Assignee: NVIDIA Corporation

Inventors: Tero Tapani Karras, Samuli Matias Laine, David Patrick Luebke, Jaakko T. Lehtinen, Miika Samuli Aittala, Timo Oskari Aila, Ming-Yu Liu, Arun Mohanray Mallya, Ting-Chun Wang
CONDITIONAL IMAGE GENERATION USING ONE OR MORE NEURAL NETWORKS

Publication number: 20230045076

Abstract: Apparatuses, systems, and techniques are presented to generate one or more images. In at least one embodiment, one or more neural networks are used to generate one or more images based, at least in part, upon one or more input types.

Type: Application

Filed: July 29, 2021

Publication date: February 9, 2023

Inventors: Xun Huang, Arun Mallya, Ting Wang, Ming-Yu Liu
SYNTHESIZING VIDEO FROM AUDIO USING ONE OR MORE NEURAL NETWORKS

Publication number: 20230035306

Abstract: Apparatuses, systems, and techniques are presented to generate media content.

Type: Application

Filed: July 21, 2021

Publication date: February 2, 2023

Inventors: Ming-Yu Liu, Koki Nagano, Yeongho Seol, Jose Rafael Valle Gomes da Costa, Jaewoo Seo, Ting-Chun Wang, Arun Mallya, Sameh Khamis, Wei Ping, Rohan Badlani, Kevin Jonathan Shih, Bryan Catanzaro, Simon Yuen, Jan Kautz
SYNTHESIZING HIGH RESOLUTION 3D SHAPES FROM LOWER RESOLUTION REPRESENTATIONS FOR SYNTHETIC DATA GENERATION SYSTEMS AND APPLICATIONS

Publication number: 20220392162

Abstract: In various examples, a deep three-dimensional (3D) conditional generative model is implemented that can synthesize high resolution 3D shapes using simple guides—such as coarse voxels, point clouds, etc.—by marrying implicit and explicit 3D representations into a hybrid 3D representation. The present approach may directly optimize for the reconstructed surface, allowing for the synthesis of finer geometric details with fewer artifacts. The systems and methods described herein may use a deformable tetrahedral grid that encodes a discretized signed distance function (SDF) and a differentiable marching tetrahedral layer that converts the implicit SDF representation to an explicit surface mesh representation. This combination allows joint optimization of the surface geometry and topology as well as generation of the hierarchy of subdivisions using reconstruction and adversarial losses defined explicitly on the surface mesh.

Type: Application

Filed: April 11, 2022

Publication date: December 8, 2022

Inventors: Tianchang Shen, Jun Gao, Kangxue Yin, Ming-Yu Liu, Sanja Fidler
SYNTHESIZING VIDEO FROM AUDIO USING ONE OR MORE NEURAL NETWORKS

Publication number: 20220374637

Abstract: Apparatuses, systems, and techniques are presented to reduce an amount of data to be transmitted for media content. In at least one embodiment, one or more neural networks are used to generate video and audio information corresponding to one or more people based, at least in part, on at least one image and voice information corresponding to the one or more people.

Type: Application

Filed: May 20, 2021

Publication date: November 24, 2022

Inventors: Ming-Yu LIU, Ting-Chun WANG, Arun MALLYA
Using residual video data resulting from a compression of original video data to improve a decompression of the original video data

Patent number: 11496773

Abstract: A method, computer readable medium, and system are disclosed for identifying residual video data. This data describes data that is lost during a compression of original video data. For example, the original video data may be compressed and then decompressed, and this result may be compared to the original video data to determine the residual video data. This residual video data is transformed into a smaller format by means of encoding, binarizing, and compressing, and is sent to a destination. At the destination, the residual video data is transformed back into its original format and is used during the decompression of the compressed original video data to improve a quality of the decompressed original video data.

Type: Grant

Filed: June 18, 2021

Date of Patent: November 8, 2022

Assignee: NVIDIA CORPORATION

Inventors: Yi-Hsuan Tsai, Ming-Yu Liu, Deqing Sun, Ming-Hsuan Yang, Jan Kautz
CONTEXT-AWARE SYNTHESIS AND PLACEMENT OF OBJECT INSTANCES

Publication number: 20220335672

Abstract: One embodiment of a method includes applying a first generator model to a semantic representation of an image to generate an affine transformation, where the affine transformation represents a bounding box associated with at least one region within the image. The method further includes applying a second generator model to the affine transformation and the semantic representation to generate a shape of an object. The method further includes inserting the object into the image based on the bounding box and the shape.

Type: Application

Filed: January 26, 2022

Publication date: October 20, 2022

Inventors: Donghoon LEE, Sifei LIU, Jinwei GU, Ming-Yu LIU, Jan KAUTZ
MACHINE LEARNING TRAINING IN LOGARITHMIC NUMBER SYSTEM

Publication number: 20220261650

Abstract: An end-to-end low-precision training system based on a multi-base logarithmic number system and a multiplicative weight update algorithm. The multi-base logarithmic number system is applied to update weights of the neural network, with different bases of the multi-base logarithmic number system utilized between calculation of weight updates, calculation of feed-forward signals, and calculation of feedback signals. The LNS expresses a high dynamic range and computational energy efficiency, making it advantageous for on-board training in energy-constrained edge devices.

Type: Application

Filed: June 11, 2021

Publication date: August 18, 2022

Applicant: NVIDIA Corp.

Inventors: Jiawei Zhao, Steve Haihang Dai, Rangharajan Venkatesan, Ming-Yu Liu, William James Dally, Anima Anandkumar
IMAGE SEGMENTATION USING A NEURAL NETWORK TRANSLATION MODEL

Publication number: 20220254029

Abstract: The neural network includes an encoder, a common decoder, and a residual decoder. The encoder encodes input images into a latent space. The latent space disentangles unique features from other common features. The common decoder decodes common features resident in the latent space to generate translated images which lack the unique features. The residual decoder decodes unique features resident in the latent space to generate image deltas corresponding to the unique features. The neural network combines the translated images with the image deltas to generate combined images that may include both common features and unique features. The combined images can be used to drive autoencoding. Once training is complete, the residual decoder can be modified to generate segmentation masks that indicate any regions of a given input image where a unique feature resides.

Type: Application

Filed: October 13, 2021

Publication date: August 11, 2022

Inventors: Eugene Vorontsov, Wonmin Byeon, Shalini De Mello, Varun Jampani, Ming-Yu Liu, Pavlo Molchanov
IMAGE SYNTHESIS USING ONE OR MORE NEURAL NETWORKS

Publication number: 20220237838

Abstract: Apparatuses, systems, and techniques are presented to synthesize representations. In at least one embodiment, one or more neural networks are used to generate one or more representations of one or more objects based, at least in part, upon one or more structural features and one or more appearance features for the one or more objects.

Type: Application

Filed: January 27, 2021

Publication date: July 28, 2022

Inventors: Ming-Yu Liu, Xun Huang
GENERATION OF MOVING THREE DIMENSIONAL MODELS USING MOTION TRANSFER

Publication number: 20220207770

Abstract: Apparatuses, systems, and techniques to produce an image of a first subject positioned in a pose demonstrated by an image of a second subject. In at least one embodiment, an image of a first subject can be generated from a variety of points of view.

Type: Application

Filed: February 2, 2021

Publication date: June 30, 2022

Inventors: Ming-Yu Liu, Ting-Chun Wang, Xihui Liu

prev 1 2 3 4 5 6 … next