Patents by Inventor Ming-Hsuan Yang

Ming-Hsuan Yang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Real time perspective correction on faces

Patent number: 12373958

Abstract: Apparatus and methods related to image processing are provided. A computing device can determine a first image area of an image, such as an image captured by a camera. The computing device can determine a warping mesh for the image with a first portion of the warping mesh associated with the first image area. The computing device can determine a cost function for the warping mesh by: determining first costs associated with the first portion of the warping mesh that include costs associated with face-related transformations of the first image area to correct geometric distortions. The computing device can determine an optimized mesh based on optimizing the cost function. The computing device can modify the first image area based on the optimized mesh.

Type: Grant

Filed: August 30, 2021

Date of Patent: July 29, 2025

Assignee: Google LLC

Inventors: Yichang Shih, Chia-Kai Liang, Wei-Sheng Lai, Ming-Hsuan Yang, Siargey Pisarchyk, Ryhor Karpiak
DIFFUSION-GUIDED THREE-DIMENSIONAL RECONSTRUCTION

Publication number: 20240412458

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for editing images based on decoder-based accumulative score sampling (DASS) losses.

Type: Application

Filed: June 12, 2024

Publication date: December 12, 2024

Inventors: Varun Jampani, Chun-Han Yao, Amit Raj, Wei-Chih Hung, Ming-Hsuan Yang, Michael Rubinstein, Yuanzhen Li
Compression-Informed Video Super-Resolution

Publication number: 20240022760

Abstract: Example aspects of the present disclosure are directed to systems and methods which feature a machine-learned video super-resolution (VSR) model which has been trained using a bi-directional training approach. In particular, the present disclosure provides a compression-informed (e.g., compression-aware) super-resolution model that can perform well on real-world videos with different levels of compression. Specifically, example models described herein can include three modules to robustly restore the missing information caused by video compression. First, a bi-directional recurrent module can be used to reduce the accumulated warping error from the random locations of the intra-frame from compressed video frames. Second, a detail-aware flow estimation module can be added to enable recovery of high resolution (HR) flow from compressed low resolution (LR) frames. Finally, a Laplacian enhancement module can add high-frequency information to the warped HR frames washed out by video encoding.

Type: Application

Filed: August 5, 2021

Publication date: January 18, 2024

Inventors: Yinxiao Li, Peyman Milanfar, Feng Yang, Ce Liu, Ming-Hsuan Yang, Pengchong Jin
Learnable cost volume for determining pixel correspondence

Patent number: 11790550

Abstract: A method includes obtaining a first plurality of feature vectors associated with a first image and a second plurality of feature vectors associated with a second image. The method also includes generating a plurality of transformed feature vectors by transforming each respective feature vector of the first plurality of feature vectors by a kernel matrix trained to define an elliptical inner product space. The method additionally includes generating a cost volume by determining, for each respective transformed feature vector of the plurality of transformed feature vectors, a plurality of inner products, wherein each respective inner product of the plurality of inner products is between the respective transformed feature vector and a corresponding candidate feature vector of a corresponding subset of the second plurality of feature vectors. The method further includes determining, based on the cost volume, a pixel correspondence between the first image and the second image.

Type: Grant

Filed: July 8, 2020

Date of Patent: October 17, 2023

Assignee: Google LLC

Inventors: Taihong Xiao, Deqing Sun, Ming-Hsuan Yang, Qifei Wang, Jinwei Yuan
Bilateral convolution layer network for processing point clouds

Patent number: 11636668

Abstract: A method includes filtering a point cloud transformation of a 3D object to generate a 3D lattice and processing the 3D lattice through a series of bilateral convolution networks (BCL), each BCL in the series having a lower lattice feature scale than a preceding BCL in the series. The output of each BCL in the series is concatenated to generate an intermediate 3D lattice. Further filtering of the intermediate 3D lattice generates a first prediction of features of the 3D object.

Type: Grant

Filed: May 22, 2018

Date of Patent: April 25, 2023

Inventors: Varun Jampani, Hang Su, Deqing Sun, Ming-Hsuan Yang, Jan Kautz
System and method for providing unsupervised domain adaptation for spatio-temporal action localization

Patent number: 11580743

Abstract: A system and method for providing unsupervised domain adaption for spatio-temporal action localization that includes receiving video data associated with a source domain and a target domain that are associated with a surrounding environment of a vehicle. The system and method also include analyzing the video data associated with the source domain and the target domain and determining a key frame of the source domain and a key frame of the target domain. The system and method additionally include completing an action localization model to model a temporal context of actions occurring within the key frame of the source domain and the key frame of the target domain and completing an action adaption model to localize individuals and their actions and to classify the actions based on the video data. The system and method further include combining losses to complete spatio-temporal action localization of individuals and actions.

Type: Grant

Filed: March 25, 2022

Date of Patent: February 14, 2023

Assignee: HONDA MOTOR CO., LTD.

Inventors: Yi-Ting Chen, Behzad Dariush, Nakul Agarwal, Ming-Hsuan Yang
RESOURCE CONSTRAINED NEURAL NETWORK ARCHITECTURE SEARCH

Publication number: 20220414425

Abstract: Methods, and systems, including computer programs encoded on computer storage media for neural network architecture search.

Type: Application

Filed: August 19, 2022

Publication date: December 29, 2022

Applicant: Google LLC

Inventors: Ming-Hsuan Yang, Xiaojie Jin, Joshua Foster Slocum, Shengyang Dai, Jiang Wang
Using residual video data resulting from a compression of original video data to improve a decompression of the original video data

Patent number: 11496773

Abstract: A method, computer readable medium, and system are disclosed for identifying residual video data. This data describes data that is lost during a compression of original video data. For example, the original video data may be compressed and then decompressed, and this result may be compared to the original video data to determine the residual video data. This residual video data is transformed into a smaller format by means of encoding, binarizing, and compressing, and is sent to a destination. At the destination, the residual video data is transformed back into its original format and is used during the decompression of the compressed original video data to improve a quality of the decompressed original video data.

Type: Grant

Filed: June 18, 2021

Date of Patent: November 8, 2022

Assignee: NVIDIA CORPORATION

Inventors: Yi-Hsuan Tsai, Ming-Yu Liu, Deqing Sun, Ming-Hsuan Yang, Jan Kautz
Resource constrained neural network architecture search

Patent number: 11443162

Abstract: Methods, and systems, including computer programs encoded on computer storage media for neural network architecture search.

Type: Grant

Filed: August 23, 2019

Date of Patent: September 13, 2022

Assignee: Google LLC

Inventors: Ming-Hsuan Yang, Xiaojie Jin, Joshua Foster Slocum, Shengyang Dai, Jiang Wang
Abnormal packet detection apparatus and method

Patent number: 11425094

Abstract: An abnormal packet detection apparatus and method are provided. The abnormal packet detection apparatus stores a whitelist corresponding to a protocol port, wherein the whitelist includes at least one legal packet record. Each legal packet record includes a legal packet length, a legal source address, and a legal variation position set, and corresponds to a reference packet. The abnormal packet detection apparatus determines that a current packet length and a current source address of a to-be-analyzed packet are respectively the same as the legal packet length and the legal source address of a reference packet record among the at least one legal packet record, determines a current variation position of the to-be-analyzed packet by comparing the to-be-analyzed packet with the reference packet corresponding to the reference packet record, and generates a detection result by comparing the current variation position with the legal variation position set of the reference packet record.

Type: Grant

Filed: November 23, 2020

Date of Patent: August 23, 2022

Assignee: INSTITUTE FOR INFORMATION INDUSTRY

Inventors: Yu-Ting Tsou, Ding-Jie Huang, Chih-Ta Lin, Ming-Hsuan Yang, Mei-Lin Li, Saranchon Lammongkol, Chin-Fang Mao
System and method for providing unsupervised domain adaptation for spatio-temporal action localization

Patent number: 11403850

Abstract: A system and method for providing unsupervised domain adaption for spatio-temporal action localization that includes receiving video data associated with a surrounding environment of a vehicle. The system and method also include completing an action localization model to model a temporal context of actions occurring within the surrounding environment of the vehicle based on the video data and completing an action adaption model to localize individuals and their actions and to classify the actions based on the video data. The system and method further include combining losses from the action localization model and the action adaption model to complete spatio-temporal action localization of individuals and actions that occur within the surrounding environment of the vehicle.

Type: Grant

Filed: February 28, 2020

Date of Patent: August 2, 2022

Assignee: Honda Motor Co., Ltd.

Inventors: Yi-Ting Chen, Behzad Dariush, Nakul Agarwal, Ming-Hsuan Yang
SYSTEM AND METHOD FOR PROVIDING UNSUPERVISED DOMAIN ADAPTATION FOR SPATIO-TEMPORAL ACTION LOCALIZATION

Publication number: 20220215661

Abstract: A system and method for providing unsupervised domain adaption for spatio-temporal action localization that includes receiving video data associated with a source domain and a target domain that are associated with a surrounding environment of a vehicle. The system and method also include analyzing the video data associated with the source domain and the target domain and determining a key frame of the source domain and a key frame of the target domain. The system and method additionally include completing an action localization model to model a temporal context of actions occurring within the key frame of the source domain and the key frame of the target domain and completing an action adaption model to localize individuals and their actions and to classify the actions based on the video data. The system and method further include combining losses to complete spatio-temporal action localization of individuals and actions.

Type: Application

Filed: March 25, 2022

Publication date: July 7, 2022

Inventors: Yi-Ting CHEN, Behzad DARIUSH, Nakul AGARWAL, Ming-Hsuan YANG
Learnable Cost Volume for Determining Pixel Correspondence

Publication number: 20220189051

Abstract: A method includes obtaining a first plurality of feature vectors associated with a first image and a second plurality of feature vectors associated with a second image. The method also includes generating a plurality of transformed feature vectors by transforming each respective feature vector of the first plurality of feature vectors by a kernel matrix trained to define an elliptical inner product space. The method additionally includes generating a cost volume by determining, for each respective transformed feature vector of the plurality of transformed feature vectors, a plurality of inner products, wherein each respective inner product of the plurality of inner products is between the respective transformed feature vector and a corresponding candidate feature vector of a corresponding subset of the second plurality of feature vectors. The method further includes determining, based on the cost volume, a pixel correspondence between the first image and the second image.

Type: Application

Filed: July 8, 2020

Publication date: June 16, 2022

Inventors: Taihong Xiao, Deqing Sun, Ming-Hsuan Yang, Qifei Wang, Jinwei Yuan
ABNORMAL PACKET DETECTION APPARATUS AND METHOD

Publication number: 20220131833

Abstract: An abnormal packet detection apparatus and method are provided. The abnormal packet detection apparatus stores a whitelist corresponding to a protocol port, wherein the whitelist includes at least one legal packet record. Each legal packet record includes a legal packet length, a legal source address, and a legal variation position set, and corresponds to a reference packet. The abnormal packet detection apparatus determines that a current packet length and a current source address of a to-be-analyzed packet are respectively the same as the legal packet length and the legal source address of a reference packet record among the at least one legal packet record, determines a current variation position of the to-be-analyzed packet by comparing the to-be-analyzed packet with the reference packet corresponding to the reference packet record, and generates a detection result by comparing the current variation position with the legal variation position set of the reference packet record.

Type: Application

Filed: November 23, 2020

Publication date: April 28, 2022

Inventors: Yu-Ting TSOU, Ding-Jie HUANG, Chih-Ta LIN, Ming-Hsuan YANG, Mei-Lin LI, Saranchon LAMMONGKOL, Chin-Fang MAO
Real Time Perspective Correction on Faces

Publication number: 20220058808

Abstract: Apparatus and methods related to image processing are provided. A computing device can determine a first image area of an image, such as an image captured by a camera. The computing device can determine a warping mesh for the image with a first portion of the warping mesh associated with the first image area. The computing device can determine a cost function for the warping mesh by: determining first costs associated with the first portion of the warping mesh that include costs associated with face-related transformations of the first image area to correct geometric distortions. The computing device can determine an optimized mesh based on optimizing the cost function. The computing device can modify the first image area based on the optimized mesh.

Type: Application

Filed: August 30, 2021

Publication date: February 24, 2022

Inventors: Yichang Shih, Chia-Kai Liang, Wei-Sheng Lai, Ming-Hsuan Yang, Siargey Pisarchyk, Ryhor Karpiak
Training a neural network to predict superpixels using segmentation-aware affinity loss

Patent number: 11256961

Abstract: Segmentation is the identification of separate objects within an image. An example is identification of a pedestrian passing in front of a car, where the pedestrian is a first object and the car is a second object. Superpixel segmentation is the identification of regions of pixels within an object that have similar properties. An example is identification of pixel regions having a similar color, such as different articles of clothing worn by the pedestrian and different components of the car. A pixel affinity neural network (PAN) model is trained to generate pixel affinity maps for superpixel segmentation. The pixel affinity map defines the similarity of two points in space. In an embodiment, the pixel affinity map indicates a horizontal affinity and vertical affinity for each pixel in the image. The pixel affinity map is processed to identify the superpixels.

Type: Grant

Filed: July 6, 2020

Date of Patent: February 22, 2022

Assignee: NVIDIA Corporation

Inventors: Wei-Chih Tu, Ming-Yu Liu, Varun Jampani, Deqing Sun, Ming-Hsuan Yang, Jan Kautz
USING RESIDUAL VIDEO DATA RESULTING FROM A COMPRESSION OF ORIGINAL VIDEO DATA TO IMPROVE A DECOMPRESSION OF THE ORIGINAL VIDEO DATA

Publication number: 20210314629

Abstract: A method, computer readable medium, and system are disclosed for identifying residual video data. This data describes data that is lost during a compression of original video data. For example, the original video data may be compressed and then decompressed, and this result may be compared to the original video data to determine the residual video data. This residual video data is transformed into a smaller format by means of encoding, binarizing, and compressing, and is sent to a destination. At the destination, the residual video data is transformed back into its original format and is used during the decompression of the compressed original video data to improve a quality of the decompressed original video data.

Type: Application

Filed: June 18, 2021

Publication date: October 7, 2021

Inventors: Yi-Hsuan Tsai, Ming-Yu Liu, Deqing Sun, Ming-Hsuan Yang, Jan Kautz
Real time perspective correction on faces

Patent number: 11132800

Abstract: Apparatus and methods related to image processing are provided. A computing device can determine a first image area of an image, such as an image captured by a camera. The computing device can determine a warping mesh for the image with a first portion of the warping mesh associated with the first image area. The computing device can determine a cost function for the warping mesh by: determining first costs associated with the first portion of the warping mesh that include costs associated with face-related transformations of the first image area to correct geometric distortions. The computing device can determine an optimized mesh based on optimizing the cost function. The computing device can modify the first image area based on the optimized mesh.

Type: Grant

Filed: October 2, 2019

Date of Patent: September 28, 2021

Assignee: Google LLC

Inventors: Yichang Shih, Chia-Kai Liang, Wei-Sheng Lai, Ming-Hsuan Yang, Siargey Pisarchyk, Ryhor Karpiak
Using residual video data resulting from a compression of original video data to improve a decompression of the original video data

Patent number: 11082720

Abstract: A method, computer readable medium, and system are disclosed for identifying residual video data. This data describes data that is lost during a compression of original video data. For example, the original video data may be compressed and then decompressed, and this result may be compared to the original video data to determine the residual video data. This residual video data is transformed into a smaller format by means of encoding, binarizing, and compressing, and is sent to a destination. At the destination, the residual video data is transformed back into its original format and is used during the decompression of the compressed original video data to improve a quality of the decompressed original video data.

Type: Grant

Filed: November 14, 2018

Date of Patent: August 3, 2021

Assignee: NVIDIA CORPORATION

Inventors: Yi-Hsuan Tsai, Ming-Yu Liu, Deqing Sun, Ming-Hsuan Yang, Jan Kautz
Computer Vision Systems and Methods for Diverse Image-to-Image Translation Via Disentangled Representations

Publication number: 20210224947

Abstract: Computer vision systems and methods for image to image translation are provided. The system receives a first input image and a second input image and applies a content adversarial loss function to the first input image and the second input image to determine a disentanglement representation of the first input image and a disentanglement representation of the second input image. The system trains a network to generate at least one output image by applying a cross cycle consistency loss function to the first disentanglement representation and the second disentanglement representation to perform multimodal mapping between the first input image and the second input image.

Type: Application

Filed: January 19, 2021

Publication date: July 22, 2021

Applicant: Insurance Services Office, Inc.

Inventors: Hsin-Ying Lee, Hung-Yu Tseng, Jia-Bin Huang, Maneesh Kumar Singh, Ming-Hsuan Yang

1 2 3 4 next