Patents by Inventor Nam Ling

Nam Ling has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Generative video compression with a transformer-based discriminator

Patent number: 12530588

Abstract: A method, an apparatus, and a non-transitory computer-readable storage medium for video compression using a generative adversarial network (GAN) are provided. The method includes obtaining, by a generator of the GAN, a reconstructed target frame based on a reference frame and a raw target frame to be reconstructed; concatenating, by a transformer-based discriminator of the GAN, the reference frame, the raw target frame and the reconstructed target frame to obtain a paired data; determining, by the transformer-based discriminator of the GAN, whether the paired data is real or fake to guide reconstruction of the raw target frame; and determining a generator loss and a transformer-based discriminator loss, and performing gradient back propagation and updating network parameters of the GAN based on the generator loss and the transformer-based discriminator loss.

Type: Grant

Filed: October 21, 2022

Date of Patent: January 20, 2026

Assignees: Beijing Dajia Internet Information Technology Co., Ltd., Santa Clara University

Inventors: Pengli Du, Ying Liu, Nam Ling, Yongxiong Ren, Lingzhi Liu
Class-specific neural network for video compressed sensing

Patent number: 12394103

Abstract: A class-specific neural network for video compressed sensing and methods for training and testing the class-specific neural network are provided. The class-specific neural network includes a Gaussian-mixture model (GMM) and a plurality of encoders, where the GMM classifies video frame blocks with a plurality of clusters and assigns the video frame blocks to the plurality of clusters. Further, the plurality of encoders receive the video frame blocks and generate a plurality of compressed-sensed frame block vectors, where the plurality of encoders correspond to the plurality of clusters.

Type: Grant

Filed: March 15, 2022

Date of Patent: August 19, 2025

Assignees: KWAI INC., SANTA CLARA UNIVERSITY

Inventors: Yifei Pei, Ying Liu, Nam Ling, Lingzhi Liu, Yongxiong Ren, Ming Kai Hsu
End-to-end deep generative network for low bitrate image coding

Patent number: 12327385

Abstract: A neural network system, a method and an apparatus for image compression are provided. The neural network may include a generator including an encoder, an entropy estimator, and a decoder, where the encoder receives an input image and generates an encoder output, a plurality of quantized feature entries are obtained based on the encoder output outputted at a last encoder block, the entropy estimator receives the plurality of quantized feature entries and calculates an entropy loss based on the plurality of quantized feature entries, and the decoder receives the plurality of quantized feature entries and generates a reconstructed image. Furthermore, the neural network may include a discriminator that determines whether the reconstructed image different from the input image based on a discriminator loss. Moreover, the generator may determine whether content of the reconstructed image matches content of the input image based on a generator loss including the entropy loss.

Type: Grant

Filed: October 19, 2022

Date of Patent: June 10, 2025

Assignees: SANTA CLARA UNIVERSITY, KWAI INC.

Inventors: Yifei Pei, Ying Liu, Nam Ling, Yongxiong Ren, Lingzhi Liu
Generative adversarial network for video compression

Patent number: 12058312

Abstract: A method and an apparatus for video processing are provided. The method includes that a decoding terminal receives a plurality of coded video frames coded using one or more generative adversarial networks (GANs), receives network parameters related to the one or more GANs, and decodes the plurality of coded video frames using GANs based on the network parameters. Further, the one or more GANs respectively implement one or more video coding functions including reference-frame coding, motion-compensated frame prediction, and residue-frame coding.

Type: Grant

Filed: October 6, 2021

Date of Patent: August 6, 2024

Assignees: KWAI INC., SANTA CLARA UNIVERSITY

Inventors: Pengli Du, Ying Liu, Nam Ling, Lingzhi Liu, Yongxiong Ren, Ming Kai Hsu
END-TO-END DEEP GENERATIVE NETWORK FOR LOW BITRATE IMAGE CODING

Publication number: 20240185473

Abstract: A neural network system, a method and an apparatus for image compression are provided. The neural network may include a generator including an encoder, an entropy estimator, and a decoder, where the encoder receives an input image and generates an encoder output, a plurality of quantized feature entries are obtained based on the encoder output outputted at a last encoder block, the entropy estimator receives the plurality of quantized feature entries and calculates an entropy loss based on the plurality of quantized feature entries, and the decoder receives the plurality of quantized feature entries and generates a reconstructed image. Furthermore, the neural network may include a discriminator that determines whether the reconstructed image different from the input image based on a discriminator loss. Moreover, the generator may determine whether content of the reconstructed image matches content of the input image based on a generator loss including the entropy loss.

Type: Application

Filed: October 19, 2022

Publication date: June 6, 2024

Applicants: SANTA CLARA UNIVERSITY, KWAI INC.

Inventors: Yifei PEI, Ying LIU, Nam LING, Yongxiong REN, Lingzhi LIU
GENERATIVE VIDEO COMPRESSION WITH A TRANSFORMER-BASED DISCRIMINATOR

Publication number: 20240185075

Abstract: A method, an apparatus, and a non-transitory computer-readable storage medium for video compression using a generative adversarial network (GAN) are provided. The method includes obtaining, by a generator of the GAN, a reconstructed target frame based on a reference frame and a raw target frame to be reconstructed; concatenating, by a transformer-based discriminator of the GAN, the reference frame, the raw target frame and the reconstructed target frame to obtain a paired data; determining, by the transformer-based discriminator of the GAN, whether the paired data is real or fake to guide reconstruction of the raw target frame; and determining a generator loss and a transformer-based discriminator loss, and performing gradient back propagation and updating network parameters of the GAN based on the generator loss and the transformer-based discriminator loss.

Type: Application

Filed: October 21, 2022

Publication date: June 6, 2024

Applicants: SANTA CLARA UNIVERSITY, KWAI INC.

Inventors: Pengli DU, Ying LIU, Nam LING, Yongxiong REN, Lingzhi LIU
GENERATIVE ADVERSARIAL NETWORK FOR VIDEO COMPRESSION

Publication number: 20230105436

Abstract: A method and an apparatus for video processing are provided. The method includes that a decoding terminal receives a plurality of coded video frames coded using one or more generative adversarial networks (GANs), receives network parameters related to the one or more GANs, and decodes the plurality of coded video frames using GANs based on the network parameters. Further, the one or more GANs respectively implement one or more video coding functions including reference-frame coding, motion-compensated frame prediction, and residue-frame coding.

Type: Application

Filed: October 6, 2021

Publication date: April 6, 2023

Applicants: KWAI INC., SANTA CLARA UNIVERSITY

Inventors: Pengli DU, Ying LIU, Nam LING, Lingzhi LIU, Yongxiong REN, Ming Kai HSU
CLASS-SPECIFIC NEURAL NETWORK FOR VIDEO COMPRESSED SENSING

Publication number: 20220292727

Abstract: A class-specific neural network for video compressed sensing and methods for training and testing the class-specific neural network are provided. The class-specific neural network includes a Gaussian-mixture model (GMM) and a plurality of encoders, where the GMM classifies video frame blocks with a plurality of clusters and assigns the video frame blocks to the plurality of clusters. Further, the plurality of encoders receive the video frame blocks and generate a plurality of compressed-sensed frame block vectors, where the plurality of encoders correspond to the plurality of clusters.

Type: Application

Filed: March 15, 2022

Publication date: September 15, 2022

Applicants: KWAI INC., SANTA CLARA UNIVERSITY

Inventors: Yifei PEI, Ying LIU, Nam LING, Lingzhi LIU, Yongxiong REN, Ming Kai HSU
3D SEPARABLE DEEP CONVOLUTIONAL NEURAL NETWORK FOR MOVING OBJECT DETECTION

Publication number: 20220164630

Abstract: A method for detecting moving objects in video frames, an apparatus and a non-transitory computer-readable storage medium thereof are provided. The method includes that: an encoder in a 3-dimenional (3D) separable convolutional neural network with multi-input multi-output (3DS_MM) receives a first input including multiple video frames, where the encoder includes a plurality of encoder layers including 3D separable convolutional neural network (CNN) layers; the encoder generates a first encoder output; and a decoder in the 3DS_MM receives the first encoder output and generates a first output including multiple first binary masks related to the first input, where the decoder includes a plurality of decoder layers comprising 3D separable transposed CNN layers.

Type: Application

Filed: November 22, 2021

Publication date: May 26, 2022

Applicants: KWAI INC., SANTA CLARA UNIVERSITY

Inventors: Bingxin HOU, Ying LIU, Nam LING, Lingzhi LIU, Yongxiong REN, Ming Kai HSU
Mode dependent intra smoothing filter table mapping methods for non-square prediction units

Patent number: 11190809

Abstract: An apparatus including a memory operably coupled to a processor. The processor is configured to determine whether to use an intra smoothing filter for a rectangular prediction unit (PU), wherein a width of the rectangular PU is different from a height of the rectangular PU.

Type: Grant

Filed: March 2, 2020

Date of Patent: November 30, 2021

Assignee: Futurewei Technologies, Inc.

Inventors: Guichun Li, Lingzhi Liu, Changcai Lai, Nam Ling, Jianhua Zheng, Chen-Xiong Zhang
Non-MPM mode coding for intra prediction in video coding

Patent number: 10764577

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing intra coding predictions. An intra-coding process applied to pixels in a frame of media is determined. The intra-coding process is determined whether to corresponding to at least one of most probable modes. In response to determining the intra-coding process does not correspond to the at least one of the most probable modes, four angular prediction modes are extracted from a list of prediction modes. A set of candidates based on the four angular prediction modes are determined. A pre-defined order of the set of candidates is determined, wherein each candidate mode of the set of candidate modes is included in a ranked order and signaled with a particular fixed length coding, and wherein a length of the particular fixed length coding increases based on the pre-defined order of the set of candidate modes.

Type: Grant

Filed: October 25, 2019

Date of Patent: September 1, 2020

Assignees: Futurewei Technologies, Inc., Santa Clara University

Inventors: Minqiang Jiang, Taru Kanchan, Jianhua Zheng, Nam Ling, Chen-Xiong Zhang
Mode Dependent Intra Smoothing Filter Table Mapping Methods for Non-Square Prediction Units

Publication number: 20200204830

Abstract: An apparatus including a memory operably coupled to a processor. The processor is configured to determine whether to use an intra smoothing filter for a rectangular prediction unit (PU), wherein a width of the rectangular PU is different from a height of the rectangular PU.

Type: Application

Filed: March 2, 2020

Publication date: June 25, 2020

Inventors: Guichun Li, Lingzhi Liu, Changcai Lai, Nam Ling, Jianhua Zheng, Chen-Xiong Zhang
Mode dependent intra smoothing filter table mapping methods for non-square prediction units

Patent number: 10645422

Abstract: An apparatus including a memory operably coupled to a processor. The processor is configured to select an intra smoothing filter for a rectangular prediction unit (PU) based on a lookup table (LUT) used for square PUs, wherein a width of the rectangular PU is different from a height of the rectangular PU.

Type: Grant

Filed: June 7, 2018

Date of Patent: May 5, 2020

Assignee: Futurewei Technologies, Inc.

Inventors: Guichun Li, Lingzhi Liu, Changcai Lai, Nam Ling, Jianhua Zheng, Chen-Xiong Zhang
NON-MPM MODE CODING FOR INTRA PREDICTION IN VIDEO CODING

Publication number: 20200137385

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing intra coding predictions. An intra-coding process applied to pixels in a frame of media is determined. The intra-coding process is determined whether to corresponding to at least one of most probable modes. In response to determining the intra-coding process does not correspond to the at least one of the most probable modes, four angular prediction modes are extracted from a list of prediction modes. A set of candidates based on the four angular prediction modes are determined. A pre-defined order of the set of candidates is determined, wherein each candidate mode of the set of candidate modes is included in a ranked order and signaled with a particular fixed length coding, and wherein a length of the particular fixed length coding increases based on the pre-defined order of the set of candidate modes.

Type: Application

Filed: October 25, 2019

Publication date: April 30, 2020

Inventors: Minqiang JIANG, Taru KANCHAN, Jianhua ZHENG, Nam LING, Chen-Xiong ZHANG
Systems, methods, and devices for image coding

Patent number: 10587900

Abstract: System and method embodiments for image coding are disclosed. In an embodiment, a method in a data processing system for image encoding includes determining a sparsity constraint according to a dimension of an input image signal. The method also includes iteratively determining a plurality of approximations to the input image signal. Each iteration provides an approximation of the input image signal. Each approximation includes a set of dictionary element indices and coefficients. The dictionary is an over-complete dictionary. Iterations of the determining step are terminated when a number of iterations is equal to the sparsity constraint. The method also includes selecting one of the plurality of approximations according to a minimum rate-distortion cost. The method also includes determining an encoded image signal according to non-zero coefficients and corresponding indices for each non-zero coefficient in the selected approximation.

Type: Grant

Filed: February 15, 2017

Date of Patent: March 10, 2020

Assignees: FUTUREWEI TECHNOLOGIES, INC., Santa Clara University

Inventors: Minqiang Jiang, Jianhua Zheng, Madhusudan Kalluri, Nam Ling, Chen-Xiong Zhang
Illumination compensation (IC) refinement based on positional pairings among pixels

Patent number: 10554967

Abstract: An apparatus comprises a receiver configured to receive video views comprising a reference view and a current view, wherein the reference view comprises a reference block and the current view comprises a current block, and a processor coupled to the receiver and configured to determine neighboring reference pixels associated with the reference block, determine neighboring current pixels associated with the current block, determine a first positional pairing between the neighboring reference pixels and the neighboring current pixels, determine a second positional pairing between the neighboring reference pixels and the neighboring current pixels, and determine an optimal pairing from between the first positional pairing and the second positional pairing.

Type: Grant

Filed: March 20, 2015

Date of Patent: February 4, 2020

Inventors: Zhouye Gu, Jianhua Zheng, Nam Ling, Chen-Xiong Zhang
System and method for estimating view synthesis distortion

Patent number: 10326995

Abstract: System and method embodiments are provided for achieving improved View Synthesis Distortion (VSD) calculation and more accurate distortion estimation of encoded video frames. An embodiment method includes obtaining a depth map value for a video frame and determining a weighting factor for depth distortion in accordance with the depth map value. The weighting factor maps a pixel range of the depth map value to an output function having higher values for closer image objects and lower values for farther image objects. The VSD for the video frame is then calculated as a function of absolute horizontal texture gradients weighted by a depth distortion value and the weighting factor determined in accordance with the depth map value.

Type: Grant

Filed: June 30, 2017

Date of Patent: June 18, 2019

Assignees: Futurewei Technologies, Inc., Santa Clara University

Inventors: Zhouye Gu, Nam Ling, Chen-Xiong Zhang, Jianhua Zheng
Method and apparatus of depth prediction mode selection

Patent number: 10306266

Abstract: A method, an apparatus and a decoder for decoding a block of a depth map are provided. An ordered list of decoding modes is obtained, wherein the ordered list of decoding modes comprises a plurality of decoding modes each of which is capable of being used for decoding of the block. A plurality of depth modeling modes (DMMs) each of which is capable of being used for decoding of the block are obtained. And whether a DMM of the plurality of DMMs is to be added into the ordered list of decoding modes in accordance with a decision condition is determined.

Type: Grant

Filed: November 21, 2016

Date of Patent: May 28, 2019

Assignee: FUTUREWEI TECHNOLOGIES, INC.

Inventors: Zhouye Gu, Jianhua Zheng, Nam Ling, Chen-Xiong Zhang
Reference pixel selection and filtering for intra coding of depth map

Patent number: 10129542

Abstract: A video codec configured to receive a current block and a plurality of neighboring pixels, wherein the current block comprises a first partition and a second partition, select one or more reference pixels from the plurality of neighboring pixels, and predict a plurality of pixels located in the second partition based on the reference pixels.

Type: Grant

Filed: October 16, 2014

Date of Patent: November 13, 2018

Assignees: Futurewei Technologies, Inc., Santa Clara University

Inventors: Zhouye Gu, Jianhua Zheng, Nam Ling, Philipp Zhang
Mode Dependent Intra Smoothing Filter Table Mapping Methods for Non-Square Prediction Units

Publication number: 20180295386

Abstract: An apparatus including a memory operably coupled to a processor. The processor is configured to select an intra smoothing filter for a rectangular prediction unit (PU) based on a lookup table (LUT) used for square PUs, wherein a width of the rectangular PU is different from a height of the rectangular PU.

Type: Application

Filed: June 7, 2018

Publication date: October 11, 2018

Inventors: Guichun Li, Lingzhi Liu, Changcai Lai, Nam Ling, Jianhua Zheng, Chen-Xiong Zhang

1 2 3 next