Patents by Inventor Minghai Qin

Minghai Qin has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Method and system for processing video content

Patent number: 11445200

Abstract: Embodiments of the disclosure provide systems and methods for processing video content. The method can include: receiving raw video data of a video; determining a texture complexity for the video based on the raw video data; determining an encoding mode for the raw video data based on the texture complexity; and encoding the raw video data using the determined encoding mode.

Type: Grant

Filed: May 12, 2020

Date of Patent: September 13, 2022

Assignee: Alibaba Group Holding Limited

Inventors: Minghai Qin, Guanlin Wu, Tae Meon Bae, Sicheng Li, Yuanwei Fang, Yen-kuang Chen
FREQUENCY SPECIFIC COMPRESSION AND COMPENSATION TECHNIQUE IN IMAGE PROCESSING

Publication number: 20220286711

Abstract: The present disclosure relates to a method for compensating an image. The method comprises estimating transform coefficients of a frequency component for a first image based on the first image, performing a dot multiplication operation between the estimated transform coefficients and a basis function associated with the frequency component to generate a compensation image, and combining the first image and the compensation image to generate a combined image.

Type: Application

Filed: May 24, 2022

Publication date: September 8, 2022

Inventors: Zihao Liu, Sicheng Li, Minghai Qin, Yen-kuang Chen
Storing neural net works and weights for neural networks

Patent number: 11436482

Abstract: Systems and methods are disclosed for storing neural networks and weights for neural networks. In some implementations, a method is provided. The method includes storing a plurality of weights of a neural network comprising a plurality of nodes and a plurality of connections between the plurality of nodes. Each weight of the plurality of weights is associated with a connection of the plurality of connections. The neural network comprises a binarized neural network. The method also includes receiving input data to be processed by the neural network. The method further includes determining whether a set of weights of the plurality of weights comprises one or more errors. The method further includes refraining from using the set of weights to process the input data using the neural network in response to determining that the set of weights comprises the one or more errors.

Type: Grant

Filed: October 29, 2018

Date of Patent: September 6, 2022

Assignee: Western Digital Technologies, Inc.

Inventor: Minghai Qin
Noise injection training for memory-based learning

Patent number: 11403529

Abstract: The system described herein can include neural networks with noise-injection layers. The noise-injection layers can enable the neural networks to be trained such that the neural networks are able to maintain their classification and prediction performance in the presence of noisy data signals. Once trained, the parameters from the neural networks with noise-injection layers can be used in the neural networks of systems that include resistive random-access memory (ReRAM), memristors, or phase change memory (PCM), which use analog signals that can introduce noise into the system. The use of ReRAM, memristors, or PCM can enable large-scale parallelism that improves the speed and computational efficiency of neural network training and classification. Using the parameters from the neural networks trained with noise-injection layers, enables the neural networks to make robust predictions and calculations in the presence of noisy data.

Type: Grant

Filed: June 28, 2018

Date of Patent: August 2, 2022

Assignee: Western Digital Technologies, Inc.

Inventors: Minghai Qin, Dejan Vucinic
Techniques to dynamically gate encoded image components for artificial intelligence tasks

Patent number: 11403783

Abstract: A system for processing encoded image components for artificial intelligence tasks. The system can include one or more compute units, one or more controllers and memory. The one or more controllers can include one or more micro-op schedulers and one or more channel switches. The one or more compute units can be configured to process components of the transformed domain image data according to one or more micro-operations for an artificial intelligence task. The one or more channel switches can be configured to selectively control the transfer of the components of transformed domain image data to the one or more compute units based on one or more gating flags. The one or more channel switches can also be configured to selectively control generation of the one or more micro-operations by the one or more micro-op schedulers based on the one or more gating flags.

Type: Grant

Filed: November 14, 2019

Date of Patent: August 2, 2022

Assignee: Alibaba Group Holding Limited

Inventors: Kai Xu, Minghai Qin, Yuhao Wang, Fei Sun, Yen-kuang Chen, Yuan Xie
Static channel filtering in frequency domain

Patent number: 11403782

Abstract: Methods and systems are provided for implementing static channel filtering operations upon image datasets transformed to frequency domain representations, including decoding images of an image dataset to generate a frequency domain representation of the image dataset; discarding coefficient values of one or more particular frequency channels of each image of the image dataset in a frequency domain representation; and transporting the image dataset in a frequency domain representation to one or more special-purpose processor(s). Methods and systems of the present disclosure may enable a filtered image dataset to be input to a second layer of a learning model, bypassing a first layer, or may enable a learning model to be designed with a reduced-size first layer. This may achieve benefits such as reducing computational overhead and time of machine learning training and inference computations, reducing volume of image data input into the learning model, and reducing convergence time.

Type: Grant

Filed: December 31, 2019

Date of Patent: August 2, 2022

Assignee: Alibaba Group Holding Limited

Inventors: Kai Xu, Fei Sun, Minghai Qin, Yen-kuang Chen
Method and apparatus for efficient application screen compression

Patent number: 11386873

Abstract: A method for of encoding an application screen comprises partitioning graphic data into a plurality of graphic layers and classifying each of the plurality of graphic layers as either a screen content (SC) or a non-screen content (non-SC) layer. The method further comprises classifying each of the plurality of graphic layers as either a screen content (SC) or a non-screen content (non-SC) layer. Further, the method comprises rendering and encoding the one or more SC layers using a first codec and the one or more non-SC layers using a second codec.

Type: Grant

Filed: April 1, 2020

Date of Patent: July 12, 2022

Assignee: Alibaba Group Holding Limited

Inventors: Tae Meon Bae, Sicheng Li, Yen-kuang Chen, Guanlin Wu, Shaolin Xie, Minghai Qin, Qinggang Zhou
Region-of-interest based video encoding

Patent number: 11388423

Abstract: A video processing unit can include a non-object-based region-of-interest detection neural network, a threshold selection module and a region-of-interest map generator. The non-object-based region-of-interest detection neural network can be configured to receive a video frame and generate a plurality of candidate non-object-based region-of-interest blocks. The threshold selection module can be configured to receive the plurality of candidate non-object-based region-of-interest blocks and identify a plurality of selected region-of-interest blocks based on a predetermined threshold. The region-of-interest map generator can be configured to receive the selected non-object-based region-of-interest blocks and generate a region-of-interest map.

Type: Grant

Filed: March 23, 2020

Date of Patent: July 12, 2022

Assignee: Alibaba Group Holding Limited

Inventors: Minghai Qin, Sicheng Li, Guanlin Wu, Tae Meon Bae, Yen-kuang Chen
METHOD AND SYSTEM FOR MICROARCHITECTURE-AWARE PROGRAM SAMPLING

Publication number: 20220215241

Abstract: This application describes methods, systems, and apparatus, including computer programs encoded on computer storage media, for microarchitecture-aware program sampling. An exemplary method includes receiving one or more traces collected from one or more microarchitectures executing a computer program for evaluating hardware configurations; training a machine learning (ML) model with multi-task learning based on the one or more traces as one or more training tasks; generating a plurality of embedded vectors representing the computer program; and updating, based on the trained ML model, the plurality of embedded vectors.

Type: Application

Filed: January 5, 2021

Publication date: July 7, 2022

Inventors: Yuanwei FANG, Minghai QIN, Yen-kuang CHEN
Frequency specific compression and compensation technique in image processing

Patent number: 11375241

Abstract: The present disclosure relates to a method for compensating an image. The method comprises estimating transform coefficients of a frequency component for a first image based on the first image, performing a dot multiplication operation between the estimated transform coefficients and a basis function associated with the frequency component to generate a compensation image, and combining the first image and the compensation image to generate a combined image.

Type: Grant

Filed: October 20, 2020

Date of Patent: June 28, 2022

Assignee: Alibaba Group Holding Limited

Inventors: Zihao Liu, Sicheng Li, Minghai Qin, Yen-kuang Chen
Using selected components of frequency domain image data in artificial intelligence tasks

Patent number: 11366979

Abstract: Image data is accessed. The image data includes frequency domain components. A subset of the frequency domain components is selected based on the relative importance of the frequency domain components. Only the subset of the frequency domain components is provided to an accelerator that executes a neural network to perform an artificial intelligence task using the subset of frequency domain components.

Type: Grant

Filed: November 14, 2019

Date of Patent: June 21, 2022

Assignee: Alibaba Group Holding Limited

Inventors: Yuhao Wang, Minghai Qin, Yen-Kuang Chen
SUPER-SPARSE IMAGE COMPRESSION USING CROSS-BAR NON-VOLATILE MEMORY DEVICE

Publication number: 20220171992

Abstract: Exemplary methods and apparatus are disclosed that implement super-sparse image/video compression by storing image dictionary elements within a cross-bar resistive random access memory (ReRAM) array (or other suitable cross-bar NVM array). In illustrative examples, each column of the cross-bar ReRAM array stores the values for one dictionary element (such as one 4×4 dictionary element). Methods and apparatus are described for training (configuring) the cross-bar ReRAM array to generate and store the dictionary elements by sequentially applying patches from training images to the array using an unstructured Hebbian training procedure. Additionally, methods and apparatus are described for compressing an input image by applying patches from the input image to the ReRAM array to read out cross-bar column indices identifying the columns storing the various dictionary elements that best fit the image. This may be done in parallel using a set of ReRAM arrays.

Type: Application

Filed: February 9, 2022

Publication date: June 2, 2022

Inventors: Wen Ma, Minghai Qin, Won Ho Choi, Pi-Feng Chiu, Martin Lueker-Boden
METHOD AND SYSTEM FOR CHARACTERISTIC-BASED VIDEO PROCESSING

Publication number: 20220147567

Abstract: A method and apparatus for characteristic-based video processing include: in response to receiving a region of a picture of a video sequence, determining a characteristic in the region, the region being independent of other regions of the picture for video coding; determining a class associated with the region based on the characteristic, the class being selected from a plurality of classes; and encoding the region using a parameter set associated with the class, the parameter set being selected from a plurality of parameter sets for video coding at different quality levels.

Type: Application

Filed: January 21, 2022

Publication date: May 12, 2022

Inventors: Shaolin XIE, Minghai QIN, Yen-kuang CHEN, Tae Meon BAE, Qinggang ZHOU
Realization of binary neural networks in NAND memory arrays

Patent number: 11328204

Abstract: Use of a NAND array architecture to realize a binary neural network (BNN) allows for matrix multiplication and accumulation to be performed within the memory array. A unit synapse for storing a weight of a BNN is stored in a pair of series connected memory cells. A binary input is applied as a pattern of voltage values on a pair of word lines connected to the unit synapse to perform the multiplication of the input with the weight by determining whether or not the unit synapse conducts. The results of such multiplications are determined by a sense amplifier, with the results accumulated by a counter.

Type: Grant

Filed: March 28, 2019

Date of Patent: May 10, 2022

Assignee: SanDisk Technologies LLC

Inventors: Won Ho Choi, Pi-Feng Chiu, Wen Ma, Minghai Qin, Gerrit Jan Hemink, Martin Lueker-Boden
FREQUENCY SPECIFIC COMPRESSION AND COMPENSATION TECHNIQUE IN IMAGE PROCESSING

Publication number: 20220124375

Abstract: The present disclosure relates to a method for compensating an image. The method comprises estimating transform coefficients of a frequency component for a first image based on the first image, performing a dot multiplication operation between the estimated transform coefficients and a basis function associated with the frequency component to generate a compensation image, and combining the first image and the compensation image to generate a combined image.

Type: Application

Filed: October 20, 2020

Publication date: April 21, 2022

Inventors: Zihao Liu, Sicheng Li, Minghai Qin, Yen-kuang Chen
INTELLIGENT COMPUTING RESOURCES ALLOCATION FOR FEATURE NETWORK BASED ON FEATURE PROPAGATION

Publication number: 20220103831

Abstract: The present disclosure relates to a method for scheduling computation resources for generating feature maps for video. The method comprises determining runtime for generating feature maps of a reference picture and a predicted picture, determining available computation resources for generating the feature maps, and allocating, based on the runtime, one or more computation resources among the available computation resources for generating the feature maps such that the feature maps are generated at regular time intervals.

Type: Application

Filed: September 30, 2020

Publication date: March 31, 2022

Inventors: Sicheng Ll, Yuanwei Fang, Minghai Qin, Yen-kuang Chen
ESTIMATED OPTIMAL VIDEO ENCODING PARAMETER SET SYSTEMS AND METHODS

Publication number: 20220094961

Abstract: The systems and methods are configured to efficiently and effectively determine or find an estimated optimal encoding parameter set. In one embodiment, a video encoding parameter set estimation method comprises: performing an offline encoding parameter set characteristic prediction process that determines an estimate of a candidate encoding parameter set characteristic; and performing an encoding parameter set search process that identifies a predicted or estimated optimal video encoding parameter set. The encoding parameter set search process can include applying a constraint to the candidate encoding parameter set characteristic; and determining if candidate encoding parameter set meets an objective, wherein the determining is performed if the constraint is satisfied. The candidate encoding parameter set characteristic can be an estimated encoding time of the candidate encoding parameter set. The objective can be the best video quality out of a plurality of candidate encoding parameter sets.

Type: Application

Filed: May 4, 2021

Publication date: March 24, 2022

Inventors: Tae Meon BAE, Minghai QIN, Yen-kuang CHEN, Guanlin WU, Sicheng LI
Super-sparse image compression using cross-bar non-volatile memory device

Patent number: 11275968

Abstract: Exemplary methods and apparatus are disclosed that implement super-sparse image/video compression by storing image dictionary elements within a cross-bar resistive random access memory (ReRAM) array (or other suitable cross-bar NVM array). In illustrative examples, each column of the cross-bar ReRAM array stores the values for one dictionary element (such as one 4×4 dictionary element). Methods and apparatus are described for training (configuring) the cross-bar ReRAM array to generate and store the dictionary elements by sequentially applying patches from training images to the array using an unstructured Hebbian training procedure. Additionally, methods and apparatus are described for compressing an input image by applying patches from the input image to the ReRAM array to read out cross-bar column indices identifying the columns storing the various dictionary elements that best fit the image. This may be done in parallel using a set of ReRAM arrays.

Type: Grant

Filed: February 13, 2019

Date of Patent: March 15, 2022

Assignee: WESTERN DIGITAL TECHNOLOGIES, INC.

Inventors: Wen Ma, Minghai Qin, Won Ho Choi, Pi-Feng Chiu, Martin Lueker-Boden
Region of interest quality controllable video coding techniques

Patent number: 11277626

Abstract: Video coding techniques including differential bit rate or quality coding of one or more regions of interest and one or more non-regions of interest based on information including one or more of coordinates of the one or more regions of interest, a target complexity, residual encoder bit data, a requested quality, a difference between the current video data frame and a reconstructed video data frame, a target quality, a requested bit rate, frame target bit allocation and an as encoded bit rate.

Type: Grant

Filed: February 21, 2020

Date of Patent: March 15, 2022

Assignee: Alibaba Group Holding Limited

Inventors: Guanlin Wu, Minghai Qin, Tae Meon Bae, Sicheng Li, Yuanwei Fang, Yen-Kuang Chen
MULTI-LEVEL SPARSE NEURAL NETWORKS WITH DYNAMIC REROUTING

Publication number: 20220076095

Abstract: Systems and methods for providing a neural network with multiple sparsity levels include sparsifying a matrix associated with the neural network to form a first sparse matrix; training the neural network using the first sparse matrix to form a second sparse matrix by fixing values and locations of non-zero elements of the first sparse matrix and updating a zero-value element of the first sparse matrix to be a non-zero value, wherein non-zero elements of the second sparse matrix includes the non-zero elements of the first sparse matrix; and outputting the second sparse matrix for executing the neural network.

Type: Application

Filed: September 4, 2020

Publication date: March 10, 2022

Inventors: Minghai QIN, Tianyun ZHANG, Fei SUN, Yen-Kuang CHEN

prev 1 2 3 4 5 6 … next