Patents by Inventor Minghai Qin

Minghai Qin has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

REDUCING DATA COMMUNICATIONS IN DISTRIBUTED INFERENCE SCHEMES

Publication number: 20240095540

Abstract: Methods and apparatus for processing data in a distributed inference scheme based on sparse inputs. An example method includes receiving an input at a first node. A first sparsified input is generated for a second node based on a set of features associated with the second node, which are identified based on a weight mask having non-zero values for weights associated with features upon which processing by the second node depends and zeroed values for weights associated with other features. The first sparsified input is transmitted to the second node for generating an output of the second node. A second sparsified input is received from the second node and combined into a combined input. The combined input is processed into an output of the first node. The neural network is configured to generate an inference based on processing the outputs of the first node and the second node.

Type: Application

Filed: June 28, 2023

Publication date: March 21, 2024

Applicant: Western Digital Technologies, Inc.

Inventors: Minghai QIN, Jaco HOFMANN, Chao SUN, Qingbo WANG, Dejan VUCINIC
Transcoder target bitrate prediction techniques

Patent number: 11792408

Abstract: Transcoding bitrate prediction techniques can include receiving a first encoded content. A transcoder bitrate can be estimated based on regression over a video quality estimator of the first encoded content and a second encoded content. The estimated transcoder bitrate can be utilized to transcoding the first encoded content into the second encoded.

Type: Grant

Filed: March 30, 2020

Date of Patent: October 17, 2023

Assignee: Alibaba Group Holding Limited

Inventors: Tae Meon Bae, Minghai Qin, Yen-kuang Chen, Guanlin Wu, Sicheng Li
Frequency specific compression and compensation technique in image processing

Patent number: 11785261

Abstract: The present disclosure relates to a method for compensating an image. The method comprises estimating transform coefficients of a frequency component for a first image based on the first image, performing a dot multiplication operation between the estimated transform coefficients and a basis function associated with the frequency component to generate a compensation image, and combining the first image and the compensation image to generate a combined image.

Type: Grant

Filed: May 24, 2022

Date of Patent: October 10, 2023

Assignee: Alibaba Group Holding Limited

Inventors: Zihao Liu, Sicheng Li, Minghai Qin, Yen-Kuang Chen
SYSTEMS AND METHODS FOR NEURAL NETWORK TRAINING WITH WEIGHT SPARSITY

Publication number: 20230306257

Abstract: Neural network (NN) model training techniques can include computing activations in a forward pass using a sparse weight matrix that is transpose invariant. The neural network (NN) model training techniques can further include computing activation gradients and weight gradients in a backward pass using the sparse weight matrix.

Type: Application

Filed: July 15, 2022

Publication date: September 28, 2023

Inventors: Fei SUN, Minghai QIN, Haoran LI, Guocai ZHU, Yuan GAO, Guyue HUANG, Yawen ZHANG
Hardware accelerated discretized neural network

Patent number: 11741188

Abstract: An innovative low-bit-width device may include a first digital-to-analog converter (DAC), a second DAC, a plurality of non-volatile memory (NVM) weight arrays, one or more analog-to-digital converters (ADCs), and a neural circuit. The first DAC is configured to convert a digital input signal into an analog input signal. The second DAC is configured to convert a digital previous hidden state (PHS) signal into an analog PHS signal. NVM weight arrays are configured to compute vector matrix multiplication (VMM) arrays based on the analog input signal and the analog PHS signal. The NVM weight arrays are coupled to the first DAC and the second DAC. The one or more ADCs are coupled to the plurality of NVM weight arrays and are configured to convert the VMM arrays into digital VMM values. The neural circuit is configured to process the digital VMM values into a new hidden state.

Type: Grant

Filed: July 8, 2021

Date of Patent: August 29, 2023

Assignee: WESTERN DIGITAL TECHNOLOGIES, INC.

Inventors: Wen Ma, Pi-Feng Chiu, Minghai Qin, Won Ho Choi, Martin Lueker-Boden
SCENE AWARE VIDEO CONTENT ENCODING

Publication number: 20230199192

Abstract: Scene aware video content encoding techniques can determine if video content is a given content type and is one of one or more given titles that include one or more given scenes. The one or more given scenes of the video content of the given type and given one of the titles can be encoded using corresponding scenes specific encoding parameter values, and the non-given scenes can be encoded using one or more general encoding parameter values. The one or more given titles can be selected based on a rate of streaming of various video content titles of the given type.

Type: Application

Filed: October 11, 2022

Publication date: June 22, 2023

Inventors: Tae Meon BAE, Minghai QIN, Guanlin WU, Yen-kuang CHEN, Qinggang ZHOU, Shaolin XIE
Symbol-based variable node updates for binary LDPC codes

Patent number: 11663077

Abstract: Systems and methods for implementing data protection techniques with symbol-based variable node updates for binary low-density parity-check (LDPC) codes are described. A semiconductor memory (e.g., a NAND flash memory) may read a set of data from a set of memory cells, determine a set of data state probabilities for the set of data based on sensed threshold voltages for the set of memory cells, generate a valid codeword for the set of data using an iterative LDPC decoding with symbol-based variable node updates and the set of data state probabilities, and store the valid codeword within the semiconductor memory or transfer the valid codeword from the semiconductor memory. The iterative LDPC decoding may utilize a message passing algorithm in which outgoing messages from a plurality of multi-variable nodes are generated using incoming messages (e.g., log-likelihood ratios or L-values) from a plurality of check nodes.

Type: Grant

Filed: June 23, 2020

Date of Patent: May 30, 2023

Assignee: Western Digital Technologies, Inc.

Inventor: Minghai Qin
PRUNING HARDWARE UNIT FOR TRAINING NEURAL NETWORK

Publication number: 20230109617

Abstract: A system for pruning weights during training of a neural network includes a configurable pruning hardware unit that is configured to: receive, from a neural network training engine, inputs including the weights, gradients associated with the weights, and a prune indicator per weight; select unpruned weights for pruning; prune the unpruned weights selected for pruning; update the prune indicator per weight for the weights that are selected and pruned; and provide the updated prune indicator to the training engine for the next iteration or epoch. The pruning hardware unit can be configured to perform incremental pruning or non-incremental pruning.

Type: Application

Filed: December 9, 2022

Publication date: April 6, 2023

Inventors: Tianchan GUAN, Yuan GAO, Hongzhong ZHENG, Minghai QIN, Chunsheng LIU, Dimin NIU
Apparatus and method for representation of a sparse matrix in a neural network

Patent number: 11586601

Abstract: The present disclosure relates to a method and an apparatus for representation of a sparse matrix in a neural network. In some embodiments, an exemplary operation unit includes a buffer for storing a representation of a sparse matrix in a neural network, a sparse engine communicatively coupled with the buffer, and a processing array communicatively coupled with the sparse engine. The sparse engine includes circuitry to: read the representation of the sparse matrix from the buffer, the representation comprising a first level bitmap, a second level bitmap, and an element array; decompress the first level bitmap to determine whether a block of the sparse matrix comprises a non-zero element; and in response to the block comprising a non-zero element, decompress the second level bitmap using the element array to obtain the block of the sparse matrix. The processing array includes circuitry to execute the neural network with the sparse matrix.

Type: Grant

Filed: February 5, 2020

Date of Patent: February 21, 2023

Assignee: Alibaba Group Holding Limited

Inventors: Zhibin Xiao, Xiaoxin Fan, Minghai Qin
Video encoding technique utilizing user guided information in cloud environment

Patent number: 11582478

Abstract: The present disclosure relates to a computer-implemented method for processing video data. The method comprises receiving a user input corresponding to a first picture of the video data, generating, based on the user input, prediction information of the first picture with respect a reference picture of the video data, and encoding the first picture using the prediction information.

Type: Grant

Filed: September 8, 2020

Date of Patent: February 14, 2023

Assignee: Alibaba Group Holding Limited

Inventors: Yuhao Wang, Minghai Qin, Jian Lou, Yen-Kuang Chen
Data preprocessing and data augmentation in frequency domain

Patent number: 11570477

Abstract: Methods and systems are provided for implementing preprocessing operations and augmentation operations upon image datasets transformed to frequency domain representations, including decoding images of an image dataset to generate a frequency domain representation of the image dataset; performing a resizing operation based on resizing factors on the image dataset in a frequency domain representation; performing a reshaping operation based on reshaping factors on the image dataset in a frequency domain representation; and performing a cropping operation on the image dataset in a frequency domain representation. The methods and systems may further include performing an augmentation operation on the image dataset in a frequency domain representation. Methods and systems of the present disclosure may free learning models from computational overhead caused by transforming image datasets into frequency domain representations.

Type: Grant

Filed: December 31, 2019

Date of Patent: January 31, 2023

Assignee: Alibaba Group Holding Limited

Inventors: Kai Xu, Fei Sun, Minghai Qin, Yen-kuang Chen
Variable input size techniques for neural networks

Patent number: 11568252

Abstract: A neural network, trained on a plurality of random size data samples, can receive a plurality of inference data samples including samples of different sizes. The neural network can generate feature maps of the plurality of inference data samples. Pooling can be utilized to generate feature maps having a fixed size. The fixed size feature maps can be utilized to generate an indication of a class for each of the plurality of inference data samples.

Type: Grant

Filed: June 29, 2020

Date of Patent: January 31, 2023

Assignee: ALIBABA GROUP HOLDING LIMITED

Inventors: Minghai Qin, Yen-Kuang Chen, Zhenzhen Wang, Fei Sun
Vector-vector multiplication techniques for processing systems

Patent number: 11568021

Abstract: Vector-vector multiplication or matrix-matrix multiplication computation on computing systems can include computing a first portion of a vector-vector multiplication product based on a most-significant-bit set of a first vector and a most-significant-bit set of a second vector, and determining if the first portion of the vector-vector multiplication product is less than a threshold. If the first partial vector-vector multiplication product is not less than the threshold, a remaining portion of the vector-vector multiplication product can be computed, and a rectified linear vector-vector multiplication product can be determined for the sum of the first portion of the vector-vector multiplication product and the remaining portion of the vector-vector multiplication product.

Type: Grant

Filed: February 21, 2020

Date of Patent: January 31, 2023

Assignee: Alibaba Group Holding Limited

Inventors: Minghai Qin, Zhibin Xiao, Chunsheng Liu
HIERARCHICAL METHODS AND SYSTEMS FOR STORING DATA

Publication number: 20230021511

Abstract: Disclosed are systems and methods that determine whether instances of data (e.g., forward activations, backward derivatives of activations) that are used to train deep neural networks are to be stored on-chip or off-chip. The disclosed systems and methods are also used to prune the data (discard or delete selected instances of data). A system includes a hierarchical arrangement of on-chip and off-chip memories, and also includes a hierarchical arrangement of data selector devices that are used to decide whether to discard data and where in the system the data is to be discarded.

Type: Application

Filed: October 4, 2022

Publication date: January 26, 2023

Inventors: Minghai QIN, Chunsheng LIU, Zhibin XIAO, Tianchan GUAN, Yuan GAO
Methods to tolerate programming and retention errors of crossbar memory arrays

Patent number: 11556616

Abstract: Systems and methods for reducing the impact of defects within a crossbar memory array when performing multiplication operations in which multiple control lines are concurrently selected are described. A group of memory cells within the crossbar memory array may be controlled by a local word line that is controlled by a local word line gating unit that may be configured to prevent the local word line from being biased to a selected word line voltage during an operation; the local word line may instead be set to a disabling voltage during the operation such that the memory cell currents through the group of memory cells are eliminated. If a defect has caused a short within one of the memory cells of the group of memory cells, then the local word line gating unit may be programmed to hold the local word line at the disabling voltage during multiplication operations.

Type: Grant

Filed: October 17, 2019

Date of Patent: January 17, 2023

Assignee: SanDisk Technologies LLC

Inventors: Minghai Qin, Pi-Feng Chiu, Wen Ma, Won Ho Choi
Method and system for video transcoding based on spatial or temporal importance

Patent number: 11528493

Abstract: Methods and apparatuses for video transcoding based on spatial or temporal importance include: in response to receiving an encoded video bitstream, decoding a picture from the encoded video bitstream; determining a first level of spatial importance for a first region of a background of the picture based on an image segmentation technique; applying to the first region a first resolution-enhancement technique associated with the first level of spatial importance for increasing resolution of the first region by a scaling factor, wherein the first resolution-enhancement technique is selected from a set of resolution-enhancement techniques having different computational complexity levels; and encoding the first region using a video coding standard.

Type: Grant

Filed: May 6, 2020

Date of Patent: December 13, 2022

Assignee: Alibaba Group Holding Limited

Inventors: Tae Meon Bae, Shaolin Xie, Minghai Qin, Yen-kuang Chen, Guanlin Wu, Qinggang Zhou
Apparatuses and methods for map reduce

Patent number: 11500811

Abstract: The present disclosure relates to a method and an apparatus for map reduce. In some embodiments, an exemplary processing unit includes: a 2-dimensional (2D) processing element (PE) array comprising a plurality of PEs, each PE comprising a first input and a second input, the first inputs of the PEs in a linear array in a first dimension of the PE array being connected in series and the second inputs of the PEs in a linear array in a second dimension of the PE array being connected in parallel, each PE being configured to perform an operation on data from the first input or second input; and a plurality of reduce tree units, each reduce tree unit being coupled with the PEs in a linear array in the first dimension or the second dimension of the PE array and configured to perform a first reduction operation.

Type: Grant

Filed: June 12, 2020

Date of Patent: November 15, 2022

Assignee: Alibaba Group Holding Limited

Inventors: Yuanwei Fang, Tae Meon Bae, Sicheng Li, Minghai Qin, Guanlin Wu, Yen-kuang Chen
Scene aware video content encoding

Patent number: 11470327

Abstract: Scene aware video content encoding techniques can determine if video content is a given content type and is one of one or more given titles that include one or more given scenes. The one or more given scenes of the video content of the given type and given one of the titles can be encoded using corresponding scenes specific encoding parameter values, and the non-given scenes can be encoded using one or more general encoding parameter values. The one or more given titles can be selected based on a rate of streaming of various video content titles of the given type.

Type: Grant

Filed: March 30, 2020

Date of Patent: October 11, 2022

Assignee: Alibaba Group Holding Limited

Inventors: Tae Meon Bae, Minghai Qin, Guanlin Wu, Yen-kuang Chen, Qinggang Zhou, Shaolin Xie
METHOD AND APPARATUS FOR EFFICIENT APPLICATION SCREEN COMPRESSION

Publication number: 20220301523

Abstract: A method for of encoding an application screen comprises partitioning graphic data into a plurality of graphic layers and classifying each of the plurality of graphic layers as either a screen content (SC) or a non-screen content (non-SC) layer. The method further comprises classifying each of the plurality of graphic layers as either a screen content (SC) or a non-screen content (non-SC) layer. Further, the method comprises rendering and encoding the one or more SC layers using a first codec and the one or more non-SC layers using a second codec.

Type: Application

Filed: June 6, 2022

Publication date: September 22, 2022

Inventors: Tae Meon BAE, Sicheng LI, Yen-kuang CHEN, Guanlin WU, Shaolin XIE, Minghai QIN, Qinggang ZHOU
Method and system for executing neural network

Patent number: 11443163

Abstract: The present disclosure provides methods and systems for executing a neural network. The method includes: receiving a plurality of input vectors of input data; generating, among the plurality of input vectors, an estimation value associated with a subset of an input vector based on a weight vector of the activation function; determining whether the estimation value associated with the subset of the input vector satisfies a threshold condition; and determining an output of the activation function based on the estimation value.

Type: Grant

Filed: October 11, 2019

Date of Patent: September 13, 2022

Assignee: Alibaba Group Holding Limited

Inventors: Minghai Qin, Chunsheng Liu

1 2 3 4 5 … next