Patents by Inventor Minghai Qin
Minghai Qin has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20240095540Abstract: Methods and apparatus for processing data in a distributed inference scheme based on sparse inputs. An example method includes receiving an input at a first node. A first sparsified input is generated for a second node based on a set of features associated with the second node, which are identified based on a weight mask having non-zero values for weights associated with features upon which processing by the second node depends and zeroed values for weights associated with other features. The first sparsified input is transmitted to the second node for generating an output of the second node. A second sparsified input is received from the second node and combined into a combined input. The combined input is processed into an output of the first node. The neural network is configured to generate an inference based on processing the outputs of the first node and the second node.Type: ApplicationFiled: June 28, 2023Publication date: March 21, 2024Applicant: Western Digital Technologies, Inc.Inventors: Minghai QIN, Jaco HOFMANN, Chao SUN, Qingbo WANG, Dejan VUCINIC
-
Patent number: 11792408Abstract: Transcoding bitrate prediction techniques can include receiving a first encoded content. A transcoder bitrate can be estimated based on regression over a video quality estimator of the first encoded content and a second encoded content. The estimated transcoder bitrate can be utilized to transcoding the first encoded content into the second encoded.Type: GrantFiled: March 30, 2020Date of Patent: October 17, 2023Assignee: Alibaba Group Holding LimitedInventors: Tae Meon Bae, Minghai Qin, Yen-kuang Chen, Guanlin Wu, Sicheng Li
-
Patent number: 11785261Abstract: The present disclosure relates to a method for compensating an image. The method comprises estimating transform coefficients of a frequency component for a first image based on the first image, performing a dot multiplication operation between the estimated transform coefficients and a basis function associated with the frequency component to generate a compensation image, and combining the first image and the compensation image to generate a combined image.Type: GrantFiled: May 24, 2022Date of Patent: October 10, 2023Assignee: Alibaba Group Holding LimitedInventors: Zihao Liu, Sicheng Li, Minghai Qin, Yen-Kuang Chen
-
Publication number: 20230306257Abstract: Neural network (NN) model training techniques can include computing activations in a forward pass using a sparse weight matrix that is transpose invariant. The neural network (NN) model training techniques can further include computing activation gradients and weight gradients in a backward pass using the sparse weight matrix.Type: ApplicationFiled: July 15, 2022Publication date: September 28, 2023Inventors: Fei SUN, Minghai QIN, Haoran LI, Guocai ZHU, Yuan GAO, Guyue HUANG, Yawen ZHANG
-
Patent number: 11741188Abstract: An innovative low-bit-width device may include a first digital-to-analog converter (DAC), a second DAC, a plurality of non-volatile memory (NVM) weight arrays, one or more analog-to-digital converters (ADCs), and a neural circuit. The first DAC is configured to convert a digital input signal into an analog input signal. The second DAC is configured to convert a digital previous hidden state (PHS) signal into an analog PHS signal. NVM weight arrays are configured to compute vector matrix multiplication (VMM) arrays based on the analog input signal and the analog PHS signal. The NVM weight arrays are coupled to the first DAC and the second DAC. The one or more ADCs are coupled to the plurality of NVM weight arrays and are configured to convert the VMM arrays into digital VMM values. The neural circuit is configured to process the digital VMM values into a new hidden state.Type: GrantFiled: July 8, 2021Date of Patent: August 29, 2023Assignee: WESTERN DIGITAL TECHNOLOGIES, INC.Inventors: Wen Ma, Pi-Feng Chiu, Minghai Qin, Won Ho Choi, Martin Lueker-Boden
-
Publication number: 20230199192Abstract: Scene aware video content encoding techniques can determine if video content is a given content type and is one of one or more given titles that include one or more given scenes. The one or more given scenes of the video content of the given type and given one of the titles can be encoded using corresponding scenes specific encoding parameter values, and the non-given scenes can be encoded using one or more general encoding parameter values. The one or more given titles can be selected based on a rate of streaming of various video content titles of the given type.Type: ApplicationFiled: October 11, 2022Publication date: June 22, 2023Inventors: Tae Meon BAE, Minghai QIN, Guanlin WU, Yen-kuang CHEN, Qinggang ZHOU, Shaolin XIE
-
Patent number: 11663077Abstract: Systems and methods for implementing data protection techniques with symbol-based variable node updates for binary low-density parity-check (LDPC) codes are described. A semiconductor memory (e.g., a NAND flash memory) may read a set of data from a set of memory cells, determine a set of data state probabilities for the set of data based on sensed threshold voltages for the set of memory cells, generate a valid codeword for the set of data using an iterative LDPC decoding with symbol-based variable node updates and the set of data state probabilities, and store the valid codeword within the semiconductor memory or transfer the valid codeword from the semiconductor memory. The iterative LDPC decoding may utilize a message passing algorithm in which outgoing messages from a plurality of multi-variable nodes are generated using incoming messages (e.g., log-likelihood ratios or L-values) from a plurality of check nodes.Type: GrantFiled: June 23, 2020Date of Patent: May 30, 2023Assignee: Western Digital Technologies, Inc.Inventor: Minghai Qin
-
Publication number: 20230109617Abstract: A system for pruning weights during training of a neural network includes a configurable pruning hardware unit that is configured to: receive, from a neural network training engine, inputs including the weights, gradients associated with the weights, and a prune indicator per weight; select unpruned weights for pruning; prune the unpruned weights selected for pruning; update the prune indicator per weight for the weights that are selected and pruned; and provide the updated prune indicator to the training engine for the next iteration or epoch. The pruning hardware unit can be configured to perform incremental pruning or non-incremental pruning.Type: ApplicationFiled: December 9, 2022Publication date: April 6, 2023Inventors: Tianchan GUAN, Yuan GAO, Hongzhong ZHENG, Minghai QIN, Chunsheng LIU, Dimin NIU
-
Patent number: 11586601Abstract: The present disclosure relates to a method and an apparatus for representation of a sparse matrix in a neural network. In some embodiments, an exemplary operation unit includes a buffer for storing a representation of a sparse matrix in a neural network, a sparse engine communicatively coupled with the buffer, and a processing array communicatively coupled with the sparse engine. The sparse engine includes circuitry to: read the representation of the sparse matrix from the buffer, the representation comprising a first level bitmap, a second level bitmap, and an element array; decompress the first level bitmap to determine whether a block of the sparse matrix comprises a non-zero element; and in response to the block comprising a non-zero element, decompress the second level bitmap using the element array to obtain the block of the sparse matrix. The processing array includes circuitry to execute the neural network with the sparse matrix.Type: GrantFiled: February 5, 2020Date of Patent: February 21, 2023Assignee: Alibaba Group Holding LimitedInventors: Zhibin Xiao, Xiaoxin Fan, Minghai Qin
-
Patent number: 11582478Abstract: The present disclosure relates to a computer-implemented method for processing video data. The method comprises receiving a user input corresponding to a first picture of the video data, generating, based on the user input, prediction information of the first picture with respect a reference picture of the video data, and encoding the first picture using the prediction information.Type: GrantFiled: September 8, 2020Date of Patent: February 14, 2023Assignee: Alibaba Group Holding LimitedInventors: Yuhao Wang, Minghai Qin, Jian Lou, Yen-Kuang Chen
-
Patent number: 11570477Abstract: Methods and systems are provided for implementing preprocessing operations and augmentation operations upon image datasets transformed to frequency domain representations, including decoding images of an image dataset to generate a frequency domain representation of the image dataset; performing a resizing operation based on resizing factors on the image dataset in a frequency domain representation; performing a reshaping operation based on reshaping factors on the image dataset in a frequency domain representation; and performing a cropping operation on the image dataset in a frequency domain representation. The methods and systems may further include performing an augmentation operation on the image dataset in a frequency domain representation. Methods and systems of the present disclosure may free learning models from computational overhead caused by transforming image datasets into frequency domain representations.Type: GrantFiled: December 31, 2019Date of Patent: January 31, 2023Assignee: Alibaba Group Holding LimitedInventors: Kai Xu, Fei Sun, Minghai Qin, Yen-kuang Chen
-
Patent number: 11568252Abstract: A neural network, trained on a plurality of random size data samples, can receive a plurality of inference data samples including samples of different sizes. The neural network can generate feature maps of the plurality of inference data samples. Pooling can be utilized to generate feature maps having a fixed size. The fixed size feature maps can be utilized to generate an indication of a class for each of the plurality of inference data samples.Type: GrantFiled: June 29, 2020Date of Patent: January 31, 2023Assignee: ALIBABA GROUP HOLDING LIMITEDInventors: Minghai Qin, Yen-Kuang Chen, Zhenzhen Wang, Fei Sun
-
Patent number: 11568021Abstract: Vector-vector multiplication or matrix-matrix multiplication computation on computing systems can include computing a first portion of a vector-vector multiplication product based on a most-significant-bit set of a first vector and a most-significant-bit set of a second vector, and determining if the first portion of the vector-vector multiplication product is less than a threshold. If the first partial vector-vector multiplication product is not less than the threshold, a remaining portion of the vector-vector multiplication product can be computed, and a rectified linear vector-vector multiplication product can be determined for the sum of the first portion of the vector-vector multiplication product and the remaining portion of the vector-vector multiplication product.Type: GrantFiled: February 21, 2020Date of Patent: January 31, 2023Assignee: Alibaba Group Holding LimitedInventors: Minghai Qin, Zhibin Xiao, Chunsheng Liu
-
Publication number: 20230021511Abstract: Disclosed are systems and methods that determine whether instances of data (e.g., forward activations, backward derivatives of activations) that are used to train deep neural networks are to be stored on-chip or off-chip. The disclosed systems and methods are also used to prune the data (discard or delete selected instances of data). A system includes a hierarchical arrangement of on-chip and off-chip memories, and also includes a hierarchical arrangement of data selector devices that are used to decide whether to discard data and where in the system the data is to be discarded.Type: ApplicationFiled: October 4, 2022Publication date: January 26, 2023Inventors: Minghai QIN, Chunsheng LIU, Zhibin XIAO, Tianchan GUAN, Yuan GAO
-
Patent number: 11556616Abstract: Systems and methods for reducing the impact of defects within a crossbar memory array when performing multiplication operations in which multiple control lines are concurrently selected are described. A group of memory cells within the crossbar memory array may be controlled by a local word line that is controlled by a local word line gating unit that may be configured to prevent the local word line from being biased to a selected word line voltage during an operation; the local word line may instead be set to a disabling voltage during the operation such that the memory cell currents through the group of memory cells are eliminated. If a defect has caused a short within one of the memory cells of the group of memory cells, then the local word line gating unit may be programmed to hold the local word line at the disabling voltage during multiplication operations.Type: GrantFiled: October 17, 2019Date of Patent: January 17, 2023Assignee: SanDisk Technologies LLCInventors: Minghai Qin, Pi-Feng Chiu, Wen Ma, Won Ho Choi
-
Patent number: 11528493Abstract: Methods and apparatuses for video transcoding based on spatial or temporal importance include: in response to receiving an encoded video bitstream, decoding a picture from the encoded video bitstream; determining a first level of spatial importance for a first region of a background of the picture based on an image segmentation technique; applying to the first region a first resolution-enhancement technique associated with the first level of spatial importance for increasing resolution of the first region by a scaling factor, wherein the first resolution-enhancement technique is selected from a set of resolution-enhancement techniques having different computational complexity levels; and encoding the first region using a video coding standard.Type: GrantFiled: May 6, 2020Date of Patent: December 13, 2022Assignee: Alibaba Group Holding LimitedInventors: Tae Meon Bae, Shaolin Xie, Minghai Qin, Yen-kuang Chen, Guanlin Wu, Qinggang Zhou
-
Patent number: 11500811Abstract: The present disclosure relates to a method and an apparatus for map reduce. In some embodiments, an exemplary processing unit includes: a 2-dimensional (2D) processing element (PE) array comprising a plurality of PEs, each PE comprising a first input and a second input, the first inputs of the PEs in a linear array in a first dimension of the PE array being connected in series and the second inputs of the PEs in a linear array in a second dimension of the PE array being connected in parallel, each PE being configured to perform an operation on data from the first input or second input; and a plurality of reduce tree units, each reduce tree unit being coupled with the PEs in a linear array in the first dimension or the second dimension of the PE array and configured to perform a first reduction operation.Type: GrantFiled: June 12, 2020Date of Patent: November 15, 2022Assignee: Alibaba Group Holding LimitedInventors: Yuanwei Fang, Tae Meon Bae, Sicheng Li, Minghai Qin, Guanlin Wu, Yen-kuang Chen
-
Patent number: 11470327Abstract: Scene aware video content encoding techniques can determine if video content is a given content type and is one of one or more given titles that include one or more given scenes. The one or more given scenes of the video content of the given type and given one of the titles can be encoded using corresponding scenes specific encoding parameter values, and the non-given scenes can be encoded using one or more general encoding parameter values. The one or more given titles can be selected based on a rate of streaming of various video content titles of the given type.Type: GrantFiled: March 30, 2020Date of Patent: October 11, 2022Assignee: Alibaba Group Holding LimitedInventors: Tae Meon Bae, Minghai Qin, Guanlin Wu, Yen-kuang Chen, Qinggang Zhou, Shaolin Xie
-
Publication number: 20220301523Abstract: A method for of encoding an application screen comprises partitioning graphic data into a plurality of graphic layers and classifying each of the plurality of graphic layers as either a screen content (SC) or a non-screen content (non-SC) layer. The method further comprises classifying each of the plurality of graphic layers as either a screen content (SC) or a non-screen content (non-SC) layer. Further, the method comprises rendering and encoding the one or more SC layers using a first codec and the one or more non-SC layers using a second codec.Type: ApplicationFiled: June 6, 2022Publication date: September 22, 2022Inventors: Tae Meon BAE, Sicheng LI, Yen-kuang CHEN, Guanlin WU, Shaolin XIE, Minghai QIN, Qinggang ZHOU
-
Patent number: 11443163Abstract: The present disclosure provides methods and systems for executing a neural network. The method includes: receiving a plurality of input vectors of input data; generating, among the plurality of input vectors, an estimation value associated with a subset of an input vector based on a weight vector of the activation function; determining whether the estimation value associated with the subset of the input vector satisfies a threshold condition; and determining an output of the activation function based on the estimation value.Type: GrantFiled: October 11, 2019Date of Patent: September 13, 2022Assignee: Alibaba Group Holding LimitedInventors: Minghai Qin, Chunsheng Liu