Patents by Inventor Maohua ZHU

Maohua ZHU has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Systems and methods for providing vector-wise sparsity in a neural network

Patent number: 12141699

Abstract: The present disclosure relates to systems and methods for providing vector-wise sparsity in neural networks. In some embodiments, an exemplary method for providing vector-wise sparsity in a neural network, comprises: dividing a matrix associated with the neural network into a plurality of vectors; selecting a first subset of non-zero elements from the plurality of vectors to form a pruned matrix; and outputting the pruned matrix for executing the neural network using the pruned matrix.

Type: Grant

Filed: July 23, 2020

Date of Patent: November 12, 2024

Assignee: Alibaba Group Holding Limited

Inventors: Maohua Zhu, Tao Zhang, Zhenyu Gu, Yuan Xie
Systems and methods for providing block-wise sparsity in a neural network

Patent number: 11755903

Abstract: The present disclosure relates to systems and methods for providing block-wise sparsity in neural networks. In one implementation, a system for providing block-wise sparsity in a neural network may include at least one memory storing instructions and at least one processor configured to execute the instructions to: divide a matrix of weights associated with a neural network into a plurality of blocks; extract non-zero elements from one or more of the plurality of blocks; re-encode the extracted non-zero elements as vectors with associated coordinates of the extracted non-zero elements within the one or more blocks; enforce input sparsity in the neural network corresponding to the associated coordinates; and execute the neural network using the vectors and the enforced input sparsity.

Type: Grant

Filed: July 24, 2019

Date of Patent: September 12, 2023

Assignee: Alibaba Group Holding Limited

Inventors: Maohua Zhu, Zhenyu Gu, Yuan Xie
SYSTEMS AND METHODS FOR PROVIDING VECTOR-WISE SPARSITY IN A NEURAL NETWORK

Publication number: 20210065005

Abstract: The present disclosure relates to systems and methods for providing vector-wise sparsity in neural networks. In some embodiments, an exemplary method for providing vector-wise sparsity in a neural network, comprises: dividing a matrix associated with the neural network into a plurality of vectors; selecting a first subset of non-zero elements from the plurality of vectors to form a pruned matrix; and outputting the pruned matrix for executing the neural network using the pruned matrix.

Type: Application

Filed: July 23, 2020

Publication date: March 4, 2021

Inventors: Maohua Zhu, Tao Zhang, Zhenyu Gu, Yuan Xie
SYSTEMS AND METHODS FOR PROVIDING BLOCK-WISE SPARSITY IN A NEURAL NETWORK

Publication number: 20210027156

Abstract: The present disclosure relates to systems and methods for providing block-wise sparsity in neural networks. In one implementation, a system for providing block-wise sparsity in a neural network may include at least one memory storing instructions and at least one processor configured to execute the instructions to: divide a matrix of weights associated with a neural network into a plurality of blocks; extract non-zero elements from one or more of the plurality of blocks; re-encode the extracted non-zero elements as vectors with associated coordinates of the extracted non-zero elements within the one or more blocks; enforce input sparsity in the neural network corresponding to the associated coordinates; and execute the neural network using the vectors and the enforced input sparsity.

Type: Application

Filed: July 24, 2019

Publication date: January 28, 2021

Inventors: Maohua ZHU, Zhenyu GU, Yuan XIE

Systems and methods for providing vector-wise sparsity in a neural network

Systems and methods for providing block-wise sparsity in a neural network

SYSTEMS AND METHODS FOR PROVIDING VECTOR-WISE SPARSITY IN A NEURAL NETWORK

SYSTEMS AND METHODS FOR PROVIDING BLOCK-WISE SPARSITY IN A NEURAL NETWORK