Patents by Inventor Maohua ZHU

Maohua ZHU has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 12141699
    Abstract: The present disclosure relates to systems and methods for providing vector-wise sparsity in neural networks. In some embodiments, an exemplary method for providing vector-wise sparsity in a neural network, comprises: dividing a matrix associated with the neural network into a plurality of vectors; selecting a first subset of non-zero elements from the plurality of vectors to form a pruned matrix; and outputting the pruned matrix for executing the neural network using the pruned matrix.
    Type: Grant
    Filed: July 23, 2020
    Date of Patent: November 12, 2024
    Assignee: Alibaba Group Holding Limited
    Inventors: Maohua Zhu, Tao Zhang, Zhenyu Gu, Yuan Xie
  • Patent number: 11755903
    Abstract: The present disclosure relates to systems and methods for providing block-wise sparsity in neural networks. In one implementation, a system for providing block-wise sparsity in a neural network may include at least one memory storing instructions and at least one processor configured to execute the instructions to: divide a matrix of weights associated with a neural network into a plurality of blocks; extract non-zero elements from one or more of the plurality of blocks; re-encode the extracted non-zero elements as vectors with associated coordinates of the extracted non-zero elements within the one or more blocks; enforce input sparsity in the neural network corresponding to the associated coordinates; and execute the neural network using the vectors and the enforced input sparsity.
    Type: Grant
    Filed: July 24, 2019
    Date of Patent: September 12, 2023
    Assignee: Alibaba Group Holding Limited
    Inventors: Maohua Zhu, Zhenyu Gu, Yuan Xie
  • Publication number: 20210065005
    Abstract: The present disclosure relates to systems and methods for providing vector-wise sparsity in neural networks. In some embodiments, an exemplary method for providing vector-wise sparsity in a neural network, comprises: dividing a matrix associated with the neural network into a plurality of vectors; selecting a first subset of non-zero elements from the plurality of vectors to form a pruned matrix; and outputting the pruned matrix for executing the neural network using the pruned matrix.
    Type: Application
    Filed: July 23, 2020
    Publication date: March 4, 2021
    Inventors: Maohua Zhu, Tao Zhang, Zhenyu Gu, Yuan Xie
  • Publication number: 20210027156
    Abstract: The present disclosure relates to systems and methods for providing block-wise sparsity in neural networks. In one implementation, a system for providing block-wise sparsity in a neural network may include at least one memory storing instructions and at least one processor configured to execute the instructions to: divide a matrix of weights associated with a neural network into a plurality of blocks; extract non-zero elements from one or more of the plurality of blocks; re-encode the extracted non-zero elements as vectors with associated coordinates of the extracted non-zero elements within the one or more blocks; enforce input sparsity in the neural network corresponding to the associated coordinates; and execute the neural network using the vectors and the enforced input sparsity.
    Type: Application
    Filed: July 24, 2019
    Publication date: January 28, 2021
    Inventors: Maohua ZHU, Zhenyu GU, Yuan XIE