Patents by Inventor Ao Ren

Ao Ren has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 12073317
    Abstract: Embodiments of the disclosure provide methods and systems for processing a neural network associated with an input matrix having a first number of elements. The method can include: dividing the input matrix into a plurality of vectors, each vector having a second number of elements; grouping the plurality of vectors into a first group of vectors and a second group of vectors; and pruning the first group of vectors and the second group of vectors.
    Type: Grant
    Filed: January 7, 2020
    Date of Patent: August 27, 2024
    Assignee: Alibaba Group Holding Limited
    Inventors: Ao Ren, Tao Zhang, Yuhao Wang, Yuan Xie
  • Patent number: 11816574
    Abstract: An input weight pattern of a machine learning model may be received. The input weight pattern may be pruned to produce an output weight pattern based on a predetermined pruning algorithm. The pruning algorithm may include partitioning the input weight pattern into a plurality of sub-patterns, each row of the input weight pattern including sub-rows of a first number of sub-patterns, and each column of the input weight pattern including sub-columns of a second number of sub-patterns; and pruning sub-columns and sub-rows from the plurality of sub-patterns to achieve predetermined column and row sparsities respectively, with a constraint that at least one sub-row in each row of the input weight pattern is not pruned. The output weight pattern may further be compressed to produce a compact weight pattern. The compact weight pattern has lower memory and computational overheads as compared to the input weight pattern for the machine learning model.
    Type: Grant
    Filed: October 25, 2019
    Date of Patent: November 14, 2023
    Assignee: Alibaba Group Holding Limited
    Inventors: Ao Ren, Yuhao Wang, Tao Zhang, Yuan Xie
  • Publication number: 20210406654
    Abstract: The accuracy of multiple stages within an artificial neural network is substantially improved while at the same time utilizing approximately the same number of floating-point operations per second (FLOPS) as prior art neural network stages by filtering the input with large sparse weight matrices and large sparse weight arrays.
    Type: Application
    Filed: June 29, 2020
    Publication date: December 30, 2021
    Inventors: Fei SUN, Ao REN
  • Publication number: 20210209462
    Abstract: Embodiments of the disclosure provide methods and systems for processing a neural network associated with an input matrix having a first number of elements. The method can include: dividing the input matrix into a plurality of vectors, each vector having a second number of elements; grouping the plurality of vectors into a first group of vectors and a second group of vectors; and pruning the first group of vectors and the second group of vectors.
    Type: Application
    Filed: January 7, 2020
    Publication date: July 8, 2021
    Inventors: Ao REN, Tao ZHANG, Yuhao WANG, Yuan XIE
  • Publication number: 20210125071
    Abstract: An input weight pattern of a machine learning model may be received. The input weight pattern may be pruned to produce an output weight pattern based on a predetermined pruning algorithm. The pruning algorithm may include partitioning the input weight pattern into a plurality of sub-patterns, each row of the input weight pattern including sub-rows of a first number of sub-patterns, and each column of the input weight pattern including sub-columns of a second number of sub-patterns; and pruning sub-columns and sub-rows from the plurality of sub-patterns to achieve predetermined column and row sparsities respectively, with a constraint that at least one sub-row in each row of the input weight pattern is not pruned. The output weight pattern may further be compressed to produce a compact weight pattern. The compact weight pattern has lower memory and computational overheads as compared to the input weight pattern for the machine learning model.
    Type: Application
    Filed: October 25, 2019
    Publication date: April 29, 2021
    Inventors: Ao Ren, Yuhao Wang, Tao Zhang, Yuan Xie