Patents by Inventor Ao Ren

Ao Ren has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Method and system for processing a neural network

Patent number: 12073317

Abstract: Embodiments of the disclosure provide methods and systems for processing a neural network associated with an input matrix having a first number of elements. The method can include: dividing the input matrix into a plurality of vectors, each vector having a second number of elements; grouping the plurality of vectors into a first group of vectors and a second group of vectors; and pruning the first group of vectors and the second group of vectors.

Type: Grant

Filed: January 7, 2020

Date of Patent: August 27, 2024

Assignee: Alibaba Group Holding Limited

Inventors: Ao Ren, Tao Zhang, Yuhao Wang, Yuan Xie
Structured pruning for machine learning model

Patent number: 11816574

Abstract: An input weight pattern of a machine learning model may be received. The input weight pattern may be pruned to produce an output weight pattern based on a predetermined pruning algorithm. The pruning algorithm may include partitioning the input weight pattern into a plurality of sub-patterns, each row of the input weight pattern including sub-rows of a first number of sub-patterns, and each column of the input weight pattern including sub-columns of a second number of sub-patterns; and pruning sub-columns and sub-rows from the plurality of sub-patterns to achieve predetermined column and row sparsities respectively, with a constraint that at least one sub-row in each row of the input weight pattern is not pruned. The output weight pattern may further be compressed to produce a compact weight pattern. The compact weight pattern has lower memory and computational overheads as compared to the input weight pattern for the machine learning model.

Type: Grant

Filed: October 25, 2019

Date of Patent: November 14, 2023

Assignee: Alibaba Group Holding Limited

Inventors: Ao Ren, Yuhao Wang, Tao Zhang, Yuan Xie
ARTIFICIAL NEURAL NETWORK WITH SPARSE WEIGHTS

Publication number: 20210406654

Abstract: The accuracy of multiple stages within an artificial neural network is substantially improved while at the same time utilizing approximately the same number of floating-point operations per second (FLOPS) as prior art neural network stages by filtering the input with large sparse weight matrices and large sparse weight arrays.

Type: Application

Filed: June 29, 2020

Publication date: December 30, 2021

Inventors: Fei SUN, Ao REN
METHOD AND SYSTEM FOR PROCESSING A NEURAL NETWORK

Publication number: 20210209462

Abstract: Embodiments of the disclosure provide methods and systems for processing a neural network associated with an input matrix having a first number of elements. The method can include: dividing the input matrix into a plurality of vectors, each vector having a second number of elements; grouping the plurality of vectors into a first group of vectors and a second group of vectors; and pruning the first group of vectors and the second group of vectors.

Type: Application

Filed: January 7, 2020

Publication date: July 8, 2021

Inventors: Ao REN, Tao ZHANG, Yuhao WANG, Yuan XIE
Structured Pruning for Machine Learning Model

Publication number: 20210125071

Abstract: An input weight pattern of a machine learning model may be received. The input weight pattern may be pruned to produce an output weight pattern based on a predetermined pruning algorithm. The pruning algorithm may include partitioning the input weight pattern into a plurality of sub-patterns, each row of the input weight pattern including sub-rows of a first number of sub-patterns, and each column of the input weight pattern including sub-columns of a second number of sub-patterns; and pruning sub-columns and sub-rows from the plurality of sub-patterns to achieve predetermined column and row sparsities respectively, with a constraint that at least one sub-row in each row of the input weight pattern is not pruned. The output weight pattern may further be compressed to produce a compact weight pattern. The compact weight pattern has lower memory and computational overheads as compared to the input weight pattern for the machine learning model.

Type: Application

Filed: October 25, 2019

Publication date: April 29, 2021

Inventors: Ao Ren, Yuhao Wang, Tao Zhang, Yuan Xie

Method and system for processing a neural network

Structured pruning for machine learning model

ARTIFICIAL NEURAL NETWORK WITH SPARSE WEIGHTS

METHOD AND SYSTEM FOR PROCESSING A NEURAL NETWORK

Structured Pruning for Machine Learning Model