Patents by Inventor Layali Rashid

Layali Rashid has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Accelerator for dense and sparse matrix computations

Patent number: 11562047

Abstract: A method of increasing computer hardware efficiency of a matrix computation. The method comprises receiving at a computer processing device, digital signals encoding one or more operations of the matrix computation, each operation including one or more operands. The method further comprises, responsive to determining, by a sparse data check device of the computer processing machine, that an operation of the matrix computation includes all dense operands, forwarding the operation to a dense computation device of the computer processing machine configured to perform the operation of the matrix computation based on the dense operands. The method further comprises, responsive to determining, by the sparse data check device, that an operation of the matrix computation includes one or more sparse operands, forwarding the operation to a sparse computation device configured to perform the operation of the matrix computation.

Type: Grant

Filed: April 29, 2020

Date of Patent: January 24, 2023

Assignee: Microsoft Technology Licensing, LLC

Inventors: Layali Rashid, Saurabh M. Kulkarni, Marc Tremblay
Executing large artificial intelligence models on memory-constrained devices

Patent number: 11520592

Abstract: Methods, systems, apparatuses, and computer program products are described herein that enable execution of a large AI model on a memory-constrained target device that is communicatively connected to a parameter server, which stores a master copy of the AI model. The AI model may be dissected into smaller portions (e.g., layers or sub-layers), and each portion may be executed as efficiently as possible on the target device. After execution of one portion of the AI model is finished, another portion of the AI model may be downloaded and executed at the target device. To improve efficiency, the input samples may be divided into microbatches, and a plurality of microbatches executing in sequential order may form a minibatch. The size of the group of microbatches or minibatch can be manually or automatically adjusted to reduce the communication overhead.

Type: Grant

Filed: September 20, 2019

Date of Patent: December 6, 2022

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Bharadwaj Pudipeddi, Marc Tremblay, Gautham Popuri, Layali Rashid, Tiyasa Mitra, Mohit Mittal, Maral Mesmakhosroshahi
EXECUTING LARGE ARTIFICIAL INTELLIGENCE MODELS ON MEMORY-CONSTRAINED DEVICES

Publication number: 20220276871

Abstract: Methods, systems, apparatuses, and computer program products are described herein that enable execution of a large AI model on a memory-constrained target device that is communicatively connected to a parameter server, which stores a master copy of the AI model. The AI model may be dissected into smaller portions (e.g., layers or sub-layers), and each portion may be executed as efficiently as possible on the target device. After execution of one portion of the AI model is finished, another portion of the AI model may be downloaded and executed at the target device. To improve efficiency, the input samples may be divided into microbatches, and a plurality of microbatches executing in sequential order may form a minibatch. The size of the group of microbatches or minibatch can be manually or automatically adjusted to reduce the communication overhead.

Type: Application

Filed: May 18, 2022

Publication date: September 1, 2022

Inventors: Bharadwaj PUDIPEDDI, Marc TREMBLAY, Gautham POPURI, Layali RASHID, Tiyasa MITRA, Mohit MITTAL, Maral MESMAKHOSROSHAHI
ACCELERATOR FOR DENSE AND SPARSE MATRIX COMPUTATIONS

Publication number: 20210240797

Abstract: A method of increasing computer hardware efficiency of a matrix computation. The method comprises receiving at a computer processing device, digital signals encoding one or more operations of the matrix computation, each operation including one or more operands. The method further comprises, responsive to determining, by a sparse data check device of the computer processing machine, that an operation of the matrix computation includes all dense operands, forwarding the operation to a dense computation device of the computer processing machine configured to perform the operation of the matrix computation based on the dense operands. The method further comprises, responsive to determining, by the sparse data check device, that an operation of the matrix computation includes one or more sparse operands, forwarding the operation to a sparse computation device configured to perform the operation of the matrix computation.

Type: Application

Filed: April 29, 2020

Publication date: August 5, 2021

Applicant: Microsoft Technology Licensing, LLC

Inventors: Layali RASHID, Saurabh M. KULKARNI, Marc TREMBLAY
EXECUTING LARGE ARTIFICIAL INTELLIGENCE MODELS ON MEMORY-CONSTRAINED DEVICES

Publication number: 20210019151

Abstract: Methods, systems, apparatuses, and computer program products are described herein that enable execution of a large AI model on a memory-constrained target device that is communicatively connected to a parameter server, which stores a master copy of the AI model. The AI model may be dissected into smaller portions (e.g., layers or sub-layers), and each portion may be executed as efficiently as possible on the target device. After execution of one portion of the AI model is finished, another portion of the AI model may be downloaded and executed at the target device. To improve efficiency, the input samples may be divided into microbatches, and a plurality of microbatches executing in sequential order may form a minibatch. The size of the group of microbatches or minibatch can be manually or automatically adjusted to reduce the communication overhead.

Type: Application

Filed: September 20, 2019

Publication date: January 21, 2021

Inventors: Bharadwaj Pudipeddi, Marc Tremblay, Gautham Popuri, Layali Rashid, Tiyasa Mitra, III, Mohit Mittal, Maral Mesmakhosroshahi

Accelerator for dense and sparse matrix computations

Executing large artificial intelligence models on memory-constrained devices

EXECUTING LARGE ARTIFICIAL INTELLIGENCE MODELS ON MEMORY-CONSTRAINED DEVICES

ACCELERATOR FOR DENSE AND SPARSE MATRIX COMPUTATIONS

EXECUTING LARGE ARTIFICIAL INTELLIGENCE MODELS ON MEMORY-CONSTRAINED DEVICES