Patents by Inventor Steven Karl REINHARDT

Steven Karl REINHARDT has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Semi-programmable and reconfigurable co-accelerator for a deep neural network with normalization or non-linearity

Patent number: 12124391

Abstract: The present disclosure relates to devices for using a configurable stacked architecture for a fixed function datapath with an accelerator for accelerating an operation or a layer of a deep neural network (DNN). The stacked architecture may have a fixed function datapath that includes one or more configurable micro-execution units that execute a series of vector, scalar, reduction, broadcasting, and normalization operations for a DNN layer operation. The fixed function datapath may be customizable based on the DNN or the operation.

Type: Grant

Filed: July 5, 2023

Date of Patent: October 22, 2024

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Stephen Sangho Youn, Steven Karl Reinhardt, Jeremy Halden Fowers, Lok Chand Koppaka, Kalin Ovtcharov
Banked memory architecture for multiple parallel datapath channels in an accelerator

Patent number: 12079137

Abstract: The present disclosure relates to devices and methods for using a banked memory structure with accelerators. The devices and methods may segment and isolate dataflows in datapath and memory of the accelerator. The devices and methods may provide each data channel with its own register memory bank. The devices and methods may use a memory address decoder to place the local variables in the proper memory bank.

Type: Grant

Filed: May 30, 2023

Date of Patent: September 3, 2024

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Stephen Sangho Youn, Steven Karl Reinhardt, Hui Geng
Deep neural network accelerator with independent datapaths for simultaneous processing of different classes of operations

Patent number: 12073310

Abstract: Deep neural network accelerators (DNNs) with independent datapaths for simultaneous processing of different classes of operations and related methods are described. An example DNN accelerator includes an instruction dispatcher for receiving chains of instructions having both instructions for performing a first class of operations and a second class of operations corresponding to a neural network model. The DNN accelerator further includes a first datapath and a second datapath, where each is configured to execute at least one instruction chain locally before outputting any results. The instruction dispatcher is configured to forward instructions for performing the first class of operations to the first datapath and forward instructions for performing the second class of operations to the second datapath to overlap in time a performance of at least a subset of the first class of operations with a performance of at least a subset of the second class of operations.

Type: Grant

Filed: April 1, 2020

Date of Patent: August 27, 2024

Assignee: Microsoft Technology Licensing, LLC

Inventors: Stephen Sangho Youn, Lok Chand Koppaka, Steven Karl Reinhardt
SEMI-PROGRAMMABLE AND RECONFIGURABLE CO-ACCELERATOR FOR A DEEP NEURAL NETWORK WITH NORMALIZATION OR NON-LINEARITY

Publication number: 20230342320

Abstract: The present disclosure relates to devices for using a configurable stacked architecture for a fixed function datapath with an accelerator for accelerating an operation or a layer of a deep neural network (DNN). The stacked architecture may have a fixed function datapath that includes one or more configurable micro-execution units that execute a series of vector, scalar, reduction, broadcasting, and normalization operations for a DNN layer operation. The fixed function datapath may be customizable based on the DNN or the operation.

Type: Application

Filed: July 5, 2023

Publication date: October 26, 2023

Inventors: Stephen Sangho YOUN, Steven Karl REINHARDT, Jeremy Halden FOWERS, Lok Chand KOPPAKA, Kalin OVTCHAROV
BANKED MEMORY ARCHITECTURE FOR MULTIPLE PARALLEL DATAPATH CHANNELS IN AN ACCELERATOR

Publication number: 20230305967

Abstract: The present disclosure relates to devices and methods for using a banked memory structure with accelerators. The devices and methods may segment and isolate dataflows in datapath and memory of the accelerator. The devices and methods may provide each data channel with its own register memory bank. The devices and methods may use a memory address decoder to place the local variables in the proper memory bank.

Type: Application

Filed: May 30, 2023

Publication date: September 28, 2023

Inventors: Stephen Sangho YOUN, Steven Karl REINHARDT, Hui GENG
Semi-programmable and reconfigurable co-accelerator for a deep neural network with normalization or non-linearity

Patent number: 11734214

Abstract: The present disclosure relates to devices for using a configurable stacked architecture for a fixed function datapath with an accelerator for accelerating an operation or a layer of a deep neural network (DNN). The stacked architecture may have a fixed function datapath that includes one or more configurable micro-execution units that execute a series of vector, scalar, reduction, broadcasting, and normalization operations for a DNN layer operation. The fixed function datapath may be customizable based on the DNN or the operation.

Type: Grant

Filed: March 25, 2021

Date of Patent: August 22, 2023

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Stephen Sangho Youn, Steven Karl Reinhardt, Jeremy Halden Fowers, Lok Chand Koppaka, Kalin Ovtcharov
Banked memory architecture for multiple parallel datapath channels in an accelerator

Patent number: 11704251

Abstract: The present disclosure relates to devices and methods for using a banked memory structure with accelerators. The devices and methods may segment and isolate dataflows in datapath and memory of the accelerator. The devices and methods may provide each data channel with its own register memory bank. The devices and methods may use a memory address decoder to place the local variables in the proper memory bank.

Type: Grant

Filed: April 27, 2022

Date of Patent: July 18, 2023

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Stephen Sangho Youn, Steven Karl Reinhardt, Hui Geng
BANKED MEMORY ARCHITECTURE FOR MULTIPLE PARALLEL DATAPATH CHANNELS IN AN ACCELERATOR

Publication number: 20220253384

Abstract: The present disclosure relates to devices and methods for using a banked memory structure with accelerators. The devices and methods may segment and isolate dataflows in datapath and memory of the accelerator. The devices and methods may provide each data channel with its own register memory bank. The devices and methods may use a memory address decoder to place the local variables in the proper memory bank.

Type: Application

Filed: April 27, 2022

Publication date: August 11, 2022

Inventors: Stephen Sangho YOUN, Steven Karl REINHARDT, Hui GENG
SEMI-PROGRAMMABLE AND RECONFIGURABLE CO-ACCELERATOR FOR A DEEP NEURAL NETWORK WITH NORMALIZATION OR NON-LINEARITY

Publication number: 20220245083

Abstract: The present disclosure relates to devices for using a configurable stacked architecture for a fixed function datapath with an accelerator for accelerating an operation or a layer of a deep neural network (DNN). The stacked architecture may have a fixed function datapath that includes one or more configurable micro-execution units that execute a series of vector, scalar, reduction, broadcasting, and normalization operations for a DNN layer operation. The fixed function datapath may be customizable based on the DNN or the operation.

Type: Application

Filed: March 25, 2021

Publication date: August 4, 2022

Inventors: Stephen Sangho YOUN, Steven Karl REINHARDT, Jeremy Halden FOWERS, Lok Chand KOPPAKA, Kalin OVTCHAROV
Banked memory architecture for multiple parallel datapath channels in an accelerator

Patent number: 11347652

Abstract: The present disclosure relates to devices and methods for using a banked memory structure with accelerators. The devices and methods may segment and isolate dataflows in datapath and memory of the accelerator. The devices and methods may provide each data channel with its own register memory bank. The devices and methods may use a memory address decoder to place the local variables in the proper memory bank.

Type: Grant

Filed: November 13, 2020

Date of Patent: May 31, 2022

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Stephen Sangho Youn, Steven Karl Reinhardt, Hui Geng
BANKED MEMORY ARCHITECTURE FOR MULTIPLE PARALLEL DATAPATH CHANNELS IN AN ACCELERATOR

Publication number: 20220066943

Abstract: The present disclosure relates to devices and methods for using a banked memory structure with accelerators. The devices and methods may segment and isolate dataflows in datapath and memory of the accelerator. The devices and methods may provide each data channel with its own register memory bank. The devices and methods may use a memory address decoder to place the local variables in the proper memory bank.

Type: Application

Filed: November 13, 2020

Publication date: March 3, 2022

Inventors: Stephen Sangho YOUN, Steven Karl REINHARDT, Hui GENG
DEEP NEURAL NETWORK ACCELERATOR WITH INDEPENDENT DATAPATHS FOR SIMULTANEOUS PROCESSING OF DIFFERENT CLASSES OF OPERATIONS

Publication number: 20210312266

Abstract: Deep neural network accelerators (DNNs) with independent datapaths for simultaneous processing of different classes of operations and related methods are described. An example DNN accelerator includes an instruction dispatcher for receiving chains of instructions having both instructions for performing a first class of operations and a second class of operations corresponding to a neural network model. The DNN accelerator further includes a first datapath and a second datapath, where each is configured to execute at least one instruction chain locally before outputting any results. The instruction dispatcher is configured to forward instructions for performing the first class of operations to the first datapath and forward instructions for performing the second class of operations to the second datapath to overlap in time a performance of at least a subset of the first class of operations with a performance of at least a subset of the second class of operations.

Type: Application

Filed: April 1, 2020

Publication date: October 7, 2021

Applicant: Microsoft Technology Licensing, LLC

Inventors: Stephen Sangho YOUN, Lok Chand KOPPAKA, Steven Karl REINHARDT
Tensor-based hardware accelerator including a scalar-processing unit

Patent number: 10997116

Abstract: A computing system is described herein that expedites deep neural network (DNN) operations or other processing operations using a hardware accelerator. The hardware accelerator, in turn, includes a tensor-processing engine that works in conjunction with a scalar-processing unit (SPU). The tensor-processing engine handles various kinds of tensor-based operations required by the DNN, such as multiplying vectors by matrices, combining vectors with other vectors, transforming individual vectors, etc. The SPU performs scalar-based operations, such as forming the reciprocal of a scalar, generating the square root of a scalar, etc. According to one illustrative implementation, the computing system uses the same vector-based programmatic interface to interact with both the tensor-processing engine and the SPU.

Type: Grant

Filed: August 6, 2019

Date of Patent: May 4, 2021

Assignee: Microsoft Technology Licensing, LLC

Inventors: Steven Karl Reinhardt, Joseph Anthony Mayer, II, Dan Zhang
Tensor-Based Hardware Accelerator including a Scalar-Processing Unit

Publication number: 20210042260

Abstract: A computing system is described herein that expedites deep neural network (DNN) operations or other processing operations using a hardware accelerator. The hardware accelerator, in turn, includes a tensor-processing engine that works in conjunction with a scalar-processing unit (SPU). The tensor-processing engine handles various kinds of tensor-based operations required by the DNN, such as multiplying vectors by matrices, combining vectors with other vectors, transforming individual vectors, etc. The SPU performs scalar-based operations, such as forming the reciprocal of a scalar, generating the square root of a scalar, etc. According to one illustrative implementation, the computing system uses the same vector-based programmatic interface to interact with both the tensor-processing engine and the SPU.

Type: Application

Filed: August 6, 2019

Publication date: February 11, 2021

Inventors: Steven Karl REINHARDT, Joseph Anthony MAYER, II, Dan ZHANG
Tensor processor instruction set architecture

Patent number: 10372456

Abstract: A hardware accelerator having an efficient instruction set is disclosed. An apparatus may comprise logic configured to access a first and a second machine instruction. The second machine instruction may be missing a tensor operand needed to execute the second machine instruction. The logic may be further configured to execute the first machine instruction, resulting in a tensor. The logic may be further configured to execute the second machine instruction using the resultant tensor as the missing tensor operand.

Type: Grant

Filed: May 24, 2017

Date of Patent: August 6, 2019

Assignee: Microsoft Technology Licensing, LLC

Inventors: Jeremy Halden Fowers, Kalin Ovtcharov, Steven Karl Reinhardt, Eric Sen Chung, Ming Gang Liu
Tensor register files

Patent number: 10338925

Abstract: Tensor register files in a hardware accelerator are disclosed. An apparatus may comprise tensor operation calculators each configured to perform a type of tensor operation. The apparatus may also comprises tensor register files, each of which is associated with one of the tensor operation calculators. The apparatus may also comprises logic configured to store respective ones of the tensors in the plurality of tensor register files in accordance with the type of tensor operation to be performed on the respective tensors. The apparatus may also control read access to tensor register files based on a type of tensor operation that a machine instruction is to perform.

Type: Grant

Filed: May 24, 2017

Date of Patent: July 2, 2019

Assignee: Microsoft Technology Licensing, LLC

Inventors: Jeremy Halden Fowers, Steven Karl Reinhardt, Kalin Ovtcharov, Eric Sen Chung
Multifunction vector processor circuits

Patent number: 10331445

Abstract: A processor circuit is provided that includes an input terminal and an output terminal, a plurality of vector processor operation circuits, a selector circuit coupled to the input terminal, the output terminal, and each of the vector processor operation circuits, and a scheduler circuit adapted to control the selector circuit to configure a vector processing pipeline comprising zero, one or more of the vector processor operation circuits in any order between the input terminal and the output terminal.

Type: Grant

Filed: May 24, 2017

Date of Patent: June 25, 2019

Assignee: Microsoft Technology Licensing, LLC

Inventors: Jeremy Halden Fowers, Ming Gang Liu, Kalin Ovtcharov, Steven Karl Reinhardt, Eric Sen Chung
Tensor Register Files

Publication number: 20180341483

Abstract: Tensor register files in a hardware accelerator are disclosed. An apparatus may comprise tensor operation calculators each configured to perform a type of tensor operation. The apparatus may also comprises tensor register files, each of which is associated with one of the tensor operation calculators. The apparatus may also comprises logic configured to store respective ones of the tensors in the plurality of tensor register files in accordance with the type of tensor operation to be performed on the respective tensors. The apparatus may also control read access to tensor register files based on a type of tensor operation that a machine instruction is to perform.

Type: Application

Filed: May 24, 2017

Publication date: November 29, 2018

Applicant: Microsoft Technology Licensing, LLC

Inventors: Jeremy Halden FOWERS, Steven Karl REINHARDT, Kalin OVTCHAROV, Eric Sen CHUNG
MULTIFUNCTION VECTOR PROCESSOR CIRCUITS

Publication number: 20180341486

Abstract: A processor circuit is provided that includes an input terminal and an output terminal, a plurality of vector processor operation circuits, a selector circuit coupled to the input terminal, the output terminal, and each of the vector processor operation circuits, and a scheduler circuit adapted to control the selector circuit to configure a vector processing pipeline comprising zero, one or more of the vector processor operation circuits in any order between the input terminal and the output terminal.

Type: Application

Filed: May 24, 2017

Publication date: November 29, 2018

Applicant: Microsoft Technology Licensing, LLC

Inventors: Jeremy Halden FOWERS, Ming Gang LIU, Kalin OVTCHAROV, Steven Karl REINHARDT, Eric Sen CHUNG
Tensor Processor Instruction Set Architecture

Publication number: 20180341484

Abstract: A hardware accelerator having an efficient instruction set is disclosed. An apparatus may comprise logic configured to access a first and a second machine instruction. The second machine instruction may be missing a tensor operand needed to execute the second machine instruction. The logic may be further configured to execute the first machine instruction, resulting in a tensor. The logic may be further configured to execute the second machine instruction using the resultant tensor as the missing tensor operand.

Type: Application

Filed: May 24, 2017

Publication date: November 29, 2018

Applicant: Microsoft Technology Licensing, LLC

Inventors: Jeremy Halden FOWERS, Kalin OVTCHAROV, Steven Karl REINHARDT, Eric Sen CHUNG, Ming Gang LIU