Patents by Inventor Dana Michelle Vantrease

Dana Michelle Vantrease has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Runtime augmentation of engine instructions

Patent number: 10664282

Abstract: Methods for repeated execution of program code by an execution engine are provided. In order to execute large programs, the instruction buffer of an execution engine may be refilled may times with program code to complete one execution of the program. At completion of program execution, the program code needed to begin re-execution of the program is no longer in the instruction buffer. A runtime driver program can load instructions into the instruction buffer, or can cause instructions to be loaded. Once the instructions are loaded, the execution engine may be able to re-execute the instructions without needing further assistance from the runtime driver.

Type: Grant

Filed: February 4, 2019

Date of Patent: May 26, 2020

Assignee: Amazon Technologies, Inc.

Inventors: Ilya Minkin, Ron Diamant, Mohammad El-Shabani, Dana Michelle Vantrease
PERFORMING CONCURRENT OPERATIONS IN A PROCESSING ELEMENT

Publication number: 20200050582

Abstract: A processing element (PE) of a systolic array can perform neural networks computations on two or more data elements of an input data set using the same weight. Thus, two or more output data elements corresponding to an output data set may be generated. Based on the size of the input data set and an input data type, the systolic array can process a single data element or multiple data elements in parallel.

Type: Application

Filed: October 15, 2019

Publication date: February 13, 2020

Inventors: Dana Michelle Vantrease, Ron Diamant
Performing concurrent operations in a processing element

Patent number: 10459876

Abstract: A processing element (PE) of a systolic array can perform neural networks computations in parallel on two or more sequential data elements of an input data set using the same weight. Thus, two or more output data elements corresponding to an output data set may be generated in parallel. Based on the size of the input data set and an input data type, the systolic array can process a single data element or multiple data elements in parallel.

Type: Grant

Filed: January 31, 2018

Date of Patent: October 29, 2019

Assignee: Amazon Technologies, Inc.

Inventors: Dana Michelle Vantrease, Ron Diamant
PROCESSING FOR MULTIPLE INPUT DATA SETS

Publication number: 20190294968

Abstract: Disclosed herein are techniques for performing multi-layer neural network processing for multiple contexts. In one embodiment, a computing engine is set in a first configuration to implement a second layer of a neural network and to process first data related to a first context to generate first context second layer output. The computing engine can be switched from the first configuration to a second configuration to implement a first layer of the neural network. The computing engine can be used to process second data related to a second context to generate second context first layer output. The computing engine can be set to a third configuration to implement a third layer of the neural network to process the first context second layer output and the second context first layer output to generate a first processing result of the first context and a second processing result of the second context.

Type: Application

Filed: March 22, 2018

Publication date: September 26, 2019

Inventors: Dana Michelle Vantrease, Ron Diamant, Thomas A. Volpe, Randy Huang
ACCELERATED QUANTIZED MULTIPLY-AND-ADD OPERATIONS

Publication number: 20190294413

Abstract: Disclosed herein are techniques for accelerating convolution operations or other matrix multiplications in applications such as neural network. A computer-implemented method includes receiving low-precision inputs for a convolution operation from a storage device, and subtracting a low-precision value representing a high-precision zero value from the low-precision inputs to generate difference values, where the low-precision inputs are asymmetrically quantized from high-precision inputs. The method also includes performing multiplication and summation operations on the difference values to generate a sum of products, and generating a high-precision output by scaling the sum of products with a scaling factor.

Type: Application

Filed: March 23, 2018

Publication date: September 26, 2019

Inventors: Dana Michelle Vantrease, Randy Huang, Ron Diamant, Thomas Elmer, Sundeep Amirineni
SCHEDULING NETWORK COMPUTATIONS

Publication number: 20190294959

Abstract: Disclosed herein are techniques for scheduling and executing multi-layer neural network computations for multiple contexts. In one embodiment, a method comprises determining a set of computation tasks to be executed, the set of computation tasks including a first computation task and a second computation task, as well as a third computation task and a fourth computation task to provide input data for the first and second computation tasks; determining a first execution batch comprising the first and second computation tasks; determining a second execution batch comprising at least the third computation task to be executed before the first execution batch; determining whether to include the fourth computation task in the second execution batch based on whether the memory device has sufficient capacity to hold input data and output data of both of the third and fourth computation; executing the second execution batch followed by the first execution batch.

Type: Application

Filed: March 22, 2018

Publication date: September 26, 2019

Inventors: Dana Michelle Vantrease, Ron Diamant, Thomas A. Volpe, Randy Huang
Reducing traffic in hierarchical cache interconnects

Patent number: 10402329

Abstract: A congestion controller may be configured to control traffic on an interconnect between a higher level cache and a lower level cache. The lower level cache may also be coupled to a main memory. The congestion controller may be configured to reduce congestion on the interconnect by blocking transactions that include writing of data to the lower level cache if the data has not been modified relative to a copy of the data in the main memory. The congestion controller may also be configured to control the traffic by blocking certain transactions in a controlled manner for traffic shaping or for performance features.

Type: Grant

Filed: September 26, 2017

Date of Patent: September 3, 2019

Assignee: Amazon Technologies, Inc.

Inventor: Dana Michelle Vantrease
Performing concurrent operations in a processing element

Publication number: 20190236049

Abstract: A processing element (PE) of a systolic array can perform neural networks computations in parallel on two or more sequential data elements of an input data set using the same weight. Thus, two or more output data elements corresponding to an output data set may be generated in parallel. Based on the size of the input data set and an input data type, the systolic array can process a single data element or multiple data elements in parallel.

Type: Application

Filed: January 31, 2018

Publication date: August 1, 2019

Inventors: Dana Michelle Vantrease, Ron Diamant

prev 1 2