Patents by Inventor Elliot H. Mednick

Elliot H. Mednick has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Instruction subset implementation for low power operation

Patent number: 11880260

Abstract: A heterogeneous processor system includes a first processor implementing an instruction set architecture (ISA) including a set of ISA features and configured to support a first subset of the set of ISA features. The heterogeneous processor system also includes a second processor implementing the ISA including the set of ISA features and configured to support a second subset of the set of ISA features, wherein the first subset and the second subset of the set of ISA features are different from each other. When the first subset includes an entirety of the set of ISA features, the lower-feature second processor is configured to execute an instruction thread by consuming less power and with lower performance than the first processor.

Type: Grant

Filed: June 25, 2020

Date of Patent: January 23, 2024

Assignee: Advanced Micro Devices, Inc.

Inventors: Elliot H. Mednick, Edward McLellan
INPUT/OUTPUT STUTTER WAKE ALIGNMENT

Publication number: 20240004721

Abstract: An apparatus and method for efficiently performing power management for a multi-client computing system. In various implementations, a computing system includes multiple clients that process tasks corresponding to applications. The clients store generated requests of a particular type while processing tasks. A client receives an indication specifying that another client is having requests of the particular type being serviced. In response to receiving this indication, the client inserts a first urgency level in one or more stored requests of the particular type prior to sending the requests for servicing. When the client determines a particular time interval has elapsed, the client sends an indication to other clients specifying that requests of the particular type are being serviced. The client also inserts a second urgency level different from the first urgency level in one or more stored requests of the particular type prior to sending the requests for servicing.

Type: Application

Filed: June 29, 2022

Publication date: January 4, 2024

Inventors: Indrani Paul, Alexander J. Branover, Benjamin Tsien, Elliot H. Mednick
METHOD OF TASK TRANSITION BETWEEN HETEROGENOUS PROCESSORS

Publication number: 20230185623

Abstract: A method, system, and apparatus determines whether a task should be relocated from a first processor to a second processor by comparing performance metrics to associated thresholds or by using other indications. The task is relocated from the first processor to the second processor and executed on the second processor based on the com paring.

Type: Application

Filed: February 3, 2023

Publication date: June 15, 2023

Applicant: Advanced Micro Devices, Inc.

Inventors: Alexander J. Branover, Benjamin Tsien, Elliot H. Mednick
Method of task transition between heterogenous processors

Patent number: 11586472

Abstract: A method, system, and apparatus determines that one or more tasks should be relocated from a first processor to a second processor by comparing performance metrics to associated thresholds or by using other indications. To relocate the one or more tasks from the first processor to the second processor, the first processor is stalled and state information from the first processor is copied to the second processor. The second processor uses the state information and then services incoming tasks instead of the first processor.

Type: Grant

Filed: December 10, 2019

Date of Patent: February 21, 2023

Assignee: Advanced Micro Devices, Inc.

Inventors: Alexander J. Branover, Benjamin Tsien, Elliot H. Mednick
Word type/boundary propagation with memory performance applications

Patent number: 11347650

Abstract: A method includes, for each data value in a set of one or more data values, determining a boundary between a high order portion of the data value and a low order portion of the data value, storing the low order portion at a first memory location utilizing a low data fidelity storage scheme, and storing the high order portion at a second memory location utilizing a high data fidelity storage scheme for recording data at a higher data fidelity than the low data fidelity storage scheme.

Type: Grant

Filed: February 7, 2018

Date of Patent: May 31, 2022

Assignee: Advanced Micro Devices, Inc.

Inventors: David A. Roberts, Elliot H. Mednick
Dynamic, variable bit-width numerical precision on field-programmable gate arrays for machine learning tasks

Patent number: 11216250

Abstract: A method includes providing a set of one or more computational units implemented in a set of one or more field programmable gate array (FPGA) devices, where the set of one or more computational units is configured to generate a plurality of output values based on one or more input values. The method further includes, for each computational unit of the set of computational units, performing a first calculation in the computational unit using a first number representation, where a first output of the plurality of output values is based on the first calculation, determining a second number representation based on the first output value, and performing a second calculation in the computational unit using the second number representation, where a second output of the plurality of output values is based on the second calculation.

Type: Grant

Filed: December 6, 2017

Date of Patent: January 4, 2022

Assignee: Advanced Micro Devices, Inc.

Inventors: Nicholas P. Malaya, Elliot H. Mednick
METHOD OF TASK TRANSITION BETWEEN HETEROGENOUS PROCESSORS

Publication number: 20210173715

Abstract: A method, system, and apparatus determines that one or more tasks should be relocated from a first processor to a second processor by comparing performance metrics to associated thresholds or by using other indications. To relocate the one or more tasks from the first processor to the second processor, the first processor is stalled and state information from the first processor is copied to the second processor. The second processor uses the state information and then services incoming tasks instead of the first processor.

Type: Application

Filed: December 10, 2019

Publication date: June 10, 2021

Applicant: Advanced Micro Devices, Inc.

Inventors: Alexander J. Branover, Benjamin Tsien, Elliot H. Mednick
METHOD AND APPARATUS FOR SERVICING AN INTERRUPT

Publication number: 20200409762

Abstract: A method and apparatus for servicing a task in a computer system includes receiving the task and if the task is serviceable without waking the fabric, servicing the task by a first service stage entity. If the task is not serviceable by the first service stage entity, the task is serviced by a first processing unit without waking a second processing unit. If the task is not serviceable by the first processing unit, the task is serviced by the second processing unit.

Type: Application

Filed: June 26, 2019

Publication date: December 31, 2020

Applicant: Advanced Micro Devices, Inc.

Inventors: Alexander J. Branover, Elliot H. Mednick, Benjamin Tsien
INSTRUCTION SUBSET IMPLEMENTATION FOR LOW POWER OPERATION

Publication number: 20200393887

Abstract: A heterogeneous processor system includes a first processor implementing an instruction set architecture (ISA) including a set of ISA features and configured to support a first subset of the set of ISA features. The heterogeneous processor system also includes a second processor implementing the ISA including the set of ISA features and configured to support a second subset of the set of ISA features, wherein the first subset and the second subset of the set of ISA features are different from each other. When the first subset includes an entirety of the set of ISA features, the lower-feature second processor is configured to execute an instruction thread by consuming less power and with lower performance than the first processor.

Type: Application

Filed: June 25, 2020

Publication date: December 17, 2020

Inventors: Elliot H. MEDNICK, Edward MCLELLAN
Instruction subset implementation for low power operation

Patent number: 10698472

Abstract: A heterogeneous processor system includes a first processor implementing an instruction set architecture (ISA) including a set of ISA features and configured to support a first subset of the set of ISA features. The heterogeneous processor system also includes a second processor implementing the ISA including the set of ISA features and configured to support a second subset of the set of ISA features, wherein the first subset and the second subset of the set of ISA features are different from each other. When the first subset includes an entirety of the set of ISA features, the lower-feature second processor is configured to execute an instruction thread by consuming less power and with lower performance than the first processor.

Type: Grant

Filed: October 27, 2017

Date of Patent: June 30, 2020

Assignee: ADVANCED MICRO DEVICES, INC.

Inventors: Elliot H. Mednick, Edward McLellan
Preemptive cache writeback with transaction support

Patent number: 10452548

Abstract: A method of preemptive cache writeback includes transmitting, from a first cache controller of a first cache to a second cache controller of a second cache, an unused bandwidth message representing an unused bandwidth between the first cache and the second cache during a first cycle. During a second cycle, a cache line containing dirty data is preemptively written back from the second cache to the first cache based on the unused bandwidth message. Further, the cache line in the second cache is written over in response to a cache miss to the second cache.

Type: Grant

Filed: September 28, 2017

Date of Patent: October 22, 2019

Assignee: Advanced Micro Devices, Inc.

Inventors: David A. Roberts, Elliot H. Mednick
WORD TYPE/BOUNDARY PROPAGATION WITH MEMORY PERFORMANCE APPLICATIONS

Publication number: 20190243772

Abstract: A method includes, for each data value in a set of one or more data values, determining a boundary between a high order portion of the data value and a low order portion of the data value, storing the low order portion at a first memory location utilizing a low data fidelity storage scheme, and storing the high order portion at a second memory location utilizing a high data fidelity storage scheme for recording data at a higher data fidelity than the low data fidelity storage scheme.

Type: Application

Filed: February 7, 2018

Publication date: August 8, 2019

Inventors: David A. Roberts, Elliot H. Mednick
DYNAMIC, VARIABLE BIT-WIDTH NUMERICAL PRECISION ON FPGAS FOR MACHINE LEARNING TASKS

Publication number: 20190171420

Abstract: A method includes providing a set of one or more computational units implemented in a set of one or more field programmable gate array (FPGA) devices, where the set of one or more computational units is configured to generate a plurality of output values based on one or more input values. The method further includes, for each computational unit of the set of computational units, performing a first calculation in the computational unit using a first number representation, where a first output of the plurality of output values is based on the first calculation, determining a second number representation based on the first output value, and performing a second calculation in the computational unit using the second number representation, where a second output of the plurality of output values is based on the second calculation.

Type: Application

Filed: December 6, 2017

Publication date: June 6, 2019

Inventors: Nicholas P. Malaya, Elliot H. Mednick
Hybrid analog-digital floating point number representation and arithmetic

Patent number: 10289413

Abstract: A hybrid floating-point arithmetic processor includes a scheduler, a hybrid register file, and a hybrid arithmetic operation circuit. The scheduler has an input for receiving floating-point instructions, and an output for providing decoded register numbers in response to the floating-point instructions. The hybrid register file is coupled to the scheduler and contains circuitry for storing a plurality of floating-point numbers each represented by a digital sign bit, a digital exponent, and an analog mantissa. The hybrid register file has an output for providing selected ones of the plurality of floating-point numbers in response to the decoded register numbers. The hybrid arithmetic operation circuit is coupled to the scheduler and to the hybrid register file, for performing a hybrid arithmetic operation between two floating-point numbers selected by the scheduler and providing a hybrid result represented by a result digital sign bit, a result digital exponent, and a result analog mantissa.

Type: Grant

Filed: December 15, 2017

Date of Patent: May 14, 2019

Assignee: Advanced Micro Devices, Inc.

Inventors: David A. Roberts, Elliot H. Mednick, David John Cownie
INSTRUCTION SUBSET IMPLEMENTATION FOR LOW POWER OPERATION

Publication number: 20190129489

Abstract: A heterogeneous processor system includes a first processor implementing an instruction set architecture (ISA) including a set of ISA features and configured to support a first subset of the set of ISA features. The heterogeneous processor system also includes a second processor implementing the ISA including the set of ISA features and configured to support a second subset of the set of ISA features, wherein the first subset and the second subset of the set of ISA features are different from each other. When the first subset includes an entirety of the set of ISA features, the lower-feature second processor is configured to execute an instruction thread by consuming less power and with lower performance than the first processor.

Type: Application

Filed: October 27, 2017

Publication date: May 2, 2019

Inventors: Elliot H. MEDNICK, Edward MCLELLAN
HYBRID ANALOG-DIGITAL FLOATING POINT NUMBER REPRESENTATION AND ARITHMETIC

Publication number: 20190102175

Abstract: A hybrid floating-point arithmetic processor includes a scheduler, a hybrid register file, and a hybrid arithmetic operation circuit. The scheduler has an input for receiving floating-point instructions, and an output for providing decoded register numbers in response to the floating-point instructions. The hybrid register file is coupled to the scheduler and contains circuitry for storing a plurality of floating-point numbers each represented by a digital sign bit, a digital exponent, and an analog mantissa. The hybrid register file has an output for providing selected ones of the plurality of floating-point numbers in response to the decoded register numbers. The hybrid arithmetic operation circuit is coupled to the scheduler and to the hybrid register file, for performing a hybrid arithmetic operation between two floating-point numbers selected by the scheduler and providing a hybrid result represented by a result digital sign bit, a result digital exponent, and a result analog mantissa.

Type: Application

Filed: December 15, 2017

Publication date: April 4, 2019

Applicant: Advanced Micro Devices, Inc.

Inventors: David A. Roberts, Elliot H. Mednick, David John Cownie
PREEMPTIVE CACHE WRITEBACK WITH TRANSACTION SUPPORT

Publication number: 20190095330

Abstract: A method of preemptive cache writeback includes transmitting, from a first cache controller of a first cache to a second cache controller of a second cache, an unused bandwidth message representing an unused bandwidth between the first cache and the second cache during a first cycle. During a second cycle, a cache line containing dirty data is preemptively written back from the second cache to the first cache based on the unused bandwidth message. Further, the cache line in the second cache is written over in response to a cache miss to the second cache.

Type: Application

Filed: September 28, 2017

Publication date: March 28, 2019

Inventors: David A. ROBERTS, Elliot H. MEDNICK
Instruction set and micro-architecture supporting asynchronous memory access

Patent number: 10209991

Abstract: A system and method for reducing latencies of main memory data accesses are described. A non-blocking load (NBLD) instruction identifies an address of requested data and a subroutine. The subroutine includes instructions dependent on the requested data. A processing unit verifies that address translations are available for both the address and the subroutine. The processing unit continues processing instructions with no stalls caused by younger-in-program-order instructions waiting for the requested data. The non-blocking load unit performs a cache coherent data read request on behalf of the NBLD instruction and requests that the processing unit perform an asynchronous jump to the subroutine upon return of the requested data from lower-level memory.

Type: Grant

Filed: November 16, 2016

Date of Patent: February 19, 2019

Assignees: Advanced Micro Devices, Inc., ATI Technologies ULC

Inventors: Meenakshi Sundaram Bhaskaran, Elliot H. Mednick, David A. Roberts, Anthony Asaro, Amin Farmahini-Farahani
Virtual FPGA management and optimization system

Patent number: 10164639

Abstract: A macro scheduler includes a resource tracking module configured to update a database enumerating a plurality of macro components of a set of field programmable gate array (FPGA) devices, a communication interface configured to receive from a first client device a first design definition indicating one or more specified macro components for a design, resource allocation logic configured to allocate a first set of macro components for the design by allocating one of the plurality of macro components for each of the one or more specified macro components indicated in the first design definition, and configuration logic configured to implement the design in the set of FPGA devices by configuring the first set of allocated macro components according to the first design definition.

Type: Grant

Filed: November 14, 2017

Date of Patent: December 25, 2018

Assignee: Advanced Micro Devices, Inc.

Inventors: David A. Roberts, Andrew G. Kegel, Elliot H. Mednick
INSTRUCTION SET AND MICRO-ARCHITECTURE SUPPORTING ASYNCHRONOUS MEMORY ACCESS

Publication number: 20170212760

Abstract: A system and method for reducing latencies of main memory data accesses are described. A non-blocking load (NBLD) instruction identifies an address of requested data and a subroutine. The subroutine includes instructions dependent on the requested data. A processing unit verifies that address translations are available for both the address and the subroutine. The processing unit continues processing instructions with no stalls caused by younger-in-program-order instructions waiting for the requested data. The non-blocking load unit performs a cache coherent data read request on behalf of the NBLD instruction and requests that the processing unit perform an asynchronous jump to the subroutine upon return of the requested data from lower-level memory.

Type: Application

Filed: November 16, 2016

Publication date: July 27, 2017

Inventors: Meenakshi Sundaram Bhaskaran, Elliot H. Mednick, David A. Roberts, Anthony Asaro, Amin Farmahini-Farahani

1 2 next