Patents by Inventor Randy RAMSEY

Randy RAMSEY has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

DYNAMIC DISPATCH FOR WORKGROUP DISTRIBUTION

Publication number: 20240320780

Abstract: Systems, methods, and techniques dynamically utilize load balancing for workgroup assignments between a group of shader engines by a command processor of a graphics processing unit (GPU). Based on one or more commands received for execution, a plurality of workgroups is generated for assignment to a plurality of shader engines for processing, each shader engine including a respective quantity of active compute units. Each workgroup of the plurality of workgroups is dynamically assigned to a respective shader engine for execution based at least in part on indications of available resources respectively associated with each of the shader engines. In various embodiments, the indications of available resources may include physical parameters regarding each shader engine, as well as current status information regarding the processing of workgroups assigned to each shader engine.

Type: Application

Filed: March 12, 2024

Publication date: September 26, 2024

Inventors: Randy RAMSEY, Yash UKIDAVE
Method and apparatus for implementing a rasterizer in GPU operations

Patent number: 11954757

Abstract: An apparatus, such as a graphical processing unit (GPU), includes one or more processors configured to determine a plurality of first locality information of a received wave at a processing unit and to select a first processing element of a plurality of processing elements, the first processing unit having a plurality of second locality information from a previous wave that matches the plurality of first locality information to execute the received wave.

Type: Grant

Filed: December 28, 2021

Date of Patent: April 9, 2024

Assignee: Advanced Micro Devices, Inc.

Inventors: Yash Ukidave, Randy Ramsey, Sukanya Chavan, Zhongliang Chen
Dynamic dispatch for workgroup distribution

Patent number: 11941723

Abstract: Systems, methods, and techniques dynamically utilize load balancing for workgroup assignments between a group of shader engines by a command processor of a graphics processing unit (GPU). Based on one or more commands received for execution, a plurality of workgroups is generated for assignment to a plurality of shader engines for processing, each shader engine including a respective quantity of active compute units. Each workgroup of the plurality of workgroups is dynamically assigned to a respective shader engine for execution based at least in part on indications of available resources respectively associated with each of the shader engines. In various embodiments, the indications of available resources may include physical parameters regarding each shader engine, as well as current status information regarding the processing of workgroups assigned to each shader engine.

Type: Grant

Filed: December 29, 2021

Date of Patent: March 26, 2024

Assignee: Advanced Micro Devices, Inc.

Inventors: Randy Ramsey, Yash Ukidave
METHOD AND APPARATUS FOR IMPLEMENTING A RASTERIZER IN GPU OPERATIONS

Publication number: 20230206381

Abstract: An apparatus, such as a graphical processing unit (GPU), includes one or more processors configured to determine a plurality of first locality information of a received wave at a processing unit and to select a first processing element of a plurality of processing elements, the first processing unit having a plurality of second locality information from a previous wave that matches the plurality of first locality information to execute the received wave.

Type: Application

Filed: December 28, 2021

Publication date: June 29, 2023

Inventors: Yash UKIDAVE, Randy RAMSEY, Sukanya CHAVAN, Zhongliang CHEN
DYNAMIC DISPATCH FOR WORKGROUP DISTRIBUTION

Publication number: 20230206382

Abstract: Systems, methods, and techniques dynamically utilize load balancing for workgroup assignments between a group of shader engines by a command processor of a graphics processing unit (GPU). Based on one or more commands received for execution, a plurality of workgroups is generated for assignment to a plurality of shader engines for processing, each shader engine including a respective quantity of active compute units. Each workgroup of the plurality of workgroups is dynamically assigned to a respective shader engine for execution based at least in part on indications of available resources respectively associated with each of the shader engines. In various embodiments, the indications of available resources may include physical parameters regarding each shader engine, as well as current status information regarding the processing of workgroups assigned to each shader engine.

Type: Application

Filed: December 29, 2021

Publication date: June 29, 2023

Inventors: Randy RAMSEY, Yash UKIDAVE
PRIORITY INVERSION MITIGATION

Publication number: 20230205602

Abstract: Parallel processors typically allocate resources to workloads based on workload priority. Priority inversion of resource allocation between workloads of different priorities reduces the operating efficiency of a parallel processor in some cases. A parallel processor mitigates priority inversion by soft-locking resources to prevent their allocation for the processing of lower priority workloads. Soft-locking is enabled responsive to a soft-lock condition, such as one or more priority inversion heuristics exceeding corresponding thresholds or multiple failed allocations of higher priority workloads within a time period. In some cases, priority inversion heuristics include quantities of higher priority workloads and lower priority workloads that are in-flight or incoming, ratios between such quantities, quantities of render targets, or a combination of these.

Type: Application

Filed: December 28, 2021

Publication date: June 29, 2023

Inventors: Yash UKIDAVE, Randy Ramsey, Nishank Pathak, Baturay Turkmen
VARIABLE DISPATCH WALK

Publication number: 20230195509

Abstract: A processing unit performs a dispatch walk of a set of thread groups based on a programmable access pattern. The access pattern is stored at a table that is programmed with the access pattern based upon a specified command. By using the command to program the table with different access patterns, the dispatch order of the set of thread groups is adapted to better suit the processing of different data sets, thereby reducing power consumption at the processing unit, and improving overall processing efficiency.

Type: Application

Filed: December 21, 2021

Publication date: June 22, 2023

Inventors: Saurabh Sharma, Jeremy Lukacs, Hashem Hashemi, Gianpaolo Tommasi, Guennadi Riguer, Mark Fowler, Randy Ramsey
VARIABLE DISPATCH WALK FOR SUCCESSIVE CACHE ACCESSES

Publication number: 20230195626

Abstract: A processing system is configured to translate a first cache access pattern of a dispatch of work items to a cache access pattern that facilitates consumption of data stored at a cache of a parallel processing unit by a subsequent access before the data is evicted to a more remote level of the memory hierarchy. For consecutive cache accesses having read-after-read data locality, in some embodiments the processing system translates the first cache access pattern to a space-filling curve. In some embodiments, for consecutive accesses having read-after-write data locality, the processing system translates a first typewriter cache access pattern that proceeds in ascending order for a first access to a reverse typewriter cache access pattern that proceeds in descending order for a subsequent cache access. By translating the cache access pattern based on data locality, the processing system increases the hit rate of the cache.

Type: Application

Filed: December 21, 2021

Publication date: June 22, 2023

Inventors: Saurabh Sharma, Jeremy Lukacs, Hashem Hashemi, Gianpaolo Tommasi, Guennadi Riguer, Mark Fowler, Randy Ramsey
Selectively dispatching waves based on accumulators holding behavioral characteristics of waves currently executing

Patent number: 11397578

Abstract: An apparatus such as a graphics processing unit (GPU) includes a plurality of processing elements configured to concurrently execute a plurality of first waves and accumulators associated with the plurality of processing elements. The accumulators are configured to store accumulated values representative of behavioral characteristics of the plurality of first waves that are concurrently executing on the plurality of processing elements. The apparatus also includes a dispatcher configured to dispatch second waves to the plurality of processing elements based on comparisons of values representative of behavioral characteristics of the second waves and the accumulated values stored in the accumulators. In some cases, the behavioral characteristics of the plurality of first waves comprise at least one of fetch bandwidths, usage of an arithmetic logic unit (ALU), and number of export operations.

Type: Grant

Filed: August 30, 2019

Date of Patent: July 26, 2022

Assignee: Advanced Micro Devices, Inc.

Inventors: Randy Ramsey, William David Isenberg, Michael Mantor
Exception handler for sampling draw dispatch identifiers

Patent number: 11386518

Abstract: The address of the draw or dispatch packet responsible for creating an exception is tied to a shader/wavefront back to the draw command from which it originated. In various embodiments, a method of operating a graphics pipeline and exception handling includes receiving, at a command processor of a graphics processing unit (GPU), an exception signal indicating an occurrence of a pipeline exception at a shader stage of a graphics pipeline. The shader stage generates an exception signal in response to a pipeline exception and transmits the exception signal to the command processor. The command processor determines, based on the exception signal, an address of a command packet responsible for the occurrence of the pipeline exception.

Type: Grant

Filed: September 24, 2019

Date of Patent: July 12, 2022

Assignee: ADVANCED MICRO DEVICES, INC.

Inventors: Michael Mantor, Alexander Fuad Ashkar, Randy Ramsey, Mangesh P. Nijasure, Brian Emberling
EXCEPTION HANDLER FOR SAMPLING DRAW DISPATCH IDENTIFIERS

Publication number: 20210090205

Abstract: The address of the draw or dispatch packet responsible for creating an exception is tied to a shader/wavefront back to the draw command from which it originated. In various embodiments, a method of operating a graphics pipeline and exception handling includes receiving, at a command processor of a graphics processing unit (GPU), an exception signal indicating an occurrence of a pipeline exception at a shader stage of a graphics pipeline. The shader stage generates an exception signal in response to a pipeline exception and transmits the exception signal to the command processor. The command processor determines, based on the exception signal, an address of a command packet responsible for the occurrence of the pipeline exception.

Type: Application

Filed: September 24, 2019

Publication date: March 25, 2021

Inventors: Michael MANTOR, Alexander Fuad ASHKAR, Randy RAMSEY, Mangesh P. NIJASURE, Brian EMBERLING
ACCUMULATORS FOR BEHAVIORAL CHARACTERISTICS OF WAVES

Publication number: 20210064366

Abstract: An apparatus such as a graphics processing unit (GPU) includes a plurality of processing elements configured to concurrently execute a plurality of first waves and accumulators associated with the plurality of processing elements. The accumulators are configured to store accumulated values representative of behavioral characteristics of the plurality of first waves that are concurrently executing on the plurality of processing elements. The apparatus also includes a dispatcher configured to dispatch second waves to the plurality of processing elements based on comparisons of values representative of behavioral characteristics of the second waves and the accumulated values stored in the accumulators. In some cases, the behavioral characteristics of the plurality of first waves comprise at least one of fetch bandwidths, usage of an arithmetic logic unit (ALU), and number of export operations.

Type: Application

Filed: August 30, 2019

Publication date: March 4, 2021

Inventors: Randy RAMSEY, William David ISENBERG, Michael MANTOR
Polymer concrete having high bond strength and long working time

Patent number: 4816503

Abstract: Polymer concretes having a high bond strength and/or long working times are made from a curable composition of norbornyl modified unsaturated polyester or polyesteramide resins blended with a polymerizable monomer such as styrene, an aggregate mixture such as sand and gravel and an effective amount of styrene acrylonitrile copolymers, styrene alphamethylstyrene copolymers, or a styrene acrylonitrile copolymer mixture with no more than 25% by weight polystyrene.

Type: Grant

Filed: April 8, 1987

Date of Patent: March 28, 1989

Assignee: The Dow Chemical Company

Inventors: William C. Cunningham, Randy A. Ramsey, Randal E. Autenrieth