Patents Assigned to Advanced Micro Devics, Inc.

Core Activation and Deactivation for a Multi-Core Processor

Publication number: 20230315191

Abstract: Core activation and deactivation for a multi-core processor is described. In accordance with the described techniques, a processor having multiple cores operates using a first core configuration. A request to switch from the first core configuration to a second core configuration is received. Responsive to the request, a switch from the first core configuration to the second core configuration occurs by adjusting a number of active cores of the processor without rebooting.

Type: Application

Filed: March 30, 2022

Publication date: October 5, 2023

Applicant: Advanced Micro Devices, Inc.

Inventors: William Robert Alverson, Amitabh Mehra, Jerry Anton Ahrens, Grant Evan Ley, Anil Harwani, Joshua Taylor Knight
Throttling shaders based on resource usage in a graphics pipeline

Patent number: 11776085

Abstract: A processing system includes a graphics pipeline that executes a first shader of a first type and a second shader of a second type. In some cases, the first shader is a geometry shader and the second shader is a pixel shader. The processing system also includes buffers that hold primitives generated by the first shader and provide the primitives to the second shader. The processing system also includes a primitive hub that monitors fullness of the buffers. Launching of waves from the first shader is throttled based on the fullness of the buffers. A shader processor input (SPI) selectively throttles the waves launched by the geometry shader based on a signal from the primitive hub indicating the fullness, an indication of relative resource usage of geometry waves and pixel waves in the graphics pipeline, or an indication of lifetimes of the geometry waves.

Type: Grant

Filed: December 16, 2020

Date of Patent: October 3, 2023

Assignee: Advanced Micro Devices, Inc.

Inventors: Nishank Pathak, Randy Wayne Ramsey, Tad Litwiller, Rex Eldon McCrary
Encoded enable clock gaters

Patent number: 11776599

Abstract: A processing device is provided which includes a processor and a data storage structure. The data storage structure comprises a data storage array comprising a plurality of lines. Each line comprises at least one A latch configured to store a data bit and a clock gater. The data storage structure also comprises a write data B latch configured to store, over different clock cycles, a different data bit, each to be written to the at least one A latch of one of the plurality of lines. The data storage structure also comprises a plurality of write index B latches shared by the clock gaters of the lines. The write index B latches are configured to store, over the different clock cycles, combinations of index bits having values which index one of the lines to which a corresponding data bit is to be stored.

Type: Grant

Filed: September 24, 2021

Date of Patent: October 3, 2023

Assignee: Advanced Micro Devices, Inc.

Inventor: Patrick J. Shyvers
Power saving through delayed message processing

Patent number: 11775043

Abstract: Systems and methods are disclosed for reducing the power consumption of a system. Techniques are described that queue a message, sent by a source engine of the system, in a queue of a destination engine of the system that is in a sleep mode. Then, a priority level associated with the queued message is determined. If the priority level is at a maximum level, the destination engine is brought into an active mode. If the priority level is at an intermediate level, the destination engine is brought into an active mode when a time, associated with the intermediate level, has elapsed. When the destination engine is brought into an active mode it processes all messages accumulated in its queue in an order determined by their associated priority levels.

Type: Grant

Filed: September 24, 2021

Date of Patent: October 3, 2023

Assignee: Advanced Micro Devices, Inc.

Inventor: Vidyashankar Viswanathan
Cross FET SRAM cell layout

Patent number: 11778803

Abstract: A system and method for efficiently creating layout for memory bit cells are described. In various implementations, a memory bit cell uses Cross field effect transistors (FETs) that include vertically stacked gate all around (GAA) transistors with conducting channels oriented in an orthogonal direction between them. The channels of the vertically stacked transistors use opposite doping polarities. The memory bit cell includes one of a read bit line and a write word line routed in no other metal layer other than a local interconnect layer. In addition, a six transistor (6T) random access data storage of the given memory bit cell consumes a planar area above a silicon substrate of four transistors.

Type: Grant

Filed: September 29, 2021

Date of Patent: October 3, 2023

Assignee: Advanced Micro Devices, Inc.

Inventor: Richard T. Schultz
Runtime extension for neural network training with heterogeneous memory

Patent number: 11775799

Abstract: Systems, apparatuses, and methods for managing buffers in a neural network implementation with heterogeneous memory are disclosed. A system includes a neural network coupled to a first memory and a second memory. The first memory is a relatively low-capacity, high-bandwidth memory while the second memory is a relatively high-capacity, low-bandwidth memory. During a forward propagation pass of the neural network, a run-time manager monitors the usage of the buffers for the various layers of the neural network. During a backward propagation pass of the neural network, the run-time manager determines how to move the buffers between the first and second memories based on the monitored buffer usage during the forward propagation pass. As a result, the run-time manager is able to reduce memory access latency for the layers of the neural network during the backward propagation pass.

Type: Grant

Filed: November 19, 2018

Date of Patent: October 3, 2023

Assignee: Advanced Micro Devices, Inc.

Inventors: Georgios Mappouras, Amin Farmahini-Farahani, Sudhanva Gurumurthi, Abhinav Vishnu, Gabriel H. Loh
Error-Tolerant Memory System for Machine Learning Systems

Publication number: 20230305923

Abstract: A memory system uses error detection codes to detect when errors have occurred in a region of memory. A count of the number of errors is kept and a notification is output in response to the number of errors satisfying a threshold value. The notification is an indication to a host (e.g., a program accessing or managing a machine learning system) that the threshold number of errors have been detected in the region of memory. As long as the number of errors that have been detected in the region of memory remains under the threshold number no notification need be output to the host.

Type: Application

Filed: March 25, 2022

Publication date: September 28, 2023

Applicant: Advanced Micro Devices, Inc.

Inventors: Sudhanva Gurumurthi, Ganesh Suryanarayan Dasika
ELECTRONIC DEVICE INCLUDING DIES AND AN INTERCONNECT COUPLED TO THE DIES AND PROCESSES OF FORMING THE SAME

Publication number: 20230307405

Abstract: An electronic device can include a first die, a second die, and an interconnect. The first die or the second die has a principal function as a power module or a memory. The first die includes a first bond pad, and the second die includes a second bond pad. The device sides of the first and second dies are along the same sides as the first and second bond pads. In an embodiment, the first die and the second die are in a chip first, die face-up configuration. The first and the second bond pads are electrically connected along a first solderless connection that includes the interconnect. In another embodiment, each material within the electrical connection between the first and the second bond pads has a flow point or melting point temperature of at least 300° C.

Type: Application

Filed: March 25, 2022

Publication date: September 28, 2023

Applicant: Advanced Micro Devices, Inc.

Inventors: Lei Fu, Raja Swaminathan, Brett P. Wilkerson
Array of Pointers Prefetching

Publication number: 20230305849

Abstract: Array of pointers prefetching is described. In accordance with described techniques, a pointer target instruction is detected by identifying that a destination location of a load instruction is used in an address compute for a memory operation and the load instruction is included in a sequence of load instructions having addresses separated by a step size. An instruction for fetching data of a future load instruction is injected in an instruction stream of a processor. The data of the future load instruction is stored in a temporary register. An additional instruction is injected in the instruction stream for prefetching a pointer target based on an address of the memory operation and the data of the future load instruction.

Type: Application

Filed: March 25, 2022

Publication date: September 28, 2023

Applicant: Advanced Micro Devices, Inc.

Inventors: Chetana N. Keltcher, Alok Garg, Paul S. Keltcher
Processing unit with mixed precision operations

Patent number: 11768664

Abstract: A graphics processing unit (GPU) implements operations, with associated op codes, to perform mixed precision mathematical operations. The GPU includes an arithmetic logic unit (ALU) with different execution paths, wherein each execution path executes a different mixed precision operation. By implementing mixed precision operations at the ALU in response to designate op codes that delineate the operations, the GPU efficiently increases the precision of specified mathematical operations while reducing execution overhead.

Type: Grant

Filed: October 2, 2019

Date of Patent: September 26, 2023

Assignee: Advanced Micro Devices, Inc.

Inventors: Bin He, Michael Mantor, Jiasheng Chen
Re-reference indicator for re-reference interval prediction cache replacement policy

Patent number: 11768778

Abstract: Techniques for performing cache operations are provided. The techniques include tracking re-references for cache lines of a cache, detecting that eviction is to occur, and selecting a cache line for eviction from the cache based on a re-reference indication.

Type: Grant

Filed: September 30, 2021

Date of Patent: September 26, 2023

Assignee: Advanced Micro Devices, Inc.

Inventor: Paul J. Moyer
Cache management based on access type priority

Patent number: 11768779

Abstract: Systems, apparatuses, and methods for cache management based on access type priority are disclosed. A system includes at least a processor and a cache. During a program execution phase, certain access types are more likely to cause demand hits in the cache than others. Demand hits are load and store hits to the cache. A run-time profiling mechanism is employed to find which access types are more likely to cause demand hits. Based on the profiling results, the cache lines that will likely be accessed in the future are retained based on their most recent access type. The goal is to increase demand hits and thereby improve system performance. An efficient cache replacement policy can potentially reduce redundant data movement, thereby improving system performance and reducing energy consumption.

Type: Grant

Filed: December 16, 2019

Date of Patent: September 26, 2023

Assignee: Advanced Micro Devices, Inc.

Inventors: Jieming Yin, Yasuko Eckert, Subhash Sethumurugan
Techniques for handling cache coherency traffic for contended semaphores

Patent number: 11768771

Abstract: The techniques described herein improve cache traffic performance in the context of contended lock instructions. More specifically, each core maintains a lock address contention table that stores addresses corresponding to contended lock instructions. The lock address contention table also includes a state value that indicates progress through a series of states meant to track whether a load by the core in a spin-loop associated with semaphore acquisition has obtained the semaphore in an exclusive state. Upon detecting that a load in a spin-loop has obtained the semaphore in an exclusive state, the core responds to incoming requests for access to the semaphore with negative acknowledgments. This allows the core to maintain the semaphore cache line in an exclusive state, which allows it to acquire the semaphore faster and to avoid transmitting that cache line to other cores unnecessarily.

Type: Grant

Filed: December 9, 2021

Date of Patent: September 26, 2023

Assignee: Advanced Micro Devices, Inc.

Inventors: John M. King, Gregory W. Smaus
Low latency long short-term memory inference with sequence interleaving

Patent number: 11769041

Abstract: Systems, apparatuses, and methods for implementing a low latency long short-term memory (LSTM) machine learning engine using sequence interleaving techniques are disclosed. A computing system includes at least a host processing unit, a machine learning engine, and a memory. The host processing unit detects a plurality of sequences which will be processed by the machine learning engine. The host processing unit interleaves the sequences into data blocks and stores the data blocks in the memory. When the machine learning engine receives a given data block, the machine learning engine performs, in parallel, a plurality of matrix multiplication operations on the plurality of sequences in the given data block and a plurality of coefficients. Then, the outputs of the matrix multiplication operations are coupled to one or more LSTM layers.

Type: Grant

Filed: October 31, 2018

Date of Patent: September 26, 2023

Assignees: Advanced Micro Devices, Inc., ATI Technologies ULC

Inventors: Sateesh Lagudu, Lei Zhang, Allen H. Rush
DISTRIBUTED VISIBILITY STREAM GENERATION FOR COARSE GRAIN BINNING

Publication number: 20230298261

Abstract: Techniques for performing rendering operations are disclosed herein. The techniques include performing two-level primitive batch binning in parallel across multiple rendering engines, wherein tiles for subdividing coarse-level work across the rendering engines have the same size as tiles for performing coarse binning.

Type: Application

Filed: June 21, 2022

Publication date: September 21, 2023

Applicant: Advanced Micro Devices, Inc.

Inventors: Michael John Livesley, Ruijin Wu, Mangesh P. Nijasure
Load Dependent Branch Prediction

Publication number: 20230297381

Abstract: Load dependent branch prediction is described. In accordance with described techniques, a load dependent branch instruction is detected by identifying that a destination location of a load instruction is used in an operation for determining whether a conditional branch is taken or not taken. The load instruction is included in a sequence of load instructions having addresses separated by a step size. An instruction is injected in an instruction stream of a processor for fetching data of a future load instruction using an address of the load instruction offset by a distance based on the step size. An additional instruction is injected in the instruction stream of the processor for precomputing an outcome of a load dependent branch using an address computed based on an address of the operation and the data of the future load instruction.

Type: Application

Filed: March 21, 2022

Publication date: September 21, 2023

Applicant: Advanced Micro Devices, Inc.

Inventors: Chetana N. Keltcher, Alok Garg, Paul S Keltcher
STACK-BASED RAY TRAVERSAL WITH DYNAMIC MULTIPLE-NODE ITERATIONS

Publication number: 20230298256

Abstract: A technique for performing ray tracing operations is provided. The technique includes, in response to detecting that a threshold number of traversal stage work-items of a wavefront have terminated, increasing intersection test parallelization for non-terminated work-items.

Type: Application

Filed: June 20, 2022

Publication date: September 21, 2023

Applicants: Advanced Micro Devices, Inc., ATI Technologies ULC

Inventors: Daniel James Skinner, Michael John Livesley, David William John Pankratz
Adaptive biasing circuit for serial communication interfaces

Patent number: 11764789

Abstract: Systems and techniques for applying voltage biases to gates of driver circuitry of an integrated circuit (IC) based on a detected bus voltage, IC supply voltage, or both are used to mitigate Electrical Over-Stress (EOS) issues in components of the driver circuitry caused, for instance, by high bus voltages in serial communication systems relative to maximum operating voltages of those components. A driver bias generator selectively applies bias voltages at gates of transistors of a stacked driver structure of an IC to prevent the voltage drop across any given transistor of the stacked driver structure from exceeding a predetermined threshold associated with the maximum operating voltage range of the transistors.

Type: Grant

Filed: September 28, 2021

Date of Patent: September 19, 2023

Assignee: Advanced Micro Devices, Inc.

Inventors: Rajesh Mangalore Anand, Prasant Kumar Vallur, Piyush Gupta, Girish Anathahalli Singrigowda, Jagadeesh Anathahalli Singrigowda
Cuckoo filters and cuckoo hash tables with biasing, compression, and decoupled logical sparsity

Patent number: 11762828

Abstract: A method includes, for each key of a plurality of keys, identifying from a set of buckets a first bucket for the key based on a first hash function, and identifying from the set of buckets a second bucket for the key based on a second hash function. An entry for the key is stored in a bucket selected from one of the first bucket and the second bucket. The entry is inserted in a sequence of entries in a memory block. A position of the entry in the sequence of entries corresponds to the selected bucket. For each bucket in the set of buckets, an indication of a number of entries in the bucket is recorded.

Type: Grant

Filed: August 17, 2018

Date of Patent: September 19, 2023

Assignee: Advanced Micro Devices, Inc.

Inventors: Alexander D. Breslow, Nuwan S. Jayasena
Method and apparatus for a dram cache tag prefetcher

Patent number: 11762777

Abstract: Devices and methods for cache prefetching are provided. A device is provided which comprises memory and a processor. The memory comprises a DRAM cache, a cache dedicated to the processor and one or more intermediate caches between the dedicated cache and the DRAM cache. The processor is configured to issue prefetch requests to prefetch data, issue data access requests to fetch the data and when one or more previously issued prefetch requests are determined to be inaccurate, issue a prefetch request to prefetch a tag, corresponding to the memory address of requested data in the DRAM cache. A tag look-up is performed at the DRAM cache without performing tag look-ups at the dedicated cache or the intermediate caches. The tag is prefetched from the DRAM cache without prefetching the requested data.

Type: Grant

Filed: March 31, 2021

Date of Patent: September 19, 2023

Assignee: Advanced Micro Devices, Inc.

Inventors: Jagadish B. Kotra, Marko Scrbak, Matthew Raymond Poremba

prev … 37 38 39 40 41 42 43 44 45 … next