Patents Assigned to Advanced Micro Devices
-
Patent number: 12367153Abstract: The disclosed method includes detecting a stalled memory request, for a first level cache within a cache hierarchy of a processor, that includes an outstanding memory request associated with a second level cache within the cache hierarchy that is higher than the first level cache. The method further includes indicating, to the second level cache, that the first level cache is experiencing a starvation issue due to stalled memory requests to enable the second level cache to perform starvation-remediation actions. Various other methods, systems, and computer-readable media are also disclosed.Type: GrantFiled: September 30, 2022Date of Patent: July 22, 2025Assignee: Advanced Micro Devices, Inc.Inventors: Sankaranarayanan Gurumurthy, Anil Harwani
-
Publication number: 20250231606Abstract: Systems, methods, and devices for integrated circuit power management. A mode of a power management state is entered, from the power management state, in response to an entry condition of the mode. A device that is otherwise powered off in the power management state is powered on in the mode of the power management state. In some implementations, the device includes a communications path between a second device and a third device. In some implementations, the device is in a power domain that is powered off in the power management state. In some implementations, the power domain is powered off in the mode. In some implementations, the device is powered on in the mode via a power rail that is specific to the mode. In some implementations, the entry condition of the mode includes an amount of data stored for display in a display buffer falling below a threshold amount.Type: ApplicationFiled: April 7, 2025Publication date: July 17, 2025Applicant: Advanced Micro Devices, Inc.Inventors: Benjamin Tsien, Indrani Paul, Alexander J. Branover, Thomas J. Gibney, Mihir Shaileshbhai Doctor, John P. Petry, Stephen V. Kosonocky, Christopher T. Weaver
-
Patent number: 12360912Abstract: An approach provides indirect addressing support for PIM. Indirect PIM commands include address translation information that allows memory modules to perform indirect addressing. Processing logic in a memory module processes an indirect PIM command and retrieves, from a first memory location, a virtual address of a second memory location. The processing logic calculates a corresponding physical address for the virtual address using the address translation information included with the indirect PIM command and retrieves, from the second memory location, a virtual address of a third memory location. This process is repeated any number of times until one or more indirection stop criteria are satisfied. The indirection stop criteria stop the process when work has been completed normally or to prevent errors. Implementations include the processing logic in the memory module working in cooperation with a memory controller to perform indirect addressing.Type: GrantFiled: December 23, 2021Date of Patent: July 15, 2025Assignee: Advanced Micro Devices, Inc.Inventors: Matthew R. Poremba, Alexandru Dutu, Sooraj Puthoor
-
Patent number: 12360927Abstract: Systems, apparatuses, and methods for implementing duplicated registers for access by initiators across multiple semiconductor dies are disclosed. A system includes multiple initiators on multiple semiconductor dies of a chiplet processor. One of the semiconductor dies is the master die, and this master die has copies of registers which can be accessed by the multiple initiators on the multiple semiconductor dies. When a given initiator on a given secondary die generates a register access, the register access is routed to the master die and a particular duplicate copy of the register maintained for the given secondary die. From the point of view of software, the multiple semiconductor dies appear as a single die, and the multiple initiators appear as a single initiator. Multiple types of registers can be maintained by the master die, with a flush register being one of the register types.Type: GrantFiled: March 28, 2024Date of Patent: July 15, 2025Assignees: Advanced Micro Devices, Inc., ATI Technologies ULCInventors: Haikun Dong, Kostantinos Danny Christidis, Ling-Ling Wang, Minhua Wu, Gaojian Cong, Rui Wang
-
Patent number: 12360896Abstract: Data routing for efficient decompressor use is described. In accordance with the described techniques, a cache controller receives requests from multiple requestors for elements of data stored in a compressed format in a cache. The requests include at least a first request from a first requestor and a second request from a second requestor. A decompression routing system identifies a redundant element of data requested by both the first requestor and the second requestor and causes decompressors to decompress the requested elements of data. The decompression includes performing a single decompression of the redundant element. After the decompression, the decompression routing system routes the decompressed elements to the plurality of requestors, which includes routing the decompressed redundant element to both the first requestor and the second requestor.Type: GrantFiled: October 25, 2023Date of Patent: July 15, 2025Assignees: Advanced Micro Devices, Inc., Samsung Electronics Co., LtdInventors: Jeffrey Christopher Allan, Balakrishnan Sundararaman, Jeongae Park, Wilson Wai Lun Fung, Zhenhong Liu
-
Patent number: 12360907Abstract: A method for performing prefetching operations is disclosed. The method includes storing a recorded access pattern indicating a set of accesses for a region; in response to an access within the region, fetching the recorded access pattern; and performing prefetching based on the access pattern.Type: GrantFiled: September 30, 2022Date of Patent: July 15, 2025Assignee: Advanced Micro Devices, Inc.Inventors: Gabriel H. Loh, Marko Scrbak, Akhil Arunkumar, John Kalamatianos
-
Patent number: 12361628Abstract: A graphics processing unit (GPU) of a processing system is partitioned into multiple dies (referred to as GPU chiplets) that are configurable to collectively function and interface with an application as a single GPU in a first mode and as multiple GPUs in a second mode. By dividing the GPU into multiple GPU chiplets, the processing system flexibly and cost-effectively configures an amount of active GPU physical resources based on an operating mode. In addition, a configurable number of GPU chiplets are assembled into a single GPU, such that multiple different GPUs having different numbers of GPU chiplets can be assembled using a small number of tape-outs and a multiple-die GPU can be constructed out of GPU chiplets that implement varying generations of technology.Type: GrantFiled: December 8, 2022Date of Patent: July 15, 2025Assignee: Advanced Micro Devices, Inc.Inventors: Mark Fowler, Samuel Naffziger, Michael Mantor, Mark Leather
-
Patent number: 12360804Abstract: A processing system flexibly schedules workgroups across kernels based on data dependencies between workgroups to enhance processing efficiency. The workgroups are partitioned into subsets based on the data dependencies and workgroups of a first subset that produces data are scheduled to execute immediately before workgroups of a second subset that consumes the data generated by the first subset. Thus, the processing system does not execute one kernel at a time, but instead schedules workgroups across kernels based on data dependencies across kernels. By limiting the sizes of the subsets to the amount of data that can be stored at local caches, the processing system increases the probability that data to be consumed by workgroups of a subset will be resident in a local cache and will not require a memory access.Type: GrantFiled: December 30, 2022Date of Patent: July 15, 2025Assignee: Advanced Micro Devices, Inc.Inventor: Harris Gasparakis
-
Publication number: 20250224982Abstract: In accordance with the described techniques, a scalable input/output virtualization (SIOV) device includes multiple hardware queues, backend hardware resources, and a command processor running scheduling firmware. The scheduling firmware selects a shared work queue of multiple shared work queues managed by the scheduling firmware from which to dispatch tasks based on one or dispatch policies. In addition, the scheduling firmware selects a hardware queue of the multiple hardware queues in which to enqueue the tasks based on one or more queue policies. Further, the scheduler dispatches the tasks from the shared work queue to the hardware queue, and the tasks are read from the hardware queue by the backend hardware resources for execution.Type: ApplicationFiled: January 10, 2024Publication date: July 10, 2025Applicant: Advanced Micro Devices, Inc.Inventors: Anthony Thomas Gutierrez, Stephen Alexander Zekany, Ali Arda Eker
-
Publication number: 20250225017Abstract: A system comprises a machine check architecture and a processor. The machine check architecture is configured to log hardware errors. The processor is configured to obtain a log of one or more of the hardware errors from the machine check architecture and/or to generate a copy of the log. The processor is further configured to either (1) deliver the log to an in-band agent and the copy of the log to an out-of-band agent or (2) deliver the copy of the log to the in-band agent and the log to the out-of-band agent. Various other devices, systems, and methods are also disclosed.Type: ApplicationFiled: March 31, 2025Publication date: July 10, 2025Applicants: ATI Technologies ULC, Advanced Micro Devices, Inc.Inventors: Vilas Sridharan, Vamsi Krishna Alla, Maher Mounir Moghabghab, Kabita Rani Saha, Carlos Vallin, Vignesh Vaidhyanathan Seshan
-
Patent number: 12354183Abstract: A primary processing unit includes queues configured to store commands prior to execution in corresponding pipelines. The primary processing unit also includes a first table configured to store entries indicating dependencies between commands that are to be executed on different ones of a plurality of processing units that include the primary processing unit and one or more secondary processing units. The primary processing unit also includes a scheduler configured to release commands in response to resolution of the dependencies. In some cases, a first one of the secondary processing units schedules the first command for execution in response to resolution of a dependency on a second command executing in a second one of the secondary processing units. The second one of the secondary processing units notifies the primary processing unit in response to completing execution of the second command.Type: GrantFiled: May 29, 2024Date of Patent: July 8, 2025Assignee: Advanced Micro Devices, Inc.Inventor: Rex Eldon McCrary
-
Patent number: 12353338Abstract: A data processing node includes a processor element and a data fabric circuit. The data fabric circuit is coupled to the processor element and to a local memory element and includes a crossbar switch. The data fabric circuit is operable to bypass the crossbar switch for memory access requests between the processor element and the local memory element.Type: GrantFiled: June 29, 2023Date of Patent: July 8, 2025Assignee: Advanced Micro Devices, Inc.Inventor: Gabriel H. Loh
-
Publication number: 20250217120Abstract: Using artificial intelligence (AI)-based techniques to guide instruction scheduling in a compiler can improve the efficiency and code generation quality of the compiler. AI-guided scheduling of a basic block of a computer program can include obtaining first and second representations of the basic block; selecting K instruction scheduling procedures from a set of N instruction scheduling procedures based on analysis of the first representation of the basic block by a model, where 1?K<N and N?2; generating K candidate schedules of the basic block, including applying the K instruction scheduling procedures to the second representation of the basic block, and ordering the instructions of the second representation of the basic block in accordance with a candidate schedule included in the K candidate schedules.Type: ApplicationFiled: December 28, 2023Publication date: July 3, 2025Applicant: Advanced Micro Devices, Inc.Inventors: Jake Matthew Daly, Ian Charles Colbert, Andrei Rudolfovich Yershov, Ryan Mitchell, Robert A. Gottlieb, Norman Rubin
-
Publication number: 20250217298Abstract: A method for reducing cache fills can include training a filter, by at least one processor and in response to at least one of eviction or rewrite of one or more entries of a cache, the filter indicating one or more cache loads from which the one or more entries were previously filled. The method can also include preventing, by the at least one processor and based on the trained filter, one or more subsequent fills to the cache from the one or more cache loads. Various other methods and systems are also disclosed.Type: ApplicationFiled: December 28, 2023Publication date: July 3, 2025Applicant: Advanced Micro Devices, Inc.Inventors: Alok Garg, Matthew Sobel, Alice Danielle Kivity
-
Publication number: 20250218104Abstract: A device that defines and uses a bounding volume for testing for ray intersections with a displaced micro-mesh. The bounding volume is indirectly based on a twisted prism composed of two triangles and three bilinear patches that bounds the displaced micro-mesh. Instead of detecting intersection with the bilinear patches directly, tetrahedrons that circumscribe the bilinear patches can be used instead. The two bases and the three tetrahedra make fourteen triangles. The device tests for potential intersection with the displaced micro-mesh by testing for an intersection with any of the fourteen triangles. Various other methods and systems are also disclosed.Type: ApplicationFiled: December 28, 2023Publication date: July 3, 2025Applicant: Advanced Micro Devices, Inc.Inventors: David Kirk McAllister, Andrew Erin Kensler, Holger Gruen
-
Publication number: 20250216889Abstract: The disclosed device includes various circuit blocks and a clock tree for sending a clock signal to the circuit blocks. The clock tree includes various clock drivers. The device also includes a control circuit that power gates, in response to one of the circuit blocks being power gated, a portion of the clock tree that includes one of the clock drivers. Various other methods, systems, and computer-readable media are also disclosed.Type: ApplicationFiled: December 28, 2023Publication date: July 3, 2025Applicant: Advanced Micro Devices, Inc.Inventors: Benjamin Tsien, Pravesh Gupta, Madhusudan Chilakam, Jeffrey Lynn Freeman, Indrani Paul, Guhan Krishnan, Ann M. Ling, Chandana Yerneni
-
Publication number: 20250217185Abstract: A computer-implemented method for physical core-specific wear-based task scheduling can include obtaining a wear metric for each physical core based of the plurality of physical cores of the at least one integrated circuit, wherein the wear metric is indicative of a physical condition of each physical core. The computer-implemented method can then schedule a plurality of tasks across at least one physical core of the plurality of physical cores based at least in part on the wear metric of each physical core of the plurality of cores. Various other methods, systems, and computer-readable media are also disclosed.Type: ApplicationFiled: December 28, 2023Publication date: July 3, 2025Applicant: Advanced Micro Devices, Inc.Inventors: Moumita Dey, Heather Lynn Hanson, Aarti Choudhary, Srilatha Manne
-
Publication number: 20250216888Abstract: Temporary system adjustment for component overclocking is described. In accordance with the described techniques, a processor and/or memory are operated according to first settings. During operation of the processor and/or the memory according to the first settings, a signal triggers a temporary adjustment of operation of the processor and/or the memory according to second settings. Responsive to the request, operation of the processor and/or the memory is switched to the second settings without rebooting. After a duration, operation of the processor and/or the memory is switched back to the first settings. In one or more implementations, at least one of the first settings or the second settings overclock the processor and/or the memory.Type: ApplicationFiled: December 28, 2023Publication date: July 3, 2025Applicants: Advanced Micro Devices, Inc., ATI TECHNOLOGIES ULCInventors: Wayne Paul Rodrigue, Grant Evan Ley, Jerry Anton Ahrens, JR., Coralie So, Xianglong Du, Nicholas Carmine DeFiore, Ronald James Baughman, Joshua Taylor Knight, William Robert Alverson
-
Publication number: 20250217692Abstract: A quantum computing device includes a plurality of quantum parallel processing units (Q-PPUs) configured to execute a set of quantum instructions of a quantum application program. The quantum computing device includes an adaptive quantum instruction scheduler to dynamically distribute the set of quantum instructions to the plurality of Q-PPUs based, at least in part, upon a measured probability of a desired result of executing the set of quantum instructions of the quantum application program and a decoherence time of a qubit.Type: ApplicationFiled: December 28, 2023Publication date: July 3, 2025Applicant: Advanced Micro Devices, Inc.Inventors: DaZheng WANG, Jie ZHANG, Zhenyu XU
-
Publication number: 20250217287Abstract: An example device can include at least one network controller configured to receive a data request and to retrieve data based on the data request, and a cache agent configured to receive a data access parameter based on the data request, and reconfigure a cache for at least one memory cache based on the data access parameter. The data request can be received from a computer device and the data can be retrieved from at least one memory device. An example data access parameter can include a latency of at least one network-attached memory device to retrieve data from the at least one memory device based on the data request. An example device can further comprises a flit profiler configured to determine the data access parameter. Various other methods, systems, and computer-readable media are also disclosed.Type: ApplicationFiled: December 28, 2023Publication date: July 3, 2025Applicant: Advanced Micro Devices, Inc.Inventors: Sergey Blagodurov, Vamsee Reddy Kommareddy, Pratik Mishra, Nathaniel Morris, Kevin Y. Cheng