Patents Assigned to Advanced Micro Devices
-
Patent number: 12333309Abstract: A coprocessor such as a floating-point unit includes a pipeline that is partitioned into a first portion and a second portion. A controller is configured to provide control signals to the first portion and the second portion of the pipeline. A first physical distance traversed by control signals propagating from the controller to the first portion of the pipeline is shorter than a second physical distance traversed by control signals propagating from the controller to the second portion of the pipeline. A scheduler is configured to cause a physical register file to provide a first subset of bits of an instruction to the first portion at a first time. The physical register file provides a second subset of the bits of the instruction to the second portion at a second time subsequent to the first time.Type: GrantFiled: June 16, 2023Date of Patent: June 17, 2025Assignee: Advanced Micro Devices, Inc.Inventors: Jay Fleischman, Michael Estlick, Michael Christopher Sedmak, Erik Swanson, Sneha V. Desai
-
Patent number: 12332824Abstract: The disclosed semiconductor package includes a first chiplet area for receiving a first chiplet, a second chiplet area for receiving a second chiplet, and a host die coupled to the first and second chiplet areas. The semiconductor package also includes an interconnect directly coupling the first chiplet area to the second chiplet area. Various other methods, systems, and computer-readable media are also disclosed.Type: GrantFiled: December 28, 2022Date of Patent: June 17, 2025Assignee: Advanced Micro Devices, Inc.Inventors: Gabriel Hsiuwei Loh, Todd David Basso
-
Patent number: 12333158Abstract: A data processor is adapted to couple to a memory. The data processor includes a memory operation array, a power engine, and an initialization circuit. The memory operation array includes a command portion and a data portion. The power engine has an input for receiving power state change request signals and an output for providing memory operations responsive to instructions stored in the command portion. The initialization circuit populates the data portion such that consecutive memory operations are separated by an amount corresponding to a predetermined minimum timing parameter.Type: GrantFiled: June 29, 2022Date of Patent: June 17, 2025Assignee: Advanced Micro Devices, Inc.Inventors: Jean J. Chittilappilly, Kevin M. Brandl, Michael L. Choate
-
Patent number: 12332722Abstract: An apparatus and method for efficient power management of multiple integrated circuits. In various implementations, a computing system includes an integrated circuit with a security processor. The security processor determines the integrated circuit transitions to an active state from a sleep state that is not intended to maintain configuration information to return to the active state without restarting an operating system. In the sleep state, multiple components of the integrated circuit have a power supply reference level turned off, which provides low power consumption for the integrated circuit. The security processor performs the bootup operation using information stored in persistent on-chip memory. By not using information stored in off-chip memory, the security processor reduces the latency of the transition.Type: GrantFiled: March 24, 2023Date of Patent: June 17, 2025Assignees: Advanced Micro Devices, Inc., ATI Technologies ULCInventors: Gia Tung Phan, Randall Brown, Ashish Jain
-
Publication number: 20250190367Abstract: According to one aspect, a system includes a processor core, a history buffer, a history buffer logic, and an interrupt controller. The processor core is configured to issue a trigger that causes the history buffer logic to, after a fixed time, flush existing entries in the history buffer and start adding new entries into the history buffer. According to another aspect, a system includes a processor core that includes a load store tracker buffer and a prefetch engine. The buffer is configured to track a critical section memory location associated with critical section data fetched by the processor core. The prefetch engine is configured to obtain the critical section memory location from a previous load store tracker buffer associated with a previous lock holder processor core. The system also includes a lock manager configured to signal the buffer to start and stop tracking the critical section memory.Type: ApplicationFiled: December 12, 2023Publication date: June 12, 2025Applicant: Advanced Micro Devices, Inc.Inventors: Shaju Abraham, Akash A, Naveen M, Shreeroop Ajaykumar
-
Publication number: 20250190221Abstract: A disclosed method may include initializing a deep learning recommendation model (DLRM) comprising a plurality of embedding tables, each embedding table comprising a plurality of embeddings. The method may also include receiving input data associated with accessing embeddings from the plurality of embedding tables and applying a parallelization strategy to process the plurality of embedding tables, the parallelization strategy configured to improve performance by distributing computational workloads and optimizing memory access. The method may also include processing the embeddings based on the input data in accordance with the parallelization strategy, the processing comprising aggregating embeddings accessed from the plurality of embedding tables. The method may also include generating, for further processing, output data based on the processed embeddings. Various other methods, systems, and computer-readable media are also disclosed.Type: ApplicationFiled: December 9, 2024Publication date: June 12, 2025Applicants: Advanced Micro Devices, Inc., Xilinx, Inc.Inventors: Krishnakumar Nair, Meenakshi Arunachalam, John Kalamatianos, Rishabh Jain, Varun Agrawal, Avinash Chandra Pandey, Siddappa Yallappa Karabannavar, Ashish Sirasao, Elliott Delaye
-
Publication number: 20250190253Abstract: The disclosed device includes multiple physical processor cores including enabled cores and disabled cores. The device also includes a controller that can track a total aging value for each core and facilitate swapping out enabled cores for disabled cores to manage core lifespans. Various other methods, systems, and computer-readable media are also disclosed.Type: ApplicationFiled: June 12, 2023Publication date: June 12, 2025Applicant: Advanced Micro Devices, Inc.Inventors: Srilatha Manne, Madhu Saravana Sibi Govindan
-
Publication number: 20250190282Abstract: An example system includes a plurality of wait queues implemented in a first portion of a memory unit configured to store a plurality of lock requests. The system includes a lock head array implemented in a second portion of the memory unit configured to store a plurality of lock heads. Each wait queue of the plurality of wait queues is mapped to one lock head in the lock head array. The system includes a lock waiter array implemented in a third portion of the memory unit configured to store a number of entries corresponding to each lock head in the lock head array. The system also includes a spinlock controller including hardware circuitry configured to execute a lock logic responsive to a lock acquire request of the plurality of lock requests, and to execute an unlock logic responsive to a lock release request of the plurality of lock requests.Type: ApplicationFiled: December 11, 2023Publication date: June 12, 2025Applicant: Advanced Micro Devices, Inc.Inventors: Shreeroop Ajaykumar, Shaju Abraham, Naveen M
-
Publication number: 20250191600Abstract: The disclosed computer-implemented method includes transforming, from a time domain into a frequency domain, a sound signal into a transformed sound signal. The transformed sound signal has a phase component and a magnitude component. The method also includes filtering the phase component of the transformed sound signal by applying a quantized mask from a machine-learning model to the phase component, and generating a filtered sound signal by transforming, from the frequency domain into the time domain, the transformed sound signal comprising the magnitude component and the filtered phase component. Various other methods, systems, and computer-readable media are also disclosed.Type: ApplicationFiled: December 23, 2022Publication date: June 12, 2025Applicants: Advanced Micro Devices, Inc., ATI Technologies ULCInventors: Carl Wakeland, Geoffrey Park
-
Publication number: 20250192763Abstract: The disclosed device can include a dual-tail sampler. The dual-tail sampler can include a first stage with an input pair, a cross-coupled load circuit, a precharge device between drain nodes of the input pair, and at least one pass-gate switch between the input pair and the cross-coupled load circuit. Various other devices and systems are also disclosed.Type: ApplicationFiled: May 4, 2023Publication date: June 12, 2025Applicant: Advanced Micro Devices, Inc.Inventors: Raghavendra Rukmani Gowrishankar, Kamlesh Satyadev Singh
-
Publication number: 20250194019Abstract: Disclosed is a computer-implemented method for degassing for the manufacture of a high speed design. The method includes analyzing information related to a printed circuit board (PCB) that includes a set of layers and a plurality of voids. The method identifies a void from the plurality of voids, where the void has a position among the PCB set of layers. The method determines a radius associated with the identified void, where the radius is based on a center of the identified void. The method performs a trace selection and executes a shift algorithm based on the trace selection, where the shift algorithm includes a modification of the information related to the void. As a result, the method can generate a grid for degassing based on execution of the shift algorithm. Various other methods, systems, and computer-readable media are also disclosed.Type: ApplicationFiled: April 26, 2023Publication date: June 12, 2025Applicant: Advanced Micro Devices, Inc.Inventor: Supatta Niramarnkarn
-
Publication number: 20250191619Abstract: A technique is provided. The technique includes identifying memory cells, of a set of memory cells to power down, based on a set of priorities for the set of memory cells; powering down the identified memory cells in accordance with the set of priorities, resulting in powered down memory cells; and performing processing in accordance with the powered down memory cells.Type: ApplicationFiled: December 8, 2023Publication date: June 12, 2025Applicant: Advanced Micro Devices, Inc.Inventor: Ali Haidous
-
Publication number: 20250192024Abstract: A semiconductor device includes a first metal layer including a plurality of first ground wire pairs alternating with a plurality of first power wire pairs and a second metal layer including a plurality of second ground wire pairs alternating with a plurality of second power wire pairs. A metal-insulator-metal capacitor (MIMCAP) is between the first metal layer and the second metal layer. A group of ground vias connects a pair of the first ground wire pairs with a pair of the second ground wire pairs. The group of ground vias can also connect to a ground plate of the MIMCAP. A group of power vias connects a pair of the first power wire pairs with a pair of the second power wire pairs. The group of power vias can also connect to a power plate of the MIMCAP. Various other methods and systems are also disclosed.Type: ApplicationFiled: December 6, 2023Publication date: June 12, 2025Applicant: Advanced Micro Devices, Inc.Inventors: Douglas Stirrett, Thomas Michael Daum, Jeffrey Lucas
-
Patent number: 12327124Abstract: An array processor includes processor element arrays distributed in rows and columns. The processor element arrays perform operations on parameter values. The array processor also includes memory interfaces that broadcast sets of the parameter values to mutually exclusive subsets of the rows and columns of the processor element arrays. In some cases, the array processor includes single-instruction-multiple-data (SIMD) units including subsets of the processor element arrays in corresponding rows, workgroup processors (WGPs) including subsets of the SIMD units, and a memory fabric configured to interconnect with an external memory that stores the parameter values. The memory interfaces broadcast the parameter values to the SIMD units that include the processor element arrays in rows associated with the memory interfaces and columns of processor element arrays that are implemented across the SIMD units in the WGPs. The memory interfaces access the parameter values from the external memory via the memory fabric.Type: GrantFiled: March 30, 2023Date of Patent: June 10, 2025Assignee: Advanced Micro Devices, Inc.Inventors: Sateesh Lagudu, Allen H. Rush, Michael Mantor, Arun Vaidyanathan Ananthanarayan, Prasad Nagabhushanamgari, Maxim V. Kazakov
-
Patent number: 12327580Abstract: A memory device includes a memory circuitry includes a first transmission grate, a first capacitor, a second transmission gate, and a second capacitor. The first transmission gate includes a first transistor connected between a first node and a second node. The first transistor having a gate terminal connected to a first clock node. The first clock node configured to receive a first clock signal. The first capacitor is connected between the second node and a first voltage node. The first capacitor is a ferroelectric capacitor. The second transmission gate includes a second transistor connected between the second node and a third node. The second transistor has a gate terminal connected to the first clock node. The second capacitor is connected between the third node and a second voltage node.Type: GrantFiled: June 29, 2023Date of Patent: June 10, 2025Assignee: Advanced Micro Devices, Inc.Inventors: Divya Madapusi Srinivas Prasad, Michael Ignatowski
-
Patent number: 12327608Abstract: A static random-access memory (SRAM) circuit includes an SRAM bitcell coupled to a word line, a bit line and a complementary bit line. A precharge circuit is coupled to the bit line and the complementary bit line and includes a precharge input. A first keeper transistor is coupled to the bit line and a second keeper transistor is coupled to the complementary bit line. A write driver circuit includes a select input receiving a select signal, a write data input, and a write data compliment input, and is operable to write a data bit to the SRAM bitcell. A combinatorial logic circuit provides a precharge signal to the precharge circuit based on the select signal and a bit line precharge signal.Type: GrantFiled: December 29, 2022Date of Patent: June 10, 2025Assignees: Advanced Micro Devices, Inc., ATI Technologies ULCInventors: Russell Schreiber, Sahilpreet Singh
-
Publication number: 20250183226Abstract: A semiconductor device includes a first logic die comprising: a clock source configured to generate a clock signal; and a first clock mesh for receiving the clock signal from the clock source. The device includes a second logic die stacked over the first logic die, the second logic die comprising: a second clock mesh for receiving the clock signal from the clock source. The device includes a plurality of conductive connections between the first clock mesh and the second clock mesh to transmit the clock signal from the first clock mesh to the second clock mesh. Various other methods and systems are also disclosed.Type: ApplicationFiled: December 1, 2023Publication date: June 5, 2025Applicant: Advanced Micro Devices, Inc.Inventors: Russell Schreiber, John Wuu, Spence Oliver
-
Publication number: 20250181384Abstract: Task graph control techniques for data transfer are described. The task graph control techniques are usable to aggregate data from multiple tasks into an aggregated data transfer, thereby improving operational efficiency and device performance. In a first example, a runtime scheduler executed on a command processor is implemented to select a node during execution of tasks of the task graph. The selected node is assigned by the runtime schedule to transfer aggregated data from that node and parent of that node. In a second example, a compiler of a host device is tasked with generating the task graph. As part of generating the task graph, the compiler also inserts one or more data transfer nodes. The location of the data transfer node within the task graph by the compiler is used to specify when a data transfer is to be performed.Type: ApplicationFiled: November 30, 2023Publication date: June 5, 2025Applicant: Advanced Micro Devices, Inc.Inventors: Ali Arda Eker, Anthony Thomas Gutierrez, Stephen Alexander Zekany
-
Publication number: 20250182377Abstract: A technique for performing inside/outside testing is provided. To determine if a ray intersects a convex polygon, an inside/outside test is commonly performed by checking which side of an edge the ray passes. By efficiently sharing edge test results among polygons with shared edges, inside/outside testing for groups of polygons can be made more efficient. This optimization can be achieved using either full precision floating-point math or reduced precision (e.g., fixed-point math) to make hardware-based testing more cost-effective.Type: ApplicationFiled: December 1, 2023Publication date: June 5, 2025Applicant: Advanced Micro Devices, Inc.Inventor: Andrew Erin Kensler
-
Patent number: 12321744Abstract: A computer-implemented method for hardware gather optimization can include identifying, by at least one processor, one or more gather instructions that retrieve data from contiguous memory locations. The method can additionally include converting, by the at least one processor, the one or more gather instructions into one or more strided load instructions in response to the identification. The method can also include loading, by the at least one processor, data retrieved using the one or more strided load instructions into one or more vector registers. Various other methods, systems, and computer-readable media are also disclosed.Type: GrantFiled: June 27, 2023Date of Patent: June 3, 2025Assignee: Advanced Micro Devices, Inc.Inventor: Ashish Jha