Patents Assigned to Advanced Micro Devices
-
Patent number: 9965392Abstract: Existing multiprocessor computing systems often have insufficient memory coherency and, consequently, are unable to efficiently utilize separate memory systems. Specifically, a CPU cannot effectively write to a block of memory and then have a GPU access that memory unless there is explicit synchronization. In addition, because the GPU is forced to statically split memory locations between itself and the CPU, existing multiprocessor computing systems are unable to efficiently utilize the separate memory systems. Embodiments described herein overcome these deficiencies by receiving a notification within the GPU that the CPU has finished processing data that is stored in coherent memory, and invalidating data in the CPU caches that the GPU has finished processing from the coherent memory. Embodiments described herein also include dynamically partitioning a GPU memory into coherent memory and local memory through use of a probe filter.Type: GrantFiled: August 24, 2016Date of Patent: May 8, 2018Assignees: Advanced Micro Devices, Inc., ATI Technologies ULCInventors: Anthony Asaro, Kevin Normoyle, Mark Hummel
-
Patent number: 9965134Abstract: A method and apparatus provides a user an interface for a file system. In one example, the method and apparatus displays the file as a visualized object, e.g., a graphical representation of the file as a real life object, receives selection of visualized objects and activates data elements represented by the visualized objects. The visualization of the file may be determined based on visualizer identification information associated with the file. For the activated data elements, the method and apparatus displays tool interfaces, in combination with the visualized objects. The tool interfaces may be selectively displayed for the activated data element base on tool identification information associated with a data type of the data element. Furthermore, the method and apparatus can process the activated data elements using the selected tool actions from different programs.Type: GrantFiled: September 14, 2012Date of Patent: May 8, 2018Assignee: Advanced Micro Devices, Inc.Inventor: David M. Lynch
-
Patent number: 9965329Abstract: The methods and apparatus can assign processing core workloads to processing cores from a heterogeneous instruction set architectures (ISA) pool of available processing cores based on processing core metric results. For example, the method and apparatus can obtain processing core metric results for one or more processing cores, such as processing cores within general purpose processors, from a heterogeneous ISA pool of available processing cores. The method and apparatus can also obtain one or more processing core workloads, such as software applications or software processes, from a pool of available processing core workloads to be assigned. The method and apparatus can then assign one or more processing core workloads that have higher priority than others from the pool of available processing core workloads to a processing core from the heterogeneous ISA pool of available processing cores based on its processing core metric result.Type: GrantFiled: October 12, 2015Date of Patent: May 8, 2018Assignee: Advanced Micro Devices, Inc.Inventor: Sergey Blagodurov
-
Publication number: 20180121386Abstract: A super single instruction, multiple data (SIMD) computing structure and a method of executing instructions in the super-SIMD is disclosed. The super-SIMD structure is capable of executing more than one instruction from a single or multiple thread and includes a plurality of vector general purpose registers (VGPRs), a first arithmetic logic unit (ALU), the first ALU coupled to the plurality of VGPRs, a second ALU, the second ALU coupled to the plurality of VGPRs, and a destination cache (Do$) that is coupled via bypass and forwarding logic to the first ALU, the second ALU and receiving an output of the first ALU and the second ALU. The Do$ holds multiple instructions results to extend an operand by-pass network to save read and write transactions power. A compute unit (CU) and a small CU including a plurality of super-SIMDs are also disclosed.Type: ApplicationFiled: November 17, 2016Publication date: May 3, 2018Applicant: Advanced Micro Devices, Inc.Inventors: Jiasheng Chen, Angel E. Socarras, Michael Mantor, YunXiao Zou, Bin He
-
Publication number: 20180124404Abstract: A method and a non-transitory computer readable medium for decompressing an image including one or more regions are presented. A region of the image is selected to be decoded. The region and metadata associated with the region are decoded, the metadata including transformation and quantization settings used to compress the region. A reconstruction transformation is applied to the region using the transformation and quantization settings.Type: ApplicationFiled: December 18, 2017Publication date: May 3, 2018Applicant: Advanced Micro Devices, Inc.Inventors: Andrew S. Pomianowski, Konstantine Iourcha
-
Patent number: 9959122Abstract: A method includes allocating a first single-cycle instruction to a first pipeline that picks single-cycle instructions for execution in program order. The method further includes marking at least one source register of the first single-cycle instruction as ready for execution in the first pipeline in response to all older single-cycle instructions allocated to the first pipeline being ready and eligible to be picked for execution. An apparatus includes a decoder to decode a first single-cycle instruction and to allocate the first single-cycle instruction to a first pipeline. The apparatus further includes a scheduler to pick single-cycle instructions for execution by the first pipeline in program order and to mark at least one source register of the first single-cycle instruction as ready for execution in the first pipeline in response to determining that all older single-cycle instructions allocated to the first pipeline are ready and eligible.Type: GrantFiled: April 24, 2013Date of Patent: May 1, 2018Assignee: Advanced Micro Devices, Inc.Inventors: Michael D. Estlick, Jay E. Fleischman, Kevin A. Hurd, Mark M. Gibson, Kelvin D. Goveas, Brian M. Lay
-
Patent number: 9958921Abstract: A method includes controlling a power limit of a computing system based on a determined skin temperature of at least one location on an outer surface of a device housing the computing system. A processor includes a processing unit and a power management controller to control a power limit of the processing unit based on a determined skin temperature of at least one location on an outer surface of a device housing the processor.Type: GrantFiled: March 9, 2015Date of Patent: May 1, 2018Assignee: Advanced Micro Devices, Inc.Inventors: Ashish Jain, Benjamin David Bates, Ali Akbar Merrikh, Samuel David Naffziger, Steven Frederick Liepe, Madhu Saravana Sibi Govindan
-
Publication number: 20180113814Abstract: A method and apparatus for reducing dynamic power consumption in a multi-thread content-addressable memory (CAM) is described. The disclosed apparatus includes a first input configured to receive a first virtual address corresponding to a first thread, a second input configured to receive a second virtual address corresponding to a second thread, a register bank including a plurality of registers each configured to store a binary word mapped to one of a plurality of physical addresses, a first comparator bank including a first plurality of comparators each coupled to one of the plurality of registers in a fully-associative configuration and configured to determine whether a first match is present, and a second comparator bank including a second plurality of comparators each coupled to one of the plurality of registers in a fully-associative configuration and configured to determine whether a second match is present.Type: ApplicationFiled: December 21, 2017Publication date: April 26, 2018Applicant: Advanced Micro Devices, Inc.Inventor: Anthony J. Bybell
-
Publication number: 20180113648Abstract: A data processing system includes a memory channel and a data processor coupled to the memory channel. The data processor includes a memory controller coupled to the memory channel and is adapted to access at least one rank of double data rate memory. The memory controller includes a command queue for storing received memory access requests, and an arbiter for picking memory access requests from the command queue, and then providing the memory access requests to the memory channel. The memory access requests are selected based on predetermined criteria, and in response to a mode register access request to quiesce pending operations. Additionally, the memory controller includes a mode register access controller that in response to the mode register access request, generates at least one corresponding mode register set command to a memory bus. The memory controller then relinquishes control of the memory bus to the arbiter thereafter.Type: ApplicationFiled: October 21, 2016Publication date: April 26, 2018Applicant: Advanced Micro Devices, Inc.Inventors: Kevin M. Brandl, Scott P. Murphy, James R. Magro, Paramjit K. Lubana
-
Publication number: 20180113709Abstract: A method and apparatus for performing a multi-precision computation in a plurality of arithmetic logic units (ALUs) includes pairing a first Single Instruction/Multiple Data (SIMD) block channel device with a second SIMD block channel device to create a first block pair having one-level staggering between the first and second channel devices. A third SIMD block channel device is paired with a fourth SIMD block channel device to create a second block pair having one-level staggering between the third and fourth channel devices. A plurality of source inputs are received at the first block pair and the second block pair. The first block pair computes a first result, and the second block pair computes a second result.Type: ApplicationFiled: November 3, 2016Publication date: April 26, 2018Applicant: Advanced Micro Devices, Inc.Inventors: Bin He, YunXiao Zou, Jiasheng Chen, Michael Mantor
-
Patent number: 9953687Abstract: An interlock circuit utilizes a single combinatorial pseudo-dynamic logic gate to take inputs from two voltage domains at the same time without requiring either input to be level shifted. The interlock design allows hold timing to be met across a large voltage range of both supplies in a dual-voltage supply environment while not significantly hurting setup time by having much lower latency than the latency of a level shifter.Type: GrantFiled: October 21, 2016Date of Patent: April 24, 2018Assignee: Advanced Micro Devices, Inc.Inventors: John J. Wuu, Ryan Freese, Russell J. Schreiber
-
Publication number: 20180107627Abstract: Methods, devices, and systems for transmitting data over a computer communications network are disclosed. A queue of communications commands can be pre-generated using a central processing unit (CPU) and stored in a device memory of a network interface controller (NIC). Thereafter, if a graphics processing unit (GPU) has data to communicate to a remote GPU, it can store the data in a send buffer, where the location in the buffer is pointed to by a pre-generated command. The GPU can then signal to the interface device that the data is ready, triggering execution of the pre-generated command to send the data.Type: ApplicationFiled: October 18, 2016Publication date: April 19, 2018Applicant: Advanced Micro Devices, Inc.Inventors: Michael W. LeBeane, Steven K. Reinhardt
-
Patent number: 9946278Abstract: A processor system includes first and second regulators for regulating an adjusted supply voltage. In one embodiment, the regulator system comprises a digital low-dropout (DLDO) control system comprising first and second regulators that generate a plurality of control signals to regulate an adjusted power supply voltage and that generate a charge when a droop level falls below a droop threshold value. The first regulator implements a first control loop and the second regulator implements a second and much faster acting control loop. A supply adjustment block with the two regulators and control loops are provided for each processor core allowing different cores to have different regulated supply levels all based on one common supply.Type: GrantFiled: May 22, 2015Date of Patent: April 17, 2018Assignee: Advanced Micro Devices, Inc.Inventors: Miguel Rodriguez, Stephen Victor Kosonocky, Ravinder Reddy Rachala
-
Patent number: 9946646Abstract: A system for managing cache utilization includes a processor core, a lower-level cache, and a higher-level cache. In response to activating the higher-level cache, the system counts lower-level cache victims evicted from the lower-level cache. While a count of the lower-level cache victims is not greater than a threshold number, the system transfers each lower-level cache victim to a system memory without storing the lower-level cache victim to the higher-level cache. When the count of the lower-level cache victims is greater than the threshold number, the system writes each lower-level cache victim to the higher-level cache. In this manner, if the higher-level cache is deactivated before the threshold number of lower-level cache victims is reached, the higher-level cache is empty and thus may be deactivated without flushing.Type: GrantFiled: September 6, 2016Date of Patent: April 17, 2018Assignee: Advanced Micro Devices, Inc.Inventor: William L. Walker
-
Patent number: 9947386Abstract: A method of managing thermal levels in a memory system may include determining an expected thermal level associated with each of a plurality of locations in a memory structure, and for each operation of a plurality of operations addressed to the memory structure, assigning the operation to a target location of the plurality of physical locations in the memory structure based on a thermal penalty associated with the operation and the expected thermal level associated with the target location.Type: GrantFiled: September 21, 2014Date of Patent: April 17, 2018Assignee: Advanced Micro Devices, Inc.Inventors: Manish Arora, Indrani Paul, Yasuko Eckert, Nuwan Jayasena, Dong Ping Zhang
-
Patent number: 9946319Abstract: The described embodiments include a computing device with a first entity and a second entity. In the computing device, a management controller dynamically sets a power-state limit for the first entity based on a performance coupling and a thermal coupling between the first entity and the second entity.Type: GrantFiled: October 21, 2013Date of Patent: April 17, 2018Assignee: ADVANCED MICRO DEVICES, INC.Inventors: Indrani Paul, Manish Arora, Srilatha Manne, William L. Bircher
-
Patent number: 9940247Abstract: The present application describes embodiments of a method and apparatus for concurrently accessing dirty bits in a cache. One embodiment of the apparatus includes a cache configurable to store a plurality of lines. The lines are grouped into a plurality of subsets the plurality of lines. This embodiment of the apparatus also includes a plurality of dirty bits associated with the plurality of lines and first circuitry configurable to concurrently access the plurality of dirty bits associated with at least one of the plurality of subsets of lines.Type: GrantFiled: June 26, 2012Date of Patent: April 10, 2018Assignee: Advanced Micro Devices, Inc.Inventor: William L. Walker
-
Patent number: 9934148Abstract: A memory module stores memory access metadata reflecting information about memory accesses to the memory module. The memory access metadata can indicate the number of times a particular unit of data (e.g., a row of data, a unit of data corresponding to a cache line, and the like) has been read, written, had one or more of its bits flipped, and the like. Modifications to the embedded access metadata can be made by a control module at the memory module itself, thereby reducing overhead at a processor core. In addition, the control module can be configured to record different access metadata for different memory locations of the memory module.Type: GrantFiled: June 23, 2015Date of Patent: April 3, 2018Assignee: Advanced Micro Devices, Inc.Inventors: David A. Roberts, Sergey Blagodurov
-
Patent number: 9934551Abstract: Embodiments of the present invention are directed to improving the performance of anti-aliased image rendering. One embodiment is a method of rendering a pixel from an anti-aliased image. The method includes: storing a first set and a second set of samples from a plurality of anti-aliased samples of the pixel respectively in a first memory and a second memory; and rendering a determined number of said samples from one of only the first set or the first and second sets. Corresponding system and computer program product embodiments are also disclosed.Type: GrantFiled: September 30, 2016Date of Patent: April 3, 2018Assignee: Advanced Micro Devices, Inc.Inventor: Mark Fowler
-
Publication number: 20180088858Abstract: A processing apparatus is provided that includes NVRAM and one or more processors configured to process a first set and a second set of instructions according to a hierarchical processing scope and process a scoped persistence barrier residing in the program after the first instruction set and before the second instruction set. The barrier includes an instruction to cause first data to persist in the NVRAM before second data persists in the NVRAM. The first data results from execution of each of the first set of instructions processed according to the one hierarchical processing scope. The second data results from execution of each of the second set of instructions processed according to the one hierarchical processing scope. The processing apparatus also includes a controller configured to cause the first data to persist in the NVRAM before the second data persists in the NVRAM based on the scoped persistence barrier.Type: ApplicationFiled: September 23, 2016Publication date: March 29, 2018Applicant: Advanced Micro Devices, Inc.Inventors: Arkaprava Basu, Mitesh R. Meswani, Dibakar Gope, Sooraj Puthoor