Patents Assigned to Advanced Micro Devices
  • Patent number: 9679345
    Abstract: A frame pacing method, computer program product, and computing system are provided for graphics processing. A method and system for frame pacing adds a delay which evenly spaces out the display of the subsequent frames, and a measurement mechanism which measures and adjusts the delay as application workload changes in an evenly spaced manner.
    Type: Grant
    Filed: August 6, 2015
    Date of Patent: June 13, 2017
    Assignee: ADVANCED MICRO DEVICES, INC.
    Inventors: Jonathan Lawrence Campbell, Mitchell H. Singer, Yuping Shen, Yue Zhuo
  • Publication number: 20170161212
    Abstract: Described is a method and apparatus for application migration between a dockable device and a docking station in a seamless manner. The dockable device includes a processor and the docking station includes a high-performance processor. The method includes determining a docking state of a dockable device while at least an application is running. Application migration from the dockable device to a docking station is initiated when the dockable device is moving to a docked state. Application migration from the docking station to the dockable device is initiated when the dockable device is moving to an undocked state. The application continues to run during the application migration from the dockable device to the docking station or during the application migration from the docking station to the dockable device.
    Type: Application
    Filed: December 2, 2015
    Publication date: June 8, 2017
    Applicant: Advanced Micro Devices, Inc.
    Inventors: Jonathan Lawrence Campbell, Yuping Shen
  • Publication number: 20170161114
    Abstract: A computing device is disclosed. The computing device includes an Accelerated Processing Unit (APU) including at least a first Heterogeneous System Architecture (HSA) computing device and at least a second HSA computing device, the second computing device being a different type than the first computing device, and an HSA Memory Management Unit (HMMU) allowing the APU to communicate with at least one memory. The computing task is enqueued on an HSA-managed queue that is set to run on the at least first HSA computing device or the at least second HSA computing device. The computing task is re-enqueued on the HSA-managed queue based on a repetition flag that triggers the number of times the computing task is re-enqueued. The repetition field is decremented each time the computing task is re-enqueued. The repetition field may include a special value (e.g., ?1) to allow re-enqueuing of the computing task indefinitely.
    Type: Application
    Filed: December 8, 2015
    Publication date: June 8, 2017
    Applicant: Advanced Micro Devices, Inc.
    Inventors: Walter B. Benton, Steven K. Reinhardt
  • Publication number: 20170163282
    Abstract: Methods and apparatus are described. A method, implemented in a decoder, includes receiving two or more signals from an encoder over two or more respective wires. At least one of the two or more signals includes at least one code that was recoded by the encoder. The decoder receives a recoding table. The recoding table provides a mapping indicating the recoding for each code that was recoded by the encoder in the received two or more signals. The decoder decodes the two or more received signals using the received recoding table.
    Type: Application
    Filed: February 17, 2017
    Publication date: June 8, 2017
    Applicant: Advanced Micro Devices, Inc.
    Inventor: Greg Sadowski
  • Patent number: 9673849
    Abstract: Systems, apparatuses, and methods for performing common mode extraction for data communication are disclosed. A circuit is configured to receive a single-ended data signal on a first input port and couple the data signal to a positive input terminal of a receiver component. The circuit is also configured to receive a differential clock signal on second and third input ports and generate a reference signal from the differential clock signal. In one embodiment, the reference signal is generated from an average of the differential clock signal. The circuit is configured to couple the reference signal to a negative input terminal of the receiver component. In one embodiment, the receiver component is an amplifier.
    Type: Grant
    Filed: August 16, 2016
    Date of Patent: June 6, 2017
    Assignee: Advanced Micro Devices, Inc.
    Inventors: Milam Paraschou, Edoardo Prete
  • Patent number: 9674523
    Abstract: Methods and apparatus for transcoding digital video data are disclosed. In an embodiment, a transcoder (300) decodes a digital video block (304) using a first coding scheme, such as 8×8 MPEG-2/4, to produce domain transformed data (306) and a motion vector (308). The transcoder (300) then estimates an energy level of each sub-block in the digital video block (304) in the frequency domain (as opposed to the spatial domain), thereby reducing or eliminating the need for motion compensation. For each sub-block with an estimated energy level below a desired threshold (e.g., likely an all-zero sub-block), the transcoder (300) transcodes the sub-block by converting the motion vector (308) from the first coding scheme (e.g., MPEG-2/4) to the second coding scheme (e.g., H.264) (e.g., convert 8×8 MPEG-2/4 vector to 4×4 H.264 vector or reuse the MPEG-2/4 vector if all four sub-blocks are AZB and coding in H.264 as an 8×8 block). The transcoded sub-block may then be used (e.g., stored or transmitted).
    Type: Grant
    Filed: November 14, 2012
    Date of Patent: June 6, 2017
    Assignee: Advanced Micro Devices, Inc.
    Inventor: Wei-Lien Hsu
  • Patent number: 9671767
    Abstract: A system and method for efficient management of operating modes within an integrated circuit (IC) for optimal power and performance targets. A semiconductor chip includes one or more processing units each of which operates with respective operating parameters. One or more temperature sensors are included to measure a temperature of the one or more processing units during operation. When the measured temperature exceeds a threshold, a power manager on the chip determines a temperature headroom utilizing temperature values based on worst-case ambient temperature. When the measured temperature does not exceed the threshold, the power manager determines the temperature headroom utilizing at least one temperature value based on room ambient temperature. Following, the power manager adjusts the respective operating parameters based on at least the temperature headroom.
    Type: Grant
    Filed: May 14, 2014
    Date of Patent: June 6, 2017
    Assignee: Advanced Micro Devices, Inc.
    Inventors: Samuel D. Naffziger, Benjamin D. Bates, Praveen K. Dongara
  • Patent number: 9672161
    Abstract: The described embodiments include a cache controller that configures a cache management mechanism. In the described embodiments, the cache controller is configured to monitor at least one structure associated with a cache to determine at least one cache block that may be accessed during a future access in the cache. Based on the determination of the at least one cache block that may be accessed during a future access in the cache, the cache controller configures the cache management mechanism.
    Type: Grant
    Filed: December 9, 2012
    Date of Patent: June 6, 2017
    Assignee: ADVANCED MICRO DEVICES, INC.
    Inventors: Gabriel H. Loh, Yasuko Eckert
  • Publication number: 20170147608
    Abstract: A method and apparatus for performing a search in a processor-in-memory (PIM) system having a first processor and at least one memory module includes receiving one or more images by the first processor. The first processor sends a query for a search of memory for a matching image to the one or more images to at least one memory module, which searches memory in the memory module, in response to the received query. The at least one memory module sends the results of the search to the first processor, and the first processor performs a comparison of the received results from the at least one memory module to the received one or more images.
    Type: Application
    Filed: November 23, 2015
    Publication date: May 25, 2017
    Applicant: Advanced Micro Devices, Inc.
    Inventor: Dong Ping Zhang
  • Patent number: 9658960
    Abstract: A method and apparatus for controlling affinity of subcaches is disclosed. When a core compute unit evicts a line of victim data, a prioritized search for space allocation on available subcaches is executed, in order of proximity between the subcache and the compute unit. The victim data may be injected into an adjacent subcache if space is available. Otherwise, a line may be evicted from the adjacent subcache to make room for the victim data or the victim data may be sent to the next closest subcache. To retrieve data, a core compute unit sends a Tag Lookup Request message directly to the nearest subcache as well as to a cache controller, which controls routing of messages to all of the subcaches. A Tag Lookup Response message is sent back to the cache controller to indicate if the requested data is located in the nearest sub-cache.
    Type: Grant
    Filed: December 22, 2010
    Date of Patent: May 23, 2017
    Assignee: ADVANCED MICRO DEVICES, INC.
    Inventor: Greggory D. Donley
  • Patent number: 9658895
    Abstract: The present disclosure relates to a method and system for configuring a computing system, such as a cloud computing system. A method includes providing a user interface comprising selectable boot-time configuration data and selecting, based on at least one user selection of the boot-time configuration data, a boot-time configuration for at least one node of a cluster of nodes of the computing system. The method further includes configuring the at least one node of the cluster of nodes with the selected boot-time configuration to modify at least one boot-time parameter of the at least one node.
    Type: Grant
    Filed: August 7, 2012
    Date of Patent: May 23, 2017
    Assignee: Advanced Micro Devices, Inc.
    Inventors: Mauricio Breternitz, Keith A. Lowery, Patryk Kaminski, Anton Chernoff
  • Patent number: 9659928
    Abstract: By forming a trench isolation structure after providing a high-k dielectric layer stack, direct contact of oxygen-containing insulating material of a top surface of the trench isolation structure with the high-k dielectric material in shared polylines may be avoided. This technique is self-aligned, thereby enabling further device scaling without requiring very tight lithography tolerances. After forming the trench isolation structure, the desired electrical connection across the trench isolation structure may be re-established by providing a further conductive material.
    Type: Grant
    Filed: March 11, 2015
    Date of Patent: May 23, 2017
    Assignee: Advanced Micro Devices, Inc.
    Inventors: Andy Wei, Roman Boschke, Markus Forsberg
  • Patent number: 9658663
    Abstract: A three-dimensional (3-D) processor stack includes a plurality of processor cores implemented in a plurality of layers. A controller is to selectively throttle one or more of a plurality of processor cores in response to detecting a thermal event. The controller selectively throttles the one or more of the plurality of processor cores based on values of thermal couplings between the plurality of layers and based on measures of criticality of threads executing on the plurality of processor cores.
    Type: Grant
    Filed: September 22, 2015
    Date of Patent: May 23, 2017
    Assignee: Advanced Micro Devices, Inc.
    Inventors: Wei Huang, Manish Arora, Yasuko Eckert, Indrani Paul
  • Publication number: 20170139748
    Abstract: A system and methods embodying some aspects of the present embodiments for efficient load balancing using predication flags are provided. The load balancing system includes a first processing unit, a second processing unit, and a shared queue. The first processing unit is in communication with a first queue. The second processing unit is in communication with a second queue. The first and second queues are each configured to hold a packet. The shared queue is configured to maintain a work assignment, wherein the work assignment is to be processed by either the first or second processing unit.
    Type: Application
    Filed: January 31, 2017
    Publication date: May 18, 2017
    Applicant: Advanced Micro Devices, Inc.
    Inventors: Vinod TIPPARAJU, Lee Howes, Thomas Scogland
  • Patent number: 9652390
    Abstract: Apparatus, computer readable medium, integrated circuit, and method of moving a plurality of data items to a first cache or a second cache are presented. The method includes receiving an indication that the first cache requested the plurality of data items. The method includes storing information indicating that the first cache requested the plurality of data items. The information may include an address for each of the plurality of data items. The method includes determining based at least on the stored information to move the plurality of data items to the second cache. The method includes moving the plurality of data items to the second cache. The method may include determining a time interval between receiving the indication that the first cache requested the plurality of data items and moving the plurality of data items to the second cache. A scratch pad memory is disclosed.
    Type: Grant
    Filed: August 5, 2014
    Date of Patent: May 16, 2017
    Assignee: Advanced Micro Devices, Inc.
    Inventors: JunLi Gu, Bradford M. Beckmann, Yuan Xie
  • Patent number: 9652019
    Abstract: A system and method for efficient management of operating modes within an integrated circuit (IC) for optimal power and performance targets. A semiconductor chip includes processing units each of which operates with respective operating parameters. Temperature sensors are included to measure a temperature of the one or more processing units during operation. A power manager determines a calculated power value independent of thermal conditions and current draw. The power manager reads each of a first thermal design power (TDP) value for the processing units and a second TDP value for a platform housing the semiconductor chip. The power manager determines a ratio of the first TDP value to the second TDP value. Additionally, the power manager determines another ratio of the first TDP value to the calculated power value. Using the measured temperature, the ratios and the calculated power value, the power manager determines a manner to adjust the operating parameters.
    Type: Grant
    Filed: June 2, 2014
    Date of Patent: May 16, 2017
    Assignees: Advanced Micro Devices, Inc., ATI Technologies ULC
    Inventors: Praveen K. Dongara, Aniruddha Dasgupta, Adam Clark
  • Patent number: 9652305
    Abstract: A processor includes an execution unit to execute instructions and a scheduler unit to store a queue of instructions for execution by the execution unit. The scheduler unit includes a wake array including a plurality of source slots to store source identifiers for sources associated with the instructions, a picker to schedule a particular instruction for execution in the execution unit, broadcast a destination identifier associated with the particular instruction to a first subset of the source slots, and a delay element to receive the destination identifier broadcast by the picker and communicate a delayed version of the destination identifier to a second subset of the source slots different from the first subset.
    Type: Grant
    Filed: August 6, 2014
    Date of Patent: May 16, 2017
    Assignee: Advanced Micro Devices, Inc.
    Inventors: Srikanth Arekapudi, Emil Talpes, Sahil Arora
  • Patent number: 9645854
    Abstract: A method, system and article of manufacture for balancing a workload on heterogeneous processing devices. The method comprising accessing a memory storage of a processor of one type by a dequeuing entity associated with a processor of a different type, identifying a task from a plurality of tasks within the memory that can be processed by the processor of the different type, synchronizing a plurality of dequeuing entities capable of accessing the memory storage, and dequeuing the task form the memory storage.
    Type: Grant
    Filed: November 2, 2011
    Date of Patent: May 9, 2017
    Assignee: Advanced Micro Devices, Inc.
    Inventors: Benjamin Thomas Sander, Michael Houston, Newton Cheung, Keith Lowery
  • Publication number: 20170123670
    Abstract: A memory-to-memory copy operation control system includes a processor configured to receive an instruction to perform a memory-to-memory copy operation and a memory module network in communication with the processor. The memory module network has a plurality of memory modules that include a proximal memory module in direct communication with the processor and one or more additional memory modules in communication with the processor via the proximal memory module. The system also includes a memory controller in communication with the processor and the network of memory modules. The processor is configured to issue a first command causing data to be copied from a first memory module to a second memory module without sending the data to the processor or the memory controller.
    Type: Application
    Filed: October 28, 2015
    Publication date: May 4, 2017
    Applicant: ADVANCED MICRO DEVICES, INC.
    Inventors: Nuwan Jayasena, David A. Roberts
  • Patent number: 9639371
    Abstract: A system and method for efficiently processing instructions in hardware parallel execution lanes within a processor. In response to a given divergent point within an identified loop, a compiler generates code wherein when executed determines a size of a next very large instruction world (VLIW) to process and determine multiple pointer values to store in multiple corresponding PC registers in a target processor. The updated PC registers point to instructions intermingled from different basic blocks between the given divergence point and a corresponding convergence point. The target processor includes a single instruction multiple data (SIMD) micro-architecture. The assignment for a given lane is based on branch direction found at runtime for the given lane at the given divergent point. The processor includes a vector register for mapping PC registers to execution lanes.
    Type: Grant
    Filed: January 29, 2013
    Date of Patent: May 2, 2017
    Assignee: Advanced Micro Devices, Inc.
    Inventor: Reza Yazdani