Patents Assigned to Advanced Micros Devices, Inc.
-
Patent number: 10146282Abstract: The present disclosure relates to a method and system for securing a performance state change of one or more processors. A disclosed method includes detecting a request to change a current performance state of a processor to a target performance state, and adjusting an operating level tolerance range of the current performance state to include operating levels associated with a transition from the current performance state to the target performance state. A disclosed system includes an operating system module operative to transmit a request for a performance state change of at least one processing core. The system includes performance state control logic operative to change the performance state of the at least one processing core based on the request.Type: GrantFiled: October 30, 2014Date of Patent: December 4, 2018Assignee: Advanced Micro Devices, Inc.Inventors: Jonathan Hauke, Benjamin Tsien, Denis Rystsov
-
Patent number: 10147721Abstract: Various on-die-precision-resistor arrays, and methods of making and calibrating the same are disclosed. In one aspect, an apparatus is provided that includes a semiconductor chip and a precision resistor array on the semiconductor chip. A replica precision resistor array is on the semiconductor chip. The replica precision resistor array is configured to mimic the resistance behavior of the precision resistor array and has a characteristic resistance that is a function of temperature. The semiconductor chip is configured to calibrate the precision resistor array using the characterized resistance as a function of temperature, a resistance offset of the precision resistor array relative to the characterized resistance as a function of temperature, and a temperature of the precision resistor array.Type: GrantFiled: December 20, 2017Date of Patent: December 4, 2018Assignee: Advanced Micro Devices, Inc.Inventors: Sridhar V. Gada, Sonu Arora
-
Patent number: 10146698Abstract: A method and apparatus for reducing dynamic power consumption in a multi-thread content-addressable memory (CAM) is described. The disclosed apparatus includes a first input configured to receive a first virtual address corresponding to a first thread, a second input configured to receive a second virtual address corresponding to a second thread, a register bank including a plurality of registers each configured to store a binary word mapped to one of a plurality of physical addresses, a first comparator bank including a first plurality of comparators each coupled to one of the plurality of registers in a fully-associative configuration and configured to determine whether a first match is present, and a second comparator bank including a second plurality of comparators each coupled to one of the plurality of registers in a fully-associative configuration and configured to determine whether a second match is present.Type: GrantFiled: December 21, 2017Date of Patent: December 4, 2018Assignee: Advanced Micro Devices, Inc.Inventor: Anthony J. Bybell
-
Patent number: 10146549Abstract: A method, system, and computer program product synchronize a group of workitems executing an instruction stream on a processor. The processor is yielded by a first workitem responsive to a synchronization instruction in the instruction stream. A first one of a plurality of program counters is updated to point to a next instruction following the synchronization instruction in the instruction stream to be executed by the first workitem. A second workitem is run on the processor after the yielding.Type: GrantFiled: November 6, 2017Date of Patent: December 4, 2018Assignee: Advanced Micro Devices, Inc.Inventors: Lee W. Howes, Benedict R. Gaster, Michael C. Houston
-
Patent number: 10146504Abstract: Systems, apparatuses, and methods for performing a division operation are disclosed. In one embodiment, a processor includes at least one arithmetic logic unit and a register file. In response to detecting a request to perform a division operation between a dividend and a divisor, the processor generates an initial approximation of the reciprocal of the divisor. Then, the processor converts the initial approximation of the reciprocal of the divisor into a fractional fixed point representation. The processor also introduces a small error into the initial approximation of the reciprocal of the divisor. Then, the processor implements one or more Newton-Raphson iterations for refining the approximation of the reciprocal and then multiplies the final reciprocal value by the dividend to generate the quotient.Type: GrantFiled: February 24, 2017Date of Patent: December 4, 2018Assignee: Advanced Micro Devices, Inc.Inventor: Nicolai Hähnle
-
Patent number: 10146575Abstract: Methods, systems and computer-readable mediums for task scheduling on an accelerated processing device (APD) are provided. In an embodiment, a method comprises: enqueuing one or more tasks in a memory storage module based on the APD; using a software-based enqueuing module; and dequeuing the one or more tasks from the memory storage module using a hardware-based command processor, wherein the command processor forwards the one or more tasks to the shader cote.Type: GrantFiled: August 29, 2016Date of Patent: December 4, 2018Assignee: ADVANCED MICRO DEVICES, INC.Inventors: Benjamin Thomas Sander, Michael Houston, Newton Cheung, Keith Lowery
-
Publication number: 20180341613Abstract: A method and apparatus of integrating memory stacks includes providing a first memory die of a first memory technology and a second memory die of a second memory technology. A first logic die is in communication with the first memory die of the first memory technology, and includes a first memory controller including a first memory control function for interpreting requests in accordance with a first protocol for the first memory technology. A second logic die is in communication with the second memory die of the second memory technology and includes a second memory controller including a second memory control function for interpreting requests in accordance with a second protocol for the second memory technology. A memory operation request is received at the first or second memory controller, and the memory operation request is performed in accordance with the associated first memory protocol or the second memory protocol.Type: ApplicationFiled: May 25, 2017Publication date: November 29, 2018Applicant: Advanced Micro Devices, Inc.Inventors: Dmitri Yudanov, Michael Ignatowski
-
Publication number: 20180343470Abstract: Described herein is a method and apparatus for using cube mapping and mapping metadata with encoders. Video data, such as 360° video data, is sent by a capturing device to an application, such as video editing software, which generates cube mapped video data and mapping metadata from the 360° video data. An encoder then applies the mapping metadata to the cube mapped video data to minimize or eliminate search regions when performing motion estimation, minimize or eliminate neighbor regions when performing intra coding prediction and assign zero weights to edges having no relational meaning.Type: ApplicationFiled: May 25, 2017Publication date: November 29, 2018Applicant: Advanced Micro Devices, Inc.Inventor: Michael L. Schmit
-
Publication number: 20180342099Abstract: A method, a system, and a computer-readable storage medium directed to performing high-speed parallel tessellation of 3D surface patches are disclosed. The method includes generating a plurality of primitives in parallel. Each primitive in the plurality is generated by a sequence of functional blocks, in which each sequence acts independently of all the other sequences.Type: ApplicationFiled: August 7, 2018Publication date: November 29, 2018Applicant: Advanced Micro Devices, Inc.Inventors: Timour T. Paltashev, Boris Prokopenko, Vladimir V. Kibardin
-
Publication number: 20180343430Abstract: A method and apparatus of precomputing includes capturing a first image by a first image capturing device. An image space for the first image is defined and pixels in the image space are analyzed for validity. Valid pixels are stored as valid pixel groups and the valid pixel groups are processed.Type: ApplicationFiled: May 25, 2017Publication date: November 29, 2018Applicant: Advanced Micro Devices, Inc.Inventors: Michael L. Schmit, Radhakrishna Giduthuri, Kiriti Nagesh Gowda
-
Patent number: 10142258Abstract: Methods and apparatus of delegating instructions or data from a CU to an NOC node in a network on chip (NOC) is disclosed. The NOC node executes the delegated instructions or processes the delegated data. An NOC controller (NCC), which is operatively coupled to the CU and the NOC node, facilitates delegating the instructions or data from the CU to the NOC node.Type: GrantFiled: April 8, 2016Date of Patent: November 27, 2018Assignee: Advanced Micro Devices, Inc.Inventors: Greg Sadowski, Edward McLellan
-
Patent number: 10140123Abstract: A graphics processing unit is disclosed, the graphics processing unit having a processor having one or more SIMD processing units, and a local data share corresponding to one of the one or more SIMD processing units, the local data share comprising one or more low latency accessible memory regions for each group of threads assigned to one or more execution wavefronts, and a global data share comprising one or more low latency memory regions for each group of threads.Type: GrantFiled: April 10, 2017Date of Patent: November 27, 2018Assignee: ADVANCED MICRO DEVICES, INC.Inventors: Michael J. Mantor, Brian Emberling
-
Patent number: 10133574Abstract: A system-on-a-chip includes a plurality of instruction processors and a hardware block such as a system management unit. The hardware block accesses values of performance counters associated with the plurality of instruction processors and modifies one or more operating points of one or more of the plurality of instruction processors based on comparisons of the instruction arrival rates and the instruction service rates to achieve optimized system metrics.Type: GrantFiled: June 14, 2016Date of Patent: November 20, 2018Assignee: Advanced Micro Devices, Inc.Inventors: Akanksha Jain, Wei Huang, Indrani Paul
-
Patent number: 10134102Abstract: A GPU is configured to read and process data produced by a compute shader via the one or more ring buffers and pass the resulting processed data to a vertex shader as input. The GPU is further configured to allow the compute shader and vertex shader to write through a cache. Each ring buffer is configured to synchronize the compute shader and the vertex shader to prevent processed data generated by the compute shader that is written to a particular ring buffer from being overwritten before the data is accessed by the vertex shader. It is emphasized that this abstract is provided to comply with the rules requiring an abstract that will allow a searcher or other reader to quickly ascertain the subject matter of the technical disclosure. It is submitted with the understanding that it will not be used to interpret or limit the scope or meaning of the claims.Type: GrantFiled: June 5, 2014Date of Patent: November 20, 2018Assignees: SONY INTERACTIVE ENTERTAINMENT INC., ADVANCED MICRO DEVICES, INC.Inventors: Mark Evan Cerny, David Simpson, Jason Scanlin, Michael Mantor
-
Patent number: 10134355Abstract: A processor performs vertex coloring for a graph based at least in part on the degree of each vertex of the graph and based at least in part with another coloring approach, such as comparison of random values assigned to the vertices. For each vertex in the graph, a processor determines whether the degree of the vertex is a local maximum; that is, whether the degree of the vertex is greater than the degree of each of its connected vertices. Each vertex having a local-maximum degree is assigned a specified or randomly selected color, and is then omitted from future iterations of the coloring process. After a stop criterion is met, the processor assigns random values to the remaining uncolored vertices and assigns colors based on comparisons of the random values.Type: GrantFiled: May 22, 2015Date of Patent: November 20, 2018Assignee: Advanced Micro Devices, Inc.Inventor: Shuai Che
-
Patent number: 10133672Abstract: Described is a system and method for efficient pointer chasing in systems having a single memory node or a network of memory nodes. In particular, a pointer chasing command is sent along with a memory request by an issuing node to a memory node. The pointer chasing command indicates the number of interdependent memory accesses and information needed for the identified interdependent memory accesses. An address computing unit associated with the memory node determines the relevant memory address for an interdependent memory access absent further interaction with the issuing node or without having to return to the issuing node.Type: GrantFiled: September 15, 2016Date of Patent: November 20, 2018Assignee: Advanced Micro Devices, Inc.Inventors: Paula Aguilera Diez, Amin Farmahini-Farahani, Nuwan Jayasena
-
Patent number: 10133678Abstract: In some embodiments, a method of managing cache memory includes identifying a group of cache lines in a cache memory, based on a correlation between the cache lines. The method also includes tracking evictions of cache lines in the group from the cache memory and, in response to a determination that a criterion regarding eviction of cache lines in the group from the cache memory is satisfied, selecting one or more (e.g., all) remaining cache lines in the group for eviction.Type: GrantFiled: August 28, 2013Date of Patent: November 20, 2018Assignee: ADVANCED MICRO DEVICES, INC.Inventors: Yasuko Eckert, Syed Ali Jafri, Srilatha Manne, Gabriel Loh
-
Patent number: 10127044Abstract: A processor, a device, and a non-transitory computer readable medium for performing branch prediction in a processor are presented. The processor includes a front end unit. The front end unit includes a level 1 branch target buffer (BTB), a BTB index predictor (BIP), and a level 1 hash perceptron (HP). The BTB is configured to predict a target address. The BIP is configured to generate a prediction based on a program counter and a global history, wherein the prediction includes a speculative partial target address, a global history value, a global history shift value, and a way prediction. The HP is configured to predict whether a branch instruction is taken or not taken.Type: GrantFiled: October 24, 2014Date of Patent: November 13, 2018Assignee: Advanced Micro Devices, Inc.Inventors: Douglas Williams, Sahil Arora, Nikhil Gupta, Wei-Yu Chen, Debjit Das Sarma, Marius Evers
-
Publication number: 20180321946Abstract: A method for use in a processor for arbitrating between multiple processes to select wavefronts for execution on a shader core is provided. The processor includes a compute pipeline configured to issue wavefronts to the shader core for execution, a hardware queue descriptor associated with the compute pipeline, and the shader core. The shader core is configured to execute work for the compute pipeline corresponding to a first memory queue descriptor executed using data for the first memory queue descriptor that is loaded into a first hardware queue descriptor. The processor is configured to detect a context switch condition, and, responsive to the context switch condition, perform a context switch operation including loading data for a second memory queue descriptor into the first hardware queue descriptor. The shader core is configured to execute work corresponding to the second memory queue descriptor that is loaded into the first hardware queue descriptor.Type: ApplicationFiled: July 19, 2018Publication date: November 8, 2018Applicant: Advanced Micro Devices, Inc.Inventors: Robert Scott Hartog, Mark Leather, Michael Mantor, Rex McCrary, Sebastien Nussbaum, Philip J. Rogers, Ralph Clay Taylor, Thomas Woller
-
Patent number: 10120430Abstract: A system and method for managing operating modes within a semiconductor chip for optimal power and performance while meeting a reliability target are described. A semiconductor chip includes a functional unit and a corresponding reliability monitor. The functional unit provides actual usage values to the reliability monitor. The reliability monitor determines expected usage values based on a reliability target and the age of the semiconductor chip. The reliability monitor compares the actual usage values and the expected usage values. The result of this comparison is used to increase or decrease current operational parameters.Type: GrantFiled: September 7, 2016Date of Patent: November 6, 2018Assignees: Advanced Micro Devices, Inc., ATI Technologies ULCInventors: Stephen V. Kosonocky, Thomas Burd, Adam Clark, Larry D. Hewitt, John Vincent Faricelli, John P. Petry