Patents Assigned to ADVANCED MICRO DEVICES (AMD)
-
Patent number: 10585847Abstract: A method and apparatus for scheduling instructions of a shader program for a graphics processing unit (GPU) with a fixed number of registers. The method and apparatus include computing, via a processing unit (PU), a liveness-based register usage across all basic blocks in the shader program, computing, via the PU, the range of numbers of waves of a plurality of registers for the shader program, assessing the impact of available post-register allocation optimizations, computing, via the PU, the scoring data based on number of waves of the plurality of registers, and computing, via the PU, the number of waves for execution for the plurality of registers.Type: GrantFiled: February 4, 2019Date of Patent: March 10, 2020Assignee: ADVANCED MICRO DEVICES, INC.Inventors: Robert A. Gottlieb, Christopher L. Reeve, Michael John Bedy
-
Patent number: 10585642Abstract: A system and method for managing data in a ring buffer is disclosed. The system includes a legacy ring buffer functioning as an on-chip ring buffer, a supplemental buffer for storing data in the ring buffer, a preload ring buffer that is on-chip and capable of receiving preload data from the supplemental buffer, a write controller that determines where to write data that is write requested by a write client of the ring buffer, and a read controller that controls a return of data to a read client pursuant to a read request to the ring buffer.Type: GrantFiled: October 15, 2018Date of Patent: March 10, 2020Assignee: ADVANCED MICRO DEVICES, INC.Inventors: XuHong Xiong, Pingping Shao, ZhongXiang Luo, ChenBin Wang
-
Patent number: 10585826Abstract: The described embodiments include a computing device with multiple interrupt processors for processing interrupts. In the described embodiments, each of the multiple processors is classified as one or more processor types based on factors such as features and functionality of the processor, an operating environment of the processor, the characteristics of some or all of the available interrupts, etc. During operation, an interrupt controller in the computing device receives an indication of an interrupt. The interrupt controller then determines a processor type for processing the interrupt. Next, the interrupt controller causes the interrupt to be processed by one of the plurality of processors that is the determined processor type.Type: GrantFiled: January 25, 2016Date of Patent: March 10, 2020Assignee: ADVANCED MICRO DEVICES, INC.Inventors: Nuwan S. Jayasena, Andrew G. Kegel
-
Patent number: 10585805Abstract: A computing device that handles address translations is described. The computing device includes a hardware table walker and a memory that stores a reverse map table and a plurality of pages of memory. The table walker is configured to use validated indicators in entries in the reverse map table to determine if page accesses are made to pages for which entries are validated. The table walker is further configured to use virtual machine permissions levels information in entries in the reverse map table determine if page accesses for specified operation types are permitted.Type: GrantFiled: February 28, 2018Date of Patent: March 10, 2020Assignee: ADVANCED MICRO DEVICES, INC.Inventors: David A. Kaplan, Jeremy W. Powell, Thomas R. Woller
-
Patent number: 10568025Abstract: An approach is provided for data processing methods wherein a PHY layer transmit operation is constructed by aggregating MAC layer data units (MPDUs) by means of a linked list. A link list of TX descriptors can be formed separately from the payloads. In order to minimize processor speed rates, the next transmission buffer is pre-fetched whenever appropriate. Interlocks are used to prevent conflict on descriptor fetches and payload fetches.Type: GrantFiled: July 13, 2015Date of Patent: February 18, 2020Assignees: ADVANCED MICRO DEVICES, INC., AMD FAR EAST LTD.Inventors: Sebastian Ahmed, Douglas A. Mammoser
-
Patent number: 10560022Abstract: An apparatus includes an integrated circuit chip with a set of circuits having two or more subsets of circuits; an external voltage regulator separate from the integrated circuit chip; two or more integrated voltage regulators on the integrated circuit chip that each provide an input voltage to a respective subset of the circuits; and a controller. The controller determines, using an integrated voltage regulator power loss model, an electrical power loss for the integrated voltage regulators for a first combination of operating points for the subsets of the circuits. The controller then determines, based on the electrical power loss, a second combination of operating points for the subsets of the circuits that includes an adjustment to an operating point for at least one of the subsets of the circuits that compensates for an electrical power loss of the corresponding integrated voltage regulator.Type: GrantFiled: June 13, 2019Date of Patent: February 11, 2020Assignee: ADVANCED MICRO DEVICES, INC.Inventors: Wei Huang, Miguel Rodriguez, Karthik Rao
-
Patent number: 10542268Abstract: A system and method for providing video compression that includes encoding using an encoding engine a YUV stream wherein Y, U and V color values are encoded in parallel and patching together the Y, U and V color streams to form a compressed YUV output stream. The encoding engine further includes encoding each color value of the YUV stream in parallel using parallel encoding engines and a control engine for controlling operation all of the encoding engines in parallel. The YUV stream has an average bits per pixel value that varies from a first value to a second value that is double the first value. The encoding engine includes encoding the YUV stream in generally the same amount of time regardless of the average bits per pixel value.Type: GrantFiled: April 19, 2017Date of Patent: January 21, 2020Assignees: ADVANCED MICRO DEVICES, INC., ATI TECHNOLOGIES ULCInventors: Haibin Li, Zhen Chen, Lei Zhang, Ji Zhou, Zhong Cai
-
Patent number: 10540280Abstract: Techniques for performing cache invalidates and write-backs in an accelerated processing device (e.g., a graphics processing device that renders three-dimensional graphics) are disclosed. The techniques involve receiving requests from a “master” (e.g., the central processing unit). The techniques involve invalidating virtual-to-physical address translations in an address translation request. The techniques include splitting up the requests based on whether the requests target virtually or physically tagged caches. Addresses for the portions of a request that target physically tagged caches are translated using invalidated virtual-to-physical address translations for speed. The split up request is processed to generate micro-transactions for individual caches targeted by the request. Micro-transactions for physically and virtually tagged caches are processed in parallel. Once all micro-transactions for a request have been processed, the unit that made the request is notified.Type: GrantFiled: December 23, 2016Date of Patent: January 21, 2020Assignees: ADVANCED MICRO DEVICES, INC., ATI TECHNOLOGIES ULCInventors: Mark Fowler, Jimshed Mirza, Anthony Asaro
-
Patent number: 10535393Abstract: An electronic device including a memory functional block having multiple ranks of memory and a memory controller functional block coupled to the memory. The memory controller includes refresh logic that detects, based on buffered memory accesses for each rank of memory of the ranks of memory, two or more ranks of memory for which a refresh is to be performed during a refresh interval. Based at least in part on one or more properties of buffered memory accesses for the two or more ranks of memory, the refresh logic determines a refresh order for performing refreshes for the two or more ranks of memory during the refresh interval. The memory controller then performs, in the refresh order, refreshes for the two or more ranks of memory during the refresh interval.Type: GrantFiled: July 21, 2018Date of Patent: January 14, 2020Assignee: ADVANCED MICRO DEVICES, INC.Inventor: Kedarnath Balakrishnan
-
Patent number: 10534498Abstract: A system and method are set forth which combine an ability to view a motion video with an ability to simultaneously access computer programs. In certain embodiments, the media system provides access to movies, music and photos in a visually appealing three dimensional environment. In certain environments, the media system presents a three dimensional navigation tool (such as a three dimensional wheel) on which thumbnails are presented. A required resource value corresponding to system resources required to present individual thumbnails is generated, followed by the generation of an available resource value corresponding to system resources available to present media associated with the selected thumbnail. The available resource value and one or more required resource values are then processed to generate a consumed resource value, which is then used to limit the number of thumbnails presented.Type: GrantFiled: November 1, 2016Date of Patent: January 14, 2020Assignee: ADVANCED MICRO DEVICES, INC.Inventors: Michael C. Gotcher, Raymond F. Dumbeck
-
Patent number: 10522193Abstract: A system, method, and computer program product are provided for a memory device system. One or more memory dies and at least one logic die are disposed in a package and communicatively coupled. The logic die comprises a processing device configurable to manage virtual memory and operate in an operating mode. The operating mode is selected from a set of operating modes comprising a slave operating mode and a host operating mode.Type: GrantFiled: September 12, 2018Date of Patent: December 31, 2019Assignee: ADVANCED MICRO DEVICES, INC.Inventors: Nuwan S. Jayasena, Gabriel H. Loh, Bradford M. Beckmann, James M. O'Connor, Lisa R. Hsu
-
Patent number: 10515173Abstract: An electronic device includes a first integrated circuit chip including a processing functional block, and a second integrated circuit chip including an input-output (IO) functional block. The IO functional block performs one or more IO processing operations on behalf of the processing functional block in the first integrated circuit chip. The first integrated circuit chip lacks at least some elements of the IO functional block, so that the processing functional block is unable to perform corresponding IO operations without the IO functional block.Type: GrantFiled: December 29, 2017Date of Patent: December 24, 2019Assignee: ADVANCED MICRO DEVICES, INC.Inventors: David A. Roberts, Dean Gonzales
-
Patent number: 10509736Abstract: An input-output (IO) memory management unit (IOMMU) uses a reverse map table (RMT) to ensure that address translations acquired from a nested page table are correct and that IO devices are permitted to access pages in a memory when performing memory accesses in a computing device. A translation lookaside buffer (TLB) flushing mechanism is used to invalidate address translation information in TLBs that are affected by changes in the RMT. A modified Address Translation Caching (ATC) mechanism may be used, in which only partial address translation information is provided to IO devices so that the RMT is checked when performing memory accesses for the IO devices using the cached address translation information.Type: GrantFiled: April 10, 2018Date of Patent: December 17, 2019Assignee: ADVANCED MICRO DEVICES, INC.Inventors: Nippon Raval, David A. Kaplan, Philip Ng
-
Patent number: 10509452Abstract: Techniques for managing power distribution amongst processors in a massively parallel computer architecture are disclosed. The techniques utilize a hierarchy that organizes the various processors of the massively parallel computer architecture. The hierarchy groups numbers of the processors at the lowest level. When processors complete tasks, the power assigned to those processors is distributed to other processors in the same group so that the performance of those processors can be increased. Hierarchical organization simplifies the calculations required for determining how and when to distribute power, because when tasks are complete and power is available for distribution, a relatively small number of processors are available for consideration to receive that power. The number of processors that are grouped together can be adjusted in real time based on performance factors to improve the trade-off between calculation speed and power distribution efficacy.Type: GrantFiled: April 26, 2017Date of Patent: December 17, 2019Assignee: ADVANCED MICRO DEVICES, INC.Inventors: Xinwei Chen, Leonardo de Paula Rosa Piga
-
Patent number: 10509596Abstract: A technique for accessing memory in an accelerated processing device coupled to stacked memory dies is provided herein. The technique includes receiving a memory access request from an execution unit and identifying whether the memory access request corresponds to memory cells of the stacked dies that are considered local to the execution unit or non-local. For local accesses, the access is made “directly”, that is, without using a bus. A control die coordinates operations for such local accesses, activating particular through-silicon-vias associated with the memory cells that include the data for the access. Non-local accesses are made via a distributed cache fabric and an interconnect bus in the control die. Various other features and details are provided below.Type: GrantFiled: December 21, 2017Date of Patent: December 17, 2019Assignee: ADVANCED MICRO DEVICES, INC.Inventors: Dmitri Yudanov, Jiasheng Chen
-
Patent number: 10510185Abstract: A technique for performing rasterization and pixel shading with decoupled resolution is provided herein. The technique involves performing rasterization as normal to generate fine rasterization data and a set of (fine) quads. The quads are accumulated into a tile buffer and coarse quads are generated from the quads in the tile buffer based on a shading rate. The shading rate determines how many pixels of the fine quads are combined to generate coarse pixels of the coarse quads. Combination of fine pixels involves generating a single coarse pixel for each such fine pixel to be combined. The positions of the coarse pixels of the coarse quads are set based on the positions of the corresponding fine pixels. The coarse quads are shaded normally and the resulting shaded coarse quads are modified based on the fine rasterization data to generate shaded fine quads.Type: GrantFiled: August 25, 2017Date of Patent: December 17, 2019Assignee: ADVANCED MICRO DEVICES, INC.Inventors: Skyler Jonathon Saleh, Christopher J. Brennan, Andrew S. Pomianowski, Ruijin Wu
-
Patent number: 10503655Abstract: The described embodiments include a computing device that caches data acquired from a main memory in a high-bandwidth memory (HBM), the computing device including channels for accessing data stored in corresponding portions of the HBM. During operation, the computing device sets each of the channels so that data blocks stored in the corresponding portions of the HBM include corresponding numbers of cache lines. Based on records of accesses of cache lines in the HBM that were acquired from pages in the main memory, the computing device sets a data block size for each of the pages, the data block size being a number of cache lines. The computing device stores, in the HBM, data blocks acquired from each of the pages in the main memory using a channel having a data block size corresponding to the data block size for each of the pages.Type: GrantFiled: July 21, 2016Date of Patent: December 10, 2019Assignee: ADVANCED MICRO DEVICES, INC.Inventors: Mitesh R. Meswani, Jee Ho Ryoo
-
Patent number: 10491916Abstract: The present disclosure is directed a system and method for exploiting camera and depth information associated with rendered video frames, such as those rendered by a server operating as part of a cloud gaming service, to more efficiently encode the rendered video frames for transmission over a network. The method and system of the present disclosure can be used in a server operating in a cloud gaming service to improve, for example, the amount of latency, downstream bandwidth, and/or computational processing power associated with playing a video game over its service. The method and system of the present disclosure can be further used in other applications where camera and depth information of a rendered or captured video frame is available.Type: GrantFiled: October 1, 2013Date of Patent: November 26, 2019Assignees: ADVANCED MICRO DEVICES, INC., ATI TECHNOLOGIES ULCInventors: Khaled Mammou, Ihab Amer, Gabor Sines, Lei Zhang, Michael Schmit, Daniel Wong
-
Patent number: 10467178Abstract: Embodiments of a peripheral component are described herein. Embodiments provide alternatives to the use of an external bridge integrated circuit (IC) architecture. For example, an embodiment multiplexes a peripheral bus such that multiple processors in one peripheral component can use one peripheral interface slot without requiring an external bridge IC. Embodiments are usable with known bus protocols.Type: GrantFiled: December 9, 2016Date of Patent: November 5, 2019Assignees: ADVANCED MICRO DEVICES, INC., ATI TECHNOLOGIES ULC.Inventors: Shahin Solki, Stephen Morein, Mark S. Grossman
-
Patent number: 10459776Abstract: Techniques for managing message transmission in a large networked computer system that includes multiple individual networked computing systems are disclosed. Message passing among the computing systems include a sending computing device transmitting a message to a receiver computing device and a receiver computing device consuming that message. A build-up of data stored in a buffer at the receiver can reduce performance. In order to reduce the potential performance degradation associated with large amounts of “waiting” data in the buffer, a sending computer system first determines whether the receiver computer system is ready to receive a message and does not transmit the message if the receiver computer system is not ready. To determine whether the receiver computer system is ready to receive a message, the receiver computer system, at the request of the sending computer system, checks a counting filter that stores indications of whether particular messages are ready.Type: GrantFiled: June 5, 2017Date of Patent: October 29, 2019Assignee: ADVANCED MICRO DEVICES, INC.Inventor: Shuai Che