Patents Assigned to Advanced Micros Devices, Inc.

Reduced bandwidth tessellation factors

Patent number: 11010862

Abstract: A graphics pipeline reduces the number of tessellation factors written to and read from a graphics memory. A hull shader stage of the graphics pipeline detects whether at least a threshold percentage of the tessellation factors for a thread group of patches are the same and, in some embodiments, whether at least the threshold percentage of the tessellation factors for a thread group of patches have a same value that either indicates that the plurality of patches are to be culled or that the plurality of patches are to be passed to a tessellator stage of the graphics pipeline.

Type: Grant

Filed: November 14, 2019

Date of Patent: May 18, 2021

Assignee: ADVANCED MICRO DEVICES, INC.

Inventors: Mangesh P. Nijasure, Tad Litwiller, Todd Martin, Nishank Pathak
LATENCY HIDING FOR CACHES

Publication number: 20210141740

Abstract: A technique for accessing a memory having a high latency portion and a low latency portion is provided. The technique includes detecting a promotion trigger to promote data from the high latency portion to the low latency portion, in response to the promotion trigger, copying cache lines associated with the promotion trigger from the high latency portion to the low latency portion, and in response to a read request, providing data from either or both of the high latency portion or the low latency portion, based on a state associated with data in the high latency portion and the low latency portion.

Type: Application

Filed: November 13, 2019

Publication date: May 13, 2021

Applicant: Advanced Micro Devices, Inc.

Inventors: John Kalamatianos, Apostolos Kokolis, Shrikanth Ganapathy
DYNAMIC BANKING AND BIT SEPARATION IN MEMORIES

Publication number: 20210141733

Abstract: Memories that are configurable to operate in either a banked mode or a bit-separated mode. The memories include a plurality of memory banks; multiplexing circuitry; input circuitry; and output circuitry. The input circuitry inputs at least a portion of a memory address and configuration information to the multiplexing circuitry. The multiplexing circuitry generates read data by combining a selected subset of data corresponding to the address from each of the plurality of memory banks, the subset selected based on the configuration information, if the configuration information indicates a bit-separated mode. The multiplexing circuitry generates the read data by combining data corresponding to the address from one of the memory banks, the one of the memory banks selected based on the configuration information, if the configuration information indicates a banked mode. The output circuitry outputs the generated read data from the memory.

Type: Application

Filed: November 11, 2019

Publication date: May 13, 2021

Applicant: Advanced Micro Devices, Inc.

Inventor: Russell J. Schreiber
Semiconductor chip with stacked conductor lines and air gaps

Patent number: 11004791

Abstract: Various semiconductor chip metallization layers and methods of manufacturing the same are disclosed. In aspect, a semiconductor chip is provided that includes a substrate, plural metallization layers on the substrate, a first conductor line in one of the metallization layers and a second conductor line in the one of the metallization layers in spaced apart relation to the first conductor line, each of the first conductor line and the second conductor line has a first line portion and a second line portion stacked on the first line portion, and a dielectric layer that has a portion positioned between the first conductor line and a second line, the portion has an air gap.

Type: Grant

Filed: April 12, 2019

Date of Patent: May 11, 2021

Assignee: ADVANCED MICRO DEVICES, INC.

Inventor: Richard Schultz
Networked input/output memory management unit

Patent number: 11003588

Abstract: A networked input/output memory management unit (IOMMU) includes a plurality of IOMMUs. The networked IOMMU receives a memory access request that includes a domain physical address generated by a first address translation layer. The networked IOMMU selectively translates the domain physical address into a physical address in a system memory using one of the plurality of IOMMUs that is selected based on a type of a device that generated the memory access request. In some cases, the networked IOMMU is connected to a graphics processing unit (GPU), at least one peripheral device, and the memory. The networked IOMMU includes a command queue to receive the memory access requests, a primary IOMMU to selectively translate the domain physical address in memory access requests from the GPU, and a secondary IOMMU to translate the domain physical address in memory requests from the peripheral device.

Type: Grant

Filed: August 22, 2019

Date of Patent: May 11, 2021

Assignees: ADVANCED MICRO DEVICES, INC., ATI TECHNOLOGIES ULC

Inventors: Sonu Arora, Paul Blinzer, Philip Ng, Nippon Harshadk Raval
Automatic configuration of knobs to optimize performance of a graphics pipeline

Patent number: 11004251

Abstract: A knob has a plurality of settings that configure a graphics pipeline. A first setting is associated with a first state of the graphics pipeline. The first setting is associated with the first state based on a measure of performance of the graphics pipeline while configured according to the first setting. The graphics pipeline is configured according to the first setting in response to the first state of the graphics pipeline matching a current state of the graphics pipeline. The graphics pipeline processes graphics according to the first setting. In some cases, the first setting is associated with the first state of the graphics pipeline by dithering or toggling the knob between the settings once per frame for a predetermined number of frames. The first setting achieves better performance than other ones of the plurality of settings during the predetermined number of frames.

Type: Grant

Filed: November 27, 2018

Date of Patent: May 11, 2021

Assignee: ADVANCED MICRO DEVICES, INC.

Inventor: Christopher J. Brennan
Combined world-space pipeline shader stages

Patent number: 11004258

Abstract: Improvements to graphics processing pipelines are disclosed. More specifically, the vertex shader stage, which performs vertex transformations, and the hull or geometry shader stages, are combined. If tessellation is disabled and geometry shading is enabled, then the graphics processing pipeline includes a combined vertex and graphics shader stage. If tessellation is enabled, then the graphics processing pipeline includes a combined vertex and hull shader stage. If tessellation and geometry shading are both disabled, then the graphics processing pipeline does not use a combined shader stage. The combined shader stages improve efficiency by reducing the number of executing instances of shader programs and associated resources reserved.

Type: Grant

Filed: October 2, 2019

Date of Patent: May 11, 2021

Assignee: Advanced Micro Devices, Inc.

Inventors: Mangesh P. Nijasure, Randy W. Ramsey, Todd Martin
METHOD AND APPARATUS FOR DATA SCRAMBLING

Publication number: 20210132675

Abstract: A method and apparatus for scrambling and descrambling data in a computer system includes transmitting non-scrambled data from a first high speed inter chip (IP) link circuit located on a first chip to a first serializer/deserializer (SERDES) physical (PHY) circuit located on the first chip, the first high speed link IP indicating the data is not scrambled. The received non-scrambled data is scrambled by the first SERDES PHY circuit and transmitted to a second chip. The received scrambled data is descrambled by a second SERDES PHY circuit located on the second chip. The non-scrambled data is transmitted by the second SERDES PHY circuit to a second high speed link IP circuit located on the second chip to a third circuit for further processing or transmission.

Type: Application

Filed: January 15, 2021

Publication date: May 6, 2021

Applicants: ADVANCED MICRO DEVICES, INC., ATI TECHNOLOGIES ULC

Inventors: Yanfeng Wang, Michael J. Tresidder, Kevin M. Lepak, Larry David Hewitt, Noah Beck
Improving latency by performing early synchronization operations in between sets of program operations of a thread

Patent number: 10990453

Abstract: A memory fence or other similar operation is executed with reduced latency. An early fence operation is executed and acts as a hint to the processor executing the thread that executes the fence. This hint causes the processor to begin performing sub-operations for the fence earlier than if no such hint were executed. Examples of sub-operations for the fence include operations to make data written to by writes prior to the fence operation available to other threads. A resolving fence, which occurs after the early fence, performs the remaining sub-operations for the fence. By triggering some or all of the sub-operations for a memory fence that will occur in the future, the early fence operation reduces the amount of latency associated with that memory fence operation.

Type: Grant

Filed: April 12, 2018

Date of Patent: April 27, 2021

Assignee: Advanced Micro Devices, Inc.

Inventors: Amin Farmahini-Farahani, David A. Roberts, Nuwan Jayasena
Address-based filtering for load/store speculation

Patent number: 10990393

Abstract: Address-based filtering for load/store speculation includes maintaining a filtering table including table entries associated with ranges of addresses; in response to receiving an ordering check triggering transaction, querying the filtering table using a target address of the ordering check triggering transaction to determine if an instruction dependent upon the ordering check triggering transaction has previously been generated a physical address; and in response to determining that the filtering table lacks an indication that the instruction dependent upon the ordering check triggering transaction has previously been generated a physical address, bypassing a lookup operation in an ordering violation memory structure to determine whether the instruction dependent upon the ordering check triggering transaction is currently in-flight.

Type: Grant

Filed: October 21, 2019

Date of Patent: April 27, 2021

Assignee: ADVANCED MICRO DEVICES, INC.

Inventors: John Kalamatianos, Krishnan V. Ramani, Susumu Mashimo
Residency map descriptors

Patent number: 10991146

Abstract: A processor receives a request to access one or more levels of a partially resident texture (PRT) resource. The levels represent a texture at different levels of detail (LOD) and the request includes normalized coordinates indicating a location in the texture. The processor accesses a texture descriptor that includes dimensions of a first level of the levels and one or more offsets between a reference level and one or more second levels that are associated with one or more residency maps that indicate texels that are resident in the PRT resource. The processor translates the normalized coordinates to texel coordinates in the one or more residency maps based on the offset and accesses, in response to the request, the one or more residency maps based on the texel coordinates to determine whether texture data indicated by the normalized coordinates is resident in the PRT resource.

Type: Grant

Filed: December 20, 2019

Date of Patent: April 27, 2021

Assignee: ADVANCED MICRO DEVICES, INC.

Inventors: Maxim V. Kazakov, Mark Fowler
Low latency FIFO with auto sync

Patent number: 10990120

Abstract: A method operates a first-in-first-out (FIFO) buffer with a first clock, and operates one of a read pointer or a write pointer of the FIFO buffer with the first clock while operating the other one of the read pointer or write pointer with a second clock. One of a serializer fed from the FIFO buffer output, or a de-serializer feeding the FIFO buffer input, is operated with the second clock. Timing pulses indicate that the pointer operating with the second clock has reached a predetermined point in its cycle. The phase of the second clock is adjusted based on a relationship between the timing pulses and an advance period of the pointer operating with the first clock. The pointer operating with the first clock is reset to achieve a desired value for the relationship. A skew created from adjusting the phase of the second clock is corrected.

Type: Grant

Filed: June 26, 2019

Date of Patent: April 27, 2021

Assignee: Advanced Micro Devices, Inc.

Inventor: Bhuvanachandran K. Nair
HYBRID FIRST-FIT K-CHOICE INSERTIONS FOR HASH TABLES, HASH SETS, APPROXIMATE SET MEMBERSHIP DATA STRUCTURES, AND CACHES

Publication number: 20210117100

Abstract: A hybrid mechanism for operating on a data item in connection with an associative structure combines first-fit and K-choice. The hybrid mechanism leverages advantages of both approaches by choosing whether to insert, retrieve, delete, or modify a data item using either first-fit or K-choice. Based on the data item, a function of the data item, and/or other factors such as the load statistics of the associative structure, one of either first-fit or K-choice is used to improve operation on the associative structure across a variety of different load states of the associative structure.

Type: Application

Filed: October 21, 2019

Publication date: April 22, 2021

Applicant: Advanced Micro Devices, Inc.

Inventors: Alexander D. Breslow, Nuwan Jayasena
SYSTEM AND METHOD FOR PROTECTING GPU MEMORY INSTRUCTIONS AGAINST FAULTS

Publication number: 20210117269

Abstract: A system and method for protecting memory instructions against faults are described. The system and method include converting the slave instructions to dummy operations, modifying memory arbiter to issue up to N master and N slave global/shared memory instructions per cycle, sending master memory requests to memory system, using slave requests for error checking, entering master requests to the GM/LM FIFO, storing slave requests in a register, and comparing the entered master requests with the stored slave requests.

Type: Application

Filed: December 7, 2020

Publication date: April 22, 2021

Applicant: Advanced Micro Devices, Inc.

Inventors: John Kalamatianos, Michael Mantor, Sudhanva Gurumurthi
COMPOSABLE NEURAL NETWORK KERNELS

Publication number: 20210117806

Abstract: A technique for manipulating a generic tensor is provided. The technique includes receiving a first request to perform a first operation on a generic tensor descriptor associated with the generic tensor, responsive to the first request, performing the first operation on the generic tensor descriptor, receiving a second request to perform a second operation on generic tensor raw data associated with the generic tensor, and responsive to the second request, performing the second operation on the generic tensor raw data, the performing the second operation including mapping a tensor coordinate specified by the second request to a memory address, the mapping including evaluating a delta function to determine an address delta value to add to a previously determined address for a previously processed tensor coordinate.

Type: Application

Filed: December 30, 2020

Publication date: April 22, 2021

Applicant: Advanced Micro Devices, Inc.

Inventors: Chao Liu, Daniel Isamu Lowell, Wen Heng Chung, Jing Zhang
Interconnect architecture for three-dimensional processing systems

Patent number: 10984838

Abstract: A processing system includes a plurality of processor cores formed in a first layer of an integrated circuit device and a plurality of partitions of memory formed in one or more second layers of the integrated circuit device. The one or more second layers are deployed in a stacked configuration with the first layer. Each of the partitions is associated with a subset of the processor cores that have overlapping footprints with the partitions. The processing system also includes first memory paths between the processor cores and their corresponding subsets of partitions. The processing system further includes second memory paths between the processor cores and the partitions.

Type: Grant

Filed: November 17, 2015

Date of Patent: April 20, 2021

Assignee: ADVANCED MICRO DEVICES, INC.

Inventors: Nuwan S. Jayasena, Yasuko Eckert
Flexible dictionary sharing for compressed caches

Patent number: 10983915

Abstract: Systems, apparatuses, and methods for implementing flexible dictionary sharing techniques for caches are disclosed. A set-associative cache includes a dictionary for each data array set. When a cache line is to be allocated in the cache, a cache controller determines to which set a base index of the cache line address maps. Then, a selector unit determines which dictionary of a group of dictionaries stored by those sets neighboring this set would achieve the most compression for the cache line. This dictionary is then selected to compress the cache line. An offset is added to the base index of the cache line to generate a full index in order to map the cache line to the set corresponding to this chosen dictionary. The compressed cache line is stored in this set with the chosen dictionary, and the offset is stored in the corresponding tag array entry.

Type: Grant

Filed: August 19, 2019

Date of Patent: April 20, 2021

Assignee: Advanced Micro Devices, Inc.

Inventors: Alexander D. Breslow, John Kalamatianos
Interposer-based damping resistor

Patent number: 10985097

Abstract: Various resistor circuits and methods of making and using the same are disclosed. In one aspect, a method of manufacturing is provided that includes forming a resistor onboard an interposer. The resistor is adapted to dampen a capacitive network. The capacitive network has at least one capacitor positioned external to the interposer.

Type: Grant

Filed: July 30, 2018

Date of Patent: April 20, 2021

Assignees: ADVANCED MICRO DEVICES, INC., ATI TECHNOLOGIES ULC

Inventors: Fei Guo, Feng Zhu, Julius Din, Anwar Kashem, Sally Yeung
LOW LATENCY WIRELESS VIRTUAL REALITY SYSTEMS AND METHODS

Publication number: 20210112289

Abstract: Virtual Reality (VR) processing devices and methods are provided for transmitting user feedback information comprising at least one of user position information and user orientation information, receiving encoded audio-video (A/V) data, which is generated based on the transmitted user feedback information, separating the A/V data into video data and audio data corresponding to a portion of a next frame of a sequence of frames of the video data to be displayed, decoding the portion of a next frame of the video data and the corresponding audio data, providing the audio data for aural presentation and controlling the portion of the next frame of the video data to be displayed in synchronization with the corresponding audio data.

Type: Application

Filed: December 23, 2020

Publication date: April 15, 2021

Applicants: Advanced Micro Devices, Inc., ATI Technologies ULC

Inventors: Lei Zhang, Gabor Sines, Khaled Mammou, David Glen, Layla A. Mah, Rajabali M. Koduri, Bruce Montag
Methods and apparatus for optical blur modeling for improved video encoding

Patent number: 10979704

Abstract: Methods and apparatus of generating a refined reference frame for inter-frame encoding by applying blur parameters to allow encoding of image frames having blurred regions are presented herein. The methods and apparatus may identify a blurred region of an image frame by comparing the image frame with a reference frame, generate a refined reference frame by applying the blur parameter indicative of the blurred region to the reference frame, determine whether to use one of the reference frame and refined reference frame to encode the image frame, and encode the image frame using the refined reference frame when determined to use the refined reference frame.

Type: Grant

Filed: May 4, 2015

Date of Patent: April 13, 2021

Assignees: Advanced Micro Devices, Inc., ATI Technologies ULC

Inventors: Ihab M. A. Amer, Khaled Mammou, Vladyslav S. Zakharchenko, Dmytro U. Elperin

prev … 85 86 87 88 89 90 91 92 93 … next