Patents Assigned to Advanced Micros Devices, Inc.
-
Patent number: 11200106Abstract: A data processing system includes a memory channel, a memory coupled to the memory channel, and a data processor. The data processor is coupled to the memory channel and accesses the memory over the memory channel using a packet structure defining a plurality of commands and having corresponding address bits, data bits, and user bits. The data processor communicates with the memory over the memory channel using a first type of error code. In response to a write access request, the data processor calculates a different, second type of error code and appends each bit of the second type of error code as a corresponding one of the user bits. The memory stores the user bits in the memory in response to a write command, and transfers the user bits to the data processor in a read response packet in response to a read command.Type: GrantFiled: December 6, 2019Date of Patent: December 14, 2021Assignee: Advanced Micro Devices, Inc.Inventors: Kedarnath Balakrishnan, James R. Magro, Kevin Michael Lepak, Vilas Sridharan
-
Patent number: 11200724Abstract: A texture processor based ray tracing accelerator method and system are described. The system includes a shader, texture processor (TP) and cache, which are interconnected. The TP includes a texture address unit (TA), a texture cache processor (TCP), a filter pipeline unit and a ray intersection engine. The shader sends a texture instruction which contains ray data and a pointer to a bounded volume hierarchy (BVH) node to the TA. The TCP uses an address provided by the TA to fetch BVH node data from the cache. The ray intersection engine performs ray-BVH node type intersection testing using the ray data and the BVH node data. The intersection testing results and indications for BVH traversal are returned to the shader via a texture data return path. The shader reviews the intersection results and the indications to decide how to traverse to the next BVH node.Type: GrantFiled: December 22, 2017Date of Patent: December 14, 2021Assignee: Advanced Micro Devices, Inc.Inventors: Skyler Jonathon Saleh, Maxim V. Kazakov, Vineet Goel
-
Patent number: 11201104Abstract: A thermal management system includes an integrated circuit having an active side including a control circuit and a backside including a first set of electrodes distributed across the backside. The thermal management system includes a heat exchanger having a surface including a second set of electrodes. The thermal management system includes a thermal interface material including thermally conductive particles suspended in a fluid. The thermal interface material is disposed between the backside of the integrated circuit and the surface of the heat exchanger. The control circuit is configured to apply an electric field to the thermal interface material using a first electrode of the first set of electrodes and a second electrode of the second set of electrodes to excite at least some of the thermally conductive particles between the first electrode and the second electrode.Type: GrantFiled: December 30, 2019Date of Patent: December 14, 2021Assignee: Advanced Micro Devices, Inc.Inventors: Andrew J. McNamara, Swagata P. Kalve, Christopher M. Jaggers
-
Publication number: 20210383527Abstract: A technique for generating a trained discriminator is provided. The technique includes applying one or more of a glitched image or an unglitched image to a discriminator; receiving classification output from the discriminator; adjusting weights of the discriminator to improve classification accuracy of the discriminator; applying noise to a generator; receiving an output image from the generator; applying the output image to the discriminator to obtain a classification; and adjusting weights of one of the discriminator or the generator to improve ability of the generator to reduce classification accuracy of the discriminator, based on the classification.Type: ApplicationFiled: September 23, 2020Publication date: December 9, 2021Applicants: Advanced Micro Devices, Inc., ATI Technologies ULCInventors: Nicholas Malaya, Max Kiehn
-
Publication number: 20210383528Abstract: A technique for detecting a glitch in an image is provided. The technique includes providing an image to a plurality of individual classifiers to generate a plurality of individual classifier outputs and providing the plurality of individual classifier outputs to an ensemble classifier to generate a glitch classification.Type: ApplicationFiled: September 23, 2020Publication date: December 9, 2021Applicants: Advanced Micro Devices, Inc., ATI Technologies ULCInventors: Nicholas Malaya, Max Kiehn, Stanislav Ivashkevich
-
Publication number: 20210382661Abstract: A memory controller selects from among a plurality of memory access commands including volatile memory reads, volatile memory writes, non-volatile memory reads, and non-volatile memory writes. The selected memory access commands are transmitted to a heterogenous memory channel coupled to a non-volatile memory and a volatile memory. The non-volatile read commands that are transmitted are stored in a non-volatile command queue (NV queue). A ready response is received from the non-volatile memory indicating that responsive data is available for an associated one of the non-volatile read commands. In response to receiving the ready response, a send command is transmitted for commanding the non-volatile memory to send the responsive data.Type: ApplicationFiled: August 23, 2021Publication date: December 9, 2021Applicant: Advanced Micro Devices, Inc.Inventors: James R. Magro, Kedarnath Balakrishnan
-
Patent number: 11196657Abstract: A system for automatically discovering fabric topology includes at least one or more processing units, one or more memory devices, a security processor, and a communication fabric with an unknown topology coupled to the processing unit(s), memory device(s), and security processor. The security processor queries each component of the fabric to retrieve various attributes associated with the component. The security processor utilizes the retrieved attributes to create a network graph of the topology of the components within the fabric. The security processor generates routing tables from the network graph and programs the routing tables into the fabric components. Then, the fabric components utilize the routing tables to determine how to route incoming packets.Type: GrantFiled: December 21, 2017Date of Patent: December 7, 2021Assignee: Advanced Micro Devices, Inc.Inventors: Vydhyanathan Kalyanasundharam, Eric Christopher Morton, Alan Dodson Smith, Joe G. Cruz
-
Patent number: 11195326Abstract: Described herein are techniques for improving the effectiveness of depth culling. In a first technique, a binner is used to sort primitives into depth bins. Each depth bin covers a range of depths. The binner transmits the depth bins to the screen space pipeline for processing in near-to-far order. Processing the near bins first results in the depth buffer being updated, allowing fragments for the primitives in the farther bins to be culled more aggressively than if the depth binning did not occur. In a second technique, a buffer is used to initiate two-pass processing through the screen space pipeline. In the first pass, primitives are sent down to update the depth block and are then culled. The fragments are processed normally in the second pass, with the benefit of the updated depth values.Type: GrantFiled: September 21, 2018Date of Patent: December 7, 2021Assignee: Advanced Micro Devices, Inc.Inventors: Ruijin Wu, Young In Yeo, Sagar S. Bhandare, Vineet Goel, Martin G. Sarov, Christopher J. Brennan
-
Patent number: 11194382Abstract: A processing system includes a memory controller that preemptively exits a dynamic random access (DRAM) integrated circuit rank from a low power mode such as power down mode based on a predicted time when the memory controller will receive a request to access the DRAM rank. The memory controller tracks how long after a DRAM rank enters the low power mode before a request to access the DRAM rank is received by the memory controller. Based on a history of the timing of access requests, the memory controller predicts for each DRAM rank a predicted time reflecting how long after entering low power mode a request to access each DRAM rank is expected to be received. The memory controller speculatively exits the DRAM rank from the low power mode based on the predicted time and prior to receiving a request to access the DRAM IC rank.Type: GrantFiled: October 16, 2018Date of Patent: December 7, 2021Assignee: ADVANCED MICRO DEVICES, INC.Inventor: Kedarnath Balakrishnan
-
Patent number: 11194740Abstract: Described is a method and apparatus for application migration between a dockable device and a docking station in a seamless manner. The dockable device includes a processor and the docking station includes a high-performance processor. The method includes determining a docking state of a dockable device while at least an application is running. Application migration from the dockable device to a docking station is initiated when the dockable device is moving to a docked state. Application migration from the docking station to the dockable device is initiated when the dockable device is moving to an undocked state. The application continues to run during the application migration from the dockable device to the docking station or during the application migration from the docking station to the dockable device.Type: GrantFiled: December 6, 2019Date of Patent: December 7, 2021Assignee: Advanced Micro Devices, Inc.Inventors: Jonathan Lawrence Campbell, Yuping Shen
-
Patent number: 11194634Abstract: In some examples, thermal aware optimization logic determines a characteristic (e.g., a workload or type) of a wavefront (e.g., multiple threads). For example, the characteristic indicates whether the wavefront is compute intensive, memory intensive, mixed, and/or another type of wavefront. The thermal aware optimization logic determines temperature information for one or more compute units (CUs) in one or more processing cores. The temperature information includes predictive thermal information indicating expected temperatures corresponding to the one or more CUs and historical thermal information indicating current or past thermal temperatures of at least a portion of a graphics processing unit (GPU). The logic selects the one or more compute units to process the plurality of threads based on the determined characteristic and the temperature information. The logic provides instructions to the selected subset of the plurality of CUs to execute the wavefront.Type: GrantFiled: December 14, 2018Date of Patent: December 7, 2021Assignee: Advanced Micro Devices, Inc.Inventors: Karthik Rao, Shomit N. Das, Xudong An, Wei Huang
-
Patent number: 11194583Abstract: Speculative execution using a page-level tracked load order queue includes: determining that a first load instruction targets a determined memory region; and in response to the first load instruction targeting the determined memory region, adding an entry to a page-level tracked load order queue instead of a load order queue, where the entry indicates a page address of a target of the first load instruction.Type: GrantFiled: October 21, 2019Date of Patent: December 7, 2021Assignee: ADVANCED MICRO DEVICES, INC.Inventor: Krishnan V. Ramani
-
Publication number: 20210374607Abstract: A device is disclosed. The device includes a machine learning die including a memory and one or more machine learning accelerators; and a processing core die stacked with the machine learning die, the processing core die being configured to execute shader programs for controlling operations on the machine learning die, wherein the memory is configurable as either or both of a cache and a directly accessible memory.Type: ApplicationFiled: December 21, 2020Publication date: December 2, 2021Applicant: Advanced Micro Devices, Inc.Inventors: Maxim V. Kazakov, Swapnil P. Sakharshete, Milind N. Nemlekar, Vineet Goel
-
Publication number: 20210377552Abstract: A system configured to perform scalable video encoding is provided. The system includes a memory; and a processing unit, wherein the processing unit is configured to: receive inter-layer data and a current picture, wherein the current picture has a base layer; upsample the inter-layer data to generate residual data and reconstruction data, wherein the inter-layer data includes a base mode flag; and encode the current picture to an enhanced layer using the upsampled inter-layer data based on a block type of the base layer and the base mode flag.Type: ApplicationFiled: August 16, 2021Publication date: December 2, 2021Applicants: Advanced Micro Devices, Inc., ATI Technologies ULCInventors: Lei Zhang, Ji Zhou, Zhen Chen, Min Yu
-
Publication number: 20210374006Abstract: A memory controller interfaces with a dynamic random access memory (DRAM). The memory controller selectively places memory commands in a memory interface queue, and transmits the commands from the memory interface queue to a memory channel connected to at least one dynamic random access memory (DRAM). The transmitted commands are stored in a replay queue. A number of activate commands to a memory region of the DRAM is counted. Based on this count, a refresh control circuit signals that an urgent refresh command should be sent to the memory region. In response to detecting a designated type of error, a recovery sequence initiates to re-transmit memory commands from the replay queue. Designated error conditions can cause the recovery sequence to restart. If an urgent refresh command is pending when such a restart occurs, the recovery sequence is interrupted to allow the urgent refresh command to be sent.Type: ApplicationFiled: May 29, 2020Publication date: December 2, 2021Applicant: Advanced Micro Devices, Inc.Inventors: Jing Wang, James R. Magro, Kedarnath Balakrishnan
-
Publication number: 20210373957Abstract: Techniques for scheduling operations for a task graph on a processing device are provided. The techniques include receiving a task graph that specifies one or more passes, one or more resources, and one or more directed edges between passes and resources; identifying independent passes and dependent passes of the task graph; based on performance criteria of the processing device, scheduling commands to execute the passes; and transmitting scheduled commands to the processing device for execution as scheduled.Type: ApplicationFiled: May 29, 2020Publication date: December 2, 2021Applicants: Advanced Micro Devices, Inc., ATI Technologies ULCInventors: Steven J. Tovey, Zhuo Chen, David Ronald Oldcorn
-
Publication number: 20210373892Abstract: Techniques for generating a task graph for workload scheduling based on a task graph specification program are provided. The techniques include executing control flow instructions of the task graph specification program to traverse the task graph specification program; generating pass nodes of the task graph based on pass instructions of the task graph specification program; generating resource nodes and directed edges based on resource declarations of the task graph specification program; and outputting the task graph specification program to a command scheduler for scheduling.Type: ApplicationFiled: May 29, 2020Publication date: December 2, 2021Applicants: Advanced Micro Devices, Inc., ATI Technologies ULCInventors: Steven J. Tovey, Zhuo Chen, David Ronald Oldcorn
-
Patent number: 11188640Abstract: A method includes establishing an isolated execution environment for executing a platform firmware operating mode subroutine in a platform firmware operating mode. In response to receiving an interrupt, the platform firmware operating mode subroutine is executed in the isolated execution environment. In response to detecting an attempted access of a hardware resource resulting from execution of the platform firmware operating mode subroutine, the attempted access is blocked when the attempted access violates a security policy.Type: GrantFiled: August 23, 2018Date of Patent: November 30, 2021Assignee: Advanced Micro Devices, Inc.Inventors: Jeremy W Powell, David A Kaplan
-
Patent number: 11189540Abstract: Various semiconductor chip devices with stacked chips are disclosed. In one aspect, a semiconductor chip device is provided. The semiconductor chip device includes a first semiconductor chip that has a floor plan with a high heat producing area and a low heat producing area. At least one second semiconductor chip is stacked on the low heat producing area. The semiconductor chip device also includes means for transferring heat from the high heat producing area.Type: GrantFiled: September 6, 2019Date of Patent: November 30, 2021Assignee: ADVANCED MICRO DEVICES, INC.Inventors: John Wuu, Samuel Naffziger, Patrick J. Shyvers, Milind S. Bhagavat, Kaushik Mysore, Brett P. Wilkerson
-
Patent number: 11188406Abstract: Detecting execution hazards in offloaded operations is disclosed. A second offload operation is compared to a first offload operation that precedes the second offload operation. It is determined whether the second offload operation creates an execution hazard on an offload target device based on the comparison of the second offload operation to the first offload operation. If the execution hazard is detected, an error handling operation may be performed. In some examples, the offload operations are processing-in-memory operations.Type: GrantFiled: March 31, 2021Date of Patent: November 30, 2021Assignee: ADVANCED MICRO DEVICES, INC.Inventors: Johnathan Alsop, Shaizeen Aga