Patents Assigned to Advanced Micros Devices, Inc.
-
Publication number: 20190037240Abstract: A processing device is provided which includes memory configured to store data and a processor. The processor is configured to receive a plurality of panoramic video images representing views around a point in a three dimensional (3D) space and warp the plurality of panoramic video images, using a panoramic format, into a plurality of formatted warped images. The processor is also configured to store, in the memory, the plurality of formatted warped images and perform a motion search around each co-located pixel block of a reference panoramic frame by limiting the motion searches in a vertical direction around the co-located pixel blocks.Type: ApplicationFiled: July 27, 2017Publication date: January 31, 2019Applicant: Advanced Micro Devices, Inc.Inventor: Michael L. Schmit
-
Publication number: 20190037097Abstract: A computer vision processing device is provided which comprises memory configured to store data and a processor. The processor is configured to store captured image data in a first buffer and acquire access to the captured image data in the first buffer when the captured image data is available for processing. The processor is also configured to execute a first group of operations in a processing pipeline, each of which processes the captured image data accessed from the first buffer and return the first buffer for storing next captured image data when a last operation of the first group of operations executes.Type: ApplicationFiled: July 28, 2017Publication date: January 31, 2019Applicant: Advanced Micro Devices, Inc.Inventors: Radhakrishna Giduthuri, Michael L. Schmit
-
Publication number: 20190033939Abstract: A data processing system includes a power manager for providing a power event depth signal in response to a power event request signal. A plurality of real-time clients is coupled to the power manager. Each real-time client includes a client buffer that has a plurality of entries for storing data. The real-time client also includes a register for storing a watermark threshold for the client buffer, as well as logic for providing an allow signal when a number of valid entries in the client buffer exceeds the watermark threshold. A power management state machine is coupled to each of the plurality of real-time clients. The power management state machine provides a power event start signal in response to all of the plurality of real-time clients providing respective allow signals.Type: ApplicationFiled: July 28, 2017Publication date: January 31, 2019Applicant: Advanced Micro Devices, Inc.Inventors: Sonu Arora, Alexander Branover, Benjamin Tsien
-
Publication number: 20190034251Abstract: Described herein are a method and apparatus for memory vulnerability prediction. A memory vulnerability predictor predicts the reliability of a memory region when it is first accessed, based on past program history. The memory vulnerability predictor uses a table to store reliability predictions and predicts reliability needs of a new memory region. A memory management module uses the reliability information to make decisions, (such as to guide memory placement policies in a heterogeneous memory system).Type: ApplicationFiled: July 28, 2017Publication date: January 31, 2019Applicant: Advanced Micro Devices, Inc.Inventors: Vilas Sridharan, David A. Roberts
-
Publication number: 20190034151Abstract: A technique for implementing synchronization monitors on an accelerated processing device (“APD”) is provided. Work on an APD includes workgroups that include one or more wavefronts. All wavefronts of a workgroup execute on a single compute unit. A monitor is a synchronization construct that allows workgroups to stall until a particular condition is met. Responsive to all wavefronts of a workgroup executing a wait instruction, the monitor coordinator records the workgroup in an “entry queue.” The workgroup begins saving its state to a general APD memory and, when such saving is complete, the monitor coordinator moves the workgroup to a “condition queue.” When the condition specified by the wait instruction is met, the monitor coordinator moves the workgroup to a “ready queue,” and, when sufficient resources are available on a compute unit, the APD schedules the ready workgroup for execution on a compute unit.Type: ApplicationFiled: July 27, 2017Publication date: January 31, 2019Applicant: Advanced Micro Devices, Inc.Inventors: Alexandru Dutu, Bradford M. Beckmann
-
Patent number: 10191873Abstract: A method of and device for transferring data is provided. The method includes determining a difference between a data segment that was transferred last relative to each of one or more data segments available to be transferred next. In some embodiments, for so long as no data segment available to be sent has been waiting too long, the data segment chosen to be sent next is the data segment having the smallest difference relative to the data segment transferred last. The chosen data segment is then transmitted as the next data segment transferred.Type: GrantFiled: December 20, 2012Date of Patent: January 29, 2019Assignee: Advanced Micro Devices, Inc.Inventor: Greg Sadowski
-
Publication number: 20190028725Abstract: A system and method for scalable video coding that includes base layer having lower resolution encoding, enhanced layer having higher resolution encoding and the data transferring between two layers. The system and method provides several methods to reduce bandwidth of inter-layer transfers while at the same time reducing memory requirements. Due to less memory access, the system clock frequency can be lowered so that system power consumption is lowered as well. The system avoids having prediction data from base layer to enhanced layer to be up-sampled for matching resolution in the enhanced layer as transferring up-sampled data can impose a big burden on memory bandwidth.Type: ApplicationFiled: September 10, 2018Publication date: January 24, 2019Applicants: Advanced Micro Devices, Inc., ATI Technologies ULCInventors: Lei Zhang, Ji Zhou, Zhen Chen, Min Yu
-
Patent number: 10185498Abstract: A memory system includes a write buffer, a main memory having a higher latency than the write buffer, and a memory controller. In response to a write request indicating first data for storing at a write address in the main memory, the memory controller adds a new write entry in the write buffer, where the new write entry includes the write address and the first data, and updates a pointer of a previous write entry in the write buffer to point to the new write entry. In response to a write-back instruction, the memory controller traverses a plurality of write entries stored in the write buffer, and writes into the main memory second data of the previous write entry and the first data of the new write entry.Type: GrantFiled: June 16, 2016Date of Patent: January 22, 2019Assignee: Advanced Micro Devices, Inc.Inventor: David A. Roberts
-
Patent number: 10185604Abstract: Methods and apparatus of interleaving two or more workloads are presented herein. The methods and apparatus may comprise a schedule controller and a coprocessor. The schedule controller is operative to utilize the first storage unit to manage context stored therein that allows for the coprocessor to interleave the two or more workloads that can be directly supported by the first storage unit. The coprocessor includes a dedicated first storage unit and an engine.Type: GrantFiled: October 31, 2014Date of Patent: January 22, 2019Assignee: Advanced Micro Devices, Inc.Inventor: Winthrop Wu
-
Patent number: 10186510Abstract: A system and method for creating a layout for a vertical gate all around standard cell are described. Metal gate is placed all around two vertical nanowire sheets formed on a silicon substrate. A gate contact is formed on the metal gate between the two vertical nanowire sheets. Gate extension metal (GEM) is placed above the metal gate at least on the gate contact. A via for a gate is formed at a location on the GEM where a local interconnect layer is available to be used for routing a gate connection. Local metal layers are placed for connecting local routes and power connections.Type: GrantFiled: June 26, 2017Date of Patent: January 22, 2019Assignee: Advanced Micro Devices, Inc.Inventor: Richard T. Schultz
-
Publication number: 20190018664Abstract: Methods of compiling source code are provided. A method includes identifying a first array of structures (AOS), having a plurality of array elements, each array element being a structure with a plurality of fields, and performing structure peeling on the first AOS to convert a data layout of the first AOS to an array of structure of arrays (AOSOA) including a plurality of memory blocks of uniform block size. At least one of the plurality of memory blocks is allocated for each field of the plurality of fields. The method further includes allocating a number of complete memory blocks to accommodate all of the plurality of array elements of the AOS.Type: ApplicationFiled: July 14, 2017Publication date: January 17, 2019Applicant: Advanced Micro Devices, Inc.Inventors: Suresh Mani, Dibyendu Das, Shivarama Rao, Ashutosh Nema
-
Publication number: 20190018699Abstract: A technique for recovering from a hang in a virtualized accelerated processing device (“APD”) is provided. In the virtualization scheme, different virtual machines are assigned different “time-slices” in which to use the APD. When a time-slice expires, the APD stops operations for a current VM and starts operations for another VM. To stop operations on the APD, a virtualization scheduler sends a request to idle the APD. The APD responds by completing work and idling. If one or more portions of the APD do not complete this idling process before a timeout expires, then a hang occurs. In response to the hang, the virtualization scheduler informs the hypervisor that a hang has occurred. The hypervisor performs a function level reset on the APD and informs the VM that the hang has occurred. The VM responds by stopping command issue to the APD and re-initializing the APD for the function.Type: ApplicationFiled: July 28, 2017Publication date: January 17, 2019Applicants: Advanced Micro Devices, Inc., ATI Technologies ULCInventors: Anthony Asaro, Yinan Jiang, Andy Sung, Ahmed M. Abdelkhalek, Xiaowei Wang, Sidney D. Fortes
-
Patent number: 10180826Abstract: A compiler generates transfer functions for blocks of a program during compilation of the program. The transfer functions estimate bit widths of variables in the blocks based on numbers of bits needed to carry out at least one instruction in the blocks and whether the variables are live in the blocks. For example, a transfer function may return a number indicating how many bits of a variable are needed to execute a current instruction as a function of the number of bits of the variable used by the program in subsequent instructions. Numbers of bits to represent the variables in the compiled program based on the transfer functions.Type: GrantFiled: October 22, 2015Date of Patent: January 15, 2019Assignee: Advanced Micro Devices, Inc.Inventors: Prakash Sathyanath Raghavendra, Dibyendu Das, Arun Rangasamy
-
Patent number: 10180789Abstract: Systems, apparatuses, and methods for implementing software control of state sets are disclosed. In one embodiment, a processor includes at least an execution unit and a plurality of state registers. The processor is configured to detect a command to allocate a first state set for storing a first state, wherein the command is generated by software, and wherein the first state specifies values for the plurality of state registers. The command is executed on the execution unit while the processor is in a second state, wherein the second state is different from the first state. The first state set of the processor is allocated with the first state responsive to executing the command on the execution unit. The processor is configured to allocate the first state set for the first state prior to the processor entering the first state.Type: GrantFiled: January 26, 2017Date of Patent: January 15, 2019Assignee: Advanced Micro Devices, Inc.Inventors: Rex Eldon McCrary, Michael J. Mantor, Alexander Fuad Ashkar, Harry J. Wise
-
Publication number: 20190013051Abstract: A system, method, and computer program product are provided for a memory device system. One or more memory dies and at least one logic die are disposed in a package and communicatively coupled. The logic die comprises a processing device configurable to manage virtual memory and operate in an operating mode. The operating mode is selected from a set of operating modes comprising a slave operating mode and a host operating mode.Type: ApplicationFiled: September 12, 2018Publication date: January 10, 2019Applicant: ADVANCED MICRO DEVICES, INC.Inventors: Nuwan S. Jayasena, Gabriel H. Loh, Bradford M. Beckmann, James M. O'Connor, Lisa R. Hsu
-
Patent number: 10176122Abstract: A processor employs a hardware encryption module in the memory access path between an input/out device and memory to cryptographically isolate secure information. In some embodiments, the encryption module is located at a memory controller of the processor, and each memory access request provided to the memory controller includes VM tag value identifying the source of the memory access request. The VM tag is determined based on a requestor ID identifying the source of the memory access request. The encryption module performs encryption (for write accesses) or decryption (for read accesses) of the data associated with the memory access based on an encryption key associated with the VM tag.Type: GrantFiled: October 19, 2016Date of Patent: January 8, 2019Assignees: Advanced Micro Devices, Inc., ATI Technologies ULCInventors: David Kaplan, Maggie Chan, Philip Ng
-
Publication number: 20190005377Abstract: Training devices and methods for training an artificial neural network (ANN). The training device includes processing circuitry configured to transmit training data for the ANN and parameters for the ANN to an inference device. The processing circuitry is also configured to receive inference data, based on the training data and the parameters, from the inference device. The processing circuitry is also configured to receive inference timing information, based on the training data and the parameters, from the inference device. The processing circuitry is also configured to calculate a difference between the calculated inference data and expected inference data.Type: ApplicationFiled: June 30, 2017Publication date: January 3, 2019Applicant: Advanced Micro Devices, Inc.Inventor: Nicholas Malaya
-
Publication number: 20190004839Abstract: A technique for efficient time-division of resources in a virtualized accelerated processing device (“APD”) is provided. In a virtualization scheme implemented on the APD, different virtual machines are assigned different “time-slices” in which to use the APD. When a time-slice expires, the APD performs a virtualization context switch by stopping operations for a current virtual machine (“VM”) and starting operations for another VM. Typically, each VM is assigned a fixed length of time, after which a virtualization context switch is performed. This fixed length of time can lead to inefficiencies. Therefore, in some situations, in response to a VM having no more work to perform on the APD and the APD being idle, a virtualization context switch is performed “early.” This virtualization context switch is “early” in the sense that the virtualization context switch is performed before the fixed length of time for the time-slice expires.Type: ApplicationFiled: June 29, 2017Publication date: January 3, 2019Applicants: Advanced Micro Devices, Inc., ATI Technologies ULCInventors: Gongxian Jeffrey Cheng, Louis Regniere, Anthony Asaro
-
Patent number: 10169244Abstract: The described embodiments perform a method for handling memory accesses by virtual machines in a computing device. The described embodiments include a reverse map table (RMT) and a separate guest accessed pages table (GAPT) for each virtual machine. The RMT has a plurality of entries, each entry including information for identifying a virtual machine that is permitted to access an associated page of data in a memory. Each GAPT has a record of pages being accessed by a corresponding virtual machine. During operation, a table walker receives a request from a given virtual machine to translate a guest physical address to a system physical address. The table walker checks at least one of the RMT and a corresponding GAPT to determine whether the given virtual machine has access to a corresponding page. If not, the table walker terminates the translating. Otherwise, the table walker completes the translating.Type: GrantFiled: July 29, 2016Date of Patent: January 1, 2019Assignee: ADVANCED MICRO DEVICES, INC.Inventors: David A. Kaplan, Jeremy W. Powell, Thomas R. Woller
-
Patent number: 10169906Abstract: A system, method and a computer program product are provided for hybrid rendering with deferred primitive batch binning. A primitive batch is generated from a sequence of primitives. Initial bin intercepts are identified for primitives in the primitive batch. A bin for processing is identified. The bin corresponds to a region of a screen space. Pixels of the primitives intercepting the identified bin are processed. Next bin intercepts are identified while the primitives intercepting the identified bin are processed.Type: GrantFiled: March 29, 2013Date of Patent: January 1, 2019Assignees: Advanced Micro Devices, Inc., ATI Technologies ULCInventors: Michael Mantor, Laurent Lefebvre, Mark Fowler, Timothy Kelley, Mikko Alho, Mika Tuomi, Kiia Kallio, Patrick Klas Rudolf Buss, Jari Antero Komppa, Kaj Tuomi