Abstract: An internal bus bridge architecture and method is described. Embodiments include a system with multiple bus endpoints coupled to a bus root via a host bus bridge that is internal to at least one bus endpoint. In addition, the bus endpoints are directly coupled to each other. Embodiments are usable with known bus protocols.
Abstract: By controlling the flow rate of one or more gaseous components of an etch ambient during the formation of metal lines and vias on the basis of feedback measurement data from critical dimensions, process variations may be reduced, thereby enhancing performance and reliability of the respective metallization structure.
Abstract: An approach and a method for efficient execution of nested map-reduce framework workloads to take advantage of the combined execution of central processing units (CPUs) and graphics processing units (GPUs) and lower latency of data access in accelerated processing units (APUs) is described. In embodiments, metrics are generated to determine whether a map or reduce function is more efficiently processed on a CPU or a GPU. A first metric is based on ratio of a number of branch instructions to a number of non-branch instructions, and a second metric is based on the comparison of execution times on each of the CPU and the GPU. Selecting execution of map and reduce functions based on the first and second metrics result in accelerated computations. Some embodiments include scheduling pipelined executions of functions on the CPU and functions on the GPU concurrently to achieve power-efficient nested map reduce framework execution.
Type:
Application
Filed:
May 9, 2013
Publication date:
November 13, 2014
Applicant:
Advanced Micro Devices, Inc.
Inventors:
Patryk KAMINSKI, Mauricio Breternitz, Gary R. Frost, Christophe Harle
Abstract: A method, computer program product, and system is described that determines the correctness of using memory operations in a computing device with heterogeneous computer components. Embodiments include an optimizer based on the characteristics of a Sequential Consistency for Heterogeneous-Race-Free (SC for HRF) model that analyzes a program and determines the correctness of the ordering of events in the program. HRF models include combinations of the properties: scope order, scope inclusion, and scope transitivity. The optimizer can determine when a program is heterogeneous-race-free in accordance with an SC for HRF memory consistency model . For example, the optimizer can analyze a portion of program code, respect the properties of the SC for HRF model, and determine whether a value produced by a store memory event will be a candidate for a value observed by a load memory event. In addition, the optimizer can determine whether reordering of events is possible.
Type:
Application
Filed:
May 12, 2014
Publication date:
November 13, 2014
Applicant:
Advanced Micro Devices, Inc.
Inventors:
Derek R. HOWER, Mark D. Hill, David Wood, Steven K. Reinhardt, Benedict R. Gaster, Blake A. Hechtman, Bradford M. Beckmann
Abstract: Embodiments are described for a method for processing textures for a mesh comprising quadrilateral polygons for real-time rendering of an object or model in a graphics processing unit (GPU), comprising associating an independent texture map with each face of the mesh to produce a plurality of face textures, packing the plurality of face textures into a single texture atlas, wherein the atlas is divided into a plurality of blocks based on a resolution of the face textures, adding a border to the texture map for each face comprising additional texels including at least border texels from an adjacent face texture map, and performing linear interpolation of a trilinear filtering operation on the face textures to resolve resolution discrepancies caused when crossing an edge of a polygon.
Abstract: The described embodiments comprise an embedded management controller for managing a server on a sled device. The embedded management controller on the sled device comprises a processing mechanism; a plurality of internal interfaces coupled to the processing mechanism, each internal interface coupled to a corresponding interface of the server; and an external interface coupled to the processing mechanism, the external interface configured to be coupled to a system management controller separate from the sled device. In these embodiments, the processing mechanism is configured to communicate with the server using the internal interfaces, and is configured to selectively communicate information based on communications with the server to the system management controller using the external interface and commands to the server based on communications received from the system management controller using the external interface.
Type:
Application
Filed:
May 13, 2013
Publication date:
November 13, 2014
Applicant:
Advanced Micro Devices, Inc.
Inventors:
Hari Ramachandran, Ravi Bingi, Ranger H. Lam
Abstract: A server system includes an array of server cells. Some or all of the server cells include a set of at least three side panels forming an enclosure and a compute component comprising a processor core. At least one side panel of each server cell is removably mechanically coupled and removably electrically coupled to a facing side panel of an adjacent server cell. The enclosure may form a triangular prism enclosure, a cuboid enclosure, a hexagonal prism enclosure, etc. The enclosure can be formed from a rigid flex printed circuit board (PCB) assembly, whereby the side panels are implemented as rigid PCB sections that are interconnected via flexible PCB sections, with the flexible PCB sections forming corners between the rigid PCB sections when the rigid-flex PCB assembly is folded into the enclosure shape. The compute component and other circuit components are disposed at the interior surfaces of the rigid PCB sections.
Abstract: A method and apparatus are described for managing a plurality of performance monitoring resources residing in a plurality of cores of a processor. A plurality of resource queues are maintained. Each resource queue corresponds to a particular one of the performance monitoring resources, and detects conflicts in use of the particular performance monitoring resource by multiple users. The detected conflicts associated with the particular performance monitoring resource are then resolved. A dynamic resource scheduler is used to resolve the detected conflicts, and is driven by an advanced programmable interrupt controller (APIC) timer residing in a particular core of the processor to provide each item, in an items list of a resource queue associated with the particular performance monitoring resource, an equal opportunity to use the particular performance monitoring resource for a predetermined period of time.
Abstract: A semiconductor device includes a semiconductor substrate and at least one integrated circuit formed on a frontside of the semiconductor substrate. A shielding layer is formed on a backside of the semiconductor substrate. The shielding layer includes one or more elements having a high thermal neutron absorption cross section.
Abstract: Disclosed herein methods, apparatuses, and systems for performing graphics processing. In this regard, a processing unit includes a tessellation module and a connectivity module. The tessellation module is configured to sequentially tessellate portions of a geometric shape to provide a series of tessellation points for the geometric shape. The connectivity module is configured to connect one or more groups of the tessellation points into one or more primitives in an order in which the series of tessellation points is provided.
Type:
Grant
Filed:
February 18, 2010
Date of Patent:
November 11, 2014
Assignee:
Advanced Micro Devices, Inc.
Inventors:
Vineet Goel, Jason David Carroll, Brian Buchner, Mangesh Nijasure
Abstract: Methods, apparatus, and fabrication techniques relating to management of noise arising from capacitance in a clock tree of an integrated circuit. In some embodiments, the methods comprise receiving a signal to adjust a clock having a first rate to a second rate; and ramping, in response to receiving the signal, the clock from the first rate to the second rate, wherein the ramping comprises changing the frequency of the clock to at least one third rate between the first and second rates.
Abstract: Methods, apparatus, and fabrication techniques relating to improved propagation of fuse data through an integrated circuit device during scan shift reset. In some embodiments, the methods comprise loading a first value of at least one fuse bit to an integrated circuit device, during a time period when a clock signal having a first frequency is provided to at least one component of the integrated circuit device; disabling a scan shift after the loading of the first value; inactivating the clock signal after the loading of the first value; propagating the first value of the at least one fuse bit to the at least one component of the integrated circuit device; and reactivating the clock signal after the propagation of the first value.
Abstract: The present invention provides a method and apparatus for saving and restoring soft repair information. One embodiment of the method includes storing soft repair information for one or more cache arrays implemented in a processor core in a memory element outside of the processor core in response to determining that a voltage supply to the processor core is to be disconnected.
Type:
Grant
Filed:
August 24, 2010
Date of Patent:
November 11, 2014
Assignee:
Advanced Micro Devices, Inc.
Inventors:
Bill K. Kwan, Atchyuth K. Gorti, Norm Hack, David Kaplan
Abstract: The present disclosure relates to a method, system, and apparatus for configuring a computing system, such as a cloud computing system. A method includes, based on user selections received via a user interface, configuring a cluster of nodes by selecting the cluster of nodes from a plurality of available nodes, selecting a workload container module from a plurality of available workload container modules for operation on each node of the selected cluster of nodes, and selecting a workload for execution with the workload container on the cluster of nodes. Each node of the cluster of nodes includes at least one processing device and memory, and the cluster of nodes is operative to share processing of a workload.
Type:
Grant
Filed:
August 7, 2012
Date of Patent:
November 11, 2014
Assignee:
Advanced Micro Devices, Inc.
Inventors:
Mauricio Breternitz, Keith A. Lowery, Patryk Kaminski, Anton Chernoff
Abstract: During a replacement gate approach, the inverse tapering of the opening obtained after removal of the polysilicon material may be reduced by depositing a spacer layer and forming corresponding spacer elements on inner sidewalls of the opening. Consequently, the metal-containing gate electrode material and the high-k dielectric material may be deposited with enhanced reliability.
Type:
Grant
Filed:
May 20, 2013
Date of Patent:
November 11, 2014
Assignee:
Advanced Micro Devices, Inc.
Inventors:
Kai Frohberg, Uwe Griebenow, Katrin Reiche, Heike Berthold
Abstract: An interface couples a plurality of compute units to a power management controller. The interface conveys a power report for the plurality of compute units to the power management controller. The power management controller receives the power report, determines a power action for the plurality of compute units based at least in part on the power report, and transmits a message specifying the power action through the interface. The power action is performed.
Type:
Application
Filed:
May 1, 2013
Publication date:
November 6, 2014
Applicant:
Advanced Micro Devices, Inc.
Inventors:
Alexander Branover, Steven Kommrusch, Marvin Denman, Maurice Steinman
Abstract: A phase locked loop (PLL) system includes a PLL and a calibration circuit. The PLL has a reference clock input, a voltage controlled oscillator (VCO) clock output, and a feedback clock output. The calibration circuit provides a reference clock signal to the reference clock input of the PLL, induces first and second phase disturbances between the reference clock signal and a feedback clock signal, measures respective first and second zero crossing times of a phase error between the reference clock signal and the feedback clock signal, and estimates a bandwidth of the PLL in response to an average of the first and second zero crossing times.
Type:
Application
Filed:
May 6, 2013
Publication date:
November 6, 2014
Applicant:
Advanced Micro Devices, Inc.
Inventors:
Meei-Ling Chiang, Boon-Aik Ang, Dennis Fischette, Jr.
Abstract: An integrated circuit device includes a memory cell coupled to a supply voltage line to receive a supply voltage and a voltage control circuit operable to reduce a magnitude of the supply voltage prior to a write cycle to the memory cell. The voltage control circuit includes a first capacitor that is selectively coupled between a supply voltage line and a first reference supply voltage line of the integrated circuit device in anticipation of a write cycle to the memory cell.
Abstract: The described embodiments include a networking subsystem in a second computing device that is configured to receive a task message from a first computing device. Based on the task message, the networking subsystem updates an entry in a task queue with task information from the task message. A processing subsystem in the second computing device subsequently retrieves the task information from the task queue and performs the corresponding task. In these embodiments, the networking subsystem processes the task message (e.g., stores the task information in the task queue) without causing the processing subsystem to perform operations for processing the task message.
Type:
Application
Filed:
January 26, 2014
Publication date:
November 6, 2014
Applicant:
Advanced Micro Devices, Inc.
Inventors:
Steven K. Reinhardt, Michael L. Chu, Vinod Tipparaju, Walter B. Benton
Abstract: The described embodiments include a program code testing system that determines the vulnerability of multi-threaded program code to soft errors. For multi-threaded program code, two to more threads from the program code may access shared architectural structures while the program code is being executed. The program code testing system determines accesses of architectural structures made by the two or more threads of the multi-threaded program code and uses the determined accesses to determine a time for which the program code is exposed to soft errors. From this time, the program code testing system determines a vulnerability of the program code to soft errors.
Type:
Application
Filed:
April 30, 2014
Publication date:
November 6, 2014
Applicant:
Advanced Micro Devices, Inc.
Inventors:
Vilas Sridharan, Mark E. Wilkening, Sudhanva Gurumurthi