Abstract: Existing multiprocessor computing systems often have insufficient memory coherency and, consequently, are unable to efficiently utilize separate memory systems. Specifically, a CPU cannot effectively write to a block of memory and then have a GPU access that memory unless there is explicit synchronization. In addition, because the GPU is forced to statically split memory locations between itself and the CPU, existing multiprocessor computing systems are unable to efficiently utilize the separate memory systems. Embodiments described herein overcome these deficiencies by receiving a notification within the GPU that the CPU has finished processing data that is stored in coherent memory, and invalidating data in the CPU caches that the GPU has finished processing from the coherent memory. Embodiments described herein also include dynamically partitioning a GPU memory into coherent memory and local memory through use of a probe filter.
Type:
Application
Filed:
August 31, 2012
Publication date:
October 3, 2013
Applicants:
ATI Technologies ULC, Advanced Micro Devices, Inc.
Inventors:
Anthony ASARO, Kevin NORMOYLE, Mark HUMMEL
Abstract: The present system enables receiving a request from an I/O device to translate a virtual address to a physical address to access the page in system memory. One or more memory attributes of the page defining a cacheability characteristic of the page is identified. A response including the physical address and the cacheability characteristic of the page is sent to the I/O device.
Type:
Application
Filed:
March 30, 2012
Publication date:
October 3, 2013
Applicants:
ATI Technologies ULC, Advanced Micro Devices, Inc.
Inventors:
Andrew KEGEL, Mark Hummel, Anthony Asaro
Abstract: Provided is a method of permitting the reordering of a visibility order of operations in a computer arrangement configured for permitting a first processor and a second processor threads to access a shared memory. The method includes receiving in a program order, a first and a second operation in a first thread and permitting the reordering of the visibility order for the operations in the shared memory based on the class of each operation. The visibility order determines the visibility in the shared memory, by a second thread, of stored results from the execution of the first and second operations.
Type:
Application
Filed:
August 17, 2012
Publication date:
October 3, 2013
Applicants:
Advanced Micro Devices, Inc., ATI Technologies ULC
Inventors:
Anthony Asaro, Kevin Normoyle, Mark Hummel
Abstract: Embodiments of the present invention provide a method of a first processor using a memory resource associated with a second processor. The method includes receiving a memory instruction from a first processor process, wherein the memory instruction refers to a shared memory address (SMA) that maps to a second processor memory. The method also includes mapping the SMA to the second processor memory, wherein the mapping produces a mapping result and providing the mapping result to the first processor.
Type:
Application
Filed:
August 17, 2012
Publication date:
October 3, 2013
Applicants:
Advanced Micro Devices, Inc., ATI Technologies ULC
Inventors:
Anthony Asaro, Kevin Normoyle, Mark Hummel
Abstract: A method and system for allocating memory to a memory operation executed by a processor in a computer arrangement having a first processor configured for unified operation with a second processor. The method includes receiving a memory operation from a processor and mapping the memory operation to one of a plurality of memory heaps. The mapping produces a mapping result. The method also includes providing the mapping result to the processor.
Type:
Application
Filed:
December 21, 2012
Publication date:
October 3, 2013
Applicants:
ATI Technologies ULC, Advanced Micro Devices, Inc.
Inventors:
Anthony Asaro, Kevin Normoyle, Mark Hummel
Abstract: A method of reducing the formation of electrically resistive scale on a an article comprising a silicon-containing ferritic stainless subjected to oxidizing conditions in service includes, prior to placing the article in service, subjecting the article to conditions under which silica, which includes silicon derived from the steel, forms on a surface of the steel. Optionally, at least a portion of the silica is removed from the surface to placing the article in service. A ferritic stainless steel alloy having a reduced tendency to form silica on at least a surface thereof also is provided. The steel includes a near-surface region that has been depleted of silicon relative to a remainder of the steel.
Abstract: Described herein are methods and related apparatus for the allocation of computing resources to perform computing tasks. The methods described herein may be used to allocate computing tasks to many different types of computing resources, such as processor cores, individual computers, and virtual machines. Characteristics of the available computing resources, as well as other aspects of the computing environment, are modeled in a multidimensional coordinate system. Each coordinate point in the coordinate system corresponds to a unique combination of attributes of the computing resources/computing environment, and each coordinate point is associated with a weight that indicates the relative desirability of the coordinate point. To allocate a computing resource to execute a task, the weights of the coordinate points, as well as other related factors, are analyzed.
Abstract: In one aspect, there is provided a video decoder including a first write port to write uncompressed video data to a first buffer in a first format adapted based on a format required by the video decoder. The video decoder also includes a second write port to write uncompressed video data to a second buffer in a second format adapted to provide the uncompressed video data for subsequent processing external to the video decoder.
Type:
Grant
Filed:
March 23, 2007
Date of Patent:
September 17, 2013
Assignee:
ATI Technologies ULC
Inventors:
Greg Sadowksi, Wai Ki Lo, Haibo Liu, Stephen Edward Smith
Abstract: A manually actuated robotic tool changer includes a rapid coupling mechanism. The tool changer includes a master unit having a piston moveable along its axis between an unlocked position and a fully locked position, and a tool unit that is coupled to the master unit when the units are adjacent and the piston is moved to the fully locked position. A piston movement control mechanism selectively allows free axial motion of the piston between the unlocked position and a nearly locked position, when the control mechanism is actuated. The piston may be biased toward the locked position, allowing the master and tool units to be rapidly coupled by positioning the units adjacently, actuating the control mechanism to allow the piston to rapidly advance to a nearly locked position, releasing the control mechanism to restrict free axial motion of the piston, and manually advancing the piston to a fully locked position.
Abstract: An image noise filter includes a wavelet transform module and an edge based adaptive filter module. The dual tree wavelet transform module provides low frequency wavelet information and high frequency wavelet information in response to image information. The edge based adaptive filter module provides filtered high frequency wavelet information in response to the high frequency wavelet information and edge information that is based on the low frequency wavelet information.
Type:
Grant
Filed:
November 13, 2009
Date of Patent:
September 17, 2013
Assignee:
ATI Technologies ULC
Inventors:
Radu Gheorghe, Milivoje Aleksic, Sergio Goma
Abstract: A method and apparatus that partitions a single display's viewable area into at least two virtual viewable areas, and emulates the at least two virtual viewable areas as at least two emulated physical displays with an operating system such that the operating system behaves as if interfacing with at least two actual independent physical displays. The method provides the operating system with generated display identification data (such as “EDID”) for each of the emulated physical displays in response to a query from the operating system. The method and apparatus also receive notification of an interrupt (where the interrupt corresponds to the single physical display), and reports to the operating system with at least two sets of interrupt reporting information, corresponding to the at least two emulated physical displays, as if two interrupts were received. The operating system is thereby “faked” into acting as if two physical displays are in operation.
Abstract: In one example an electronic device includes a housing that includes an A/C input or DC input, and at least one circuit substrate that includes electronic circuitry, such as graphics processing circuitry that receives power based on the A/C input or DC input. The electronic device also includes a divided multi-connector element differential bus connector that is coupled to the electronic circuitry. The divided multi-connector element differential bus connector includes a single housing that connects with the circuit substrate and the connector housing includes therein a divided electronic contact configuration comprised of a first group of electrical contacts divided from an adjacent second group of mirrored electrical contacts wherein each group of electrical connects includes a row of at least lower and upper contacts.
Type:
Grant
Filed:
February 23, 2012
Date of Patent:
September 17, 2013
Assignee:
ATI Technologies ULC
Inventors:
James D. Hunkins, Lawrence J. King, Raja Koduri, III
Abstract: The present disclosure relates to a method and system for mapping cache lines to a row-based cache. In particular, a method includes, in response to a plurality of memory access requests each including an address associated with a cache line of a main memory, mapping sequentially addressed cache lines of the main memory to a row of the row-based cache. A disclosed system includes row index computation logic operative to map sequentially addressed cache lines of a main memory to a row of a row-based cache in response to a plurality of memory access requests each including an address associated with a cache line of the main memory.
Abstract: A method and device are provided for performing tile based rendering. The method and device analyze past and current commands to determine when tiles are renderable independently of other tiles. In such cases, all rendering passes are performed successively without rendering other tiles in between.
Abstract: An aspect of the present disclosure is directed to low-alloy steels exhibiting high hardness and an advantageous level of multi-hit ballistic resistance with low or no crack propagation imparting a level of ballistic performance suitable for military armor applications. Various embodiments of the steels according to the present disclosure have hardness in excess of 550 BHN and demonstrate a high level of ballistic penetration resistance relative to conventional military specifications.
Type:
Application
Filed:
April 19, 2013
Publication date:
September 12, 2013
Applicant:
ATI Properties, Inc.
Inventors:
Ronald E. Bailey, Thomas R. Parayil, Glenn J. Swiatek
Abstract: A computing device may be configured to output a digital audio stream to an audio playback system for rendering as sound over speakers. The sound may be sampled. Based at least in part on a quality of the sampled sound, the data rate of the digital audio stream may be reduced by reducing a sampling rate and/or by reducing a number of bits per sample. A reduced sampling rate may be determined based on a computed maximum sampling rate of the audio playback system, and/or a reduced number of bits per sample may be determined based on a computed maximum number of bits per sample of the audio playback system. The maximum usable sampling rate and maximum usable number of bits per sample may be determined based on an upper usable frequency within a frequency spectrum of the sampled sound.
Abstract: The systems and methods include multiple processors that each couple to receive commands and data, where the commands and/or data correspond to frames of video that include multiple pixels. Additionally, an interlink module is coupled to receive processed data corresponding to the frames from each of the multiple processors. The interlink module selects pixels of the frames from the processed data of one of the processors based on a predetermined pixel characteristic and outputs the frames that include the selected pixels.
Abstract: A method of reducing the formation of electrically resistive scale on a an article comprising a silicon-containing ferritic stainless subjected to oxidizing conditions in service includes, prior to placing the article in service, subjecting the article to conditions under which silica, which includes silicon derived from the steel, forms on a surface of the steel. Optionally, at least a portion of the silica is removed from the surface to placing the article in service. A ferritic stainless steel alloy having a reduced tendency to form silica on at least a surface thereof also is provided. The steel includes a near-surface region that has been depleted of silicon relative to a remainder of the steel.
Abstract: A method, computer program product, and system that includes a virtual function module with an emulated display timing device, a first independent resource, and a second independent resource, where the first and second independent resources signal a physical function module that a new surface has been rendered, and where the physical function module signals the virtual function module via the emulated timing device and the first and second independent resources when the rendered new surface has been displayed, copied, used, or otherwise consumed.
Abstract: Embodiments of the present invention are directed to a method and apparatus for block based image compression with multiple non-uniform block encodings. In one embodiment, an image is divided into blocks of pixels. In one embodiment the blocks are four pixels by four pixels, but other block sizes are used in other embodiments. In one embodiment, a block of pixels in the original image is compressed using two different methods to produce a first and second compressed block. Thus, each block in the original image is represented by two, typically different, compressed blocks. In one embodiment, color associated with a pixel is determined by combining the compressed information about the pixel in the first compressed block with information about the pixel in the second compressed block. In another embodiment, global information about the image is combined with the information in the first and second compressed blocks.
Type:
Grant
Filed:
September 16, 2011
Date of Patent:
August 27, 2013
Assignee:
ATI Technologies ULC
Inventors:
Konstantine Iourcha, Andrew S. C. Pomianowski, Raja Koduri