Patents by Inventor Yan Yan Tang

Yan Yan Tang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Determining a working set of texture maps

Patent number: 9013498

Abstract: A system and method for tracking and reporting texture map levels of detail that are computed during graphics processing allows for efficient management of texture map storage. Minimum and/or maximum pre-clamped texture map levels of detail values are tracked by a graphics processor and an array stored in memory is updated to report the minimum and/or maximum values for use by an application program. The minimum and/or maximum values may be used to determine the active set of texture map levels of detail that is loaded into graphics memory.

Type: Grant

Filed: December 19, 2008

Date of Patent: April 21, 2015

Assignee: NVIDIA Corporation

Inventors: John S. Montrym, Andrew J. Tao, Henry P. Moreton, Emmett M. Kilgariff, Cass W. Everitt, Alexander L. Minkin, Eric Anderson, Yan Yan Tang, Jerome F. Duluk, Jr.
Address mapping for a parallel thread processor

Patent number: 8700877

Abstract: A method for thread address mapping in a parallel thread processor. The method includes receiving a thread address associated with a first thread in a thread group; computing an effective address based on a location of the thread address within a local window of a thread address space; computing a thread group address in an address space associated with the thread group based on the effective address and a thread identifier associated with a first thread; and computing a virtual address associated with the first thread based on the thread group address and a thread group identifier, where the virtual address is used to access a location in a memory associated with the thread address to load or store data.

Type: Grant

Filed: September 24, 2010

Date of Patent: April 15, 2014

Assignee: Nvidia Corporation

Inventors: Michael C. Shebanow, Yan Yan Tang, John R. Nickolls
Register indexed sampler for texture opcodes

Patent number: 8624910

Abstract: One embodiment of the present invention sets forth a technique for dynamically specifying a texture header and texture sampler using an index. The index corresponds to a particular register value that may be static or computed during execution of a shader program. Any texture operation instruction may specify an index value for each of the texture header and the texture sampler.

Type: Grant

Filed: August 25, 2010

Date of Patent: January 7, 2014

Assignee: Nvidia Corporation

Inventors: John Erik Lindholm, Yan Yan Tang
Reordering operands assigned to each one of read request ports concurrently accessing multibank register file to avoid bank conflict

Patent number: 8533435

Abstract: One embodiment of the present invention sets forth a technique for collecting operands specified by an instruction. As a sequence of instructions is received the operands specified by the instructions are assigned to ports, so that each one of the operands specified by a single instruction is assigned to a different port. Reading of the operands from a multi-bank register file is scheduled by selecting an operand from each one of the different ports to produce an operand read request and ensuring that two or more of the selected operands are not stored in the same bank of the multi-bank register file. The operands specified by the operand read request are read from the multi-bank register file in a single clock cycle. Each instruction is then executed as the operands specified by the instruction are read from the multi-bank register file and collected over one or more clock cycles.

Type: Grant

Filed: September 3, 2010

Date of Patent: September 10, 2013

Assignee: NVIDIA Corporation

Inventors: Xiaogang Qiu, Ming Y. Siu, Yan Yan Tang, John Erik Lindholm, Michael C. Shebanow, Stuart F. Oberman
ECC bits used as additional register file storage

Patent number: 8321761

Abstract: A memory module includes a plurality of register files. Each register file is associated with a set of error-correcting code (ECC) bits and ECC check/correct logic that can provide error-correcting functionality, if required. When error-correcting functionality is not required, ECC bits are grouped together to form additional register files, thereby providing additional storage space.

Type: Grant

Filed: September 28, 2009

Date of Patent: November 27, 2012

Assignee: NVIDIA Corporation

Inventors: Fred Gruner, Xiaogang Qiu, Yan Yan Tang
Cache interface protocol including arbitration and hints

Patent number: 8266382

Abstract: One embodiment of the present invention sets forth a technique for arbitrating requests received from one of the multiple clients of an L1 cache and for providing hints to the client to assist in arbitration. The L1 cache services multiple clients with diverse latency and bandwidth requirements and may be reconfigured to provide memory spaces for clients executing multiple parallel threads, where the memory spaces each have a different scope.

Type: Grant

Filed: December 30, 2009

Date of Patent: September 11, 2012

Assignee: NVIDIA Corporation

Inventors: Alexander L. Minkin, Steven J. Heinrich, Rajeshwaran Selvanesan, Charles McCarver, Stewart Glenn Carlton, Anjana Rajendran, Yan Yan Tang
Cache miss processing using a defer/replay mechanism

Patent number: 8266383

Abstract: One embodiment of the present invention sets forth a technique for processing cache misses resulting from a request received from one of the multiple clients of an L1 cache. The L1 cache services multiple clients with diverse latency and bandwidth requirements, including at least one client whose requests cannot be stalled. The L1 cache includes storage to buffer pending requests for caches misses. When an entry is available to store a pending request, a request causing a cache miss is accepted. When the data for a read request becomes available, the cache instructs the client to resubmit the read request to receive the data. When an entry is not available to store a pending request, a request causing a cache miss is deferred and the cache provides the client with status information that is used to determine when the request should be resubmitted.

Type: Grant

Filed: December 30, 2009

Date of Patent: September 11, 2012

Assignee: NVIDIA Corporation

Inventors: Alexander L. Minkin, Steven J. Heinrich, Rajeshwaran Selvanesan, Charles McCarver, Stewart Glenn Carlton, Ming Y. Siu, Yan Yan Tang, Robert J. Stoll
Buffering unit to support graphics processing operations

Patent number: 8139071

Abstract: An apparatus and method for buffering graphics data are described. In one embodiment, a graphics processing apparatus includes a storage unit and a reorder control unit that is connected to the storage unit. The reorder control unit is configured to coordinate storage of vertex attributes in the storage unit so as to convert the vertex attributes from an initial order to a modified order. The reorder control unit is configured to identify a subset of the vertex attributes to be stored within a common range of addresses in the storage unit, and the reorder control unit is configured to access the storage unit such that the subset of the vertex attributes is written into the storage unit substantially in parallel.

Type: Grant

Filed: November 2, 2006

Date of Patent: March 20, 2012

Assignee: Nvidia Corporation

Inventors: Andrew J. Tao, Vimal S. Parikh, Yan Yan Tang
System and method for graphics attribute packing for pixel shader usage

Patent number: 8134570

Abstract: A system, method and computer program product are provided for packing graphics attributes. In use, a plurality of graphics attributes is identified. Such graphics attributes are packed, such that the packed graphics attributes are capable of being processed utilizing a pixel shader.

Type: Grant

Filed: September 18, 2006

Date of Patent: March 13, 2012

Assignee: NVIDIA Corporation

Inventors: Jerome F. Duluk, Jr., Andrew J. Tao, Roger L. Allen, Svetoslav D. Tzvetkov, Yan Yan Tang, Elena M. Ing
Buffering unit to support graphics processing operations

Patent number: 7999817

Abstract: An apparatus and method for buffering graphics data are described. In one embodiment, a graphics processing apparatus includes a memory and a buffering unit that is connected to the memory. The buffering unit is configured to buffer vertex attributes en route to the memory. The buffering unit is configured to coalesce a subset of the vertex attributes to be stored within a common range of addresses in the memory, and the buffering unit is configured to issue a single write request to the memory on behalf of the subset of the vertex attributes.

Type: Grant

Filed: November 2, 2006

Date of Patent: August 16, 2011

Assignee: NVIDIA Corporation

Inventors: Andrew J. Tao, Vimal S. Parikh, Yan Yan Tang
Address Mapping for a Parallel Thread Processor

Publication number: 20110078689

Abstract: A method for thread address mapping in a parallel thread processor. The method includes receiving a thread address associated with a first thread in a thread group; computing an effective address based on a location of the thread address within a local window of a thread address space; computing a thread group address in an address space associated with the thread group based on the effective address and a thread identifier associated with a first thread; and computing a virtual address associated with the first thread based on the thread group address and a thread group identifier, where the virtual address is used to access a location in a memory associated with the thread address to load or store data.

Type: Application

Filed: September 24, 2010

Publication date: March 31, 2011

Inventors: Michael C. SHEBANOW, Yan Yan Tang, John R. Nickolls
REGISTER INDEXED SAMPLER FOR TEXTURE OPCODES

Publication number: 20110069076

Abstract: One embodiment of the present invention sets forth a technique for dynamically specifying a texture header and texture sampler using an index. The index corresponds to a particular register value that may be static or computed during execution of a shader program. Any texture operation instruction may specify an index value for each of the texture header and the texture sampler.

Type: Application

Filed: August 25, 2010

Publication date: March 24, 2011

Inventors: John Erik LINDHOLM, Yan Yan Tang
Unified Collector Structure for Multi-Bank Register File

Publication number: 20110072243

Abstract: One embodiment of the present invention sets forth a technique for collecting operands specified by an instruction. As a sequence of instructions is received the operands specified by the instructions are assigned to ports, so that each one of the operands specified by a single instruction is assigned to a different port. Reading of the operands from a multi-bank register file is scheduled by selecting an operand from each one of the different ports to produce an operand read request and ensuring that two or more of the selected operands are not stored in the same bank of the multi-bank register file. The operands specified by the operand read request are read from the multi-bank register file in a single clock cycle. Each instruction is then executed as the operands specified by the instruction are read from the multi-bank register file and collected over one or more clock cycles.

Type: Application

Filed: September 3, 2010

Publication date: March 24, 2011

Inventors: Xiaogang Qiu, Ming Y. Siu, Yan Yan Tang, John Erik Lindholm, Michael C. Shebanow, Stuart F. Oberman
Active block write-back from SRAM cache to DRAM

Patent number: 7027064

Abstract: An external cache management unit for use with 3D-RAM and suitable for use in a computer graphics system is described. The unit maintains and tracks the status of level one cache memory in the 3D-RAM. The unit identifies dirty blocks of cache memory and prioritizes block cleansing based on a least used algorithm. Periodic block cleansing during empty memory cycles is provided for, and may also be prompted on demand.

Type: Grant

Filed: February 28, 2002

Date of Patent: April 11, 2006

Assignee: Sun Microsystems, Inc.

Inventors: Michael G. Lavelle, Ewa M. Kubalska, Yan Yan Tang
Parallel box filtering through reuse of existing circular filter

Patent number: 6927775

Abstract: A sample filtering system and method for concurrently filtering sample data for two or more sequential pixels (in a scan-line) are disclosed. The system may include a sample cache, a control register, a read cache controller, and a sample-to-pixel calculation unit. The read cache controller reads a first set of S samples from the sample cache, and outputs a second set of S samples to the sample-to-pixel calculation unit. The second set of samples may have one or more subsets of samples, with each subset of samples selected to cover the filter region for one of the sequential pixels. The sample-to-pixel calculation unit may process each subset separately and concurrently.

Type: Grant

Filed: March 3, 2003

Date of Patent: August 9, 2005

Assignee: Sun Microsystems, Inc.

Inventors: Michael W. Schimpf, Yan Yan Tang
Multiple scan line sample filtering

Patent number: 6914609

Abstract: A system and method for generating pixels for a display device. The system may include a sample buffer for storing a plurality samples in a memory, a sample cache for caching recently accessed samples, and a sample filter unit for filtering one or more samples to generate a pixel. The generated pixels may then be stored in a frame buffer or provided to a display device. The method operates to take advantage of the common samples shared by neighboring pixels in both the x and y directions for reduced sample buffer accesses and improved performance. The method involves reading samples from the memory that correspond to pixels in a plurality of neighboring scan lines, and possibly also to multiple pixels in each of these scan lines. The samples may be stored in a cache memory and then accessed from the cache memory for filtering. The method maximizes use of the common samples shared by neighboring pixels in both the x and y directions.

Type: Grant

Filed: February 28, 2002

Date of Patent: July 5, 2005

Assignee: Sun Microsystems, Inc.

Inventors: Yan Yan Tang, Wayne Eric Burk, Philip C. Leung
Opcode to turn around a bi-directional bus

Patent number: 6895458

Abstract: A system for managing the control of a bi-directional data bus between a master unit and a slave unit. The master couples to the slave through a request opcode bus, a reply opcode bus and the data bus. If the master is in a bus driving state (with respect to the data bus) and receives a read request, the master relinquishes bus control and sends a read request through the request opcode bus. The slave unit assumes bus control and sends the requested data through the data bus. If the master is in a bus sensing state and receives a write request, the master sends a last read opcode to the slave via the request opcode bus, and waits for the slave to return a special token through the reply opcode bus. Upon receiving the special token the master unit assumes bus control and performs the write transaction.

Type: Grant

Filed: March 4, 2002

Date of Patent: May 17, 2005

Assignee: Sun Microsystems, Inc.

Inventors: Ewa M. Kubalska, Lisa Grenier, Yan Yan Tang, Elena M. Ing
Frame buffer organization and reordering

Patent number: 6833834

Abstract: A graphics system includes a frame buffer, a write address generator, and a pixel buffer. A burst of pixels received from the frame buffer may not be in display order. In one embodiment, a write address generator calculates a write address for each pixel in the burst of pixels output from the frame buffer. The write address corresponds to a relative display order within the burst for each respective pixel. Each pixel in the burst is stored to its write address in the pixel buffer. This way, the pixels in the burst are stored in display order within the pixel buffer.

Type: Grant

Filed: December 12, 2001

Date of Patent: December 21, 2004

Assignee: Sun Microsystems, Inc.

Inventors: Michael A. Wasserman, Michael G. Lavelle, David C. Kehlet, Yan Yan Tang, Ewa M. Kubalska
Graphics pixel packing for improved fill rate performance

Patent number: 6831653

Abstract: A system and method for packing pixels together to provide a increased fill rate in a frame buffer hardware in the graphics system. The graphics system may be configured to receive and rasterize graphics data at a faster cycle rate than the system's frame buffer memory fill rate. The output from the rasterization hardware may be stored in a FIFO memory that is configured to selectively shift pixels in order to improve fill rate performance. The FIFO memory may be configured to ensure that the pixels meet certain criteria in order to prevent page faults and interleave conflicts that could reduce the fill rate. The FIFO memory may also be configured to remove empty cycles that occur as a result of the pixel packing.

Type: Grant

Filed: July 31, 2001

Date of Patent: December 14, 2004

Assignee: Sun Microsystems, Inc.

Inventors: David Kehlet, Nandini Ramani, Yan Yan Tang, Roger W. Swanson
System and method for prefetching data from a frame buffer

Patent number: 6812929

Abstract: A graphics system may include a frame buffer that includes several sets of one or more memory banks and a cache. The frame buffer may load data from one of the memory banks into the cache in response to receiving a cache fill request. Each set of memory banks is accessible independently of each other set of memory banks. A frame buffer interface coupled to the frame buffer includes a plurality of cache fill request queues. Each cache fill request queue is configured to store one or more cache fill requests targeting a corresponding one of the sets of memory banks. The frame buffer interface is configured to select a cache fill request from one of the cache fill request queues that stores cache fill requests targeting a set of memory banks that is not currently being accessed and to provide the selected cache fill request to the frame buffer.

Type: Grant

Filed: March 11, 2002

Date of Patent: November 2, 2004

Assignee: Sun Microsystems, Inc.

Inventors: Michael G. Lavelle, Ewa M. Kubalska, Yan Yan Tang

1 2 next