Patents by Inventor Guofang Jiao

Guofang Jiao has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Programmable streaming processor with mixed precision instruction execution

Patent number: 8633936

Abstract: The disclosure relates to a programmable streaming processor that is capable of executing mixed-precision (e.g., full-precision, half-precision) instructions using different execution units. The various execution units are each capable of using graphics data to execute instructions at a particular precision level. An exemplary programmable shader processor includes a controller and multiple execution units. The controller is configured to receive an instruction for execution and to receive an indication of a data precision for execution of the instruction. The controller is also configured to receive a separate conversion instruction that, when executed, converts graphics data associated with the instruction to the indicated data precision. When operable, the controller selects one of the execution units based on the indicated data precision.

Type: Grant

Filed: April 21, 2008

Date of Patent: January 21, 2014

Assignee: QUALCOMM Incorporated

Inventors: Yun Du, Chun Yu, Guofang Jiao, Stephen Molloy
Shader compile system and method

Patent number: 8495602

Abstract: The present disclosure includes a shader compiler system and method. In an embodiment, a shader compiler includes a decoder to translate an instruction having a vector representation to a unified instruction representation. The shader compiler also includes an encoder to translate an instruction having a unified instruction representation to a processor executable instruction.

Type: Grant

Filed: September 28, 2007

Date of Patent: July 23, 2013

Assignee: QUALCOMM Incorporated

Inventors: Lin Chen, Guofang Jiao, Chihong Zhang, Junhong Sun
Demand based power control in a graphics processing unit

Patent number: 8458497

Abstract: Disclosed herein is power controller for use with a graphics processing unit. The power controller monitors, manages and controls power supplied to components of a pipeline of the graphics processing unit. The power controller determining whether and to what extent power is to be supplied to a pipeline component based on status information received by the power controller in connection with the pipeline component. The power controller is capable of identifying a trend using the received status information, and determining whether and to what extent power is to be supplied to a pipeline component based on the identified trend.

Type: Grant

Filed: October 11, 2007

Date of Patent: June 4, 2013

Assignee: QUALCOMM Incorporated

Inventors: Yun Du, Chun Yu, Guofang Jiao, Stephen Molloy
Graphics processing unit with deferred vertex shading

Patent number: 8436854

Abstract: Techniques are described for processing graphics images with a graphics processing unit (GPU) using deferred vertex shading. An example method includes the following: generating, within a processing pipeline of a graphics processing unit (GPU), vertex coordinates for vertices of each primitive within an image geometry, wherein the vertex coordinates comprise a location and a perspective parameter for each one of the vertices, and wherein the image geometry represents a graphics image; identifying, within the processing pipeline of the GPU, visible primitives within the image geometry based upon the vertex coordinates; and, responsive to identifying the visible primitives, generating, within the processing pipeline of the GPU, vertex attributes only for the vertices of the visible primitives in order to determine surface properties of the graphics image.

Type: Grant

Filed: September 10, 2009

Date of Patent: May 7, 2013

Assignee: QUALCOMM Incorporated

Inventors: Guofang Jiao, Yun Du, Lingjun Chen, Chun Yu
Processor with adaptive multi-shader

Patent number: 8421794

Abstract: The disclosure describes an adaptive multi-shader within a processor that uses one or more high-precision arithmetic logic units (ALUs) and low-precision ALUs to process data based on the type of the data. Upon receiving a stream of data, the adaptive multi-shader first determines the type of the data. For example, the adaptive multi-shader may determine whether the data is suitable for high-precision processing or low-precision processing. The adaptive multi-shader then processes the data using the high-precision ALUs when the data is suitable for high-precision processing, and processes the data using the high-precision ALUs and the low-precision ALUs when the data is suitable for low-precision processing. The adaptive multi-shader may substantially reduce power consumption and silicon size of the processor by implementing the low-precision ALUs while maintaining the ability to process data using high-precision processing by implementing the high-precision ALUs.

Type: Grant

Filed: March 23, 2007

Date of Patent: April 16, 2013

Assignee: QUALCOMM Incorporated

Inventors: Yun Du, Guofang Jiao, Chun Yu
TECHNIQUES FOR HANDLING DIVERGENT THREADS IN A MULTI-THREADED PROCESSING SYSTEM

Publication number: 20130061027

Abstract: This disclosure describes techniques for handling divergent thread conditions in a multi-threaded processing system. In some examples, a control flow unit may obtain a control flow instruction identified by a program counter value stored in a program counter register. The control flow instruction may include a target value indicative of a target program counter value for the control flow instruction. The control flow unit may select one of the target program counter value and a minimum resume counter value as a value to load into the program counter register. The minimum resume counter value may be indicative of a smallest resume counter value from a set of one or more resume counter values associated with one or more inactive threads. Each of the one or more resume counter values may be indicative of a program counter value at which a respective inactive thread should be activated.

Type: Application

Filed: September 7, 2011

Publication date: March 7, 2013

Applicant: QUALCOMM Incorporated

Inventors: Lin Chen, David Rigel Garcia Garcia, Andrew E. Gruber, Guofang Jiao
Supplemental cache in a graphics processing unit, and apparatus and method thereof

Patent number: 8384728

Abstract: Disclosed herein is a supplemental cache for use with a graphics processing unit. The supplemental cache can be used to supplement a vertex cache used with a graphics processing unit. The supplemental cache stores vertex values generated in assembling primitives from vertices provided to the graphics processing unit as part of an image geometry. Generated vertex values associated with a vertex determined to be shared by two or more primitives can be retrieved from the supplemental cache, so as to reduce the need to perform duplicative operations to generate vertex values for shared vertices.

Type: Grant

Filed: September 14, 2007

Date of Patent: February 26, 2013

Assignee: QUALCOMM Incorporated

Inventors: Guofang Jiao, Jian Liang, Chun Yu
System and method of mapping shader variables into physical registers

Patent number: 8379032

Abstract: The present disclosure includes system and method of mapping shader variables into physical registers. In an embodiment, a graphics processing unit (GPU) and a memory coupled to the GPU are disclosed. The memory includes a processor readable data file that has a register file portion. The register file portion has a rectangular structure including a plurality of data items. At least two of the plurality of data items corresponding to data elements of a shader program. The data elements have different data storage types.

Type: Grant

Filed: September 28, 2007

Date of Patent: February 19, 2013

Assignee: QUALCOMM Incorporated

Inventors: Lin Chen, Junhong Sun, Guofang Jiao, Chihong Zhang, Lingjun Chen
Server-based code compilation

Patent number: 8365153

Abstract: A server is disclosed that includes an interface to a data communication network, a compiler library that stores a plurality of different compilers, and compiler selection logic responsive to data received at the interface and including logic. The compiler selection logic is configured to select one of the plurality of different compilers based on an evaluation of the received data. The selected compiler generates compiled output data and the compiled output data is communicated over the data communication network to a client.

Type: Grant

Filed: October 26, 2007

Date of Patent: January 29, 2013

Assignee: QUALCOMM Incorporated

Inventors: Lingjun Chen, Guofang Jiao, Yun Du, Chun Yu
Scheme for varying packing and linking in graphics systems

Patent number: 8355028

Abstract: A wireless device which performs a first-level compiler packing process and a second-level hardware packing process on varyings. The compiler packing process packs two or more shader variables (varyings or attributes) whose sum of components equals M into a shared M-dimensional (MD) vector register. The hardware packing consecutively packs M components of the shader variables (varyings or attributes) and any remaining variables into a vertex cache or other storage medium.

Type: Grant

Filed: July 30, 2007

Date of Patent: January 15, 2013

Assignee: QUALCOMM Incorporated

Inventors: Guofang Jiao, Alexei V. Bourd, Chun Yu, Lingjun Chen, Yun Du
Graphics processors with parallel scheduling and execution of threads

Patent number: 8345053

Abstract: A graphics processor capable of parallel scheduling and execution of multiple threads, and techniques for achieving parallel scheduling and execution, are described. The graphics processor may include multiple hardware units and a scheduler. The hardware units are operable in parallel, with each hardware unit supporting a respective set of operations. The hardware units may include an ALU core, an elementary function core, a logic core, a texture sampler, a load control unit, some other hardware unit, or a combination thereof. The scheduler dispatches instructions for multiple threads to the hardware units concurrently. The graphics processor may further include an instruction cache to store instructions for threads and register banks to store data. The instruction cache and register banks may be shared by the hardware units.

Type: Grant

Filed: September 21, 2006

Date of Patent: January 1, 2013

Assignee: QUALCOMM Incorporated

Inventors: Guofang Jiao, Yun Du, Chun Yu
Fragment shader bypass in a graphics processing unit, and apparatus and method thereof

Patent number: 8325184

Abstract: Configuration information is used to make a determination to bypass fragment shading by a shader unit of a graphics processing unit, the shader unit capable of performing both vertex shading and fragment shader. Based on the determination, the shader unit performs vertex shading and bypasses fragment shading. A processing element other than the shader unit, such as a pixel blender, can be used to perform some fragment shading. Power is managed to “turn off” power to unused components in a case that fragment shading is bypassed. For example, power can be turned off to a number of arithmetic logic units, the shader unit using the reduced number of arithmetic logic unit to perform vertex shading. At least one register bank of the shader unit can be used as a FIFO buffer storing pixel attribute data for use, with texture data, to fragment shading operations by another processing element.

Type: Grant

Filed: September 14, 2007

Date of Patent: December 4, 2012

Assignee: QUALCOMM Incorporated

Inventors: Guofang Jiao, Yun Du, Chun Yu
PIXEL VALUE COMPACTION FOR GRAPHICS PROCESSING

Publication number: 20120262493

Abstract: In general, aspects of this disclosure describe example techniques for efficient usage of the fixed data rate processing of a graphics processing unit (GPU) for a variable data rate processing. For example, the GPU may be coupled to a pixel value processing unit that receives pixel values for pixels in an image processed by the GPU. The pixel value processing unit may determine whether the pixel values are for pixels that require further processing, and store the pixel values for the pixels that are required for further processing in a buffer.

Type: Application

Filed: April 13, 2011

Publication date: October 18, 2012

Inventors: Ming-Chang Tsai, Guofang Jiao
Dependent instruction thread scheduling

Patent number: 8291431

Abstract: A thread scheduler includes context units for managing the execution of threads where each context unit includes a load reference counter for maintaining a counter value indicative of a difference between a number of data requests and a number of data returns associated with the particular context unit. A context controller of the thread context unit is configured to refrain from forwarding an instruction of a thread when the counter value is nonzero and the instruction includes a data dependency indicator indicating the instruction requires data returned by a previous instruction.

Type: Grant

Filed: August 29, 2006

Date of Patent: October 16, 2012

Assignee: QUALCOMM Incorporated

Inventors: Yun Du, Guofang Jiao, Chun Yu
3-D CLIPPING IN A GRAPHICS PROCESSING UNIT

Publication number: 20120256921

Abstract: A graphics processing unit (GPU) efficiently performs 3-dimensional (3-D) clipping using processing units used for other graphics functions. The GPU includes first and second hardware units and at least one buffer. The first hardware unit performs 3-D clipping of primitives using a first processing unit used for a first graphics function, e.g., an ALU used for triangle setup, depth gradient setup, etc. The first hardware unit may perform 3-D clipping by (a) computing clip codes for each vertex of each primitive, (b) determining whether to pass, discard or clip each primitive based on the clip codes for all vertices of the primitive, and (c) clipping each primitive to be clipped against clipping planes. The second hardware unit computes attribute component values for new vertices resulting from the 3-D clipping, e.g., using an ALU used for attribute gradient setup, attribute interpolation, etc. The buffer(s) store intermediate results of the 3-D clipping.

Type: Application

Filed: June 15, 2012

Publication date: October 11, 2012

Applicant: QUALCOMM Incorporated

Inventors: Guofang Jiao, Chun Yu, Lingjun Chen, Yun Du
Discarding of vertex points during two-dimensional graphics rendering using three-dimensional graphics hardware

Patent number: 8269775

Abstract: This disclosure describes techniques for removing vertex points during two-dimensional (2D) graphics rendering using three-dimensional (3D) graphics hardware. In accordance with the described techniques one or more vertex points may be removed during 2D graphics rendering using 3D graphics hardware. For example, the techniques may remove redundant vertex points in the display coordinate space by discarding vertex points that have the substantially same positional coordinates in the display coordinate space as a previous vertex point. Alternatively or additionally, the techniques may remove excess vertex points that lie in a straight line. Removing the redundant vertex points or vertex points that lie in a straight line allow for more efficient utilization of the hardware resources of the GPU and increase the speed at which the GPU renders the image for display.

Type: Grant

Filed: December 9, 2008

Date of Patent: September 18, 2012

Assignee: QUALCOMM Incorporated

Inventors: Alexei V. Bourd, Guofang Jiao, Jay C. Yun
Efficient scissoring for graphics application

Patent number: 8269792

Abstract: Scissoring for any number of scissoring regions is performed in a sequential order by drawing one scissoring region at a time on a drawing surface and updating scissor values for pixels within each scissoring region. A scissor value for a pixel may indicate the number of scissoring regions covering the pixel and may be incremented for each scissoring region covering the pixel. A scissor value for a pixel may also be a bitmap, and a bit for a scissoring region may be set to one if the pixel is within the scissoring region. Pixels within a region of interest are passed and rendered, and pixels outside of the region are discarded. This region may be defined by a reference value, which may be set to (a) one for the union of all scissoring regions, for a scissoring UNION operation, or (b) larger than one for the intersection of multiple (e.g., all) scissoring regions, for a scissoring AND operation.

Type: Grant

Filed: November 21, 2006

Date of Patent: September 18, 2012

Assignee: QUALCOMM Incorporated

Inventors: Guofang Jiao, Angus M. Dorbie
3-D clipping in a graphics processing unit

Patent number: 8212840

Abstract: A graphics processing unit (GPU) efficiently performs 3-dimensional (3-D) clipping using processing units used for other graphics functions. The GPU includes first and second hardware units and at least one buffer. The first hardware unit performs 3-D clipping of primitives using a first processing unit used for a first graphics function, e.g., an ALU used for triangle setup, depth gradient setup, etc. The first hardware unit may perform 3-D clipping by (a) computing clip codes for each vertex of each primitive, (b) determining whether to pass, discard or clip each primitive based on the clip codes for all vertices of the primitive, and (c) clipping each primitive to be clipped against clipping planes. The second hardware unit computes attribute component values for new vertices resulting from the 3-D clipping, e.g., using an ALU used for attribute gradient setup, attribute interpolation, etc. The buffer(s) store intermediate results of the 3-D clipping.

Type: Grant

Filed: October 23, 2006

Date of Patent: July 3, 2012

Assignee: QUALCOMM Incorporated

Inventors: Guofang Jiao, Chun Yu, Lingjun Chen, Yun Du
Efficient 2-D and 3-D graphics processing

Patent number: 8203564

Abstract: Techniques for supporting both 2-D and 3-D graphics are described. A graphics processing unit (GPU) may perform 3-D graphics processing in accordance with a 3-D graphics pipeline to render 3-D images and may also perform 2-D graphics processing in accordance with a 2-D graphics pipeline to render 2-D images. Each stage of the 2-D graphics pipeline may be mapped to at least one stage of the 3-D graphics pipeline. For example, a clipping, masking and scissoring stage in 2-D graphics may be mapped to a depth test stage in 3-D graphics. Coverage values for pixels within paths in 2-D graphics may be determined using rasterization and depth test stages in 3-D graphics. A paint generation stage and an image interpolation stage in 2-D graphics may be mapped to a fragment shader stage in 3-D graphics. A blending stage in 2-D graphics may be mapped to a blending stage in 3-D graphics.

Type: Grant

Filed: February 16, 2007

Date of Patent: June 19, 2012

Assignee: QUALCOMM Incorporated

Inventors: Guofang Jiao, Angus M. Dorbie, Yun Du, Chun Yu, Jay C. Yun
Multi-media processor cache with cache line locking and unlocking

Patent number: 8200917

Abstract: The disclosure relates to techniques for locking and unlocking cache lines in a cache included within a multi-media processor that performs read-modify-write functions using batch read and write requests for data stored in either an external memory or an embedded memory. The techniques may comprise receiving a read request in a batch of read requests for data included in a section of a cache line and setting a lock bit associated with the section in response to the read request. When the lock bit is set, additional read requests in the batch of read requests are unable to access data in that section of the cache line. The lock bit may be unset in response to a write request in a batch of write requests to update the data previously read out from that section of the cache line.

Type: Grant

Filed: September 26, 2007

Date of Patent: June 12, 2012

Assignee: QUALCOMM Incorporated

Inventors: Chun Yu, Guofang Jiao, Jian Wei

prev 1 2 3 4 5 6 7 next