Patents by Inventor Derek J. Lentz

Derek J. Lentz has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

PRECISION MODULATED SHADING

Publication number: 20210358191

Abstract: A GPU is disclosed, which may include a VRS interface to provide spatial information and/or primitive-specific information. The GPU may include one or more shader cores including a control logic section to determine a shading precision value based on the spatial information and/or the primitive-specific information. The control logic section may modulate a shading precision according to the shading precision value. A method for controlling shading precision by a GPU may include providing, by a VRS interface, the spatial information and/or primitive-specific information. The method may include determining, by a control logic section, a shading precision value based on the spatial information and/or the primitive-specific information. The method may include modulating a shading precision according to the shading precision value.

Type: Application

Filed: November 20, 2020

Publication date: November 18, 2021

Inventors: Christopher P. FRASCATI, Raun M. KRISCH, Derek J. LENTZ, David C. TANNENBAUM
METHODS AND APPARATUS FOR EFFICIENT BLENDING IN A GRAPHICS PIPELINE

Publication number: 20210343052

Abstract: A method for performing a blending operation in a graphics processing unit may include multiplying a first value and a first blend factor for a component of a render target with a first multiply operation of a multiplier, multiplying a second value and a second blend factor for the component of the render target with a second multiply operation of the multiplier, and adding a first result of the first multiply operation and a second result of the second multiply operation. The method may further include bypassing the multiplier for a first blend factor of substantially zero. The method may further include, for a blend factor of substantially one: bypassing the multiplier, and providing the first value of the component of the render target as a result of the first multiply operation.

Type: Application

Filed: September 22, 2020

Publication date: November 4, 2021

Inventors: Derek J. LENTZ, David TANNENBAUM
METHODS AND APPARATUS FOR AVOIDING LOCKUP IN A GRAPHICS PIPELINE

Publication number: 20210295465

Abstract: A method for avoiding lockup in a graphics pipeline may include accumulating position information in an accumulating stage of the pipeline, passing position information from a first preceding stage of the pipeline to the accumulating stage, determining a condition in the first preceding stage, and draining accumulated position information from the accumulating stage in response to the condition in the first preceding stage. The method may further include passing position information from the first preceding stage to a second preceding stage of the pipeline, passing position information from the second preceding stage to the accumulating stage, determining a condition in the second preceding stage, and draining accumulated position information from the accumulating stage in response to the condition in the first preceding stage and the condition in the second preceding stage.

Type: Application

Filed: April 30, 2020

Publication date: September 23, 2021

Inventors: Derek J. LENTZ, Chiachi CHAO
METHODS AND APPARATUS FOR DECOUPLED SHADING TEXTURE RENDERING

Publication number: 20210295586

Abstract: A method for rendering an image in a graphics processing system may include calculating texture coverage data for an image during an image coverage pass using a rendering pipeline, generating rendered texture data during a texture rendering pass in texture space based on the texture coverage data using one or more hardware resources of the rendering pipeline, and rendering the image based on the rendered texture data using the rendering pipeline. A graphics processing system may include a rendering pipeline configured to support decoupled shading, the pipeline may include a data buffer configured to receive texture coverage data for an image, and a rasterization unit configured to read the texture coverage data from the data buffer and use the texture coverage data to limit rasterization coverage during a texture rendering pass for the image.

Type: Application

Filed: April 30, 2020

Publication date: September 23, 2021

Inventor: Derek J. LENTZ
Methods and apparatus for avoiding lockup in a graphics pipeline

Patent number: 11127109

Abstract: A method for avoiding lockup in a graphics pipeline may include accumulating position information in an accumulating stage of the pipeline, passing position information from a first preceding stage of the pipeline to the accumulating stage, determining a condition in the first preceding stage, and draining accumulated position information from the accumulating stage in response to the condition in the first preceding stage. The method may further include passing position information from the first preceding stage to a second preceding stage of the pipeline, passing position information from the second preceding stage to the accumulating stage, determining a condition in the second preceding stage, and draining accumulated position information from the accumulating stage in response to the condition in the first preceding stage and the condition in the second preceding stage.

Type: Grant

Filed: April 30, 2020

Date of Patent: September 21, 2021

Inventors: Derek J. Lentz, Chiachi Chao
Elimination of minimal use threads via quad merging

Patent number: 9972124

Abstract: Provided is a method of performing coverage merging in a shading stage of a graphics system. The method includes: performing a draw call on primitives and rasterizing the primitives into blocks of pixels; selecting the draw call for merge testing of individual blocks; performing a depth test on the individual blocks; in response to the depth test being satisfied, merging partially covered fragments of the same draw call of one of the block of pixels to form a merged block of pixels; and performing shading of the merged block of pixels on a draw call basis.

Type: Grant

Filed: June 26, 2017

Date of Patent: May 15, 2018

Assignee: Samsung Electronics Co., Ltd.

Inventors: Derek J. Lentz, Veynu T. Narasiman, Karthik Ramani
ELIMINATION OF MINIMAL USE THREADS VIA QUAD MERGING

Publication number: 20170309065

Abstract: Provided is a method of performing coverage merging in a shading stage of a graphics system. The method includes: performing a draw call on primitives and rasterizing the primitives into blocks of pixels; selecting the draw call for merge testing of individual blocks; performing a depth test on the individual blocks; in response to the depth test being satisfied, merging partially covered fragments of the same draw call of one of the block of pixels to form a merged block of pixels; and performing shading of the merged block of pixels on a draw call basis.

Type: Application

Filed: June 26, 2017

Publication date: October 26, 2017

Inventors: Derek J. Lentz, Veynu T. Narasiman, Karthik Ramani
Elimination of minimal use threads via quad merging

Patent number: 9721376

Abstract: Fragment merging is performed on a draw call basis. One application is for quad merging. Primitives of the same draw call have many common attributes, such as a graphics state, which facilitates merging of quad fragments. Partially covered quad fragments of the same draw call are considered for possible merging and at least one merge test performed. The merge test may include error tests such as a level of detail error test, interpolated depth, and an interpolation error test.

Type: Grant

Filed: March 27, 2015

Date of Patent: August 1, 2017

Assignee: Samsung Electronics Co., Ltd.

Inventors: Derek J. Lentz, Sang Oak Woo
ELIMINATION OF MINIMAL USE THREADS VIA QUAD MERGING

Publication number: 20150379764

Abstract: Fragment merging is performed on a draw call basis. One application is for quad merging. Primitives of the same draw call have many common attributes, such as a graphics state, which facilitates merging of quad fragments. Partially covered quad fragments of the same draw call are considered for possible merging and at least one merge test performed. The merge test may include error tests such as a level of detail error test, interpolated depth, and an interpolation error test.

Type: Application

Filed: March 27, 2015

Publication date: December 31, 2015

Inventors: Derek J. LENTZ, Sang Oak WOO
System and method for handling load and/or store operations in a superscalar microprocessor

Patent number: 8019975

Abstract: The present invention provides a system and method for managing load and store operations necessary for reading from and writing to memory or I/O in a superscalar RISC architecture environment. To perform this task, a load store unit is provided whose main purpose is to make load requests out of order whenever possible to get the load data back for use by an instruction execution unit as quickly as possible. A load operation can only be performed out of order if there are no address collisions and no write pendings. An address collision occurs when a read is requested at a memory location where an older instruction will be writing. Write pending refers to the case where an older instruction requests a store operation, but the store address has not yet been calculated. The data cache unit returns 8 bytes of unaligned data. The load/store unit aligns this data properly before it is returned to the instruction execution unit.

Type: Grant

Filed: April 25, 2005

Date of Patent: September 13, 2011

Assignee: Seiko-Epson Corporation

Inventors: Cheryl Senter Brashears, Johannes Wang, Le Trong Nguyen, Derek J. Lentz, Yoshiyuki Miyayama, Sanjiv Garg, Yasuaki Hagiwara, Te-Li Lau, Sze-Shun Wang, Quang H. Trang
RISC microprocessor architecture implementing multiple typed register sets

Patent number: 7941636

Abstract: Disclosed herein is an apparatus that implements multiple typed register sets, and applications thereof. The apparatus includes an execution unit and a register file. The execution unit is configured to execute instructions including one or more fields. The register file is configured to store operands defined by the one or more fields and is configured to store results of execution of the instructions in a destination defined by the one or more fields. The register file includes (i) a first register set having a register configured to store data of a single data type and (ii) a second register set having a register configured to store data of a plurality of data types.

Type: Grant

Filed: December 31, 2009

Date of Patent: May 10, 2011

Assignee: Intellectual Venture Funding LLC

Inventors: Sanjiv Garg, Derek J. Lentz, Le Trong Nguyen, Sho Long Chen
High-performance superscalar-based computer system with out-of order instruction execution and concurrent results distribution

Patent number: 7941635

Abstract: The high-performance, RISC core based microprocessor architecture includes an instruction fetch unit for fetching instruction sets from an instruction store and an execution unit that implements the concurrent execution of a plurality of instructions through a parallel array of functional units. The fetch unit generally maintains a predetermined number of instructions in an instruction buffer. The execution unit includes an instruction selection unit, coupled to the instruction buffer, for selecting instructions for execution, and a plurality of functional units for performing instruction specified functional operations. A unified instruction scheduler, within the instruction selection unit, initiates the processing of instructions through the functional units when instructions are determined to be available for execution and for which at least one of the functional units implementing a necessary computational function is available.

Type: Grant

Filed: December 19, 2006

Date of Patent: May 10, 2011

Assignee: Seiko-Epson Corporation

Inventors: Le Trong Nguyen, Derek J. Lentz, Yoshiyuki Miyayama, Sanjiv Garg, Yasuaki Hagiwara, Johannes Wang, Te-Li Lau, Sze-Shun Wang, Quang H. Trang
High-performance, superscalar-based computer system with out-of-order instruction execution

Patent number: 7739482

Abstract: A high-performance, superscalar-based computer system with out-of-order instruction execution for enhanced resource utilization and performance throughput. The computer system fetches a plurality of fixed length instructions with a specified, sequential program order (in-order). The computer system includes an instruction execution unit including a register file, a plurality of functional units, and an instruction control unit for examining the instructions and scheduling the instructions for out-of-order execution by the functional units. The register file includes a set of temporary data registers that are utilized by the instruction execution control unit to receive data results generated by the functional units. The data results of each executed instruction are stored in the temporary data registers until all prior instructions have been executed, thereby retiring the executed instruction in-order.

Type: Grant

Filed: December 21, 2006

Date of Patent: June 15, 2010

Assignee: Seiko Epson Corporation

Inventors: Le Trong Nguyen, Derek J. Lentz, Yoshiyuki Miyayama, Sanjiv Garg, Yasuaki Hagiwara, Johannes Wang, Te-Li Lau, Sze-Shun Wang, Quang H. Trang
High-performance, superscalar-based computer system with out-of-order instruction execution

Patent number: 7721070

Abstract: A high-performance, superscalar-based computer system with out-of-order instruction execution for enhanced resource utilization and performance throughput. The computer system fetches a plurality of fixed length instructions with a specified, sequential program order (in-order). The computer system includes an instruction execution unit including a register file, a plurality of functional units, and an instruction control unit for examining the instructions and scheduling the instructions for out-of-order execution by the functional units. The register file includes a set of temporary data registers that are utilized by the instruction execution control unit to receive data results generated by the functional units. The data results of each executed instruction are stored in the temporary data registers until all prior instructions have been executed, thereby retiring the executed instruction in-order.

Type: Grant

Filed: September 22, 2008

Date of Patent: May 18, 2010

Inventors: Le Trong Nguyen, Derek J. Lentz, Yoshiyuki Miyayama, Sanjiv Garg, Yasuaki Hagiwara, Johannes Wang, Te-Li Lau, Sze-Shun Wang, Quang H. Trang
RISC Microprocessor Architecture Implementing Multiple Typed Register Sets

Publication number: 20100106942

Abstract: Disclosed herein is an apparatus that implements multiple typed register sets, and applications thereof. The apparatus includes an execution unit and a register file. The execution unit is configured to execute instructions including one or more fields. The register file is configured to store operands defined by the one or more fields and is configured to store results of execution of the instructions in a destination defined by the one or more fields. The register file includes (i) a first register set having a register configured to store data of a single data type and (ii) a second register set having a register configured to store data of a plurality of data types.

Type: Application

Filed: December 31, 2009

Publication date: April 29, 2010

Inventors: Sanjiv GARG, Derek J. Lentz, Le Trong Nguyen, Sho Long Chen
RISC microprocessor architecture implementing multiple typed register sets

Patent number: 7685402

Abstract: A register system for a data processor which operates in a plurality of modes. The register system provides multiple, identical banks of register sets, the data processor controlling access such that instructions and processes need not specify any given bank. An integer register set includes first (RA[23:0]) and second (RA[31:24]) subsets, and a shadow subset (RT[31:24]). While the data processor is in a first mode, instructions access the first and second subsets. While the data processor is in a second mode, instructions may access the first subset, but any attempts to access the second subset are re-routed to the shadow subset instead, transparently to the instructions, allowing system routines to seemingly use the second subset without having to save and restore data which user routines have written to the second subset. A re-typable register set provides integer width data and floating point width data in response to integer instructions and floating point instructions, respectively.

Type: Grant

Filed: January 9, 2007

Date of Patent: March 23, 2010

Inventors: Sanjiv Garg, Derek J. Lentz, Le Trong Nguyen, Sho Long Chen
Microprocessor architecture capable of supporting multiple heterogeneous processors

Patent number: 7657712

Abstract: A memory control unit for controlling access, by one or more devices within a processor, to a memory array unit external to the processor via one or more memory ports of the processor. The memory control unit includes a switch network to transfer data between the one or more devices of the processor and the one or more memory ports of the processor. The memory control unit also includes a switch arbitration unit to arbitrate for the switch network, and a port arbitration unit to arbitrate for the one or more memory ports.

Type: Grant

Filed: August 30, 2005

Date of Patent: February 2, 2010

Assignee: Seiko Epson Corporation

Inventors: Derek J. Lentz, Yasuaki Hagiwara, Te-Li Lau, Cheng-Long Tang, Le Trong Nguyen
RISC microprocessor architecture implementing multiple typed register sets

Patent number: 7555631

Abstract: A register system for a data processor which operates in a plurality of modes. The register system provides multiple, identical banks of register sets, the data processor controlling access such that instructions and processes need not specify any given bank. An integer register set includes first (RA[23:0]) and second (RA[31:24]) subsets, and a shadow subset (RT[31:24]). While the data processor is in a first mode, instructions access the first and second subsets. While the data processor is in a second mode, instructions may access the first subset, but any attempts to access the second subset are re-routed to the shadow subset instead, transparently to the instructions, allowing system routines to seemingly use the second subset without having to save and restore data which user routines have written to the second subset. A re-typable register set provides integer width data and floating point width data in response to integer instructions and floating point instructions, respectively.

Type: Grant

Filed: January 31, 2002

Date of Patent: June 30, 2009

Inventors: Sanjiv Garg, Derek J. Lentz, Le Trong Nguyen, Sho Long Chen
High-performance superscalar-based computer system with out-of-order instruction execution and concurrent results distribution

Patent number: 7555632

Abstract: The high-performance, RISC core based microprocessor architecture includes an instruction fetch unit for fetching instruction sets from an instruction store and an execution unit that implements the concurrent execution of a plurality of instructions through a parallel array of functional units. The fetch unit generally maintains a predetermined number of instructions in an instruction buffer. The execution unit includes an instruction selection unit, coupled to the instruction buffer, for selecting instructions for execution, and a plurality of functional units for performing instruction specified functional operations. A unified instruction scheduler, within the instruction selection unit, initiates the processing of instructions through the functional units when instructions are determined to be available for execution and for which at least one of the functional units implementing a necessary computational function is available.

Type: Grant

Filed: December 27, 2005

Date of Patent: June 30, 2009

Assignee: Seiko Epson Corporation

Inventors: Le Trong Nguyen, Derek J. Lentz, Yoshiyuki Miyayama, Sanjiv Garg, Yasuaki Hagiwara, Johannes Wang, Te-Li Lau, Sze-Shun Wang, Quang H. Trang
High-performance, superscalar-based computer system with out-of-order instruction execution

Patent number: 7487333

Abstract: A high-performance, superscalar-based computer system with out-of-order instruction execution for enhanced resource utilization and performance throughput. The computer system fetches a plurality of fixed length instructions with a specified, sequential program order (in-order). The computer system includes an instruction execution unit including a register file, a plurality of functional units, and an instruction control unit for examining the instructions and scheduling the instructions for out-of-order execution by the functional units. The register file includes a set of temporary data registers that are utilized by the instruction execution control unit to receive data results generated by the functional units. The data results of each executed instruction are stored in the temporary data registers until all prior instructions have been executed, thereby retiring the executed instruction in-order.

Type: Grant

Filed: November 5, 2003

Date of Patent: February 3, 2009

Assignee: Seiko Epson Corporation

Inventors: Le-Trong Nguyen, Derek J Lentz, Yoshiyuki Miyayama, Sanjiv Garg, Yasuaki Hagiwara, Johannes Wang, Te-Li Lau, Sze-Shun Wang, Quang H Trang

1 2 3 4 5 next