Patents by Inventor Nathan Tuck

Nathan Tuck has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Memory type which is cacheable yet inaccessible by speculative instructions

Patent number: 10642744

Abstract: An improved architectural means to address processor cache attacks based on speculative execution defines a new memory type that is both cacheable and inaccessible by speculation. Speculative execution cannot access and expose a memory location that is speculatively inaccessible. Such mechanisms can disqualify certain sensitive data from being exposed through speculative execution. Data which must be protected at a performance cost may be specifically marked. If the processor is told where secrets are stored in memory and is forbidden from speculating on those memory locations, then the processor will ensure the process trying to access those memory locations is privileged to access those locations before reading and caching them. Such countermeasure is effective against attacks that use speculative execution to leak secrets from a processor cache.

Type: Grant

Filed: June 28, 2018

Date of Patent: May 5, 2020

Assignee: NVIDIA Corporation

Inventors: Darrell D. Boggs, Ross Segelken, Mike Cornaby, Nick Fortino, Shailender Chaudhry, Denis Khartikov, Alok Mooley, Nathan Tuck, Gordon Vreugdenhil
Fault detection in instruction translations

Patent number: 10324725

Abstract: The disclosure provides a method and a system for identifying and replacing code translations that generate spurious fault events. In one embodiment the method includes executing a first set and a second set of native instructions, performing a third translation of a target instruction to form a third set of native instructions in response to a determination that a fault occurrence is attributed to a first translation, wherein the third set of native instructions is not the same as the second set of native instructions, and the third set of native instructions is not the same as the first set of native instructions, and executing the third set of native instructions.

Type: Grant

Filed: March 8, 2018

Date of Patent: June 18, 2019

Assignee: Nvidia Corporation

Inventors: Nathan Tuck, David Dunn, Ross Segelken, Madhu Swarna
MEMORY TYPE WHICH IS CACHEABLE YET INACCESSIBLE BY SPECULATIVE INSTRUCTIONS

Publication number: 20190004961

Abstract: An improved architectural means to address processor cache attacks based on speculative execution defines a new memory type that is both cacheable and inaccessible by speculation. Speculative execution cannot access and expose a memory location that is speculatively inaccessible. Such mechanisms can disqualify certain sensitive data from being exposed through speculative execution. Data which must be protected at a performance cost may be specifically marked. If the processor is told where secrets are stored in memory and is forbidden from speculating on those memory locations, then the processor will ensure the process trying to access those memory locations is privileged to access those locations before reading and caching them. Such countermeasure is effective against attacks that use speculative execution to leak secrets from a processor cache.

Type: Application

Filed: June 28, 2018

Publication date: January 3, 2019

Inventors: Darrell D. BOGGS, Ross SEGELKEN, Mike CORNABY, Nick FORTINO, Shailender CHAUDHRY, Denis KHARTIKOV, Alok MOOLEY, Nathan TUCK, Gordon VREUGDENHIL
Translation address cache for a microprocessor

Patent number: 10146545

Abstract: Embodiments related to fetching instructions and alternate versions achieving the same functionality as the instructions from an instruction cache included in a microprocessor are provided. In one example, a method is provided, comprising, at an example microprocessor, fetching an instruction from an instruction cache. The example method also includes hashing an address for the instruction to determine whether an alternate version of the instruction which achieves the same functionality as the instruction exists. The example method further includes, if hashing results in a determination that such an alternate version exists, aborting fetching of the instruction and retrieving and executing the alternate version.

Type: Grant

Filed: March 13, 2012

Date of Patent: December 4, 2018

Assignee: Nvidia Corporation

Inventors: Ross Segelken, Alex Klaiber, Nathan Tuck, David Dunn
Profiling code portions to generate translations

Patent number: 10108424

Abstract: The disclosure provides a micro-processing system operable in a hardware decoder mode and in a translation mode. In the hardware decoder mode, the hardware decoder receives and decodes non-native ISA instructions into native instructions for execution in a processing pipeline. In the translation mode, native translations of non-native ISA instructions are executed in the processing pipeline without using the hardware decoder. The system includes a code portion profile stored in hardware that changes dynamically in response to use of the hardware decoder to execute portions of non-native ISA code. The code portion profile is then used to dynamically form new native translations executable in the translation mode.

Type: Grant

Filed: March 14, 2013

Date of Patent: October 23, 2018

Assignee: Nvidia Corporation

Inventors: Nathan Tuck, Alexander Klaiber, Ross Segelken, David Dunn, Ben Hertzberg, Rupert Brauch, Thomas Kistler, Guillermo J. Rozas, Madhu Swarna
FAULT DETECTION IN INSTRUCTION TRANSLATIONS

Publication number: 20180260222

Abstract: The disclosure provides a method and a system for identifying and replacing code translations that generate spurious fault events. In one embodiment the method includes executing a first set and a second set of native instructions, performing a third translation of a target instruction to form a third set of native instructions in response to a determination that a fault occurrence is attributed to a first translation, wherein the third set of native instructions is not the same as the second set of native instructions, and the third set of native instructions is not the same as the first set of native instructions, and executing the third set of native instructions.

Type: Application

Filed: March 8, 2018

Publication date: September 13, 2018

Inventors: Nathan Tuck, David Dunn, Ross Segelken, Madhu Swarna
Improving hit rate of code translation redirection table with replacement strategy based on usage history table of evicted entries

Patent number: 9880846

Abstract: In one embodiment, a micro-processing system includes a hardware structure disposed on a processor core. The hardware structure includes a plurality of entries, each of which are associated with portion of code and a translation of that code which can be executed to achieve substantially equivalent functionality. The hardware structure includes a redirection array that enables, when referenced, execution to be redirected from a portion of code to its counterpart translation. The entries enabling such redirection are maintained within or evicted from the hardware structure based on usage information for the entries.

Type: Grant

Filed: April 11, 2012

Date of Patent: January 30, 2018

Assignee: NVIDIA CORPORATION

Inventors: Nathan Tuck, Ross Segelken
Custom chaining stubs for instruction code translation

Patent number: 9384001

Abstract: A processing system includes a microprocessor, a hardware decoder arranged within the microprocessor, and a translator operatively coupled to the microprocessor. The hardware decoder is configured to decode instruction code non-native to the microprocessor for execution in the microprocessor. The translator is configured to form a translation of the instruction code in an instruction set native to the microprocessor and to connect a branch instruction in the translation to a chaining stub. The chaining stub is configured to selectively cause additional instruction code at a target address of the branch instruction to be received in the hardware decoder without causing the processing system to search for a translation of additional instruction code at the target address.

Type: Grant

Filed: August 15, 2012

Date of Patent: July 5, 2016

Assignee: NVIDIA CORPORATION

Inventors: Ben Hertzberg, Nathan Tuck
PROFILING CODE PORTIONS TO GENERATE TRANSLATIONS

Publication number: 20140281392

Abstract: The disclosure provides a micro-processing system operable in a hardware decoder mode and in a translation mode. In the hardware decoder mode, the hardware decoder receives and decodes non-native ISA instructions into native instructions for execution in a processing pipeline. In the translation mode, native translations of non-native ISA instructions are executed in the processing pipeline without using the hardware decoder. The system includes a code portion profile stored in hardware that changes dynamically in response to use of the hardware decoder to execute portions of non-native ISA code. The code portion profile is then used to dynamically form new native translations executable in the translation mode.

Type: Application

Filed: March 14, 2013

Publication date: September 18, 2014

Applicant: NVIDIA CORPORATION

Inventors: Nathan Tuck, Alexander Klaiber, Ross Segelken, David Dunn, Ben Hertzberg, Rupert Brauch, Thomas Kistler, Guillermo J. Rozas, Madhu Swarna
FAULT DETECTION IN INSTRUCTION TRANSLATIONS

Publication number: 20140189310

Abstract: In one embodiment, a method for identifying and replacing code translations that generate spurious fault events includes detecting, while executing a first native translation of target instruction set architecture (ISA) instructions, occurrence of a fault event, executing the target ISA instructions or a functionally equivalent version thereof, determining whether occurrence of the fault event is replicated while executing the target ISA instructions or the functionally equivalent version thereof, and in response to determining that the fault event is not replicated, determining whether to allow future execution of the first native translation or to prevent such future execution in favor of forming and executing one or more alternate native translations.

Type: Application

Filed: December 27, 2012

Publication date: July 3, 2014

Applicant: NVIDIA CORPORATION

Inventors: Nathan Tuck, David Dunn, Ross Segelken, Madhu Swarna
CUSTOM CHAINING STUBS FOR INSTRUCTION CODE TRANSLATION

Publication number: 20140052962

Abstract: A processing system includes a microprocessor, a hardware decoder arranged within the microprocessor, and a translator operatively coupled to the microprocessor. The hardware decoder is configured to decode instruction code non-native to the microprocessor for execution in the microprocessor. The translator is configured to form a translation of the instruction code in an instruction set native to the microprocessor and to connect a branch instruction in the translation to a chaining stub. The chaining stub is configured to selectively cause additional instruction code at a target address of the branch instruction to be received in the hardware decoder without causing the processing system to search for a translation of additional instruction code at the target address.

Type: Application

Filed: August 15, 2012

Publication date: February 20, 2014

Applicant: NVIDIA CORPORATION

Inventors: Ben Hertzberg, Nathan Tuck
ACCESSING AND MANAGING CODE TRANSLATIONS IN A MICROPROCESSOR

Publication number: 20130275684

Abstract: In one embodiment, a micro-processing system includes a hardware structure disposed on a processor core. The hardware structure includes a plurality of entries, each of which are associated with portion of code and a translation of that code which can be executed to achieve substantially equivalent functionality. The hardware structure includes a redirection array that enables, when referenced, execution to be redirected from a portion of code to its counterpart translation. The entries enabling such redirection are maintained within or evicted from the hardware structure based on usage information for the entries.

Type: Application

Filed: April 11, 2012

Publication date: October 17, 2013

Applicant: NVIDIA CORPORATION

Inventors: Nathan Tuck, Ross Segelken
TRANSLATION ADDRESS CACHE FOR A MICROPROCESSOR

Publication number: 20130246709

Abstract: Embodiments related to fetching instructions and alternate versions achieving the same functionality as the instructions from an instruction cache included in a microprocessor are provided. In one example, a method is provided, comprising, at an example microprocessor, fetching an instruction from an instruction cache. The example method also includes hashing an address for the instruction to determine whether an alternate version of the instruction which achieves the same functionality as the instruction exists. The example method further includes, if hashing results in a determination that such an alternate version exists, aborting fetching of the instruction and retrieving and executing the alternate version.

Type: Application

Filed: March 13, 2012

Publication date: September 19, 2013

Applicant: NVIDIA CORPORATION

Inventors: Ross Segelken, Alex Klaiber, Nathan Tuck, David Dunn
Vector processor

Publication number: 20070255894

Abstract: A vector processing system provides high performance vector processing using a System-On-a-Chip (SOC) implementation technique. One or more scalar processors (or cores) operate in conjunction with a vector processor, and the processors collectively share access to a plurality of memory interfaces coupled to Dynamic Random Access read/write Memories (DRAMs). In typical embodiments the vector processor operates as a slave to the scalar processors, executing computationally intensive Single Instruction Multiple Data (SIMD) codes in response to commands received from the scalar processors. The vector processor implements a vector processing Instruction Set Architecture (ISA) including machine state, instruction set, exception model, and memory model.

Type: Application

Filed: February 10, 2006

Publication date: November 1, 2007

Inventors: Richard Hessel, Nathan Tuck, Korbin Van Dyke, Chetana Keltcher
DEFERRED SHADING GRAPHICS PIPELINE PROCESSOR HAVING ADVANCED FEATURES

Publication number: 20070165035

Abstract: A deferred shading graphics pipeline processor and method are provided encompassing numerous substructures. Embodiments of the processor and method may include one or more of deferred shading, a tiled frame buffer, and multiple?stage hidden surface removal processing. In the deferred shading graphics pipeline, hidden surface removal is completed before pixel coloring is done. The pipeline processor comprises a command fetch and decode unit, a geometry unit, a mode extraction unit, a sort unit, a setup unit, a cull unit, a mode injection unit, a fragment unit, a texture unit, a Phong lighting unit, a pixel unit, and a backend unit.

Type: Application

Filed: December 19, 2006

Publication date: July 19, 2007

Applicant: Apple Computer, Inc.

Inventors: Jerome Duluk, Richard Hessel, Vaughn Arnold, Jack Benkual, Joseph Bratt, George Cuan, Stephen Dodgen, Emerson Fang, Zhaoyu Gong, Thomas Ho, Hengwei Hsu, Sidong Li, Sam Ng, Matthew Papakipos, Jason Redgrave, Sushma Trivedi, Nathan Tuck, Shun Go, Lindy Fung, Tuan Nguyen, Joseph Grass, Bo Hung, Abraham Mammen, Abbas Rashid, Albert Tsay
Method and apparatus for performing tangent space lighting and bump mapping in a deferred shading graphics processor

Patent number: 6771264

Abstract: A system and method for performing tangent space lighting in a deferred shading graphics processor (DSGP) encompasses blocks of the DSGP that preprocess data and a Phong shader that executes only after all fragments have been preprocessed. A preprocessor block receives texture maps specified in a variety of formats and converts those texture maps to a common format for use by the Phong shader. The preprocessor blocks provide the Phong shader with interpolated surface basis vectors (vs, vt, n), a vector Tb that represents in tangen/object space the texture/bump data from the texture maps, light data, material data, eye coordinates and other information used by the Phong shader to perform the lighting and bump mapping computations. The data from the preprocessor is provided for each fragment for which lighting effects need to be computed. The Phong shader computes the color of a fragment using the information provided by the preprocessor.

Type: Grant

Filed: December 17, 1999

Date of Patent: August 3, 2004

Assignee: Apple Computer, Inc.

Inventors: Jerome F. Duluk, Stephen L. Dodgen, Joseph P. Bratt, Matthew Papakipos, Nathan Tuck, Richard E. Hessel