Patents by Inventor Ross Segelken

Ross Segelken has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

FAULT DETECTION IN INSTRUCTION TRANSLATIONS

Publication number: 20140189310

Abstract: In one embodiment, a method for identifying and replacing code translations that generate spurious fault events includes detecting, while executing a first native translation of target instruction set architecture (ISA) instructions, occurrence of a fault event, executing the target ISA instructions or a functionally equivalent version thereof, determining whether occurrence of the fault event is replicated while executing the target ISA instructions or the functionally equivalent version thereof, and in response to determining that the fault event is not replicated, determining whether to allow future execution of the first native translation or to prevent such future execution in favor of forming and executing one or more alternate native translations.

Type: Application

Filed: December 27, 2012

Publication date: July 3, 2014

Applicant: NVIDIA CORPORATION

Inventors: Nathan Tuck, David Dunn, Ross Segelken, Madhu Swarna
QUEUED INSTRUCTION RE-DISPATCH AFTER RUNAHEAD

Publication number: 20140189313

Abstract: Various embodiments of microprocessors and methods of operating a microprocessor during runahead operation are disclosed herein. One example method of operating a microprocessor includes identifying a runahead-triggering event associated with a runahead-triggering instruction and, responsive to identification of the runahead-triggering event, entering runahead operation and inserting the runahead-triggering instruction along with one or more additional instructions in a queue. The example method also includes resuming non-runahead operation of the microprocessor in response to resolution of the runahead-triggering event and re-dispatching the runahead-triggering instruction along with the one or more additional instructions from the queue to the execution logic.

Type: Application

Filed: December 28, 2012

Publication date: July 3, 2014

Applicant: NVIDIA CORPORATION

Inventors: Guillermo J. Rozas, Alexander Klaiber, James van Zoeren, Paul Serris, Brad Hoyt, Sridharan Ramakrishnan, Hens Vanderschoot, Ross Segelken, Darrell D. Boggs, Magnus Ekman, Aravindh Baktha, David Dunn
INSTRUCTION CATEGORIZATION FOR RUNAHEAD OPERATION

Publication number: 20140164738

Abstract: Embodiments related to methods and devices operative, in the event that execution of an instruction produces a runahead-triggering event, to cause a microprocessor to enter into and operate in a runahead without reissuing the instruction are provided. In one example, a microprocessor is provided. The example microprocessor includes fetch logic for retrieving an instruction, scheduling logic for issuing the instruction retrieved by the fetch logic for execution, and runahead control logic. The example runahead control logic is operative, in the event that execution of the instruction as scheduled by the scheduling logic produces a runahead-triggering event, to cause the microprocessor to enter into and operate in a runahead mode without reissuing the instruction, and carry out runahead policies while the microprocessor is in the runahead mode that governs operation of the microprocessor and cause the microprocessor to operate differently than when not in the runahead mode.

Type: Application

Filed: December 7, 2012

Publication date: June 12, 2014

Applicant: NVIDIA Corporation

Inventors: Magnus Ekman, Guillermo J. Rozas, Alexander Klaiber, James van Zoeren, Paul Serris, Brad Hoyt, Sridharan Ramakrishnan, Hens Vanderschoot, Ross Segelken, Darrell D. Boggs
LAZY RUNAHEAD OPERATION FOR A MICROPROCESSOR

Publication number: 20140164736

Abstract: Embodiments related to managing lazy runahead operations at a microprocessor are disclosed. For example, an embodiment of a method for operating a microprocessor described herein includes identifying a primary condition that triggers an unresolved state of the microprocessor. The example method also includes identifying a forcing condition that compels resolution of the unresolved state. The example method also includes, in response to identification of the forcing condition, causing the microprocessor to enter a runahead mode.

Type: Application

Filed: December 7, 2012

Publication date: June 12, 2014

Applicant: NVIDIA CORPORATION

Inventors: Guillermo J. Rozas, Alexander Klaiber, James van Zoeren, Paul Serris, Brad Hoyt, Sridharan Ramakrishnan, Hens Vanderschoot, Ross Segelken, Darrell D. Boggs, Magnus Ekman
MANAGING POTENTIALLY INVALID RESULTS DURING RUNAHEAD

Publication number: 20140136891

Abstract: Embodiments related to managing potentially invalid results generated/obtained by a microprocessor during runahead are provided. In one example, a method for operating a microprocessor includes causing the microprocessor to enter runahead upon detection of a runahead event. The example method also includes, during runahead, determining that an operation associated with an instruction referencing a storage location would produce a potentially invalid result based on a value of an architectural poison bit associated with the storage location and performing a different operation in response.

Type: Application

Filed: November 14, 2012

Publication date: May 15, 2014

Applicant: NVIDIA CORPORATION

Inventors: Bruce Holmer, Guillermo J. Rozas, Alexander Klaiber, James van Zoeren, Paul Serris, Brad Hoyt, Sridharan Ramakrishnan, Hens Vanderschoot, Ross Segelken, Darrell D. Boggs, Magnus Ekman
INSTRUCTION-OPTIMIZING PROCESSOR WITH BRANCH-COUNT TABLE IN HARDWARE

Publication number: 20130311752

Abstract: A processing system comprising a microprocessor core and a translator. Within the microprocessor core is arranged a hardware decoder configured to selectively decode instructions for execution in the microprocessor core, and, a logic structure configured to track usage of the hardware decoder. The translator is operatively coupled to the logic structure and configured to selectively translate the instructions for execution in the microprocessor core, based on the usage of the hardware decoder as determined by the logic structure.

Type: Application

Filed: May 18, 2012

Publication date: November 21, 2013

Applicant: NVIDIA CORPORATION

Inventors: Rupert Brauch, Madhu Swarna, Ross Segelken, David Dunn, Ben Hertzberg
CHECKPOINTED BUFFER FOR RE-ENTRY FROM RUNAHEAD

Publication number: 20130297911

Abstract: Embodiments related to re-dispatching an instruction selected for re-execution from a buffer upon a microprocessor re-entering a particular execution location after runahead are provided. In one example, a microprocessor is provided. The example microprocessor includes fetch logic, one or more execution mechanisms for executing a retrieved instruction provided by the fetch logic, and scheduler logic for scheduling the retrieved instruction for execution. The example scheduler logic includes a buffer for storing the retrieved instruction and one or more additional instructions, the scheduler logic being configured, upon the microprocessor re-entering at a particular execution location after runahead, to re-dispatch, from the buffer, an instruction that has been previously dispatched to one of the execution mechanisms.

Type: Application

Filed: May 3, 2012

Publication date: November 7, 2013

Applicant: NVIDIA CORPORATION

Inventors: Guillermo J. Rozas, Paul Serris, Brad Hoyt, Sridharan Ramakrishnan, Hens Vanderschoot, Ross Segelken, Darrell Boggs, Magnus Ekman
BRANCH PREDICTION POWER REDUCTION

Publication number: 20130290640

Abstract: In one embodiment, a microprocessor is provided. The microprocessor includes instruction memory and a branch prediction unit. The branch prediction unit is configured to use information from the instruction memory to selectively power up the branch prediction unit from a powered-down state when fetched instruction data includes a branch instruction and maintain the branch prediction unit in the powered-down state when the fetched instruction data does not include a branch instruction in order to reduce power consumption of the microprocessor during instruction fetch operations.

Type: Application

Filed: April 27, 2012

Publication date: October 31, 2013

Applicant: NVIDIA CORPORATION

Inventors: Aneesh Aggarwal, Ross Segelken, Kevin Koschoreck, Paul Wasson
BRANCH PREDICTION POWER REDUCTION

Publication number: 20130290676

Abstract: In one embodiment, a microprocessor is provided. The microprocessor includes a branch prediction unit. The branch prediction unit is configured to track the presence of branches in instruction data that is fetched from an instruction memory after a redirection at a target of a predicted taken branch. The branch prediction unit is selectively powered up from a powered-down state when the fetched instruction data includes a branch instruction and is maintained in the powered-down state when the fetched instruction data does not include an instruction branch in order to reduce power consumption of the microprocessor during instruction fetch operations.

Type: Application

Filed: April 27, 2012

Publication date: October 31, 2013

Applicant: NVIDIA CORPORATION

Inventors: Aneesh Aggarwal, Ross Segelken, Paul Wasson
ACCESSING AND MANAGING CODE TRANSLATIONS IN A MICROPROCESSOR

Publication number: 20130275684

Abstract: In one embodiment, a micro-processing system includes a hardware structure disposed on a processor core. The hardware structure includes a plurality of entries, each of which are associated with portion of code and a translation of that code which can be executed to achieve substantially equivalent functionality. The hardware structure includes a redirection array that enables, when referenced, execution to be redirected from a portion of code to its counterpart translation. The entries enabling such redirection are maintained within or evicted from the hardware structure based on usage information for the entries.

Type: Application

Filed: April 11, 2012

Publication date: October 17, 2013

Applicant: NVIDIA CORPORATION

Inventors: Nathan Tuck, Ross Segelken
TRANSLATION ADDRESS CACHE FOR A MICROPROCESSOR

Publication number: 20130246709

Abstract: Embodiments related to fetching instructions and alternate versions achieving the same functionality as the instructions from an instruction cache included in a microprocessor are provided. In one example, a method is provided, comprising, at an example microprocessor, fetching an instruction from an instruction cache. The example method also includes hashing an address for the instruction to determine whether an alternate version of the instruction which achieves the same functionality as the instruction exists. The example method further includes, if hashing results in a determination that such an alternate version exists, aborting fetching of the instruction and retrieving and executing the alternate version.

Type: Application

Filed: March 13, 2012

Publication date: September 19, 2013

Applicant: NVIDIA CORPORATION

Inventors: Ross Segelken, Alex Klaiber, Nathan Tuck, David Dunn
INSTRUCTION CACHE POWER REDUCTION

Publication number: 20130179640

Abstract: In one embodiment, a method for controlling an instruction cache including a least-recently-used bits array, a tag array, and a data array, includes looking up, in the least-recently-used bits array, least-recently-used bits for each of a plurality of cacheline sets in the instruction cache, determining a most-recently-used way in a designated cacheline set of the plurality of cacheline sets based on the least-recently-used bits for the designated cacheline, looking up, in the tag array, tags for one or more ways in the designated cacheline set, looking up, in the data array, data stored in the most-recently-used way in the designated cacheline set, and if there is a cache hit in the most-recently-used way, retrieving the data stored in the most-recently-used way from the data array.

Type: Application

Filed: January 9, 2012

Publication date: July 11, 2013

Applicant: NVIDIA CORPORATION

Inventors: Aneesh Aggarwal, Ross Segelken, Kevin Koschoreck
Method and apparatus to execute an instruction with a semi-fast operation in a staggered ALU

Publication number: 20060206693

Abstract: A method for executing an instruction with a semi-fast operation in a staggered ALU. The method of one embodiment comprises generating a first operation and a second operation from a micro-instruction. The first and second operations are scheduled for execution in a staggered arithmetic logic unit (ALU). The first and second operations are separated by N clock cycles. Data from the first operation is communicated to the second operation for use with execution of the second operation.

Type: Application

Filed: May 16, 2006

Publication date: September 14, 2006

Inventor: Ross Segelken
Method and apparatus to execute an instruction with a semi-fast operation in a staggered ALU

Patent number: 7047397

Abstract: A method for executing an instruction with a semi-fast operation in a staggered ALU. The method of one embodiment comprises generating a first operation and a second operation from a micro-instruction. The first and second operations are scheduled for execution in a staggered arithmetic logic unit (ALU). The first and second operations are separated by N clock cycles. Data from the first operation is communicated to the second operation for use with execution of the second operation.

Type: Grant

Filed: September 13, 2002

Date of Patent: May 16, 2006

Assignee: Intel Corporation

Inventor: Ross A. Segelken
Method and apparatus for variable length instruction parallel decoding

Publication number: 20040128479

Abstract: A method and an apparatus for decoding a variable length instruction. The method includes selecting with a first pointer one of a plurality of permutations, each permutation representing a possible location of the instruction in a portion of the datastream, calculating a possible length of the instruction for each byte in the selected permutation, and selecting the length of the instruction from one of the calculated possible lengths in the selected permutation. An example of an application includes decoding X86 instruction formats.

Type: Application

Filed: December 31, 2002

Publication date: July 1, 2004

Inventors: Venkateswara Rao Madduri, Ross A. Segelken, Bret Leslie Toll
Apparatus and method for address calculation

Patent number: 6735682

Abstract: A dual-cycle address generation unit is described to generate linear addresses. The dual-cycle address generation unit includes a first adder to add a product of an index and a scaling factor to an offset and a segment base during a first clock cycle and a second adder to add output of the first adder with a base during a second clock cycle.

Type: Grant

Filed: March 28, 2002

Date of Patent: May 11, 2004

Assignee: Intel Corporation

Inventors: Ross A. Segelken, Feng Chen, David J. Sager
Method and apparatus to execute an instruction with a semi-fast operation in a staggered ALU

Publication number: 20040054875

Abstract: A method for executing an instruction with a semi-fast operation in a staggered ALU. The method of one embodiment comprises generating a first operation and a second operation from a micro-instruction. The first and second operations are scheduled for execution in a staggered arithmetic logic unit (ALU). The first and second operations are separated by N clock cycles. Data from the first operation is communicated to the second operation for use with execution of the second operation.

Type: Application

Filed: September 13, 2002

Publication date: March 18, 2004

Inventor: Ross A. Segelken
Apparatus and method for address calculation

Publication number: 20030188125

Abstract: A dual-cycle address generation unit is described to generate linear addresses. The dual-cycle address generation unit includes a first adder to add a product of an index and a scaling factor to an offset and a segment base during a first clock cycle and a second adder to add output of the first adder with a base during a second clock cycle.

Type: Application

Filed: March 28, 2002

Publication date: October 2, 2003

Inventors: Ross A. Segelken, Feng Chen, David J. Sager

prev 1 2