Commitment Control Or Register Bypass Patents (Class 712/218)

Structure for dynamic livelock resolution with variable delay memory access queue

Patent number: 8131980

Abstract: A design structure for resolving the occurrence of livelock at the interface between the processor core and memory subsystem controller. Livelock is resolved by introducing a livelock detection mechanism (which includes livelock detection utility or logic) within the processor to detect a livelock condition and dynamically change the duration of the delay stage(s) in order to alter the “harmonic” fixed-cycle loop behavior. The livelock detection logic (LDL) counts the number of flushes a particular instruction takes or the number of times an instruction re-issues without completing. The LDL then compares that number to a preset threshold number. Based on the result of the comparison, the LDL triggers the implementation of one of two different livelock resolution processes.

Type: Grant

Filed: June 3, 2008

Date of Patent: March 6, 2012

Assignee: International Business Machines Corporation

Inventors: Ronald Hall, Michael L. Karm, Alvan W. Ng, Todd A. Venton
Tracking load store ordering hazards

Patent number: 8112604

Abstract: A method and system for processing data. In one embodiment, the method includes receiving a plurality of stores into a store queue, where each store is a result from a processor, and where the plurality of stores are destined for at least one memory address. The method also includes marking a most recent store of the plurality of stores for each unique memory address, comparing a load request against the store queue, and identifying only the most recent store for each unique memory address for the purpose of handling load-hit-store ordering hazards.

Type: Grant

Filed: December 17, 2007

Date of Patent: February 7, 2012

Assignee: International Business Machines Corporation

Inventor: Eric F. Robinson
Microprocessor with fused store address/store data microinstruction

Patent number: 8090931

Abstract: A microprocessor includes an instruction translator that translates PUSHF, POP, and MOVSB x86 macroinstructions into multiple microinstructions that include a fused store microinstruction. For PUSHF, first and second microinstructions moves the x86 EFLAGS register into and mask off bits in a temporary register, and the fused store microinstruction stores it to a memory location. For POP, a first microinstruction loads a first memory location value into a temporary register and the fused store microinstruction stores it to the second memory location. For MOVSB, the first microinstruction loads a first memory location operand into a temporary register and the fused store microinstruction stores it to a second memory location. A reorder buffer receives the fused store microinstruction into exactly one entry.

Type: Grant

Filed: September 18, 2008

Date of Patent: January 3, 2012

Assignee: VIA Technologies, Inc.

Inventors: Gerard M. Col, G. Glenn Henry, Rodney E. Hooker, Terry Parks
MECHANISM FOR IRREVOCABLE TRANSACTIONS

Publication number: 20110320776

Abstract: A method and apparatus for designating and handling irrevocable transactions is herein described. In response to detecting an irrevocable event, such as an I/O operation, a user-defined irrevocable designation, and a dynamic failure profile, a transaction is designated as irrevocable. In response to designating a transaction as irrevocable, Single Owner Read Locks (SORLs) are acquired for previous and subsequent reads in the irrevocably designated transaction to ensure the transaction is able to complete without modification to locations read from, while permitting remote resources to load from those locations to continue execution.

Type: Application

Filed: September 13, 2011

Publication date: December 29, 2011

Inventors: Adam Welc, Bratin Saha, Ali-Reza Adl-Tabatabai
Pipelined processing

Patent number: 8082422

Abstract: The invention includes receiving a first instruction in an in-order execution processing pipeline; starting execution of the first instruction; determining a first set of internal operation bits indicating a prospective value of control bits upon complete execution of the first instruction; determining whether the first instruction is a committed instruction; receiving a second instruction in the in-order execution processing pipeline before execution of the first instruction completes; determining a second set of internal operation bits based on: a) the first set of internal operation bits if the first instruction is a committed instruction; or b) a set of internal operation bits of a last committed instruction if the first instruction is not a committed instruction; and starting execution of the second instruction in the in-order execution processing pipeline before execution of the first instruction completes using the second internal operation bits. Numerous other aspects are provided.

Type: Grant

Filed: September 2, 2004

Date of Patent: December 20, 2011

Assignee: International Business Machines Corporation

Inventor: Stephen J. Schwinn
Generating a flush vector from a first execution unit directly to every other execution unit of a plurality of execution units in order to block all register updates

Patent number: 8082423

Abstract: A method and apparatus are provided for detecting and handling an instruction flush in a microprocessor system. A flush mechanism is provided that is distributed across all of the execution units in a data processing system. The flush mechanism does not require a central collection point to re-distribute the flush signals to the execution units. Each unit generates a flush vector to all other execution units which is used to block register updates for the flushed instructions.

Type: Grant

Filed: August 11, 2005

Date of Patent: December 20, 2011

Assignee: International Business Machines Corporation

Inventors: Christopher Michael Abernathy, Kurt Alan Feiste, David Scott Ray, David Shippy, Albert James Van Norstrand, Jr.
Memory mapped register file

Patent number: 8078828

Abstract: A method and apparatus for operating a memory mapped register file. The method includes: receiving a source index input having a length of T?1 bits, the source index input identifying one of a plurality of unbanked registers; receiving a processor mode input to identify one of P processor modes, where P is greater than two; generating an encoded address having a length of T bits based on the source index input and the processor mode input; and identifying one of the plurality of unbanked registers associated with one of the P processor modes using the encoded address.

Type: Grant

Filed: January 31, 2011

Date of Patent: December 13, 2011

Assignee: Marvell International Ltd.

Inventors: Hong-Yi Chen, Henry Hin Kwong Fan
Device and method for processing instructions based on masked register group size information

Patent number: 8078845

Abstract: A method and a device for processing instructions based on register group size information includes a pipelined processor, an instruction memory unit and a register file, whereas the pipelined processor includes a write-back unit and an execution unit. The device is characterized by including a controller that is adapted to receive a first register group size information and a first register identification information that define a first group of source registers associated with a first instruction; and to determine an execution related operation of the first instruction in response to the first register group size information, the first register identification information, a second register group size information and a second register identification information. The second register group size information and the second register identification information define a second group of target registers associated with a second instruction.

Type: Grant

Filed: December 16, 2005

Date of Patent: December 13, 2011

Assignee: Freescale Semiconductor, Inc.

Inventors: Noam Sheffer, Shlomit Dorani, Evgeni Ginzburg
Using register rename maps to facilitate precise exception semantics

Patent number: 8078854

Abstract: One embodiment of the present invention provides a system that facilitates precise exception semantics. The system includes a processor that uses register rename maps to support out-of-order execution, where the register rename maps track mappings between native architectural registers and physical registers for a program executing on the processor. These register rename maps include: 1) a working rename map that maps architectural registers associated with a decoded instruction to corresponding physical registers; 2) a retire rename map that tracks and preserves a set of physical registers that are associated with retired instructions; and 3) a checkpoint rename map that stores a mapping between a set of architectural registers and a set of physical registers for a preceding checkpoint in the program. When the program signals an exception, the processor uses the checkpoint rename map to roll back program execution to the preceding checkpoint.

Type: Grant

Filed: December 12, 2008

Date of Patent: December 13, 2011

Assignee: Oracle America, Inc.

Inventors: Christopher A. Vick, Gregory M. Wright
Variable length pipeline processor architecture

Patent number: 8074056

Abstract: In one implementation, a pipeline processor is provided having a base architecture that includes one or more decoders operable to decode program instructions and generate one or more decoded instructions, and one or more execution units operable to execute the one or more decoded instructions. Each execution unit includes one or more execution pipeline stages. The pipeline processor architecture further includes one or more additional co-processor pipelines. The one or more decoders of the base architecture are operable to recognize one or more instructions to be processed by a given co-processor pipeline and pass the one or more recognized instructions to the given co-processor pipeline for decoding and execution.

Type: Grant

Filed: March 1, 2005

Date of Patent: December 6, 2011

Assignee: Marvell International Ltd.

Inventors: Hong-Yi Chen, Jensen Tjeng
PIPELINE PROCESSOR

Publication number: 20110276788

Abstract: A bypass circuit is provided in a pipeline processor. A pipeline register is provided between an instruction execution stage and a write-back stage. The pipeline register stores a data validity flag and a WRITE control flag to control writing data into a general purpose register unit. The data retained in the pipeline register is allowed to be written back into the general purpose register unit when the WRITE control flag indicates “valid”. The pipeline register continues to retain the retained data even after the writing of the retained data into the general purpose register unit. The first pipeline register supplies the retained data to the second stage through the bypass circuit at the time of executing a subsequent instruction having data dependency on a preceding instruction.

Type: Application

Filed: July 21, 2011

Publication date: November 10, 2011

Applicant: KABUSHIKI KAISHA TOSHIBA

Inventor: Jun TANABE
Trace based deallocation of entries in a versioning cache circuit

Patent number: 8051247

Abstract: A circuit for tracking memory operations with trace-based execution is disclosed. Each trace includes a sequence of operations that includes zero or more of the memory operations. The memory operations being executed form a set of active memory operations that have a predefined program order among them. At least some of the active memory operations access the memory in an execution order that is different from the program order. Checkpoint entries are associated with each trace. Each entry refers to a checkpoint location. Executing one of the active memory operations updates a checkpoint location. During the operation of the circuit, none of the operations of a given trace has any effect on the execution unit's architectural state prior to committing that trace. Each trace becomes eligible for commitment after all operations in the trace complete executing. After the trace is committed, all of the checkpoint entries associated with the trace are invalidated.

Type: Grant

Filed: February 13, 2008

Date of Patent: November 1, 2011

Assignee: Oracle America, Inc.

Inventors: John Gregory Favor, Paul G. Chan, Graham Ricketson Murphy, Joseph Byron Rowlands
Result path sharing between a plurality of execution units within a processor

Patent number: 8051275

Abstract: A processor 2 includes an execution cluster 10 having multiple execution units 14, 16, 18, 20. The execution units 14, 16, 18, 20 share result buses 22, 24. Issue circuitry 12 within the execution cluster 10 determines future availability of a result bus 22, 24 for an instruction to be issued (or recently issued) using a known cycle count for that instruction. The availability is tracked for each result bus using a mask register 32 storing a mask value within which each bit position indicates the availability or non-availability of that result bus at a particular processing cycle in the future. The mask value is left shifted each processing cycle.

Type: Grant

Filed: June 1, 2009

Date of Patent: November 1, 2011

Assignee: ARM Limited

Inventors: David James Williamson, Conrado Blasco Allué
Memory ordering queue/versioning cache circuit

Patent number: 8024522

Abstract: A processor includes a circuit for tracking memory operations with trace-based execution. Each trace includes a sequence of operations that includes zero or more of the memory operations. The memory operations being executed form a set of active memory operations that have a predefined program order among them. At least some of the active memory operations access the memory in an execution order that is different from the program order. During the operation of the circuit, none of the operations of a given trace has any effect on the execution unit's architectural state prior to committing that trace. Each trace becomes eligible for commitment after all operations in the trace complete executing. The circuit also includes a sub-circuit that holds memory operation ordering information corresponding to the active memory operations. The sub-circuit detects violations of ordering constraints.

Type: Grant

Filed: February 13, 2008

Date of Patent: September 20, 2011

Assignee: Oracle America, Inc.

Inventors: John Gregory Favor, Paul G. Chan, Graham Ricketson Murphy, Joseph Byron Rowlands
Setting a flag bit to defer event handling to a safe point in an instruction stream

Patent number: 8019983

Abstract: Methods and systems thereof for exception handling are described. An event to be handled is identified during execution of a code sequence. A bit is set to indicate that handling of the event is to be deferred. An exception corresponding to the event is generated if the bit is set.

Type: Grant

Filed: December 18, 2007

Date of Patent: September 13, 2011

Inventors: Guillermo J. Rozas, Alexander Klaiber
Pipeline processor with write control and validity flags for controlling write-back of execution result data stored in pipeline buffer register

Patent number: 8019974

Abstract: A bypass circuit is provided in a pipeline processor. A pipeline register is provided between an instruction execution stage and a write-back stage. The pipeline register stores a data validity flag and a WRITE control flag to control writing data into a general purpose register unit. The data retained in the pipeline register is allowed to be written back into the general purpose register unit when the WRITE control flag indicates “valid”. The pipeline register continues to retain the retained data even after the writing of the retained data into the general purpose register unit. The first pipeline register supplies the retained data to the second stage through the bypass circuit at the time of executing a subsequent instruction having data dependency on a preceding instruction.

Type: Grant

Filed: January 12, 2009

Date of Patent: September 13, 2011

Assignee: Kabushiki Kaisha Toshiba

Inventor: Jun Tanabe
Checking for a memory ordering violation after a speculative cache write

Patent number: 8019944

Abstract: An embodiment of the present invention includes a circuit for tracking memory operations with trace-based execution. Each trace includes a sequence of operations that includes zero or more of the memory operations. The memory operations being executed form a set of active memory operations that have a predefined program order among them and corresponding ordering constraints. At least some of the active memory operations access the memory in an execution order that is different from the program order. Checkpoint entries are associated with each trace. Violations of the ordering constraints may be signaled too late to prevent an update of the cached data associated with the memory operations. A sub-circuit detects this condition and invalidates the checkpoint locations indicated by the checkpoint entries associated with the trace experiencing the violation and all younger traces.

Type: Grant

Filed: February 13, 2008

Date of Patent: September 13, 2011

Assignee: Oracle America, Inc.

Inventors: John Gregory Favor, Paul G. Chan, Graham Ricketson Murphy, Joseph Byron Rowlands
Rolling back a speculative update of a non-modifiable cache line

Patent number: 8010745

Abstract: An embodiment of the present invention includes a circuit for tracking memory operations with trace-based execution. Each trace includes a sequence of operations that includes zero or more of the memory operations. The memory operations being executed form a set of active memory operations that have a predefined program order among them and corresponding ordering constraints. At least some of the active memory operations access the memory in an execution order that is different from the program order. Checkpoint entries are associated with each trace. When a memory operation attempts to update a cache line that may not be updated, the circuit attempts to upgrade the cache line. If this fails, a rollback request is generated that indicates the trace involved. The checkpoint locations associated with the indicated trace are overwritten along with those locations associated with all younger traces.

Type: Grant

Filed: February 13, 2008

Date of Patent: August 30, 2011

Assignee: Oracle America, Inc.

Inventors: John Gregory Favor, Paul G. Chan, Graham Ricketson Murphy, Joseph Byron Rowlands
Processing unit

Patent number: 8001362

Abstract: A processing unit includes a plurality of thread execution units each provided with a performance analysis circuit for measuring various types of events resulting from execution of instructions and a commit stack entry unit for controlling the completion of executed instructions and each executing a thread having a plurality of instructions, a commit scope register for storing instructions of completion candidates stored in each commit stack entry unit by execution by each thread execution unit and performing processing for completion of instructions included in the thread, and a thread selecting means for sending commit events of the instructions to a performance analysis circuit provided in each thread execution unit corresponding to the instructions when performing commit processing for instructions stored in the commit scope register.

Type: Grant

Filed: December 8, 2009

Date of Patent: August 16, 2011

Assignee: Fujitsu Limited

Inventors: Atsushi Fusejima, Takashi Suzuki, Toshio Yoshida, Yasunobu Akizuki
Multiport execution target delay queue FIFO array

Patent number: 7996655

Abstract: One embodiment provides a method of forwarding data in a processor. The method generally includes providing at least one cascaded delayed execution pipeline unit having at least a first pipeline and a second pipeline for executing first and second instructions in a common issue group, wherein the second pipeline executes the second instruction in a delayed manner relative to the execution of the first instruction in the first pipeline, storing results generated by an execution unit of the first pipeline in a first-in first-out (FIFO) storage target delay queue, determining if the target delay queue contains source data for executing the second instruction, and if the target delay queue contains source data for the second instruction, forwarding the source data for the second instruction from the target delay queue to an execution unit of the second pipeline.

Type: Grant

Filed: April 22, 2008

Date of Patent: August 9, 2011

Assignee: International Business Machines Corporation

Inventor: David A. Luick
Processor and method for synchronous load multiple fetching sequence and pipeline stage result tracking to facilitate early address generation interlock bypass

Patent number: 7987343

Abstract: A pipelined processor including an architecture for address generation interlocking, the processor including: an instruction grouping unit to detect a read-after-write dependency and to resolve instruction interdependency; an instruction dispatch unit (IDU) including address generation interlock (AGI) and operand fetching logic for dispatching an instruction to at least one of a load store unit and an execution unit; wherein the load store unit is configured with access to a data cache and to return fetched data to the execution unit; wherein the execution unit is configured to write data into a general purpose register bank; and wherein the architecture provides support for bypassing of results of a load multiple instruction for address generation while such instruction is executing in the execution unit before the general purpose register bank is written. A method and a computer system are also provided.

Type: Grant

Filed: March 19, 2008

Date of Patent: July 26, 2011

Assignee: International Business Machines Corporation

Inventors: Khary J. Alexander, Fadi Y. Busaba, Vimal M. Kapadia, Chung-Lung Kevin Shum
PROCESSING BYPASS DIRECTORY TRACKING SYSTEM AND METHOD

Publication number: 20110179256

Abstract: A processing bypass directory system and method are disclosed. In one embodiment, a bypass directory tracking process includes setting bits in a bypass directory when a corresponding architectural register is written. The bits are selectively cleared in the bypass directory each cycle. The configuration of the bits is utilized to determine which stage of a bypass path processing information is at.

Type: Application

Filed: March 28, 2011

Publication date: July 21, 2011

Inventors: Alexander Klaiber, Guillermo Rozas
Design structure for single hot forward interconnect scheme for delayed execution pipelines

Patent number: 7984272

Abstract: A design structure embodied in a machine readable storage medium for designing, manufacturing, and/or testing a design for forwarding data in a processor is provided. The design structure includes a processor. The processor includes at least one cascaded delayed execution pipeline unit having a first and second pipeline, wherein the second pipeline is configured to execute instructions in a common issue group in a delayed manner relative to the first pipeline, and circuitry. The circuitry is configured to determine if a first instruction being executed in the first pipeline modifies data in a data register which is accessed by a second instruction being executed in the second pipeline, and if the first instruction being executed in the first pipeline modifies data in the data register which is accessed by the second instruction being executed in the second pipeline, forward the modified data from the first pipeline to the second pipeline.

Type: Grant

Filed: March 21, 2008

Date of Patent: July 19, 2011

Assignee: International Business Machines Corporation

Inventor: David Arnold Luick
Apparatuses and programs for implementing a forwarding function

Patent number: 7975128

Abstract: The processor according to the present invention is a processor having a forwarding function and includes an attribute information holding unit (141) that holds attribute information regarding inhibition of writing to a register and a register write inhibition circuit (126) that holds, when forwarding is performed, the writing of the data forwarded according to attribute information. The attribute information holding unit (141) holds the attribute information by relating the attribute information to at least one register. Alternatively, the attribute information holding unit is a part of plural pipeline buffers and passes the attribute information along with the data to be forwarded, to a pipeline buffer in a subsequent stage.

Type: Grant

Filed: October 16, 2006

Date of Patent: July 5, 2011

Assignee: Panasonic Corporation

Inventors: Shin-ichiro Fukai, Makoto Kawamura
System and method for retiring approximately simultaneously a group of instructions in a superscalar microprocessor

Patent number: 7958337

Abstract: An apparatus and method for executing instructions having a program order. The apparatus comprising a temporary buffer, tag assignment logic, a plurality of functional units, a plurality of data paths, a register array, a retirement control block, and a superscalar instruction retirement unit. The temporary buffer includes a plurality of temporary buffer locations to store result data for executed instructions, wherein the temporary buffer locations are arranged in a plurality of groups of temporary buffer locations. The tag assignment logic is configured to concurrently assign a tag to each instruction in a first set of instructions, wherein the tags are assigned such that the respective tag assigned to each of the instructions in the first set of instructions identifies a different one of the temporary buffer locations in a first one of the groups of temporary buffer locations.

Type: Grant

Filed: February 26, 2009

Date of Patent: June 7, 2011

Assignee: Seiko Epson Corporation

Inventors: Johannes Wang, Sanjiv Garg, Trevor Deosaran
High-performance superscalar-based computer system with out-of order instruction execution and concurrent results distribution

Patent number: 7941635

Abstract: The high-performance, RISC core based microprocessor architecture includes an instruction fetch unit for fetching instruction sets from an instruction store and an execution unit that implements the concurrent execution of a plurality of instructions through a parallel array of functional units. The fetch unit generally maintains a predetermined number of instructions in an instruction buffer. The execution unit includes an instruction selection unit, coupled to the instruction buffer, for selecting instructions for execution, and a plurality of functional units for performing instruction specified functional operations. A unified instruction scheduler, within the instruction selection unit, initiates the processing of instructions through the functional units when instructions are determined to be available for execution and for which at least one of the functional units implementing a necessary computational function is available.

Type: Grant

Filed: December 19, 2006

Date of Patent: May 10, 2011

Assignee: Seiko-Epson Corporation

Inventors: Le Trong Nguyen, Derek J. Lentz, Yoshiyuki Miyayama, Sanjiv Garg, Yasuaki Hagiwara, Johannes Wang, Te-Li Lau, Sze-Shun Wang, Quang H. Trang
Processing bypass directory tracking system and method

Patent number: 7937566

Abstract: A processing bypass directory system and method are disclosed. In one embodiment, a bypass directory tracking process includes setting bits in a bypass directory when a corresponding architectural register is written. The configuration of the bits is utilized to determine which stage of a bypass path processing information is at.

Type: Grant

Filed: January 13, 2009

Date of Patent: May 3, 2011

Inventors: Alexander Klaiber, Guillermo Rozas
Method and system for data speculation on multicore systems

Patent number: 7937565

Abstract: The method and system for data speculation of multicore systems are disclosed. In one embodiment, a method includes dynamically determining whether a current speculative load instruction and an associated store instruction have same memory addresses in an application thread in compiled code running on a main core using a dynamic helper thread running on a idle core substantially before encountering the current speculative load instruction. The instruction sequence associated with the current speculative load instruction is then edited by the dynamic helper thread based on the outcome of the determination so that the current speculative load instruction becomes a non-speculative load instruction.

Type: Grant

Filed: February 21, 2008

Date of Patent: May 3, 2011

Assignee: Hewlett-Packard Development Company, L.P.

Inventors: Sandya Srivilliputtur Mannarswamy, Hariharan Sandanagobalane
System and method for retiring approximately simultaneously a group of instructions in a superscalar microprocessor

Patent number: 7934078

Abstract: An system and method for retiring instructions in a superscalar microprocessor which executes a program comprising a set of instructions having a predetermined program order, the retirement system for simultaneously retiring groups of instructions executed in or out of order by the microprocessor. The retirement system comprises a done block for monitoring the status of the instructions to determine which instruction or group of instructions have been executed, a retirement control block for determining whether each executed instruction is retirable, a temporary buffer for storing results of instructions executed out of program order, and a register array for storing retirable-instruction results.

Type: Grant

Filed: September 17, 2008

Date of Patent: April 26, 2011

Assignee: Seiko Epson Corporation

Inventors: Johannes Wang, Sanjiv Garg, Trevor Deosaran
Selectively powered retirement unit using a partitioned allocation array and a partitioned writeback array

Patent number: 7921280

Abstract: In one embodiment, the present invention includes a retirement unit to receive and retire executed instructions. The retirement unit may include a first array to receive information at allocation and a second array to receive information after execution. The retirement unit may further include logic to calculate an event associated with an executed instruction if information associated with the executed instruction is stored in an on-demand portion of at least one of arrays. Other embodiments are described and claimed.

Type: Grant

Filed: June 27, 2008

Date of Patent: April 5, 2011

Assignee: Intel Corporation

Inventors: Zeev Sperber, Rafi Marom, Ofer Levy
Operand and result forwarding between differently sized operands in a superscalar processor

Patent number: 7921279

Abstract: Result and operand forwarding is provided between differently sized operands in a superscalar processor by grouping a first set of instructions for operand forwarding, and grouping a second set of instructions for result forwarding, the first set of instructions comprising a first source instruction having a first operand and a first dependent instruction having a second operand, the first dependent instruction depending from the first source instruction; the second set of instructions comprising a second source instruction having a third operand and a second dependent instruction having a fourth operand, the second dependent instruction depending from the second source instruction, performing operand forwarding by forwarding the first operand, either whole or in part, as it is being read to the first dependent instruction prior to execution; performing result forwarding by forwarding a result of the second source instruction, either whole or in part, to the second dependent instruction, after execution; wher

Type: Grant

Filed: March 19, 2008

Date of Patent: April 5, 2011

Assignee: International Business Machines Corporation

Inventors: David S. Hutton, Fadi Y. Busaba, Bruce C. Giamei, Christopher A. Krygowski, Edward T. Malley, Jeffrey S. Plate, John G. Rell, Jr., Chung-Lung Kevin Shum, Timothy J. Slegel
Technique to enable store forwarding during long latency instruction execution

Patent number: 7900023

Abstract: A technique to allow independent loads to be satisfied during high-latency instruction processing. Embodiments of the invention relate to a technique in which a storage structure is used to hold store operations in program order while independent load instructions are satisfied during a time in which a high-latency instruction is being processed. After the high-latency instruction is processed, the store operations can be restored in program order without searching the storage structure.

Type: Grant

Filed: December 16, 2004

Date of Patent: March 1, 2011

Assignee: Intel Corporation

Inventors: Ravi Rajwar, Srikanth T. Srinivasan, Haitham Akkary, Amit Gandhi
Operand queue for use in a floating point unit to reduce read-after-write latency and method operation

Patent number: 7895418

Abstract: There is disclosed an operand queue for use in a floating point unit. The floating point unit comprises floating point processing units for executing floating point instructions that write operands to an external memory and for executing floating point instructions that read operands from the external memory. The floating point also comprises an operand queue for storing a plurality of operands associated with one or more operations being processed in the floating point unit. The operand queue stores a first operand being written to an external memory by a floating point write instruction executed by a first one of the plurality of floating point processing units and supplies the first operand to a floating point read instruction executed by a second one of the plurality of floating point processing units subsequent to the execution of the floating point write instruction.

Type: Grant

Filed: November 28, 2005

Date of Patent: February 22, 2011

Assignee: National Semiconductor Corporation

Inventor: Daniel W. Green
Memory mapped register file

Patent number: 7882332

Abstract: A register system for a data processing system includes an address encoder that generates an encoded address based on a processor mode identifier and a register identifier and memory comprising 2T?1 unbanked registers. The encoded address identifies one of the 2T?1 unbanked registers associated with one of the P processor modes. The encoded address comprises T bits. The register identifier identifies one of 2T?1 unbanked registers. The processor mode identifier identifies P processor modes, where T and P are integers greater than two.

Type: Grant

Filed: October 13, 2008

Date of Patent: February 1, 2011

Assignee: Marvell International Ltd.

Inventors: Hong-Yi Hubert Chen, Henry Hin Kwong Fan
Store queue architecture for a processor that supports speculative execution

Patent number: 7849290

Abstract: Embodiments of the present invention provide a system that buffers stores on a processor that supports speculative execution. The system starts by buffering a store into an entry in the store queue during a speculative execution mode. If an entry for the store does not already exist in the store queue, the system writes the store into an available entry in the store queue and updates a byte mask for the entry. Otherwise, if an entry for the store already exists in the store queue, the system merges the store into the existing entry in the store queue and updates the byte mask for the entry to include information about the newly merged store. The system then forwards the data from the store queue to subsequent dependent loads.

Type: Grant

Filed: July 9, 2007

Date of Patent: December 7, 2010

Assignee: Oracle America, Inc.

Inventors: Robert E. Cypher, Shailender Chaudhry
SINGLE CYCLE DATA MOVEMENT BETWEEN GENERAL PURPOSE AND FLOATING-POINT REGISTERS

Publication number: 20100306510

Abstract: Systems and methods for providing single cycle movement of data between a floating-point register file (FRF) and a general purpose or integer register file (RF) of a microprocessor system are provided. The system may include an integer execution unit operative to execute instructions with single cycle latency, a floating-point execution unit, a working register file (WRF), an FRF, and an IRF. To achieve the single cycle movement functionality, the integer execution unit may physically own the WRF, IRF, and FRF, and may monitor and control any dependencies between them. Thus, since the integer execution unit has direct read access to both the IRF and the FRF, data may be moved between the two register files using the single cycle operation of the integer execution unit, without the need to store and load the data from memory.

Type: Application

Filed: June 2, 2009

Publication date: December 2, 2010

Applicant: Sun Microsystems, Inc.

Inventors: Christopher Olson, Robert T. Golla, Jeffrey S. Brooks
System and method for handling load and/or store operations in a superscalar microprocessor

Patent number: 7844797

Abstract: The present invention provides a system and method for managing load and store operations necessary for reading from and writing to memory or I/O in a superscalar RISC architecture environment. To perform this task, a load store unit is provided whose main purpose is to make load requests out of order whenever possible to get the load data back for use by an instruction execution unit as quickly as possible. A load operation can only be performed out of order if there are no address collisions and no write pendings. An address collision occurs when a read is requested at a memory location where an older instruction will be writing. Write pending refers to the case where an older instruction requests a store operation, but the store address has not yet been calculated. The data cache unit returns 8 bytes of unaligned data. The load/store unit aligns this data properly before it is returned to the instruction execution unit.

Type: Grant

Filed: May 6, 2009

Date of Patent: November 30, 2010

Assignee: Seiko Epson Corporation

Inventors: Cheryl D. Senter, Johannes Wang
Method for renaming a large number of registers in a data processing system using a background channel

Patent number: 7844800

Abstract: A processor 2 utilising register renaming executes program instructions requiring a large number of architectural register specifiers to be renamed by dividing the renaming tasks into an initial set and a remaining set. The initial set are performed first and the results passed via a main channel 32 for further processing. The remaining set are performed in sequence with the results being passed via a background channel 34 for further processing. This technique is particularly useful for performing renaming operations for load/store multiple LDM instructions.

Type: Grant

Filed: August 21, 2007

Date of Patent: November 30, 2010

Assignee: ARM Limited

Inventors: Melanie Emanuelle Lucie Vincent, Florent Begon, Cedric Denis Robert Airaud, Norbert Bernard Eugene Lataille
MICROPROCESSOR WITH MICROINSTRUCTION-SPECIFIABLE NON-ARCHITECTURAL CONDITION CODE FLAG REGISTER

Publication number: 20100299504

Abstract: A microprocessor includes an architectural register and a non-architectural register, each having a plurality of condition code flags. A first instruction of the microarchitectural instruction set of the microprocessor instructs the microprocessor to update the plurality of condition code flags based on a result of the first instruction. The first instruction includes a field for indicating whether to update the plurality of condition code flags of the architectural or non-architectural register. A second instruction of the microarchitectural instruction set instructs the microprocessor to conditionally perform an operation based on one of the plurality of condition code flags. The second instruction includes a field for indicating whether to use the one of the plurality of condition code flags of the architectural or non-architectural register to determine whether to perform the operation.

Type: Application

Filed: May 20, 2009

Publication date: November 25, 2010

Applicant: VIA TECHNOLOGIES, INC.

Inventors: G. Glenn Henry, Terry Parks, Gerard M. Col
CABAC type encoding device and method

Patent number: 7834785

Abstract: An encoding device and method, of CABAC type, for an initial stream of binary digital information intended to generate an outgoing stream to form video images, after decoding, the method included the following steps: bit-by-bit analysis of the successive series of bits of the initial binary stream so as to deduce therefrom, for each bit, an interval representing the probability of occurrence associated with this bit, this interval being defined by its size CIR and its lower bound CIL, analysis of this interval so as to ensure, if necessary, a renormalization thereof. The renormalization is non-iterative and for each bit of the initial stream is compliant with the appended figure in which: M is the length of the sequence S of high-order bits common to CIL and CIR, N is the integer number such that CIR.2N-1<0.25?CIR.2N, BO is the number of bits waiting to be inserted.

Type: Grant

Filed: June 27, 2007

Date of Patent: November 16, 2010

Assignee: Assistance Technique et Etude de Materiels Electroniques - ATEME

Inventor: Tchi Southivong
Dynamic concurrent atomic execution

Patent number: 7836280

Abstract: Executing a set of one or more instructions atomically is disclosed. Executing includes determining whether speculatively executing the instructions is advised based at least in part on dynamic information associated with synchronization data and speculatively executing the instructions when it is determined that speculatively executing the instructions is advised.

Type: Grant

Filed: September 14, 2005

Date of Patent: November 16, 2010

Assignee: Azul Systems, Inc.

Inventors: Gil Tene, Ivan Posva, Michael A. Wolf, Daniel Dwight Grove, Tom Kraljevic
System and method of load-store forwarding

Patent number: 7822951

Abstract: A system and method for data forwarding from a store instruction to a load instruction during out-of-order execution, when the load instruction address matches against multiple older uncommitted store addresses or if the forwarding fails during the first pass due to any other reason. In a first pass, the youngest store instruction in program order of all store instructions older than a load instruction is found and an indication to the store buffer entry holding information of the youngest store instruction is recorded. In a second pass, the recorded indication is used to index the store buffer and the store bypass data is forwarded to the load instruction. Simultaneously, it is verified if no new store, younger than the previously identified store and older than the load has not been issued due to out-of-order execution.

Type: Grant

Filed: August 1, 2007

Date of Patent: October 26, 2010

Assignee: Advanced Micro Devices, Inc.

Inventors: Krishnan Ramani, Gary Lauterbach
Using a concurrent partial inspector loop with speculative parallelism

Patent number: 7823141

Abstract: A method for executing a loop in an application that includes executing iterations in a first segment of the loop by a base thread, logging memory transactions that occur during execution of iterations in the first segment by a co-inspector thread to obtain a co-inspector log, executing iterations in a second segment of the loop by a co-thread to obtain temporary results, logging memory transactions that occur during execution of iterations in the second segment to obtain a co-thread log, and comparing the co-inspector log and the co-thread log to determine whether a thread interdependency exists.

Type: Grant

Filed: September 30, 2005

Date of Patent: October 26, 2010

Assignee: Oracle America, Inc.

Inventors: Phyllis E. Gustafson, Michael H. Paleczny, Christopher A. Vick, Olaf Manczak, Jay R. Freeman, Yuguang Wu
Superscalar RISC instruction scheduling

Patent number: 7802074

Abstract: A register renaming system for out-of-order execution of a set of reduced instruction set computer instructions having addressable source and destination register fields, adapted for use in a computer having an instruction execution unit with a register file accessed by read address ports and for storing instruction operands. A data dependance check circuit is included for determining data dependencies between the instructions. A tag assignment circuit generates one or more tags to specify the location of operands, based on the data dependencies determined by the data dependance check circuit. A set of register file port multiplexers select the tags generated by the tag assignment circuit and pass the tags onto the read address ports of the register file for storing execution results.

Type: Grant

Filed: April 2, 2007

Date of Patent: September 21, 2010

Inventors: Sanjiv Garg, Kevin Ray Iadonato, Le Trong Nguyen, Johannes Wang
Implementing instruction set architectures with non-contiguous register file specifiers

Patent number: 7793081

Abstract: There are provided methods and computer program products for implementing instruction set architectures with non-contiguous register file specifiers. A method for processing instruction code includes processing a fixed-width instruction of a fixed-width instruction set using a non-contiguous register specifier of a non-contiguous register specification. The fixed-width instruction includes the non-contiguous register specifier.

Type: Grant

Filed: April 3, 2008

Date of Patent: September 7, 2010

Assignee: International Business Machines Corporation

Inventors: Michael Karl Gschwind, Robert Kevin Montoye, Brett Olsson, John-David Wellman
Method and system for performing reassociation in software loops

Patent number: 7774766

Abstract: Various embodiments of the present invention relate to methods and systems for optimizing an intermediate code in a compilation logic. The intermediate code is optimized by performing reassociation in software loops. The intermediate code includes at least one critical recurrence cycle. The performance of reassociation in software loops can reduce a critical recurrence cycle in them, which can speed up their execution. The subject method can include the determination of one or more critical recurrence cycles in a software loop. The method can also include the determination of at least one edge in a critical recurrence cycle, with respect to which reassociation can be performed, if one or more pre-determined criteria are met. The method can further include performing reassociation of a dependee and a dependent of an edge. In an embodiment, when one or more pre-determined criteria are met, the logic of the software loop is maintained after performing reassociation of the dependee and the dependent of the edge.

Type: Grant

Filed: September 29, 2005

Date of Patent: August 10, 2010

Assignee: Intel Corporation

Inventors: Kalyan Muthukumar, Daniel M Lavery
Result bypassing to override a data hazard within a superscalar processor

Patent number: 7774582

Abstract: A data processing system including multiple execution pipelines each having multiple execution stages E1, E2, E3 may have instructions issued together in parallel despite a data dependency therebetween if it is detected that the result operand value for the older instruction will be generated in an execution stage prior to an execution stage which requires that result operand value as an input operand value to the younger instruction and accordingly cross-forwarding of the operand value is possible between the execution pipelines to resolve the data dependency.

Type: Grant

Filed: May 26, 2005

Date of Patent: August 10, 2010

Assignee: ARM Limited

Inventors: David James Williamson, Glen Andrew Harris, Stephen John Hill
Processing bypass register file system and method

Patent number: 7774583

Abstract: A processing bypass register file system and method are disclosed. In one embodiment a processing bypass register file includes a rotating head pointer, and a plurality of write ports, storage cells and read ports. The write ports receive processing result information. The head pointer identifies which entries are written by the write ports. The plurality of cells store the processing result information. The read ports forward results to the processing data path, and to an architectural register file for retirement.

Type: Grant

Filed: September 29, 2006

Date of Patent: August 10, 2010

Inventors: Parag Gupta, Alexander Klaiber, James Van Zoeren
Register file

Publication number: 20100199072

Abstract: A register file comprising a plurality of register entries for storing data values for use in the execution of data processing instructions is provided, and comprises at least one write port and at least one read port, and circuitry responsive to a write request received at said at least one write port to update one of said plurality of register entries identified by an address specified by said write request with a data value specified by said write request. The register file also comprises further circuitry responsive to a received control signal to set at least a portion of a predetermined register entry to a predetermined value. In this way, certain register file updating instructions can be executed in parallel with other instructions without the need for additional full write-ports as would be required for typical dual-issue, thereby reducing area and routing complexity and cost compared with the use of an additional write-port due to the lower gate count required by the proposed further circuitry.

Type: Application

Filed: February 2, 2009

Publication date: August 5, 2010

Applicant: ARM LIMITED

Inventor: Simon John Craske
Single hot forward interconnect scheme for delayed execution pipelines

Patent number: 7769987

Abstract: A method and apparatus for forwarding data in a processor. The method includes providing at least one cascaded delayed execution pipeline unit having a first pipeline and a second pipeline, wherein the second pipeline executes instructions in a common issue group in a delayed manner relative to the first pipeline. The method further includes determining if a first instruction being executed in the first pipeline modifies data in a data register which is accessed by a second instruction being executed in the second pipeline. If the first instruction being executed in the first pipeline modifies data in the data register which is accessed by the second instruction being executed in the second pipeline, the modified data is forwarded from the first pipeline to the second pipeline.

Type: Grant

Filed: June 27, 2007

Date of Patent: August 3, 2010

Assignee: International Business Machines Corporation

Inventor: David Arnold Luick

prev 1 2 3 4 5 6 7 8 … next