Loop Execution Patents (Class 712/241)

Circuit and method for loop control

Publication number: 20070186084

Abstract: A loop control circuit of the present invention includes a program counter for sequentially indicating an address of an instruction, a LSA calculation circuit for calculating a loop start address of a loop start instruction, a LEA calculation circuit for calculating a loop end address of a loop end instruction, an interlock generation circuit for generating an interlock until a pipeline of a loop instruction is completed so as to suspend a pipeline process of the loop end instruction, and a loop end evaluation circuit for setting the program counter to the loop start address according to a result of a comparison between the program counter and the loop end address after the pipeline process of the loop instruction is completed.

Type: Application

Filed: January 31, 2007

Publication date: August 9, 2007

Inventor: Satoshi Chiba
Method, apparatus, and system for variable increment multi-index looping operations

Patent number: 7249248

Abstract: According to one embodiment of the invention, an apparatus is provided which includes a first register to hold an initial value of a first index associated with a looping instruction to be executed for a number of iterations, a second register to hold an initial value of a second index associated with the respective looping instruction, and a third register to hold data indicating non-linear variation pattern associated with the second index. For each iteration, actual increment of the first index and actual increment of the second index are set based on a target increment and the data indicating the non-linear variation pattern associated with the second index.

Type: Grant

Filed: November 25, 2002

Date of Patent: July 24, 2007

Assignee: Intel Corporation

Inventors: Bapiraiu Vinnnakota, Saleem Mohammadali, Carl A. Alberola
Method and apparatus for interrupt handling in coarse grained array

Publication number: 20070162729

Abstract: A processor including a coarse grained array including a plurality of function units and a plurality of register files, wherein a loop to be executed by the coarse grained array is split into a plurality of sub-loops, and when an interrupt request occurs while executing the sub-loop in the coarse grained array, the interrupt request is processed after the executing of the sub-loop is completed.

Type: Application

Filed: September 13, 2006

Publication date: July 12, 2007

Inventors: Soo Jung Ryu, Jeong Wook Kim, Dong-Hoon Yoo, Hee Seok Kim
Loop accelerator and data processing system having the same

Publication number: 20070157009

Abstract: Provided are a loop accelerator and a data processing system having the loop accelerator. The data processing system includes a loop accelerator which executes a loop part of a program, a processor core which processes a remaining part of the program except the loop part, and a central register file which transmits data between the processor core and the loop accelerator. The loop accelerator includes a plurality of processing elements (PEs) each of which performs an operation on each word to execute the program, a configuration memory which stores configuration bits indicating operations, states, etc. of the PEs, and a plurality of context memories, installed in a column or row direction of the PEs, which transmit the configuration bits along a direction toward which the PEs are arrayed. Thus, a connection structure between the configuration memory and the PEs can be simplified to easily modify a structure of the loop accelerator so as to extend the loop accelerator.

Type: Application

Filed: September 5, 2006

Publication date: July 5, 2007

Inventors: Soo-jung Ryu, Jeong-wook Kim, Suk-jin Kim, Hong-seok Kim, Jun-jin Kong
Apparatus and method for optimizing loop buffer in reconfigurable processor

Publication number: 20070150710

Abstract: A reconfigurable processor comprising a configuration memory for storing a configuration bit for at least one loop configuration; a valid information memory for storing bit information indicating whether an operation in a loop is a delay operation; and at least one processing unit for determining whether an operation in a next cycle is the delay operation by referring to the bit information transmitted from the valid information memory, and selectively performing a change and an implementation of a configuration according to the configuration bit from the configuration memory based on the determined results.

Type: Application

Filed: September 25, 2006

Publication date: June 28, 2007

Inventors: Soo Jung Ryu, Jeong Wook Kim, Suk Jin Kim, Hong-Seok Kim
Debugging aid parallel execution of a plurality of iterations with source lists display corresponding to each iteration

Patent number: 7237229

Abstract: This invention makes debugging more efficient when an object program is intended for processing a loop made up of n groups of iteration-forming instructions. Instructions in the secondary assembler program each has a combination of a line number “;lx” and an iteration identifier “;ix” attached thereto. The line number “;lx” specifies a source code from which the instruction is generated, and the iteration identifier “;ix” specifies an iteration to which the instruction belongs. When the user sets a breakpoint at an instruction, displayed in the windows are (a) a source code for generating the instruction at the breakpoint and (b) another source code for generating another instruction that belongs to a different group of iteration-forming instructions than the breakpoint instruction.

Type: Grant

Filed: November 4, 2002

Date of Patent: June 26, 2007

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventors: Hajime Ogawa, Kiyohiko Sumida, Shuichi Takayama, Katsuhiro Okuno, Taketo Heishi
Pipelined processor method and circuit with interleaving of iterative operations

Patent number: 7206927

Abstract: A method of executing an instruction stream in a pipelined execution unit of depth, p, comprises loading the instruction stream; detecting an iteration of an instruction in the loaded instruction stream; interleaving p steams of instances of the instruction in the pipeline; detecting an end of the iteration; and combining results obtained from the p streams after all programmed iterations have completed. A computational circuit comprises a register which can hold a value representing both an operand and result of an iterative operation; a multiplexer having a first input connected to receive the operand from the register, a second input connected to a source of an identify value for the iterative operation, and an output; and an operator circuit having an input connected to receive a value from the multiplexer output, and an output connected to return thee result to the register.

Type: Grant

Filed: November 19, 2002

Date of Patent: April 17, 2007

Assignee: Analog Devices, Inc.

Inventor: Abhijit Giri
Processor and pipeline reconfiguration control method

Patent number: 7194610

Abstract: A reconfigurable processor calculates execution times of configuration for executing pipeline processing from hardware configuration information, and fixes a clock cycle until processing ends. A counter compares the fixed clock cycle with the actual number of elapsed clocks, and, when the number of elapsed clocks equals the clock cycle, it is determined that pipeline processing has ended, and a configuration controller is notified of this.

Type: Grant

Filed: February 23, 2005

Date of Patent: March 20, 2007

Assignee: Fujitsu Limited

Inventors: Shiro Uriu, Mitsuharu Wakayoshi, Tetsuo Kawano, Hiroshi Furukawa, Ichiro Kasama, Kazuaki Imafuku, Toshiaki Suzuki
Repeat function for processing of repetitive instruction streams

Patent number: 7178013

Abstract: A REPEAT instruction for repeated execution of an associated instruction (INSTR). Once a program counter stores the address for the instruction to be repeated, it remains unchanged until the associated instruction (INSTR) has been executed the number of times indicated by a COUNT value in a preloaded register, or alternatively, by the REPEAT instruction itself. In this manner, the present invention reduces the number of instruction fetches required to repeatedly execute the associated instruction (INSTR). Consequently, there is a significant improvement in the efficiency of the program code execution.

Type: Grant

Filed: June 30, 2000

Date of Patent: February 13, 2007

Assignee: Cisco Technology, Inc.

Inventor: Kenneth W. Batcher
Run-time parallelization of loops in computer programs by access patterns

Patent number: 7171544

Abstract: Parallelization of loops is performed for loops having indirect loop index variables and embedded conditional statements in the loop body. Loops having any finite number of array variables in the loop body, and any finite number of indirect loop index variables can be parallelized. There are two particular limitations of the described techniques: (i) that there are no cross-iteration dependencies in the loop other than through the indirect loop index variables; and (ii) that the loop index variables (either direct or indirect) are not redefined in the loop body.

Type: Grant

Filed: December 15, 2003

Date of Patent: January 30, 2007

Assignee: International Business Machines Corporation

Inventor: Rajendra K. Bera
Thread switch upon spin loop detection by threshold count of spin lock reading load instruction

Patent number: 7165254

Abstract: The present invention relates to a processor system. The processor system is made up of a multithread control unit for selectively making switching among said threads to be executed in an arithmetic unit, a loop predicting unit for predicting a loop of an instruction string on the basis of a processing history of a branch instruction in the thread, and a loop detecting unit for, when the loop predicting unit predicts the loop, detecting the loop on the basis of an instruction. When the loop detecting unit detects the loop, the multithread control unit making the switching from the thread, which is in execution in the arithmetic unit, to a different thread. This prevents a wait condition stemming from the loop from interfering with the execution of other threads without retouching software.

Type: Grant

Filed: November 18, 2004

Date of Patent: January 16, 2007

Assignee: Fujitsu Limited

Inventor: Toshio Yoshida
Methods and apparatus for multi-processing execution of computer instructions

Patent number: 7162620

Abstract: A multi-processing computer architecture and a method of operating the same are provided. The multi-processing architecture provides a main processor and multiple sub-processors cascaded together to efficiently execute loop operations. The main processor executes operations outside of a loop and controls the loop. The multiple sub-processors are operably interconnected, and are each assigned by the main processor to a given loop iteration. Each sub-processor is operable to receive one or more sub-instructions sequentially, operate on each sub-instruction and propagate the sub-instruction to a subsequent sub-processor.

Type: Grant

Filed: July 24, 2002

Date of Patent: January 9, 2007

Assignee: Sony Computer Entertainment Inc.

Inventor: Hidetaka Magoshi
Zero-overhead loop operation in microprocessor having instruction buffer

Patent number: 7159103

Abstract: A loop instruction, at least one target instruction, and an associated trigger address are cached during loop entry. During each loop iteration, the processor predicts whether the loop will be taken or not-taken in a subsequent iteration. When pre-fetch of the cached loop instruction is subsequently detected (i.e., by comparing the trigger address with the current program counter value), the loop taken/not-taken prediction is used to fetch either loop body instructions (when predicted taken) or fall-through instructions (when predicted not-taken). The cached loop instruction is then executed and the loop taken/not-taken prediction is verified using a dedicated loop execution circuit while a penultimate loop body instruction is executed in the processor execution stage (pipeline). When a previous loop taken prediction is verified, the cached target instruction is executed, and then the fetched loop body instructions are executed.

Type: Grant

Filed: March 24, 2003

Date of Patent: January 2, 2007

Assignee: Infineon Technologies AG

Inventors: Sagheer Ahmad, Matthias Knoth, Roger D. Arnold
Parallel computation processor, parallel computation control method and program thereof

Patent number: 7136989

Abstract: A parallel computation processor being capable of high-speed loop operation. When instruction decoders decode the VLOOP instruction, which triggers loop operation, an instruction buffer starts storing normal instructions. The instruction buffer dispatches a VLIW instruction composed of n pieces of normal instructions to execution units each time n pieces of instructions are stored therein. The execution units concurrently execute the instructions. After all instructions comprised in a loop have been stored in the buffer and once dispatched as VLIW instructions to be executed, the loop is executed repeatedly.

Type: Grant

Filed: September 26, 2002

Date of Patent: November 14, 2006

Assignee: NEC Corporation

Inventor: Daiji Ishii
Method and apparatus for a stew-based loop predictor

Patent number: 7136992

Abstract: A method and apparatus for a loop predictor for predicting the end of a loop is disclosed. In one embodiment, the loop predictor may have a predict counter to hold a predict count representing the expected number of times that a predictor stew value will repeat during the execution of a given loop. The loop predictor may also have one or more running counters to hold a count of the times that the stew value has repeated during the execution of the present loop. When the counter values match the predictor may issue a prediction that the loop will end.

Type: Grant

Filed: December 17, 2003

Date of Patent: November 14, 2006

Assignee: Intel Corporation

Inventors: Subramaniam Maiyuran, Peter J. Smith, Stephan Jourdan
Method and apparatus for loop detection utilizing multiple loop counters and a branch promotion scheme

Patent number: 7130991

Abstract: A method and apparatus for loop detection for improved branch prediction accuracy. In one embodiment, the method may comprise executing a branch instruction, updating a plurality of event counts corresponding to the branch instruction in response to its executing, determining a loop behavior status corresponding to the branch instruction in response to the event count updating, and promoting the branch instruction to a loop branch prediction type in response to the determination of loop behavior status.

Type: Grant

Filed: October 9, 2003

Date of Patent: October 31, 2006

Assignee: Advanced Micro Devices, Inc.

Inventor: Michael G. Butler
Unrolling loops with partial hot traces

Patent number: 7120907

Abstract: Methods and apparatus are disclosed for improved loop unrolling by a compiler. A large class of loops exists for which effective loop unrolling has not previously been performed because they are too large to be completely unrolled, but which do not have a single hot trace that covers an entire loop iteration. The present invention recognizes such loops that have partial hot traces identified using profile data. A set of instructions which constitute a proper superset of the hot trace and a proper subset of the entire loop, and which forms a complete loop iteration is identified. This set of instructions can then be unrolled without unrolling the entire loop.

Type: Grant

Filed: August 28, 2003

Date of Patent: October 10, 2006

Assignee: International Business Machines Corporation

Inventors: Robert Ralph Roediger, William Jon Schmidt, Peter Jerome Steinmetz
Interprocedural dead store elimination

Patent number: 7100156

Abstract: A system for optimizing computer code generation by carrying out interprocedural dead store elimination. The system carries out a top down traversal of a call graph in an intermediate representation of the code being compiled. Live on exit (LOE) sets are defined for variables at call points for functions in the code being compiled. Bit vectors representing the LOE sets for call points for functions are stored in an LOE table indexed or hashed by call graph edges. For each function definition reached in the call graph traversal, a LOE set for the function itself is generated by taking the union of the LOE call point sets. The entries in the LOE table for the LOE call point sets are then removed. The LOE set for each function is used to determine if variables that are the subject of a store operation in a function may be subject to a dead store elimination optimization.

Type: Grant

Filed: September 27, 2001

Date of Patent: August 29, 2006

Assignee: International Business Machines Corporation

Inventor: Roch Georges Archambault
Method and apparatus for caching short program loops within an instruction FIFO

Patent number: 7093112

Abstract: A method and an apparatus for reading instruction codes and for forwarding the corresponding instructions to the instruction unit that are distinguished, in particular, by buffer-storing the instructions of program loops that do not exceed the storage capacity of the instruction FIFO, in full, in the instruction FIFO. During a “loop mode”, such program loops are then executed only from the instruction FIFO itself and the instructions do not need to be repeatedly reloaded from the memory. This loop mode is then maintained until the jump instruction at the end of the program loop finally refers to an instruction other than that at the start of the loop.

Type: Grant

Filed: September 17, 2001

Date of Patent: August 15, 2006

Assignee: Infineon Technologies AG

Inventors: Steffen Sonnekalb, Jürgen Birkhäuser
Efficient instruction prefetch mechanism employing selective validity of cached instructions for digital signal processor and method of operation thereof

Patent number: 7085916

Abstract: For use in a processor having an external memory interface, an instruction prefetch mechanism, a method of prefetching instructions and a digital signal processor incorporating the mechanism or the method. In one embodiment, the mechanism includes: (1) a branch predictor that predicts whether a branch is to be taken, (2) prefetch circuitry, coupled to the branch predictor, that prefetches instructions associated with the branch via the external memory interface if the branch is taken and prefetches sequential instructions via the external memory interface if the branch is not taken and (3) a loop recognizer, coupled to the prefetch circuitry, that determines whether a loop is present in fetched instructions and reinstates a validity of instructions in the loop and prevents the prefetch circuitry from prefetching instructions outside of the loop until the loop completes execution.

Type: Grant

Filed: October 26, 2001

Date of Patent: August 1, 2006

Assignee: LSI Logic Corporation

Inventor: Hung T. Nguyen
Loop control circuit and loop control method

Patent number: 7080239

Abstract: A loop control circuit and a loop control method that allow control on multiplexed loop operations to be executed with less overhead are provided. A loop control circuit comprises a means for address storage that stores in memory the address of a loop instruction at a second or subsequent stage in multiplexed loops or the address of the instruction immediately preceding the loop instruction when the loop instruction is executed for the first time, a means for loop instruction recurrence prediction that predicts a recurrence of the loop instruction at the second or subsequent stage by comparing the address of the loop instruction or the address of the instruction immediately preceding the loop instruction stored in memory with a value at a program counter and a means for loop instruction skipping that skips the loop instruction if it is predicted that the loop instruction is to occur next.

Type: Grant

Filed: October 30, 2002

Date of Patent: July 18, 2006

Assignee: Oki Electric Industry, Co., Ltd.

Inventor: Teruaki Uehara
Data processing apparatus

Patent number: 7080240

Abstract: A data processor apparatus having first and second instruction storages for holding a series of instructions to be executed reiteratively. The number of instructions is calculated based on the difference between a start address and an end address respectively stored in a start address register and an end address register. When the number of instructions constituting the series of instructions is less than the capacity of the second instruction storage, the series of instructions is read from the second instruction storage and executed. When the number of instructions is greater than the capacity of the second instruction storage, the series of instructions is read from the first instruction storage.

Type: Grant

Filed: November 25, 2002

Date of Patent: July 18, 2006

Assignee: Hitachi, Ltd.

Inventors: Mitsuru Hiraki, Atsushi Kiuchi, Kesami Hagiwara
Hardware loops and pipeline system using advanced generation of loop parameters

Patent number: 7065636

Abstract: In one embodiment, a programmable processor is adapted to support hardware loops. The processor may include hardware such as a first set of registers, a second set of registers, a first pipeline, and a second pipeline. Furthermore, the processor may include a control unit adapted to efficiently implement the hardware when performing a hardware loop.

Type: Grant

Filed: December 20, 2000

Date of Patent: June 20, 2006

Assignees: Intel Corporation, Analog Devices, Inc.

Inventors: Ryo Inoue, Ravi P. Singh, Charles P. Roth, Gregory A. Overkamp
Enabling tracing of a repeat instruction

Patent number: 7062684

Abstract: A method and system for enabling tracing of a repeat instruction are provided. A repeat instruction is executed within a processor. In response to detecting a repeat instruction flag set during a last execution of the repeat instruction, an interrupt is initiated within the processor. The processor enables reading a count of executions for the repeat instruction from a storage unit within the processor by a trace program or external hardware during the interrupt.

Type: Grant

Filed: December 19, 2002

Date of Patent: June 13, 2006

Assignee: International Business Machines Corporation

Inventors: Jimmie Earl DeWitt, Jr., Frank Eliot Levine, Robert John Urquhart
Method and system for scheduling software pipelined loops

Patent number: 7058938

Abstract: A method and a system for scheduling a software pipelined loop with indirect loads. The system may include a data structure in communication with a processor and a memory. The processor may determine a condition associated with a potential for saturation of the data structure. Accordingly, the processor may provide a number of instructions associated with the software pipelined loop from the memory to a queue of the data structure prior to processing of the instructions by the processor based on the condition associated with a potential for saturation of the data structure.

Type: Grant

Filed: March 5, 2003

Date of Patent: June 6, 2006

Assignee: Intel Corporation

Inventor: Lynd M. Stringer
Microprocessor apparatus and method for accelerating execution of repeat string instructions

Patent number: 7039793

Abstract: A method and apparatus are provided for processing repeat string instructions with increased efficiency in a processor pipeline. Rather than explicitly generating an initial count register setup micro instruction each time a repeat (REP) prefix in encountered, the processor includes a shadow ECX register operating in parallel with an architectural ECX count register. This enables the contents of the architectural ECX register, which are also stored in the shadow ECX register, to be immediately transferred to an internal count register from the shadow ECX register upon the first iteration of a repeat string micro code sequence.

Type: Grant

Filed: October 21, 2002

Date of Patent: May 2, 2006

Assignee: IP-First, LLC

Inventors: Gerard M. Col, G. Glenn Henry, Terry Parks
System and method for performing matrix operations

Patent number: 7028168

Abstract: A system for performing matrix operations utilizes a processor, memory, and a matrix operation manager. The processor has a memory cache. The memory is external to the processor and stores first and second matrices. The matrix operation manager is configured to mathematically combine the first matrix with the scond matrix utilizing a hoisted matrix algorithm for hoisting values of the first matrix, and the hoisted matrix algorithm has an outer loop and an inner loop that is performed to completion for each iteration of the outer loop. The matrix operation manager, for each iteration of the outer loop, is configured to load to the cache and to write to a contiguous portion of the memory, before performing the inner loop, values from the first matrix that are to be combined, via performance of the inner loop, with values from the second matrix.

Type: Grant

Filed: December 5, 2002

Date of Patent: April 11, 2006

Assignee: Hewlett-Packard Development Company, L.P.

Inventor: Kevin R. Wadleigh
Signal processor, prefetch instruction method and prefetch instruction program

Patent number: 7020749

Abstract: A signal processor including a processor having a cache memory and a process execution unit executing a process by use of information temporarily stored in the cache memory and an external memory provided external to the processor. In the signal processor, the process execution unit automatically returns to a start point of a loop-type data at an end of the loop-type data and sequentially reads out the loop-type data from the external memory to the cache memory.

Type: Grant

Filed: June 4, 2003

Date of Patent: March 28, 2006

Assignee: Kabushiki Kaisha Toshiba

Inventor: Tetsuhiko Azuma
Method and system for processing a loop of instructions

Patent number: 7020769

Abstract: An information handling system processes a loop of instructions. In response to detecting processing of a particular instruction during a pass through the loop, the system initiates a fetch of an initial instruction that is programmed at a start of the loop, and stores an identification of a different instruction that is programmed between the initial instruction and the particular instruction. According to the stored identification, in response to detecting processing of the different instruction during an additional pass through the loop, the system initiates an additional fetch of the initial instruction.

Type: Grant

Filed: September 30, 2003

Date of Patent: March 28, 2006

Assignee: StarCore, LLC

Inventor: Allen Bruce Goodrich
Data processor speeding up repeat processing by inhibiting remaining instructions after a break in a repeat block

Patent number: 7010677

Abstract: A comparator 172 compares the value held in an RPT_B register 171 with the address of the instruction which is held in an IA register 181 and is to be fetched next, and outputs coincidence information indicating whether these value coincide with each other. Based on the coincidence information, a control part 112 generates hardware-wise a control signal for switching an instruction processing sequence to the next instruction of a repeat block in the last repeat processing of the repeat block.

Type: Grant

Filed: August 6, 2001

Date of Patent: March 7, 2006

Assignee: Renesas Technology Corp.

Inventor: Masahito Matsuo
Last iteration loop branch prediction upon counter threshold and resolution upon counter one

Patent number: 7010676

Abstract: An embodiment of the invention is a processor for processing loop branch instructions. The processor includes an instruction unit for fetching and decoding instructions including at least one loop branch instruction. A branch prediction unit predicts target instructions to be fetched and decoded by the instruction unit in response to the loop branch instruction. An execution unit executes instructions from the instruction unit and maintains a counter indicating an iteration of a loop. The execution unit includes detection logic for detecting when the counter equals a threshold and notifies the branch prediction unit when the counter equals the threshold.

Type: Grant

Filed: May 12, 2003

Date of Patent: March 7, 2006

Assignee: International Business Machines Corporation

Inventors: Fadi Y. Busaba, Klaus J. Getzlaff, Christopher A. Krygowski, Timothy J. Slegel
Method for memory optimization in a digital signal processor

Patent number: 6990571

Abstract: According to one embodiment, a processing element is disclosed. The processing element includes an instruction buffer, a first most often (MO) buffer coupled to the instruction buffer and an execution unit coupled to the instruction buffer and the first MO buffer. The execution unit is adaptable to execute instructions stored within the first MO buffer based upon a first predetermined profile.

Type: Grant

Filed: April 25, 2001

Date of Patent: January 24, 2006

Assignee: Intel Corporation

Inventor: David K. Vavro
Processor with a computer repeat instruction

Patent number: 6990570

Abstract: A processing engine, such as a digital signal processor, includes an execution mechanism, a repeat count register and a repeat count index register. The execution mechanism is operable for a repeat instruction to initialize the repeat count index register with the content of the repeat count register, and to modify the content of the repeat count register. The repeat instruction comprises two parts, the first of which initializes the repeat count index register and initiates repeat of a subsequent instruction, and the second part of which modifies the content of the repeat count register.

Type: Grant

Filed: October 1, 1999

Date of Patent: January 24, 2006

Assignee: Texas Instruments Incorporated

Inventors: Yves Masse, Gilbert Laurenti, Alain Boyadjian
Method of an address trace cache storing loop control information to conserve trace cache area

Patent number: 6988190

Abstract: A branch prediction method using an address trace is described. An address trace corresponding to an executed instruction is stored itself with a decoded form. After appointing a start address and an end address of a repeated routine, current routine iteration count and total number of iterations are compared with each other, confirming the end of the routine and storing address information of the next routine. Therefore, access information of the repeated routine can be stored using a small amount of trace cache.

Type: Grant

Filed: November 15, 2000

Date of Patent: January 17, 2006

Assignee: Samsung Electronics, Co., Ltd.

Inventor: Sung-Bae Park
Method and apparatus for efficient code generation for modulo scheduled uncounted loops

Patent number: 6986131

Abstract: A method of efficient code generation for modulo scheduled uncounted loops includes: assigning a given stage predicate to each instruction in each stage, including assigning a given stage predicate to each instruction in each speculative stage; and using the stage predicate to conditionally enable or disable the execution of an instruction during the prologue and epilogue execution.

Type: Grant

Filed: June 18, 2002

Date of Patent: January 10, 2006

Assignee: Hewlett-Packard Development Company, L.P.

Inventors: Carol L. Thompson, Uma Srinivasan, Richard E. Hank, Dale Morris
Repeat block with zero cycle overhead nesting

Patent number: 6986028

Abstract: A digital system is provided with means and methods for executing an instruction type wherein context information that pertains to that type instruction is automatically saved and restored during execution of the instruction type.

Type: Grant

Filed: July 17, 2002

Date of Patent: January 10, 2006

Assignee: Texas Instruments Incorporated

Inventors: Peter N. Ehlig, Alexander Tessarolo
Repeat instruction with interrupt

Patent number: 6976158

Abstract: A processor for processing an interruptible repeat instruction is provided. The repeat instruction may include an immediate operand specifying a loop count value corresponding to the number of times that the loop is to be repeated. Alternatively, the repeat instruction may include an address of a register which holds the loop count value. The instruction immediately following the repeat instruction is the target instruction for repetition. The processing includes repeating execution of the target instruction according to the loop count value in a low processor cycle overhead manner. The processing may also include handling interrupts during repeat instruction processing in a low-overhead manner during the initial call of the interrupt service routine as well as upon returning from the interrupt service routine.

Type: Grant

Filed: June 1, 2001

Date of Patent: December 13, 2005

Assignee: Microchip Technology Incorporated

Inventors: Michael Catherwood, Joseph W. Triece
Microsequencer with nested loop counters

Patent number: 6968448

Abstract: A pattern generator includes an address generator, an address topology generator, and a data topology generator. The address generator is adapted to provide a first address having a plurality of address bits. The address topology generator includes a first plurality of programmable logic gates. Each programmable logic gate of the first plurality is coupled to receive at least a subset of the plurality of address bits. The first plurality of programmable logic gates generate a second address having a plurality of modified address bits. The data topology generator is adapted to receive at least a subset of the plurality of modified address bits and generate write data based on the subset of modified address bits. A method for generating a pattern includes generating a first address having a plurality of address bits. A second address having a plurality of modified address bits is generated. The second address is a programmable combination of subsets of the address bits.

Type: Grant

Filed: July 24, 2001

Date of Patent: November 22, 2005

Assignee: Micron Technology, Inc.

Inventors: Charles K. Snodgrass, Bruce A. Dickey
Multiple execution of instruction loops within a processor without accessing program memory

Patent number: 6959379

Abstract: A method of executing loops in a computer system is described. The computer system has a sequence of instructions held in program memory and a prefetch buffer which holds instructions fetched from the memory ready for supply to a decoder of the computer system. If the size of the loop to be executed is such that it can by holly contained within the prefetch buffer, this is detected and a lock is put on the prefetch buffer to retain the loop within it while the loop is executed a requisite number of times. This thus allows power to be saved and reduces the overhead on the memory access buffers. According to another aspect, loops can be “skipped” by holding a value of zero in the loop counter register.

Type: Grant

Filed: May 2, 2000

Date of Patent: October 25, 2005

Assignee: STMicroelectronics S.A.

Inventors: Laurent Wojcieszak, Andrew Cofler
Hardware supported software pipelined loop prologue optimization

Patent number: 6954927

Abstract: A method for optimizing a software pipelineable loop in a software code is provided. The loop comprises one or more pipelined stages and one or more loop operations. The method comprises evaluating an initiation interval time (IN) for a pipelined stage of the loop. A loop operation time latency (Tld) and a number of loop operations (Np) from the pipelined stages to peel based on IN and Tld is then determined. The loop operation is peeled Np times and copied before the loop in the software code. A vector of registers is allocated and the results of the peeled loop operations and a result of an original loop operation is assigned to the vector of registers. Memory addresses for the results of the peeled loop operations and original loop operation are also assigned.

Type: Grant

Filed: October 4, 2001

Date of Patent: October 11, 2005

Assignee: Elbrus International

Inventor: Alexander Y. Ostanevich
Loop instruction processing using loop buffer in a data processing device having a coprocessor

Patent number: 6950929

Abstract: A data processing device having a central processing unit for fetching instructions from a program memory, decoding the instructions and sending a signal to a coprocessor if a coprocessor-type instruction is decoded; a coprocessor for decoding the coprocessor-type instructions upon receipt of the signal; and a loop buffer for receiving from the program memory instructions within a loop and storing the instructions within the loop when the coprocessor decodes a loop operation from the coprocessor-type instructions, wherein the instructions within the loop are retrieved from the loop buffer for execution in a subsequent iteration of the loop, and wherein a disable signal is sent to the program memory for inhibiting access of the program memory while the instructions within the loop are retrieved from the loop buffer.

Type: Grant

Filed: May 24, 2001

Date of Patent: September 27, 2005

Assignee: Samsung Electronics Co., Ltd.

Inventors: Seung Jae Chung, Yong Chun Kim
Pop-compare micro instruction for repeat string operations

Patent number: 6931517

Abstract: A microprocessor apparatus is provided for performing a pop-compare operation. The microprocessor apparatus includes paired operation translation logic, load logic, and execution logic. The paired operation translation logic receives a macro instruction that prescribes the pop-compare operation, and generates a pop-compare micro instruction. The pop-compare micro instruction directs pipeline stages in a microprocessor to perform the pop-compare operation. The load logic is coupled to the paired operation translation logic. The load logic receives the pop-compare micro instruction, and retrieves a first operand from an address in memory, where the address is specified by contents of a register. The register is prescribed by the pop-compare micro instruction. The execution logic is coupled to the load logic. The execution logic receives the first operand, and compares the first operand to a second operand.

Type: Grant

Filed: October 22, 2002

Date of Patent: August 16, 2005

Assignee: IP-First, LLC

Inventors: Gerard M. Col, G. Glenn Henry, Terry Parks
Register adjustment based on adjustment values determined at multiple stages within a pipeline of a processor

Patent number: 6920547

Abstract: Register adjustment is performed based on adjustment values determined at multiple stages within a pipeline of a processor. In one embodiment, a programmable processor is adapted to include a speculative count register. The speculative count register may be loaded with data associated with an instruction before the instruction commits. However, if the instruction is terminated before it commits, the speculative count register may be adjusted. A set of counters may monitor the difference between the speculative count register and its architectural counterpart.

Type: Grant

Filed: December 20, 2000

Date of Patent: July 19, 2005

Assignees: Intel Corporation, Analog Devices, Inc.

Inventors: Charles P. Roth, Ravi P. Singh, Gregory A. Overkamp
Mechanism to avoid explicit prologs in software pipelined do-while loops

Patent number: 6912709

Abstract: The present invention provides a mechanism that facilitates speculative execution of instructions within software-pipelined loops. In accordance with one embodiment of the invention, a software-pipelined loop is initialized with a speculative instruction deactivated. At least one initiation interval of the software-pipelined loop is executed, and the speculative instruction is activated. Subsequent initiation intervals of the software-pipelined loop are then executed.

Type: Grant

Filed: December 29, 2000

Date of Patent: June 28, 2005

Assignee: Intel Corporation

Inventors: David A Helder, Kalyan Muthukumar
Hardware loops

Patent number: 6898693

Abstract: In one embodiment, a programmable processor is adapted to include loop hardware to increase processing speed without significantly increasing power consumption. During a first pass through a loop, a first subset of a sequence of instructions may be loaded into the loop hardware. Then, during subsequent passes through the loop the first subset may be issued from the loop hardware while a second subset is retrieved from a memory device. In this manner, the second subset may be issued with no additional penalty after the first subset has been issued.

Type: Grant

Filed: November 2, 2000

Date of Patent: May 24, 2005

Assignees: Intel Corporation, Analog Devices, Inc.

Inventors: Ravi P. Singh, Charles P. Roth, Gregory A. Overkamp
Data processing device

Patent number: 6865666

Abstract: A data processing device having a PC controlling part for executing an operation of branch which has a first register for holding a result of decoding in an instruction decode unit, a register for holding a description indicating an execution condition of the operation (a value of field for designating condition), and a register for holding the description indicating a time for executing the operation (an address value of PC), wherein the execution condition is started when a value held in the register is in agreement with a PC value in accordance with the description of the register; and if the condition is satisfied, the PC controlling part executes the operation based on a content held in the register, whereby it is possible to delay the time for judging the execution condition during this delay, to thereby increase a degree of freedom in scheduling instructions such that the branch instruction is positioned prior to the operation instruction for determining the execution condition in the program.

Type: Grant

Filed: September 3, 1998

Date of Patent: March 8, 2005

Assignee: Renesas Technology Corp.

Inventors: Toyohiko Yoshida, Hideyuki Fujii
Method of processing a repeat block efficiently in a processor wherein the repeat count is not decremented in a specific case to prevent error in execution

Patent number: 6845443

Abstract: The invention presents a method for processing data in a processor having a processor core for processing a command in a pipeline form. The method comprises the steps of setting a repeat count value; executing a repeat block command to set a repeat ending address and a repeat starting address; comparing the repeat ending address and a memory address of the second command to identify whether they are identical to each other; checking whether the first command is a command for nonlinearly changing an executing order of a program if the repeat ending address and the memory address of the second command are identical to each other; and storing the memory address of the second command while not reducing the repeat count value if the first command is a command for nonlinearly changing the executing order of the program.

Type: Grant

Filed: December 29, 2000

Date of Patent: January 18, 2005

Assignee: Hynix Semiconductor Inc

Inventor: Seong Ae Jin
Microprocessor with branch-decrement instruction that provides a target and conditionally modifies a test register if the register meets a condition

Patent number: 6834338

Abstract: A data processing system is provided with a digital signal processor which has an instruction for conditionally branching based on the contents of a specified test register. Each time a branch is taken, the register is decremented as a side effect of executing the branch instruction. In addition, a predicate register is specified by the instruction. A branch occurs only if both registers meet specified conditions.

Type: Grant

Filed: October 31, 2000

Date of Patent: December 21, 2004

Assignee: Texas Instruments Incorporated

Inventors: David Hoyle, Timothy D. Anderson
Data speculation within modulo scheduled loops

Patent number: 6832370

Abstract: Optimizing compiler performance by applying data speculation within modulo scheduled loops to achieve a higher degree of instruction-level parallelism. The compiler locates a schedule for specifying an order of execution of the instructions and allocates rotating registers for the instruction execution. Based upon the schedule and the register allocation, the compiler determines an initiation interval specifying a number of instruction issue cycles between initiation of successive iterations related to the scheduling of the instructions.

Type: Grant

Filed: May 9, 2000

Date of Patent: December 14, 2004

Assignee: Hewlett-Packard Development, L.P.

Inventors: Uma Srinivasan, Kevin Nomura, Dz-ching Ju
Branch target cache and method for efficiently obtaining target path instructions for tight program loops

Patent number: 6829702

Abstract: A processor that efficiently obtains target path instructions in the presence of tight program loops includes at least one execution unit for executing instructions and instruction sequencing logic that supplies instructions to the at least one execution unit for execution. The instruction sequencing logic includes an instruction fetch buffer and a branch prediction unit including a branch target cache. In response to prediction of a branch instruction as taken, the branch target cache causes multiple copies of a target instruction group to be loaded into the instruction fetch buffer under the assumption that the branch instruction is a member of the target instruction group. Thereafter, the branch target cache causes all but one of the multiple copies to be canceled from the instruction fetch buffer prior to dispatch if the branch instruction does not belong to the target instruction group.

Type: Grant

Filed: July 26, 2000

Date of Patent: December 7, 2004

Assignee: International Business Machines Corporation

Inventors: Thomas Leo Jeremiah, Charles Robert Moore

prev … 3 4 5 6 7 8 9 10 next