Instruction Issuing Patents (Class 712/214)

Simultaneous issuance of multiple instructions (Class 712/215)

Selecting Fixed-Point Instructions to Issue on Load-Store Unit

Publication number: 20100250901

Abstract: Issue logic identifies a simple fixed point instruction, included in a unified payload, which is ready to issue. The simple fixed point instruction is a type of instruction that is executable by both a fixed point execution unit and a load-store execution unit. In turn, the issue logic determines that the unified payload does not include a load-store instruction that is ready to issue. As a result, the issue logic issues the simple fixed point instruction to the load-store execution unit in response to determining that the simple fixed point instruction is ready to issue and determining that the unified payload does not include a load-store instruction that is ready to issue.

Type: Application

Filed: March 24, 2009

Publication date: September 30, 2010

Applicant: International Business Machines Corporation

Inventors: Christopher Michael Abernathy, James Wilson Bishop, Mary Douglass Brown, William Elton Burky, Robert Allen Cordes, Hung Qui Le, Dung Quoc Nguyen, Todd Alan Venton
System and Method of Routing Instructions

Publication number: 20100250905

Abstract: Disclosed are a method and system for reducing complexity of routing of instructions from an instruction issue queue to appropriate execution pipelines in a superscalar processor. In one or more embodiments, an instruction steering unit of the superscalar processor receives ordered instructions. The steering unit determines that a first instruction and a subsequent second instruction of the ordered instructions are non-branching instructions, and the steering unit stores the first and second instructions in two non-branching instruction issue queue entries of a shadow queue. The steering unit determines whether or not a third instruction the ordered instructions is a branch instruction, where the third instruction is subsequent to the second instruction. If the third instruction is a branch instruction, the steering unit stores the third instruction in a branch entry of the shadow queue; otherwise, the steering unit stores a no operation instruction in the branch entry of the shadow queue.

Type: Application

Filed: March 27, 2009

Publication date: September 30, 2010

Applicant: INTERNATIONAL BUISNESS MACHINES CORPORATION

Inventors: Anthony J. Bybell, Kenichi Tsuchlya
Scalable multi-threaded sequencing/synchronizing processor architecture

Patent number: 7797514

Abstract: A high performance sequencer/synchronizer controls multiple concurrent data processors and dedicated coprocessors and their interaction with multiple shared memories. This sequencer/synchronizer controls multi-threading access to shared memory.

Type: Grant

Filed: November 16, 2006

Date of Patent: September 14, 2010

Assignee: Texas Instruments Incorporated

Inventors: Shinri Inamori, Deependra Talla
METHOD AND APPARATUS FOR IMPROVED POWER MANAGEMENT OF MICROPROCESSORS BY INSTRUCTION GROUPING

Publication number: 20100228955

Abstract: A method of power gating a microprocessor having an instruction scheduling unit for receiving issued instructions from an instruction decode unit; an execution unit coupled to receive and send signals from and to the instruction scheduling unit; and a state machine located within the execution unit, the method comprises: obtaining a number of instructions per cycle being issued to the instruction scheduling unit; determining, subsequent to obtaining the number of instructions per cycle, if the number of instruction per cycle being issued to the instruction scheduling unit is less than a threshold level, and then determining if at least two of the instructions being issued to the instruction scheduling unit are independent of each other only when the instructions per cycle is less than the threshold level; determining when at least two of the instructions being issued to the instruction scheduling unit are independent of each other; and power gating the microprocessor to gate off power to idle macros with a si

Type: Application

Filed: March 4, 2009

Publication date: September 9, 2010

Applicant: International Business Machines Corporation

Inventors: Tim Niggemeier, Harry Barowski, Maarten Boersma, Gunnar Spiess
Using performance data for instruction thread direction

Patent number: 7779235

Abstract: A method for dispatching instructions in the data processing system, having in memory for storing instructions and a plurality of central processing units, where each central processing unit includes a circuit to provide data indicating internal performance, the method having steps of receiving internal performance data signals from a pool of central processing units, selecting a central processing unit according to the received internal performance data and dispatching instructions from the memory to the selected central processing unit.

Type: Grant

Filed: February 6, 2007

Date of Patent: August 17, 2010

Assignee: International Business Machines Corporation

Inventors: Deepak K. Singh, Francois Ibrahim Atallah
OUT-OF-ORDER EXECUTION MICROPROCESSOR THAT SPECULATIVELY EXECUTES DEPENDENT MEMORY ACCESS INSTRUCTIONS BY PREDICTING NO VALUE CHANGE BY OLDER INSTRUCTIONS THAT LOAD A SEGMENT REGISTER

Publication number: 20100205406

Abstract: An out-of-order execution microprocessor executes an architectural segment register-loading instruction that instructs the microprocessor to load a new value into an architectural segment register of the microprocessor. A comparator compares the new value specified by the architectural segment register-loading instruction with a current contents of the architectural segment register. A control unit causes to be re-executed using the new value all instructions in the microprocessor that used the current architectural segment register contents as a source operand and that are newer in program order than the architectural segment register-loading instruction whenever the comparator indicates the new value does not equal the current contents.

Type: Application

Filed: February 11, 2009

Publication date: August 12, 2010

Applicant: VIA Technologies, Inc..

Inventors: Rodney E. Hooker, Gerard M. Col, Terry Parks
Dual-issuance of microprocessor instructions using dual dependency matrices

Patent number: 7769984

Abstract: A dual-issue instruction is decoded to determine a plurality of LSU dependencies needed by an LSU part of the dual-issue instruction and a plurality of non-LSU dependencies needed by a non-LSU part of the dual-issue instruction. During dispatch of the dual-issue instruction by the microprocessor, the dual dependency matrices are employed as follows: a Load-Store Unit (LSU) dependency matrix is written with the plurality of LSU dependencies and a non-LSU dependency matrix is written with the plurality of non-LSU dependencies; an LSU issue valid (LSU IV) indicator is set as valid to issue; an LSU portion of the dual-issue instruction is issued once the plurality of LSU dependencies of the dual issue instruction are satisfied; a non-LSU issue valid (non-LSU IV) indicator is set as valid to issue; and a non-LSU portion of the dual-issue instruction is issued once the plurality of non-LSU dependencies of the dual issue instruction are satisfied.

Type: Grant

Filed: September 11, 2008

Date of Patent: August 3, 2010

Assignee: International Business Machines Corporation

Inventors: Gregory W. Alexander, Brian D. Barrick, Lee E. Eisen, John W. Ward, III
Universal register rename mechanism for targets of different instruction types in a microprocessor

Patent number: 7765384

Abstract: A unified register rename mechanism for targets of different instruction types is provided in a microprocessor. The universal rename mechanism renames destinations of different instruction types using a single rename structure. Thus, an instruction that is updating a floating point register (FPR) can be renamed along with an instruction that is updating a general purpose register (GPR) or vector multimedia extensions (VMX) instructions register (VR) using the same rename structure because the number of architected states for GPR is the same as the number of architected states for FPR and VR. Each destination tag (DTAG) is assigned to one destination. A floating point instruction may be assigned to a DTAG, and then a fixed point instruction may be assigned to the next DTAG and so forth. With a universal rename mechanism, significant silicon and power can be saved by having only one rename structure for all instruction types.

Type: Grant

Filed: April 18, 2007

Date of Patent: July 27, 2010

Assignee: International Business Machines Corporation

Inventors: Hung Q. Le, Dung Q. Nguyen, Balaram Sinharoy
Hardware multithreading systems with state registers having thread profiling data

Patent number: 7765547

Abstract: According to some embodiments, a multithreaded microcontroller includes a thread control unit comprising thread control hardware (logic) configured to perform a number of multithreading system calls essentially in real time, e.g. in one or a few clock cycles. System calls can include mutex lock, wait condition, and signal instructions. The thread controller includes a number of thread state, mutex, and condition variable registers used for executing the multithreading system calls. Threads can transition between several states including free, run, ready and wait. The wait state includes interrupt, condition, mutex, I-cache, and memory substates. A thread state transition controller controls thread states, while a thread instructions execution unit executes multithreading system calls and manages thread priorities to avoid priority inversion. A thread scheduler schedules threads according to their priorities.

Type: Grant

Filed: November 24, 2004

Date of Patent: July 27, 2010

Assignee: Maxim Integrated Products, Inc.

Inventors: Sorin C. Cismas, Ilie Garbacea, Kristan J. Monsen
Method for allocating registers using simulated annealing controlled instruction scheduling

Patent number: 7761691

Abstract: A method for scheduling instructions for clustered digital signal processors comprising a plurality of clusters, each cluster including at least two functional units and a first register file having a first unit, a second unit and a single set of access ports shared by the functional units comprises steps of checking whether executing one instruction needs data to be read from the first unit and the second unit of the first register file, generating a copying instruction to transfer data from the first unit to the second unit of the first register file, checking whether there is a prior operation cycle available to perform the copying instruction, scheduling the copying instruction in the prior operation cycle, and scheduling the instruction after the copying instruction.

Type: Grant

Filed: October 27, 2005

Date of Patent: July 20, 2010

Assignee: National Tsing Hua University

Inventors: Chung-Lin Tang, Yung-Chia Lin, Jenq-Kuen Lee
Method, apparatus and computer program product for dynamically selecting compiled instructions

Patent number: 7761690

Abstract: A method, apparatus, and computer program product dynamically select compiled instructions for execution. Static instructions for execution on a first execution and dynamic instructions for execution on a second execution unit are received. The throughput performance of the static instructions and the dynamic instructions is evaluated based on current states of the execution units. The static instructions or the dynamic instructions are selected for execution at runtime on the first execution unit or the second execution unit, respectively, based on the throughput performance of the instructions.

Type: Grant

Filed: July 26, 2007

Date of Patent: July 20, 2010

Assignee: International Business Machines Corporation

Inventors: Deanna J. Chou, Jesse E. Craig, John Sargis, Jr., Daneyand J. Singley, Sebastian T. Ventrone
Multiple thread in-order issue in-order completion DSP and micro-controller

Patent number: 7761688

Abstract: An in-order issue in-order completion micro-controller comprises a pipeline core comprising in succession a fetch address stage, a program access stage, a decode stage, a first execution stage, a second execution stage, a memory access stage, and a write back stage. The various stages are provided a thread ID such that alternating stages use a first thread ID, and the other stages use a second thread ID. Each stage which requires access to thread ID specific context information uses the thread ID to specify this context information.

Type: Grant

Filed: September 6, 2007

Date of Patent: July 20, 2010

Assignee: Redpine Signals, Inc.

Inventor: Heonchul Park
Leaky-bucket thread scheduler in a multithreading microprocessor

Patent number: 7752627

Abstract: A leaky-bucket style thread scheduler for scheduling concurrent execution of multiple threads in a microprocessor is provided. The execution pipeline notifies the scheduler when it has completed instructions. The scheduler maintains a virtual water level for each thread and decreases it each time the execution pipeline executes an instruction of the thread. The scheduler includes an instruction execution rate for each thread. The scheduler increases the virtual water level based on the requested rate per a predetermined number of clock cycles. The scheduler includes virtual water pressure parameters that define a set of virtual water pressure ranges over the height of the virtual water bucket. When a thread's virtual water level moves from one virtual water pressure range to the next higher range, the scheduler increases the instruction issue priority for the thread; conversely, when the level moves down, the scheduler decreases the instruction issue priority for the thread.

Type: Grant

Filed: February 4, 2005

Date of Patent: July 6, 2010

Assignee: MIPS Technologies, Inc.

Inventors: Darren M. Jones, Ryan C. Kinter, Thomas A. Petersen, Sanjay Vishin
INFORMATION HANDLING SYSTEM INCLUDING A PROCESSOR WITH A BIFURCATED ISSUE QUEUE

Publication number: 20100161942

Abstract: An information handling system includes a processor with a bifurcated unified issue queue that may perform unified issue queue VSU store instruction dependency operations. The bifurcated unified issue queue BUIQ maintains VSU store instructions in the form of internal operations data. The BUIQ includes a unified issue queue UIQ 0 and a unified issue queue UIQ 1. The BUIQ may manage a particular VSU store instruction from one UIQ to determine data dependencies and employ the other UIQ to determine address dependencies of that particular VSU store instruction. The UIQs employ a dependency matrix including a dependency array. The dependency array data maintains both data and address dependency information. The particular VSU store instruction issues to execution units such as VSUs for data dependency information and load store units (LSUs) for address dependency information. A particular VSU store instruction may execute to provide data dependency information independent of address dependency information.

Type: Application

Filed: December 22, 2008

Publication date: June 24, 2010

Applicants: International Business Machines Corporation, IBM Corporation

Inventors: James Wilson Bishop, Mary Douglass Brown, William Elton Burky, Todd Alan Venton
Method and apparatus for increasing load bandwidth

Patent number: 7739483

Abstract: A method and apparatus for dual-target register allocation is described, intended to enable the efficient mapping/renaming of registers associated with instructions within a pipelined microprocessor architecture.

Type: Grant

Filed: September 28, 2001

Date of Patent: June 15, 2010

Assignee: Intel Corporation

Inventors: Rajesh Patel, James Dundas, Adi Yoaz
DATA PROCESSING SYSTEM AND DATA PROCESSING METHOD

Publication number: 20100131740

Abstract: The workload is heavy in the development of an application program that controls the task distribution in consideration of the variety of the execution environment. In a system where the processing is distributed to SPUs serving as plural processing entities so as to execute the computer program, the data processing is broken into plural units of processing by referring to the script code in which the content of the data processing is written, and the units of processing are assigned to the plural SPUs. Then, the whole computer program is executed when the SPUs execute the assigned process.

Type: Application

Filed: January 15, 2007

Publication date: May 27, 2010

Applicant: Sony Computer Entertainment Inc.

Inventors: Yasuhiko Yokote, Hirotoshi Maegawa, Noriyuki Murata
Speculative instruction issue in a simultaneously multithreaded processor

Patent number: 7725684

Abstract: A method for optimizing throughput in a microprocessor that is capable of processing multiple threads of instructions simultaneously. Instruction issue logic is provided between the input buffers and the pipeline of the microprocessor. The instruction issue logic speculatively issues instructions from a given thread based on the probability that the required operands will be available when the instruction reaches the stage in the pipeline where they are required. Issue of an instruction is blocked if the current pipeline conditions indicate that there is a significant probability that the instruction will need to stall in a shared resource to wait for operands. Once the probability that the instruction will stall is below a certain threshold, based on current pipeline conditions, the instruction is allowed to issue.

Type: Grant

Filed: April 17, 2008

Date of Patent: May 25, 2010

Assignee: International Business Machines Corporation

Inventors: Victor R. Augsburg, Jeffrey T. Bridges, Michael S. Mcilvaine, Thomas Andrew Sartorius, Rodney W. Smith
Apparatus and method for power optimized replay via selective recirculation of instructions

Patent number: 7725683

Abstract: A method and apparatus for power optimized replay. In one embodiment, the method includes the issuance of an instruction selected from a queue. Once issued, the instruction may be enqueued within a recirculation queue if completion of the instruction is blocked by a detected blocking condition. Once enqueued, instructions contained within the recirculation queue may be reissued once a blocking condition of an instruction within the recirculation queue is satisfied. Accordingly, a power optimized replay scheme as described herein optimizes power while retaining the advantages provided by selectively replaying of blocked instructions to improve power efficiency.

Type: Grant

Filed: September 25, 2003

Date of Patent: May 25, 2010

Assignee: Marvell International Ltd.

Inventors: Sujat Jamil, Hang Nguyen, Samantha J. Edirisooriya, David E. Miner, R. Frank O'Bleness, Steven J. Tu
System and method for propagating operand availability prediction bits with instructions through a pipeline in an out-of-order processor

Patent number: 7721071

Abstract: A processor core and a method for distributive scoreboard scheduling in an out-of-order processor pipeline are described herein. In an embodiment, control logic appends operand availability bits to each instruction. The appended operand availability bits form a distributive scoreboard for each instruction. The appended operand availability bits are propagated together with the instruction through multiple stages of the processor pipeline. An instruction dispatch buffer stores the instruction and the operand availability bits. A dispatch controller determines when an instruction is to be issued. The determination is based, at least in part, on the operand availability bits stored in the instruction dispatch buffer.

Type: Grant

Filed: February 28, 2006

Date of Patent: May 18, 2010

Assignee: MIPS Technologies, Inc.

Inventor: Xing Yu Jiang
High-performance, superscalar-based computer system with out-of-order instruction execution

Patent number: 7721070

Abstract: A high-performance, superscalar-based computer system with out-of-order instruction execution for enhanced resource utilization and performance throughput. The computer system fetches a plurality of fixed length instructions with a specified, sequential program order (in-order). The computer system includes an instruction execution unit including a register file, a plurality of functional units, and an instruction control unit for examining the instructions and scheduling the instructions for out-of-order execution by the functional units. The register file includes a set of temporary data registers that are utilized by the instruction execution control unit to receive data results generated by the functional units. The data results of each executed instruction are stored in the temporary data registers until all prior instructions have been executed, thereby retiring the executed instruction in-order.

Type: Grant

Filed: September 22, 2008

Date of Patent: May 18, 2010

Inventors: Le Trong Nguyen, Derek J. Lentz, Yoshiyuki Miyayama, Sanjiv Garg, Yasuaki Hagiwara, Johannes Wang, Te-Li Lau, Sze-Shun Wang, Quang H. Trang
ACROSS-THREAD OUT-OF-ORDER INSTRUCTION DISPATCH IN A MULTITHREADED MICROPROCESSOR

Publication number: 20100122067

Abstract: Instruction dispatch in a multithreaded microprocessor such as a graphics processor is not constrained by an order among the threads. Instructions for each thread are fetched, and a dispatch circuit determines which instructions in the buffer are ready to execute. The dispatch circuit may issue any ready instruction for execution, and an instruction from one thread may be issued prior to an instruction from another thread regardless of which instruction was fetched first. If multiple functional units are available, multiple instructions can be dispatched in parallel.

Type: Application

Filed: January 20, 2010

Publication date: May 13, 2010

Applicant: NVIDIA Corporation

Inventors: John Erik Lindholm, Brett Coon, Simon S. Moy
Reduced instruction set baseboard controller

Patent number: 7715792

Abstract: In wireless communications such as in the Bluetooth communication system, an execution unit sequentially receives software instructions for execution. Prior to completing each instruction, the execution unit issues an interrupt indicating the upcoming completion of the instruction execution and awaits receipt of the next instruction. A Link Manager issues limited instructions, and a Link Controller includes a hardware execution unit for executing the limited instructions. A processing unit in the Link Manager performs remaining functions under control of a software program.

Type: Grant

Filed: April 13, 2006

Date of Patent: May 11, 2010

Assignee: Broadcom Corporation

Inventor: Joakim Linde
Interfacing processors with external memory supporting burst mode

Patent number: 7716442

Abstract: Multiple data devices (A,B,C) are interfaced via a bus arbiter (S) with an external memory (F) so as to support burst-mode access by each device (A,B,C) one or more read registers (R1,R2,R3) are provided in the memory (F), and each register (R1,R2,R3) supports burst-mode access by a corresponding device (A,B,C). The arbiter (s) selects the register to be used following the initial access burst, according to the device requiring access. Thus, the memory (F) supports multiple burst-mode accesses in parallel.

Type: Grant

Filed: September 17, 2002

Date of Patent: May 11, 2010

Assignees: MStar Semiconductor, Inc., MStar Software R&D, Ltd., MStar France SAS, MStar Semiconductor, Inc.

Inventor: Eugene Pascal Herczog
Instruction processing apparatus

Publication number: 20100106945

Abstract: The present invention includes a decode section for simultaneously holding a plurality of instructions in one thread at a time and for decoding the held instructions; an execution pipeline capable of simultaneously executing each processing represented by the respective instructions belonging to different threads and decoded by the decode section; a reservation station for receiving the instructions decoded by the decode section and holding the instructions, if the decoded instructions are of sync attribute, until executable conditions are ready and thereafter dispatching the decoded instructions to the execution pipeline; a pre-decode section for confirming by a simple decoding, prior to decoding by the decode section, whether or not the instructions are of sync attribute; and an instruction buffer for suspending issuance to the decode section and holding the instructions subsequent to an instruction of sync attribute.

Type: Application

Filed: December 16, 2009

Publication date: April 29, 2010

Applicant: FUJITSU LIMITED

Inventor: Toshio Yoshida
Instruction issue control within a multi-threaded in-order superscalar processor

Patent number: 7707390

Abstract: A multi-threaded in-order superscalar processor 2 is described having a fetch stage 8 within which thread interleaving circuitry 36 interleaves instructions taken from different program threads to form an interleaved stream of instructions which is then decoded and subject to issue. Hint generation circuitry 62 within the fetch stage 8 adds hint data to the threads indicating that parallel issue of an associated instruction is permitted with one of more other instructions.

Type: Grant

Filed: April 25, 2007

Date of Patent: April 27, 2010

Assignee: ARM Limited

Inventors: Emre Özer, Vladimir Vasekin, Stuart David Biles
Multi-Execution Unit Processing Unit with Instruction Blocking Sequencer Logic

Publication number: 20100100712

Abstract: A processing unit includes multiple execution units and sequencer logic that is disposed downstream of instruction buffer logic, and that is responsive to a sequencer instruction present in an instruction stream. In response to such an instruction, the sequencer logic issues a plurality of instructions associated with a long latency operation to one execution unit, while blocking instructions from the instruction buffer logic from being issued to that execution unit. In addition, the blocking of instructions from being issued to the execution unit does not affect the issuance of instructions to any other execution unit, and as such, other instructions from the instruction buffer logic are still capable of being issued to and executed by other execution units even while the sequencer logic is issuing the plurality of instructions associated with the long latency operation.

Type: Application

Filed: October 16, 2008

Publication date: April 22, 2010

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Eric Oliver Mejdrich, Adam James Muff, Matthew Ray Tubbs
Reordering application code to improve processing performance

Patent number: 7698534

Abstract: A method of reordering a sequence of code for processing by a target data processor in order to reduce an execution time for said code on said target data processor is disclosed. The method comprises the steps of: in response to a request to execute said sequence of code, loading said sequence of code into a volatile data store associated with said target data processor; analyzing said sequence of code in relation to properties of said target data processor; identifying interlocks within said sequence of code when executing on said target data processor, in which a portion of code would be stalled while waiting for an earlier portion to complete; reordering said sequence of code to remove at least some of said interlocks; and executing said reordered sequence of code; wherein said steps of analyzing, identifying, reordering and executing are performed by said target data processor.

Type: Grant

Filed: February 21, 2007

Date of Patent: April 13, 2010

Assignee: ARM Limited

Inventors: Rodolph Gérard Jacques Ascanio Jean-Denis Perfetta, Graham Peter Wilkinson
VERY-LONG INSTRUCTION WORD ARCHITECTURE WITH MULTIPLE PROCESSING UNITS

Publication number: 20100082947

Abstract: A processor may include a plurality of processing units for processing instructions, where each processing unit is associated with a discrete instruction queue. Data is read from a data queue selected by each instruction, and a sequencer manages distribution of instructions to the plurality of discrete instruction queues.

Type: Application

Filed: September 8, 2009

Publication date: April 1, 2010

Inventors: Matthias Tramm, Manfred Stadler, Christian Hitz
Credit-based activity regulation within a microprocessor based on an accumulative credit system

Patent number: 7689844

Abstract: A technique to control power consumption within a microprocessor. More particularly, embodiments of the invention relate to a technique to control power and performance within one or more microprocessors by enforcing a credit-based instruction execution rate algorithm.

Type: Grant

Filed: December 27, 2007

Date of Patent: March 30, 2010

Assignee: Intel Corporation

Inventor: George Chrysos
System and Method for Issuing Load-Dependent Instructions in an Issue Queue in a Processing Unit of a Data Processing System

Publication number: 20100077181

Abstract: A system and method for issuing load-dependent instructions in an issue queue in a processing unit. A load miss queue is provided. The load miss queue comprises a physical address field, an issue queue position field, a valid identifier field, a source identifier field, and a data type field. A load instruction that misses a first level cache is dispatched, and both the physical address field and the data type field are set. A load-dependent instruction is identified. In response to indentifying the load-dependent instruction, each of the issue queue position field, valid identifier field, and source identifier field are set. If the issue queue position field refers to a flushed instruction, the valid identifier field is cleared. The load instruction is recycled, and a value of the valid identifier field is determined. The load-dependent instruction is then selected for issue on a next processing cycle independent of an age of the load-dependent instruction.

Type: Application

Filed: September 23, 2008

Publication date: March 25, 2010

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Christopher M. Abernathy, Mary D. Brown, William E. Burky, Todd A. Venton
Multithreading instruction scheduler employing thread group priorities

Patent number: 7681014

Abstract: An instruction dispatching apparatus in a multi threading microprocessor that concurrently executes N threads each in one of G groups each having one of P priorities. G round-robin vectors each have N bits corresponding to the threads, each being a 1-bit left-rotated and subsequently sign-extended version of an N-bit vector with a single bit true of the last thread selected for dispatching in the group. Each of N G-input muxes receive a corresponding one of the N bits of each of the round-robin vectors and selects for output one of the inputs specified by the corresponding thread's group. Selection logic selects for dispatching one of the N instructions corresponding to the thread whose dispatch value is greater than or equal to any of the N threads left thereof. Each dispatch value comprises a least-significant bit of the corresponding mux output, a most-significant dispatchable instruction bit, and middle thread group priority bits.

Type: Grant

Filed: July 27, 2005

Date of Patent: March 16, 2010

Assignee: MIPS Technologies, Inc.

Inventors: Michael Gottlieb Jensen, Ryan C. Kinter
Scheduling control within a data processing system

Publication number: 20100064287

Abstract: A processor 2 is responsive to a stream of program instructions to issue program instructions under control of scheduling circuitry 6 to respective execution units 24 for execution. The execution units 24 can include error detecting circuitry 32 for detecting a change in an output signal which occurs after the output signal has latched and during an error detecting period following the latching of the output signal. The scheduling circuitry 6 is arranged so as to suppress issue of program instructions to an execution unit 24 having such error detecting circuitry 32 on consecutive processing cycles.

Type: Application

Filed: July 21, 2009

Publication date: March 11, 2010

Applicant: ARM Limited

Inventors: David Michael Bull, Emre Ozer, Shidhartha Das
Replay Reduction for Power Saving

Publication number: 20100064120

Abstract: In one embodiment, a processor comprises a scheduler configured to issue a first instruction operation to be executed and an execution core coupled to the scheduler. Configured to execute the first instruction operation, the execution core comprises a plurality of replay sources configured to cause a replay of the first instruction operation responsive to detecting at least one of a plurality of replay cases. The scheduler is configured to inhibit issuance of the first instruction operation subsequent to the replay for a subset of the plurality of replay cases. The scheduler is coupled to receive an acknowledgement indication corresponding to each of the plurality of replay cases in the subset, and is configured to inhibit issuance of the first instruction operation until the acknowledgement indication is asserted that corresponds to an identified replay case of the subset.

Type: Application

Filed: November 17, 2009

Publication date: March 11, 2010

Inventors: Po-Yung Chang, Wei-Han Lien, Jesse Pan, Ramesh Gunna, Tse-Yu Yeh, James B. Keller
METHOD AND SYSTEM FOR PACKET ENCRYPTION

Publication number: 20100064116

Abstract: A data processor and a method for processing data is disclosed. The processor has an input port for receiving packets of data to be processed. A master controller acts to analyse the packets and to provide a header including a list of processes to perform on the packet of data and an ordering thereof. The master controller is programmed with process related data relating to the overall processing function of the processor. The header is appended to the packet of data. The packet with the appended header information is stored within a buffer. A buffer controller acts to determine for each packet stored within the buffer based on the header within the packet a next processor to process the packet. The controller then provides the packet to the determined processor for processing. The processed packet is returned with some indication that the processing is done. For example, the process may be deleted from the list of processes.

Type: Application

Filed: November 16, 2009

Publication date: March 11, 2010

Applicant: Mosaid Technologies Incorporated

Inventors: Arthur John Low, Stephen J. Davis
Single bit control of threads in a multithreaded multicore processor

Patent number: 7676655

Abstract: A method and mechanism for controlling threads in a multithreaded multicore processor. A processor includes multiple cores, each of which are capable of executing multiple threads. A control register which is shared by each of the cores is utilized to control the status of the threads in the processing system. In one embodiment, the shared register includes a single bit for each thread in the processor. Depending upon the value written to a bit of the shared register, one of three results may occur with respect to a thread which corresponds to the bit. In one embodiment, writing a “0” to a bit of the shared register will cause a corresponding thread to be Parked. Writing a “1” to a bit of the shared register will cause a corresponding thread to either be UnParked or be Reset. Whether writing a “1” to a bit of the register causes the corresponding thread to be UnParked or Reset depends upon a state of the processor.

Type: Grant

Filed: June 30, 2004

Date of Patent: March 9, 2010

Assignee: Sun Microsystems, Inc.

Inventor: Paul J. Jordan
Method and apparatus for back to back issue of dependent instructions in an out of order issue queue

Patent number: 7669038

Abstract: A method is provided for evaluating two or more instructions in an out of order issue queue during a particular cycle of the queue, to select an instruction for issue during the next following cycle. If an instruction was previously designated to issue during the particular cycle, one or more instructions in the queue are evaluated to determine if any of them are dependent on the designated instruction. For the evaluation, each instruction placed into the queue is accompanied by corresponding logic elements that provide destination to source compares for the instruction. In an embodiment comprising a method, the oldest ready instruction in the queue during a particular cycle is identified.

Type: Grant

Filed: May 2, 2008

Date of Patent: February 23, 2010

Assignee: International Business Machines Corporation

Inventors: William Elton Burky, Raymond Cheung Yeung
Self-checking code for tamper-resistance based on code overlapping

Patent number: 7664937

Abstract: Apparatus and methods for implementing software protection using code overlapping are disclosed. In one implementation, a combination block comprising a first sub-block of instructions with one or more interspersed obfuscation instructions is received. The obfuscation instructions interspersed among sequentially executable instructions of the first sub-block of instructions can include instructions from other sub-blocks as well as control instructions configured to guide a processor to execute all of the instructions in first sub-block of instructions in sequence. The obfuscation instructions are replaced with one or more replacement instructions. The replacement instructions can be of a same bit-length as the replaced obfuscation instructions. Moreover, the replacement instructions can include integrity checks configured to check for tampering with instructions and/or runtime program state in the first sub-block and/or the combination block.

Type: Grant

Filed: March 1, 2007

Date of Patent: February 16, 2010

Assignee: Microsoft Corporation

Inventors: Matthias Jacob, Mariusz H. Jakubowski
System and method for translating non-native instructions to native instructions for processing on a host processor

Patent number: 7664935

Abstract: A system and method for extracting complex, variable length computer instructions from a stream of complex instructions each subdivided into a variable number of instructions bytes, and aligning instruction bytes of individual ones of the complex instructions. The system receives a portion of the stream of complex instructions and extracts a first set of instruction bytes starting with the first instruction bytes, using an extract shifter. The set of instruction bytes are then passed to an align latch where they are aligned and output to a next instruction detector. The next instruction detector determines the end of the first instruction based on said set of instruction bytes. An extract shifter is used to extract and provide the next set of instruction bytes to an align shifter which aligns and outputs the next instruction. The process is then repeated for the remaining instruction bytes in the stream of complex instructions.

Type: Grant

Filed: March 11, 2008

Date of Patent: February 16, 2010

Inventors: Brett Coon, Yoshiyuki Miyayama, Le Trong Nguyen, Johannes Wang
Data processing apparatus with parallel operating functional units

Patent number: 7664929

Abstract: A program of instruction words is executed with a VLIW data processing apparatus. The apparatus comprises a plurality of functional units capable of executing a plurality of instructions from each instruction word in parallel. The instructions from each of at least some of the instruction words are fetched from respective memory units in parallel, addressed with an instruction address that is common for the functional units. Translation of the instruction address into a physical address can be modified for one or more particular ones of the memory units. Modification is controlled by modification update instructions in the program. Thus, it can be selected dependent on program execution which instructions from the memory units will be combined into the instruction word in response to the instruction address.

Type: Grant

Filed: September 17, 2003

Date of Patent: February 16, 2010

Assignee: Koninklijke Philips Electronics N.V.

Inventors: Carlos Antonio Alba Pinto, Ramanathan Sethuraman, Srinivasan Balakrishnan, Harm Johannes Antonius Maria Peters, Rafael Peset Llopis
Prioritizing thread selection partly based on stall likelihood providing status information of instruction operand register usage at pipeline stages

Patent number: 7664936

Abstract: An apparatus for scheduling dispatch of instructions among a plurality of threads being concurrently executed in a multithreading processor is provided. The apparatus includes an instruction decoder that generate register usage information for an instruction from each of the threads, a priority generator that generates a priority for each instruction based on the register usage information and state information of instructions currently executing in an execution pipeline, and selection logic that dispatches at least one instruction from at least one thread based on the priority of the instructions. The priority indicates the likelihood the instruction will execute in the execution pipeline without stalling. For example, an instruction may have a high priority if it has little or no register dependencies or its data is known to be available; or may have a low priority if it has strong register dependencies or is an uncacheable or synchronized storage space load instruction.

Type: Grant

Filed: February 4, 2005

Date of Patent: February 16, 2010

Assignee: MIPS Technologies, Inc.

Inventors: Michael Gottlieb Jensen, Darren M. Jones, Ryan C. Kinter, Sanjay Vishin
Multithreading instruction scheduler employing thread group priorities

Patent number: 7660969

Abstract: A concurrent instruction dispatch apparatus includes a group indicator for each of a plurality of threads that indicates which one of a plurality of groups of the threads the thread belongs to. A group priority indicator for each group indicates an instruction dispatch priority relative to the other groups. Selection logic selects a thread for dispatching an instruction thereof based on the group and group priority indicators. A bifurcated scheduler includes first scheduler logic that issues instructions of the threads to an execution unit, second scheduler logic that enforces a thread scheduling policy, and an interface. A group indicator indicates which group each thread belongs to, a priority for each group, and execution information for each thread. The first scheduler logic issues the instructions based on the group priorities and group indicators, and the second scheduler logic updates the group indicators based on the instruction execution information.

Type: Grant

Filed: January 5, 2007

Date of Patent: February 9, 2010

Assignee: MIPS Technologies, Inc.

Inventors: Michael Gottlieb Jensen, Ryan C. Kinter
Structure and method for achieving very large lookahead instruction window via non-sequential instruction fetch and issue

Patent number: 7650485

Abstract: A multithreading processor achieves a very large lookahead instruction window by allowing non-sequential fetch and processing of the dynamic instruction stream. A speculative thread is spawned at a specified point in the dynamic instruction stream and the instructions subsequent to the specified point are speculatively executed so that these instructions are fetched and issued out of sequential order. Very minimal modifications to existing processor design of a multithreading processor are required to achieve the very large lookahead instruction window. The modifications include changes to the control logic of the issue unit, only three additional bits in the register scoreboard.

Type: Grant

Filed: April 10, 2007

Date of Patent: January 19, 2010

Assignee: Sun Microsystems, Inc.

Inventor: Yuan C. Chou
MEMORY THROUGHPUT INCREASE VIA FINE GRANULARITY OF PRECHARGE MANAGEMENT

Publication number: 20090327660

Abstract: Methods and apparatus to improve throughput in memory devices are described. In one embodiment, memory throughput is increased via fine granularity of precharge management. In an embodiment, three separate precharge timings may be used, e.g., optimized per memory bank, per memory bank group, and/or per a memory device. Other embodiments are also disclosed and claimed.

Type: Application

Filed: June 30, 2008

Publication date: December 31, 2009

Inventors: Kuljit S. Bains, John B. Halbert
Dynamic field patchable microarchitecture

Patent number: 7640418

Abstract: A microprocessor memory architecture including a read-only memory (ROM) with programmed microcode and a random access memory (RAM) capable of storing microcode and one or more data bits used for the selection of corresponding ROM or RAM microcode for execution. A multiplexer receives input signals from both the ROM microcode and RAM microcode, and a control signal which is one or more RAM data bits is used to select from the RAM or ROM microcode inputs for further execution by the microprocessor.

Type: Grant

Filed: August 9, 2004

Date of Patent: December 29, 2009

Assignee: Broadcom Corporation

Inventors: Sherman Lee, Vivian Y. Chou, John H. Lin
VOLTAGE DROOP MITIGATION THROUGH INSTRUCTION ISSUE THROTTLING

Publication number: 20090300329

Abstract: A system and method for providing a digital real-time voltage droop detection and subsequent voltage droop reduction. A scheduler within a reservation station may store a weight value for each instruction corresponding to node capacitance switching activity for the instruction derived from pre-silicon power modeling analysis. For instructions picked with available source data, the corresponding weight values are summed together to produce a local current consumption value and this value is summed with any existing global current consumption values from corresponding schedulers of other processor cores yielding an activity event. The activity event is stored. Hashing functions within the scheduler are used to determine both a recent and an old activity average using the calculated activity event and stored older activity events.

Type: Application

Filed: May 27, 2008

Publication date: December 3, 2009

Inventors: Samuel D. Naffziger, Michael Gerard Butler
Data processing device and electronic equipment using pipeline control

Patent number: 7620803

Abstract: A data processing device is provided using pipeline architecture to reduce a time loss due to a branch without causing an increase in circuit scale. The data processing device uses pipeline control. The data processing device includes an instruction queue in which a plurality of instruction codes can be fetched, a fetch address operation circuit which calculates a fetch address, a fetch circuit which fetches an instruction code based on the fetch address, and a branch information setting circuit which decodes a branch setting instruction, stores a branch address in a branch address storage register, and stores a branch target address in a branch target address storage register. The fetch address operation circuit compares either a previous fetch address or an expected next fetch address with a value stored in the branch address storage register, and determines a next fetch address to be output, based on the comparison result.

Type: Grant

Filed: June 20, 2003

Date of Patent: November 17, 2009

Assignee: Seiko Epson Corporation

Inventor: Makoto Kudo
Preferential Dispatching Of Computer Program Instructions

Publication number: 20090282221

Abstract: A computer processor that includes a plurality of execution pipelines, each execution pipeline including a configuration of one or more execution units of the processor, each execution pipeline characterized by an execution pipeline type, each execution pipeline type determined according to the types of computer program instructions executed in each execution pipeline; a plurality of hardware threads of execution, each hardware thread including computer program instructions characterized by instruction types, each hardware thread including sequences of instructions of a same instruction type, the sequences interspersed with instructions of other types; and an instruction dispatcher capable of dispatching instructions preferentially during a predefined preference period from a preferred hardware thread to a particular execution pipeline in dependence upon whether the preferred hardware thread presents a sequence of instructions, ready for execution from the preferred hardware thread, of a type for execution in

Type: Application

Filed: May 9, 2008

Publication date: November 12, 2009

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Timothy H. Heil, Brian L. Koehler, Eric O. Mejdrich
Interfacing external thread prioritizing policy enforcing logic with customer modifiable register to processor internal scheduler

Patent number: 7613904

Abstract: A bifurcated instruction scheduler for dispatching instructions of multiple threads concurrently executing in a multithreading processor is provided. The scheduler includes a first portion within a reusable core that is not customizable by a customer, a second portion outside the core that is customizable, and an interface coupling the second portion to the core. The second portion implements a thread scheduling policy that may be customized to the customer's particular application. The first portion may be scheduling policy-agnostic and issues instructions of the threads each clock cycle to execution units based on the scheduling policy communicated by the second portion. The second portion communicates the scheduling policy via a priority for each of the threads. When the core commits an instruction for execution, the core communicates to the second portion which thread the committed instruction is in to enable the second portion to update the priorities in response thereto.

Type: Grant

Filed: February 4, 2005

Date of Patent: November 3, 2009

Assignee: MIPS Technologies, Inc.

Inventors: Darren M. Jones, Ryan C. Kinter, Kevin D. Kissell, Thomas A. Petersen
Apparatus For Storing Instructions In A Multithreading Microprocessor

Publication number: 20090271592

Abstract: A circuit for selecting one of N requesters in a round-robin fashion is disclosed. The circuit 1-bit left rotatively increments a first addend by a second addend to generate a sum that is ANDed with the inverse of the first addend to generate a 1-hot vector indicating which of the requestors is selected next. The first addend is an N-bit vector where each bit is false if the corresponding requester is requesting access to a shared resource. The second addend is a 1-hot vector indicating the last selected requestor. A multithreading microprocessor dispatch scheduler employs the circuit for N concurrent threads each thread having one of P priorities. The dispatch scheduler generates P N-bit 1-hot round-robin bit vectors, and each thread's priority is used to select the appropriate round-robin bit from P vectors for combination with the thread's priority and an issuable bit to create a dispatch level used to select a thread for instruction dispatching.

Type: Application

Filed: April 23, 2009

Publication date: October 29, 2009

Applicant: MIPS Technologies, Inc.

Inventor: Michael Gottlieb Jensen
Dynamic allocation of a buffer across multiple clients in multi-threaded processor without performing a complete flush of data associated with allocation

Patent number: 7603544

Abstract: A method may include distributing ranges of addresses in a memory among a first set of functions in a first pipeline. The first set of the functions in the first pipeline may operate on data using the ranges of addresses. Different ranges of addresses in the memory may be redistributed among a second set of functions in a second pipeline without waiting for the first set of functions to be flushed of data.

Type: Grant

Filed: September 12, 2005

Date of Patent: October 13, 2009

Assignee: Intel Corporation

Inventor: Thomas A. Piazza

prev … 5 6 7 8 9 10 11 12 13 … next