Patents Examined by William Partridge

Apparatus and method for dynamic control of the number of simultaneously executing tasks based on throughput

Patent number: 8555290

Abstract: A task count controller, a task count control method, and a computer program capable of dynamically controlling the number of tasks that can be processed in parallel simultaneously without increasing computational load are provided. When a plurality of tasks are to be executed simultaneously in parallel processing, the number of tasks that can be executed simultaneously is controlled. The tasks to be executed simultaneously are added in units of a predetermined number of tasks and the throughput in one unit of work is measured for each task every time the tasks are added. The total sum of the measured throughputs is calculated, and it is determined whether the calculated total sum of throughputs is more than the total sum of throughputs immediately before the predetermined number of tasks are added.

Type: Grant

Filed: March 16, 2009

Date of Patent: October 8, 2013

Assignee: International Business Machines Corporation

Inventors: Fumitaka Uruma, Yoshiko Yaegashi
Adaptive integrated circuitry with heterogeneous and reconfigurable matrices of diverse and adaptive computational units having fixed, application specific computational elements

Patent number: 8533431

Abstract: The present invention concerns a new category of integrated circuitry and a new methodology for adaptive or reconfigurable computing. The preferred IC embodiment includes a plurality of heterogeneous computational elements coupled to an interconnection network. The plurality of heterogeneous computational elements include corresponding computational elements having fixed and differing architectures, such as fixed architectures for different functions such as memory, addition, multiplication, complex multiplication, subtraction, configuration, reconfiguration, control, input, output, and field programmability. In response to configuration information, the interconnection network is operative in real-time to configure and reconfigure the plurality of heterogeneous computational elements for a plurality of different functional modes, including linear algorithmic operations, non-linear algorithmic operations, finite state machine operations, memory operations, and bit-level manipulations.

Type: Grant

Filed: October 15, 2008

Date of Patent: September 10, 2013

Assignee: Altera Corporation

Inventors: Paul L. Master, Eugene Hogenauer, Walter James Scheuermann
Dynamic merging of pipeline stages in an execution pipeline to reduce power consumption

Patent number: 8291201

Abstract: A pipelined execution unit incorporates one or more low power modes that reduce power consumption by dynamically merging pipeline stages in an execution pipeline together with one another. In particular, the execution logic in successive pipeline stages in an execution pipeline may be dynamically merged together by setting one or more latches that are intermediate to such pipeline stages to a transparent state such that the output of the pipeline stage preceding such latches is passed to the subsequent pipeline stage during the same clock cycle so that both such pipeline stages effectively perform steps for the same instruction during each clock cycle. Then, with the selected pipeline stages merged, the power consumption of the execution pipeline can be reduced (e.g., by reducing the clock frequency and/or operating voltage of the execution pipeline), often with minimal adverse impact on performance.

Type: Grant

Filed: May 22, 2008

Date of Patent: October 16, 2012

Assignee: International Business Machines Corporation

Inventors: Stephen Joseph Schwinn, Matthew Ray Tubbs, Charles David Wait
Managing assignment of partition services to virtual input/output adapters

Patent number: 8281306

Abstract: Automated management of partition service assignment to a virtual input/output (VIO) adapter is provided. Responsive to creation of a new partition service in a data processing system, a partition priority number is determined for the new partition service, and, for each VIO adapter, the partition priority numbers of the partition services currently assigned to that VIO adapter are summed. For a VIO adapter with a lowest sum of partition priority numbers, logic determines whether assigning the new partition service to that VIO adapter results in its summed partition priority number being above a predefined threshold, and the new partition service is assigned to a VIO adapter based, at least in part, on whether assigning the new partition service to the VIO adapter with the lowest sum of partition priority numbers results in that VIO adapter's summed partition priority number exceeding the predefined threshold.

Type: Grant

Filed: March 13, 2009

Date of Patent: October 2, 2012

Assignee: International Business Machines Corporation

Inventors: Bryan M. Logan, Kyle A. Lucke, Amartey S. Pearson, Steven E. Royer
Intentionally delaying execution of a copy instruction to achieve simultaneous execution with a subsequent, non-adjacent write instruction

Patent number: 8271766

Abstract: An information processing device including registers (105) for holding data and an operation device (102) for executing arithmetic and logic operations on input/output data held in the register. The information processing device can issue an inter-register copy instruction for instructing data held in one register to be copied to another register. The information processing device further includes a copy information holding device (113) for reserving for execution of a data copy operation by the inter-register copy instruction from a control unit (108) so as to execute the actual copy operation simultaneously with the succeeding instruction to hide the execution time of the copy operation. Thus, in the inter-register copy instruction execution phase, a reservation for a data copy operation is stored in the copy information holding device so that the execution phase is completed without performing the actual data copy operation.

Type: Grant

Filed: May 18, 2006

Date of Patent: September 18, 2012

Assignee: NEC Corporation

Inventor: Noritaka Hoshi
Digital data processing apparatus having hardware multithreading support including cache line limiting mechanism for special class threads

Patent number: 8250347

Abstract: Asymmetric hardware support for a special class of threads is provided. Preferably, the special class threads are high-priority, I/O bound threads. Preferably, a multithreaded processor contains N sets of registers for supporting concurrent execution of N threads. At least one of the register sets is dedicated for use by a special class of threads, and can not be used by other threads even if idle. The special class of threads can fill only the a limited portion of the cache memory, in order to reduce flushing of the cache which might otherwise occur.

Type: Grant

Filed: October 24, 2007

Date of Patent: August 21, 2012

Assignee: International Business Machines Corporation

Inventor: David Alan Kra
Methods and apparatus for dynamically switching processor mode

Patent number: 8250348

Abstract: In a first aspect, a first processing method is provided. The first processing method includes the steps of (1) operating a processor in a first mode based on an operand size associated with a first instruction received by the processor; and (2) dynamically switching the processor operation mode from the first mode to a second mode based on a different operand size associated with a second instruction received by the processor. Numerous other aspects are provided.

Type: Grant

Filed: May 19, 2005

Date of Patent: August 21, 2012

Assignee: International Business Machines Corporation

Inventors: Gordon T. Davis, Jeffrey H. Derby
Apparatus, system and method for configuration of adaptive integrated circuitry having heterogeneous computational elements

Patent number: 8225073

Abstract: The present invention concerns configuration of a new category of integrated circuitry for adaptive or reconfigurable computing. The preferred adaptive computing engine (ACE) IC includes a plurality of heterogeneous computational elements coupled to an interconnection network. The plurality of heterogeneous computational elements include corresponding computational elements having fixed and differing architectures, such as fixed architectures for different functions such as memory, addition, multiplication, complex multiplication, subtraction, configuration, reconfiguration, control, input, output, and field programmability.

Type: Grant

Filed: March 6, 2009

Date of Patent: July 17, 2012

Assignee: QST Holdings LLC

Inventors: Paul L. Master, Stephen J. Smith, John Watson
Processing long-latency instructions in a pipelined processor

Patent number: 8214624

Abstract: There are provided a method and a processor for processing a thread. The thread includes a plurality of sequential instructions. The plurality of sequential instructions include some short-latency instructions and some long-latency instructions and at least one hazard instruction. The hazard instruction requires one or more preceding instructions to be processed before the hazard instruction is processed. The method includes the steps of: a) before processing each long-latency instruction, incrementing by one, a counter associated with the thread; b) after each long-latency instruction has been processed, decrementing by one, the counter associated with the thread; c) before processing each hazard instruction, checking the value of the counter associated with the thread, and i) if the counter value is zero, processing the hazard instruction, or ii) if the counter value is non-zero, pausing processing of the hazard instruction until a later time.

Type: Grant

Filed: May 23, 2007

Date of Patent: July 3, 2012

Assignee: Imagination Technologies Limited

Inventors: Morrie Berglas, Yoong Chert Foo
Reduction operations in a synchronous parallel thread processing system with disabled execution threads

Patent number: 8200940

Abstract: A system and method for successfully performing reduction operations in a multi-threaded SIMD (single-instruction multiple-data) system while one or more threads are disabled allows for the reduction operations to be performed without a performance penalty compared with performing the same operation with all of the threads enabled. The source data for each intermediate computation of the reduction operation is remapped by a configurable crossbar as needed to avoid using invalid data from the disabled threads. The remapping function is transparent to the user and enables correct execution of order invariant reduction operations and order dependent prefix-reduction operations.

Type: Grant

Filed: June 30, 2008

Date of Patent: June 12, 2012

Assignee: NVIDIA Corporation

Inventor: John Erik Lindholm
Determining real time stateful business application processing in an otherwise stateless service-oriented architecture

Patent number: 8196145

Abstract: Methods, apparatus, and products for deterministic real time stateful business application processing in an otherwise stateless service-oriented architecture (‘SOA’), the SOA including SOA services with each SOA service carrying out a processing step of the business application, each SOA service is a real time process executable on a real time operating system of a generally programmable computer and business application processing according to embodiments of the present invention includes: configuring each service of the SOA to record state information describing the state of the service upon completion of a processing step in the business application and provide the state information to a subsequent service, the state information including real time processing information; and executing the business application in the SOA in real time, including sending requests for data processing among the services, each such request comprising a specification of the state of the executing business application.

Type: Grant

Filed: June 26, 2008

Date of Patent: June 5, 2012

Assignee: International Business Machines Corporation

Inventors: Landon C. Miller, Siljan H. Simpson
Method and system for purging pattern history tables as a function of global accuracy in a state machine-based filtered gshare branch predictor

Patent number: 8195926

Abstract: A method, system and computer product for purging pattern history tables as a function of global accuracy in a state machine-based filter gshare branch predictor. An exemplary embodiment includes a method including storing a plurality of encountered branch instructions in the branch history table, indexing the branch history table by a branch instruction address, modifying an entry of the branch history table, indexing the pattern history table, selecting at least one of a branch history entry and a pattern history table entry as a prediction for the branch instruction, wherein the pattern history table entry is selected as the prediction for the branch instruction in response to the branch history entry being in a state specifying to use the pattern history table entry, comparing a pattern history table accuracy to an accuracy threshold, and in response to the pattern history table accuracy falling below the accuracy threshold, purging the PHT.

Type: Grant

Filed: February 15, 2008

Date of Patent: June 5, 2012

Assignee: International Business Machines Corporation

Inventors: James J. Bonanno, Brian R. Prasky
Apparatus and method for heterogeneous chip multiprocessors via resource allocation and restriction

Patent number: 8190863

Abstract: A method and apparatus for heterogeneous chip multiprocessors (CMP) via resource restriction. In one embodiment, the method includes the accessing of a resource utilization register to identify a resource utilization policy. Once accessed, a processor controller ensures that the processor core utilizes a shared resource in a manner specified by the resource utilization policy. In one embodiment, each processor core within a CMP includes an instruction issue throttle resource utilization register, an instruction fetch throttle resource utilization register and other like ways of restricting its utilization of shared resources within a minimum and maximum utilization level. In one embodiment, resource restriction provides a flexible manner for allocating current and power resources to processor cores of a CMP that can be controlled by hardware or software. Other embodiments are described and claimed.

Type: Grant

Filed: July 2, 2004

Date of Patent: May 29, 2012

Assignee: Intel Corporation

Inventors: Tryggve Fossum, George Chrysos, Todd A. Dutton
Millicode assist instructions for millicode store access exception checking

Patent number: 8176301

Abstract: Millicode store access checking instructions are provided via an operand access control register (OACR) including a test modifier indicator, which is communicatively coupled to an instruction unit subsystem, the instruction unit subsystem for fetching and decoding instructions. The instructions include a millicode instruction with an operand defining an address to check for a store access exception. In addition, an execution unit for executing the millicode instruction performs a method. The method includes receiving the millicode instruction from the instruction unit subsystem, testing for the store access exception at the address as if the test modifier is set absent an update to the OACR, and outputting a result of the testing for the store access exception.

Type: Grant

Filed: February 15, 2008

Date of Patent: May 8, 2012

Assignee: International Business Machines Corporation

Inventors: Mark S. Farrell, Bruce C. Giamei, Chung-Lung Kevin Shum
Store misaligned vector with permute

Patent number: 8161271

Abstract: Embodiments of the invention provide logic within the store data path between a processor and a memory array. The logic may be configured to misalign vector data as it is stored to memory. By misaligning vector data as it is stored to memory, memory bandwidth may be maximized while processing bandwidth required to store vector data misaligned is minimized. Furthermore, embodiments of the invention provide logic within the load data path which allows vector data which is stored misaligned to be aligned as it is loaded into a vector register. By aligning misaligned vector data as it is loaded into a vector register, memory bandwidth may be maximized while processing bandwidth required to align misaligned vector data may be minimized.

Type: Grant

Filed: July 11, 2007

Date of Patent: April 17, 2012

Assignee: International Business Machines Corporation

Inventors: David Arnold Luick, Eric Oliver Mejdrich, Adam James Muff
Executing a gather operation on a parallel computer

Patent number: 8140826

Abstract: Methods, apparatus, and computer program products are disclosed for executing a gather operation on a parallel computer according to embodiments of the present invention. Embodiments include configuring, by the logical root, a result buffer or the logical root, the result buffer having positions, each position corresponding to a ranked node in the operational group and for storing contribution data gathered from that ranked node.

Type: Grant

Filed: May 29, 2007

Date of Patent: March 20, 2012

Assignee: International Business Machines Corporation

Inventors: Charles J. Archer, Joseph D. Ratterman
Tracking effective addresses in an out-of-order processor

Patent number: 8131976

Abstract: Mechanisms, in a data processing system, are provided for tracking effective addresses through a processor pipeline of the data processing system. The mechanisms comprise logic for fetching an instruction from an instruction cache and associating, by an effective address table logic in the data processing system, an entry in an effective address table (EAT) data structure with the fetched instruction. The mechanisms further comprise logic for associating an effective address tag (eatag) with the fetched instruction, the eatag comprising a base eatag that points to the entry in the EAT and an eatag offset. Moreover, the mechanisms comprise logic for processing the instruction through the processor pipeline by processing the eatag.

Type: Grant

Filed: April 13, 2009

Date of Patent: March 6, 2012

Assignee: International Business Machines Corporation

Inventors: Richard W. Doing, Susan E. Eisen, David S. Levitan, Kevin N. Magill, Brian R. Mestan, Balaram Sinharoy, Benjamin W. Stolt, Jeffrey R. Summers, Albert J. Van Norstrand, Jr.
Structure for dynamically adjusting pipelined data paths for improved power management

Patent number: 8086832

Abstract: A design structure embodied in a machine readable, non-transitory storage medium used in a design process includes a system for dynamically varying the pipeline depth of a computing device. The system includes a state machine that determines an optimum length of a pipeline architecture based on a processing function to be performed. A pipeline sequence controller, responsive to the state machine, varies the depth of the pipeline based on the optimum length. A plurality of clock splitter elements, each associated with a corresponding plurality of latch stages in the pipeline architecture, are coupled to the pipeline sequence controller and adapted to operate in a functional mode, one or more clock gating modes, and a pass-through flush mode. For each of the clock splitter elements operating in the pass-through flush mode, data is passed through the associated latch stage without oscillation of clock signals associated therewith.

Type: Grant

Filed: October 9, 2007

Date of Patent: December 27, 2011

Assignee: International Business Machines Corporation

Inventors: Susan K. Lichtensteiger, Pascal A. Nsame, Sebastian T. Ventrone
Method and apparatus for coherent device initialization and access

Patent number: 8082418

Abstract: A method and apparatus for enabling usage of an accelerator device in a processor socket is herein described. A set of inter-processor messages is utilized to initialize a configuration/memory space of the accelerator device. As an example, a first set of inter-processor interrupts (IPIs) is sent to indicate a base address of a memory space and a second set of IPIs is sent to indicate a size of the memory space. Furthermore, similar methods and apparatus' are herein described for dynamic reconfiguration of an accelerator device in a processor socket.

Type: Grant

Filed: December 17, 2007

Date of Patent: December 20, 2011

Assignee: Intel Corporation

Inventors: Paul M. Stillwell, Jr., Nagabhushan Chitlur, Dennis Bradford, Linda Rankin
Methods and system for resolving simultaneous predicted branch instructions

Patent number: 8082428

Abstract: A method of resolving simultaneous branch predictions prior to validation of the predicted branch instruction is disclosed. The method includes processing two or more predicted branch instructions, with each predicted branch instruction having a predicted state and a corrected state. The method further includes selecting one of the corrected states. Should one of the predicted branch instructions be mispredicted, the selected corrected state is used to direct future instruction fetches.

Type: Grant

Filed: September 29, 2009

Date of Patent: December 20, 2011

Assignee: QUALCOMM Incorporated

Inventors: Rodney Wayne Smith, Brian Michael Stempel, James Norris Dieffenderfer, Thomas Andrew Sartorius

1 2 next