Including Scheduling Instructions Patents (Class 717/161)

Prefetching using future branch path information derived from branch prediction

Patent number: 7441110

Abstract: A mechanism is described that predicts the usefulness of a prefetching instruction during the instruction's decode cycle. Prefetching instructions that are predicted as useful (prefetch useful data) are sent to an execution unit of the processor for execution, while instructions that are predicted as not useful are discarded. The prediction regarding the usefulness of a prefetching instructions is performed utilizing a branch prediction mask contained in the branch history mechanism. This mask is compared to information contained in the prefetching instruction that records the branch path between the prefetching instruction and actual use of the data. Both instructions and data can be prefetched using this mechanism.

Type: Grant

Filed: December 10, 1999

Date of Patent: October 21, 2008

Assignee: International Business Machines Corporation

Inventors: Thomas R. Puzak, Allan M. Hartstein, Mark Charney, Daniel A. Prener, Peter H. Oden
Transient shared computer resource and settings change bubble for computer programs

Patent number: 7434211

Abstract: Described is a mechanism that preserves the state of computer system shared resources and/or settings, and ensures that changes thereto are reverted when an application exits. A shared resource change bubble logically surrounds application code that causes system resource and/or setting data to change. The bubble preserves existing data before it gets changed, and restores the data when the application program code exits. In one implementation, the bubble is implemented as a library loaded by the application. In an alternative implementation, the bubble is run in a separate process, whereby the bubble can restore changed data even if the application program crashes. In another implementation, a bubble is automatically run for any application that the user has specified needs a bubble. Multiple settings and states may be preserved in a bubble for multiple applications, allowing changes to be undone and reapplied per application, e.g., whenever focus changes.

Type: Grant

Filed: February 16, 2005

Date of Patent: October 7, 2008

Assignee: Microsoft Corporation

Inventors: Roger H. Wynn, Michael S. Bernstein, Kamesh Chander Tumsi Dayakar
Efficient Data Reorganization to Satisfy Data Alignment Constraints

Publication number: 20080201699

Abstract: Vectorizing misaligned references in compiled code for SIMD architectures that support only aligned loads and stores is presented. In the framework presented herein, a loop is first simdized as if the memory unit imposes no alignment constraints. The compiler then inserts data reorganization operations to satisfy the actual alignment requirement of the hardware. Finally, the code generation algorithm generates SIMD codes based on the data reorganization graph, addressing realistic issues such as runtime alignments, unknown loop bounds, residue iteration counts, and multiple statements with arbitrary alignment combinations. Beyond generating a valid simdization, a preferred embodiment further improves the quality of the generated codes. Four stream-shift placement policies are disclosed, which minimize the number of data reorganization generated by the alignment handling.

Type: Application

Filed: April 23, 2008

Publication date: August 21, 2008

Inventors: Alexandre E. Eichenberger, John Kevin Patrick O'Brien, Peng Wu
Reordering application code to improve processing performance

Publication number: 20080201698

Abstract: A method of reordering a sequence of code for processing by a target data processor in order to reduce an execution time for said code on said target data processor is disclosed. The method comprises the steps of: in response to a request to execute said sequence of code, loading said sequence of code into a volatile data store associated with said target data processor; analyzing said sequence of code in relation to properties of said target data processor; identifying interlocks within said sequence of code when executing on said target data processor, in which a portion of code would be stalled while waiting for an earlier portion to complete; reordering said sequence of code to remove at least some of said interlocks; and executing said reordered sequence of code; wherein said steps of analyzing, identifying, reordering and executing are performed by said target data processor.

Type: Application

Filed: February 21, 2007

Publication date: August 21, 2008

Inventors: Rodolph Gerard Jacques Ascanio Jean-Denis Perfetta, Graham Peter Wilkinson
Runtime quality verification of execution units

Patent number: 7415700

Abstract: One embodiment disclosed relates to a method of compiling a program to be executed on a target microprocessor with multiple execution units of a same type. The method includes selecting one of the execution units for testing and scheduling the parallel execution of program code and diagnostics code. The diagnostic code is scheduled to be executed on the selected execution unit. The program code is scheduled to be executed on remaining execution units of the same type.

Type: Grant

Filed: October 14, 2003

Date of Patent: August 19, 2008

Assignee: Hewlett-Packard Development Company, L.P.

Inventors: Ken Gary Pomaranski, Andrew Harvey Barr, Dale John Shidla
Compiling computer programs to exploit parallelism without exceeding available processing resources

Patent number: 7401329

Abstract: A compilation technique for computer programs forms a data flow graph of vertices which are analysed to form clusters C for parallel execution where those clusters are added to up to the point at which arbitrary selection between further vertices C, D to be added must be made. This data flow graph with these small clusters is then scheduled such that the clusters do not overlap with other clusters or with vertices outside of clusters. This starting point scheduled data flow graph is then subject to iterative processing whereby a window of timestamps is analysed to see if a candidate cluster formed by the parallel execution of the vertices within that window will result in faster execution whilst avoiding exceeding architectural constraints, such as register occupancy. If the rescheduled vertices do improve performance without exceeding architectural constraints, then this new schedule is adopted and the following vertices are subject to an adjustment in their timestamps to account for this.

Type: Grant

Filed: April 25, 2005

Date of Patent: July 15, 2008

Assignee: ARM Limited

Inventor: Bert De Rijck
Framework for efficient code generation using loop peeling for SIMD loop code with multiple misaligned statements

Patent number: 7395531

Abstract: A system and method is provided for vectorizing misaligned references in compiled code for SIMD architectures that support only aligned loads and stores. In this framework, a loop is first simdized as if the memory unit imposes no alignment constraints. The compiler then inserts data reorganization operations to satisfy the actual alignment requirements of the hardware. Finally, the code generation algorithm generates SIMD codes based on the data reorganization graph, addressing realistic issues such as runtime alignments, unknown loop bounds, residual iteration counts, and multiple statements with arbitrary alignment combinations. Loop peeling is used to reduce the computational overhead associated with misaligned data. A loop prologue and epilogue are peeled from individual iterations in the simdized loop, and vector-splicing instructions are applied to the peeled iterations, while the steady-state loop body incurs no additional computational overhead.

Type: Grant

Filed: August 16, 2004

Date of Patent: July 1, 2008

Assignee: International Business Machines Corporation

Inventors: Alexandre E. Eichenberger, Kai-Ting Amy Wang, Peng Wu
Process for running programs on processors and corresponding processor system

Patent number: 7395532

Abstract: Programs having a given instruction-set architecture are executed on a multiprocessor system comprising a plurality of processors, for example of a VLIW type, each of said processors being able to execute, at each processing cycle, a respective maximum number of instructions. The instructions are compiled as instruction words of given length executable on a first processor. At least some of the instruction words of given length are converted into modified-instruction words executable on a second processor. The operation of modifying comprises in turn at least one operation chosen in the group consisting of: splitting the instruction words into modified-instruction words; and entering no-operation instructions in the modified-instruction words.

Type: Grant

Filed: July 1, 2003

Date of Patent: July 1, 2008

Assignee: STMicroelectronics S.r.l.

Inventors: Antonio Maria Borneo, Fabrizio Simone Rovati, Danilo Pietro Pau
Methods and apparatus to dynamically insert prefetch instructions based on compiler and garbage collector analysis

Patent number: 7389385

Abstract: Methods and apparatus to insert prefetch instructions based on garbage collector analysis and compiler analysis are disclosed. In an example method, one or more batches of samples associated with cache misses from a performance monitoring unit in a processor system are received. One or more samples from the one or more batches of samples based on delinquent information are selected. A performance impact indicator associated with the one or more samples is generated. Based on the performance indicator, at least one of a garbage collector analysis and a compiler analysis is initiated to identify one or more delinquent paths. Based on the at least one of the garbage collector analysis and the compiler analysis, one or more prefetch points to insert prefetch instructions are identified.

Type: Grant

Filed: December 19, 2003

Date of Patent: June 17, 2008

Assignee: Intel Corporation

Inventors: Mauricio J. Serrano, Sreenivas Subramoney, Richard L. Hudson, Ali-Reza Adl-Tabatabai
Compiler apparatus and method of optimizing a source program by reducing a hamming distance between two instructions

Patent number: 7386844

Abstract: A compiler apparatus is capable of generating instruction sequences causing a processor to operate with lower power consumption. The compiler apparatus translates a source program into a machine language program for a processor including execution units which can execute instructions in parallel, and including instruction issue units which issue the instructions executed, respectively, by the execution units. The compiler apparatus includes a parser unit operable to parse the source program, an intermediate code conversion unit operable to convert the parsed source program into intermediate codes, an optimization unit operable to optimize the intermediate codes to reduce a hamming distance between instructions from the same instruction issue unit in consecutive instruction cycles, and includes a code generation unit operable to convert the optimized intermediate codes into machine language instructions.

Type: Grant

Filed: January 21, 2004

Date of Patent: June 10, 2008

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventors: Taketo Heishi, Hajime Ogawa, Takenobu Tani, Yukihiro Sasagawa
Compiler device, method, program and recording medium

Patent number: 7383544

Abstract: Compiler device optimizes a program by changing an order of executing instructions.

Type: Grant

Filed: March 4, 2004

Date of Patent: June 3, 2008

Assignee: International Business Machines Corporation

Inventors: Motohiro Kawahito, Hideaki Komatsu
Method for increasing the speed of speculative execution

Patent number: 7337439

Abstract: A method for increasing the speed of execution by a processor including the steps of selecting a sequence of instructions to optimize, optimizing the sequence of instructions, creating a duplicate of instructions from the sequence of instructions which has been selected to optimize, executing the optimized sequence of instructions, and responding to an error during the execution of the optimized sequence of instructions by rolling back to the duplicate of instructions from the sequence of instructions.

Type: Grant

Filed: June 18, 2003

Date of Patent: February 26, 2008

Assignee: Transmeta Corporation

Inventors: Richard Johnson, Guillermo Rozas
METHODS AND APPARATUS TO OPTIMIZE COMPUTER INSTRUCTIONS

Publication number: 20080040711

Abstract: Methods and apparatus to optimize computer instructions are disclosed. An example method includes receiving a set of computer instructions, determining a first location of a first computer instruction that indicates the end of a critical section in the set of computer instructions, and modifying the execution order of the set of computer instructions to cause the first computer instruction to be executed earlier than the first location. In an example implementation, the disclosed methods and apparatus may be used to optimize the performance of computer instructions executing on multi-processing computer systems.

Type: Application

Filed: October 2, 2006

Publication date: February 14, 2008

Inventors: Xiaofeng Guo, Jinquan Dai, Long Li
Methods and apparatus for reducing memory latency in a software application

Patent number: 7328433

Abstract: Methods and apparatus for reducing memory latency in a software application are disclosed. A disclosed system uses one or more helper threads to prefetch variables for a main thread to reduce performance bottlenecks due to memory latency and/or a cache miss. A performance analysis tool is used to profile the software application's resource usage and identifies areas in the software application experiencing performance bottlenecks. Compiler-runtime instructions are generated into the software application to create and manage the helper thread. The helper thread prefetches data in the identified areas of the software application experiencing performance bottlenecks. A counting mechanism is inserted into the helper thread and a counting mechanism is inserted into the main thread to coordinate the execution of the helper thread with the main thread and to help ensure the prefetched data is not removed from the cache before the main thread is able to take advantage of the prefetched data.

Type: Grant

Filed: October 2, 2003

Date of Patent: February 5, 2008

Assignee: Intel Corporation

Inventors: Xinmin Tian, Shih-wei Liao, Hong Wang, Milind Girkar, John Shen, Perry Wang, Grant Haab, Gerolf Hoflehner, Daniel Lavery, Hideki Saito, Sanjiv Shah, Dongkeun Kim
System, method, and apparatus for spilling and filling rotating registers in software-pipelined loops

Patent number: 7316012

Abstract: An efficient method for software-pipelining (SWP) of loops to translate programs, from higher level languages into equivalent object or machine language code for execution on a computer. In one example embodiment, this is accomplished by spilling and filling multiple computed values, in a register, that are live across multiple stages in a software-pipelined loop, using multiple rotating stack memory locations to reduce compiler-time of SWP, and complexity of the implemented SWP.

Type: Grant

Filed: September 29, 2003

Date of Patent: January 1, 2008

Assignee: Intel Corporation

Inventor: Kalyan Muthukumar
Ordering of high use program code segments using simulated annealing

Patent number: 7308683

Abstract: An apparatus, program product and method utilize a heuristic-based algorithm such as simulated annealing to order program code segments in a computer memory to provide improved computer performance in terms of memory access, e.g., by minimizing cache misses or other memory-related performance penalties that may be present in a multi-level memory architecture. Program code is ordered in a computer memory by selecting an ordering from among a plurality of orderings for a plurality of program code segments using a heuristic algorithm, and ordering the plurality of program code segments in a memory of a computer using the selected ordering.

Type: Grant

Filed: October 30, 2003

Date of Patent: December 11, 2007

Assignee: International Business Machines Corporation

Inventor: Ross E. Johnson
Control flow based compression of execution traces

Patent number: 7308681

Abstract: A method and apparatus for creating a compressed trace for a program, wherein events are compressed separately to provide improved compression and tracing. A sequence of events for a program is selected, and a sequence of values is then determined for each of the selected events occurring during an execution of the program. Each sequence of values is then compressed to generate a compressed sequence of values for each event. These values are then ordered in accordance with information stored in selected events (such as for example, branch events), where the ordered values correspond to the trace.

Type: Grant

Filed: October 28, 2003

Date of Patent: December 11, 2007

Assignee: International Business Machines Corporation

Inventors: Kattamuri Ekanadham, Pratap Pattnaik, Simone Sbaraglia, Luiz A. DeRose
Data repacking for memory accesses

Patent number: 7302680

Abstract: A method and apparatus are provided for repacking of memory data. For at least one embodiment, data for a plurality of store instructions in a source code program is loaded from memory into the appropriate sub-location of a proxy storage location. The packed data is then written with a single instruction from the proxy storage location into contiguous memory locations.

Type: Grant

Filed: November 4, 2002

Date of Patent: November 27, 2007

Assignee: Intel Corporation

Inventors: Jean-Francois C. Collard, Kalyan Muthukumar
Data processing apparatus and associated method

Patent number: 7240345

Abstract: In accordance with a plurality of processing requests, a SAM chip generates IC card entity data including job execution order data showing an order of execution of a plurality of jobs forming processing in accordance with a processing request and status data showing a state of progress of execution of said plurality of jobs for each of said processing requests. Further, the SAM chip selects one entity data from said plurality of entity data, selects and executes the job to be executed next based on the status data and processing order data of said selected entity data, and updates the status data in accordance with execution of said job.

Type: Grant

Filed: February 15, 2002

Date of Patent: July 3, 2007

Assignee: Sony Corporation

Inventors: Masahiro Sueyoshi, Fumio Kubono, Kei Tateno
Method for processing data using external and internal identifiers to provide reliable access to the data even when reconfigurations of the data occur, and associated system

Patent number: 7234135

Abstract: A method for processing data in a data processing system (1) which includes a number of data processing units (2), and operator and observation units (4), which are interconnected by way of a data transmission unit (6), whereby a respective data processing program (8) with corresponding function modules (FB1 to FBn) and data modules (DB 1 to DBn) is implemented in the data processing units (2). At least one identifier (K, KE, KI) for the function modules and data modules is stored in a respective conversion table (10) belonging to each data processing unit (2). When an external data processing unit (2) accesses a data module (DB1 to DBn) and/or a function module (FB1 to FBn) of another data processing unit (2), a conversion table (10) mediates by using an external identifier (KE) that characterizes the access and verifies whether a matching internal identifier (KI) is stored for the particular external identifier (KE).

Type: Grant

Filed: January 2, 2004

Date of Patent: June 19, 2007

Assignee: Siemens Aktiengesellschaft

Inventors: Wilhelm Bollhoefer, Michael Dreher, Juergen Laforsch
Method and apparatus for selecting references for prefetching in an optimizing compiler

Patent number: 7234136

Abstract: One embodiment of the present invention provides a system that generates code to perform anticipatory prefetching for data references. During operation, the system receives code to be executed on a computer system. Next, the system analyzes the code to identify data references to be prefetched. This analysis can involve: using a two-phase marking process in which blocks that are certain to execute are considered before other blocks; and analyzing complex array subscripts. Next, the system inserts prefetch instructions into the code in advance of the identified data references. This insertion can involve: dealing with non-constant or unknown stride values; moving prefetch instructions into preceding basic blocks; and issuing multiple prefetches for the same data reference.

Type: Grant

Filed: November 2, 2001

Date of Patent: June 19, 2007

Assignee: Sun Microsystems, Inc.

Inventors: Partha P Tirumalai, Spiros Kalogeropulos, Mahadevan Rajagopalan, Yonghong Song, Vikram Rao
Apparatus and method for distribution of work on a doubly linked list among processing threads

Patent number: 7213244

Abstract: An apparatus and method for distributing work on a doubly linked list to a plurality of worker threads are provided. With the apparatus and method, an initial thread obtains the list lock for the linked list and inserts a marker element at the beginning of the linked list. Elements in the linked list that are before the marker element in the linked list are considered work that has been assigned or work that has been completed. Elements of the linked list that are after the marker element in the linked list are work to be done. The initial thread spawns worker threads to perform the work on the linked list and passes the address of the marker element to each of the worker threads. Each worker thread then operates independently to perform work on the linked list based on the current position of the marker element in the linked list.

Type: Grant

Filed: March 13, 2003

Date of Patent: May 1, 2007

Assignee: International Business Machines Corporation

Inventors: Joon Chang, Jean-Philippe Sugarbroad
System and method for optimizing operations via dataflow analysis

Patent number: 7185327

Abstract: A method for modifying serial dependencies in a procedure includes a step of building a graph representation of the procedure. The graph representation has an origin as well as a unique position, relative to the origin, for each memory operation in the procedure. Each memory operation in the representation is designated with a location type. Each of these location types are based on one or more characteristics of the corresponding memory operation that are sufficient to notify the compiler that the memory operation accesses a distinct or disjoint memory location. Memory operations having the same location type as subsequent memory operations are identified. When the graph representation does not include additional memory operations of the same location type between pairs of such memory operations, the subsequent memory operation is moved to a position in the intermediate representation that is closer to the origin.

Type: Grant

Filed: January 9, 2001

Date of Patent: February 27, 2007

Assignee: Hewlett-Packard Development Company, L.P.

Inventor: Daniel J. Scales
Methods and apparatus for indirect VLIW memory allocation

Patent number: 7181730

Abstract: Techniques and a set of heuristics are described to perform allocation of the special instruction memory where indirect very long instruction words (VLIW's) are stored for the ManArray family of multiprocessor digital signal processors (DSP). This approach substantially reduces the cost of pre-initializing the contents of VLIWs.

Type: Grant

Filed: June 21, 2001

Date of Patent: February 20, 2007

Assignee: Altera Corporation

Inventors: Nikos P. Pitsianis, Benjamin Strautin, Sanjay Banerjee, Gerald G. Pechanek
Method and apparatus for simultaneous optimization of code targeting multiple machines

Patent number: 7140010

Abstract: Method and apparatus for simultaneous optimization of the compiler to generate codes that may be compatible and acceptable for two or more different processors without potentially sacrificing the performance on any processors is provided. In particular, the rules of instructions scheduling for the machines of interest of different processors are abstracted. From the abstractions, a hypothetical machine is generated that is the restrictive or constraining set of the actual machines modeled in the abstraction step. After generating the hypothetical machine, the restricted hypothetical machine is targeted rather than the actual machines modeled in the first step. Thereafter, conflicts, if any are resolved by modeling the performance impact and selecting the less damaging choice.

Type: Grant

Filed: March 30, 2001

Date of Patent: November 21, 2006

Assignee: Sun Microsystems, Inc.

Inventors: Partha P. Tirumalai, Mahadevan Rajagopalan
Code conversion method and apparatus

Patent number: 7114151

Abstract: Interlocked floating-point instructions are detected, and a register address referring to and assigning an operand in the interlocked instructions is changed to an odd-number address not assigned as any operation at the time of compiling. Next, an instruction not in any register-dependency relation with the interlocked instructions is detected, and the detected instruction is inserted between instructions interlocked with each other. Thus a program can be executed with an improved efficiency.

Type: Grant

Filed: June 5, 2003

Date of Patent: September 26, 2006

Assignee: Sony Corporation

Inventor: Tetsuya Okada
Information system and data access method

Patent number: 7107291

Abstract: A data access method in an information system including a plurality of data utilization systems connected to a network N1, and a plurality of data provision systems connected to a network, wherein a data utilization system transmits a request for utilizing data in a data provision system to another data utilization system P2 through the network N1, the data utilization system, upon receipt of the data utilization request, transmits a processing execution request corresponding to the data utilization request to the data provision system through the network N2, the data provision system, upon receipt of the execution request, executes processing corresponding to this execution request and transmits necessary data to the data provision system through the network, and the data provision system receives the data and stores the same.

Type: Grant

Filed: February 1, 2005

Date of Patent: September 12, 2006

Assignee: Hitachi, Ltd.

Inventors: Norifumi Nishikawa, Shoichi Minami, Takanobu Otani, Yasuharu Namba, Hirotaka Mizuno
Optimization apparatus, complier program, optimization method and recording medium

Patent number: 7103882

Abstract: An optimization apparatus (compiler program, method and recording medium) for changing the order of execution of instructions in a program to be optimized includes an exception occasion instruction detection section which detects a first instruction having a possibility to cause an exception, an assurance instruction detection section which detects a second instruction executed prior to the first instruction, the second instruction assuring that no exception of the first instruction occurs, and an execution order control section which changes the position of the first instruction in execution order so that the first instruction is executed before a conditional branch instruction for selectively executing the first instruction and after the second instruction.

Type: Grant

Filed: May 29, 2003

Date of Patent: September 5, 2006

Assignee: International Business Machines Corporation

Inventor: Motohiro Kawahito
Methods and apparatus to avoid dynamic micro-architectural penalties in an in-order processor

Patent number: 7100157

Abstract: Methods and apparatus to avoid dynamic micro-architectural penalties in an in-order processor are disclosed. In an example, a compiler inserts decision code into the object code to thereby cause the in-order processor to operate like an out of order processor to operate with respect to at least some memory access instructions. The decision code determines at run time of the object code whether a load instruction is a likely cache hit. Based on that determination, the decision code causes the in-order processor to execute a use instruction at a first time if the load instruction is a likely cache hit, and to execute a copy of the use instruction at a second, later time if the load instruction is not a likely cache hit.

Type: Grant

Filed: September 24, 2002

Date of Patent: August 29, 2006

Assignee: Intel Corporation

Inventor: Jean-Francois C. Collard
Detection of reduction variables in an assignment statement

Patent number: 7089545

Abstract: This invention relates to a method, system and program product to detect reduction variables in assignment statements in the source code for enabling the parallel execution of program loops. The reduction variables found using the method herein described can be tagged to the respective loops and passed to a compiler through compiler directives for parallelizing the reduction operation, along with the information about each variable's respective associative operator.

Type: Grant

Filed: December 17, 2002

Date of Patent: August 8, 2006

Assignee: International Business Machines Corporation

Inventor: Rajendra K. Bera
Multi-thread execution method and parallel processor system

Patent number: 7082601

Abstract: In a parallel processor system for executing a plurality of threads in parallel to each other by a plurality of thread execution units, the respective thread execution units allow for forking of a slave thread from an individual thread execution unit into another arbitrary thread execution unit. The respective thread execution units are managed in three states, a free state where fork is possible, a busy state where a thread is being executed, and a term state where a thread being terminated and yet to be settled exists. At the time of forking of a new thread, when there exists no thread execution unit at the free state, a thread that the thread execution unit at the term state has is merged into its immediately succeeding slave thread to bring the thread execution unit in question to the free state and conduct forking of a new thread.

Type: Grant

Filed: July 17, 2002

Date of Patent: July 25, 2006

Assignee: NEC Corporation

Inventors: Taku Ohsawa, Atsufumi Shibayama, Satoshi Matsushita
Function unit based finite state automata data structure, transitions and methods for making the same

Patent number: 7082602

Abstract: We disclose a function unit based finite state automata data structure for use in computer program compilers. According to an aspect of an embodiment, the data structure comprises a function unit vector, having no more used bits than there are issue ports for any particular microprocessor, and a plurality of valid template assignments for each function unit vector. In a preferred embodiment, the template assignments are constructed so as to account for dispersal rules associated with the particular microprocessor. Further, the template assignments can be sorted according to priority data.

Type: Grant

Filed: April 12, 2002

Date of Patent: July 25, 2006

Assignee: Intel Corporation

Inventors: Chen Fu, Dong-Yuan Chen, Chengyong Wu, Dz-Ching Ju
Processor for executing instructions in units that are unrelated to the units in which instructions are read, and a compiler, an optimization apparatus, an assembler, a linker, a debugger and a disassembler for such processor

Patent number: 7080367

Abstract: When a branch instruction is decoded by the instruction decoders 409a˜409c, the upper 29 bits of the PC relative value included in the branch instruction are sent to the upper PC calculator 411 and the lower 3 bits are sent to the lower PC calculator 405. The lower PC calculator 405 adds the lower 3 bits of the PC relative value and the lower 3 bits of the present lower PC 404 and sends the result to the lower PC 404 as the updated lower PC. The upper PC calculator 411 adds the upper 29 bits of the PC relative value, the upper 29 bits of the present upper PC 403, and a carry that may be received from the lower PC calculator 405, and sends the result to the upper PC 403 as the updated upper PC.

Type: Grant

Filed: October 19, 2001

Date of Patent: July 18, 2006

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventors: Shuichi Takayama, Hajime Ogawa, Kenichi Kawaguchi, Nobuo Higaki, Kensuke Odani, Tetsuya Tanaka, Shinya Miyaji, Taketo Heishi
Run-time parallelization of loops in computer programs with static irregular memory access patterns

Patent number: 7076777

Abstract: Run-time parallelization of loops with static irregular read-write memory access patterns is performed across multiple arrays. More than one element from each array can be read or written during each iteration of the loop.

Type: Grant

Filed: August 7, 2002

Date of Patent: July 11, 2006

Assignee: International Business Machines Corporation

Inventor: Radhakrishnan Srinivasan
Compiler device with branch instruction inserting unit

Patent number: 7073169

Abstract: A compiler device includes a conditional-executable-instruction generation unit and a branch instruction insertion unit. The conditional-executable-instruction generation unit generates a conditional executable instruction that is executed when a condition that the conditional executable instruction refers to is satisfied. In the case where there is a section containing a non-executive condition under which no instruction is executed in one cycle or a plurality of cycles in series, the branch instruction insertion unit inserts a conditional branch instruction that refers to the non-executive condition and instructs to branch to a cycle immediately after a last cycle of the section, to after an instruction of a cycle immediately before a start of the section.

Type: Grant

Filed: June 17, 2002

Date of Patent: July 4, 2006

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventors: Hajime Ogawa, Shuichi Takayama, Taketo Heishi, Nobuo Higaki
Method and apparatus for controlling multiple resources using thermal related parameters

Patent number: 7069189

Abstract: In some embodiments of the present invention, a method and system are provided in a multiple resource environment for relieving a thermal condition by applying cooling techniques or throttling to lower priority threads.

Type: Grant

Filed: December 31, 2002

Date of Patent: June 27, 2006

Assignee: Intel Corporation

Inventor: Efraim Rotem
Method and system for scheduling software pipelined loops

Patent number: 7058938

Abstract: A method and a system for scheduling a software pipelined loop with indirect loads. The system may include a data structure in communication with a processor and a memory. The processor may determine a condition associated with a potential for saturation of the data structure. Accordingly, the processor may provide a number of instructions associated with the software pipelined loop from the memory to a queue of the data structure prior to processing of the instructions by the processor based on the condition associated with a potential for saturation of the data structure.

Type: Grant

Filed: March 5, 2003

Date of Patent: June 6, 2006

Assignee: Intel Corporation

Inventor: Lynd M. Stringer
Methods and systems for integrated scheduling and resource management for a compiler

Patent number: 7058937

Abstract: A compiler comprising an integrated instruction scheduler and resource management system is provided. According to an aspect of an embodiment, the resource management system includes a function unit based finite state automata system. Instructions to be compiled are modeled through the function unit based finite state automata system based on their function unit usage, before they are emitted as compiled computer code. We also disclose a function unit based finite state automata data structure and computer implemented methods for making the same.

Type: Grant

Filed: April 12, 2002

Date of Patent: June 6, 2006

Assignee: Intel Corporation

Inventors: Chen Fu, Dong-Yuan Chen, Chengyong Wu, Dz-Ching Ju
Method and system for optimizing the use of processors when compiling a program

Patent number: 7055144

Abstract: A method and system for optimizing the use of a plurality of processors when compiling a program in a computer system is disclosed. The method and system comprises providing a list of directories and a list of processors. The method and system further includes determining when a directory is available. The method and system includes assigning a directory to a next available processor in an ordered manner to allow the next available processor to compile at least one file within the directory. Finally, the method and system includes ensuring that the maximum number of directories can be processed by assigning a processor thereto. Through the use of the method and system in accordance with the present invention, compile cycle time for large programs is significantly reduced. Accordingly, the dependencies are updated simultaneously with the code changes, thereby allowing for the compiling of the large program with minimal dependency violations.

Type: Grant

Filed: July 12, 2001

Date of Patent: May 30, 2006

Assignee: International Business Machines Corporation

Inventor: Matthew Edward Aubertine
Method and apparatus for prioritizing software tests

Patent number: 7028290

Abstract: A computer system and method is provided for prioritizing software tests. Software tests are prioritized based on coverage indicators for the software tests and an indication of impacted areas of the software. Each of the coverage indicators indicates which portions of the software are executed for a respective one of the software tests. The portions of the software include at least one of a plurality of blocks. The indication of impacted areas of the software indicates ones of the plurality of blocks that are modified or new with respect to a previous version of the software. A prioritized list of software tests is output. The prioritized list includes at least one test sequence.

Type: Grant

Filed: April 29, 2002

Date of Patent: April 11, 2006

Assignee: Microsoft Corporation

Inventors: Amitabh Srivastava, Jayaraman Thiagarajan
Specifying an invariant property (range of addresses) in the annotation in source code of the computer program

Patent number: 7013460

Abstract: Method and apparatus for verifying at runtime an invariant property of a data structure. In various example embodiments, code that verifies whether a runtime value of the data structure is consistent with the invariant property is automatically generated in response to an annotation of the data structure in the source code. In executing the program, the runtime value of the data structure is compared to the invariant property in the automatically generated code. If the runtime property is inconsistent with the invariant property, the program branches to exception handler code.

Type: Grant

Filed: May 15, 2001

Date of Patent: March 14, 2006

Assignee: Hewlett-Packard Development Company, L.P.

Inventors: Carol L. Thompson, Jeff Littfin
Method and apparatus for multi-versioning loops to facilitate modulo scheduling

Patent number: 6993757

Abstract: One embodiment of the present invention provides a system that facilitates multi-versioning loops to facilitate modulo scheduling. Upon receiving a computer program, the system analyzes the code to locate loops within the program. When a loop is located, the system examines the loop termination condition to determine if it is based on a “not-equal-to” condition that makes it hard to determine beforehand whether the loop will terminate. If the loop termination condition is based on a “not-equal-to” condition, the system creates multiple versions of the loop, at least one of which will terminate and can be modulo scheduled, and at least one of which might be an infinite loop and consequently cannot be modulo scheduled.

Type: Grant

Filed: May 6, 2002

Date of Patent: January 31, 2006

Assignee: Sun Microsystems, Inc.

Inventor: Mahadevan Rajagopalan
Method of transforming variable loops into constant loops

Patent number: 6988266

Abstract: A system and method for processing a variable looping statement into a constant looping statement to enable loop unrolling. A lower bound and an upper bound of the loop index within the variable looping statement are determined. A constant looping statement is then formed using the lower bound and upper bound to define a range over which the loop index varies within the constant looping statement. The constant looping statement further includes a conditional statement that reflects conditions in the initial expression and/or the exit expression of the variable looping statement. The conditional statement controls execution of the body of the generated constant looping statement, which includes the body from the original variable looping statement. Loop unrolling may then be performed on the generated constant looping statement.

Type: Grant

Filed: May 8, 2001

Date of Patent: January 17, 2006

Assignee: Sun Microsystems, Inc.

Inventors: William K. Lam, David S. Allison
Circuits, systems and methods for performing branch predictions by selectively accessing bimodal and fetch-based history tables

Patent number: 6976157

Abstract: Branch prediction circuitry including a bimodal branch history table, a fetch-based branch history table and a selector table is provided. The local branch history table includes a plurality of entries each for storing a prediction value and accessed by selected bits of a branch address. The fetch-based branch history table included a plurality of entries for storing a prediction value and accessed by a pointer generated from selected bits of the branch address and bits from a history register. The selector table includes a plurality of entries each for storing a selection bit and accessed by a pointer generated from selected bits from the branch address and bits from the history register, each selector bit is used for selecting between a prediction value accessed from the local history table and a prediction value accessed from the fetch-based history table.

Type: Grant

Filed: November 4, 1999

Date of Patent: December 13, 2005

Assignee: International Business Machines Corporation

Inventor: Balaram Sinharoy
Generating hardware interfaces for designs specified in a high level language

Patent number: 6952817

Abstract: A method of processing a general-purpose, high level language program to determine a hardware representation of the program can include compiling the general-purpose, high level language program to generate a language independent model (105, 110, and 115). The language independent model can be scheduled such that each component is activated when both control and valid data arrive at the component (120). An interface structure specifying a hardware interface through which devices external to the language independent model interact with a physical implementation of the language independent model can be defined and included in the language independent model (200, 300, 400).

Type: Grant

Filed: December 4, 2002

Date of Patent: October 4, 2005

Assignee: Xilinx, Inc.

Inventors: Jonathan C. Harris, Stephen G. Edwards, James E. Jensen, Andreas B. Kollegger, Ian D. Miller, Christopher R. S. Schanck
Small memory footprint system and method for separating applications within a single virtual machine

Patent number: 6938247

Abstract: A system and method for isolating the execution of a plurality of applications. The applications may utilize or share one or more “original” classes. Only one copy of each original class is maintained, regardless of how many applications utilize it. Static fields are extracted from the original classes. A separate copy of the static fields is created for each of the utilizing applications. A static field class which includes instance fields corresponding to the static fields may be created, wherein each instance of the static field class corresponds to one of the utilizing applications. Access methods for the one or more static fields may be created, wherein the access methods are operable to access the corresponding separate copy of the static fields based upon the identity of the utilizing application. A single access methods class may be created for each original class, wherein the single access methods class includes the access methods for accessing the extracted fields from the original class.

Type: Grant

Filed: May 19, 2003

Date of Patent: August 30, 2005

Assignee: Sun Microsystems, Inc.

Inventor: Grzegorz J. Czajkowski
Method and apparatus for controlling multiple resources using thermal related parameters

Patent number: 6931360

Abstract: In some embodiments of the present invention, a method and system are provided in a multiple resource environment for relieving a thermal condition by applying cooling techniques or throttling to lower priority threads.

Type: Grant

Filed: December 31, 2002

Date of Patent: August 16, 2005

Assignee: Intel Corporation

Inventor: Efraim Rotem
Debugger impact reduction through breakpoint motion

Patent number: 6922826

Abstract: A first type of debugger impact reduction includes removing, from within a loop, an initial conditional breakpoint (“ICB”); extracting a first Boolean expression (“BE_1”) therefrom; setting a special conditional breakpoint (“SCB”) including the BE_1; and reestablishing the ICB if the SCB is satisfied. Optionally, the first type may further include extracting, from code within the loop, a second Boolean expression (“BE_2”); disjunctively including its complement in the SCB; and setting a reset breakpoint at loop exit program positions to remove reset breakpoints and/or the ICB. A second type includes setting the SCB with the complement of BE_1; and removing the ICB when the SCB is satisfied. Optionally, the second type may further include conjunctively including the BE_2 in the SCB; and setting a reset breakpoint to remove reset breakpoints and/or reestablish the ICB. The above may be embodied in a method, a program debugger and an article of manufacture.

Type: Grant

Filed: July 3, 2001

Date of Patent: July 26, 2005

Assignee: International Business Machines Corporation

Inventors: Cary Lee Bates, William Jon Schmidt
System and method for transforming object code

Patent number: 6922824

Abstract: A method comprising: converting bytecodes into a graph of jop objects to track where jump operations pointed before modification of the bytecodes; adjusting constant pool references from local to global numbers based on the graph; and combining the bytecodes into a bundle.

Type: Grant

Filed: May 2, 2001

Date of Patent: July 26, 2005

Assignee: Danger, Inc.

Inventor: Brian Swetland
System and method for scheduling instructions to maximize outstanding prefetches and loads

Patent number: 6918111

Abstract: The present invention discloses a method and device for ordering memory operation instructions in an optimizing compiler. for a processor that can potentially enter a stall state if a memory queue is full. The method uses a dependency graph coupled with one or more memory queues. The dependency graph is used to show the dependency relationships between instructions in a program being compiled. After creating the dependency graph, the ready nodes are identified. Dependency graph nodes that correspond to memory operations may have the effect of adding an element to the memory queue or removing one or more elements from the memory queue. The ideal situation is to keep the memory queue as full as possible without exceeding the maximum desirable number of elements, by scheduling memory operations to maximize the parallelism of memory operations while avoiding stalls on the target processor.

Type: Grant

Filed: October 3, 2000

Date of Patent: July 12, 2005

Assignee: Sun Microsystems, Inc.

Inventors: Peter C. Damron, Nicolai Kosche

prev 1 2 3 4 5 6 7 next