Patents by Inventor Alexandre Eichenberger

Alexandre Eichenberger has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Framework for Integrated Intra- and Inter-Loop Aggregation of Contiguous Memory Accesses for SIMD Vectorization

Publication number: 20080010634

Abstract: A method, computer program product, and information handling system for generating loop code to execute on Single-Instruction Multiple-Datapath (SIMD) architectures, where the loop contains multiple non-stride-one memory accesses that operate over a contiguous stream of memory is disclosed. A preferred embodiment identifies groups of isomorphic statements within a loop body where the isomorphic statements operate over a contiguous stream of memory over the iteration of the loop. Those identified statements are then converted in to virtual-length vector operations. Next, the hardware's available vector length is used to determine a number of virtual-length vectors to aggregate into a single vector operation for each iteration of the loop. Finally, the aggregated, vectorized loop code is converted into SIMD operations.

Type: Application

Filed: September 17, 2007

Publication date: January 10, 2008

Inventors: Alexandre Eichenberger, Kai-Ting Wang, Peng Wu
Method for improving processing of relatively aligned memory references for increased reuse opportunities

Publication number: 20070226453

Abstract: Computer implemented method, system and computer program product for aligning vectors to be processed by SIMD code. A pair of vectors to be aligned at runtime and having a known relative alignment at compile time is identified. A modified second memory reference is generated by modifying an address of the second memory reference to be in a same congruence class as the first memory reference, wherein the congruence class is mod V and wherein V is SIMD byte width. A first SIMD load located at the modified second memory reference and a next adjacent SIMD load located at a third memory reference corresponding to the modified second memory reference address plus V are loaded, and the first SIMD load and the next adjacent SIMD load are concatenated to generate a resultant vector of length 2V.

Type: Application

Filed: March 23, 2006

Publication date: September 27, 2007

Inventors: Alexandre Eichenberger, Rohini Nair, Kai-Ting Wang, Peng Wu, Peng Zhao
Efficient generation of SIMD code in presence of multi-threading and other false sharing conditions and in machines having memory protection support

Publication number: 20070226723

Abstract: A computer implemented method, system and computer program product for automatically generating SIMD code, particularly in the presence of multi-threading and other false sharing conditions, and in machines having a segmented/virtual page memory protection system. The method begins by analyzing data to be accessed by a targeted loop including at least one statement, where each statement has at least one memory reference, to determine if memory accesses are safe. If memory accesses are safe, the targeted loop is simdized. If not safe, it is determined if a scheme can be applied in which safety need not be guaranteed. If such a scheme can be applied, the scheme is applied and the targeted loop is simdized according to the scheme. If such a scheme cannot be applied, it is determined if padding is appropriate. If padding is appropriate, the data is padded and the targeted loop is simdized.

Type: Application

Filed: February 21, 2006

Publication date: September 27, 2007

Inventors: Alexandre Eichenberger, Kai-Ting Wang, Peng Wu, Peng Zhao
Method to analyze and reduce number of data reordering operations in SIMD code

Publication number: 20070192762

Abstract: A method for analyzing data reordering operations in Single Issue Multiple Data source code and generating executable code therefrom is provided. Input is received. One or more data reordering operations in the input are identified and each data reordering operation in the input is abstracted into a corresponding virtual shuffle operation so that each virtual shuffle operation forms part of an expression tree. One or more virtual shuffle trees are collapsed by combining virtual shuffle operations within at least one of the one or more virtual shuffle trees to form one or more combined virtual shuffle operations, wherein each virtual shuffle tree is a subtree of the expression tree that only contains virtual shuffle operations. Then code is generated for the one or more combined virtual shuffle operations.

Type: Application

Filed: January 26, 2006

Publication date: August 16, 2007

Inventors: Alexandre Eichenberger, Kai-Ting Wang, Peng Wu, Peng Zhao
Apparatus and method for optimizing scalar code executed on a SIMD engine by alignment of SIMD slots

Publication number: 20070174825

Abstract: An apparatus and method for optimizing scalar code executed on a single instruction multiple data (SIMD) engine is provided that aligns the slots of SIMD registers. With the apparatus and method, a compiler is provided that parses source code and, for each statement in the program, generates an expression tree. The compiler inspects all storage inputs to scalar operations in the expression tree to determine their alignment in the SIMD registers. This alignment is propagated up the expression tree from the leaves. When the alignments of two operands in the expression tree are the same, the resulting alignment is the shared value. When the alignments of two operands in the expression tree are different, one operand is shifted. For shifted operands, a shift operation is inserted in the expression tree. The executable code is then generated for the expression tree and shifts are inserted where indicated.

Type: Application

Filed: January 25, 2006

Publication date: July 26, 2007

Inventors: Alexandre Eichenberger, John Kevin O'Brien
Method and system for versioning codes based on relative alignment for single instruction multiple data units

Publication number: 20070169058

Abstract: A method and system for generating efficient versioned codes for single instruction multiple data units whose memory systems have alignment constraints. The system creates multiple versions of codes based on relative alignments of the data streams involved in the computation. The system also analyzes characteristics of relative alignments (e.g. compile-time or runtime) to determine whether code versioning is beneficial based on a cost model.

Type: Application

Filed: January 17, 2006

Publication date: July 19, 2007

Inventors: Alexandre Eichenberger, Amy Wang, Peng Wu, Peng Zhao
Method and system for data-driven runtime alignment operation

Publication number: 20070011441

Abstract: A method for processing instructions and data in a processor includes steps of: preparing an input stream of data for processing in a data path in response to a first set of instructions specifying a dynamic parameter; and processing the input stream of data in the same data path in response to a second set of instructions. A common portion of a dataflow is used for preparing the input stream of data for processing in response to a first set of instructions under the control of a dynamic parameter specified by an instruction of the first set of instructions, and for operand data routing based on the instruction specification of a second set of instructions during the processing of the input stream in response to the second set of instructions.

Type: Application

Filed: July 8, 2005

Publication date: January 11, 2007

Inventors: Alexandre Eichenberger, Michael Gschwind, Valentina Salapura, Peng Wu
Framework for integrated intra- and inter-loop aggregation of contiguous memory accesses for SIMD vectorization

Publication number: 20050283775

Abstract: A method, computer program product, and information handling system for generating loop code to execute on Single-Instruction Multiple-Datapath (SIMD) architectures, where the loop contains multiple non-stride-one memory accesses that operate over a contiguous stream of memory is disclosed. A preferred embodiment identifies groups of isomorphic statements within a loop body where the isomorphic statements operate over a contiguous stream of memory over the iteration of the loop. Those identified statements are then converted in to virtual-length vector operations. Next, the hardware's available vector length is used to determine a number of virtual-length vectors to aggregate into a single vector operation for each iteration of the loop. Finally, the aggregated, vectorized loop code is converted into SIMD operations.

Type: Application

Filed: August 16, 2004

Publication date: December 22, 2005

Applicant: International Business Machines Corporation

Inventors: Alexandre Eichenberger, Kai-Ting Wang, Peng Wu
Framework for efficient code generation using loop peeling for SIMD loop code with multiple misaligned statements

Publication number: 20050283773

Abstract: A system and method is provided for vectorizing misaligned references in compiled code for SIMD architectures that support only aligned loads and stores. In this framework, a loop is first simdized as if the memory unit imposes no alignment constraints. The compiler then inserts data reorganization operations to satisfy the actual alignment requirements of the hardware. Finally, the code generation algorithm generates SIMD codes based on the data reorganization graph, addressing realistic issues such as runtime alignments, unknown loop bounds, residual iteration counts, and multiple statements with arbitrary alignment combinations. Loop peeling is used to reduce the computational overhead associated with misaligned data. A loop prologue and epilogue are peeled from individual iterations in the simdized loop, and vector-splicing instructions are applied to the peeled iterations, while the steady-state loop body incurs no additional computational overhead.

Type: Application

Filed: August 16, 2004

Publication date: December 22, 2005

Applicant: International Business Machines Corporation

Inventors: Alexandre Eichenberger, Kai-Ting Wang, Peng Wu
System and method for SIMD code generation in the presence of optimized misaligned data reorganization

Publication number: 20050283774

Abstract: A method, computer program product, and information handling system for generating loop code to execute on Single-Instruction Multiple-Datapath (SIMD) architectures, where the loop operates on datatypes having different lengths, is disclosed. Further, a preferred embodiment of the present invention includes a novel technique to efficiently realign or shift arbitrary streams to an arbitrary offset, regardless whether the alignments or offsets are known at the compile time or not. This technique enables the application of advanced alignment optimizations to runtime alignment. This allows sequential loop code operating on datatypes of disparate length to be transformed (“simdized”) into optimized SIMD code through a fully automated process.

Type: Application

Filed: August 16, 2004

Publication date: December 22, 2005

Applicant: International Business Machines Corporation

Inventors: Alexandre Eichenberger, Kai-Ting Wang, Peng Wu
System and method for efficient data reorganization to satisfy data alignment constraints

Publication number: 20050283769

Abstract: A system and method is provided for vectorizing misaligned references in compiled code for SIMD architectures that support only aligned loads and stores. In the framework presented herein, a loop is first simdized as if the memory unit imposes no alignment constraints. The compiler then inserts data reorganization operations to satisfy the actual alignment requirement of the hardware. Finally, the code generation algorithm generates SIMD codes based on the data reorganization graph, addressing realistic issues such as runtime alignments, unknown loop bounds, residue iteration counts, and multiple statements with arbitrary alignment combinations. Beyond generating a valid simdization, a preferred embodiment further improves the quality of the generated codes. Four stream-shift placement policies are disclosed, which minimize the number of data reorganization generated by the alignment handling.

Type: Application

Filed: June 7, 2004

Publication date: December 22, 2005

Applicant: International Business Machines Corporation

Inventors: Alexandre Eichenberger, John O'Brien, Peng Wu
Framework for generating mixed-mode operations in loop-level simdization

Publication number: 20050273769

Abstract: A method, computer program product, and information handling system for generating mixed-mode operations in the compilation of program code for processors having vector or SIMD processing units is disclosed. In a preferred embodiment of the present invention, program instructions making up the body of a loop are abstracted into virtual vector instructions. These virtual vector instructions are treated, for initial code optimization purposes, as vector instructions (i.e., instructions written for the vector unit). The virtual vector instructions are eventually expanded into native code for the target processor, at which time a determination is made for each virtual vector instruction as to whether to expand the virtual vector instruction into native vector instructions, into native scalar instructions, into calls to pre-defined library functions, or into a combination of these.

Type: Application

Filed: August 16, 2004

Publication date: December 8, 2005

Applicant: International Business Machines Corporation

Inventors: Alexandre Eichenberger, Kai-Ting Wang, Peng Wu
System and method for SIMD code generation for loops with mixed data lengths

Publication number: 20050273770

Abstract: A method, computer program product, and information handling system for generating loop code to execute on Single-Instruction Multiple-Datapath (SIMD) architectures, where the loop operates on datatypes having different lengths, is disclosed. Further, a preferred embodiment of the present invention includes a novel technique to efficiently realign or shift arbitrary streams to an arbitrary offset, regardless whether the alignments or offsets are known at the compile time or not. This technique enables the application of advanced alignment optimizations to runtime alignment. Length conversion operations, for packing and unpacking data values, are included in the alignment handling framework. These operations are formally defined in terms of standard SIMD instructions that are readily available on various SIMD platforms. This allows sequential loop code operating on datatypes of disparate length to be transformed (“simdized”) into optimized SIMD code through a fully automated process.

Type: Application

Filed: August 16, 2004

Publication date: December 8, 2005

Applicant: International Business Machines Corporation

Inventors: Alexandre Eichenberger, Kai-Ting Wang, Peng Wu
Method and apparatus for eliminating the need for register assignment, allocation, spilling and re-filling

Publication number: 20050132172

Abstract: A method and apparatus is provided to manage data in computer registers in a program, making more computer registers available to one or more programmers utilizing a name level instruction. The method and apparatus disclosed herein presents a way of reducing the overhead of register management, by introducing a concept of a name level for each of the named architected registers in a processor. The method provides a programmer with a larger register name-space while not increasing the size of the instruction word in the processor instruction-set architecture. It also provides for the facilitation of architectural features which overload the architected register namespace and ease the overhead of register management. This provides for the addition of more computer registers without changing the instruction format of the computer.

Type: Application

Filed: December 12, 2003

Publication date: June 16, 2005

Inventors: Alexandre Eichenberger, Erik Altman, Sumedh Sathaye, John-David Wellman