Patents by Inventor Gerald George Pechanek

Gerald George Pechanek has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Methods and Apparatus for Attaching Application Specific Functions Within an Array Processor

Publication number: 20110153998

Abstract: A multi-node video signal processor (VSPN) is describes that tightly couples multiple multi-cycle state machines (hardware assist units) to each processor and each memory in each node of an N node scalable array processor. VSPN memory hardware assist instructions are used to initiate multi-cycle state machine functions, to pass parameters to the multi-cycle state machines, to fetch operands from a node's memory, and to control the transfer of results from the multi-cycle state machines.

Type: Application

Filed: March 1, 2011

Publication date: June 23, 2011

Applicant: Altera Corporation

Inventors: Gerald George Pechanek, Mihailo M. Stojancic
Methods and apparatus storing expanded width instructions in a VLIW memory deferred execution

Patent number: 7962723

Abstract: Techniques are described for decoupling fetching of an instruction stored in a main program memory from earliest execution of the instruction. An indirect execution method and program instructions to support such execution are addressed. In addition, an improved indirect deferred execution processor (DXP) VLIW architecture is described which supports a scalable array of memory centric processor elements that do not require local load and store units.

Type: Grant

Filed: July 9, 2009

Date of Patent: June 14, 2011

Inventors: Gerald George Pechanek, Stamatis Vassiliadis
Efficient complex multiplication and fast fourier transform (FFT) implementation on the ManArray architecture

Patent number: 7962719

Abstract: Efficient computation of complex multiplication results and very efficient fast Fourier transforms (FFTs) are provided. A parallel array VLIW digital signal processor is employed along with specialized complex multiplication instructions and communication operations between the processing elements which are overlapped with computation to provide very high performance operation. Successive iterations of a loop of tightly packed VLIWs are used allowing the complex multiplication pipeline hardware to be efficiently used. In addition, efficient techniques for supporting combined multiply accumulate operations are described.

Type: Grant

Filed: August 7, 2008

Date of Patent: June 14, 2011

Inventors: Nikos P. Pitsianis, Gerald George Pechanek, Ricardo Rodriguez
Methods and apparatus for automated generation of abbreviated instruction set and configurable processor architecture

Patent number: 7953955

Abstract: A systematic approach to architecture and design of the instruction fetch mechanisms and instruction set architectures in embedded processors is described. This systematic approach allows a relaxing of certain restrictions normally imposed by a fixed-size instruction set architecture (ISA) on design and development of an embedded system. The approach also guarantees highly efficient usage of the available instruction storage which is only bounded by the actual information contents of an application or its entropy. The result of this efficiency increase is a general reduction of the storage requirements, or a compression, of the instruction segment of the original application. An additional feature of this system is the full decoupling of the ISA from the core architecture. This decoupling allows usage of a variable length encoding for any size of the ISA without impacting the physical instruction memory organization or layout and branching mechanism as well as tuning of the execution core to the application.

Type: Grant

Filed: December 14, 2010

Date of Patent: May 31, 2011

Assignee: Altera Corporation.

Inventors: Sergei Yurievich Larin, Gerald George Pechanek, Thomas M. Conte
Methods and apparatus for address translation functions

Patent number: 7945760

Abstract: Techniques are described for efficient reordering of data and performing data exchanges within a register file or memory, or in general, any device storing data that is accessible through a set of addressable locations. In one technique, an address translator is placed in the path of all or a selected set of address busses to a storage device to provide a programmable translating of the storage device addresses. An effect of this translation is that the data stored in one pattern may be accessed and stored in another pattern or accessed, processed and stored in another pattern. The address translation operation may be carried out in a single cycle, does not involve the physical movement of data in swap operations, allows data to effectively be ordered more efficiently for algorithmic processing and therefore saves power. Address translation functions are shown to be useful for vector operations and a new type of storage unit using built in address translation functions is presented.

Type: Grant

Filed: April 1, 2004

Date of Patent: May 17, 2011

Assignee: Altera Corporation

Inventors: Edwin Franklin Barry, Gerald George Pechanek
Methods and apparatus for dynamic instruction controlled reconfigurable register file

Patent number: 7941648

Abstract: A scalable reconfigurable register file (SRRF) containing multiple register files, read and write multiplexer complexes, and a control unit operating in response to instructions is described. Multiple address configurations of the register files are supported by each instruction and different configurations are operable simultaneously during a single instruction execution. For example, with separate files of the size 32×32 supported configurations of 128×32 bit s, 64×64 bit s and 32×128 bit s can be in operation each cycle. Single width, double width, quad width operands are optimally supported without increasing the register file size and without increasing the number of register file read or write ports.

Type: Grant

Filed: June 3, 2008

Date of Patent: May 10, 2011

Assignee: Altera Corporation

Inventors: Gerald George Pechanek, Edward A. Wolff
METHODS AND APPARATUS FOR AUTOMATED GENERATION OF ABBREVIATED INSTRUCTION SET AND CONFIGURABLE PROCESSOR ARCHITECTURE

Publication number: 20110083001

Abstract: A systematic approach to architecture and design of the instruction fetch mechanisms and instruction set architectures in embedded processors is described. This systematic approach allows a relaxing of certain restrictions normally imposed by a fixed-size instruction set architecture (ISA) on design and development of an embedded system. The approach also guarantees highly efficient usage of the available instruction storage which is only bounded by the actual information contents of an application or its entropy. The result of this efficiency increase is a general reduction of the storage requirements, or a compression, of the instruction segment of the original application. An additional feature of this system is the full decoupling of the ISA from the core architecture. This decoupling allows usage of a variable length encoding for any size of the ISA without impacting the physical instruction memory organization or layout and branching mechanism as well as tuning of the execution core to the application.

Type: Application

Filed: December 14, 2010

Publication date: April 7, 2011

Applicant: Altera Corporation

Inventors: Sergei Yurievich Larin, Gerald George Pechanek, Thomas M. Conte
Methods and apparatus for efficiently sharing memory and processing in a multi-processor

Publication number: 20110072237

Abstract: A shared memory network for communicating between processors using store and load instructions is described. A new processor architecture which may be used with the shared memory network is also described that uses arithmetic/logic instructions that do not specify any source operand addresses or target operand addresses. The source operands and target operands for arithmetic/logic execution units are provided by independent load instruction operations and independent store instruction operations.

Type: Application

Filed: November 27, 2010

Publication date: March 24, 2011

Inventor: Gerald George Pechanek
Interconnection network and method of construction thereof for efficiently sharing memory and processing in a multi-processor wherein connections are made according to adjacency of nodes in a dimension

Patent number: 7886128

Abstract: A shared memory network for communicating between processors using store and load instructions is described. A new processor architecture which may be used with the shared memory network is also described that uses arithmetic/logic instructions that do not specify any source operand addresses or target operand addresses. The source operands and target operands for arithmetic/logic execution units are provided by independent load instruction operations and independent store instruction operations.

Type: Grant

Filed: June 3, 2009

Date of Patent: February 8, 2011

Inventor: Gerald George Pechanek
Methods and apparatus for automated generation of abbreviated instruction set and configurable processor architecture

Patent number: 7865692

Abstract: A systematic approach to architecture and design of the instruction fetch mechanisms and instruction set architectures in embedded processors is described. This systematic approach allows a relaxing of certain restrictions normally imposed by a fixed-size instruction set architecture (ISA) on design and development of an embedded system. The approach also guarantees highly efficient usage of the available instruction storage which is only bounded by the actual information contents of an application or its entropy. The result of this efficiency increase is a general reduction of the storage requirements, or a compression, of the instruction segment of the original application. An additional feature of this system is the full decoupling of the ISA from the core architecture. This decoupling allows usage of a variable length encoding for any size of the ISA without impacting the physical instruction memory organization or layout and branching mechanism as well as tuning of the execution core to the application.

Type: Grant

Filed: January 26, 2006

Date of Patent: January 4, 2011

Assignee: Altera Corp.

Inventors: Sergei Yurievich Larin, Gerald George Pechanek, Thomas M. Conte
Methods and Apparatus for Adapting Pipeline Stage Latency Based on Instruction Type

Publication number: 20100318775

Abstract: Processor pipeline controlling techniques are described which take advantage of the variation in critical path lengths of different instructions to achieve increased performance. By examining a processor's instruction set and execution unit implementation's critical timing paths, instructions are classified into speed classes. Based on these speed classes, one pipeline is presented where hold signals are used to dynamically control the pipeline based on the instruction class in execution. An alternative pipeline supporting multiple classes of instructions is presented where the pipeline clocking is dynamically changed as a result of decoded instruction class signals. A single pass synthesis methodology for multi-class execution stage logic is also described. For dynamic class variable pipeline processors, the mix of instructions can have a great effect on processor performance and power utilization since both can vary by the program mix of instruction classes.

Type: Application

Filed: August 24, 2010

Publication date: December 16, 2010

Applicant: Altera Corporation

Inventors: Edwin Franklin Barry, Gerald George Pechanek, Patrick R. Marchand
Methods and apparatus for scalable array processor interrupt detection and response

Patent number: 7853779

Abstract: Hardware and software techniques for interrupt detection and response in a scalable pipelined array processor environment are described. Utilizing these techniques, a sequential program execution model with interrupts can be maintained in a highly parallel scalable pipelined array processing containing multiple processing elements and distributed memories and register files. When an interrupt occurs, interface signals are provided to all PEs to support independent interrupt operations in each PE dependent upon the local PE instruction sequence prior to the interrupt. Processing/element exception interrupts are supported and low latency interrupt processing is also provided for embedded systems where real time signal processing is required. Further, a hierarchical interrupt structure is used allowing a generalized debug approach using debut interrupts and a dynamic debut monitor mechanism.

Type: Grant

Filed: May 14, 2008

Date of Patent: December 14, 2010

Assignee: Altera Corporation

Inventors: Edwin Franklin Barry, Patrick R. Marchand, Gerald George Pechanek, Larry D. Larsen
Methods and apparatus for power control in a scalable array of processor elements

Patent number: 7836317

Abstract: Low power architecture features and techniques are provided in a scalable array indirect VLIW processor. These features and techniques include power control of a reconfigurable register file, conditional power control of multi-cycle operations and indirect VLIW utilization, and power control of VLIW-based vector processing using the ManArray register file indexing mechanism. These techniques are applicable to all processing elements (PEs) and the array controller sequence processor (SP) to provide substantial power savings.

Type: Grant

Filed: January 29, 2008

Date of Patent: November 16, 2010

Assignee: Altera Corp.

Inventors: Patrick R. Marchand, Gerald George Pechanek, Edward A. Wolff
Methods and apparatus for adapting pipeline stage latency based on instruction type

Patent number: 7809932

Abstract: Processor pipeline controlling techniques are described which take advantage of the variation in critical path lengths of different instructions to achieve increased performance. By examining a processor's instruction set and execution unit implementation's critical timing paths, instructions are classified into speed classes. Based on these speed classes, one pipeline is presented where hold signals are used to dynamically control the pipeline based on the instruction class in execution. An alternative pipeline supporting multiple classes of instructions is presented where the pipeline clocking is dynamically changed as a result of decoded instruction class signals. A single pass synthesis methodology for multi-class execution stage logic is also described. For dynamic class variable pipeline processors, the mix of instructions can have a great effect on processor performance and power utilization since both can vary by the program mix of instruction classes.

Type: Grant

Filed: March 22, 2004

Date of Patent: October 5, 2010

Assignee: Altera Corporation

Inventors: Edwin Franklin Barry, Gerald George Pechanek, Patrick R. Marchand
Methods and apparatus for independent processor node operations in a SIMD array processor

Patent number: 7730280

Abstract: A control processor is used for fetching and distributing single instruction multiple data (SIMD) instructions to a plurality of processing elements (PEs). One of the SIMD instructions is a thread start (Tstart) instruction, which causes the control processor to pause its instruction fetching. A local PE instruction memory (PE Imem) is associated with each PE and contains local PE instructions for execution on the local PE. Local PE Imem fetch, decode, and execute logic are associated with each PE. Instruction path selection logic in each PE is used to select between control processor distributed instructions and local PE instructions fetched from the local PE Imem. Each PE is also initialized to receive control processor distributed instructions. In addition, local hold generation logic is associated with each PE. A PE receiving a Tstart instruction causes the instruction path selection logic to switch to fetch local PE Imem instructions.

Type: Grant

Filed: April 18, 2007

Date of Patent: June 1, 2010

Assignee: Vicore Technologies, Inc.

Inventors: Gerald George Pechanek, Edwin Franklin Barry, Mihailo M. Stojancic
Methods and apparatus for extracting bits of a source register based on a mask and right justifying the bits into a target register

Patent number: 7685408

Abstract: Techniques for performing a bit rake instruction in a programmable processor. The bit rake instruction extracts an arbitrary pattern of bits from a source register, based on a mask provided in another register, and packs and right justifies the bits into a target register. The bit rake instruction allows any set of bits from the source register to be packed together.

Type: Grant

Filed: September 29, 2008

Date of Patent: March 23, 2010

Assignee: Altera Corporation

Inventors: Edward A. Wolff, Peter R. Molnar, Ayman Elezabi, Gerald George Pechanek
Methods and apparatus for efficient complex long multiplication and covariance matrix implementation

Patent number: 7680873

Abstract: Efficient computation of complex long multiplication results and an efficient calculation of a covariance matrix are described. A parallel array VLIW digital signal processor is employed along with specialized complex long multiplication instructions and communication operations between the processing elements which are overlapped with computation to provide very high performance operation. Successive iterations of a loop of tightly packed VLIWs may be used allowing the complex multiplication pipeline hardware to be efficiently used.

Type: Grant

Filed: May 22, 2006

Date of Patent: March 16, 2010

Assignee: Altera Corporation

Inventors: Gerald George Pechanek, Ricardo Rodriguez, Matthew Plonski, David Strube, Kevin Coopman
Processor organized in clusters of processing elements and cluster interconnections by a clustering process

Patent number: 7631165

Abstract: An array processor includes processing elements (00, 01, 02, 03, 10, 11, 12, 13, 20, 21, 23, 30, 31, 32, 33) arranged in clusters (e.g., 44, 46, 48, 50) to form a rectangular array (40). Inter-cluster communication paths (88) are mutually exclusive. Due to the mutual exclusivity of the data paths, communications between the processing elements of each cluster may be combined in a single inter-cluster path, thus eliminating half the wiring required for the path. The length of the longest communication path is not directly determined by the overall dimension of the array, as in conventional torus arrays. Rather, the longest communications path is limited by the inter-cluster spacing. Transpose elements of an N×N torts may be combined in clusters and communicate with one another through intra-cluster communications paths. Transpose operation latency is eliminated in this approach. Each PE may have a single transmit port (35) and a single receive port (37).

Type: Grant

Filed: March 7, 2007

Date of Patent: December 8, 2009

Assignee: Altera Corp.

Inventors: Gerald George Pechanek, Charles W. Kurak, Jr.
Register file indexing methods and apparatus for providing indirect control of register addressing in a VLIW processor

Patent number: RE41012

Abstract: A double indirect method of accessing a block of data in a register file is used to allow efficient implementations without the use of specialized vector processing hardware. In addition, the automatic modification of the register addressing is not tied to a single vector instruction nor to repeat or loop instructions. Rather, the technique, termed register file indexing (RFI) allows full programmer flexibility in control of the block data operational facility and provides the capability to mix non-RFI instructions with RFI instructions. The block-data operation facility is embedded in the iVLIW ManArray architecture allowing its generalized use across the instruction set architecture without specialized vector instructions or being limited in use only with repeat or loop instructions.

Type: Grant

Filed: June 3, 2004

Date of Patent: November 24, 2009

Assignee: Altera Corporation

Inventors: Edwin Franklin Barry, Gerald George Pechanek, Patrick R. Marchand
Methods and apparatus for efficient synchronous MIMD operations with IVLIW PE-TO-PE communication

Patent number: RE41703

Abstract: A SIMD machine employing a plurality of parallel processor (PEs) in which communications hazards are eliminated in an efficient manner. An indirect Very Long Instruction Word instruction memory (VIM) is employed along with execute and delimiter instructions. A masking mechanism may be employed to control which PEs have their VIMs loaded. Further, a receive model of operation is preferably employed. In one aspect, each PE operates to control a switch that selects from which PE it receives. The present invention addresses a better machine organization for execution of parallel algorithms that reduces hardware cost and complexity while maintaining the best characteristics of both SIMD and MIMD machines and minimizing communication latency. This invention brings a level of MIMD computational autonomy to SIMD indirect Very Long Instruction Word (iVLIW) processing elements while maintaining the single thread of control used in the SIMD machine organization.

Type: Grant

Filed: June 21, 2004

Date of Patent: September 14, 2010

Assignee: Altera Corp.

Inventors: Gerald George Pechanek, Thomas L. Drabenstott, Juan Guillermo Revilla, David Strube, Grayson Morris

prev 1 2 3 4 5 6 7 next