Array Processor Element Interconnection Patents (Class 712/11)
  • Patent number: 7581081
    Abstract: A system for processing applications includes processor nodes and links interconnecting the processor nodes. Each node includes a processing element, a software extensible device, and a communication interface. The processing element executes at least one of the applications. The software extensible device provides additional instructions to a set of standard instructions for the processing element. The communication interface communicates with other processor nodes.
    Type: Grant
    Filed: December 31, 2003
    Date of Patent: August 25, 2009
    Assignee: Stretch, Inc.
    Inventors: Ricardo E. Gonzalez, Albert R. Wang, Gareld Howard Banta
  • Patent number: 7577820
    Abstract: An integrated circuit comprises a plurality of tiles. Each tile comprises a processor including a storage module, wherein the processor is configured to process multiple streams of instructions, a switch including switching circuitry to forward data received over data paths from other tiles to the processor and to switches of other tiles, and to forward data received from the processor to switches of other tiles, and coupling circuitry configured to couple data resulting from processing an instruction from at least one of the streams of instructions to the storage module and to the switch.
    Type: Grant
    Filed: April 14, 2006
    Date of Patent: August 18, 2009
    Assignee: Tilera Corporation
    Inventors: David Wentzlaff, Anant Agarwal
  • Publication number: 20090193225
    Abstract: A processing architecture and methods therein for building application specific array processing utilizing a sequential data bus for control and data propagation. The methods of array processing provided by the architecture allows for numerical analysis of large numerical data such as simulation, image processing, computer modeling or other numerical functions. The architecture is unlimited in scalability and facilitates mixed mode processing of idealized, analytical and real data, in conjunction with real time input and output.
    Type: Application
    Filed: January 21, 2009
    Publication date: July 30, 2009
    Applicant: GRAY AREA TECHNOLOGIES, INC.
    Inventor: Jerrold Lee Gray
  • Patent number: 7565287
    Abstract: Techniques for implementing vocoders in parallel digital signal processors are described. A preferred approach is implemented in conjunction with the BOPS® Manifold Array (ManArray™) processing architecture so that in an array of N parallel processing elements, N channels of voice communication are processed in parallel. Techniques for forcing vocoder processing of one data-frame to take the same number of cycles are described. Improved throughput and lower clock rates can be achieved.
    Type: Grant
    Filed: December 20, 2005
    Date of Patent: July 21, 2009
    Assignee: Altera Corporation
    Inventors: Ali Soheil Sadri, Navin Jaffer, Anissim A. Silivra, Bin Huang, Matthew Plonski
  • Publication number: 20090158007
    Abstract: A system for implementing waveform processing in a software defined radio (SDR) includes a scaleable array processor having a plurality of micro-engines (MEs) interconnected by a two dimensional topology. Each micro-engine includes multiple FIFOs for interconnecting to each other in the two dimensional topology. One micro-engine communicates with another adjacent micro-engine by way of the respective FIFOs. The micro-engines are dedicated to predetermined algorithms. The two dimensional topology includes an array of N×M micro-engines interconnected by the multiple FIFOs. The N×M are integer numbers of rows and columns, respectively, in the array of micro-engines. The micro-engines are dedicated to baseband processing of data for RF transmission or RF reception.
    Type: Application
    Filed: December 12, 2007
    Publication date: June 18, 2009
    Applicant: ITT MANUFACTURING ENTERPRISES, INC.
    Inventor: Patrick A. McCabe
  • Publication number: 20090146691
    Abstract: A logic cell array having a number of logic cells and a segmented bus system for logic cell communication, the bus system including different segment lines having shorter and longer segments for connecting two points in order to be able to minimize the number of bus elements traversed between separate communication start and end points.
    Type: Application
    Filed: February 13, 2009
    Publication date: June 11, 2009
    Inventors: Martin VORBACH, Frank MAY, Dirk REICHARDT, Frank LIER, Gerd EHLERS, Armin NUCKEL, Volker BAUMGARTE, Prashant RAO, Jens OERTEL
  • Publication number: 20090144522
    Abstract: A data processing device comprising a multidimensional array of coarse grained logic elements processing data and operating at a first clock rate and communicating with one another and/or other elements via busses and/or communication lines operated at a second clock rate is disclosed, wherein the first clock rate is higher than the second and wherein the coarse grained logic elements comprise storage means for storing data needed to be processed.
    Type: Application
    Filed: October 24, 2008
    Publication date: June 4, 2009
    Inventors: Martin Vorbach, Alexander Thomas
  • Patent number: 7539845
    Abstract: An integrated circuit comprises a plurality of tiles. Each tile comprises a processor, and a switch including switching circuitry to forward data received over data paths from other tiles to the processor and to switches of other tiles, and to forward data received from the processor to switches of other tiles. The integrated circuit further comprises an interface coupled to a plurality of the tiles to transfer data between one or more switches of the tiles and one or more switches of tiles in an externally coupled integrated circuit.
    Type: Grant
    Filed: April 14, 2006
    Date of Patent: May 26, 2009
    Assignee: Tilera Corporation
    Inventors: David Wentzlaff, Carl G. Ramey, Anant Agarwal
  • Patent number: 7526631
    Abstract: A processor book designed to support both commercial workloads and technical workloads based on a dynamic or static mechanism of reconfiguring the external wiring interconnect. The processor book is configured as a building block for commercial workload processing systems with external connector buses (ECBs). The processor book is also provided with routing logic to enable to ECBs to be utilized for either book-to-book routing or routing within the same processor book. A table specific wiring scheme is provided for coupling the ECBs running off the chips of one MCM to the chips of the second MCM on the processor book so that the chips of the first MCM are connected directly to the chips of a second MCM that is logically furthest away and vice versa. Once the wiring of the ECBs are completed according to the wiring scheme, the operational and functional characteristics reflect those of a processor book configured for technical workloads.
    Type: Grant
    Filed: April 28, 2003
    Date of Patent: April 28, 2009
    Assignee: International Business Machines Corporation
    Inventors: Ravi Kumar Arimilli, Vicente Enrique Chung, Jody Bern Joyner, Jerry Don Lewis
  • Publication number: 20090094436
    Abstract: The invention provides an ultra-scalable supercomputer based on MPU architecture in achieving the well-balanced performance of hundreds of TFLOPS or PFLOPS range in applications. The supercomputer system design includes the interconnect topology and its corresponding routing strategies, the communication subsystem design and implementation, the software and hardware schematic implementations. The supercomputer comprises a plurality of processing nodes powering the parallel processing and Axon nodes connecting computing nodes while implementing the external interconnections. The interconnect topology can be based on MPU architecture and the communication routing logic as required by switching logics is implemented in the FPGA chips while some modular designs for accelerating particular traffic patterns from applications and meliorating the communication overhead are able to be deployed as well.
    Type: Application
    Filed: July 25, 2008
    Publication date: April 9, 2009
    Inventors: Yuefan Deng, Alexander Korobka, Peng Zhang
  • Patent number: 7515899
    Abstract: Additional computing power is captured using the idle processing power of mobile phones incorporated into a grid computing system, wherein the system is capable of pushing projects out to available mobile phones for processing during idle operation times. To further efficiently utilize the unused processing cycles of mobile phones, a unique protocol is utilized to coordinate processing tasks which makes use of existing short messages techniques to communicate projects. The unique protocol is combination of bootstrapping using standard compression techniques along with an adaptive compression scheme.
    Type: Grant
    Filed: April 23, 2008
    Date of Patent: April 7, 2009
    Assignee: International Business Machines Corporation
    Inventors: Hollie Carr, Peter Mattison, Christopher E. Sharp
  • Patent number: 7508776
    Abstract: A controlling method and device for data transmission including the steps of providing a system bus for connecting a first transmission channel and a second transmission channel with a command processor, adjusting a transmitting direction of the system bus according to a transmitting direction of the second transmission channel, and proceeding the data processing procedures of the second transmission channel, wherein parts of data processing procedures of the first transmission channel will last during a interval between the system bus adjusting the transmitting direction and the data processing procedures of the second transmission channel start on. The present invention ensure that the independence between every data caching and processing reduces the times of flushing the cached data from the data transmission channel and re-seeking through the source, shortening the transmission time, increasing facileness and improving the efficiency of the data transmission.
    Type: Grant
    Filed: April 15, 2004
    Date of Patent: March 24, 2009
    Assignee: Tian Holdings, LLC
    Inventors: Kevin Lin, Morris Chu
  • Patent number: 7503046
    Abstract: A method of determining an interleave pattern for n lots of A and y lots of B, when n plus y equals a power of two such that the expression 2z?n may be used to represent the value of y, includes generating a key including the reverse bit order of a serially indexed count from 0 to 2z. An interleave pattern can be generated from the key in which all values less than n are replace by A and all other values are replaced by B. The key can be used to generate a table that contains all possible combinations of values of A and B. The table can then be stored such that an interleave pattern can be automatically selected based on either the number of lots of A or the number of lots of B.
    Type: Grant
    Filed: October 20, 2003
    Date of Patent: March 10, 2009
    Assignee: Micron Technology, Inc.
    Inventor: Mark Beaumont
  • Patent number: 7502844
    Abstract: A monitoring system provided according to an aspect of the present invention enables a user to specify one or more resource elements as a group, and compute an Abnormality Index that represents the deviation from a baseline operation of these selected resource elements. In an embodiment, the Abnormality Index is computed as a single number. The user may include any desired attributes of each resource element to be considered in determining deviations. The Abnormality Index represents the number of deviations, the seriousness of deviation (e.g., more serious if deviation relates to unavailability of the resource element), and the extent of deviation (i.e., whether the deviation is based on expected operation for hourly, daily or weekly duration).
    Type: Grant
    Filed: July 29, 2005
    Date of Patent: March 10, 2009
    Assignee: BMC Software
    Inventors: Atul Garg, Joe Scarpelli, Dasari Subramanyeswara Rao, Girish Narasimha Raghavan, Bopana Ganapathy, Anil Kumar Sondekoppa Hanumappa
  • Publication number: 20090055624
    Abstract: The present invention relates to the control of an array of processing elements in a parallel processor using row and column select lines. For each column in the array, a column select line connects to all of the processing elements in the column. For each row in the array, a row select line connecting to all of the processing elements in the row. A processing element in the array may be selected by activation of its row and column select lines.
    Type: Application
    Filed: October 22, 2008
    Publication date: February 26, 2009
    Applicant: Micron Technology, Inc.
    Inventor: Graham Kirsch
  • Patent number: 7477608
    Abstract: There is provided a method for routing packets on a linear of N processors connected in a nearest neighbor configuration. The method includes the step of, for each end processor of the array, connecting unused outputs to corresponding unused inputs. For each axis required to directly route a packet from a source to a destination processor, the following steps are performed. It is determined whether a result of directly sending a packet from an initial processor to a target processor is less than or greater than N/2 moves, respectively. The initial processor is the source processor in the first axis, and the target processor is the destination processor in the last axis. The packet is directly sent from the initial processor to the target processor, when the result is less than N/2 moves. The packet is indirectly sent so as to wrap around each end processor, when the result is greater than N/2 moves.
    Type: Grant
    Filed: July 21, 2005
    Date of Patent: January 13, 2009
    Assignee: International Business Machines Corporation
    Inventors: Monty M. Denneau, Peter H. Hochschild, Richard A. Swetz, Henry S. Warren, Jr.
  • Patent number: 7472051
    Abstract: A microcontroller comprising a central processing unit and a further fault processing unit suitable for performing validation of operations of said central processing unit. The further fault processing unit is external and different with respect to said central processing unit and said further fault processing unit comprises at least a module for performing validation of operations of said central processing unit and one or more modules suitable for performing validation of operations of other functional parts of said microcontroller. Validation of operations of said central processing unit is performed by using one or more of the following fault tolerance techniques: data shadowing; code&flow signature; data processing legality check; addressing legality check; ALU concurrent integrity checking; concurrent mode/interrupt check.
    Type: Grant
    Filed: July 9, 2004
    Date of Patent: December 30, 2008
    Assignee: Yogitech Spa
    Inventors: Riccardo Mariani, Silvano Motto, Monia Chiavacci
  • Patent number: 7471643
    Abstract: A heterogeneous array includes clusters of processing elements. The clusters include a combination of ALUs and multiplexers linked by direct connections and various general-purpose routing networks. The multiplexers are controlled by the ALUs in the same cluster, or alternatively by ALUs in other clusters, via a dedicated multiplexer control network. Components of applications configured onto the array are selectively implemented in either multiplexers or ALUs, as determined by the relative efficiency of implementing the component in one or the other type of processing element, and by the relative availability of the processing element types. Multiplexer control signals are generated from combinations of ALU status signals, and optionally routed to control multiplexers in different clusters.
    Type: Grant
    Filed: July 1, 2002
    Date of Patent: December 30, 2008
    Assignee: Panasonic Corporation
    Inventor: Anthony I. Stansfield
  • Patent number: 7461236
    Abstract: An integrated circuit includes a plurality of tiles. Each tile comprises a processor; and a switch including switching circuitry to forward data over data paths from other tiles to the processor and to switches of other tiles according to a switch instruction indicating an input port to which each of multiple output ports of the switch is to be coupled. The switch is able to operate in a first mode in which successive input data arriving at the switch are forwarded according to a different switch instruction, and a second mode in which successive input data arriving at the switch are forwarded according to the same switch instruction.
    Type: Grant
    Filed: December 21, 2005
    Date of Patent: December 2, 2008
    Assignee: Tilera Corporation
    Inventor: David Wentzlaff
  • Patent number: 7461234
    Abstract: A heterogeneous array includes clusters of processing elements. The clusters include a combination of ALUs and multiplexers linked by direct connections and various general-purpose routing networks. The multiplexers are controlled by the ALUs in the same cluster, or alternatively by ALUs in other clusters, via a special purpose routing network. Components of applications configured onto the array are selectively implemented in either multiplexers or ALUs, as determined by the relative efficiency of implementing the component in one or the other type of processing element, and by the relative availability of the processing element types. Multiplexer control signals are generated from combinations of ALU status signals, and optionally routed to control multiplexers in different clusters.
    Type: Grant
    Filed: May 16, 2005
    Date of Patent: December 2, 2008
    Assignee: Panasonic Corporation
    Inventors: Nicholas John Charles Ray, Andrea Olgiati, Anthony I. Stansfield, Alan D Marshall
  • Patent number: 7454593
    Abstract: The present invention relates to the control of an array of processing elements in a parallel processor using row and column select lines. For each column in the array, a column select line connects to all of the processing elements in the column. For each row in the array, a row select line connecting to all of the processing elements in the row. A processing element in the array may be selected by activation of its row and column select lines. The processing elements are connected to adjacent processing elements by respective segments of a row bus for each row and by respective segments of a column bus for each column. Each row of the array includes a respective column edge register coupled to a processing element at one end of the respective row and to a processing element at the other end of the respective row.
    Type: Grant
    Filed: April 11, 2003
    Date of Patent: November 18, 2008
    Assignee: Micron Technology, Inc.
    Inventor: Graham Kirsch
  • Patent number: 7451292
    Abstract: Quantum gaps exist between an origin and a destination that heretofore have prevented reliably utilizing the advantages of quantum computing. To predict the outcome of instructions with precision, the input data, preferably a qubit, is collapsed to a point value within the quantum gap based on a software instruction. After collapse the input data is restructured at the destination, wherein dynamics of restructuring are governed by a plurality of gap factors as follows: computational self-awareness; computational decision logic; computational processing logic; computational and network protocol and logic exchange; computational and network components, logic and processes; provides the basis for excitability of the Gap junction and its ability to transmit electronic and optical impulses, integrates them properly, and depends on feedback loop logic; computational and network component and system interoperability; and embodiment substrate and network computational physical topology.
    Type: Grant
    Filed: August 8, 2003
    Date of Patent: November 11, 2008
    Inventor: Thomas J Routt
  • Patent number: 7444276
    Abstract: A logic simulation processor stores in a shift register intermediate values generated during the logic simulation. The simulation processor includes multiple processor units and an interconnect system that communicatively couples the processor units to each other. Each of the processor units includes a processor element configurable to simulate at least a logic gate, and a shift register associated with the processor element. The shift register includes multiple entries to store the intermediate values, and is coupled to receive the output of the processor element. Each of the processor units further includes one or more multiplexers for selecting one of the entries of the shift register as outputs to be coupled to the interconnect system. Each of the processor units may further include a local memory for storing data from, and loading the data to, the simulation processor.
    Type: Grant
    Filed: September 28, 2005
    Date of Patent: October 28, 2008
    Assignee: Liga Systems, Inc.
    Inventors: William Watt, Henry T. Verheyen
  • Publication number: 20080244221
    Abstract: Embodiments of apparatuses, methods, and systems for exposing system topology to an execution environment are disclosed. In one embodiment, an apparatus includes execution cores and resources on a single integrated circuit, and topology logic. The topology logic is to populate a data structure with information regarding a relationship between the execution cores and the resources.
    Type: Application
    Filed: March 30, 2007
    Publication date: October 2, 2008
    Inventors: Donald K. Newell, Jaideep Moses, Ravishankar Iyer, Rameshkumar G. Illikkal, Srihari Makineni
  • Publication number: 20080209163
    Abstract: A processor book designed to support both commercial workloads and technical workloads based on a dynamic or static mechanism of reconfiguring the external wiring interconnect. The processor book is configured as a building block for commercial workload processing systems with external connector buses (ECBs). The processor book is also provided with routing logic to enable to ECBs to be utilized for either book-to-book routing or routing within the same processor book. A table specific wiring scheme is provided for coupling the ECBs running off the chips of one MCM to the chips of the second MCM on the processor book so that the chips of the first MCM are connected directly to the chips of a second MCM that is logically furthest away and vice versa. Once the wiring of the ECBs are completed according to the wiring scheme, the operational and functional characteristics reflect those of a processor book configured for technical workloads.
    Type: Application
    Filed: May 9, 2008
    Publication date: August 28, 2008
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Ravi Kumar Arimilli, Vicente Enrique Chung, Jody Bern Joyner, Jerry Don Lewis
  • Patent number: 7418536
    Abstract: A processor for use in a router, the processor having a systolic array pipeline for processing data packets to determine to which output port of the router the data packet should be routed. In one embodiment, the systolic array pipeline includes a plurality of programmable functional units and register files arranged sequentially as stages, for processing packet contexts (which contain the packet's destination address) to perform operations, under programmatic control, to determine the destination port of the router for the packet. A single stage of the systolic array may contain a register file and one or more functional units such as adders, shifters, logical units, etc., for performing, in one example, very long instruction word (vliw) operations. The processor may also include a forwarding table memory, on-chip, for storing routing information, and a cross bar selectively connecting the stages of the systolic array with the forwarding table memory.
    Type: Grant
    Filed: January 4, 2006
    Date of Patent: August 26, 2008
    Assignee: Cisco Technology, Inc.
    Inventors: Arthur Tung-Tak Leung, Anthony Li, William Lynch, Sharad Mehrotra
  • Publication number: 20080195842
    Abstract: Occurrence and propagation of glitches caused by changing the path layout are suppressed, thereby reducing the power consumption. An array-type processor comprises a plurality of processor elements and can change the path layout relating to data transmission/reception between the processor elements depending on clock cycle. Each processor element comprises a layout information memory 11 that stores a layout information indicating signal relating to the layout of the paths, a delay adjusting circuit 12 that adjusts the timing of a layout information indicating signal Pin outputted from the layout information memory 11 at every clock cycle, and a wiring connection circuit 13 that changes a path to at least one of the other processor elements (PE) or function unit(s) (a register file unit 14 and an arithmetic logic unit 15) based on a layout information indicating signal Pout whose timing has been adjusted.
    Type: Application
    Filed: February 13, 2008
    Publication date: August 14, 2008
    Applicant: NEC ELECTRONICS CORPORATION
    Inventors: Yoshitaka Izawa, Yoshikazu Yabe
  • Patent number: 7406582
    Abstract: A method for communicating between nodes of a plurality of nodes is disclosed. Each node includes a plurality of processors and an interconnect chipset. The method issues a request for data from a processor in a first node and passes the request for data to other nodes through an expansion port (or scalability port). The method also starts an access of a memory in response to the request for data and snoops a processor cache of each processor in each node. The method accordingly identifies the location of the data in either the processor cache or memory in the node having the processor issuing the request or in a processor cache or memory of another node. A method for requesting data between two directly coupled nodes in a router system is also disclosed. A method for requesting data between three or more nodes in an interconnect system is also disclosed. A method for resolving crossing cases in an interconnect system is also disclosed.
    Type: Grant
    Filed: August 13, 2004
    Date of Patent: July 29, 2008
    Assignee: Fujitsu Limited
    Inventors: James C Wilson, Wolf-Dietrich Weber
  • Patent number: 7406075
    Abstract: A small cost-effective crossbar switch is provided. A switch circuit is disposed in each of a plurality of nodes which are cascade connected with each other in a plurality of stages. Each switch circuit receives from a node of a previous stage a designated address to specify directly or indirectly the relative position in which a target switch circuit is present, determines whether the designated address represents a specific value “0”. When it is determined that the specific value is represented, each switch circuit allows data output to a node-out line, decrements the received designated address by “1” to generate a new designated address, and supplies this new designated address to a node of the subsequent stage.
    Type: Grant
    Filed: March 22, 2004
    Date of Patent: July 29, 2008
    Assignee: Sony Computer Entertainment Inc.
    Inventors: Junichi Naoi, Tomohiro Ohto
  • Publication number: 20080176750
    Abstract: An analog processor, for example a quantum processor may include a plurality of elongated qubits that are disposed with respect to one another such that each qubit may selectively be directly coupled to each of the other qubits via a single coupling device. Such may provide a fully interconnected topology.
    Type: Application
    Filed: January 11, 2008
    Publication date: July 24, 2008
    Inventors: Geordie Rose, Paul Bunyk, Michael D. Coury, William Macready, Vicky Choi
  • Publication number: 20080162872
    Abstract: A data processing system includes a first processing node and a second processing node. The first processing node includes a plurality of first processing units coupled to each other for communication, and the second processing node includes a plurality of second processing units coupled to each other for communication. Each of the plurality of first processing units is coupled to a respective one of the plurality of second processing units in the second processing node by a respective one of a plurality of point-to-point links.
    Type: Application
    Filed: December 19, 2007
    Publication date: July 3, 2008
    Inventors: VICENTE E. CHUNG, Benjiman L. Goodman, Praveen S. Reddy, William J. Starke
  • Publication number: 20080148009
    Abstract: A processing system comprising processors and the dynamically configurable communication elements coupled together in an interspersed arrangement. The processors each comprise at least one arithmetic logic unit, an instruction processing unit, and a plurality of processor ports. The dynamically configurable communication elements each comprise a plurality of communication ports, a first memory, and a routing engine. For each of the processors, the plurality of processor ports is configured for coupling to a first subset of the plurality of dynamically configurable communication elements. For each of the dynamically configurable communication elements, the plurality of communication ports comprises a first subset of communication ports configured for coupling to a subset of the plurality of processors and a second subset of communication ports configured for coupling to a second subset of the plurality of dynamically configurable communication elements.
    Type: Application
    Filed: February 8, 2008
    Publication date: June 19, 2008
    Inventors: Michael B. Doerr, William H. Hallidy, David A. Gibson, Craig M. Chase
  • Publication number: 20080148120
    Abstract: An integrated chip architecture is provided which allows for efficiently testing multiple cores included in the integrated chip architecture and storing corresponding diagnosis data which include an indication of the failure-causing test data and the corresponding test analysis data. Embodiments are provided which enable that the test time and the number of required Input/Output test pins is nearly independent from the number of cores included in the multicore chip. The presented embodiments provide a multicore chip architecture which allows for providing input data to the multiple cores in parallel for simultaneously testing the multiple cores, and analyzing the resulting multiple test outputs on chip. As a result of this analysis embodiments may store on chip an indication for those cores that have not successfully passed the test, together with respective diagnosis data.
    Type: Application
    Filed: April 23, 2007
    Publication date: June 19, 2008
    Inventor: Markus Seuring
  • Patent number: 7386689
    Abstract: A method and apparatus for connecting the processor array of an MPP array to a memory such that data conversion by software is not necessary, and the data can be directly stored in either a normal mode or vertical mode in the memory is disclosed. A connection circuit is provided in which multiple PEs share their connections to multiple data bits in the memory array. Each PE is associated with a plurality of memory buffer registers, which stores data read from (or to be written to) one or two memory data bits. In horizontal (normal) mode connection the memory bits are selected so that all the bits of a given byte are stored in the same PE, i.e., each set of buffer registers associated with a respective PE contains one byte as seen by an external device. In vertical (bit serial) mode, each set of buffer registers contains the successive bits at successive locations in the memory corresponding to that PEs position in the memory word.
    Type: Grant
    Filed: May 4, 2005
    Date of Patent: June 10, 2008
    Assignee: Micron Technology, Inc.
    Inventor: Graham Kirsch
  • Publication number: 20080133879
    Abstract: Provided is a single instruction multiple data (SIMD) parallel processor including a plurality of processing units connected to one another. Each processing unit includes: an instruction register; an instruction decoder; a register files selection circuit; and register files. The SIMD parallel processor can selectively control data of register files required for any one of SIMD, single instruction single data (SISD), row, and column operations in response to an instruction. Since each of the SIMD, SISD, row, and column operations can be effectively performed according to the type of application, the SIMD parallel processor has excellent utility, efficiency, and flexibility.
    Type: Application
    Filed: October 1, 2007
    Publication date: June 5, 2008
    Inventors: Yil Suk Yang, Tae Moon Roh, Dae Woo Lee, Jong Dae Kim, Chun Gi Lyuh
  • Patent number: 7383242
    Abstract: A method and apparatus for item processing is disclosed which provides a stand alone clearing solution having an imaged enable environment for item processing and balancing. The present invention incorporates user-definable balancing control rules, balancing control levels and reconciliation algorithms for facilitating item processing. The present invention includes a data store (14) stored in memory accessible by a capture module (34), a configuration module (38), a balancing module (40), an administration module (46), a publishing module (42) and portal module (48) which provides parallel access to the data store (14) during item processing.
    Type: Grant
    Filed: February 28, 2001
    Date of Patent: June 3, 2008
    Assignee: Alogent Corporation
    Inventors: Brian R. Geisel, William Randlett, Amar Verma, Jeffrey T. Chinander, Kavil S. Subramanian, Ranjee Gangadhar
  • Patent number: 7379418
    Abstract: A method of ensuring system serialization in a multiprocessor multi-nodal environment is used to force all processors in a multiprocessor environment to temporarily suspend operations while one processor changes the system state. Architected designs where latencies between nodes are made known and predictable greatly simplify the task of coordinating quiesce responses within the system. When latencies are not fixed and topologies such as open or closed bus architectures are be used a more dynamic approach is required to ensure system serialization. Adaptive quiesce logic on each node's SCE can dynamically identify the role of the node within the system and automatically configure itself to guarantee that no enabled processor within the entire system receives a quiesce indication before all processors have reached the stopped state. This is also true for systems where nodes are being concurrently added or removed during system operation. Bus states process quiesce requests.
    Type: Grant
    Filed: May 12, 2003
    Date of Patent: May 27, 2008
    Assignee: International Business Machines Corporation
    Inventors: Steven A. Korb, Pak-kin Mak
  • Publication number: 20080109635
    Abstract: General purpose array processing techniques including processing methods and apparatus. Processors may include parallel processing paths designed with reusable computational components such as multipliers, multiplexers, and ALUs. Flow of data through the paths and operations performed may be controlled based on opcodes. Processors may be shared, scalable, and configured to perform matrix operations. In particular, such operation may be useful for physical sections of MIMO-OFDM communication systems.
    Type: Application
    Filed: April 27, 2007
    Publication date: May 8, 2008
    Applicant: QUALCOMM INCORPORATED
    Inventor: Garret Webster Shih
  • Publication number: 20080104367
    Abstract: A system and method for enabling high-speed, low-latency global collective communications among interconnected processing nodes. The global collective network optimally enables collective reduction operations to be performed during parallel algorithm operations executing in a computer structure having a plurality of the interconnected processing nodes. Router devices ate included that interconnect the nodes of the network via links to facilitate performance of low-latency global processing operations at nodes of the virtual network and class structures. The global collective network may be configured to provide global barrier and interrupt functionality in asynchronous or synchronized manner. When implemented in a massively-parallel supercomputing structure, the global collective network is physically and logically partitionable according to needs of a processing algorithm.
    Type: Application
    Filed: July 18, 2005
    Publication date: May 1, 2008
    Inventors: Matthias A. Blumrich, Paul W. Coteus, Dong Chen, Alan Gara, Mark E. Giampapa, Philip Heidelberger, Dirk Hoenicke, Todd E. Takken, Burkhard D. Steinmacher-Burow, Pavlos M. Vranas
  • Patent number: 7356669
    Abstract: A split protocol transmission method for transmitting data and a communication thread identifier for said data along a communication path from a source functional unit (SFU) to a destination functional unit (DFU) via zero or more intermediate functional units (IFU) is described. A data consuming functional unit (CFU) and a data producing functional unit (PFU) in the communication path directly communicate to each other by means of a handshake procedure wherein the data consuming functional unit (CFU) indicates a communication thread identifier (TID) to the data producing functional unit. The data producing functional unit provides data related to said communication thread identifier to said data consuming functional unit. Likewise a data processing system using this method is described.
    Type: Grant
    Filed: May 4, 2004
    Date of Patent: April 8, 2008
    Assignee: Koninklijke Philips Electronics N.V.
    Inventors: Andrei Radulescu, Kees Gerard Willem Goossens
  • Publication number: 20080052490
    Abstract: A sea of computational resources includes a number of computational resources, each of which is a member of one or more nearest neighbor pairings. Each nearest neighbor pairing has an upstream neighbor and a downstream neighbor, and each nearest neighbor pairing transfers data between the upstream neighbor and the downstream neighbor using a nearest neighbor protocol. Generally, atomic units of work are selectively passed from the highest upstream computational resource, which can be accessed by a gateway device or the like, to one or more downstream computational resources, one of which eventually performs the work (for example, data processing, etc.) and then passes the computational result from that work upstream. The atomic units of work can be configured and/or formatted as request packets that can utilize a signature word as a work unit identifier. The computational results can likewise be configured and/or formatted as response packets that also utilize the signature word as a work product identifier.
    Type: Application
    Filed: August 28, 2006
    Publication date: February 28, 2008
    Applicant: Tableau, LLC
    Inventor: Robert C. Botchek
  • Patent number: 7337249
    Abstract: A general bus system is provided which combines a number of internal lines and leads them as a bundle to the terminals. The bus system control is predefined and does not require any influence by the programmer. Any number of memories, peripherals or other units can be connected to the bus system (for cascading).
    Type: Grant
    Filed: June 20, 2007
    Date of Patent: February 26, 2008
    Assignee: Pact XPP Technologies AG
    Inventors: Martin Vorbach, Robert Münch
  • Patent number: 7315934
    Abstract: A data processor has sixteen processing elements that each include a register file and an arithmetic logic unit. A network unit connects between the register files of the processing elements and the arithmetic logic units of the processing elements. The network unit has a selector for simultaneously performing a plurality of data transfers which are each made from a register file of one processing element to an operation unit of another processing element. With the provision of this selector that can perform such simultaneous data transfers, the processing efficiency of the processing elements can be maintained even if a change occurs in operand assignments and the like.
    Type: Grant
    Filed: February 28, 2003
    Date of Patent: January 1, 2008
    Assignee: Matsushita Electric Industrial Co., Ltd.
    Inventors: Hiroyuki Morishita, Atsushi Ito, Satoshi Takashima, Hideshi Nishida, Kozo Kimura, Tokuzo Kiyohara, Akira Miyoshi, Hiroshi Kadota
  • Publication number: 20070294508
    Abstract: A method of generating pseudo-random numbers on a parallel processing system comprises generating a plurality of sub-streams of pseudo-random numbers, wherein the sub-streams are generated in parallel by one or more co-processors, and providing the plurality of sub-streams to respective processing elements, wherein the respective processing elements employ the plurality of sub-streams to execute an application.
    Type: Application
    Filed: March 9, 2007
    Publication date: December 20, 2007
    Inventors: Myles A. Sussman, William Y. Crutchfield, Matthew N. Papakipos
  • Patent number: 7308558
    Abstract: The data interconnect and routing mechanism reduces data communication latency, supports dynamic route determination based upon processor activity level/traffic, and implements an architecture that supports scalable improvements in communication frequencies. In one application, a data processing system includes first and second processing books, each including at least first and second processing units. Each of the first and second processing units has a respective first output data bus. The first output data bus of the first processing unit is coupled to the second processing unit, and the first output data bus of the second processing unit is coupled to the first processing unit. At least the first processing unit of the first processing book and the second processing unit of the second processing book each have a respective second output data bus.
    Type: Grant
    Filed: January 7, 2004
    Date of Patent: December 11, 2007
    Assignee: International Business Machines Corporation
    Inventors: Ravi Kumar Arimilli, Jerry Don Lewis, Vicente Enrique Chung, Jody Bern Joyner
  • Patent number: 7302548
    Abstract: A method for communicating in a multi-processor environment is provided that includes generating a bit at an originating processor associated with a message to be communicated to a destination processor. The bit is positioned in a send register associated with the originating processor and transposed from the send register of the originating processor to a receive register of the destination processor. An interrupt signal is then generated in response to the bit being transposed.
    Type: Grant
    Filed: June 18, 2002
    Date of Patent: November 27, 2007
    Assignee: Cisco Technology, Inc.
    Inventors: John W. Mitten, William R. Lee, Trevor S. Garner, Robert L. King
  • Patent number: 7299339
    Abstract: A field programmable gate array includes a virtual bus interface that receives a control word from a host processor over a standard I/O bus. A configurable very long instruction word (VLIW) controller receives the control word via virtual bus interface signals mapped from the virtual bus interface. A reconfigurable communication and control fabric controls the data paths and programming modes of single instruction-multiple data (SIMD) processing element cells. The configurable VLIW controller has an interface with the reconfigurable communication and control fabric. SIMD processing element cells are controlled by the configurable VLIW controller through the reconfigurable communication and control fabric via the interface.
    Type: Grant
    Filed: August 30, 2004
    Date of Patent: November 20, 2007
    Assignee: The Boeing Company
    Inventor: Tirumale K. Ramesh
  • Publication number: 20070239963
    Abstract: A multiprocessor system is provided, comprising a baseboard, for arranging peripheral equipments; and a plurality of processor modules, each equipped with a processor and a board-to-board connector; wherein the plurality of processor modules are stacked up, with board-to-board connectors being electrically connected between the processor modules and between the processor modules and the baseboard; the processors communicate with the peripheral equipments in accordance with a specific bus specification; and the operations of the plurality of processor modules are coordinated by routes provided between the processor modules and between the processor modules and the baseboard.
    Type: Application
    Filed: November 27, 2006
    Publication date: October 11, 2007
    Applicant: Shenzhen Mindray Bio-Medical Electronics Co., Ltd.
    Inventors: Li Yao, Wei Chen
  • Patent number: 7275145
    Abstract: According to some embodiments, a processing element includes (i) a next neighbor register to receive information directly from a previous processing element in a series of processing elements, and (ii) a previous neighbor register to receive information directly from a next processing element in the series.
    Type: Grant
    Filed: December 24, 2003
    Date of Patent: September 25, 2007
    Assignee: Intel Corporation
    Inventors: Sridhar Lakshmanamurthy, Prashant Chandra, Wilson Y. Liao, Jeen-Yuan Miin, Pun Yim, Chen-Chi Kuo, Jaroslaw J. Sydir
  • Patent number: 7263597
    Abstract: The present invention provides a method and apparatus for improving transmission of control information within a network device and between multiple connected network devices. Specifically, a control path is included within a network device that is independent of the data path and dedicates control path resources to each distributed processor within the network device. Dedicating resources insures that each processor has sufficient bandwidth on the control plane to transmit control information at high frequencies. This may prevent starvation of data transmissions during periods of high control information transfers and may also reduce the likelihood or further spreading of control information storms when one or more network devices in a network experiences a failure.
    Type: Grant
    Filed: April 19, 2001
    Date of Patent: August 28, 2007
    Assignee: CIENA Corporation
    Inventors: Peter B Everdell, Chris R Noel, Brian Branscomb, Nicholas A Langrind