Data Flow Array Processor Patents (Class 712/18)

Low latency offloading of collectives over a switch

Patent number: 12353320

Abstract: A device includes a plurality of processing elements (PEs). A symmetric memory is allocated in each of the plurality of PEs. The device includes a switch connected to the plurality of PEs. The switch is to: receive, from a first processing element (PE) of the plurality of PEs, a message that includes a buffer offset, compute, based on the buffer offset, a first memory address of a first buffer in a first symmetric memory of the first PE and a second memory address of a second buffer in a second symmetric memory of a second PE of the plurality of PEs, and initiate, based on the first memory address and the second memory address, a direct memory access operation to access the first buffer and the second buffer.

Type: Grant

Filed: August 31, 2023

Date of Patent: July 8, 2025

Assignees: ATI Technologies ULC

Inventors: Kishore Punniyamurthy, Richard David Sodke, Furkan Eris, Sergey Blagodurov, Bradford Michael Beckmann, Brandon Keith Potter, Khaled Hamidouche
Computer system and a computer device

Patent number: 12235785

Abstract: A computer system is provided. The computer system includes a hub board, a common bus, and a plurality of Sibling boards. The hub board has an I/O controller hub, which includes a main communication chipset. The plurality of Sibling boards is coupled to the hub board by the common bus. Each of the Sibling boards includes a memory and at least one CPU. The memory is operative to host a Sibling operating system. The CPU is coupled to the memory. The Southbridge type chipset which resides in the hub board is shared amongst the plurality of Sibling boards. At least one of the plurality of Sibling boards functions as a master processing unit of the system. Sibling boards offer processing flexibility through the means of how they are configured in the system.

Type: Grant

Filed: September 21, 2023

Date of Patent: February 25, 2025

Inventor: Jonathan Glickman
System having a hybrid threading processor, a hybrid threading fabric having configurable computing elements, and a hybrid interconnection network

Patent number: 12204363

Abstract: Representative apparatus, method, and system embodiments are disclosed for configurable computing. In a representative embodiment, a system includes an interconnection network, a processor, a host interface, and a configurable circuit cluster. The configurable circuit cluster may include a plurality of configurable circuits arranged in an array; an asynchronous packet network and a synchronous network coupled to each configurable circuit of the array; and a memory interface circuit and a dispatch interface circuit coupled to the asynchronous packet network and to the interconnection network. Each configurable circuit includes instruction or configuration memories for selection of a current data path configuration, a master synchronous network input, and a data path configuration for a next configurable circuit.

Type: Grant

Filed: January 15, 2024

Date of Patent: January 21, 2025

Assignee: Micron Technology, Inc.

Inventor: Tony M. Brewer
External nonvolatile memory with additional functionality

Patent number: 12175118

Abstract: An external nonvolatile memory device that includes a rewritable nonvolatile memory and a CMOS interface is disclosed. The interface includes a clock signal which is input to the external nonvolatile memory device. This clock signal is multiplied by an integer to create a memory serdes clock which is used to clock outgoing data. The memory serdes clock is also used to create a clock that is used to clock the incoming data from the main processing device. The external nonvolatile memory device also includes an encryption/decryption block that encrypts data read from the nonvolatile memory before it is transmitted over the interface, and decrypts data received from the interface before storing it in the nonvolatile memory. The encryption/decryption block may utilize a stream cipher.

Type: Grant

Filed: March 22, 2022

Date of Patent: December 24, 2024

Assignee: Silicon Laboratories Inc.

Inventors: Thomas Saroshan David, Aslam Rafi, Joshua Norem, Adrianus Josephus Bink, Daniel Cooley
Intelligent packet switch

Patent number: 12148032

Abstract: Various techniques are disclosed for offloading the processing of data packets. For example, the offloaded processing can be resident in an intelligent switch, such as an intelligent switch upstream or downstream from an electronic trading platform. The data packets processed through the intelligent switch can be unreliable datagram protocol (UDP) data packets from multiple feeds of financial market data, and the offloaded processing can include line arbitration, gap detection, and/or gap mitigation of redundant copies of UDP data packets from one or more financial market data feeds.

Type: Grant

Filed: September 6, 2022

Date of Patent: November 19, 2024

Assignee: Exegy Incorporated

Inventors: Scott Parsons, David E. Taylor, Ronald S. Indeck
Method for verifying drone included in industrial internet of things system, by using petri-net modeling

Patent number: 11914720

Abstract: A method for verifying a drone included in an industrial Internet of Things (IIoT) system, using a petri-net modeling is disclosed. In an embodiment, the method includes a step of modeling the IIoT system as a hierarchical petri-net (modeling step); and a step of verifying whether the drone has security vulnerability on the basis of the hierarchical petri-net model (verification step), wherein the verification step can determine that a drone has security vulnerability when at least one of a plurality of determination factors provided as places to the hierarchical petri-net model determines that the drone is operating abnormally.

Type: Grant

Filed: May 9, 2019

Date of Patent: February 27, 2024

Assignee: SOONCHUNHYANG UNIVERSITY INDUSTRY ACADEMY COOPERATION FOUNDATION

Inventors: Il Sun You, Vishal Sharma, Gaurav Choudhary, Yong Ho Ko
System having a hybrid threading processor, a hybrid threading fabric having configurable computing elements, and a hybrid interconnection network

Patent number: 11880687

Abstract: Representative apparatus, method, and system embodiments are disclosed for configurable computing. In a representative embodiment, a system includes an interconnection network, a processor, a host interface, and a configurable circuit cluster. The configurable circuit cluster may include a plurality of configurable circuits arranged in an array; an asynchronous packet network and a synchronous network coupled to each configurable circuit of the array; and a memory interface circuit and a dispatch interface circuit coupled to the asynchronous packet network and to the interconnection network. Each configurable circuit includes instruction or configuration memories for selection of a current data path configuration, a master synchronous network input, and a data path configuration for a next configurable circuit.

Type: Grant

Filed: January 26, 2023

Date of Patent: January 23, 2024

Assignee: Micron Technology, Inc.

Inventor: Tony M. Brewer
Terrestrial based high speed data communications mesh network

Patent number: 11876595

Abstract: A network for providing high speed data communications may include multiple terrestrial transmission stations that are located within overlapping communications range and a mobile receiver station. The terrestrial transmission stations provide a continuous and uninterrupted high speed data communications link with the mobile receiver station employing a wireless radio access network protocol.

Type: Grant

Filed: August 12, 2022

Date of Patent: January 16, 2024

Assignee: SMARTSKY NETWORKS LLC

Inventor: Donald Alcorn
Stateless system to enable data breach notification

Patent number: 11843619

Abstract: A method including determining, by the infrastructure device, a breach database including breach information indicating breached data that is compromised due to a data breach; calculating, by the infrastructure device, a hash of the breached data; calculating, by the user device, a hash of private data; transmitting, by the user device, the hash of the private data to the infrastructure device; comparing, by the infrastructure device, the hash of the private data with the hash of the breached data; and transmitting, by the infrastructure device to the user device based at least in part on a result of the comparison, a notification indicating whether the private data is breached due to the data breach is disclosed. Various other aspects are contemplated.

Type: Grant

Filed: October 7, 2022

Date of Patent: December 12, 2023

Assignee: UAB 360 IT

Inventor: Edvinas Tamosiunas
Execution engine for executing single assignment programs with affine dependencies

Patent number: 11681650

Abstract: The execution engine is a new organization for a digital data processing apparatus, suitable for highly parallel execution of structured fine-grain parallel computations. The execution engine includes a memory for storing data and a domain flow program, a controller for requesting the domain flow program from the memory, and further for translating the program into programming information, a processor fabric for processing the domain flow programming information and a crossbar for sending tokens and the programming information to the processor fabric.

Type: Grant

Filed: May 28, 2021

Date of Patent: June 20, 2023

Assignee: Stillwater Supercomputing, Inc.

Inventor: Erwinus Theodorus Leonardus Omtzigt
Methods and systems for searching

Patent number: 11586674

Abstract: This invention relates to methods and systems for searching. It is particularly applicable to methods of searching which enable efficient identification of compatible portfolios. Embodiments of the invention propose methods of searching which address the huge search space issue associated with identifying compatible portfolios. In particular, embodiments of the invention start their search operations simultaneously from both sides by both trying to form valid portfolios from candidate products until a valid solution is found and trying to find conflicts from the defined compatibility rules until a conflict is found which leads to the conclusion that no valid solution exists. A conclusion from either process will stop the whole searching process which can significantly reduce blind and unnecessary searching in the whole search space.

Type: Grant

Filed: December 28, 2016

Date of Patent: February 21, 2023

Assignees: Khalifa University of Science and Technology, British Telecommunications plc, Emirates Telecommunications Corporation

Inventors: Di Wang, Ivan Boyd, Ahmad Al-Rubaie, Jason Ng
System having a hybrid threading processor, a hybrid threading fabric having configurable computing elements, and a hybrid interconnection network

Patent number: 11579887

Abstract: Representative apparatus, method, and system embodiments are disclosed for configurable computing. In a representative embodiment, a system includes an interconnection network, a processor, a host interface, and a configurable circuit cluster. The configurable circuit cluster may include a plurality of configurable circuits arranged in an array; an asynchronous packet network and a synchronous network coupled to each configurable circuit of the array; and a memory interface circuit and a dispatch interface circuit coupled to the asynchronous packet network and to the interconnection network. Each configurable circuit includes instruction or configuration memories for selection of a current data path configuration, a master synchronous network input, and a data path configuration for a next configurable circuit.

Type: Grant

Filed: July 10, 2021

Date of Patent: February 14, 2023

Assignee: Micron Technology, Inc.

Inventor: Tony M. Brewer
Intelligent switch for processing financial market data

Patent number: 11436672

Abstract: Various techniques are disclosed for offloading the processing of data packets. For example, incoming data packets can be processed through an offload processor to generate a new stream of outgoing data packets that organize data from the data packets in a manner different than the incoming data packets. Furthermore, in an exemplary embodiment, the offloaded processing can be resident in an intelligent switch, such as an intelligent switch upstream or downstream from an electronic trading platform.

Type: Grant

Filed: March 3, 2014

Date of Patent: September 6, 2022

Assignee: Exegy Incorporated

Inventors: Scott Parsons, David E. Taylor, Ronald S. Indeck
Computing efficient cross channel operations in parallel computing machines using systolic arrays

Patent number: 11182337

Abstract: An apparatus to facilitate computing efficient cross channel operations in parallel computing machines using systolic arrays is disclosed. The apparatus includes a plurality of registers and one or more processing elements communicably coupled to the plurality of registers. The one or more processing elements include a systolic array circuit to perform cross-channel operations on source data received from a single source register of the plurality of registers, the systolic array circuit modified to receive inputs from the single source register and route elements of the single source register to multiple channels in the systolic array circuit.

Type: Grant

Filed: June 12, 2020

Date of Patent: November 23, 2021

Assignee: INTEL CORPORATION

Inventors: Subramaniam Maiyuran, Jorge Parra, Supratim Pal, Chandra Gurram
Digital signal processing array using integrated processing elements

Patent number: 11080053

Abstract: Techniques and mechanisms described herein include a signal processor implemented as an overlay on a field-programmable gate array (FPGA) device that utilizes special purpose, hardened intellectual property (IP) modules such as memory blocks and digital signal processing (DSP) cores. A Processing Element (PE) is built from one or more DSP cores connected to additional logic. Interconnected as an array, the PEs may operate in a computational model such as Single Instruction-Multiple Thread (SIMT). A software hierarchy is described that transforms the SIMT array into an effective signal processor.

Type: Grant

Filed: March 18, 2020

Date of Patent: August 3, 2021

Assignee: Nextera Video, Inc.

Inventors: John E. Deame, Steven Kaufmann, Liviu Voicu
Execution engine for executing single assignment programs with affine dependencies

Patent number: 11030146

Abstract: The execution engine is a new organization for a digital data processing apparatus, suitable for highly parallel execution of structured fine-grain parallel computations. The execution engine includes a memory for storing data and a domain flow program, a controller for requesting the domain flow program from the memory, and further for translating the program into programming information, a processor fabric for processing the domain flow programming information and a crossbar for sending tokens and the programming information to the processor fabric.

Type: Grant

Filed: April 1, 2019

Date of Patent: June 8, 2021

Assignee: Stillwater Supercomputing, Inc.

Inventor: Erwinus Theodorus Leonardus Omtzigt
Execution engine for executing single assignment programs with affine dependencies

Patent number: 10289606

Abstract: The execution engine is a new organization for a digital data processing apparatus, suitable for highly parallel execution of structured fine-grain parallel computations. The execution engine includes a memory for storing data and a domain flow program, a controller for requesting the domain flow program from the memory, and further for translating the program into programming information, a processor fabric for processing the domain flow programming information and a crossbar for sending tokens and the programming information to the processor fabric.

Type: Grant

Filed: August 22, 2017

Date of Patent: May 14, 2019

Assignee: Stillwater Supercomputing, Inc.

Inventor: Erwinus Theodorus Leonardus Omtzigt
Generating and executing a control flow

Patent number: 10061590

Abstract: The present disclosure provide apparatuses and methods related to generating and executing a control flow. An example apparatus can include a first device configured to generate control flow instructions, and a second device including an array of memory cells, an execution unit to execute the control flow instructions, and a controller configured to control an execution of the control flow instructions on data stored in the array.

Type: Grant

Filed: December 28, 2015

Date of Patent: August 28, 2018

Assignee: Micron Technology, Inc.

Inventors: Kyle B. Wheeler, Richard C. Murphy, Troy A. Manning, Dean A. Klein
Generating and executing a control flow

Patent number: 10025593

Abstract: The present disclosure provide apparatuses and methods related to generating and executing a control flow. An example apparatus can include a first device configured to generate control flow instructions, and a second device including an array of memory cells, an execution unit to execute the control flow instructions, and a controller configured to control an execution of the control flow instructions on data stored in the array.

Type: Grant

Filed: December 28, 2015

Date of Patent: July 17, 2018

Assignee: Micron Technology, Inc.

Inventors: Kyle B. Wheeler, Richard C. Murphy, Troy A. Manning, Dean A. Klein
Instruction and logic for a matrix scheduler

Patent number: 9851976

Abstract: A processor includes a core and a scheduler. The scheduler includes first and second dependency matrices and a ready determination unit. The scheduler also includes logic to queue a first parent operation, a second parent operation, and a child operation that includes first and second sources dependent on the first and second parent operations. The scheduler also includes logic to store physical addresses of the first and second sources of the child operation respectively in the first and second dependency matrices. Further, the scheduler includes logic to perform a tag comparisons between the respective physical addresses of the destinations of the first and second parent operations respectively with the respective physical address of the first and second sources of the child operation. In addition, the ready determination unit includes logic to determine that the child operation is ready for dispatch based on the tag comparisons.

Type: Grant

Filed: December 23, 2014

Date of Patent: December 26, 2017

Assignee: Intel Corporation

Inventors: Wing Shek Wong, James E. Phillips
System and method for compiling machine-executable code generated from a sequentially ordered plurality of processor instructions

Patent number: 9477470

Abstract: A method and system are provided for deriving a resultant software program from an originating software program having overlapping branches, wherein the resultant software project has either no overlapping branches or fewer overlapping branches than the originating software program. A preferred embodiment of the invented method generates a resultant software program that has no overlapping branches. The resultant software is more easily converted into programming reconfigurable logic than the originating software program. Separate and individually applicable aspects of the invented method are used to eliminate all four possible states of two overlapping branches, i.e., forward branch overlapping forward branch, back branch overlapping back branch, and each of the two possible and distinguishable states of forward branch and back branch overlap. One or more elements of each aspect of the invention may be performed by one or more computers or processors, or by means of a computer or a communications network.

Type: Grant

Filed: October 7, 2014

Date of Patent: October 25, 2016

Inventor: Robert Keith Mykland
Maintaining secure data isolated from non-secure access when switching between domains

Patent number: 9477834

Abstract: A data processing apparatus including circuitry for performing data processing, a plurality of registers; and a data store including regions having different secure levels, at least one secure region (for storing sensitive data accessible by the data processing circuitry operating in the secure domain and not accessible by the data processing circuitry operating in a less secure domain) and a less secure region (for storing less secure data). The circuitry is configured to determine which stack to store data to, or load data from, in response to the storage location of the program code being executed. In response to program code calling a function to be executed, the function code being stored in a second region, the second region having a different secure level to the first region, the data processing circuitry is configured to determine which of the first and second region have a lower secure level.

Type: Grant

Filed: February 8, 2012

Date of Patent: October 25, 2016

Assignee: ARM Limited

Inventors: Thomas Christopher Grocutt, Richard Roy Grisenthwaite
Data processor chip with flexible bus system

Patent number: 9256575

Abstract: A data processor chip having a two-dimensional array of arithmetic logic units and memory where the arithmetic logic units are in communication with memory units in one dimension and with other arithmetic units in a second.

Type: Grant

Filed: May 21, 2015

Date of Patent: February 9, 2016

Assignee: PACT XPP TECHNOLOGIES AG

Inventors: Martin Vorbach, Frank May, Dirk Reichardt, Frank Lier, Gerd Ehlers, Armin Nückel, Volker Baumgarte, Prashant Rao, Jens Oertel
Devices, systems, and methods to synchronize parallel processing of a single data stream

Patent number: 9164945

Abstract: Disclosed are methods and devices, among which is a system that includes one or more pattern-recognition processors, such as in a pattern-recognition cluster. The pattern-recognition processors may be activated to perform a search of a data stream individually using a chip select or in parallel using a universal select signal. In this manner, the plurality of pattern-recognition processors may be enabled concurrently for synchronized processing of the data stream.

Type: Grant

Filed: December 1, 2008

Date of Patent: October 20, 2015

Assignee: Micron Technology, Inc.

Inventor: Harold B Noyes
Handheld imaging device with multi-core image processor integrating common bus interface and dedicated image sensor interface

Patent number: 9148530

Abstract: A handheld imaging device includes an image sensor for sensing an image; a multi-core processor for processing the sensed image; and a program memory provided external to the multi-core processor, and communicating therewith via a communication bus. The multi-core processor includes a bus interface for interfacing with the communication bus, and further includes an image sensor interface for interfacing with the image sensor separately from the communication bus and the bus interface. The multi-core processor includes a plurality of parallel processing units connected by a crossbar switch to form the multi-core.

Type: Grant

Filed: September 15, 2012

Date of Patent: September 29, 2015

Assignee: Google Inc.

Inventor: Kia Silverbrook
Handheld imaging device with multi-core image processor integrating common bus interface and dedicated image sensor interface

Patent number: 9113008

Abstract: A handheld imaging device includes an image sensor for sensing an image; a multi-core processor for processing the sensed image; and a program memory provided external to the multi-core processor, and communicating therewith via a communication bus. The multi-core processor includes a bus interface for interfacing with the communication bus, and further includes an image sensor interface for interfacing with the image sensor separately from the communication bus and the bus interface. The multi-core processor includes a plurality of parallel processing units connected by a crossbar switch to form the multi-core.

Type: Grant

Filed: September 15, 2012

Date of Patent: August 18, 2015

Assignee: Google Inc.

Inventor: Kia Silverbrook
Handheld imaging device with multi-core image processor integrating common bus interface and dedicated image sensor interface

Patent number: 9060081

Abstract: A handheld imaging device includes an image sensor for sensing an image; a multi-core processor for processing the sensed image; and a program memory provided external to the multi-core processor, and communicating therewith via a communication bus. The multi-core processor includes a bus interface for interfacing with the communication bus, and further includes an image sensor interface for interfacing with the image sensor separately from the communication bus and the bus interface. The multi-core processor includes a plurality of parallel processing units connected by a crossbar switch to form the multi-core.

Type: Grant

Filed: September 15, 2012

Date of Patent: June 16, 2015

Assignee: Google Inc.

Inventor: Kia Silverbrook
Hum generation circuitry

Patent number: 8952727

Abstract: Systems and methods for clock generation and distribution are disclosed. Embodiments include arrangements of synchronization signals implemented using a mesh circuit. The mesh circuit is comprised of a plurality of null convention logic (NCL) gates organized into rings. Each ring shares at least one NCL gate with an adjacent ring. The rings are configured in such a way that each ring in the mesh operates synchronously with the other rings in the mesh.

Type: Grant

Filed: August 19, 2013

Date of Patent: February 10, 2015

Assignee: Wave Semiconductor, Inc.

Inventors: Scott E Johnston, Karl Michael Fant
Processor architecture with switch matrices for transferring data along buses

Patent number: 8904148

Abstract: There is described a processor architecture, comprising: a plurality of first bus pairs, each first bus pair including a respective first bus running in a first direction (for example, left to right) and a respective second bus running in a second direction opposite to the first direction (for example right to left); a plurality of second bus pairs, each second bus pair including a respective third bus running in a third direction (for example downwards) and a respective fourth bus running in a fourth direction opposite to the third direction (for example upwards), the third and fourth buses intersecting the first and second buses; a plurality of switch matrices, each switch matrix located at an intersection of a first and a second pair of buses; a plurality of elements arranged in an array, each element being arranged to receive data from a respective first or second bus, and transfer data to a respective first or second bus.

Type: Grant

Filed: July 5, 2011

Date of Patent: December 2, 2014

Assignee: Intel Corporation

Inventors: Anthony Peter John Claydon, Anne Patricia Claydon
Method and apparatus for a general-purpose, multiple-core system for implementing stream-based computations

Patent number: 8843928

Abstract: A method and system of efficient use and programming of a multi-processing core device. The system includes a programming construct that is based on stream-domain code. A programmable core based computing device is disclosed. The computing device includes a plurality of processing cores coupled to each other. A memory stores stream-domain code including a stream defining a stream destination module and a stream source module. The stream source module places data values in the stream and the stream conveys data values from the stream source module to the stream destination module. A runtime system detects when the data values are available to the stream destination module and schedules the stream destination module for execution on one of the plurality of processing cores.

Type: Grant

Filed: January 21, 2011

Date of Patent: September 23, 2014

Assignee: QST Holdings, LLC

Inventors: Paul Master, Frederick Furtek
Execution engine for executing single assignment programs with affine dependencies

Patent number: 8688956

Abstract: The execution engine is a new organization for a digital data processing apparatus for highly parallel execution of structured fine-grain parallel computations. The execution engine includes a memory for storing data and a domain flow program, a controller for requesting the domain flow program from the memory, and further for translating the program into programming information, a processor fabric for processing the domain flow programming information and a crossbar for sending tokens and the programming information to the processor fabric.

Type: Grant

Filed: May 18, 2009

Date of Patent: April 1, 2014

Assignee: Stillwater Supercomputing, Inc.

Inventor: Erwinus Theodorus Leonardus Omtzigt
SPLIT EMBEDDED DRAM PROCESSOR

Publication number: 20140032878

Abstract: A processing architecture includes a first CPU core portion coupled to a second embedded dynamic random access memory (DRAM) portion. These architectural components jointly implement a single processor and instruction set. Advantageously, the embedded logic on the DRAM chip implements the memory intensive processing tasks, thus reducing the amount of traffic that needs to be bussed back and forth between the CPU core and the embedded DRAM chips. The embedded DRAM logic monitors and manipulates the instruction stream into the CPU core. The architecture of the instruction set, data paths, addressing, control, caching, and interfaces are developed to allow the system to operate using a standard programming model. Specialized video and graphics processing systems are developed. Also, an extended very long instruction word (VLIW) architecture implemented as a primary VLIW processor coupled to an embedded DRAM VLIW extension processor efficiently deals with memory intensive tasks.

Type: Application

Filed: July 15, 2013

Publication date: January 30, 2014

Inventor: Eric M. Dowling
Packet draining from a scheduling hierarchy in a traffic manager of a network processor

Patent number: 8638805

Abstract: Described embodiments provide for restructuring a scheduling hierarchy of a network processor having a plurality of processing modules and a shared memory. The scheduling hierarchy schedules packets for transmission. The network processor generates tasks corresponding to each received packet associated with a data flow. A traffic manager receives tasks provided by one of the processing modules and determines a queue of the scheduling hierarchy corresponding to the task. The queue has a parent scheduler at each of one or more next levels of the scheduling hierarchy up to a root scheduler, forming a branch of the hierarchy. The traffic manager determines if the queue and one or more of the parent schedulers of the branch should be restructured. If so, the traffic manager drops subsequently received tasks for the branch, drains all tasks of the branch, and removes the corresponding nodes of the branch from the scheduling hierarchy.

Type: Grant

Filed: September 30, 2011

Date of Patent: January 28, 2014

Assignee: LSI Corporation

Inventors: Balakrishnan Sundararaman, Shashank Nemawarkar, David Sonnier, Shailendra Aulakh, Allen Vestal
Coupling data for interrupt processing in a parallel processing environment

Patent number: 8190855

Abstract: An integrated circuit comprises a plurality of tiles. Each tile comprises a processor, and a switch including switching circuitry to forward data received over data paths from other tiles to the processor and to switches of other tiles, and to forward data received from the processor to switches of other tiles. The integrated circuit further comprises one or more interface modules including circuitry to transfer data to and from a device external to the tiles; and a sub-port routing network including circuitry to route data between a port of a switch and a plurality of sub-ports coupled to one or more interface modules.

Type: Grant

Filed: February 25, 2008

Date of Patent: May 29, 2012

Assignee: Tilera Corporation

Inventors: Carl G. Ramey, David Wentzlaff, Anant Agarwal
Data processing device and method

Patent number: 8145881

Abstract: A data processing device comprising a multidimensional array of coarse grained logic elements processing data and operating at a first clock rate and communicating with one another and/or other elements via busses and/or communication lines operated at a second clock rate is disclosed, wherein the first clock rate is higher than the second and wherein the coarse grained logic elements comprise storage means for storing data needed to be processed.

Type: Grant

Filed: October 24, 2008

Date of Patent: March 27, 2012

Inventors: Martin Vorbach, Alexander Thomas
Processor architectures for enhanced computational capability and low latency

Patent number: 8108653

Abstract: A processor includes a compute array comprising a first plurality of compute engines serially connected along a data flow path such that data flows between successive compute engines at successive times. The first plurality of compute engines includes an initial compute engine and a final compute engine. The data flow path includes a recirculation path connecting the final compute engine to the initial compute engine with no compute engine therebetween.

Type: Grant

Filed: February 5, 2010

Date of Patent: January 31, 2012

Assignee: Analog Devices, Inc.

Inventors: Boris Lerner, Douglas Garde
Multi-channel timing recovery system

Patent number: 8094768

Abstract: The present invention discloses a novel multi-channel timing recovery scheme that utilizes a shared CORDIC to accurately compute the phase for each tone. Then a hardware-based linear combiner module is used to reconstruct the best phase estimate from multiple phase measurements. The firmware monitors the noise variance for the pilot tones and determines the corresponding weight for each tone to ensure that the minimum phase jitter noise is achieved through the linear combiner. Then a hardware-based second-order timing recovery control loop generates the frequency reference signal for VCXO or DCXO. A single sequentially controlled multiplier is used for all multiplications in the control loop.

Type: Grant

Filed: December 21, 2006

Date of Patent: January 10, 2012

Assignee: Triductor Technology (Suzhou) Inc.

Inventor: Yaolong Tan
Computing nodes for executing groups of instructions

Patent number: 8055881

Abstract: A computation node according to various embodiments of the invention includes at least one input port capable of being coupled to at least one first other 5 computation node, a first store coupled to the input port(s) to store input data, a second store to receive and store instructions, an instruction wakeup unit to match the input data to the instructions, at least one execution unit to execute the instructions, using the input data to produce output data, and at least one output port capable of being coupled to at least one second other computation node. The node may also include a router to direct the output data from the output port(s) to the second other node. A system according to various embodiments of the invention includes and external instruction sequencer to fetch a group of instructions, and one or more interconnected, preselected computational nodes.

Type: Grant

Filed: June 10, 2008

Date of Patent: November 8, 2011

Assignee: Board of Regents, University of Texas System

Inventors: Douglas C. Burger, Stephen W. Keckler, Karthikevan Sankaralingam, Ramadass Nagarajan
Processor architecture with switch matrices for transferring data along buses

Patent number: 7996652

Abstract: A processor architecture includes a plurality of elements arranged in an array of rows and columns and a plurality of first and second bus pairs with the first pair being located between different adjacent rows of the array and having first and second buses running in opposite directions and the second bus pair being located between different adjacent columns of the array and having third and fourth buses running in opposite directions and intersecting the first and second buses. A plurality of switch matrices located at an intersection of one of the first bus pairs and one of the second bus pairs includes inputs and outputs for first, second, third and fourth buses and switch elements for switchably connecting the inputs and outputs.

Type: Grant

Filed: February 21, 2008

Date of Patent: August 9, 2011

Inventors: Anthony Peter John Claydon, Anne Patricia Claydon
CROSS FLOW PARALLEL PROCESSING METHOD AND SYSTEM

Publication number: 20110113218

Abstract: Provided is a cross flow parallel processing method and system that may process multiple data flows and increase a parallel processing rate in a multi-processor that processes multiple cross data flows.

Type: Application

Filed: October 18, 2010

Publication date: May 12, 2011

Applicant: Electronics and Telecommunications Research Institute

Inventors: Jung Hee LEE, Bhum Cheol LEE, Tae Sik CHEUNG
MULTILAYER PARALLEL PROCESSING APPARATUS AND METHOD

Publication number: 20110107059

Abstract: A multilayer parallel processing apparatus. The multilayer parallel processing apparatus includes two or more hierarchical parallel processing units, each configured to process flow data corresponding to a hierarchy that is allocated thereto in response to inputting pieces of flow data configured with two or more hierarchies, and a common database configured to be accessed by the two or more hierarchical parallel processing units and store processing results of each of the hierarchical parallel processing units.

Type: Application

Filed: November 1, 2010

Publication date: May 5, 2011

Applicant: Electronics and Telecommunications Research Institute

Inventors: Sang Yoon OH, Bhum-Cheol LEE, Jung-Hee LEE, Dong-Myoung BAEK, Seung-Woo LEE
RUNTIME CONFIGURABLE ARITHMETIC AND LOGIC CELL

Publication number: 20110010523

Abstract: A cascadable arithmetic and logic unit (ALU) which is configurable in function and interconnection. No decoding of commands is needed during execution of the algorithm. The ALU can be reconfigured at run time without any effect on surrounding ALUs, processing units or data streams. The volume of configuration data is very small, which has positive effects on the space required and the configuration speed. Broadcasting is supported through the internal bus systems in order to distribute large volumes of data rapidly and efficiently. The ALU is equipped with a power-saving mode to shut down power consumption completely. There is also a clock rate divider which makes it possible to operate the ALU at a slower clock rate. Special mechanisms are available for feedback on the internal states to the external controllers.

Type: Application

Filed: July 27, 2010

Publication date: January 13, 2011

Inventors: Martin VORBACH, Robert Münch
Data processing device and method

Patent number: 7844796

Abstract: A data processing device comprising a multidimensional array of coarse grained logic elements processing data and operating at a first clock rate and communicating with one another and/or other elements via busses and/or communication lines operated at a second clock rate is disclosed, wherein the first clock rate is higher than the second and wherein the coarse grained logic elements comprise storage means for storing data needed to be processed.

Type: Grant

Filed: August 30, 2004

Date of Patent: November 30, 2010

Inventors: Martin Vorbach, Alexander Thomas
Communicating signals between semiconductor chips using round-robin-coupled micropipelines

Patent number: 7831810

Abstract: Embodiments of the present invention provide a system for transferring data between a receiver chip and a transmitter chip. The system includes a set of data path circuits in the transmitter chip and a set of data path circuits in the receiver chip coupled to a shared data channel. In addition, the system includes a set of asynchronous control circuits for controlling corresponding data path circuits in the transmitter chip and receiver chip. Upon detecting the transition of a control signal for an asynchronous control circuit in the transmitter chip, the asynchronous control circuit is configured to enable a transfer of data from the corresponding data path circuit in the transmitter chip across the data channel to a corresponding data path circuit in the receiver chip, and generate a control signal to cause a next asynchronous control circuit to commence the transfer of a data signal.

Type: Grant

Filed: October 2, 2007

Date of Patent: November 9, 2010

Assignee: Oracle America, Inc.

Inventor: Scott M. Fairbanks
DATA PROCESSING DEVICE AND METHOD

Publication number: 20100241823

Abstract: A data processing device comprising a multidimensional array of coarse grained logic elements processing data and operating at a first clock rate and communicating with one another and/or other elements via busses and/or communication lines operated at a second clock rate is disclosed, wherein the first clock rate is higher than the second and wherein the coarse grained logic elements comprise storage means for storing data needed to be processed.

Type: Application

Filed: June 1, 2010

Publication date: September 23, 2010

Inventors: Martin VORBACH, Alexander Thomas
Register access protocol in a multihreaded multi-core processor

Patent number: 7747771

Abstract: A method and mechanism for managing access to a plurality of registers in a processing device are contemplated. A processing device includes multiple nodes coupled to a ring bus, each of which include one or more registers which may be accessed by processes executing within the device. Also coupled to the ring bus is a ring control unit which is configured to initiate transactions targeted to nodes on the ring bus. Each of the nodes are configured receive and process bus transaction with a fixed latency whether or not the first transaction is targeted to the receiving node. The ring control unit is configured to periodically convey idle transactions on the ring bus in order to allow nodes responding to indeterminate transactions to gain access to the bus.

Type: Grant

Filed: June 30, 2004

Date of Patent: June 29, 2010

Assignee: Oracle America, Inc.

Inventors: Manish Shah, Robert T. Golla, Mark A. Luttrell, Gregory F. Grohoski
Processor having a dedicated hash unit integrated within

Patent number: 7743235

Abstract: A parallel hardware-based multithreaded processor is described. The processor includes a general purpose processor that coordinates system functions and a plurality of microengines that support multiple hardware threads or contexts. The processor also includes a memory control system that has a first memory controller that sorts memory references based on whether the memory references are directed to an even bank or an odd bank of memory and a second memory controller that optimizes memory references based upon whether the memory references are read references or write references. Instructions for switching and branching based on executing contexts are also disclosed.

Type: Grant

Filed: June 6, 2007

Date of Patent: June 22, 2010

Assignee: Intel Corporation

Inventors: Gilbert Wolrich, Matthew Adiletta, William R. Wheeler
Stream processor and information processing apparatus

Patent number: 7680962

Abstract: An array type processor comprises a data path unit to execute processing, and a state management unit to control the state of the data path unit in accordance with a command that specifies processing on the data. An input DMA circuit reads from a memory information and data to be processed including a command corresponding to the data. The input DMA circuit first transfers the command to the state management unit, and then transfers the data to be processed to the data path unit.

Type: Grant

Filed: December 21, 2005

Date of Patent: March 16, 2010

Assignee: NEC Electronics Corporation

Inventors: Kenichiro Anjo, Katsumi Togawa, Ryoko Sasaki, Taro Fujii, Masato Motomura
Resource sharing in multiple parallel pipelines

Patent number: 7653804

Abstract: A signal processing network and method for generating code for such a signal processing network are described. Pipeline blocks are each coupled to receive control signaling and associated information signaling from a scheduler. Each of the pipeline blocks respectively includes an allocation unit, a pipeline, and section controllers. The allocation unit is configured to provide a lock signal and sequence information to the section controllers in each of the pipeline blocks. The section controllers are configured to maintain in order inter-pipeline execution of the sequence responsive to the sequence information and the lock signal.

Type: Grant

Filed: January 26, 2006

Date of Patent: January 26, 2010

Assignee: Xilinx, Inc.

Inventors: Thomas A. Lenart, Jorn W. Janneck
Processing in pipelined computing units with data line and circuit configuration rule signal line

Patent number: 7653805

Abstract: A semiconductor device for performing data processing by performing a plurality of computations in cycles includes a pipeline formed by connecting a plurality of computing units in series, each of the computing units including: a data line for receiving data; a control line for receiving a rule signal; a circuit information control unit configured to store, before data processing, several circuit information items, and to output a first one of the several circuit information items according to the rule signal received via the control line in a first cycle of the data processing; a processing element configured to construct an execution circuit according to the first circuit information item, to perform a computation using data from the data line, and to output a computation result; a data register for storing the computation result, and for outputting the computation result in a second cycle; and a control register for storing the rule signal and for outputting the rule signal in the second cycle.

Type: Grant

Filed: March 23, 2007

Date of Patent: January 26, 2010

Assignee: Kabushiki Kaisha Toshiba

Inventors: Takashi Yoshikawa, Shigehiro Asano, Yutaka Yamada

1 2 3 next