Patents Assigned to Graphcore Limited

Method of testing a stacked integrated circuit device

Patent number: 12032021

Abstract: A method for testing a stacked integrated circuit device comprising a first die and a second die, the method comprising: sending from testing logic of the first die, first testing control signals to first testing apparatus on the first die; in response to the first testing control signals, the first testing apparatus running a first one or more tests for testing functional logic or memory of the first die; sending from the testing logic of the first die, second testing control signals to the second die via through silicon vias formed in a substrate of the first die; and in dependence upon the second testing control signals from the first die, running a second one or more tests for testing the stacked integrated circuit device.

Type: Grant

Filed: September 22, 2022

Date of Patent: July 9, 2024

Assignee: Graphcore Limited

Inventors: Stephen Felix, Phillip Horsfield
Processor repair

Patent number: 12019527

Abstract: A processor comprises a plurality of processing units, wherein there is a fixed transmission time for transmitting a message from a sending processing unit to a receiving processing unit, based on the physical positions of the sending and receiving processing units in the processor. The processing units are arranged in a column, and the fixed transmission time depends on the position of a processing circuit in the column. An exchange fabric is provided for exchanging messages between sending and receiving processing units, the columns being arranged with respect to the exchange fabric such that the fixed transmission time depends on the distances of the processing circuits with respect to the exchange fabric.

Type: Grant

Filed: September 10, 2021

Date of Patent: June 25, 2024

Assignee: Graphcore Limited

Inventor: Stephen Felix
Processing device using variable stride pattern

Patent number: 12013781

Abstract: For certain applications, parts of the application data held in memory of a processing device (e.g. that are produced as a result of operations performed by the execution unit) are arranged in regular repeating patterns in the memory, and therefore, the execution unit may set up a suitable striding pattern for use by a send engine. The send engine accesses the memory at locations in accordance with the configured striding pattern so as to access a plurality of items of data that are arranged together in a regular pattern. In a similar manner as done for sends, the execution may set up a striding pattern for use by a receive engine. The receive engine, upon receiving a plurality of items of data, causes those items of data to be stored at locations in the memory, as determined in accordance with the configured striding pattern.

Type: Grant

Filed: March 1, 2022

Date of Patent: June 18, 2024

Assignee: Graphcore Limited

Inventors: Sam Chesney, Alan Graham Alexander, Richard Luke Southwell Osborne, Edward Andrews
Controlling a processor clock

Patent number: 12001263

Abstract: There is disclosed a method of controlling the frequency of a clock signal in a processor. The method selects a first clock generator to provide a processor clock signal for executing an application. If a threshold event is detected, a second clock generator is selected. The method reduces the frequency of a clock signal generated by the first clock generator while a processor clock signal is being provided for execution of an application from the second clock generator. The second clock generator generates a clock at a lower speed than the first clock generator. After a predetermined time, the first clock generator is reselected to provide the processor clock signal. The threshold detection is repeated until an optimum clock frequency is discovered.

Type: Grant

Filed: April 5, 2023

Date of Patent: June 4, 2024

Assignee: Graphcore Limited

Inventors: Stephen Felix, Mrudula Gore
Use of multiple different variants of floating point number formats in floating point operations on a per-operand basis

Patent number: 11966740

Abstract: A processor comprising: a register file comprising a group of operand registers for holding data values, each operand register being a fixed number of bits in length for holding a respective data value of that length; and processing logic comprising floating point logic for performing floating point operations on data values in the register file, the floating point logic is configured to process the fixed number of bits in the respective data value according to a floating point format comprising a set of mantissa bits and a set of exponent bits. The processing logic is operable to select between a plurality of different variants of the floating point format, at least some of the variants having a different size sets of mantissa bits and exponent bits relative to one another.

Type: Grant

Filed: August 10, 2021

Date of Patent: April 23, 2024

Assignee: Graphcore Limited

Inventors: Mrudula Gore, Alan Alexander
Allocating variables to computer memory

Patent number: 11762641

Abstract: A method of allocating variables to computer memory includes determining at compile time when each of the plurality of variables is live in a memory region and allocating a memory region to each variable wherein at least some variables are allocated at compile time to overlapping memory regions to be stored in those memory regions at runtime at non-overlapping times.

Type: Grant

Filed: December 3, 2020

Date of Patent: September 19, 2023

Assignee: Graphcore Limited

Inventors: Godfrey Da Costa, Timothy David Hutt
Gateway to gateway synchronisation

Patent number: 11740946

Abstract: A gateway in a computing system for interfacing a host with a subsystem for acting as a work accelerator to the host, the gateway having: an accelerator interface for enabling the transfer of batches of data to the subsystem at pre-compiled data exchange synchronisation points attained by the subsystem; a data connection interface for receiving data to be processed from storage; and a gateway interface for connection to a third gateway. The gateway is configured to store a number of credits indicating at least one of: the availability of data for transfer to the subsystem at a pre-compiled data exchange synchronisation point; and the availability of storage for receiving data from the subsystem at a pre-compiled data exchange synchronisation point. The gateway uses these credits to control whether or not synchronisation barrier is passed by transmitting synchronisation requests upstream to the third gateway or simply acknowledging the requests received.

Type: Grant

Filed: December 28, 2018

Date of Patent: August 29, 2023

Assignee: Graphcore Limited

Inventors: Ola Tørudbakken, Daniel John Pelham Wilkinson, Brian Manula, Harald Høeg
Handling exceptions in a multi-tile processing arrangement

Patent number: 11645081

Abstract: A multitile processing system has an execution unit on each tile, and an interconnect which conducts communications between the tiles according to a bulk synchronous parallel scheme. Each tile performs an on-tile compute phase followed by an intertile exchange phase, where the exchange phase is held back until all tiles in a particular group have completed the compute phase. On completion of the compute phase, each tile generates a synchronisation request and pauses an issue of instructions until it receives a synchronisation acknowledgement. If a tile attains an excepted state, it raises an exception signal and pauses instruction issue until the excepted state has been resolved. However, tiles which are not in the excepted state can continue to perform their on-tile computer phase, and will issue their own synchronisation request in their own normal time frame.

Type: Grant

Filed: May 22, 2019

Date of Patent: May 9, 2023

Assignee: Graphcore Limited

Inventors: Alan Graham Alexander, Matthew David Fyles
Data through gateway

Patent number: 11615038

Abstract: A gateway for use in a computing system to interface a host with the subsystem for acting as a work accelerator to the host, the gateway having: an accelerator interface for connection to the subsystem to enable transfer of batches of data between the subsystem and the gateway; a data connection interface for connection to external storage for exchanging data between the gateway and storage; a gateway interface for connection to at least one second gateway; a memory interface connected to a local memory associated with the gateway; and a streaming engine for controlling the streaming of batches of data into and out of the gateway in response to pre-compiled data exchange synchronisation points attained by the subsystem, wherein the streaming of batches of data are selectively via at least one of the accelerator interface, data connection interface, gateway interface and memory interface.

Type: Grant

Filed: December 28, 2018

Date of Patent: March 28, 2023

Assignee: Graphcore Limited

Inventors: Ola Tørudbakken, Brian Manula, Harald Høeg
Repeat instruction for loading and/or executing code in a claimable repeat cache a specified number of times

Patent number: 11567768

Abstract: A processor is disclosed including: a barrel-threaded execution unit for executing concurrent threads, and a repeat cache shared between the concurrent threads. The processor's instruction set includes a repeat instruction which takes a repeat count operand. When the repeat cache is not claimed and the repeat instruction is executed in a first thread, a portion of code is cached from the first thread into the repeat cache, the state of the repeat cache is changed to record it as claimed, and the cached code is executed a number of times. When the repeat instruction is then executed in a further thread, then the already-cached portion of code is again executed a respective number of times, each time from the repeat cache. For each of the first and further instructions, the repeat count operand in the respective instruction specifies the number of times to execute the cached code.

Type: Grant

Filed: February 15, 2019

Date of Patent: January 31, 2023

Assignee: Graphcore Limited

Inventors: Alan Graham Alexander, Simon Christian Knowles, Mrudula Chidambar Gore, Jonathan Louis Ferguson
Gateway fabric ports

Patent number: 11477050

Abstract: A gateway for interfacing a host with a subsystem for acting as a work accelerator to the host. The gateway enables the transfer of batches of data to the subsystem at precompiled data exchange synchronisation points. The gateway acts to route data between accelerators which are connected in a scaled system of multiple gateways and accelerators using a global address space set up at compile time of an application to run on the computer system.

Type: Grant

Filed: December 28, 2018

Date of Patent: October 18, 2022

Assignee: Graphcore Limited

Inventors: Ola Tørudbakken, Daniel John Pelham Wilkinson, Brian Manula
Load-store instruction for performing multiple loads, a store, and strided increment of multiple addresses

Patent number: 11467833

Abstract: A processor having an instruction set including a load-store instruction having operands specifying, from amongst the registers in at least one register file, a respective destination of each of two load operations, a respective source of a store operation, and a pair of address registers arranged to hold three memory addresses, the three memory addresses being a respective load address for each of the two load operations and a respective store address for the store operation. The load-store instruction further includes three stride operands each specifying a respective stride value for each of the two load addresses and one store address, wherein at least some possible values of each stride operand specify the respective stride value by specifying one of a plurality of fields within a stride register in one of the one or more register files, each field holding a different stride value.

Type: Grant

Filed: February 15, 2019

Date of Patent: October 11, 2022

Assignee: Graphcore Limited

Inventors: Alan Graham Alexander, Simon Christian Knowles, Mrudula Chidambar Gore
Handling exceptions in a multi-tile processing arrangement

Patent number: 11449338

Abstract: A multi-tile processing system has a plurality of tiles each having an execution unit, and an interconnect operable to conduct communications between a group of the tiles according to a bulk synchronous parallel scheme. The execution unit is operable to execute instructions of an instruction set which has a synchronisation instruction for execution by each tile upon completion of its compute phase. The execution of the synchronisation instruction depends on the state of an exception enable flag. In one state, the synchronisation instruction causes the execution unit to send the synchronisation request to hardware logic in the interconnect. In another state of the exception enable flag the synchronisation instruction does not send the synchronisation request, but sets an exception events status to permit interrogation access to the tile. A corresponding method of controlling the debug states of the processing system is provided.

Type: Grant

Filed: April 26, 2019

Date of Patent: September 20, 2022

Assignee: Graphcore Limited

Inventors: Alan Graham Alexander, Matthew David Fyles
Method of debugging a processor that executes vertices of an application, each vertex being assigned to a programming thread of the processor

Patent number: 11416258

Abstract: A method for debugging a processor which is executing vertices of a software application is described. Each vertex is assigned to a programming thread of the processor. The processor has debug hardware for raising exceptions in certain break conditions. The method comprises inspecting a vertex identifier, comparing the vertex identifier and raising an instruction exception event for the programming thread if the vertex identifier assigned to the thread matches the vertex break identifier in the debug hardware. Exceptions are raised based on identified vertices, rather than just individual instructions or instruction addresses.

Type: Grant

Filed: May 22, 2019

Date of Patent: August 16, 2022

Assignee: Graphcore Limited

Inventors: Alan Graham Alexander, Richard Luke Southwell Osborne, Matthew David Fyles
Execution unit for evaluating functions using newton raphson iterations

Patent number: 11340868

Abstract: An execution unit for a processor, the execution unit comprising: a look up table having a plurality of entries, each of the plurality of entries comprising an initial estimate for a result of an operation; a preparatory circuit configured to search the look up table using an index value dependent upon the operand to locate an entry comprising a first initial estimate for a result of the operation; a plurality of processing circuits comprising at least one multiplier circuit; and control circuitry configured to provide the first initial estimate to the at least one multiplier circuit of the plurality of processing circuits so as perform processing, by the plurality of processing units, of the first initial estimate to generate the function result, said processing comprising applying one or more Newton Raphson iterations to the first initial estimate.

Type: Grant

Filed: April 26, 2019

Date of Patent: May 24, 2022

Assignee: Graphcore Limited

Inventors: Jonathan Mangnall, Stephen Felix
Function approximation

Patent number: 11328015

Abstract: A function approximation system is disclosed for determining output floating point values of functions calculated using floating point numbers. Complex functions have different shapes in different subsets of their input domain, making them difficult to predict for different values of the input variable. The function approximation system comprises an execution unit configured to determine corresponding values of a given function given a floating point input to the function; a plurality of look up tables for each function type; a correction table of values which determines if corrections to the output value are required; and a table selector for finding an appropriate table for a given function.

Type: Grant

Filed: April 26, 2019

Date of Patent: May 10, 2022

Assignee: Graphcore Limited

Inventors: Jonathan Mangnall, Stephen Felix
Instruction set

Patent number: 11321272

Abstract: The invention relates to a computer program comprising a sequence of instructions for execution on a processing unit having instruction storage for holding the computer program, an execution unit for executing the computer program and data storage for holding data, the computer program comprising one or more computer executable instruction which, when executed, implements: a send function which causes a data packet destined for a recipient processing unit to be transmitted on a set of connection wires connected to the processing unit, the data packet having no destination identifier but being transmitted at a predetermined transmit time; and a switch control function which causes the processing unit to control switching circuitry to connect a set of connection wires of the processing unit to a switching fabric to receive a data packet at a predetermined receive time.

Type: Grant

Filed: February 1, 2018

Date of Patent: May 3, 2022

Assignee: Graphcore Limited

Inventors: Simon Christian Knowles, Daniel John Pelham Wilkinson, Richard Luke Southwell Osborne, Alan Graham Alexander, Stephen Felix, Jonathan Mangnall, David Lacey
Pseudo-random number generator

Patent number: 11294635

Abstract: A pseudo random number generator implemented in hardware. The pseudo random number generator comprises a state post processing circuit for processing two state values to produce a random number. The circuit having a first combinatorial logic comprising a XOR or XNOR gate configured to process a first pair of bits from the state values, a second combinatorial logic comprising an OR or AND gate configured to process a second pair of bits from the state value, and third combinatorial logic comprising an OR or AND gate configured or process a third pair of bits from the state value. The circuit has fourth combinatorial logic configured to process the outputs of the first three set of combinatorial logic so as to provide a result bit of the random number. The fourth combinatorial logic comprises an AND or OR gate and a XOR or XNOR gate.

Type: Grant

Filed: April 26, 2019

Date of Patent: April 5, 2022

Assignee: Graphcore Limited

Inventors: Stephen Felix, James William Hanlon
Data exchange pathways between pairs of processing units in columns in a computer

Patent number: 11269806

Abstract: A time deterministic computer is architected so that exchange code compiled for one set of tiles, e.g., a column, can be reused on other sets.

Type: Grant

Filed: May 22, 2019

Date of Patent: March 8, 2022

Assignee: Graphcore Limited

Inventors: Stephen Felix, Simon Christian Knowles
Checkpointing

Patent number: 11263081

Abstract: A system comprising: a first subsystem comprising at least one first processor, and a second subsystem comprising one or more second processors. A first program is arranged to run on the at least one first processor, the first program being configured to send data from the first subsystem to the second subsystem. A second program is arranged to run on the one more second processors, the second program being configured to operate on the data content from the first subsystem. The first program is configured to set a checkpoint at successive points in time. At each checkpoint it records in memory of the first subsystem i) a program state of the second program, comprising a state of one or more registers on each of the second processors at the time of the checkpoint, and ii) a copy of the data content sent to the second subsystem since the respective checkpoint.

Type: Grant

Filed: May 22, 2019

Date of Patent: March 1, 2022

Assignee: Graphcore Limited

Inventors: David Lacey, Daniel John Pelham Wilkinson

1 2 3 4 5 next