Patents by Inventor Michial Allen Gunter
Michial Allen Gunter has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 12147793Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for sharding dataflow graphs for a device having multiple synchronous tiles. One of the methods includes receiving a representation of a dataflow graph comprising a plurality of nodes that each represent respective matrix operations to be performed by a device having a plurality synchronous tiles. Candidate allocations of respective portions of the dataflow graph to each tile of the plurality of synchronous tiles are evaluated according to one or more resource constraints of the device. One of the candidate allocations is selected based on evaluating each candidate allocation.Type: GrantFiled: August 20, 2020Date of Patent: November 19, 2024Assignee: Google LLCInventors: Reiner Pope, Herman Schmit, Michial Allen Gunter
-
Patent number: 12124783Abstract: A method of configuring an integrated circuit including multiple hardware tiles, includes: establishing a data forwarding path through the multiple hardware tiles by configuring each hardware tile, except for a last hardware tile, of the multiple hardware tiles to be in a data forwarding state, in which configuring each hardware tile, except for the last hardware tile, to be in a forwarding state includes installing a respective forwarding state counter specifying a corresponding predefined length of time that the hardware tile is in the data forwarding state; supplying, along the data forwarding path, each hardware tile of the plurality of hardware tiles with a respective program data packet comprising program data for the hardware tile; and installing, for each hardware tile of the multiple hardware tiles, the respective program data.Type: GrantFiled: August 20, 2020Date of Patent: October 22, 2024Assignee: Google LLCInventors: Michial Allen Gunter, Reiner Pope, Pavel Krajcevski, Clifford Biffle
-
Patent number: 12057834Abstract: A tile including circuitry for use with machine learning models, the tile including: a first computational array of cells, in which the computational array of cells is a sub-array of a larger second computational array of cells; local memory coupled to the first computational array of cells; and multiple controllable bus lines, in which a first subset of the multiple controllable bus lines include multiple general purpose controllable bus lines couplable to the local memory.Type: GrantFiled: September 19, 2022Date of Patent: August 6, 2024Assignee: Google LLCInventors: Michial Allen Gunter, Charles Henry Leichner, IV, Tammo Spalink
-
Patent number: 12032511Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining, for each pair of adjacent chips in a plurality of chips connected in a series-ring arrangement of a semiconductor device, a corresponding loop latency for round trip data transmissions between the pair of chips. Identifying, from among the loop latencies, a maximum loop latency. Determining a ring latency for a data transmission originating from a chip of the plurality chips to be transmitted around the series-ring arrangement and back to the chip. Comparing half of the maximum loop latency to one N-th of the ring latency, where N is the number of chips in the plurality of chips, and storing the greater value as an inter-chip latency of the semiconductor device, the inter-chip latency representing an operational characteristic of the semiconductor device.Type: GrantFiled: June 21, 2022Date of Patent: July 9, 2024Assignee: Google LLCInventors: Michial Allen Gunter, Denis Baylor, Clifford Biffle, Charles Ross
-
Patent number: 11797074Abstract: Aspects of the disclosure include methods, systems, and apparatus, including computer-readable storage media for multi-mode integrated circuits with balanced energy consumption. A method includes determining, by one or more processors and based at least on a maximum energy threshold for planned multi-mode system having one or more processing units, a respective number of operations that can be performed per clock cycle by the processing units for each operating mode. The system is configured to consume the same amount of energy per clock cycle in each operating mode, but perform more operations in operating modes corresponding to operations performed on smaller bit-width operands.Type: GrantFiled: May 25, 2021Date of Patent: October 24, 2023Assignee: Google LLCInventors: Reiner Alwyn Pope, Michial Allen Gunter, Lukasz Lew
-
Publication number: 20230237007Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining, for each pair of adjacent chips in a plurality of chips connected in a series-ring arrangement of a semiconductor device, a corresponding loop latency for round trip data transmissions between the pair of chips. Identifying, from among the loop latencies, a maximum loop latency. Determining a ring latency for a data transmission originating from a chip of the plurality chips to be transmitted around the series-ring arrangement and back to the chip. Comparing half of the maximum loop latency to one N-th of the ring latency, where N is the number of chips in the plurality of chips, and storing the greater value as an inter-chip latency of the semiconductor device, the inter-chip latency representing an operational characteristic of the semiconductor device.Type: ApplicationFiled: March 30, 2023Publication date: July 27, 2023Inventors: Michial Allen Gunter, Denis Baylor, Clifford Biffle, Charles Ross
-
Publication number: 20230195836Abstract: Methods, systems, and apparatus, including computer-readable media, are described for implementing a one-dimensional computational unit in an integrated circuit for a machine-learning (ML) hardware accelerator.Type: ApplicationFiled: December 16, 2022Publication date: June 22, 2023Inventors: Reiner Pope, Charles Ross, Michial Allen Gunter, Wren Romano
-
Patent number: 11652484Abstract: An application specific integrated circuit (ASIC) chip includes: a systolic array of cells; and multiple controllable bus lines configured to convey data among the systolic array of cells, in which the systolic array of cells is arranged in multiple tiles, each tile of the multiple tiles including 1) a corresponding sub array of cells of the systolic array of cells, 2) a corresponding subset of controllable bus lines of the multiple controllable bus lines, and 3) memory coupled to the subarray of cells.Type: GrantFiled: August 9, 2021Date of Patent: May 16, 2023Assignee: Google LLCInventors: Michial Allen Gunter, Charles Henry Leichner, IV, Tammo Spalink
-
Publication number: 20230010315Abstract: A tile including circuitry for use with machine learning models, the tile including: a first computational array of cells, in which the computational array of cells is a sub-array of a larger second computational array of cells; local memory coupled to the first computational array of cells; and multiple controllable bus lines, in which a first subset of the multiple controllable bus lines include multiple general purpose controllable bus lines couplable to the local memory.Type: ApplicationFiled: September 19, 2022Publication date: January 12, 2023Inventors: Michial Allen Gunter, Charles Henry Leichner, IV, Tammo Spalink
-
Publication number: 20220413721Abstract: A method includes: receiving control data at a first data selector of a plurality of data selectors, in which the control data comprises (i) a configuration registry address specifying a location in a configuration state registry and (ii) configuration data specifying a circuit configuration state of a circuit element of a computational circuit; transferring the control data, from the first data selector, to an entry in a trigger table registry; responsive to a first trigger event occurring, transferring the configuration data to the location in the configuration state registry specified by the configuration registry address; and updating a state of the circuit element based on the configuration data.Type: ApplicationFiled: June 28, 2022Publication date: December 29, 2022Inventors: Michial Allen Gunter, Reiner Pope, Brian Foley, Charles Henry Leichner, IV
-
Publication number: 20220391347Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining, for each pair of adjacent chips in a plurality of chips connected in a series-ring arrangement of a semiconductor device, a corresponding loop latency for round trip data transmissions between the pair of chips. Identifying, from among the loop latencies, a maximum loop latency. Determining a ring latency for a data transmission originating from a chip of the plurality chips to be transmitted around the series-ring arrangement and back to the chip. Comparing half of the maximum loop latency to one N-th of the ring latency, where N is the number of chips in the plurality of chips, and storing the greater value as an inter-chip latency of the semiconductor device, the inter-chip latency representing an operational characteristic of the semiconductor device.Type: ApplicationFiled: June 21, 2022Publication date: December 8, 2022Inventors: Michial Allen Gunter, Denis Baylor, Clifford Biffle, Charles Ross
-
Publication number: 20220382360Abstract: Aspects of the disclosure include methods, systems, and apparatus, including computer-readable storage media for multi-mode integrated circuits with balanced energy consumption. A method includes determining, by one or more processors and based at least on a maximum energy threshold for planned multi-mode system having one or more processing units, a respective number of operations that can be performed per clock cycle by the processing units for each operating mode. The system is configured to consume the same amount of energy per clock cycle in each operating mode, but perform more operations in operating modes corresponding to operations performed on smaller bit-width operands.Type: ApplicationFiled: May 25, 2021Publication date: December 1, 2022Inventors: Reiner Alwyn Pope, Michial Allen Gunter, Lukasz Lew
-
Publication number: 20220326988Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for obtaining a first schedule, for a first hardware block of an integrated circuit device, where the first schedule identifies a first set of operations to be performed by the first hardware block. Obtaining a second schedule for a second hardware block of the integrated circuit device, where the second schedule identifies a second set of operations to be performed by the second hardware block and where operations of the second schedule are coordinated with operations of the first schedule such that the first schedule triggers the first hardware block to send data to the second block at a first pre-scheduled value of a counter, and the second schedule triggers the second hardware block to accept the data at an input at a second pre-scheduled value of the counter that is after the first pre-scheduled value.Type: ApplicationFiled: August 14, 2020Publication date: October 13, 2022Inventors: Michial Allen Gunter, Charles Henry Leichner, IV
-
Publication number: 20220318638Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for scheduling operations to reduce propagation latency between tiles of an accelerator. One of the methods includes receiving a request to generate a schedule for a first layer of a program to be executed by an accelerator configured to perform matrix operations at least partially in parallel, wherein the program defines a plurality of layers including the first layer, each layer of the program defining matrix operations to be performed using a respective matrix of values. A plurality of initial blocks of the schedule are assigned according to an initial assignment direction. The assignment direction is switched starting at a particular cycle so that blocks processed after the selected particular cycle are processed along a different second dimension of the first matrix. All remaining unassigned blocks are then assigned according to the switched assignment direction.Type: ApplicationFiled: August 20, 2020Publication date: October 6, 2022Inventors: Reiner Pope, Michial Allen Gunter
-
Publication number: 20220300450Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for sharding dataflow graphs for a device having multiple synchronous tiles. One of the methods includes receiving a representation of a dataflow graph comprising a plurality of nodes that each represent respective matrix operations to be performed by a device having a plurality synchronous tiles. Candidate allocations of respective portions of the dataflow graph to each tile of the plurality of synchronous tiles are evaluated according to one or more resource constraints of the device. One of the candidate allocations is selected based on evaluating each candidate allocation.Type: ApplicationFiled: August 20, 2020Publication date: September 22, 2022Inventors: Reiner Pope, Herman Schmit, Michial Allen Gunter
-
Patent number: 11451229Abstract: A tile including circuitry for use with machine learning models, the tile including: a first computational array of cells, in which the computational array of cells is a sub-array of a larger second computational array of cells; local memory coupled to the first computational array of cells; and multiple controllable bus lines, in which a first subset of the multiple controllable bus lines include multiple general purpose controllable bus lines couplable to the local memory.Type: GrantFiled: December 28, 2020Date of Patent: September 20, 2022Assignee: Google LLCInventors: Michial Allen Gunter, Charles Henry Leichner, IV, Tammo Spalink
-
Publication number: 20220277125Abstract: A method of configuring an integrated circuit including multiple hardware tiles, includes: establishing a data forwarding path through the multiple hardware tiles by configuring each hardware tile, except for a last hardware tile, of the multiple hardware tiles to be in a data forwarding state, in which configuring each hardware tile, except for the last hardware tile, to be in a forwarding state includes installing a respective forwarding state counter specifying a corresponding predefined length of time that the hardware tile is in the data forwarding state; supplying, along the data forwarding path, each hardware tile of the plurality of hardware tiles with a respective program data packet comprising program data for the hardware tile; and installing, for each hardware tile of the multiple hardware tiles, the respective program data.Type: ApplicationFiled: August 20, 2020Publication date: September 1, 2022Inventors: Michial Allen Gunter, Reiner Pope, Pavel Krajcevski, Clifford Biffle
-
Patent number: 11372801Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining, for each pair of adjacent chips in a plurality of chips connected in a series-ring arrangement of a semiconductor device, a corresponding loop latency for round trip data transmissions between the pair of chips. Identifying, from among the loop latencies, a maximum loop latency. Determining a ring latency for a data transmission originating from a chip of the plurality chips to be transmitted around the series-ring arrangement and back to the chip. Comparing half of the maximum loop latency to one N-th of the ring latency, where N is the number of chips in the plurality of chips, and storing the greater value as an inter-chip latency of the semiconductor device, the inter-chip latency representing an operational characteristic of the semiconductor device.Type: GrantFiled: June 11, 2021Date of Patent: June 28, 2022Assignee: Google LLCInventors: Michial Allen Gunter, Denis Baylor, Clifford Biffle, Charles Ross
-
Publication number: 20210303506Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining, for each pair of adjacent chips in a plurality of chips connected in a series-ring arrangement of a semiconductor device, a corresponding loop latency for round trip data transmissions between the pair of chips. Identifying, from among the loop latencies, a maximum loop latency. Determining a ring latency for a data transmission originating from a chip of the plurality chips to be transmitted around the series-ring arrangement and back to the chip. Comparing half of the maximum loop latency to one N-th of the ring latency, where N is the number of chips in the plurality of chips, and storing the greater value as an inter-chip latency of the semiconductor device, the inter-chip latency representing an operational characteristic of the semiconductor device.Type: ApplicationFiled: June 11, 2021Publication date: September 30, 2021Inventors: Michial Allen Gunter, Denis Baylor, Clifford Biffle, Charles Ross
-
Patent number: 11088694Abstract: An application specific integrated circuit (ASIC) chip includes: a systolic array of cells; and multiple controllable bus lines configured to convey data among the systolic array of cells, in which the systolic array of cells is arranged in multiple tiles, each tile of the multiple tiles including 1) a corresponding subarray of cells of the systolic array of cells, 2) a corresponding subset of controllable bus lines of the multiple controllable bus lines, and 3) memory coupled to the subarray of cells.Type: GrantFiled: March 23, 2020Date of Patent: August 10, 2021Assignee: X Development LLCInventors: Michial Allen Gunter, Charles Henry Leichner, IV, Tammo Spalink