Patents by Inventor Kenneth Duong

Kenneth Duong has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Neural network inference circuit employing dynamic memory sleep

Patent number: 11921561

Abstract: For a neural network inference circuit that executes a neural network including multiple computation nodes at multiple layers for which data is stored in a plurality of memory banks, some embodiments provide a method for dynamically putting memory banks into a sleep mode of operation to conserve power. The method tracks the accesses to individual memory banks and, if a certain number of clock cycles elapse with no access to a particular memory bank, sends a signal to the memory bank indicating that it should operate in a sleep mode. Circuit components involved in dynamic memory sleep, in some embodiments, include a core RAM pipeline, a core RAM sleep controller, a set of core RAM bank select decoders, and a set of core RAM memory bank wrappers.

Type: Grant

Filed: May 27, 2022

Date of Patent: March 5, 2024

Assignee: PERCEIVE CORPORATION

Inventors: Jung Ko, Kenneth Duong, Steven L. Teig
REDUCED DOT PRODUCT COMPUTATION CIRCUIT

Publication number: 20240070225

Abstract: Some embodiments provide an IC for implementing a machine-trained network with multiple layers. The IC includes a set of circuits to compute a dot product of (i) a first number of input values computed by other circuits of the IC and (ii) a set of predefined weight values, several of which are zero, with a weight value for each of the input values. The set of circuits includes (i) a dot product computation circuit to compute the dot product based on a second number of inputs and (ii) for each input value, at least two sets of wires for providing the input value to at least two of the dot product computation circuit inputs. The second number is less than the first number. Each input value with a corresponding weight value that is not equal to zero is provided to a different one of the dot product computation circuit inputs.

Type: Application

Filed: October 27, 2023

Publication date: February 29, 2024

Inventors: Kenneth Duong, Jung Ko, Steven L. Teig
STORAGE OF INPUT VALUES ACROSS MULTIPLE CORES OF NEURAL NETWORK INFERENCE CIRCUIT

Publication number: 20240062054

Abstract: Some embodiments provide a method for a neural network inference circuit that executes a neural network. The method loads a first set of inputs into an input buffer and computes a first dot product between the first set of inputs and a set of weights. The method shifts the first set of inputs in the buffer while loading a second set of inputs into the buffer such that a first subset of the first set of inputs is removed from the buffer, a second subset of the first set of inputs is moved to new locations in the buffer, and a second set of inputs are loaded into locations in the buffer vacated by the shifting. The method computes a second dot product between (i) the second set of inputs and the second subset of the first set of inputs and (ii) the set of weights.

Type: Application

Filed: October 27, 2023

Publication date: February 22, 2024

Inventors: Kenneth Duong, Jung Ko, Steven L. Teig
DATA TRANSFER FOR NON-DOT PRODUCT COMPUTATIONS ON NEURAL NETWORK INFERENCE CIRCUIT

Publication number: 20240046081

Abstract: Some embodiments provide a neural network inference circuit for executing a neural network that includes multiple layers of computation nodes. At least a subset of the layers include non-convolutional layers. The neural network inference circuit includes multiple cores with memories that store input values for the layers. The cores are grouped into multiple clusters. For each cluster, the neural network inference circuit includes a set of processing circuits for receiving input values from the cores of the cluster and executing the computation nodes of the non-convolutional layers.

Type: Application

Filed: October 8, 2023

Publication date: February 8, 2024

Inventors: Jung Ko, Kenneth Duong, Steven L. Teig
Shifting input values within input buffer of neural network inference circuit

Patent number: 11886979

Abstract: Some embodiments provide a method for a neural network inference circuit that executes a neural network. The method loads a first set of inputs into an input buffer and computes a first dot product between the first set of inputs and a set of weights. The method shifts the first set of inputs in the buffer while loading a second set of inputs into the buffer such that a first subset of the first set of inputs is removed from the buffer, a second subset of the first set of inputs is moved to new locations in the buffer, and a second set of inputs are loaded into locations in the buffer vacated by the shifting. The method computes a second dot product between (i) the second set of inputs and the second subset of the first set of inputs and (ii) the set of weights.

Type: Grant

Filed: March 15, 2019

Date of Patent: January 30, 2024

Assignee: PERCEIVE CORPORATION

Inventors: Kenneth Duong, Jung Ko, Steven L. Teig
Reduced dot product computation circuit

Patent number: 11809515

Abstract: Some embodiments provide an IC for implementing a machine-trained network with multiple layers. The IC includes a set of circuits to compute a dot product of (i) a first number of input values computed by other circuits of the IC and (ii) a set of predefined weight values, several of which are zero, with a weight value for each of the input values. The set of circuits includes (i) a dot product computation circuit to compute the dot product based on a second number of inputs and (ii) for each input value, at least two sets of wires for providing the input value to at least two of the dot product computation circuit inputs. The second number is less than the first number. Each input value with a corresponding weight value that is not equal to zero is provided to a different one of the dot product computation circuit inputs.

Type: Grant

Filed: May 10, 2021

Date of Patent: November 7, 2023

Assignee: PERCEIVE CORPORATION

Inventors: Kenneth Duong, Jung Ko, Steven L. Teig
Three dimensional circuit implementing machine trained network

Patent number: 11790219

Abstract: Some embodiments provide a three-dimensional (3D) circuit structure that has two or more vertically stacked bonded layers with a machine-trained network on at least one bonded layer. As described above, each bonded layer can be an IC die or an IC wafer in some embodiments with different embodiments encompassing different combinations of wafers and dies for the different bonded layers. The machine-trained network in some embodiments includes several stages of machine-trained processing nodes with routing fabric that supplies the outputs of earlier stage nodes to drive the inputs of later stage nodes. In some embodiments, the machine-trained network is a neural network and the processing nodes are neurons of the neural network. In some embodiments, one or more parameters associated with each processing node (e.g., each neuron) is defined through machine-trained processes that define the values of these parameters in order to allow the machine-trained network (e.g.

Type: Grant

Filed: October 13, 2021

Date of Patent: October 17, 2023

Assignee: Adeia Semiconductor Inc.

Inventors: Steven L. Teig, Kenneth Duong
Data transfer for non-dot product computations on neural network inference circuit

Patent number: 11783167

Abstract: Some embodiments provide a neural network inference circuit for executing a neural network that includes multiple layers of computation nodes. At least a subset of the layers include non-convolutional layers. The neural network inference circuit includes multiple cores with memories that store input values for the layers. The cores are grouped into multiple clusters. For each cluster, the neural network inference circuit includes a set of processing circuits for receiving input values from the cores of the cluster and executing the computation nodes of the non-convolutional layers.

Type: Grant

Filed: August 21, 2019

Date of Patent: October 10, 2023

Assignee: PERCEIVE CORPORATION

Inventors: Jung Ko, Kenneth Duong, Steven L. Teig
COMPUTATION OF NEURAL NETWORK NODE WITH LARGE INPUT VALUES

Publication number: 20230076850

Abstract: Some embodiments provide a method for a circuit that executes a neural network including multiple nodes. The method loads a set of weight values for a node into a set of weight value buffers, a first set of bits of each input value of a set of input values for the node into a first set of input value buffers, and a second set of bits of each of the input values into a second set of input value buffers. The method computes a first dot product of the weight values and the first set of bits of each input value and a second dot product of the weight values and the second set of bits of each input value. The method shifts the second dot product by a particular number of bits and adds the first dot product with the bit-shifted second dot product to compute a dot product for the node.

Type: Application

Filed: November 9, 2022

Publication date: March 9, 2023

Inventors: Jung Ko, Kenneth Duong, Steven L. Teig
COMPUTATION OF NEURAL NETWORK NODE BY NEURAL NETWORK INFERENCE CIRCUIT

Publication number: 20230063274

Abstract: Some embodiments provide a neural network inference circuit (NNIC) for executing a neural network that includes multiple computation nodes at multiple layers. The NNIC includes a set of clusters of core computation circuits and a channel, connecting the core computation circuits, that includes separate segments corresponding to each of the clusters. The NNIC includes a fabric controller circuit, a cluster controller circuit for each of the clusters, and a core controller circuit for each of the core computation circuits. The fabric controller circuit receives high-level neural network instructions from a microprocessor and parses the high-level neural network instructions.

Type: Application

Filed: November 7, 2022

Publication date: March 2, 2023

Inventors: Kenneth Duong, Jung Ko, Steven L. Teig
Write cache for neural network inference circuit

Patent number: 11586910

Abstract: Some embodiments provide a neural network inference circuit (NNIC) for executing a neural network that includes computation nodes at multiple layers. The NNIC includes multiple value computation circuits for computing output values of computation nodes. The NNIC includes a set of memories for storing the output values of computation nodes for use as input values to computation nodes in subsequent layers of the neural network. The NNIC includes a set of write control circuits for writing the computed output values to the set of memories. Upon receiving a set of computed output values, a write control circuit (i) temporarily stores the set of computed output values in a cache when adding the set of computed output values to the cache does not cause the cache to fill up and (ii) writes data in the cache to the set of memories when the cache fills up.

Type: Grant

Filed: August 9, 2019

Date of Patent: February 21, 2023

Assignee: PERCEIVE CORPORATION

Inventors: Kenneth Duong, Jung Ko, Steven L. Teig
Neural network inference circuit read controller with multiple operational modes

Patent number: 11568227

Abstract: Some embodiments provide a neural network inference circuit for executing a neural network with multiple layers. The neural network inference circuit includes a set of processing circuits for executing the layers of the neural network, a set of memories for storing data used by the set of processing circuits to execute the neural network layers, and a read controller for retrieving the data from the set of memories and storing the data in a cache for use by the set of processing circuits. The read controller retrieves the data in one of (i) a first mode for retrieving the data from sequential memory locations within the set of memories to store in the cache and (ii) a second mode for retrieving the data from non-sequential memory locations within the set of memories to store in the cache.

Type: Grant

Filed: December 17, 2019

Date of Patent: January 31, 2023

Assignee: PERCEIVE CORPORATION

Inventors: Jung Ko, Kenneth Duong, Steven L. Teig
Input value cache for temporarily storing input values

Patent number: 11531868

Abstract: Some embodiments provide a method for a neural network inference circuit that executes a neural network including computation nodes at multiple layers. Each of a set of the nodes includes a dot product of input values and weight values. The method reads multiple input values for a particular layer from a memory location of the circuit. A first set of the input values are used for a first dot product for a first node of the layer. The method stores the input values in a cache. The method computes the first dot product for the first node using the first set of input values. Without requiring a read of any input values from any additional memory locations, the method computes a second dot product for a second node of the particular layer using a subset of the first set of input values and a second set of the input values.

Type: Grant

Filed: March 15, 2019

Date of Patent: December 20, 2022

Assignee: PERCEIVE CORPORATION

Inventors: Kenneth Duong, Jung Ko, Steven L. Teig
Computation of neural network node with large input values

Patent number: 11531727

Abstract: Some embodiments provide a method for a circuit that executes a neural network including multiple nodes. The method loads a set of weight values for a node into a set of weight value buffers, a first set of bits of each input value of a set of input values for the node into a first set of input value buffers, and a second set of bits of each of the input values into a second set of input value buffers. The method computes a first dot product of the weight values and the first set of bits of each input value and a second dot product of the weight values and the second set of bits of each input value. The method shifts the second dot product by a particular number of bits and adds the first dot product with the bit-shifted second dot product to compute a dot product for the node.

Type: Grant

Filed: December 6, 2018

Date of Patent: December 20, 2022

Assignee: PERCEIVE CORPORATION

Inventors: Jung Ko, Kenneth Duong, Steven L. Teig
Control circuits for neural network inference circuit

Patent number: 11501138

Abstract: Some embodiments provide a neural network inference circuit (NNIC) for executing a neural network that includes multiple computation nodes at multiple layers. The NNIC includes a set of clusters of core computation circuits and a channel, connecting the core computation circuits, that includes separate segments corresponding to each of the clusters. The NNIC includes a fabric controller circuit, a cluster controller circuit for each of the clusters, and a core controller circuit for each of the core computation circuits. The fabric controller circuit receives high-level neural network instructions from a microprocessor and parses the high-level neural network instructions.

Type: Grant

Filed: December 6, 2018

Date of Patent: November 15, 2022

Assignee: PERCEIVE CORPORATION

Inventors: Kenneth Duong, Jung Ko, Steven L. Teig
Storage of input values across multiple cores of neural network inference circuit

Patent number: 11481612

Abstract: Some embodiments provide a neural network inference circuit (NNIC) for executing a neural network that includes multiple computation nodes at multiple layers. Each of a set of the computation nodes includes a dot product of input values and weight values. The NNIC includes dot product cores, each of which includes (i) partial dot product computation circuits to compute dot products between input values and weight values and (ii) memories to store the weight values and input values for a layer of the NN. The input values for a particular layer of the NN are stored in the memories of multiple cores. A starting memory location in a first core for the input values of the layer stored in the first core is the same as a starting memory location for the input values in each of the other cores that store the input values for the layer.

Type: Grant

Filed: March 15, 2019

Date of Patent: October 25, 2022

Assignee: PERCEIVE CORPORATION

Inventors: Kenneth Duong, Jung Ko, Steven L. Teig
Storage of input values within core of neural network inference circuit

Patent number: 11468145

Abstract: Some embodiments provide a neural network inference circuit (NNIC) for executing a NN that includes multiple computation nodes at multiple layers. Each of a set of the computation nodes includes a dot product of input values and weight values. The NNIC includes a set of dot product cores, each of which includes (i) partial dot product computation circuits to compute dot products between input values and weight values and (ii) memories to store the sets of weight values and sets of input values for a layer of the neural network. The input values for a particular layer are arranged in a plurality of two-dimensional grids. A particular core stores all of the input values of a subset of the two-dimensional grids. Input values having a same set of coordinates in each respective grid of the subset of the two-dimensional grids are stored sequentially within the memories of the particular core.

Type: Grant

Filed: March 15, 2019

Date of Patent: October 11, 2022

Assignee: PERCEIVE CORPORATION

Inventors: Kenneth Duong, Jung Ko, Steven L. Teig
NEURAL NETWORK INFERENCE CIRCUIT EMPLOYING DYNAMIC MEMORY SLEEP

Publication number: 20220291739

Abstract: For a neural network inference circuit that executes a neural network including multiple computation nodes at multiple layers for which data is stored in a plurality of memory banks, some embodiments provide a method for dynamically putting memory banks into a sleep mode of operation to conserve power. The method tracks the accesses to individual memory banks and, if a certain number of clock cycles elapse with no access to a particular memory bank, sends a signal to the memory bank indicating that it should operate in a sleep mode. Circuit components involved in dynamic memory sleep, in some embodiments, include a core RAM pipeline, a core RAM sleep controller, a set of core RAM bank select decoders, and a set of core RAM memory bank wrappers.

Type: Application

Filed: May 27, 2022

Publication date: September 15, 2022

Inventors: Jung Ko, Kenneth Duong, Steven L. Teig
Using quinary weights with neural network inference circuit designed for ternary weights

Patent number: 11403530

Abstract: Some embodiments provide a method for compiling a neural network program for a neural network inference circuit. The method receives a neural network definition including multiple weight values arranged as multiple filters. For each filter, each of the weight values is one of a set of weight values associated with the filter. At least one of the filters has more than three different associated weight values. The method generates program instructions for instructing the neural network inference circuit to execute the neural network. The neural network inference circuit includes circuitry for executing neural networks with a maximum of three different weight values per filter.

Type: Grant

Filed: December 6, 2018

Date of Patent: August 2, 2022

Assignee: PERCEIVE CORPORATION

Inventors: Jung Ko, Kenneth Duong, Steven L. Teig
Using lookup table to represent neural network activation function

Patent number: 11361213

Abstract: Some embodiments provide a neural network inference circuit for implementing a neural network that includes multiple computation nodes at multiple layers. Each of a set of the computation nodes includes (i) a linear function that includes a dot product of input values and weight values and (ii) a non-linear activation function. The neural network inference circuit includes (i) a set of dot product circuits to compute dot products for the plurality of computation nodes and (ii) at least one computation node post-processing circuit to (i) receive a dot product for a computation node computed by the set of dot product circuits, (ii) compute a result of the linear function for the computation node based on the dot product, and (iii) use a lookup table to compute the non-linear activation function of the computation node from the result of the linear function to determine an output of the computation node.

Type: Grant

Filed: December 6, 2018

Date of Patent: June 14, 2022

Assignee: PERCEIVE CORPORATION

Inventors: Kenneth Duong, Jung Ko, Steven L. Teig

1 2 3 4 next