Patents Examined by Tan V. Mai

Temperature-based adjustments for in-memory matrix multiplication

Patent number: 11507641

Abstract: Techniques for performing in-memory matrix multiplication, taking into account temperature variations in the memory, are disclosed. In one example, the matrix multiplication memory uses ohmic multiplication and current summing to perform the dot products involved in matrix multiplication. One downside to this analog form of multiplication is that temperature affects the accuracy of the results. Thus techniques are provided herein to compensate for the effects of temperature increases on the accuracy of in-memory matrix multiplications. According to the techniques, portions of input matrices are classified as effective or ineffective. Effective portions are mapped to low temperature regions of the in-memory matrix multiplier and ineffective portions are mapped to high temperature regions of the in-memory matrix multiplier. The matrix multiplication is then performed.

Type: Grant

Filed: May 31, 2019

Date of Patent: November 22, 2022

Assignee: Advanced Micro Devices, Inc.

Inventors: Majed Valad Beigi, Amin Farmahini-Farahani, Sudhanva Gurumurthi
Full adder, chip and computing device

Patent number: 11507347

Abstract: Full adder, a chip and a computing device are disclosed. A full adder includes: a plurality of primary logic cells and at least one secondary logic cell, wherein an output terminal of each primary logic cell is at least connected to an input terminal of a first secondary logic cell in the at least one secondary logic cell. The plurality of primary logic cells includes: a first primary logic cell, a second primary logic cell and a third primary logic cell respectively configured to generate a first intermediate signal, a second intermediate signal and a carry-related signal based on a first input signal, a second input signal and a carry input signal input to the full adder. Furthermore, the first secondary logic cell is configured to generate a sum output signal of the full adder based on the first intermediate signal, the second intermediate signal and the carry-related signal.

Type: Grant

Filed: May 14, 2021

Date of Patent: November 22, 2022

Assignee: SHENZHEN MICROBT ELECTRONICS TECHNOLOGY CO., LTD.

Inventors: Zhijun Fan, Weixin Kong, Dong Yu, Zuoxing Yang
Multiple output fusion for operations performed in a multi-dimensional array of processing units

Patent number: 11500959

Abstract: Methods, systems, and apparatus, including instructions encoded on storage media, for performing reduction of gradient vectors and similarly structured data that are generated in parallel, for example, on nodes organized in a mesh or torus topology defined by connections in at least two dimension between the nodes. The methods provide parallel computation and communication between nodes in the topology.

Type: Grant

Filed: August 16, 2019

Date of Patent: November 15, 2022

Assignee: Google LLC

Inventors: David Alexander Majnemer, Blake Alan Hechtman
Low latency matrix multiply unit

Patent number: 11500961

Abstract: Methods, systems, and apparatus for a matrix multiply unit implemented as a systolic array of cells are disclosed. The matrix multiply unit may include cells arranged in columns of the systolic array. Two chains of weight shift registers per column of the systolic array are in the matrix multiply unit. Each weight shift register is connected to only one chain and each cell is connected to only one weight shift register. A weight matrix register per cell is configured to store a weight input received from a weight shift register. A multiply unit is coupled to the weight matrix register and configured to multiply the weight input of the weight matrix register with a vector data input in order to obtain a multiplication result.

Type: Grant

Filed: March 26, 2020

Date of Patent: November 15, 2022

Assignee: Google LLC

Inventors: Andrew Everett Phelps, Norman Paul Jouppi
Emulating fine-grained sparsity in a systolic array

Patent number: 11500962

Abstract: To take advantage of the architecture of a systolic array tailored to perform sparse matrix multiplications, a weight matrix can be converted into a set of constrained fine-grained sparse weight matrices. The conversion process may include receiving a request to perform a matrix multiplication operation with a weight matrix, and determining that the weight matrix satisfies a sparsity condition to convert the weight matrix into a set of constrained fine-grained sparse weight matrices. The weight matrix can then be converted into a set of constrained fine-grained sparse weight matrices. Computer instructions can then be generated for an integrated circuit device to perform the requested matrix multiplication operation as a set of sparse matrix multiplication operations using the set of constrained fine-grained sparse weight matrices.

Type: Grant

Filed: June 30, 2020

Date of Patent: November 15, 2022

Assignee: Amazon Technologies, Inc.

Inventors: Paul Gilbert Meyer, Thiam Khean Hah, Randy Renfu Huang, Ron Diamant, Vignesh Vivekraja
Set operations using multi-core processing unit

Patent number: 11494463

Abstract: Performing set operations using sparse matrix operations offered by a multi-core processing unit (such as a graphics processing unit). The set operation is converted into operand matrices, and sparse matrix operations, foregoing the use of hash tables. The input set is converted into a matrix, a matrix operation corresponding to the set operation is identified, and one or more operands of the set operation are also represented within a matrix. The matrix operation is then performed on these matrices to obtain an output matrix, which is then converted to an output set.

Type: Grant

Filed: April 14, 2020

Date of Patent: November 8, 2022

Assignee: Microsoft Technology Licensing, LLC

Inventor: Ritwik Das
Analog vector-matrix multiplication by capacitive elements with resistive state storage

Patent number: 11494464

Abstract: An array circuit includes a plurality of vector-matrix multiplication (VMM) elements arranged in rows and columns. The VMM elements are configured to collectively perform multiplication of an input vector by a programmed input matrix to generate a plurality of output values that are representative of a result matrix that is the result of multiplication of the input vector and the input matrix. The VMM elements store states of the input matrix. Input voltages to the array are representative of elements of the input vector. A VMM element draws charge from a column read line based upon charging of a capacitor in the VMM. An integrator circuit connected to the column read line outputs a voltage that is indicative of a total charge drawn from the column read line by elements connected to the read line, which voltage is further indicative of an element of a result matrix.

Type: Grant

Filed: September 24, 2020

Date of Patent: November 8, 2022

Assignee: National Technology & Engineering Solutions of Sandia, LLC

Inventors: Sapan Agarwal, Matthew Marinella
Classifying business summaries against a hierarchical industry classification structure using supervised machine learning

Patent number: 11487991

Abstract: A classification system is provided for classifying text-based business summaries, referred to herein as “summaries,” against a hierarchical industry classification structure. The classification system includes a word-based sub classifier that uses a neural network to generate a vector space for each summary in a training set, where each summary in the training set is known to correspond to a particular industry classification in the hierarchical industry classification structure. Weight values in the hidden layer of a neural network used by the word-based sub classifier are changed to improve the predictive capabilities of the neural network in the business summary classification context. Embodiments include increasing representation in the training set for underrepresented parent industry classifications and attributes of the hierarchical industry classification structure, such as distances between industry classifications and whether industry classifications are in the same subgraph.

Type: Grant

Filed: September 4, 2019

Date of Patent: November 1, 2022

Assignee: THE DUN AND BRADSTREET CORPORATION

Inventor: Nikita Zhiltsov
Digital filter with programmable impulse response for direct amplitude modulation at radio frequency

Patent number: 11481224

Abstract: A digital filter according to the disclosure includes a processing circuit having a memory and a number of parallel processing circuits. The parallel processing circuits perform a convolution operations based on input data and function data that is accessed from the memory. The filter further includes a serializer for serializing data that is received from the processing circuits. A clock generator circuit provides a first clock signal to the processing circuit and a second clock signal to the serializer. The frequency of the second clock signal is greater than that of the first clock signal.

Type: Grant

Filed: August 30, 2019

Date of Patent: October 25, 2022

Assignee: Apple Inc.

Inventors: Tao Mai, Robert G. Lorenz, Joachim S. Hammerschmidt, Utku Seckin
Integer matrix multiplication engine using pipelining

Patent number: 11481472

Abstract: Techniques for data manipulation using integer matrix multiplication using pipelining are disclosed. A first integer matrix with dimensions m×k and a second integer matrix with dimensions k×n are obtained for matrix multiplication within a processor. The first and second integer matrices employ a two's complement variable radix point data representation. The first and second integer matrices are distilled into (j×j) submatrices. A first variable radix point format and an initial value for an accumulator register are configured dynamically. A first variable radix point format is configured dynamically for the first integer matrix and a second variable radix point format is configured dynamically for the second integer matrix. Multiply-accumulate operations are executed in a pipelined fashion on the (j×j) submatrices of the first integer matrix and the second integer matrix, where a third variable radix point format is configured for the result.

Type: Grant

Filed: July 30, 2020

Date of Patent: October 25, 2022

Inventor: David John Simpson
Mapping convolution to a matrix processor unit

Patent number: 11481471

Abstract: A system comprises a matrix processor unit that includes a first type of register, a group of a second type of registers, and a plurality of calculation units. The first type of register is configured to concurrently store values from different rows of a first matrix. At least a portion of the first type of register is logically divided into groups of elements, and each of the groups corresponds to a different row of the first matrix. Each of the second type of registers is configured to concurrently store values from a plurality of different rows of a second matrix. Each of the calculation units corresponds to one of the second type of registers and is configured to at least in part determine a corresponding element in a result matrix of convoluting the second matrix with the first matrix.

Type: Grant

Filed: August 16, 2019

Date of Patent: October 25, 2022

Assignee: Meta Platforms, Inc.

Inventors: Krishnakumar Nair, Abdulkadir Utku Diril, Dheevatsa Mudigere, Olivia Wu, Ehsan Khish Ardestani Zadeh, Yuchen Hao
Secret table reference system, method, secret calculation apparatus and program

Patent number: 11480991

Abstract: A secure table reference system includes a first combining part 11n for generating [v?] of v? ? Fm+nt in which d and v are combined, a difference calculation part 12n for generating [r?] of r? that has a difference between a certain element of r and an element before the certain element as an element corresponding to the certain element, a second combining part 13n for generating [r?] of r? ? Fm+nt in which r? and an m-dimensional zero are combined, a permutation calculation part 14n for generating {{?}} of a permutation ? that stably sorts v? in ascending order, a permutation application part 15n for generating [s] of s: =?(r?) obtained by applying the permutation ? to r?, a vector generation part 16n for generating [s?] of a prefix-sum s? of s, an inverse permutation application part for generating [s?] of s? obtained by applying an inverse permutation ??1 of the permutation ? to s?, and an output part 17n for generating [x] of x ? Fm consisting of (nt+1)th and subsequent elements of s?.

Type: Grant

Filed: March 11, 2019

Date of Patent: October 25, 2022

Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Dai Ikarashi, Koki Hamada
Device and method for convolution operation

Patent number: 11475100

Abstract: In accordance with an aspect of the present disclosure, there is provided a convolution operation method.

Type: Grant

Filed: September 28, 2020

Date of Patent: October 18, 2022

Assignee: SAPEON KOREA INC.

Inventor: Jeongho Han
Sorting networks using unary processing

Patent number: 11475288

Abstract: Various implementations of sorting networks are described that utilize time-encoded data signals having encoded values. In some examples, an electrical circuit device includes a sorting network configured to receive a plurality of time-encoded signals. Each time-encoded signal of the plurality of time-encoded signals encodes a data value based on a duty cycle of the respective time-encoded signal or based on a proportion of data bits in the respective time-encoded signal that are high relative to the total data bits in the respective time-encoded signal. The sorting network is also configured to sort the plurality of time-encoded signals based on the encoded data values of the plurality of time-encoded signals.

Type: Grant

Filed: November 5, 2019

Date of Patent: October 18, 2022

Assignee: Regents of the University of Minnesota

Inventors: Mohammadhassan Najafi, David J. Lilja, Marcus Riedel, Kiarash Bazargan
Tensor compression

Patent number: 11461625

Abstract: Lossy tensor compression and decompression circuits compress and decompress tensor elements based on the values of neighboring tensor elements. The lossy compression circuit scales each decompressed tensor element of a tile by a scaling factor that is based on the maximum value that can be represented by the number of bits used to represent a compressed tensor element, and the greatest value and least value of the tensor elements of the tile. The lossy decompression circuit performs the inverse of the lossy compression. The compression circuit and decompression circuit have parallel multiplier circuits and parallel adder circuits to perform the lossy compression and lossy decompression, respectively.

Type: Grant

Filed: May 22, 2020

Date of Patent: October 4, 2022

Assignee: XILINX, INC.

Inventors: Michael Wu, Christopher H. Dick
Methods and systems of implementing positive and negative neurons in a neural array-based flash memory

Patent number: 11461621

Abstract: In one aspect, A method for computing neural network computation includes the step of, providing plurality of neurons, coupled with a plurality of inputs, through a plurality of synapses. Each neuron output is given by an equation ?(Xi*Yi)+b. Xi*Yi comprises the ith synapse of the neuron. Xi comprises a set of Xi input vectors. Each Xi input vector is translated into an equivalent electrical signal for an ith corresponding synapse of the plurality of neurons, Yi comprises a set of Yi weight vectors, wherein each Yi weight vector comprises a parameter for the ith corresponding synapse of the plurality of neurons. Each synapse is a sub-system and the sub-system comprises a negative vector neural circuit, a positive vector neural circuit, and a set of four non-volatile memory weight cells for computation. The method includes the step of identifying the input vector x as a positive input vector or a negative input vector.

Type: Grant

Filed: June 25, 2019

Date of Patent: October 4, 2022

Inventors: Vishal Sarin, Purackal Mammen Mammen, Taber Smith
Digital RRAM-based convolutional block

Patent number: 11450385

Abstract: One embodiment provides a resistive random-access memory (RRAM) based convolutional block including a complementary pair of RRAMs having a first RRAM and a second RRAM, a programming circuit coupled to the complementary pair of RRAMs, and a XNOR sense amplifier circuit coupled to the complementary pair of RRAMs. The programming circuit is configured to receive a kernel bit from a kernel matrix, program the first RRAM to at least one selected from a group consisting of a low resistive state (LRS) and a high resistive state (HRS) based on the kernel bit, and program the second RRAM to other of the LRS and the HRS. The XNOR sense amplifier circuit is configured to receive an input bit from an input matrix, perform a XNOR operation between the input bit and the kernel bit read from the complementary pair of RRAMs, and output a XNOR output based on the XNOR operation.

Type: Grant

Filed: September 20, 2019

Date of Patent: September 20, 2022

Assignee: UNIVERSITY OF UTAH RESEARCH FOUNDATION

Inventors: Pierre-Emmanuel Gaillardon, Edouard Giacomin, Joao Vieira
Methods and apparatus for performing video processing matrix operations within a memory array

Patent number: 11449577

Abstract: Methods and apparatus for performing video processing matrix operations within a memory fabric. Various embodiments of the present disclosure are directed to converting a memory array into a matrix fabric for discrete cosine transform (DCT) matrix transformations and performing DCT matrix operations therein. Exemplary embodiments described herein perform DCT matrix-matrix multiplication operations within a memory device that includes a matrix fabric and matrix multiplication unit (MMU). In one embodiment, matrix-matrix multiplication operations are obtained using separate matrix-vector products. In one exemplary embodiment, the matrix fabric uses a “crossbar” construction of resistive elements. Each resistive element stores a level of impedance that represents the corresponding matrix coefficient value. The crossbar connectivity can be driven with an electrical signal representing the input vector as an analog voltage.

Type: Grant

Filed: November 20, 2019

Date of Patent: September 20, 2022

Assignee: Micron Technology, Inc.

Inventor: Fa-Long Luo
Product-sum operation device, neuromorphic device, and method for determining malfunction in product-sum operation device

Patent number: 11442695

Abstract: A product-sum operation device includes a product operator, a sum operator, and a malfunction determiner. The product operator includes a plurality of product operation elements (10AA) to (10AC), and each of the plurality of product operation elements (10AA) to (10AC) is a resistance change element. The sum operator includes an output detector that detects the sum of outputs from the plurality of product operation elements (10AA) to (10AC). The malfunction determiner determines that a malfunction has occurred when the sum detected by the output detector exceeds a specified value. The specified value is a value equal to or greater than a maximum value of the sum that can be detected by the output detector when the plurality of product operation elements (10AA) to (10AC) all operate normally.

Type: Grant

Filed: December 12, 2018

Date of Patent: September 13, 2022

Assignee: TDK CORPORATION

Inventor: Tatsuo Shibata
Jitter insertion system for waveform generation

Patent number: 11422584

Abstract: A test and measurement instrument for generating an analog waveform, including an interpolator configured to receive a digital signal and output interpolated samples of the digital signal at a sample rate, a filter modulation controller configured to output first filter coefficients at a first time and second filter coefficients at a second time, a convolver configured to generate a convolved signal by convolving the interpolated samples of the digital signal and the first filter coefficients and convolving the interpolated samples of the digital signal and the second filter coefficients; and a digital-to-analog converter configured to convert the convolved signal to an analog signal based on a fixed, constant clock signal.

Type: Grant

Filed: November 7, 2019

Date of Patent: August 23, 2022

Assignee: Tektronix, Inc.

Inventor: John J. Pickerd

prev … 3 4 5 6 7 8 9 10 11 … next