Patents by Inventor Malte Rasch

Malte Rasch has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Kernel sets normalization with capacitor charge sharing

Patent number: 11574694

Abstract: A method for multiple copies of a set of multi-kernel set operations in a hardware accelerated neural network includes a word line for receiving a pixel value of an input image. A bit line communicates a modified pixel value. An analog memory cell including a first capacitor stores a first kernel weight of a first kernel in one of a plurality of kernel sets such that the pixel value is operated on by the first kernel weight to produce the modified pixel value. A charge connection connects the first capacitor to at least a second capacitor storing a second kernel weight of a related kernel of a second one of the plurality of kernel sets such that charge is shared between the first capacitor and at least the second capacitor to normalize the first kernel weight and the second kernel weight.

Type: Grant

Filed: October 11, 2018

Date of Patent: February 7, 2023

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Effendi Leobandung, Tayfun Gokmen, Xiao Sun, Yulong Li, Malte Rasch
Multi-kernel configuration for convolutional neural networks

Patent number: 11556763

Abstract: Methods and systems of implementing a convolutional neural network are described. In an example, a structure may receive input signals and distribute the input signals to a plurality of unit cells. The structure may include a plurality of multi-kernel modules that may include a respective set of unit cells. A unit cell may correspond to an element of a kernel being implemented in the convolutional neural network and may include a storage component configured to store a weight of a corresponding element of the kernel. A first pass gate of the unit cell may be activated to pass a stored weight of the unit cell to a plurality of operation circuits in the corresponding unit cell, such that the stored weight may be applied to the input signals. The structure may generate a set of outputs based on the application of the stored weights to the input signals.

Type: Grant

Filed: February 19, 2019

Date of Patent: January 17, 2023

Assignee: International Business Machines Corporation

Inventors: Effendi Leobandung, Malte Rasch, Xiao Sun, Yulong Li, Zhibin Ren
Auto weight scaling for RPUs

Patent number: 11556770

Abstract: Techniques for auto weight scaling a bounded weight range of RPU devices with the size of the array during ANN training are provided. In one aspect, a method of ANN training includes: initializing weight values winit in the array to a random value, wherein the array represents a weight matrix W with m rows and n columns; calculating a scaling factor ? based on a size of the weight matrix W; providing digital inputs x to the array; dividing the digital inputs x by a noise and bound management factor ? to obtain adjusted digital inputs x?; performing a matrix-vector multiplication of the adjusted digital inputs x? with the array to obtain digital outputs y?; multiplying the digital outputs y? by the noise and bound management factor ?; and multiplying the digital outputs y? by the scaling factor ? to provide digital outputs y of the array.

Type: Grant

Filed: May 31, 2019

Date of Patent: January 17, 2023

Assignee: International Business Machines Corporation

Inventors: Malte Rasch, Tayfun Gokmen
Resistive processing unit cell having multiple weight update and read circuits for parallel processing of data using shared weight value

Patent number: 11537863

Abstract: A resistive processing unit cell includes a weight storage device to store a weight value of the resistive processing unit cell, and multiple circuit blocks. Each circuit block includes a weight update circuit coupled to dedicated update control lines, and a weight read circuit coupled to dedicated read control lines. The circuit blocks are configured to operate in parallel to (i) perform separate weight read operations in which each read circuit generates a read current based on a stored weight value, and outputs the read current on the dedicated read control lines of the read circuit, and (ii) perform separate weight update operations in which each update circuit receives respective update control signals on the dedicated update control lines, generates update currents based on the respective update control signals, and applies the update current to the weight storage device to adjust the weight value based on the update current.

Type: Grant

Filed: September 12, 2019

Date of Patent: December 27, 2022

Assignee: International Business Machines Corporation

Inventors: Effendi Leobandung, Zhibin Ren, Malte Rasch
Acceleration of convolutional neural networks on analog arrays

Patent number: 11443176

Abstract: Mechanisms are provided for acceleration of convolutional neural networks on analog arrays. Input ports receive image signals from frames in an input image. Input memory arrays store the image signals received from the input ports into a respective input memory location to create a plurality of image sub-regions in input memory arrays. A distributor associated each of a set of analog array tiles in an analog array to a part of image sub-regions of the input memory arrays, so that one or more of a set of analog memory components is associated with the image signals in a distribution order to create a respective output signal. An assembler stores each of the respective output signals into one of a set of memory outputs in an output order that is determined by the distribution order.

Type: Grant

Filed: March 22, 2019

Date of Patent: September 13, 2022

Assignee: International Business Machines Corporation

Inventors: Malte Rasch, Tayfun Gokmen, Mattia Rigotti, Wilfried Haensch
Noise and signal management for RPU array

Patent number: 11361218

Abstract: Advanced noise and signal management techniques for RPU arrays during ANN training are provided. In one aspect of the invention, a method for ANN training includes: providing an array of RPU devices with pre-normalizers and post-normalizers; computing and pre-normalizing a mean and standard deviation of all elements of an input vector x to the array that belong to the set group of each of the pre-normalizers; and computing and post-normalizing the mean ? and the standard deviation ? of all elements of an output vector y that belong to the set group of each of the post-normalizers.

Type: Grant

Filed: May 31, 2019

Date of Patent: June 14, 2022

Assignee: International Business Machines Corporation

Inventors: Malte Rasch, Tayfun Gokmen
Integrator voltage shifting for improved performance in softmax operation

Patent number: 11200297

Abstract: An apparatus and method are provided for saturation prevention of a current integrator in a Resistive Processing Unit-based (RPU-based) accelerator. The apparatus includes a set of hardware switches. The apparatus further includes a voltage generator, operatively coupled between an input terminal and an output terminal of the current integrator, reducing a magnitude of an output voltage at the output terminal of the current integrator during a current integration operation by selectively applying a non-zero initial voltage to the current integrator prior to the current integration operation, responsive to an operating state of the set of hardware switches.

Type: Grant

Filed: June 12, 2019

Date of Patent: December 14, 2021

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Seyoung Kim, Tayfun Gokmen, Malte Rasch
RESISTIVE PROCESSING UNIT CELL HAVING MULTIPLE WEIGHT UPDATE AND READ CIRCUITS FOR PARALLEL PROCESSING OF DATA USING SHARED WEIGHT VALUE

Publication number: 20210081775

Abstract: A resistive processing unit cell includes a weight storage device to store a weight value of the resistive processing unit cell, and multiple circuit blocks. Each circuit block includes a weight update circuit coupled to dedicated update control lines, and a weight read circuit coupled to dedicated read control lines. The circuit blocks are configured to operate in parallel to (i) perform separate weight read operations in which each read circuit generates a read current based on a stored weight value, and outputs the read current on the dedicated read control lines of the read circuit, and (ii) perform separate weight update operations in which each update circuit receives respective update control signals on the dedicated update control lines, generates update currents based on the respective update control signals, and applies the update current to the weight storage device to adjust the weight value based on the update current.

Type: Application

Filed: September 12, 2019

Publication date: March 18, 2021

Inventors: Effendi Leobandung, Zhibin Ren, Malte Rasch
INTEGRATOR VOLTAGE SHIFTING FOR IMPROVED PERFORMANCE IN SOFTMAX OPERATION

Publication number: 20200394252

Abstract: An apparatus and method are provided for saturation prevention of a current integrator in a Resistive Processing Unit-based (RPU-based) accelerator. The apparatus includes a set of hardware switches. The apparatus further includes a voltage generator, operatively coupled between an input terminal and an output terminal of the current integrator, reducing a magnitude of an output voltage at the output terminal of the current integrator during a current integration operation by selectively applying a non-zero initial voltage to the current integrator prior to the current integration operation, responsive to an operating state of the set of hardware switches.

Type: Application

Filed: June 12, 2019

Publication date: December 17, 2020

Inventors: Seyoung Kim, Tayfun Gokmen, Malte Rasch
Auto Weight Scaling for RPUs

Publication number: 20200380349

Abstract: Techniques for auto weight scaling a bounded weight range of RPU devices with the size of the array during ANN training are provided. In one aspect, a method of ANN training includes: initializing weight values winit in the array to a random value, wherein the array represents a weight matrix W with m rows and n columns; calculating a scaling factor ? based on a size of the weight matrix W; providing digital inputs x to the array; dividing the digital inputs x by a noise and bound management factor ? to obtain adjusted digital inputs x?; performing a matrix-vector multiplication of the adjusted digital inputs x? with the array to obtain digital outputs y?; multiplying the digital outputs y? by the noise and bound management factor ?; and multiplying the digital outputs y? by the scaling factor ? to provide digital outputs y of the array.

Type: Application

Filed: May 31, 2019

Publication date: December 3, 2020

Inventors: Malte Rasch, Tayfun Gokmen
Noise and Signal Management for RPU Array

Publication number: 20200380348

Abstract: Advanced noise and signal management techniques for RPU arrays during ANN training are provided. In one aspect of the invention, a method for ANN training includes: providing an array of RPU devices with pre-normalizers and post-normalizers; computing and pre-normalizing a mean and standard deviation of all elements of an input vector x to the array that belong to the set group of each of the pre-normalizers; and computing and post-normalizing the mean p and the standard deviation a of all elements of an output vector y that belong to the set group of each of the post-normalizers.

Type: Application

Filed: May 31, 2019

Publication date: December 3, 2020

Inventors: Malte Rasch, Tayfun Gokmen
Alignment techniques to match symmetry point as zero-weight point in analog crosspoint arrays

Patent number: 10831860

Abstract: Zero-shifting techniques in analog crosspoint arrays are provided. In one aspect, an analog array-based vector-matrix multiplication includes: a weight array connected to a reference array, each including a crossbar array having a set of conductive row wires and a set of conductive column wires intersecting the set of conductive row wires, and optimizable crosspoint devices at intersections of the set of conductive column wires and the set of conductive row wires. A method for analog array-based vector-matrix computing is also provided that includes: applying repeated voltage pulses to the crosspoint devices in the weight array until all of the crosspoint devices in the weight array converge to their own symmetry point; and copying conductance values for each crosspoint device from the weight array to the reference array.

Type: Grant

Filed: October 11, 2018

Date of Patent: November 10, 2020

Assignee: International Business Machines Corporation

Inventors: Seyoung Kim, Hyungjun Kim, Tayfun Gokmen, Malte Rasch
MULTI-KERNEL CONFIGURATION FOR CONVOLUTIONAL NEURAL NETWORKS

Publication number: 20200265298

Abstract: Methods and systems of implementing a convolutional neural network are described. In an example, a structure may receive input signals and distribute the input signals to a plurality of unit cells. The structure may include a plurality of multi-kernel modules that may include a respective set of unit cells. A unit cell may correspond to an element of a kernel being implemented in the convolutional neural network and may include a storage component configured to store a weight of a corresponding element of the kernel. A first pass gate of the unit cell may be activated to pass a stored weight of the unit cell to a plurality of operation circuits in the corresponding unit cell, such that the stored weight may be applied to the input signals. The structure may generate a set of outputs based on the application of the stored weights to the input signals.

Type: Application

Filed: February 19, 2019

Publication date: August 20, 2020

Inventors: Effendi Leobandung, Malte Rasch, Xiao Sun, Yulong Li, Zhibin Ren
KERNEL SETS NORMALIZATION WITH CAPACITOR CHARGE SHARING

Publication number: 20200118638

Abstract: A method for multiple copies of a set of multi-kernel set operations in a hardware accelerated neural network includes a word line for receiving a pixel value of an input image. A bit line communicates a modified pixel value. An analog memory cell including a first capacitor stores a first kernel weight of a first kernel in one of a plurality of kernel sets such that the pixel value is operated on by the first kernel weight to produce the modified pixel value. A charge connection connects the first capacitor to at least a second capacitor storing a second kernel weight of a related kernel of a second one of the plurality of kernel sets such that charge is shared between the first capacitor and at least the second capacitor to normalize the first kernel weight and the second kernel weight.

Type: Application

Filed: October 11, 2018

Publication date: April 16, 2020

Inventors: Effendi Leobandung, Tayfun Gokmen, Xiao Sun, Yulong Li, Malte Rasch
Alignment Techniques to Match Symmetry Point as Zero-Weight Point in Analog Crosspoint Arrays

Publication number: 20200117699

Abstract: Zero-shifting techniques in analog crosspoint arrays are provided. In one aspect, an analog array-based vector-matrix multiplication includes: a weight array connected to a reference array, each including a crossbar array having a set of conductive row wires and a set of conductive column wires intersecting the set of conductive row wires, and optimizable crosspoint devices at intersections of the set of conductive column wires and the set of conductive row wires. A method for analog array-based vector-matrix computing is also provided that includes: applying repeated voltage pulses to the crosspoint devices in the weight array until all of the crosspoint devices in the weight array converge to their own symmetry point; and copying conductance values for each crosspoint device from the weight array to the reference array.

Type: Application

Filed: October 11, 2018

Publication date: April 16, 2020

Inventors: Seyoung Kim, Hyungjun Kim, Tayfun Gokmen, Malte Rasch
Acceleration of Convolutional Neural Networks on Analog Arrays

Publication number: 20190354847

Abstract: Mechanisms are provided for acceleration of convolutional neural networks on analog arrays. Input ports receive image signals from frames in an input image. Input memory arrays store the image signals received from the input ports into a respective input memory location to create a plurality of image sub-regions in input memory arrays. A distributor associated each of a set of analog array tiles in an analog array to a part of image sub-regions of the input memory arrays, so that one or more of a set of analog memory components is associated with the image signals in a distribution order to create a respective output signal. An assembler stores each of the respective output signals into one of a set of memory outputs in an output order that is determined by the distribution order.

Type: Application

Filed: March 22, 2019

Publication date: November 21, 2019

Inventors: Malte Rasch, Tayfun Gokmen, Mattia Rigotti, Wilfried Haensch