Patents by Inventor Jungwook CHOI

Jungwook CHOI has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

ELECTRONIC DEVICE FOR FINE-TUNING A MACHINE LEARNING MODEL AND METHOD OF OPERATING THE ELECTRONIC DEVICE

Publication number: 20250238665

Abstract: An electronic device for fine-tuning a machine learning model and a method of operating the electronic device are provided. The electronic device includes at least one processor and a memory configured to store instructions executable by the at least one processor. When at least some of the instructions are executed by the at least one processor, the at least some of the instructions executed control the electronic device to determine a final weight of a current layer of a neural network by quantizing an addition result of combining a quantized base weight in low precision to an adapter weight in high precision, generate a product result based on the final weight and an activation input of the current layer, and transmit the multiplication result to a next layer of the neural network.

Type: Application

Filed: August 20, 2024

Publication date: July 24, 2025

Inventors: Jaehyung Ahn, Jungwook Choi, Minsoo Kim, Sihwa Lee
Neural network circuitry having floating point format with asymmetric range

Patent number: 12217158

Abstract: An apparatus includes circuitry for a neural network that is configured to perform forward propagation neural network operations on floating point numbers having a first n-bit floating point format. The first n-bit floating point format has a configuration consisting of a sign bit, m exponent bits and p mantissa bits where m is greater than p. The circuitry is further configured to perform backward propagation neural network operations on floating point numbers having a second n-bit floating point format that is different than the first n-bit floating point format. The second n-bit floating point format has a configuration consisting of a sign bit, q exponent bits and r mantissa bits where q is greater than m and r is less than p.

Type: Grant

Filed: September 3, 2019

Date of Patent: February 4, 2025

Assignee: International Business Machines Corporation

Inventors: Xiao Sun, Jungwook Choi, Naigang Wang, Chia-Yu Chen, Kailash Gopalakrishnan
Machine learning hardware having reduced precision parameter components for efficient parameter update

Patent number: 12175359

Abstract: An apparatus for training and inferencing a neural network includes circuitry that is configured to generate a first weight having a first format including a first number of bits based at least in part on a second weight having a second format including a second number of bits and a residual having a third format including a third number of bits. The second number of bits and the third number of bits are each less than the first number of bits. The circuitry is further configured to update the second weight based at least in part on the first weight and to update the residual based at least in part on the updated second weight and the first weight. The circuitry is further configured to update the first weight based at least in part on the updated second weight and the updated residual.

Type: Grant

Filed: September 3, 2019

Date of Patent: December 24, 2024

Assignee: International Business Machines Corporation

Inventors: Xiao Sun, Jungwook Choi, Naigang Wang, Chia-Yu Chen, Kailash Gopalakrishnan
Method to map convolutional layers of deep neural network on a plurality of processing elements with SIMD execution units, private memories, and connected as a 2D systolic processor array

Patent number: 12141513

Abstract: A method for improving performance of a predefined Deep Neural Network (DNN) convolution processing on a computing device includes inputting parameters, as input data into a processor on a computer that formalizes a design space exploration of a convolution mapping, on a predefined computer architecture that will execute the predefined convolution processing. The parameters are predefined as guided by a specification for the predefined convolution processing to be implemented by the convolution mapping and by a microarchitectural specification for the processor that will execute the predefined convolution processing. The processor calculates performance metrics for executing the predefined convolution processing on the computing device, as functions of the predefined parameters, as proxy estimates of performance of different possible design choices to implement the predefined convolution processing.

Type: Grant

Filed: October 31, 2018

Date of Patent: November 12, 2024

Assignee: International Business Machines Corporation

Inventors: Chia-Yu Chen, Jungwook Choi, Kailash Gopalakrishnan, Vijayalakshmi Srinivasan, Swagath Venkataramani, Jintao Zhang
METHOD AND APPARATUS FOR SPARSE INPUT-OUTPUT INDEX GENERATION OF SPARSE CONVOLUTION

Publication number: 20240338419

Abstract: A method of convolution operation based sparse data using artificial neural network comprises: a step of extracting index information, location information about a valid data where actual data exists in an input data; a step of generating first location information including computable row information where actual operations are performed in a kernel based on a path along which the kernel moves to perform a convolution operation on the input data and the index information; a step of generating second location information including computable column information where an actual operation is performed in the kernel based on the first location information, the index information, and the kernel size; a step of generating an operation rule for each point of the valid data and convolution output data based on the index information, and the first and second location information; and a step of performing the convolution operation based on the operation rule.

Type: Application

Filed: June 17, 2024

Publication date: October 10, 2024

Inventors: Minjae Lee, Janghwan Lee, Jun Won Choi, Jungwook Choi
Low precision deep neural network enabled by compensation instructions

Patent number: 12056594

Abstract: A compensated deep neural network (compensated-DNN) is provided. A first vector having a set of components and a second vector having a set of corresponding components are received. A component of the first vector includes a first quantized value and a first compensation instruction, and a corresponding component of the second vector includes a second quantized value and a second compensation instruction. The first quantized value is multiplied with the second quantized value to compute a raw product value. The raw product value is compensated for a quantization error according to the first and second compensation instructions to produce a compensated product value. The compensated product value is added into an accumulated value for the dot product. The accumulated value is converted into an output vector of the dot product. The output vector includes an output quantized value and an output compensation instruction.

Type: Grant

Filed: June 27, 2018

Date of Patent: August 6, 2024

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Swagath Venkataramani, Shubham Jain, Vijayalakshmi Srinivasan, Jungwook Choi, Leland Chang
METHOD AND APPARATUS WITH TRAINING OF BATCH NORM PARAMETER

Publication number: 20240152753

Abstract: Disclosed is a processor implemented method that includes calculating a quantization error for each channel of a neural network using activation data output from a first layer of the neural network and a quantization scale of a second layer connected to the first layer, calculating a final loss using a regularization loss term determined based on the quantization error for each channel, and updating a batch norm parameter of the first layer in a direction to decrease the final loss.

Type: Application

Filed: October 27, 2023

Publication date: May 9, 2024

Applicants: SAMSUNG ELECTRONICS CO., LTD., IUCF-HYU(Industry-University Cooperation Foundation Hanyang University)

Inventors: Jungwook CHOI, Seongmin PARK
Compression of fully connected / recurrent layers of deep network(s) through enforcing spatial locality to weight matrices and effecting frequency compression

Patent number: 11977974

Abstract: A system, having a memory that stores computer executable components, and a processor that executes the computer executable components, reduces data size in connection with training a neural network by exploiting spatial locality to weight matrices and effecting frequency transformation and compression. A receiving component receives neural network data in the form of a compressed frequency-domain weight matrix. A segmentation component segments the initial weight matrix into original sub-components, wherein respective original sub-components have spatial weights. A sampling component applies a generalized weight distribution to the respective original sub-components to generate respective normalized sub-components. A transform component applies a transform to the respective normalized sub-components.

Type: Grant

Filed: November 30, 2017

Date of Patent: May 7, 2024

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Chia-Yu Chen, Jungwook Choi, Kailash Gopalakrishnan, Suyog Gupta, Pritish Narayanan
COMPUTER SYSTEMS FOR COMPRESSING TRANSFORMER MODELS AND QUANTIZATION TRAINING METHODS THEREOF

Publication number: 20240028888

Abstract: A method for quantization learning by a model quantizer that is operating in a computer system and compressing a transformer model. The method may include generating a student model through quantization of the transformer model, performing a first quantization learning by inserting a self-attention map of a teacher model into a self-attention map of the student model, and performing a second quantization learning using a knowledge distillation method so that the self-attention map of the student model follows the self-attention map of the teacher model.

Type: Application

Filed: January 26, 2023

Publication date: January 25, 2024

Applicants: SAMSUNG ELECTRONICS CO., LTD., IUCF-HYU (Industry-University Cooperation Foundation Hanyang University)

Inventors: Yongsuk Kwon, Jungwook Choi, Minsoo Kim, Seongmin Park
APPARATUS AND METHOD WITH NEURAL NETWORK OPERATION

Publication number: 20230306242

Abstract: An apparatus and method with neural network operation are provided. A computing apparatus includes one or more processors, storage hardware storing instructions configured to, when executed by the one or more processors, cause the one or more processors to: extract calibration data from training data that is for training a main neural network, based on the calibration data, generate a look up table (LUT) for performing a non-linear function of the main neural network through an auxiliary network corresponding to a layer of the main neural network, and update a parameter of the LUT based on an output of the non-linear function and based on an output of the auxiliary network.

Type: Application

Filed: February 21, 2023

Publication date: September 28, 2023

Applicants: SAMSUNG ELECTRONICS CO., LTD., IUCF-HYU(Industry-University Cooperation Foundation Hanyang University)

Inventors: Jungwook CHOI, Seongmin PARK
METHOD AND APPARATUS FOR NEURAL NETWORK OPERATION

Publication number: 20230118505

Abstract: A neural network operation apparatus may include a receiver configured to receive input data to perform the neural network operation and a quantized Look Up Table (LUT) corresponding to a non-linear function comprised in the neural network operation, and a processor configured to perform scale-up on the input data based on a scale factor, to extract a quantized LUT parameter from the quantized LUT based on scaled-up input data, and to generate an operation result by performing a neural network operation based on the quantized LUT parameter.

Type: Application

Filed: August 12, 2022

Publication date: April 20, 2023

Applicants: Samsung Electronics Co., Ltd., IUCF-HYU (Industry-University Cooperation Foundation Hanyang University)

Inventors: Donghyun Lee, Joonsang Yu, Junki Park, Jungwook Choi
Hybrid floating point representation for deep learning acceleration

Patent number: 11620105

Abstract: In an embodiment, a method includes configuring a specialized circuit for floating point computations using numbers represented by a hybrid format, wherein the hybrid format includes a first format and a second format. In the embodiment, the method includes operating the further configured specialized circuit to store an approximation of a numeric value in the first format during a forward pass for training a deep learning network. In the embodiment, the method includes operating the further configured specialized circuit to store an approximation of a second numeric value in the second format during a backward pass for training the deep learning network.

Type: Grant

Filed: December 21, 2020

Date of Patent: April 4, 2023

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Naigang Wang, Jungwook Choi, Kailash Gopalakrishnan, Ankur Agrawal, Silvia Melitta Mueller
Reusing an operand received from a first-in-first-out (FIFO) buffer according to an operand specifier value specified in a predefined field of an instruction

Patent number: 11620132

Abstract: Various embodiments are provided reusing an operand in an instruction set architecture (ISA) by one or more processors in a computing system. An instruction may specify that an operand register for a selected operand retain operand data used by a previous instruction. The operand data in the operand register may be reused by the instruction.

Type: Grant

Filed: May 8, 2019

Date of Patent: April 4, 2023

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Bruce Fleischer, Sunil Shukla, Vijayalakshmi Srinivasan, Jungwook Choi
Formation failure resilient neuromorphic device

Patent number: 11610101

Abstract: A neuromorphic device includes a plurality of first control lines, a plurality of second control lines and a matrix of resistive processing unit cells. Each resistive processing unit cell is electrically connected with one of the first control lines and one of the second control lines. A given resistive processing unit cell includes a first resistive device and a second resistive device. The first resistive device is a positively weighted resistive device and the second resistive device is a negatively weighted resistive device.

Type: Grant

Filed: August 30, 2019

Date of Patent: March 21, 2023

Assignee: International Business Machines Corporation

Inventors: Youngseok Kim, Jungwook Choi, Seyoung Kim, Chun-Chen Yeh
Mixed precision capable hardware for tuning a machine learning model

Patent number: 11604647

Abstract: An apparatus includes a memory and a processor coupled to the memory. The processor includes first and second sets of arithmetic units having first and second precision for floating-point computations, the second precision being lower than the first precision. The processor is configured to obtain a machine learning model trained in the first precision, to utilize the second set of arithmetic units to perform inference on input data, to utilize the first set of arithmetic units to generate feedback for updating parameters of the second set of arithmetic units based on the inference performed on the input data by the second set of arithmetic units, to tune parameters of the second set of arithmetic units based at least in part on the feedback generated by the first set of arithmetic units, and to utilize the second set of arithmetic units with the tuned parameters to generate inference results.

Type: Grant

Filed: September 3, 2019

Date of Patent: March 14, 2023

Assignee: International Business Machines Corporation

Inventors: Xiao Sun, Chia-Yu Chen, Naigang Wang, Jungwook Choi, Kailash Gopalakrishnan
Statistics-aware weight quantization

Patent number: 11551077

Abstract: Techniques for statistics-aware weight quantization are presented. To facilitate reducing the bit precision of weights, for a set of weights, a quantizer management component can estimate a quantization scale value to apply to a weight as a linear or non-linear function of the mean of a square of a weight value of the weight and the mean of an absolute value of the weight value, wherein the quantization scale value is determined to have a smaller quantization error than all, or at least almost all, other quantization errors associated with other quantization scale values. A quantizer component applies the quantization scale value to symmetrically and/or uniformly quantize weights of a layer of the set of weights to generate quantized weights, the weights being quantized using rounding. The respective quantized weights can be used to facilitate training and inference of a deep learning system.

Type: Grant

Filed: June 13, 2018

Date of Patent: January 10, 2023

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Zhuo Wang, Jungwook Choi, Kailash Gopalakrishnan, Pierce I-Jen Chuang
System-aware selective quantization for performance optimized distributed deep learning

Patent number: 11551054

Abstract: A convolutional neural network includes a front layer, a back layer, and a plurality of other layers that are connected between the front layer and the back layer. One of the other layers is a transition layer. A first precision is assigned to activations of neurons from the front layer back to the transition layer and a second precision is assigned to activations of the neurons from the transition layer back to the back layer. A third precision is assigned to weights of inputs to neurons from the front layer back to the transition layer and a fourth precision is assigned to weights of inputs to the neurons from the transition layer back to the back layer. In some embodiments the layers forward of the transition layer have a different convolutional kernel than the layers rearward of the transition layer.

Type: Grant

Filed: August 27, 2019

Date of Patent: January 10, 2023

Assignee: International Business Machines Corporation

Inventors: Jungwook Choi, Swagath Venkataramani, Vijayalakshmi Srinivasan, Kailash Gopalakrishnan
Dynamically resizing minibatch in neural network execution

Patent number: 11354573

Abstract: A minibatch in a neural network execution may be dynamically resized based on on-chip memory. For example, a size of the minibatch is configured such that the minibatch fits within on-chip memory. The size of the minibatch may be resized for a sequence of layers in the neural network execution. A next layer's execution can commence responsive to the resized minibatch being completed in a previous layer without having to wait for all of the minibatch to be completed in the previous layer.

Type: Grant

Filed: March 25, 2019

Date of Patent: June 7, 2022

Assignee: International Business Machines Corporation

Inventors: Swagath Venkataramani, Vijayalakshmi Srinivasan, Jungwook Choi
Reduced precision based programmable and SIMD dataflow architecture

Patent number: 11347517

Abstract: A reduced precision based programmable and single instruction multiple data (SIMD) dataflow architecture includes reduced precision execution units with a majority of the execution units operating at reduced precision and a minority of the execution units are capable of operating at higher precision. The execution units operate in parallel within a programmable execution element to share instruction fetch, decode, and issue pipelines and operate on the same instruction in lock-step to minimize instruction-related overhead.

Type: Grant

Filed: June 20, 2019

Date of Patent: May 31, 2022

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Kailash Gopalakrishnan, Sunil Shukla, Jungwook Choi, Silvia Mueller, Bruce Fleischer, Vijayalakshmi Srinivasan, Ankur Agrawal, Jinwook Oh
Robust gradient weight compression schemes for deep learning applications

Patent number: 11295208

Abstract: Embodiments of the present invention provide a computer-implemented method for adaptive residual gradient compression for training of a deep learning neural network (DNN). The method includes obtaining, by a first learner, a current gradient vector for a neural network layer of the DNN, in which the current gradient vector includes gradient weights of parameters of the neural network layer that are calculated from a mini-batch of training data. A current residue vector is generated that includes residual gradient weights for the mini-batch. A compressed current residue vector is generated based on dividing the residual gradient weights of the current residue vector into a plurality of bins of a uniform size and quantizing a subset of the residual gradient weights of one or more bins of the plurality of bins. The compressed current residue vector is then transmitted to a second learner of the plurality of learners or to a parameter server.

Type: Grant

Filed: December 4, 2017

Date of Patent: April 5, 2022

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Ankur Agrawal, Daniel Brand, Chia-Yu Chen, Jungwook Choi, Kailash Gopalakrishnan

1 2 3 4 next