Patents by Inventor Sanchari Sen

Sanchari Sen has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

DEEP LEARNING OPTIMIZATION THROUGH ZERO TILE MANIPULATION

Publication number: 20250053803

Abstract: Processing zero weights within a data structure when performing a multiply and accumulate operation in a deep learning network as the result is itself a zero. Avoiding this step may save time and reduce power consumption in the training and operation of deep learning networks. An approach to zero-tile manipulation may be presented herein. An approach to permute and pack weighted data structures into zero-tile data structures may be presented. The zero-tiles may be configured in a structure which is optimized for the architecture of a parallel processing unit. The zero tile data structures may comprise vectors which instruct a the components in processing element to operate in a manner which prevents the element from expending energy when processing the zero tiles. An apparatus may also be presented in the immediate disclosure which can be configured to accept a zero-tile data structure.

Type: Application

Filed: August 11, 2023

Publication date: February 13, 2025

Inventors: Monodeep Kar, Subhankar Pal, Alper Buyuktosunoglu, Sanchari Sen
GENERATING PROGRAM BINARIES FOR A TRANSFORMER TO PROCESS SEQUENCE CONCATENATIONS HAVING DIFFERENT INPUT LENGTHS

Publication number: 20250004736

Abstract: Provided are a computer program product, system, and method for generating program binaries for a transformer to process sequence concatenations having different input lengths. A sequence concatenation length is selected of a sequence concatenation of concatenated tokens from inputs to process in the accelerator. A determination is made of combinations of input lengths. A representation of a transformer is processed to generate program binaries for each combination of input lengths of the combinations. The program binaries, for a given combination of input lengths of the combinations, program the accelerator to decompose tokens of inputs in the sequence concatenation according to the input lengths in the given combination and compute attention scores between tokens within each input of the inputs in the sequence concatenation. The program binaries, for a combination of input lengths, are executed to process a sequence concatenation of received inputs to generate outputs.

Type: Application

Filed: June 29, 2023

Publication date: January 2, 2025

Inventors: Swagath Venkataramani, Sanchari Sen, Amrit Nagarajan, Vijayalakshmi Srinivasan, Sarada Krithivasan
REUSING WEIGHTS AND BIASES IN AN ARTIFICIAL INTELLIGENCE ACCELERATOR FOR A NEURAL NETWORK FOR DIFFERENT MINIBATCH SIZES OF INFERENCES

Publication number: 20250005326

Abstract: Provided are a computer program product, system, and method for reusing weights and biases in an artificial intelligence accelerator for a neural network for different minibatch sizes of inferences. A minibatch size is selected of inference jobs batched to process in the accelerator. A representation of a neural network is processed to determine a set of weights and biases for the selected minibatch size to load into the core. The set of weights and biases is loaded into the core for use by the array of processing elements in the core of the accelerator. The weights and the biases are reused in the processing elements for the neural network, loaded for the selected minibatch size, to apply to minibatches of inferences having minibatch sizes less than the selected minibatch size.

Type: Application

Filed: June 29, 2023

Publication date: January 2, 2025

Inventors: Swagath Venkataramani, Marcel Schaal, Sanchari Sen, Amrit Nagarajan, Vijayalakshmi Srinivasan, Shyam Ramji
TESTING A SEMICONDUCTOR DEVICE USING X-RAYS

Publication number: 20240103072

Abstract: Embodiments described herein may be related to apparatuses, processes, systems, and/or techniques for using x-rays to alter or observe circuits within a semiconductor device before, during or after a test of the semiconductor device. Other embodiments may be described and/or claimed.

Type: Application

Filed: September 23, 2022

Publication date: March 28, 2024

Inventors: Patrick PARDY, Kimberlee CELIO, Sanchari SEN, May Ling OH, Shuai ZHAO, Joshua W. KEVEK, Evgeny Gregory NISENBOIM, Amir RAVEH, Boris SIMKHOVICH, Charles A. PETERSON, Kevin JOHNSON, Martin Eric Gostasson VON HAARTMAN, Eli ABU AYOB, Xianghong TONG
Exploiting fine-grained structured weight sparsity in systolic arrays

Patent number: 11941111

Abstract: Indices of non-zero weights may be stored in an index register file included within each of a plurality of processor elements in a systolic array. Non-zero weights may be stored in a register file associated with the index register file. Input values (e.g., dense input values) corresponding to a single block in a data structure may be sent to the plurality of processor elements. Those of the input values corresponding to the indices of non-zero weights in the index register file may be selected for performing multiply-accumulate (“MAC”) operation based on sending the plurality of input values to one or more of the plurality of processor elements. The indices of the plurality of non-zero weight are stored in an index data stick. The values of the plurality of non-zero weights are stored in a value data stick.

Type: Grant

Filed: July 31, 2021

Date of Patent: March 26, 2024

Assignee: International Business Machines Corporation

Inventors: Sanchari Sen, Swagath Venkataramani, Vijayalakshmi Srinivasan, Kailash Gopalakrishnan, Sunil K. Shukla
STICKIFICATION USING ANYWHERE PADDING TO ACCELERATE DATA MANIPULATION

Publication number: 20240028899

Abstract: Embodiments are provided for efficient realization of memory-bound operations in a computing system by a processor. Data may be read from and written to a memory at a granular level using a stickification operation. One or more regions of activation and weight tensor data on the memory may be annotated by coupling the stickification operation with padding.

Type: Application

Filed: July 25, 2022

Publication date: January 25, 2024

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Swagath VENKATARAMANI, Vijayalakshmi SRINIVASAN, Shubham JAIN, Sarada KRITHIVASAN, Sanchari SEN
Sparse systolic array design

Patent number: 11669489

Abstract: A systolic array can be configured to skip distributed operands that have zero-values, resulting in improved resource efficiency. A skip module is introduced to receive operands from memory, identify whether they have a zero value or not, and, if they are nonzero, generate an operand vector including an index before sending the operand vector to a processing element.

Type: Grant

Filed: September 30, 2021

Date of Patent: June 6, 2023

Assignee: International Business Machines Corporation

Inventors: Swagath Venkataramani, Sanchari Sen, Vijayalakshmi Srinivasan, Ankur Agrawal, Sunil K Shukla, Bruce Fleischer, Kailash Gopalakrishnan
Family of lossy sparse load SIMD instructions

Patent number: 11663001

Abstract: Systems, apparatuses, and methods for implementing a family of lossy sparse load single instruction, multiple data (SIMD) instructions are disclosed. A lossy sparse load unit (LSLU) loads a plurality of values from one or more input vector operands and determines how many non-zero values are included in one or more input vector operands of a given instruction. If the one or more input vector operands have less than a threshold number of non-zero values, then the LSLU causes an instruction for processing the one or more input vector operands to be skipped. In this case, the processing of the instruction of the one or more input vector operands is deemed to be redundant. If the one or more input vector operands have greater than or equal to the threshold number of non-zero values, then the LSLU causes an instruction for processing the input vector operand(s) to be executed.

Type: Grant

Filed: November 19, 2018

Date of Patent: May 30, 2023

Assignee: Advanced Micro Devices, Inc.

Inventors: Sanchari Sen, Derrick Allen Aguren, Joseph Lee Greathouse
SPARSE SYSTOLIC ARRAY DESIGN

Publication number: 20230109301

Abstract: A systolic array can be configured to skip distributed operands that have zero-values, resulting in improved resource efficiency. A skip module is introduced to receive operands from memory, identify whether they have a zero value or not, and, if they are nonzero, generate an operand vector including an index before sending the operand vector to a processing element.

Type: Application

Filed: September 30, 2021

Publication date: April 6, 2023

Inventors: Swagath Venkataramani, Sanchari Sen, Vijayalakshmi Srinivasan, Ankur Agrawal, Sunil K Shukla, Bruce Fleischer, Kailash Gopalakrishnan
EXPLOITING FINE-GRAINED STRUCTURED WEIGHT SPARSITY IN SYSTOLIC ARRAYS

Publication number: 20230030287

Abstract: Indices of non-zero weights may be stored in an index register file included within each of a plurality of processor elements in a systolic array. Non-zero weights may be stored in a register file associated with the index register file. Input values (e.g., dense input values) corresponding to a single block in a data structure may be sent to the plurality of processor elements. Those of the input values corresponding to the indices of non-zero weights in the index register file may be selected for performing multiply-accumulate (“MAC”) operation based on sending the plurality of input values to one or more of the plurality of processor elements. The indices of the plurality of non-zero weight are stored in an index data stick. The values of the plurality of non-zero weights are stored in a value data stick.

Type: Application

Filed: July 31, 2021

Publication date: February 2, 2023

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Sanchari SEN, Swagath VENKATARAMANI, Vijayalakshmi SRINIVASAN, Kailash GOPALAKRISHNAN, Sunil K. SHUKLA
FAMILY OF LOSSY SPARSE LOAD SIMD INSTRUCTIONS

Publication number: 20200159529

Abstract: Systems, apparatuses, and methods for implementing a family of lossy sparse load single instruction, multiple data (SIMD) instructions are disclosed. A lossy sparse load unit (LSLU) loads a plurality of values from one or more input vector operands and determines how many non-zero values are included in one or more input vector operands of a given instruction. If the one or more input vector operands have less than a threshold number of non-zero values, then the LSLU causes an instruction for processing the one or more input vector operands to be skipped. In this case, the processing of the instruction of the one or more input vector operands is deemed to be redundant. If the one or more input vector operands have greater than or equal to the threshold number of non-zero values, then the LSLU causes an instruction for processing the input vector operand(s) to be executed.

Type: Application

Filed: November 19, 2018

Publication date: May 21, 2020

Inventors: Sanchari Sen, Derrick Allen Aguren, Joseph Lee Greathouse