Patents by Inventor Anamitra Roy Choudhury

Anamitra Roy Choudhury has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

PARALLEL AND DISTRIBUTED PROCESSING OF PROPOSITIONAL LOGICAL NEURAL NETWORKS

Publication number: 20230385599

Abstract: An embodiment may include a processor that identifies a plurality of weights from the propositional logical neural network. The embodiment may convert the plurality of weights into a sparse matrix. The embodiment may convert a training set into a plurality of bound vectors. The embodiment may update the sparse matrix using a graphical processing unit (GPU). The embodiment may compute a loss parameter and based on determining the loss function is below threshold, update the plurality of weights of the propositional neural network.

Type: Application

Filed: May 26, 2022

Publication date: November 30, 2023

Inventors: Venkatesan Thirumalai Chakaravarthy, Anamitra Roy Choudhury, Naweed Aghmad Khan, Francois Pierre Luus, Yogish Sabharwal
Accelerating inference of transformer-based models

Patent number: 11763082

Abstract: Methods, systems, and computer program products for accelerating inference of transformer-based models are provided herein. A computer-implemented method includes obtaining a machine learning model comprising a plurality of transformer blocks, a task, and a natural language dataset; generating a compressed version of the machine learning model based on the task and the natural language dataset, wherein the generating comprises: obtaining at least one set of tokens, wherein each token in the set corresponds to one of the items in the natural language dataset, identifying and removing one or more redundant output activations of different ones of the plurality of transformer blocks for the at least one set of tokens, and adding one or more input activations corresponding to the one or more removed output activations into the machine learning model at subsequent ones of the plurality of the transformer blocks; and outputting the compressed version of the machine learning model to at least one user.

Type: Grant

Filed: July 12, 2021

Date of Patent: September 19, 2023

Assignee: International Business Machines Corporation

Inventors: Saurabh Goyal, Anamitra Roy Choudhury, Saurabh Manish Raje, Venkatesan T. Chakaravarthy, Yogish Sabharwal, Ashish Verma
MULTI-OBJECTIVE OPTIMIZATION OF MACHINE LEARNING PIPELINES

Publication number: 20230281464

Abstract: A system, program product, and method for performing multi-objective automated machine learning. The method includes selecting two or more objectives from a plurality of objectives to be optimized and injecting data and the objectives into a first machine learning (ML) pipeline. The first ML pipeline includes one or more data transformation stages in communication with a modeling stage. The method also includes executing, subject to the injecting, optimization of the two or objectives. Such executing includes selecting a respective algorithm for each of the data transformation stages and the modeling stage. Each respective algorithm is associated with a first set of respective hyperparameters. The executing also includes generating a plurality of second ML pipelines.

Type: Application

Filed: March 4, 2022

Publication date: September 7, 2023

Inventors: Vaibhav Saxena, Anamitra Roy Choudhury, Aswin Kannan
MULTI-OBJECTIVE MACHINE LEARNING WITH MODEL AND HYPERPARAMETER OPTIMIZATION FUSION

Publication number: 20230069913

Abstract: Techniques for utilizing model and hyperparameter optimization for multi-objective machine learning are disclosed. In one example, a method comprises the following steps. One of a plurality of hyperparameter optimization operations and a plurality of model parameter optimization operations are performed to generate a first solution set. The other of the plurality of hyperparameter optimization operations and the plurality of model parameter optimization operations are performed to generate a second solution set. At least a portion of the first solution set and at least a portion of the second solution set are combined to generate a third solution set.

Type: Application

Filed: September 9, 2021

Publication date: March 9, 2023

Inventors: Aswin Kannan, Vaibhav Saxena, Anamitra Roy Choudhury, Yogish Sabharwal, Parikshit Ram, Ashish Verma, Saurabh Manish Raje
Model training with variable batch sizing and gradient checkpoint segments

Patent number: 11586932

Abstract: A computer-implemented machine learning model training method and resulting machine learning model. One embodiment of the method may comprise receiving at a computer memory training data; and training on a computer processor a machine learning model on the received training data using a plurality of batch sizes to produce a trained processor. The training may include calculating a plurality of activations during a forward pass of the training and discarding at least some of the calculated plurality of activations after the forward pass of the training.

Type: Grant

Filed: March 10, 2020

Date of Patent: February 21, 2023

Assignee: International Business Machines Corporation

Inventors: Saurabh Goyal, Anamitra Roy Choudhury, Yogish Sabharwal, Ashish Verma
ACCELERATING INFERENCE OF TRANSFORMER-BASED MODELS

Publication number: 20230015895

Abstract: Methods, systems, and computer program products for accelerating inference of transformer-based models are provided herein. A computer-implemented method includes obtaining a machine learning model comprising a plurality of transformer blocks, a task, and a natural language dataset; generating a compressed version of the machine learning model based on the task and the natural language dataset, wherein the generating comprises: obtaining at least one set of tokens, wherein each token in the set corresponds to one of the items in the natural language dataset, identifying and removing one or more redundant output activations of different ones of the plurality of transformer blocks for the at least one set of tokens, and adding one or more input activations corresponding to the one or more removed output activations into the machine learning model at subsequent ones of the plurality of the transformer blocks; and outputting the compressed version of the machine learning model to at least one user.

Type: Application

Filed: July 12, 2021

Publication date: January 19, 2023

Inventors: Saurabh Goyal, Anamitra Roy Choudhury, Saurabh Manish Raje, Venkatesan T. Chakaravarthy, Yogish Sabharwal, Ashish Verma
ACCELERATING INFERENCE OF NEURAL NETWORK MODELS VIA DYNAMIC EARLY EXITS

Publication number: 20220358358

Abstract: Methods, systems, and computer program products for accelerating inference of neural network models via dynamic early exits are provided herein. A computer-implemented method includes determining a plurality of candidate exit points of a neural network model; obtaining a plurality of outputs of the neural network model for data samples in a target dataset, wherein the plurality of outputs comprises early outputs of the neural network model from the plurality of candidate exit points and regular outputs of the neural network model; and a set of one or more exit points from the plurality of candidate exits points that are dependent on the target dataset based at least in part on the plurality of outputs.

Type: Application

Filed: May 4, 2021

Publication date: November 10, 2022

Inventors: Saurabh Manish Raje, Saurabh Goyal, Anamitra Roy Choudhury, Yogish Sabharwal, Ashish Verma
INPUT ORDERING NEURAL NETWORK DECOMPOSITION

Publication number: 20220092423

Abstract: One or more computer processors decompose a weight matrix associated with a neural network utilizing a permutation dependent decomposition. The one or more computer processors regenerate a recovered matrix utilizing the decomposed weight matrix. The one or more computer processors reduce an error between the decomposed weight matrix and regenerated recovered matrix.

Type: Application

Filed: September 21, 2020

Publication date: March 24, 2022

Inventors: Venkatesan T. Chakaravarthy, Anamitra Roy Choudhury, Saurabh Goyal, Saurabh Manish Raje, Yogish Sabharwal, ASHISH VERMA
MODEL TRAINING WITH VARIABLE BATCH SIZING AND GRADIENT CHECKPOINT SEGMENTS

Publication number: 20210287094

Abstract: A computer-implemented machine learning model training method and resulting machine learning model. One embodiment of the method may comprise receiving at a computer memory training data; and training on a computer processor a machine learning model on the received training data using a plurality of batch sizes to produce a trained processor. The training may include calculating a plurality of activations during a forward pass of the training and discarding at least some of the calculated plurality of activations after the forward pass of the training.

Type: Application

Filed: March 10, 2020

Publication date: September 16, 2021

Inventors: Saurabh Goyal, Anamitra Roy Choudhury, Yogish Sabharwal, Ashish Verma
Dataset Dependent Low Rank Decomposition Of Neural Networks

Publication number: 20200410336

Abstract: Methods, systems, and computer program products for dataset dependent low rank decomposition of neural networks are provided herein. A computer-implemented method includes obtaining a target dataset and a trained model of a neural network; providing at least a portion of the target dataset to the trained model; determining relevance of each of one or more of filters of the neural network and channels of the neural network to the target dataset based on the provided portion, wherein the one or more of the filters and the channels correspond to at least one layer of the neural network; and compressing the trained model of the neural network based at least in part on the determined relevancies.

Type: Application

Filed: June 26, 2019

Publication date: December 31, 2020

Inventors: Anamitra Roy Choudhury, Saurabh Goyal, Vivek Sharma, Venkatesan T. Chakaravarthy, Yogish Sabharwal, Ashish Verma
Dynamic Batch Sizing for Inferencing of Deep Neural Networks in Resource-Constrained Environments

Publication number: 20200125926

Abstract: Methods, systems, and computer program products for dynamic batch sizing for inferencing of deep neural networks in resource-constrained environments are provided herein. A computer-implemented method includes obtaining, as input for inferencing of one or more deep neural networks, (i) an inferencing model and (ii) one or more resource constraints; computing, based at least in part on the obtained input, a set of statistics pertaining to resource utilization for each of multiple layers in the one or more deep neural networks; determining, based at least in part on (i) the obtained input and (ii) the computed set of statistics, multiple batch sizes to be used for inferencing the multiple layers of the one or more deep neural networks; and outputting, to at least one user, the determined batch sizes to be used for inferencing the multiple layers of the one or more deep neural networks.

Type: Application

Filed: October 23, 2018

Publication date: April 23, 2020

Inventors: Anamitra Roy Choudhury, Saurabh Goyal, Yogish Sabharwal, Ashish Verma, Dharma Teja Vooturi
System and method for condition monitoring of redox flow batteries using data analytics

Patent number: 10396375

Abstract: One embodiment provides a method for predicting maintenance of a redox flow battery, the method including: receiving, from a plurality of sensors, data regarding characteristics of the redox flow battery; weighting, using a processor, each of the characteristics to form an estimated state parameter for the redox flow battery; and determining, using the processor, a maintenance action for the redox flow battery using the estimated state parameter. Other aspects are described and claimed.

Type: Grant

Filed: March 24, 2016

Date of Patent: August 27, 2019

Assignee: INTERNATIONAL BUSNIESS MACHINES CORPORATION

Inventors: Anamitra Roy Choudhury, Sampath Dechu, Pratyush Kumar
SYSTEM AND METHOD FOR CONDITION MONITORING OF REDOX FLOW BATTERIES USING DATA ANALYTICS

Publication number: 20170279140

Abstract: One embodiment provides a method for predicting maintenance of a redox flow battery, the method including: receiving, from a plurality of sensors, data regarding characteristics of the redox flow battery; weighting, using a processor, each of the characteristics to form an estimated state parameter for the redox flow battery; and determining, using the processor, a maintenance action for the redox flow battery using the estimated state parameter. Other aspects are described and claimed.

Type: Application

Filed: March 24, 2016

Publication date: September 28, 2017

Inventors: Anamitra Roy Choudhury, Sampath Dechu, Pratyush Kumar