Patents by Inventor Parijat Dube

Parijat Dube has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Training transfer-focused models for deep learning

Patent number: 11853877

Abstract: Whether to train a new neural network model can be determined based on similarity estimates between a sample data set and a plurality of source data sets associated with a plurality of prior-trained neural network models. A cluster among the plurality of prior-trained neural network models can be determined. A set of training data based on the cluster can be determined. The new neural network model can be trained based on the set of training data.

Type: Grant

Filed: April 2, 2019

Date of Patent: December 26, 2023

Assignee: International Business Machines Corporation

Inventors: Patrick Watson, Bishwaranjan Bhattacharjee, Siyu Huo, Noel Christopher Codella, Brian Michael Belgodere, Parijat Dube, Michael Robert Glass, John Ronald Kender, Matthew Leon Hill
AUTOMATED DATA LABELING USING A GEOMETRIC APPROACH

Publication number: 20230326186

Abstract: An automated data labeling method, system, and computer program product that includes composing a semantically-named anchor vector derived from a source dataset into a sequence that defines a location description for target data items based on a generalization of distances into Cayley-Menger content and outputting a label for a target data item based on the location description.

Type: Application

Filed: March 28, 2022

Publication date: October 12, 2023

Inventors: Parijat Dube, John Ronald Kender, Bishwaranjan Bhattacharjee, Brian Michael Belgodere
Runtime estimation for machine learning tasks

Patent number: 11727309

Abstract: Techniques for estimating runtimes of one or more machine learning tasks are provided. For example, one or more embodiments described herein can regard a system that can comprise a memory that stores computer executable components. The system can also comprise a processor, operably coupled to the memory, and that can execute the computer executable components stored in the memory. The computer executable components can comprise an extraction component that can extract a parameter from a machine learning task. The parameter can define a performance characteristic of the machine learning task. Also, the computer executable components can comprise a model component that can generate a model based on the parameter. Further, the computer executable components can comprise an estimation component that can generate an estimated runtime of the machine learning task based on the model.

Type: Grant

Filed: October 28, 2021

Date of Patent: August 15, 2023

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Parijat Dube, Gauri Joshi, Priya Ashok Nagpurkar, Stefania Costache, Diana Jeanne Arroyo, Zehra Noman Sura
Scalable model serving

Patent number: 11604961

Abstract: A neural network models fragmenting method, system, and computer program product include recursively factoring out common prefixes of models, constructing a hierarchy of decomposed model fragments based on the factoring, and grouping the constructed hierarchy for deployment.

Type: Grant

Filed: April 30, 2019

Date of Patent: March 14, 2023

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Vinod Muthusamy, Parijat Dube, Kaoutar El Maghraoui, Falk Pollok
Data driven mixed precision learning for neural networks

Patent number: 11568235

Abstract: Embodiments for implementing mixed precision learning for neural networks by a processor. A neural network may be replicated into a plurality of replicated instances and each of the plurality of replicated instances differ in precision used for representing and determining parameters of the neural network. Data instances may be routed to one or more of the plurality of replicated instances for processing according to a data pre-processing operation.

Type: Grant

Filed: November 19, 2018

Date of Patent: January 31, 2023

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Zehra Sura, Parijat Dube, Bishwaranjan Bhattacharjee, Tong Chen
Performance based switching of a model training process

Patent number: 11551145

Abstract: Systems, computer-implemented methods, and computer program products that can facilitate switching a model training process from a ground truth training phase to an adversarial training phase based on performance of a model trained in the ground truth training phase are provided. According to an embodiment, a system can comprise a memory that stores computer executable components and a processor that executes the computer executable components stored in the memory. The computer executable components can comprise an analysis component that identifies a performance condition of a model trained in a model training process. The computer executable components can further comprise a trainer component that switches the model training process from a ground truth training process to an adversarial training process based on the identified performance condition.

Type: Grant

Filed: February 5, 2020

Date of Patent: January 10, 2023

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Sidharth Gupta, Parijat Dube, Ashish Verma
RUNTIME ESTIMATION FOR MACHINE LEARNING TASKS

Publication number: 20220051142

Abstract: Techniques for estimating runtimes of one or more machine learning tasks are provided. For example, one or more embodiments described herein can regard a system that can comprise a memory that stores computer executable components. The system can also comprise a processor, operably coupled to the memory, and that can execute the computer executable components stored in the memory. The computer executable components can comprise an extraction component that can extract a parameter from a machine learning task. The parameter can define a performance characteristic of the machine learning task. Also, the computer executable components can comprise a model component that can generate a model based on the parameter. Further, the computer executable components can comprise an estimation component that can generate an estimated runtime of the machine learning task based on the model.

Type: Application

Filed: October 28, 2021

Publication date: February 17, 2022

Inventors: Parijat Dube, Gauri Joshi, Priya Ashok Nagpurkar, Stefania Costache, Diana Jeanne Arroyo, Zehra Noman Sura
Runtime estimation for machine learning tasks

Patent number: 11200512

Abstract: Techniques for estimating runtimes of one or more machine learning tasks are provided. For example, one or more embodiments described herein can regard a system that can comprise a memory that stores computer executable components. The system can also comprise a processor, operably coupled to the memory, and that can execute the computer executable components stored in the memory. The computer executable components can comprise an extraction component that can extract a parameter from a machine learning task. The parameter can define a performance characteristic of the machine learning task. Also, the computer executable components can comprise a model component that can generate a model based on the parameter. Further, the computer executable components can comprise an estimation component that can generate an estimated runtime of the machine learning task based on the model.

Type: Grant

Filed: February 21, 2018

Date of Patent: December 14, 2021

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Parijat Dube, Gauri Joshi, Priya Ashok Nagpurkar, Stefania Costache, Diana Jeanne Arroyo, Zehra Noman Sura
Adaptive learning rate schedule in distributed stochastic gradient descent

Patent number: 11182689

Abstract: A method for performing machine learning includes assigning processing jobs to a plurality of model learners, using a central parameter server. The processing jobs includes solving gradients based on a current set of parameters. As the results from the processing job are returned, the set of parameters is iterated. A degree of staleness of the solving of the second gradient is determined based on a difference between the set of parameters when the jobs are assigned and the set of parameters when the jobs are returned. The learning rates used to iterate the parameters based on the solved gradients are proportional to the determined degrees of staleness.

Type: Grant

Filed: March 28, 2018

Date of Patent: November 23, 2021

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Parijat Dube, Sanghamitra Dutta, Gauri Joshi, Priya A. Nagpurkar
Generating and augmenting transfer learning datasets with pseudo-labeled images

Patent number: 11151410

Abstract: A computer-implemented method for data labeling is provided. The computer-implemented method assigns pseudo-labels to unlabeled examples of data using a similarity metric on an embedding space to produce pseudo-labeled examples. A curriculum learning model is trained using the pseudo-labeled examples. The curriculum learning model trained with the pseudo-labeled examples is employed in in a fine-tuning task to enhance classification accuracy of the data.

Type: Grant

Filed: September 7, 2018

Date of Patent: October 19, 2021

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Patrick Watson, Bishwaranjan Bhattacharjee, Siyu Huo, Noel C. Codella, Brian M. Belgodere, Parijat Dube, Michael R. Glass, John R. Kender, Matthew L. Hill
PERFORMANCE BASED SWITCHING OF A MODEL TRAINING PROCESS

Publication number: 20210241169

Abstract: Systems, computer-implemented methods, and computer program products that can facilitate switching a model training process from a ground truth training phase to an adversarial training phase based on performance of a model trained in the ground truth training phase are provided. According to an embodiment, a system can comprise a memory that stores computer executable components and a processor that executes the computer executable components stored in the memory. The computer executable components can comprise an analysis component that identifies a performance condition of a model trained in a model training process. The computer executable components can further comprise a trainer component that switches the model training process from a ground truth training process to an adversarial training process based on the identified performance condition.

Type: Application

Filed: February 5, 2020

Publication date: August 5, 2021

Inventors: Sidharth Gupta, Parijat Dube, Ashish Verma
AUTOMATED FINE-TUNING OF A PRE-TRAINED NEURAL NETWORK FOR TRANSFER LEARNING

Publication number: 20210174191

Abstract: In an embodiment, a method for fine-tuning a pre-trained neural network for transfer learning, the method comprising obtaining a first target feature vector from a first layer of a pre-trained neural network responsive to a first target data element of a target dataset passing therethrough, obtaining a first source feature vector associated with the first layer of the pre-trained neural network, calculating a first divergence value for the first layer of the pre-trained neural network based at least in part on the first target feature vector and the first source feature vector, and setting a learning rate for the first layer of the pre-trained neural network based at least in part on the first divergence value.

Type: Application

Filed: December 5, 2019

Publication date: June 10, 2021

Applicant: International Business Machines Corporation

Inventors: Parijat Dube, Bishwaranjan Bhattacharjee, Patrick Watson, John Ronald Kender
Adjusting simulation times for cost simulation analysis of transportation lane proposals based on space and time granularities

Patent number: 10929805

Abstract: A method and system of simulating a cost of shipment are provided. A request for quote (RFQ) is received by a computing device from a shipper having a plurality of trade lanes. A revenue generated from each trade lane is estimated and ranked based on the estimated revenue. An original time limit is assigned to each trade lane. A trade lane with a highest ranking that has not yet been selected is selected. For the selected trade lane, graphs for space and time granularity analysis are generated. Space and time granularities that maximize accuracy within the assigned time limit based on the generated graphs are calculated. Cost simulation analysis is performed using the calculated space and time granularities. Upon determining that there are trade lanes not yet selected, there is a return to the act of selecting a trade lane.

Type: Grant

Filed: August 1, 2018

Date of Patent: February 23, 2021

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Joao Goncalves, Parijat Dube
SCALABLE MODEL SERVING

Publication number: 20200349413

Abstract: A neural network models fragmenting method, system, and computer program product include recursively factoring out common prefixes of models, constructing a hierarchy of decomposed model fragments based on the factoring, and grouping the constructed hierarchy for deployment.

Type: Application

Filed: April 30, 2019

Publication date: November 5, 2020

Inventors: Vinod Muthusamy, Parijat Dube, Kaoutar El Maghraoui, Falk Pollok
TRAINING TRANSFER-FOCUSED MODELS FOR DEEP LEARNING

Publication number: 20200320379

Abstract: Whether to train a new neural network model can be determined based on similarity estimates between a sample data set and a plurality of source data sets associated with a plurality of prior-trained neural network models. A cluster among the plurality of prior-trained neural network models can be determined. A set of training data based on the cluster can be determined. The new neural network model can be trained based on the set of training data.

Type: Application

Filed: April 2, 2019

Publication date: October 8, 2020

Inventors: Patrick Watson, Bishwaranjan Bhattacharjee, Siyu Huo, Noel Christopher Codella, Brian Michael Belgodere, Parijat Dube, Michael Robert Glass, John Ronald Kender, Matthew Leon Hill
DATA DRIVEN MIXED PRECISION LEARNING FOR NEURAL NETWORKS

Publication number: 20200160169

Abstract: Embodiments for implementing mixed precision learning for neural networks by a processor. A neural network may be replicated into a plurality of replicated instances and each of the plurality of replicated instances differ in precision used for representing and determining parameters of the neural network. Data instances may be routed to one or more of the plurality of replicated instances for processing according to a data pre-processing operation.

Type: Application

Filed: November 19, 2018

Publication date: May 21, 2020

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Zehra SURA, Parijat DUBE, Bishwaranjan BHATTACHARJEE, Tong CHEN
GENERATING AND AUGMENTING TRANSFER LEARNING DATASETS WITH PSEUDO-LABELED IMAGES

Publication number: 20200082210

Abstract: A computer-implemented method for data labeling is provided. The computer-implemented method assigns pseudo-labels to unlabeled examples of data using a similarity metric on an embedding space to produce pseudo-labeled examples. A curriculum learning model is trained using the pseudo-labeled examples. The curriculum learning model trained with the pseudo-labeled examples is employed in in a fine-tuning task to enhance classification accuracy of the data.

Type: Application

Filed: September 7, 2018

Publication date: March 12, 2020

Inventors: Patrick Watson, Bishwaranjan Bhattacharjee, Siyu Huo, Noel C. Codella, Brian M. Belgodere, Parijat Dube, Michael R. Glass, John R. Kender, Matthew L. Hill
Computational Efficiency in Providing a Price Quotation for a Transportation Service

Publication number: 20200042938

Abstract: A method and system of simulating a cost of shipment are provided. A request for quote (RFQ) is received by a computing device from a shipper having a plurality of trade lanes. A revenue generated from each trade lane is estimated and ranked based on the estimated revenue. An original time limit is assigned to each trade lane. A trade lane with a highest ranking that has not yet been selected is selected. For the selected trade lane, graphs for space and time granularity analysis are generated. Space and time granularities that maximize accuracy within the assigned time limit based on the generated graphs are calculated. Cost simulation analysis is performed using the calculated space and time granularities. Upon determining that there are trade lanes not yet selected, there is a return to the act of selecting a trade lane.

Type: Application

Filed: August 1, 2018

Publication date: February 6, 2020

Inventors: Joao P. Goncalves, Parijat Dube
IDENTIFYING TRANSFER MODELS FOR MACHINE LEARNING TASKS

Publication number: 20190354850

Abstract: Techniques regarding autonomously facilitating the selection of one or more transfer models to enhance the performance of one or more machine learning tasks are provided. For example, one or more embodiments described herein can comprise a system, which can comprise a memory that can store computer executable components. The system can also comprise a processor, operably coupled to the memory, and that can execute the computer executable components stored in the memory. The computer executable components can comprise an assessment component that can assess a similarity metric between a source data set and a sample data set from a target machine learning task. The computer executable components can also comprise an identification component that can identify a pre-trained neural network model associated with the source data set based on the similarity metric to perform the target machine learning task.

Type: Application

Filed: May 17, 2018

Publication date: November 21, 2019

Inventors: Patrick Watson, Bishwaranjan Bhattacharjee, Noel Christopher Codella, Brian Michael Belgodere, Parijat Dube, Michael Robert Glass, John Ronald Kender, Siyu Huo, Matthew Leon Hill
ADAPTIVE LEARNING RATE SCHEDULE IN DISTRIBUTED STOCHASTIC GRADIENT DESCENT

Publication number: 20190303787

Abstract: A method for performing machine learning includes assigning processing jobs to a plurality of model learners, using a central parameter server. The processing jobs includes solving gradients based on a current set of parameters. As the results from the processing job are returned, the set of parameters is iterated. A degree of staleness of the solving of the second gradient is determined based on a difference between the set of parameters when the jobs are assigned and the set of parameters when the jobs are returned. The learning rates used to iterate the parameters based on the solved gradients are proportional to the determined degrees of staleness.

Type: Application

Filed: March 28, 2018

Publication date: October 3, 2019

Inventors: PARIJAT DUBE, Sanghamitra Dutta, Gauri Joshi, Priya A. Nagpurkar

1 2 3 4 5 … next