Patents by Inventor Amit Dhurandhar

Amit Dhurandhar has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Simple models using confidence profiles

Patent number: 11972344

Abstract: A method, system, and computer program product, including generating, using a linear probe, confidence scores through flattened intermediate representations and theoretically-justified weighting of samples during a training of the simple model using the confidence scores of the intermediate representations.

Type: Grant

Filed: November 28, 2018

Date of Patent: April 30, 2024

Assignee: International Business Machines Corporation

Inventors: Amit Dhurandhar, Karthikeyan Shanmugam, Ronny Luss, Peder Andreas Olsen
GENERATING LOCALLY INVARIANT EXPLANATIONS FOR MACHINE LEARNING

Publication number: 20240135239

Abstract: Techniques for generating explanations for machine learning (ML) are disclosed. These techniques include identifying an ML model, an output from the ML model, and a plurality of constraints, and generating a plurality of neighborhoods relating to the ML model based on the plurality of constraints. The techniques further include generating a predictor for each of the plurality of neighborhoods using the ML model and the plurality of constraints, constructing a combined predictor based on combining each of the respective predictors for the plurality of neighborhoods, and creating one or more explanations relating to the ML model and the output from the ML model using the combined predictor.

Type: Application

Filed: October 19, 2022

Publication date: April 25, 2024

Inventors: Amit DHURANDHAR, Karthikeyan NATESAN RAMAMURTHY, Kartik AHUJA, Vijay ARYA
SUFFICIENCY ASSESSMENT OF MACHINE LEARNING MODELS THROUGH MAXIMUM DEVIATION

Publication number: 20240095575

Abstract: Techniques regarding determining sufficiency of one or more machine learning models are provided. For example, one or more embodiments described herein can comprise a system, which can comprise a memory that can store computer executable components. The system can also comprise a processor, operably coupled to the memory, and that can execute the computer executable components stored in memory. The computer executable components can comprise a measurement component that measures maximum deviation of a supervised learning model from a reference model over a certification set and an analysis component that determines sufficiency of the supervised learning model based at least in part on the maximum deviation.

Type: Application

Filed: September 13, 2022

Publication date: March 21, 2024

Inventors: Dennis Wei, Rahul Nair, Amit Dhurandhar, Kush Raj Varshney, Elizabeth Daly, Moninder Singh, Michael Hind
Initializing optimization solvers

Patent number: 11915131

Abstract: In an approach to improve the efficiency of solving problem instances by utilizing a machine learning model to solve a sequential optimization problem. Embodiments of the present invention receive a sequential optimization problem for solving and utilize a random initialization to solve a first instance of the sequential optimization problem. Embodiments of the present invention learning, by a computing device a machine learning model, based on a previously stored solution to the first instance of the sequential optimization problem. Additionally, embodiments of the present invention generate, by the machine learning model, one or more subsequent approximate solutions to the sequential optimization problem; and output, by a user interface on the computing device, the one or more subsequent approximate solutions to the sequential optimization problem.

Type: Grant

Filed: November 23, 2020

Date of Patent: February 27, 2024

Assignee: International Business Machines Corporation

Inventors: Kartik Ahuja, Amit Dhurandhar, Karthikeyan Shanmugam, Kush Raj Varshney
MULTIPLE STAGE KNOWLEDGE TRANSFER

Publication number: 20230419103

Abstract: An input model can be received, along with a set of requirements. The set of requirements may describe an output model to be trained. The output model can then be trained. The training of the output model can be based on the input model and based further on at least one intermediate model.

Type: Application

Filed: June 27, 2022

Publication date: December 28, 2023

Inventors: Amit Dhurandhar, Tejaswini Pedapati
UNDERSTANDING REINFORCEMENT LEARNING POLICIES BY IDENTIFYING STRATEGIC STATES

Publication number: 20230419097

Abstract: One or more computer processors compute a maximum likelihood path matrix comprising a respective shortest path between each state in a set of states associated with a model trained with a deep reinforcement learning policy. The one or more computer processors generate explanations for the deep reinforcement learning policy based one or more identified meta-states for each state in the set of states and corresponding selected strategic states utilizing the computed maximum likelihood path matrix.

Type: Application

Filed: June 22, 2022

Publication date: December 28, 2023

Inventors: Ronny Luss, Amit Dhurandhar, MIAO LIU
SYSTEM AND METHOD FOR GENERATING CONTRASTIVE EXPLANATIONS FOR TEXT GUIDED BY ATTRIBUTES

Publication number: 20230409832

Abstract: A method, computer program product and system are provided to generate perturbed text is provided. A processor receives a string of text from a user. A processor determines one or more classifications for at least one word in the string of text by a classification model. A processor determines a plurality of perturbations of the at least one word based on the one or more classifications, where the plurality of perturbations do not share the same one or more classifications as the least one word in the string of text. A processor selects a perturbation of the string of text based on (i) an edit distance between the string of text and the plurality of perturbations, and (ii) a fluency metric for each of the plurality of perturbations. A processor provides the perturbation of the string of text to the user.

Type: Application

Filed: June 16, 2022

Publication date: December 21, 2023

Inventors: Saneem Ahmed Chemmengath, Amar Prakash Azad, Ronny Luss, Amit Dhurandhar
INTERPRETABLE NEURAL NETWORK ARCHITECTURE USING CONTINUED FRACTIONS

Publication number: 20230401438

Abstract: A method, a neural network, and a computer program product are provided that provide training of neural networks with continued fractions architectures. The method includes receiving, as input to a neural network, input data and training the input data through a plurality of continued fractions layers of the neural network to generate output data. The input data is provided to each of the continued fractions layers as well as output data from a previous layer. The method further includes outputting, from the neural network, the output data. Each continued fractions layer of the continued fractions layers is configured to calculate one or more linear functions of its respective input and to generate an output that is used as the input for a subsequent continued fractions layer, each continued fractions layer configured to generate an output that is used as the input for a subsequent layer.

Type: Application

Filed: June 9, 2022

Publication date: December 14, 2023

Inventors: Isha Puri, Amit Dhurandhar, Tejaswini Pedapati, Karthikeyan Shanmugam, Dennis Wei, Kush Raj Varshney
PROVIDING AI EXPLANATIONS BASED ON TASK CONTEXT

Publication number: 20230289632

Abstract: A method, computer program, and computer system are provided for providing artificial intelligence explanations. An explanation request corresponding to an output or a behavior of an artificial intelligence system is received from a user. A context or user profile associated with the user is identified. A plurality of explanation methods corresponding to the artificial intelligence system is accessed. Each explanation method provides an independent explanation for the output or the behavior of the artificial intelligence system and is rated based on a set of explanation evaluation criteria corresponding to the context or user profile. An explanation method having a highest rating is selected from among the plurality of explanation methods, and an explanation of the output or the behavior of the artificial intelligence system corresponding to the selected explanation method to the user.

Type: Application

Filed: March 11, 2022

Publication date: September 14, 2023

Inventors: Vera Liao, Yunfeng Zhang, Jorge Andres Moros Ortiz, Amit Dhurandhar, Ronny Luss
Contrastive explanations for images with monotonic attribute functions

Patent number: 11640532

Abstract: In an embodiment, a method for generating contrastive information for a classifier prediction comprises receiving image data representative of an input image, using a deep learning classifier model to predict a first classification for the input image, evaluating the input image using a plurality of classifier functions corresponding to respective high-level features to identify one or more of the high-level features absent from the input image, and identifying, from among the high-level features absent from the input image, a pertinent-negative feature that, if added to the input image, will result in the deep learning classifier model predicting a second classification for the modified input image, the second classification being different from the first classification. In an embodiment, the method includes creating a pertinent-positive image that is a modified version of the input image that has the first classification and fewer than all superpixels of the input image.

Type: Grant

Filed: December 3, 2021

Date of Patent: May 2, 2023

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Ronny Luss, Pin-Yu Chen, Amit Dhurandhar, Prasanna Sattigeri, Karthikeyan Shanmugam
Leveraging simple model predictions for enhancing computational performance

Patent number: 11586917

Abstract: A computer-implemented method, system, and non-transitory computer-readable storage medium for enhancing performance of a first model. The first model is trained with a training data set. A second model receives the training data set associated with the first model. The second model provides the first model with a hardness value associated with prediction of each data point of the training data set. The first model determines a confidence value regarding predicting each data point based on the training data set, and determines a ratio of the hardness value of a prediction of each data point by the second model with respect to the confidence value of the first model. The first model is retrained with a re-weighted training data set when the determined ratio is lower than a value of ?.

Type: Grant

Filed: April 29, 2020

Date of Patent: February 21, 2023

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Amit Dhurandhar, Karthikeyan Shanmugam, Ronny Luss
POST-HOC LOCAL EXPLANATIONS OF BLACK BOX SIMILARITY MODELS

Publication number: 20220391631

Abstract: Define a similarity measure between first and second points in a data space by operation of a machine learning model. Generate interpretable representations of the first and second points. Generate an interpretable local description of the similarity measure by approximating the similarity measure as a distance between the interpretable representations of the first and second points. The distance between the interpretable representations incorporates a matrix. Learn values for the matrix through optimizing a loss function evaluated on perturbations of the first and second points. Explain a value of the similarity measure between the first and second points using elements of the matrix. Assess the explanation of the value of the similarity measure using a rubric. In response to the assessment of the explanation of the value of the similarity measure, modify the machine learning model. Deploy the modified machine learning model.

Type: Application

Filed: July 21, 2021

Publication date: December 8, 2022

Inventors: Zaid Bin Tariq, Karthikeyan Natesan Ramamurthy, Dennis Wei, Amit Dhurandhar
Model agnostic contrastive explanations for structured data

Patent number: 11507787

Abstract: A method, system, and computer program product, including generating a contrastive explanation for a decision of a classifier trained on structured data, highlighting an important feature that justifies the decision, and determining a minimal set of new values for features that alter the decision.

Type: Grant

Filed: December 12, 2018

Date of Patent: November 22, 2022

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Amit Dhurandhar, Pin-Yu Chen, Karthikeyan Shanmugam, Tejaswini Pedapati, Avinash Balakrishnan, Ruchir Puri
Diagnosing anomalies detected by black-box machine learning models

Patent number: 11487650

Abstract: A computer-implemented method, a computer program product, and a computer system for diagnosing anomalies detected by a black-box machine learning model. A computer determines a local variance of a test sample in a test dataset, where the local variance represents uncertainty of a prediction by the black-box machine learning model. The computer initializes optimal compensations for the test sample, where the optimal compensations are optimal perturbations to test sample values of respective components of a multivariate input variable. The computer determines local gradients for the test sample. Based on the local variance and the local gradients, the computer updates the optimal compensations until convergences of the optimal compensations are reached. Using the optimal compensations, the computer diagnoses the anomalies detected by the black-box machine learning model.

Type: Grant

Filed: May 22, 2020

Date of Patent: November 1, 2022

Assignee: International Business Machines Corporation

Inventors: Tsuyoshi Ide, Amit Dhurandhar, Jiri Navratil, Naoki Abe, Moninder Singh
PATH-SUFFICIENT EXPLANATIONS FOR MODEL UNDERSTANDING

Publication number: 20220188666

Abstract: An approach to generate a path for minimally sufficient explanations for improving model understanding. Data is received from a user. The data is iteratively processed to generate minimally sufficient explanations based on the input data and the input of a subsequent explanation determination is constrained to the output of a prior explanation determination.

Type: Application

Filed: December 15, 2020

Publication date: June 16, 2022

Inventors: Ronny Luss, Amit Dhurandhar
LEARNING ROBUST PREDICTORS USING GAME THEORY

Publication number: 20220180254

Abstract: A method, computer system, and a computer program product for invariant risk minimization games is provided. The present invention may include defining a plurality of environment-specific classifiers corresponding to a plurality of environments. The present invention may also include constructing an ensemble classifier associated with the plurality of environment-specific classifiers. The present invention may further include initiating a game including a plurality of players corresponding to the plurality of environments. The present invention may also include calculating a nash equilibrium of the initiated game. The present invention may further include determining an ensemble predictor based on the calculated nash equilibrium. The present invention may include deploying the determined ensemble predictor associated with the calculated nash equilibrium to make predictions in a new environment.

Type: Application

Filed: December 8, 2020

Publication date: June 9, 2022

Inventors: Kartik Ahuja, Karthikeyan Shanmugam, Kush Raj Varshney, Amit Dhurandhar
INITIALIZING OPTIMIZATION SOLVERS

Publication number: 20220164644

Abstract: In an approach to improve the efficiency of solving problem instances by utilizing a machine learning model to solve a sequential optimization problem. Embodiments of the present invention receive a sequential optimization problem for solving and utilize a random initialization to solve a first instance of the sequential optimization problem. Embodiments of the present invention learning, by a computing device a machine learning model, based on a previously stored solution to the first instance of the sequential optimization problem. Additionally, embodiments of the present invention generate, by the machine learning model, one or more subsequent approximate solutions to the sequential optimization problem; and output, by a user interface on the computing device, the one or more subsequent approximate solutions to the sequential optimization problem.

Type: Application

Filed: November 23, 2020

Publication date: May 26, 2022

Inventors: Kartik Ahuja, Amit Dhurandhar, Karthikeyan Shanmugam, Kush Raj Varshney
CONTRASTIVE EXPLANATIONS FOR IMAGES WITH MONOTONIC ATTRIBUTE FUNCTIONS

Publication number: 20220092360

Abstract: In an embodiment, a method for generating contrastive information for a classifier prediction comprises receiving image data representative of an input image, using a deep learning classifier model to predict a first classification for the input image, evaluating the input image using a plurality of classifier functions corresponding to respective high-level features to identify one or more of the high-level features absent from the input image, and identifying, from among the high-level features absent from the input image, a pertinent-negative feature that, if added to the input image, will result in the deep learning classifier model predicting a second classification for the modified input image, the second classification being different from the first classification. In an embodiment, the method includes creating a pertinent-positive image that is a modified version of the input image that has the first classification and fewer than all superpixels of the input image.

Type: Application

Filed: December 3, 2021

Publication date: March 24, 2022

Applicant: International Business Machines Corporation

Inventors: Ronny Luss, Pin-Yu Chen, Amit Dhurandhar, Prasanna Sattigeri, Karthikeyan Shanmugam
Contrastive explanations for images with monotonic attribute functions

Patent number: 11222242

Abstract: In an embodiment, a method for generating contrastive information for a classifier prediction comprises receiving image data representative of an input image, using a deep learning classifier model to predict a first classification for the input image, evaluating the input image using a plurality of classifier functions corresponding to respective high-level features to identify one or more of the high-level features absent from the input image, and identifying, from among the high-level features absent from the input image, a pertinent-negative feature that, if added to the input image, will result in the deep learning classifier model predicting a second classification for the modified input image, the second classification being different from the first classification. In an embodiment, the method includes creating a pertinent-positive image that is a modified version of the input image that has the first classification and fewer than all superpixels of the input image.

Type: Grant

Filed: August 23, 2019

Date of Patent: January 11, 2022

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Ronny Luss, Pin-Yu Chen, Amit Dhurandhar, Prasanna Sattigeri, Karthikeyan Shanmugam
DIAGNOSING ANOMALIES DETECTED BY BLACK-BOX MACHINE LEARNING MODELS

Publication number: 20210365358

Abstract: A computer-implemented method, a computer program product, and a computer system for diagnosing anomalies detected by a black-box machine learning model. A computer determines a local variance of a test sample in a test dataset, where the local variance represents uncertainty of a prediction by the black-box machine learning model. The computer initializes optimal compensations for the test sample, where the optimal compensations are optimal perturbations to test sample values of respective components of a multivariate input variable. The computer determines local gradients for the test sample. Based on the local variance and the local gradients, the computer updates the optimal compensations until convergences of the optimal compensations are reached. Using the optimal compensations, the computer diagnoses the anomalies detected by the black-box machine learning model.

Type: Application

Filed: May 22, 2020

Publication date: November 25, 2021

Inventors: Tsuyoshi Ide, Amit Dhurandhar, Jiri Navratil, Naoki Abe, Moninder Singh

1 2 3 next