Patents by Inventor Diptikalyan Saha

Diptikalyan Saha has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

GENERATING SURROGATE PROGRAMS USING ACTIVE LEARNING

Publication number: 20240094995

Abstract: A method of providing a surrogate program for a program endpoint includes: obtaining, by a processor set, a set of plural input/output pairs generated using the program endpoint; generating, by the processor set, transformations based on the input/output pairs; generating, by the processor set, a model that classifies inputs of the input/output pairs to ones of the transformations based on parameters of one or more strings of the inputs; receiving, by the processor set, a new input; selecting, by the processor set and using the model, one of the transformations based on parameters of one or more strings of the new input; and generating, by the processor set, a new output by applying the selected one of the transformations to the new input.

Type: Application

Filed: September 20, 2022

Publication date: March 21, 2024

Inventors: Swagatam Haldar, Devika Sondhi, Diptikalyan Saha
Scalable identification of duplicate datasets in heterogeneous datasets

Patent number: 11886385

Abstract: An embodiment for identifying and sorting duplicate datasets within a large pool of heterogeneous datasets may include received a plurality of heterogeneous datasets. The embodiment may automatically compare schema information and metadata within each of the received plurality of heterogeneous datasets to generate name-based similarity scores for each dataset. The embodiment may also automatically compare data distribution information within each of the received plurality of heterogeneous datasets to generate a plurality of data distribution similarity scores for each heterogeneous dataset. The embodiment may further include automatically calculating an overall distance metric using the name-based similarity scores and plurality of data distribution similarity scores. The embodiment may also include based on the calculate overall distance metric, automatically generating distance graphs that identifying clusters of similar datasets and illustrate inferred lineage for the clusters of similar datasets.

Type: Grant

Filed: June 2, 2022

Date of Patent: January 30, 2024

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Praduemn K. Goyal, Sandeep Hans, Samiulla Zakir Hussain Shaikh, Diptikalyan Saha
Data quality-based computations for KPIs derived from time-series data

Patent number: 11860727

Abstract: A system, computer program product, and method are presented for providing replacement data for data in a time series data stream that has issues indicative of errors, where the data issues and the replacement data are related to one or more KPIs. The method includes determining one or more predicted replacement values for potentially erroneous data instances in the time series data stream. The method further includes resolving the potentially erroneous data instances with one predicted replacement value of the one or more predicted replacement values in the time series data stream.

Type: Grant

Filed: March 29, 2022

Date of Patent: January 2, 2024

Assignee: International Business Machines Corporation

Inventors: Vitobha Munigala, Diptikalyan Saha, Sattwati Kundu, Geetha Adinarayan
GLOBAL CONTEXT EXPLAINERS FOR ARTIFICIAL INTELLIGENCE (AI) SYSTEMS USING MULTIVARIATE TIMESERIES DATA

Publication number: 20230419137

Abstract: Provided are techniques for global context explainers for Artificial Intelligence systems using multivariate timeseries data. Predictions for multivariate timeseries data are received. Feature importance weights are generated from the predictions using a feature-based local explainer, where each of the feature importance weights is associated with a time period and a corresponding data source of timeseries data of the multivariate timeseries data. A dataset is generated using the feature importance weights, where the dataset includes, for each time period and the corresponding data source, a label indicating whether the feature importance weight is one of positive and negative. One or more global explanations are generated using the dataset and a directly interpretable rule-based explainer, where the one or more global explanations indicate how the predictions change at particular times in the multivariate timeseries data based on values from the corresponding data source.

Type: Application

Filed: June 24, 2022

Publication date: December 28, 2023

Inventors: Vijay ARYA, Diptikalyan SAHA, Amaresh RAJASEKHARAN, Shengrong TANG
SCALABLE IDENTIFICATION OF DUPLICATE DATASETS IN HETEROGENEOUS DATASETS

Publication number: 20230394011

Abstract: An embodiment for identifying and sorting duplicate datasets within a large pool of heterogeneous datasets may include received a plurality of heterogeneous datasets. The embodiment may automatically compare schema information and metadata within each of the received plurality of heterogeneous datasets to generate name-based similarity scores for each dataset. The embodiment may also automatically compare data distribution information within each of the received plurality of heterogeneous datasets to generate a plurality of data distribution similarity scores for each heterogeneous dataset. The embodiment may further include automatically calculating an overall distance metric using the name-based similarity scores and plurality of data distribution similarity scores. The embodiment may also include based on the calculate overall distance metric, automatically generating distance graphs that identifying clusters of similar datasets and illustrate inferred lineage for the clusters of similar datasets.

Type: Application

Filed: June 2, 2022

Publication date: December 7, 2023

Inventors: Praduemn K. Goyal, Sandeep Hans, Samiulla Zakir Hussain Shaikh, Diptikalyan Saha
Path-coverage directed black box API testing

Patent number: 11768758

Abstract: Methods, systems, and computer program products for path-coverage directed black box application programming interface (API) testing are provided herein. A computer-implemented method includes determining constraints based on inputs and corresponding outputs of an API in a production environment; generating initial test inputs based at least in part on the constraints; creating a program dependency graph based on trace sequences and request-response data obtained in response to providing the initial test inputs to an endpoint of the API; enhancing the program dependency graph by generating additional test inputs directed to one or more paths of the dependency graph; identifying, based on the enhanced program dependency graph, at least a portion of the API that is not covered by an existing test suite; and using the enhanced program dependency graph to generate new test cases for the test suite based on the identifying.

Type: Grant

Filed: October 6, 2021

Date of Patent: September 26, 2023

Assignee: International Business Machines Corporation

Inventors: Diptikalyan Saha, Devika Sondhi, Eitan Daniel Farchi
Automatically modifying responses from generative models using artificial intelligence techniques

Patent number: 11741296

Abstract: Methods, systems, and computer program products for automatically modifying responses from generative models using artificial intelligence techniques are provided herein. A computer-implemented method includes obtaining data pertaining to at least one conversation involving at least one automated conversation exchange software program and at least one user; identifying, among words proposed by the at least one automated conversation exchange software program in connection with the at least one conversation, words qualifying as belonging to one or more predetermined categories by processing the obtained data using artificial intelligence techniques; determining, by processing the identified words and at least one word-based data source, one or more alternate words; modifying at least a portion of the proposed words by replacing at least a portion of the identified words with at least a portion of the one or more alternate words; and performing at least one automated action based on the modifying.

Type: Grant

Filed: February 18, 2021

Date of Patent: August 29, 2023

Assignee: International Business Machines Corporation

Inventors: Nishtha Madaan, Naveen Panwar, Deepak Vijaykeerthy, Pranay Kumar Lohia, Diptikalyan Saha
Post-hoc improvement of instance-level and group-level prediction metrics

Patent number: 11734585

Abstract: A post-processing method, system, and computer program product for post-hoc improvement of instance-level and group-level prediction metrics, including training a bias detector that learns to detect a sample that has an individual bias greater than a predetermined individual bias threshold value with constraints on a group bias, applying the bias detector on a run-time sample to select a biased sample in the run-time sample having a bias greater than the predetermined individual bias threshold bias value, and suggesting a de-biased prediction for the biased sample.

Type: Grant

Filed: December 10, 2018

Date of Patent: August 22, 2023

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Manish Bhide, Pranay Lohia, Karthikeyan Natesan Ramamurthy, Ruchir Puri, Diptikalyan Saha, Kush Raj Varshney
Content modification using natural language processing to include features of interest to various groups

Patent number: 11714963

Abstract: According to one embodiment of the present invention, a system for modifying content associated with an item comprises at least one processor. Features of interest of the item to a plurality of different groups are determined based on user comments produced by members of the plurality of different groups. The members within each group have a common characteristic. The features of interest to each group within the content associated with the item are identified, and the content associated with the item is modified by balancing the features of interest to the plurality of different groups within the content associated with the item. Embodiments of the present invention further include a method and computer program product for modifying content associated with an item in substantially the same manner described above.

Type: Grant

Filed: March 13, 2020

Date of Patent: August 1, 2023

Assignee: International Business Machines Corporation

Inventors: Seema Nagar, Kuntal Dey, Nishtha Madaan, Manish Anand Bhide, Sameep Mehta, Diptikalyan Saha
POST-HOC IMPROVEMENT OF INSTANCE-LEVEL AND GROUP-LEVEL PREDICTION METRICS

Publication number: 20230229943

Abstract: A post-processing method, system, and computer program product for post-hoc improvement of instance-level and group-level prediction metrics, including training a bias detector on a payload data that learns to detect a sample in a customer model that has an individual bias greater than a predetermined individual bias threshold value with constraints on a group bias, suggesting, in the run-time, a de-biased prediction based on the selected biased sample by a de-biasing procedure, and an arbiter decides based on user feedback whether to use the de-biased prediction or an original prediction made prior to the de-biasing procedure from the customer model which is then used as an output.

Type: Application

Filed: March 23, 2023

Publication date: July 20, 2023

Inventors: Manish Bhide, Pranay Lohia, Karthikeyan Natesan Ramamurthy, Ruchir Puri, Diptikalyan Saha, Kush Raj Varshney
AUTOMATICALLY TESTING ARTIFICIAL INTELLIGENCE MODELS IN CONNECTION WITH ENTERPRISE-RELATED PROPERTIES

Publication number: 20230168994

Abstract: Methods, systems, and computer program products for automatically testing AI models in connection with enterprise-related properties are provided herein.

Type: Application

Filed: November 29, 2021

Publication date: June 1, 2023

Inventors: Sandeep Hans, Diptikalyan Saha, Aniya Aggarwal
Automated testing of dialog systems

Patent number: 11645470

Abstract: Methods, systems and computer program products for automated testing of dialog systems are provided herein. A computer-implemented method includes receiving information pertaining to a given conversation workspace of an automated dialog system and identifying test case inputs to the automated dialog system, the test case inputs comprising user input for the given conversation workspace that has portions thereof modified and which the automated dialog system maps to a different intent and/or a different entity relative to the user input. The method further includes generating human-interpretable explanations of mappings of portions of the test case inputs to the different intent and/or entity, generating suggestions for modifying intents, entities and dialog flows of the given conversation workspace such that the test case inputs map to the same intent and/or the same entity as their corresponding user input, and outputting the suggestions and the human-interpretable explanations to a user.

Type: Grant

Filed: December 29, 2020

Date of Patent: May 9, 2023

Assignee: International Business Machines Corporation

Inventors: Arpan Losalka, Diptikalyan Saha
RULE GENERATION FOR MACHINE-LEARNING MODEL DISCRIMINATORY REGIONS

Publication number: 20230125963

Abstract: One embodiment provides a method, including: providing, from the central server to a machine-learning model, a training set of samples having known values for at least one target protected attribute, wherein the training set includes a first set of samples having a first value for the at least one target protected attribute and a second set of samples having a second value for the at least one target protected attribute; receiving, at the central server from the machine-learning model, an output classification for each of the samples within the training set of samples; and generating, at the central server using the output classification, a set of rules delineating a region within the machine-learning model as discriminatory, wherein the region includes a classification region where the machine-learning model classifies received samples differently based upon a value of the at least one protected attribute.

Type: Application

Filed: October 25, 2021

Publication date: April 27, 2023

Inventors: Diptikalyan Saha, Swagatam Haldar, Swastik Haldar
User explanation guided machine learning

Patent number: 11636331

Abstract: Methods, systems, and computer program products for active explanation guided learning are provided herein. A computer-implemented method includes identifying a subset of training examples, from a set of training examples, based on at least one of (i) an uncertainty metric computed for each one of the training examples and (ii) an influence metric computed for each one of the training examples; outputting said subset of training examples to a user; obtaining, from the user, a user explanation for each training example in said subset of training examples, wherein each of the user explanations identifies at least one part of the corresponding training example; and training a machine learning model based at least in part on the user explanations, wherein said training comprises prioritizing the identified parts of the training examples in the subset.

Type: Grant

Filed: July 9, 2019

Date of Patent: April 25, 2023

Assignee: International Business Machines Corporation

Inventors: Deepak Vijaykeerthy, Philips George John, Diptikalyan Saha
Determining data representative of bias within a model

Patent number: 11636386

Abstract: Methods, systems, and computer program products for determining data representative of bias within a model are provided herein. A computer-implemented method includes obtaining a first dataset on which a model was trained, wherein the first dataset contains protected attributes, and a second dataset on which the model was trained, wherein the protected attributes have been removed from the second dataset; identifying, for each of the one or more protected attributes in the first dataset, one or more attributes in the second dataset correlated therewith; determining bias among at least a portion of the identified correlated attributes; and outputting, to at least one user, identifying information pertaining to the one or more instances of bias.

Type: Grant

Filed: November 21, 2019

Date of Patent: April 25, 2023

Assignee: International Business Machines Corporation

Inventors: Pranay Kumar Lohia, Diptikalyan Saha, Manish Anand Bhide, Sameep Mehta
Path-Coverage Directed Black Box API Testing

Publication number: 20230106929

Abstract: Methods, systems, and computer program products for path-coverage directed black box application programming interface (API) testing are provided herein. A computer-implemented method includes determining constraints based on inputs and corresponding outputs of an API in a production environment; generating initial test inputs based at least in part on the constraints; creating a program dependency graph based on trace sequences and request-response data obtained in response to providing the initial test inputs to an endpoint of the API; enhancing the program dependency graph by generating additional test inputs directed to one or more paths of the dependency graph; identifying, based on the enhanced program dependency graph, at least a portion of the API that is not covered by an existing test suite; and using the enhanced program dependency graph to generate new test cases for the test suite based on the identifying.

Type: Application

Filed: October 6, 2021

Publication date: April 6, 2023

Inventors: Diptikalyan Saha, Devika Sondhi, Eitan Daniel Farchi
REALISTIC COUNTERFACTUAL EXPLANATION OF MACHINE LEARNING PREDICTIONS

Publication number: 20230025731

Abstract: A computer-implemented method comprising, automatically: analyzing a machine learning dataset which comprises multiple datapoints, to deduce constraints on features of the datapoints; generating a first set of CSP (Constraint Satisfaction Problem) rules expressing the constraints; based on a machine learning model which was trained on the dataset, generating a second set of CSP rules that define one or more perturbation candidates among the features of one of the datapoints; formulating a CSP based on the first and second sets of CSP rules; solving the formulated CSP using a solver; and using the solution of the CSP as a counterfactual explanation of a prediction made by the machine learning model with respect to the one datapoint.

Type: Application

Filed: July 19, 2021

Publication date: January 26, 2023

Inventors: Michael Vinov, Oleg Blinder, Diptikalyan Saha, Sandeep Hans, Aniya Aggarwal, Omer Yehuda Boehm, Eyal Bin
Testing bias checkers

Patent number: 11556747

Abstract: One embodiment provides a method, including: receiving a dataset and a model corresponding to a bias checker, wherein the bias checker detects bias within both the dataset and the model, based upon a bias checking algorithm and a bias checking policy, wherein the dataset comprises a plurality of attributes; testing the bias checking algorithm of the bias checker by (i) generating test cases that modify the dataset by introducing bias therein and (ii) running the bias checker against the modified dataset; testing the bias checking policy of the bias checker by generating a plurality of test cases and running the bias checker against the plurality of test cases; and providing a notification to a user regarding whether the bias checker failed to indicate bias for one or more of the plurality of attributes.

Type: Grant

Filed: March 26, 2019

Date of Patent: January 17, 2023

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Kuntal Dey, Diptikalyan Saha, Deepak Vijaykeerthy, Pranay Kumar Lohia
Generating data migration plan for in-place encryption of data

Patent number: 11537724

Abstract: Methods, systems, and computer program products for generating a data migration plan for in-place encryption of data are provided herein. A computer-implemented method includes receiving, from a user, a request to generate a migration plan for performing in-place encryption of data within a database, wherein the migration plan indicates periods of time in which portions of the data are to be encrypted; determining a set of constraints for performing the in-place encryption; generating the migration plan based at least in part on the set of constraints; and performing the in-place encryption of the data in accordance with the migration plan such that only a single copy of each of the portions is maintained during the in-place encryption, wherein the single copy comprises one of a plaintext copy of data corresponding to the portion, and an encrypted copy of data corresponding to the portion.

Type: Grant

Filed: March 26, 2019

Date of Patent: December 27, 2022

Assignee: International Business Machines Corporation

Inventors: Akshar Kaul, Diptikalyan Saha, Gagandeep Singh, Manish Kesarwani
Generating explanations for context aware sequence-to-sequence models

Patent number: 11521065

Abstract: Methods, systems, and computer program products for generating explanations for a semantic parser are provided herein. A computer-implemented method includes providing to a generative model (i) at least one query and (ii) a context of at least one dataset applicable to the at least one query, wherein the generative model generates a plurality of perturbations for the at least one input query based on the context; providing the plurality of perturbations as inputs to a context aware sequence-to-sequence model, thereby obtaining a plurality of outputs; and generating, for (i) an additional query provided as input to the context aware sequence-to-sequence model and (ii) a context applicable to the additional query, an explanation indicative of one or more parts of the additional query that contributes to an output corresponding to the additional query, based at least in part on the plurality of outputs corresponding to the perturbations.

Type: Grant

Filed: February 6, 2020

Date of Patent: December 6, 2022

Assignee: International Business Machines Corporation

Inventors: Rachamalla Anirudh Reddy, Pranay Kumar Lohia, Samiulla Zakir Hussain Shaikh, Diptikalyan Saha, Sameep Mehta

1 2 3 4 5 next