Patents by Inventor Diptikalyan Saha

Diptikalyan Saha has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

LOCALIZATION-BASED TEST GENERATION FOR INDIVIDUAL FAIRNESS TESTING OF ARTIFICIAL INTELLIGENCE MODELS

Publication number: 20220343179

Abstract: Methods, systems, and computer program products for localization-based test generation for individual fairness testing of AI models are provided herein. A computer-implemented method includes obtaining at least one artificial intelligence model and training data related to the at least one artificial intelligence model; identifying one or more boundary regions associated with the at least one artificial intelligence model based at least in part on results of processing at least a portion of the training data using the at least one artificial model; generating, in accordance with at least one of the one or more identified boundary regions, one or more synthetic data points for inclusion with the training data; and executing one or more fairness tests on the at least one artificial intelligence model using at least a portion of the one or more generated synthetic data points and at least a portion of the training data.

Type: Application

Filed: April 26, 2021

Publication date: October 27, 2022

Inventors: Diptikalyan Saha, Aniya Aggarwal, Sandeep Hans
DETECTING CONTEXTUAL BIAS IN TEXT

Publication number: 20220335217

Abstract: Methods, systems, and computer program products for detecting contextual bias in text are provided herein. A computer-implemented method includes identifying, by a machine learning network, a protected attribute in one or more data samples; processing the identified data samples using a first sub-network of the machine learning network, wherein the first sub-network is configured to determine a plurality of contexts of the protected attribute across the identified data samples; determining an impact of each of the plurality of contexts on a second sub-network of the machine learning network, wherein the second sub-network of the machine learning network is configured to classify a given data sample into one of a plurality of classes; and adjusting the second sub-network of the machine learning to account for the impact of at least one of the plurality of contexts on the second sub-network.

Type: Application

Filed: April 19, 2021

Publication date: October 20, 2022

Inventors: Naveen Panwar, Nishtha Madaan, Deepak Vijaykeerthy, Pranay Kumar Lohia, Diptikalyan Saha
Bias source identification and de-biasing of a dataset

Patent number: 11475331

Abstract: A source of bias identification (SoBI) tool is provided that identifies sources of bias in a dataset. A bias detection operation is performed on results of a computer model, based on an input dataset, to generate groupings of values for a protected attribute corresponding to a detected bias in the operation of the computer model. The SoBI tool generates a plurality of sub-groups for each grouping of values. Each sub-group comprises an individual value, or a sub-range, for the protected attribute. The SoBI tool analyzes each of the sub-groups in the plurality of sub-groups, based on at least one source of bias identification criterion, to identify one or more sources of bias in the input dataset. The SoBI tool outputs a bias notification to an authorized computing device specifying the one or more sources of bias in the input dataset.

Type: Grant

Filed: June 25, 2020

Date of Patent: October 18, 2022

Assignee: International Business Machines Corporation

Inventors: Manish Anand Bhide, Pranay Kumar Lohia, Diptikalyan Saha, Madhavi Katari
Trustworthiness of artificial intelligence models in presence of anomalous data

Patent number: 11455554

Abstract: Methods, systems, and computer program products for improving trustworthiness of artificial intelligence models in presence of anomalous data are provided herein. A method includes obtaining a machine learning model and a set of training data; determining one or more anomalous data points in said set of training data; for a given one of said anomalous data points, identifying attributes that decrease confidence with respect to at least one output of said machine learning model; determining that a root cause of said decreased confidence corresponds to one of: a class imbalance issue related to said at least one attribute, a confused class issue related to said at least one attribute, a low density issue related to said at least one attribute, and an adversarial issue related to said at least one attribute; and performing step(s) to improve said confidence based at least in part on said determined root cause.

Type: Grant

Filed: November 25, 2019

Date of Patent: September 27, 2022

Assignee: International Business Machines Corporation

Inventors: Pranay Kumar Lohia, Diptikalyan Saha, Aniya Aggarwal, Gagandeep Singh, Rema Ananthanarayanan, Samiulla Zakir Hussain Shaikh, Sandeep Hans
AUTOMATICALLY MODIFYING RESPONSES FROM GENERATIVE MODELS USING ARTIFICIAL INTELLIGENCE TECHNIQUES

Publication number: 20220261535

Abstract: Methods, systems, and computer program products for automatically modifying responses from generative models using artificial intelligence techniques are provided herein. A computer-implemented method includes obtaining data pertaining to at least one conversation involving at least one automated conversation exchange software program and at least one user; identifying, among words proposed by the at least one automated conversation exchange software program in connection with the at least one conversation, words qualifying as belonging to one or more predetermined categories by processing the obtained data using artificial intelligence techniques; determining, by processing the identified words and at least one word-based data source, one or more alternate words; modifying at least a portion of the proposed words by replacing at least a portion of the identified words with at least a portion of the one or more alternate words; and performing at least one automated action based on the modifying.

Type: Application

Filed: February 18, 2021

Publication date: August 18, 2022

Inventors: Nishtha Madaan, Naveen Panwar, Deepak Vijaykeerthy, Pranay Kumar Lohia, Diptikalyan Saha
PRIORITY-BASED, ACCURACY-CONTROLLED INDIVIDUAL FAIRNESS OF UNSTRUCTURED TEXT

Publication number: 20220237415

Abstract: Methods, systems, and computer program products for priority-based, accuracy-controlled individual fairness of unstructured text are provided herein. A method includes identifying one or more samples in a set of data used to train a machine learning model having at least one attribute; generating counterfactual samples for each of the one or more identified samples; calculating scores for the one or more identified samples based at least in part on output of the machine learning model with respect to the counterfactual samples, wherein the scores indicate a relative level of bias between the one or more identified samples corresponding to the at least one attribute; creating an enhanced set of data at least in part by supplementing at least a portion of the identified samples with the corresponding counterfactual samples based on the calculated scores; and training the machine learning model using the enhanced set of data.

Type: Application

Filed: January 28, 2021

Publication date: July 28, 2022

Inventors: Pranay Kumar Lohia, Deepak Vijaykeerthy, Diptikalyan Saha, Nishtha Madaan, Naveen Panwar
DATA QUALITY-BASED COMPUTATIONS FOR KPIS DERIVED FROM TIME-SERIES DATA

Publication number: 20220237074

Abstract: A system, computer program product, and method are presented for providing replacement data for data in a time series data stream that has issues indicative of errors, where the data issues and the replacement data are related to one or more KPIs. The method includes determining one or more predicted replacement values for potentially erroneous data instances in the time series data stream. The method further includes resolving the potentially erroneous data instances with one predicted replacement value of the one or more predicted replacement values in the time series data stream.

Type: Application

Filed: March 29, 2022

Publication date: July 28, 2022

Inventors: Vitobha Munigala, Diptikalyan Saha, Sattwati Kundu, Geetha Adinarayan
Automated test case generation for deep neural networks and other model-based artificial intelligence systems

Patent number: 11379347

Abstract: Methods, systems and computer program products for automated test case generation are provided herein. A computer-implemented method includes selecting sample input data as a test case for a system under test, executing the test case on the system under test to obtain a result, and applying the result to a local explainer function to obtain at least a portion of a corresponding decision tree. The method further includes determining at least one path constraint from the decision tree, solving the path constraint to obtain a solution, and generating at least one other test case for the system under test based at least in part on the solution of the path constraint. The steps of the method are illustratively repeated in each of one or more additional iterations until at least one designated stopping criterion is met. The resulting test cases form a test suite for testing of a deep neural network (DNN) or other system.

Type: Grant

Filed: December 28, 2020

Date of Patent: July 5, 2022

Assignee: International Business Machines Corporation

Inventors: Diptikalyan Saha, Aniya Aggarwal, Pranay Lohia, Kuntal Dey
Domain aware explainable anomaly and drift detection for multi-variate raw data using a constraint repository

Patent number: 11321304

Abstract: Methods, systems, and computer program products for domain aware explainable anomaly and drift detection for multi-variate raw data using a constraint repository are provided herein. A computer-implemented method includes obtaining a set of data and information indicative of a domain of said set of data; obtaining constraints from a domain-indexed constraint repository based on said set of data and said information, wherein the domain-indexed constraint repository comprises a knowledge graph having a plurality of nodes, wherein each node comprises an attribute associated with at least one of a plurality of domains and constraints corresponding to the attribute; detecting anomalies in said set of data based on whether portions of said set of data violate said retrieved constraints; generating an explanation corresponding to each of the anomalies that describe the attributes corresponding to the violated constraints; and outputting an indication of the anomalies and the corresponding explanation.

Type: Grant

Filed: September 27, 2019

Date of Patent: May 3, 2022

Assignee: International Business Machines Corporation

Inventors: Sandeep Hans, Samiulla Zakir Hussain Shaikh, Rema Ananthanarayanan, Diptikalyan Saha, Aniya Aggarwal, Gagandeep Singh, Pranay Kumar Lohia, Manish Anand Bhide, Sameep Mehta
Data quality-based confidence computations for KPIs derived from time-series data

Patent number: 11314584

Abstract: A system, computer program product, and method are presented for providing confidence values for replacement data for data that has issues indicative of errors, where the data issues, the replacement data, and confidence values are related to one or more KPIs. The method includes identifying one or more potentially erroneous data instances and determining one or more predicted replacement values for the potentially erroneous data instances. The method further includes determining a confidence value for each predicted replacement value and resolving the one or more potentially erroneous data instances with one predicted replacement value of the one or more predicted replacement values. The method also includes generating an explanatory basis for the resolution of the one or more potentially erroneous data instances.

Type: Grant

Filed: November 25, 2020

Date of Patent: April 26, 2022

Assignee: International Business Machines Corporation

Inventors: Vitobha Munigala, Diptikalyan Saha, Sattwati Kundu, Geetha Adinarayan
Determining model-related bias associated with training data

Patent number: 11302096

Abstract: Methods, systems, and computer program products for determining model-related bias associated with training data are provided herein. A computer-implemented method includes obtaining, via execution of a first model, class designations attributed to data points used to train the first model; identifying any of the data points associated with an inaccurate class designation and/or a low-confidence class designation; training a second model using the data points from the dataset, but excluding the identified data points; determining bias related to at least a portion of those data points used to train the second model by: modifying one or more of the data points used to train the second model; executing the first model using the modified data points; and identifying a change to one or more class designations attributed to the modified data points as compared to before the modifying; and outputting identifying information pertaining to the determined bias.

Type: Grant

Filed: November 21, 2019

Date of Patent: April 12, 2022

Assignee: International Business Machines Corporation

Inventors: Pranay Kumar Lohia, Diptikalyan Saha, Manish Anand Bhide, Sameep Mehta
Domain query execution using user-provided definition

Patent number: 11294907

Abstract: One embodiment provides a method, including: receiving a query from a user; identifying that a desired definition of the at least one term is unknown, by determining that the at least one term does not map to a term having a known definition; receiving the definition of the at least one term from the user; adding the definition to a domain grammar comprising (i) domain-specific terminology and (ii) definitions corresponding to the terms within the domain grammar, wherein the adding comprises (a) extracting expressions from the requested definition and (b) adding, for the at least one term, the expressions into a structured format within the domain grammar; combining (iii) the requested definition and (iv) terms from the parsed query having previously known definitions into a complete query; and providing a response to the query by executing the complete query on a knowledge store.

Type: Grant

Filed: March 5, 2020

Date of Patent: April 5, 2022

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Jaydeep Sen, Ashish Mittal, Diptikalyan Saha, Karthik Sankaranarayanan
EXPLANATION FOR TIME SERIES FORECASTING MODELS

Publication number: 20220083897

Abstract: A method, system, and computer program product for explaining predictions made by black box time series models. The method may include identifying a black box time series model. The method may also include predicting one or more time instances using the black box time series model. The method may also include selecting a predicted time instance from the predicted data. The method may also include receiving training data for the black box time series model. The method may also include generating a set of white box time series models similar to the black box time series model. The method may also include selecting a preferred white box time series model. The method may also include analyzing behavior of the preferred white box time series model. The method may also include generating an explanation illustrating why the black box time series model forecasted the predicted time instance.

Type: Application

Filed: September 11, 2020

Publication date: March 17, 2022

Inventors: Diptikalyan Saha, Philips George John, Vitobha Munigala
Bias Source Identification and De-Biasing of a Dataset

Publication number: 20210406712

Abstract: A source of bias identification (SoBI) tool is provided that identifies sources of bias in a dataset. A bias detection operation is performed on results of a computer model, based on an input dataset, to generate groupings of values for a protected attribute corresponding to a detected bias in the operation of the computer model. The SoBI tool generates a plurality of sub-groups for each grouping of values. Each sub-group comprises an individual value, or a sub-range, for the protected attribute. The SoBI tool analyzes each of the sub-groups in the plurality of sub-groups, based on at least one source of bias identification criterion, to identify one or more sources of bias in the input dataset. The SoBI tool outputs a bias notification to an authorized computing device specifying the one or more sources of bias in the input dataset.

Type: Application

Filed: June 25, 2020

Publication date: December 30, 2021

Inventors: Manish Anand Bhide, Pranay Kumar Lohia, Diptikalyan Saha, Madhavi Katari
Natural language interface databases

Patent number: 11200222

Abstract: Embodiments are disclosed for correcting a natural language interface database (NLIDB) system. The techniques include receiving feedback indicating that an answer provided in response to a question for an NLIDB system is inaccurate. The techniques further include finding an ontology element for a datastore of the NLIDB system that matches to the feedback. The techniques also include selecting candidate annotations for the NLIDB system based on the ontology element and a data type of the ontology element. Additionally, the techniques include generating a question-answer (QA) pair for each of the candidate annotations. Further, the techniques include adding one of the candidate annotations to annotations for a natural language query (NLQ) engine of the NLIDB system based on a client verification of the QA pair.

Type: Grant

Filed: April 24, 2019

Date of Patent: December 14, 2021

Assignee: International Business Machines Corporation

Inventors: Jaydeep Sen, Diptikalyan Saha, Karthik Sankaranarayanan, Ashish Mittal, Manasa Jammi
Automatic semantic analysis and comparison of chatbot capabilities

Patent number: 11163960

Abstract: Techniques for the automatic semantic analysis and comparison of chatbot capabilities are disclosed. A first chatbot specification associated with a first chatbot is obtained that includes a first plurality of characteristics arranged in a plurality of categories. A second chatbot specification associated with a second chatbot is obtained that includes a second plurality of characteristics arranged in the plurality of categories. One or more differences between the first plurality of characteristics and the second plurality of characteristics for each of the plurality of categories are identified based at least in part on the first plurality of characteristics and the second plurality of characteristics. A natural language expression corresponding to the identified one or more differences is generated and presented to a user via a graphical user interface.

Type: Grant

Filed: April 18, 2019

Date of Patent: November 2, 2021

Assignee: International Business Machines Corporation

Inventors: Diptikalyan Saha, Rema Ananthanarayanan
CONTENT MODIFICATION USING NATURAL LANGUAGE PROCESSING TO INCLUDE FEATURES OF INTEREST TO VARIOUS GROUPS

Publication number: 20210286945

Abstract: According to one embodiment of the present invention, a system for modifying content associated with an item comprises at least one processor. Features of interest of the item to a plurality of different groups are determined based on user comments produced by members of the plurality of different groups. The members within each group have a common characteristic. The features of interest to each group within the content associated with the item are identified, and the content associated with the item is modified by balancing the features of interest to the plurality of different groups within the content associated with the item. Embodiments of the present invention further include a method and computer program product for modifying content associated with an item in substantially the same manner described above.

Type: Application

Filed: March 13, 2020

Publication date: September 16, 2021

Inventors: Seema Nagar, Kuntal Dey, Nishtha Madaan, Manish Anand Bhide, Sameep Mehta, Diptikalyan Saha
DOMAIN QUERY EXECUTION USING USER-PROVIDED DEFINITION

Publication number: 20210279243

Abstract: One embodiment provides a method, including: receiving a query from a user; identifying that a desired definition of the at least one term is unknown, by determining that the at least one term does not map to a term having a known definition; receiving the definition of the at least one term from the user; adding the definition to a domain grammar comprising (i) domain-specific terminology and (ii) definitions corresponding to the terms within the domain grammar, wherein the adding comprises (a) extracting expressions from the requested definition and (b) adding, for the at least one term, the expressions into a structured format within the domain grammar; combining (iii) the requested definition and (iv) terms from the parsed query having previously known definitions into a complete query; and providing a response to the query by executing the complete query on a knowledge store.

Type: Application

Filed: March 5, 2020

Publication date: September 9, 2021

Inventors: Jaydeep Sen, Ashish Mittal, Diptikalyan Saha, Karthik Sankaranarayanan
EXPLAINING ACCURACY DRIFT IN PRODUCTION DATA

Publication number: 20210279607

Abstract: A computer-implemented method according to one embodiment includes identifying an occurrence of accuracy drift by a trained model; identifying data associated with the accuracy drift, utilizing a drift detection model (DDM) constructed for the trained model; applying the data associated with the accuracy drift to a decision tree to determine a feature space and specific subset of the data causing the accuracy drift; analyzing a distribution of features within the feature space for the specific subset of the data causing the accuracy drift to determine specific features of the data causing the accuracy drift; and returning the specific features of the data causing the accuracy drift.

Type: Application

Filed: March 9, 2020

Publication date: September 9, 2021

Inventors: Manish Anand Bhide, Pranay Kumar Lohia, Diptikalyan Saha, Madhavi Katari
GENERATING EXPLANATIONS FOR CONTEXT AWARE SEQUENCE-TO-SEQUENCE MODELS

Publication number: 20210248455

Abstract: Methods, systems, and computer program products for generating explanations for a semantic parser are provided herein. A computer-implemented method includes providing to a generative model (i) at least one query and (ii) a context of at least one dataset applicable to the at least one query, wherein the generative model generates a plurality of perturbations for the at least one input query based on the context; providing the plurality of perturbations as inputs to a context aware sequence-to-sequence model, thereby obtaining a plurality of outputs; and generating, for (i) an additional query provided as input to the context aware sequence-to-sequence model and (ii) a context applicable to the additional query, an explanation indicative of one or more parts of the additional query that contributes to an output corresponding to the additional query, based at least in part on the plurality of outputs corresponding to the perturbations.

Type: Application

Filed: February 6, 2020

Publication date: August 12, 2021

Inventors: Rachamalla Anirudh Reddy, Pranay Kumar Lohia, Samiulla Zakir Hussain Shaikh, Diptikalyan Saha, Sameep Mehta

prev 1 2 3 4 5 next