Patents by Inventor Manish Anand Bhide

Manish Anand Bhide has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Determining Data Representative of Bias Within a Model

Publication number: 20210158102

Abstract: Methods, systems, and computer program products for determining data representative of bias within a model are provided herein. A computer-implemented method includes obtaining a first dataset on which a model was trained, wherein the first dataset contains protected attributes, and a second dataset on which the model was trained, wherein the protected attributes have been removed from the second dataset; identifying, for each of the one or more protected attributes in the first dataset, one or more attributes in the second dataset correlated therewith; determining bias among at least a portion of the identified correlated attributes; and outputting, to at least one user, identifying information pertaining to the one or more instances of bias.

Type: Application

Filed: November 21, 2019

Publication date: May 27, 2021

Inventors: Pranay Kumar Lohia, Diptikalyan Saha, Manish Anand Bhide, Sameep Mehta
Determining Model-Related Bias Associated with Training Data

Publication number: 20210158076

Abstract: Methods, systems, and computer program products for determining model-related bias associated with training data are provided herein. A computer-implemented method includes obtaining, via execution of a first model, class designations attributed to data points used to train the first model; identifying any of the data points associated with an inaccurate class designation and/or a low-confidence class designation; training a second model using the data points from the dataset, but excluding the identified data points; determining bias related to at least a portion of those data points used to train the second model by: modifying one or more of the data points used to train the second model; executing the first model using the modified data points; and identifying a change to one or more class designations attributed to the modified data points as compared to before the modifying; and outputting identifying information pertaining to the determined bias.

Type: Application

Filed: November 21, 2019

Publication date: May 27, 2021

Inventors: Pranay Kumar Lohia, Diptikalyan Saha, Manish Anand Bhide, Sameep Mehta
DOMAIN AWARE EXPLAINABLE ANOMALY AND DRIFT DETECTION FOR MULTI-VARIATE RAW DATA USING A CONSTRAINT REPOSITORY

Publication number: 20210097052

Abstract: Methods, systems, and computer program products for domain aware explainable anomaly and drift detection for multi-variate raw data using a constraint repository are provided herein. A computer-implemented method includes obtaining a set of data and information indicative of a domain of said set of data; obtaining constraints from a domain-indexed constraint repository based on said set of data and said information, wherein the domain-indexed constraint repository comprises a knowledge graph having a plurality of nodes, wherein each node comprises an attribute associated with at least one of a plurality of domains and constraints corresponding to the attribute; detecting anomalies in said set of data based on whether portions of said set of data violate said retrieved constraints; generating an explanation corresponding to each of the anomalies that describe the attributes corresponding to the violated constraints; and outputting an indication of the anomalies and the corresponding explanation.

Type: Application

Filed: September 27, 2019

Publication date: April 1, 2021

Inventors: Sandeep Hans, Samiulla Zakir Hussain Shaikh, Rema Ananthanarayanan, Diptikalyan Saha, Aniya Aggarwal, Gagandeep Singh, Pranay Kumar Lohia, Manish Anand Bhide, Sameep Mehta
TRAINING ARTIFICIAL INTELLIGENCE MODELS USING ACTIVE LEARNING

Publication number: 20210035014

Abstract: Aspects of the present invention provide an approach for reducing bias in active learning. In an embodiment, a data point is selected from a training dataset for a current training iteration while monitoring for data bias at each addition of data to a virtual training dataset. In addition, a machine learning model is examined for bias after adding the selected data point to the virtual training dataset. When data bias and/or model bias is detected, the data point is considered for potential label modification. The selected data point is modified and, if the raw value of the modified data point is within a predefined tolerance and within a bin of a desired class, the modified data point having a label of the target class is retained. Otherwise, it can be discarded.

Type: Application

Filed: July 31, 2019

Publication date: February 4, 2021

Inventors: Kuntal Dey, Sameep Mehta, Manish Anand Bhide
AUTOMATED FEEDBACK-BASED APPLICATION OPTIMIZATION

Publication number: 20210004311

Abstract: Approaches presented herein enable optimization of a developing application to a user base. More specifically, application-centric data is gathered during a cultivation phase of the developing application. Substantially concurrently with the cultivation phase of the developing application, the application-centric data is analyzed according to static code of the developing application, a testing of the developing application, or a user experience (UX) design of the developing application. A machine learning model is applied to the analyzed application-centric data. This machine learning model is trained on historic application feedback data from applications available to the user base. Based on the machine learning model, a recommended change to optimize the developing application to the user base is generated.

Type: Application

Filed: July 2, 2019

Publication date: January 7, 2021

Inventors: Manish Anand Bhide, Vijay Kumar Ananthapur Bache, Srinivas Chebolu, Jhilam Bera
AUTOMATICALLY RANK AND ROUTE DATA QUALITY REMEDIATION TASKS

Publication number: 20200401565

Abstract: In an approach for automatically ranking and routing data quality remediation tasks, a processor analyzes a data set ingested by a repository to produce a set of data quality problems. A processor computes a score for each data quality problem of the set of data quality problems. A processor identifies a route to send each data quality problem of the set of data quality problems. A processor exports each data quality problem according to the score and the route.

Type: Application

Filed: June 20, 2019

Publication date: December 24, 2020

Inventors: Yannick Saillet, Namit Kabra, Manish Anand Bhide
MODEL QUALITY AND RELATED MODELS USING PROVENANCE DATA

Publication number: 20200372398

Abstract: A method, computer system, and a computer program product for utilizing provenance data to improve machine learning is provided. Embodiments of the present invention may include collecting provenance data. Embodiments of the present invention may include identifying model quality improvements based on the collected provenance data. Embodiments of the present invention may include identifying related models based on the collected provenance data. Embodiments of the present invention may include recommending model quality improvements to a user.

Type: Application

Filed: May 22, 2019

Publication date: November 26, 2020

Inventors: Samiulla Zakir Hussain Shaikh, HIMANSHU GUPTA, Rajmohan Chandrahasan, Sameep Mehta, Manish Anand Bhide
AUTOMATIC SUMMARIZATION WITH BIAS MINIMIZATION

Publication number: 20200372056

Abstract: A processor may receive a record. The record may include one or more segments of text. The processor may tag each segment of text with an indicator. The indicator may denote a specific instance of bias in each of a respective segment of text. The processor may automatically generate a summary of the record. The summary of the record may include a set of segments of text. The set of segments of text may have a different overall bias than the record. The processor may display the summary of the record to a user.

Type: Application

Filed: May 23, 2019

Publication date: November 26, 2020

Inventors: Manish Anand Bhide, Kuntal Dey, Nishtha Madaan, Seema Nagar, Sameep Mehta
AUTOMATIC SUMMARIZATION WITH BIAS MINIMIZATION

Publication number: 20200372101

Abstract: A processor may receive a record. The record may include one or more segments of text. The processor may automatically generate a first summary of the record. The processor may determine an overall bias of the first summary. The overall bias of the first summary may be identified from one or more instances of bias in the first summary. The processor may generate a second summary of the record. The second summary of the record may include an indicator of the overall bias of the first summary. The indicator may include a description of a type of overall bias of the first summary and a numerical value of the overall bias of the first summary. The processor may determine an overall bias of the second summary. The processor may display the second summary of the record to a user.

Type: Application

Filed: May 23, 2019

Publication date: November 26, 2020

Inventors: Manish Anand Bhide, Kuntal Dey, Nishtha Madaan, Seema Nagar, Sameep Mehta
RELATIONSHIP DISCOVERY

Publication number: 20200356580

Abstract: Relationship discovery can include receiving at a first mobile device a pair of ultrasonic signals conveyed at different frequencies from a second mobile device. The pair of ultrasonic signals can convey, respectively, a second user's contact information in an encrypted form and a key indicator. A contact number can be selected from a first user's contact list electronically stored on the first mobile device. The contact number can be selected based on the key indicator. A mutual contact can be identified in response to decrypting the second user's contact information using the contact number as a decryption key.

Type: Application

Filed: May 7, 2019

Publication date: November 12, 2020

Inventors: Saravanan Sadacharam, Manish Anand Bhide, Vijay Ekambaram, Vijay Kumar Ananthapur Bache
COMMENT-BASED ARTICLE AUGMENTATION

Publication number: 20200302006

Abstract: An article is automatically augmented. The article and one or more comments are received. Comment elements are extracted from the one or more comments, and article elements are extracted from the article. Alignment scores are generated for comment-article pairs based on the extracted comment and article elements. Further, it is determined that at least one comment-article pair has an alignment score at or above a threshold alignment score. At least one augmentation feature is then generated.

Type: Application

Filed: July 15, 2019

Publication date: September 24, 2020

Inventors: Manish Anand Bhide, Nishtha Madaan, Seema Nagar, Sameep Mehta, Kuntal Dey
COMMENT-BASED ARTICLE AUGMENTATION

Publication number: 20200302005

Abstract: An article is automatically augmented. The article and one or more comments are received. Comment elements are extracted from the one or more comments, and article elements are extracted from the article. Alignment scores are generated for comment-article pairs based on the extracted comment and article elements. Further, it is determined that at least one comment-article pair has an alignment score at or above a threshold alignment score. At least one augmentation feature is then generated.

Type: Application

Filed: March 22, 2019

Publication date: September 24, 2020

Inventors: Manish Anand Bhide, Nishtha Madaan, Seema Nagar, Sameep Mehta, Kuntal Dey
Data warehouse data model adapters

Patent number: 9542469

Abstract: In the context of data administration in enterprises, an effective manner of providing a central data warehouse, particularly via employing a tool that helps by analyzing existing data and reports from different business units. In accordance with at least one embodiment of the invention, such a tool analyzes the data model of an enterprise and proposes alternatives for building a new data warehouse. The tool, in accordance with at least one embodiment of the invention, models the problem of identifying fact/dimension attributes of a warehouse model as a graph cut on a Dependency Analysis Graph (DAG). The DAG is built using existing data models and the report generation scripts. The tool also uses the DAG for generation of ETL (Extract, Transform Load) scripts that can be used to populate the newly proposed data warehouse from data present in the existing schemas.

Type: Grant

Filed: August 25, 2010

Date of Patent: January 10, 2017

Assignee: International Business Machines Corporation

Inventors: Vishal Singh Batra, Manish Anand Bhide, Mukesh Kumar Mohania, Sumit Negi
Finding partition boundaries for parallel processing of markup language documents

Patent number: 9477651

Abstract: A method, a computer program product and a system identify partition locations within an extended markup language (XML) document without parsing so as to process portions of said document in parallel. The XML document includes sections required to remain continuous. The document is scanned for continuous sections without parsing, and boundaries of the initial partitions are adjusted to reside outside the continuous sections to determine resulting partitions for the document. The resulting partitions may be processed in parallel to provide the document information for storage.

Type: Grant

Filed: September 29, 2010

Date of Patent: October 25, 2016

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Manoj K. Agarwal, Amir Bar-Or, Manish Anand Bhide, Sebastian Ertel, Sriram K. Padmanabhan
Star and snowflake schemas in extract, transform, load processes

Patent number: 9298787

Abstract: A computer-implemented method, computer program product and a system for supporting star and snowflake data schemas for use with an Extract, Transform, Load (ETL) process, comprising selecting a data source comprising dimensional data, where the dimensional data comprises at least one source table comprising at least one source column, importing a data model for the dimensional data into a data integration system, analyzing the imported data model to select a star or snowflake target data schema comprising target dimensions and target facts, generating a meta-model representation by mapping at least one source table or source column to each target fact and target dimension, automatically converting the meta-model representation into one or more ETL jobs, and executing the ETL jobs to extract the dimensional data from the data source and loading the dimensional data into the selected target data schema in a target data system.

Type: Grant

Filed: November 9, 2011

Date of Patent: March 29, 2016

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Manish Anand Bhide, Srinivas Kiran Mittapalli, Sriram Padmanabhan
Slowly changing dimension attributes in extract, transform, load processes

Patent number: 9031902

Abstract: A computer-implemented method, computer program product and a system for identifying and handling slowly changing dimension (SCD) attributes for use with an Extract, Transform, Load (ETL) process, comprising importing a data model for dimensional data into a data integration system, where the dimensional data comprises a plurality of attributes, identifying via a data discovery analyzer one or more attributes in the data model as SCD attributes, importing the identified SCD attributes into the data integration system, selecting a data source comprising dimensional data, automatically generating an ETL job for the dimensional data utilizing the imported SCD attributes, and executing the automatically generated ETL to extract the dimensional data from the data source and loading the dimensional data into the imported SCD attributes in a target data system.

Type: Grant

Filed: November 10, 2011

Date of Patent: May 12, 2015

Assignee: International Business Machines Corporation

Inventors: Manish Anand Bhide, Srinivas Kiran Mittapalli, Sriram K. Padmanabhan
Smarter business intelligence systems

Patent number: 8688606

Abstract: An embodiment of the invention provides a method and system for analyzing a plurality of reports. More specifically, a change detection module predicts results of future reports based on past reports and identifies a first report that deviates from its predicted results. A dependency analysis module connected to the change detection module at least one report sharing a dependency with the first report by performing a dependency analysis and/or a usage analysis. The dependency analysis labels the first report and at least one second report as sharing a dependency if the second report deviates from its predicted results. The usage analysis labeling the first report and at least one report analyzed by an analyst as sharing a dependency if the report analyzed by the analyst is analyzed in response to the identification of the first report.

Type: Grant

Filed: January 24, 2011

Date of Patent: April 1, 2014

Assignee: International Business Machines Corporation

Inventors: Sumit Negi, Manish Anand Bhide, Vishal Singh Batra, Govind Kothari
Smarter business intelligence systems

Patent number: 8682825

Abstract: An embodiment of the invention provides a method and system for analyzing a plurality of reports. More specifically, a change detection module predicts results of future reports based on past reports and identifies a first report that deviates from its predicted results. A dependency analysis module connected to the change detection module at least one report sharing a dependency with the first report by performing a dependency analysis and/or a usage analysis. The dependency analysis labels the first report and at least one second report as sharing a dependency if the second report deviates from its predicted results. The usage analysis labeling the first report and at least one report analyzed by an analyst as sharing a dependency if the report analyzed by the analyst is analyzed in response to the identification of the first report.

Type: Grant

Filed: July 21, 2012

Date of Patent: March 25, 2014

Assignee: International Business Machines Corporation

Inventors: Sumit Negi, Manish Anand Bhide, Vishal Singh Batra, Govind Kothari
Methods and arrangements for employing descriptors for agent-customer interactions

Patent number: 8589384

Abstract: Methods and arrangements for employing descriptors for agent-customer interactions are disclosed. Filtering the pooled records based on one or more predetermined criteria is done such that analyzing the filtered records and comparing one interaction between an agent and a customer with another interaction between an agent and a customer may occur.

Type: Grant

Filed: August 25, 2010

Date of Patent: November 19, 2013

Assignee: International Business Machines Corporation

Inventors: Manish Anand Bhide, Om Dadaji Deshmukh, Ashish Verma
Automatic enforcement of obligations according to a data-handling policy

Patent number: 8561126

Abstract: Methods, systems and computer program products for automatically enforcing obligations in accordance with a data-handling policy are disclosed. Requests by users for accessing data stored in a data repository are intercepted. A determination is made whether any obligations apply to each data item requested in accordance with the data handling policy. The determination may relate to whether rules having associated obligations identified in the data-handling policy apply to data items requested by a user. The obligations are automatically executed at an appropriate time after access of the data. Association of a data item requested by the user with an obligation may be recorded and tracked to determine the appropriate time for executing the obligation.

Type: Grant

Filed: December 29, 2004

Date of Patent: October 15, 2013

Assignee: International Business Machines Corporation

Inventors: Rema Ananthanarayanan, Mukesh K Mohania, Ajay Kumar Gupta, Calvin Stacy Powers, Sachindra Joshi, Manish Anand Bhide

prev … 2 3 4 5 6 7 next