Patents by Inventor Manish Anand Bhide

Manish Anand Bhide has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20210158102
    Abstract: Methods, systems, and computer program products for determining data representative of bias within a model are provided herein. A computer-implemented method includes obtaining a first dataset on which a model was trained, wherein the first dataset contains protected attributes, and a second dataset on which the model was trained, wherein the protected attributes have been removed from the second dataset; identifying, for each of the one or more protected attributes in the first dataset, one or more attributes in the second dataset correlated therewith; determining bias among at least a portion of the identified correlated attributes; and outputting, to at least one user, identifying information pertaining to the one or more instances of bias.
    Type: Application
    Filed: November 21, 2019
    Publication date: May 27, 2021
    Inventors: Pranay Kumar Lohia, Diptikalyan Saha, Manish Anand Bhide, Sameep Mehta
  • Publication number: 20210158076
    Abstract: Methods, systems, and computer program products for determining model-related bias associated with training data are provided herein. A computer-implemented method includes obtaining, via execution of a first model, class designations attributed to data points used to train the first model; identifying any of the data points associated with an inaccurate class designation and/or a low-confidence class designation; training a second model using the data points from the dataset, but excluding the identified data points; determining bias related to at least a portion of those data points used to train the second model by: modifying one or more of the data points used to train the second model; executing the first model using the modified data points; and identifying a change to one or more class designations attributed to the modified data points as compared to before the modifying; and outputting identifying information pertaining to the determined bias.
    Type: Application
    Filed: November 21, 2019
    Publication date: May 27, 2021
    Inventors: Pranay Kumar Lohia, Diptikalyan Saha, Manish Anand Bhide, Sameep Mehta
  • Publication number: 20210097052
    Abstract: Methods, systems, and computer program products for domain aware explainable anomaly and drift detection for multi-variate raw data using a constraint repository are provided herein. A computer-implemented method includes obtaining a set of data and information indicative of a domain of said set of data; obtaining constraints from a domain-indexed constraint repository based on said set of data and said information, wherein the domain-indexed constraint repository comprises a knowledge graph having a plurality of nodes, wherein each node comprises an attribute associated with at least one of a plurality of domains and constraints corresponding to the attribute; detecting anomalies in said set of data based on whether portions of said set of data violate said retrieved constraints; generating an explanation corresponding to each of the anomalies that describe the attributes corresponding to the violated constraints; and outputting an indication of the anomalies and the corresponding explanation.
    Type: Application
    Filed: September 27, 2019
    Publication date: April 1, 2021
    Inventors: Sandeep Hans, Samiulla Zakir Hussain Shaikh, Rema Ananthanarayanan, Diptikalyan Saha, Aniya Aggarwal, Gagandeep Singh, Pranay Kumar Lohia, Manish Anand Bhide, Sameep Mehta
  • Publication number: 20210035014
    Abstract: Aspects of the present invention provide an approach for reducing bias in active learning. In an embodiment, a data point is selected from a training dataset for a current training iteration while monitoring for data bias at each addition of data to a virtual training dataset. In addition, a machine learning model is examined for bias after adding the selected data point to the virtual training dataset. When data bias and/or model bias is detected, the data point is considered for potential label modification. The selected data point is modified and, if the raw value of the modified data point is within a predefined tolerance and within a bin of a desired class, the modified data point having a label of the target class is retained. Otherwise, it can be discarded.
    Type: Application
    Filed: July 31, 2019
    Publication date: February 4, 2021
    Inventors: Kuntal Dey, Sameep Mehta, Manish Anand Bhide
  • Publication number: 20210004311
    Abstract: Approaches presented herein enable optimization of a developing application to a user base. More specifically, application-centric data is gathered during a cultivation phase of the developing application. Substantially concurrently with the cultivation phase of the developing application, the application-centric data is analyzed according to static code of the developing application, a testing of the developing application, or a user experience (UX) design of the developing application. A machine learning model is applied to the analyzed application-centric data. This machine learning model is trained on historic application feedback data from applications available to the user base. Based on the machine learning model, a recommended change to optimize the developing application to the user base is generated.
    Type: Application
    Filed: July 2, 2019
    Publication date: January 7, 2021
    Inventors: Manish Anand Bhide, Vijay Kumar Ananthapur Bache, Srinivas Chebolu, Jhilam Bera
  • Publication number: 20200401565
    Abstract: In an approach for automatically ranking and routing data quality remediation tasks, a processor analyzes a data set ingested by a repository to produce a set of data quality problems. A processor computes a score for each data quality problem of the set of data quality problems. A processor identifies a route to send each data quality problem of the set of data quality problems. A processor exports each data quality problem according to the score and the route.
    Type: Application
    Filed: June 20, 2019
    Publication date: December 24, 2020
    Inventors: Yannick Saillet, Namit Kabra, Manish Anand Bhide
  • Publication number: 20200372398
    Abstract: A method, computer system, and a computer program product for utilizing provenance data to improve machine learning is provided. Embodiments of the present invention may include collecting provenance data. Embodiments of the present invention may include identifying model quality improvements based on the collected provenance data. Embodiments of the present invention may include identifying related models based on the collected provenance data. Embodiments of the present invention may include recommending model quality improvements to a user.
    Type: Application
    Filed: May 22, 2019
    Publication date: November 26, 2020
    Inventors: Samiulla Zakir Hussain Shaikh, HIMANSHU GUPTA, Rajmohan Chandrahasan, Sameep Mehta, Manish Anand Bhide
  • Publication number: 20200372056
    Abstract: A processor may receive a record. The record may include one or more segments of text. The processor may tag each segment of text with an indicator. The indicator may denote a specific instance of bias in each of a respective segment of text. The processor may automatically generate a summary of the record. The summary of the record may include a set of segments of text. The set of segments of text may have a different overall bias than the record. The processor may display the summary of the record to a user.
    Type: Application
    Filed: May 23, 2019
    Publication date: November 26, 2020
    Inventors: Manish Anand Bhide, Kuntal Dey, Nishtha Madaan, Seema Nagar, Sameep Mehta
  • Publication number: 20200372101
    Abstract: A processor may receive a record. The record may include one or more segments of text. The processor may automatically generate a first summary of the record. The processor may determine an overall bias of the first summary. The overall bias of the first summary may be identified from one or more instances of bias in the first summary. The processor may generate a second summary of the record. The second summary of the record may include an indicator of the overall bias of the first summary. The indicator may include a description of a type of overall bias of the first summary and a numerical value of the overall bias of the first summary. The processor may determine an overall bias of the second summary. The processor may display the second summary of the record to a user.
    Type: Application
    Filed: May 23, 2019
    Publication date: November 26, 2020
    Inventors: Manish Anand Bhide, Kuntal Dey, Nishtha Madaan, Seema Nagar, Sameep Mehta
  • Publication number: 20200356580
    Abstract: Relationship discovery can include receiving at a first mobile device a pair of ultrasonic signals conveyed at different frequencies from a second mobile device. The pair of ultrasonic signals can convey, respectively, a second user's contact information in an encrypted form and a key indicator. A contact number can be selected from a first user's contact list electronically stored on the first mobile device. The contact number can be selected based on the key indicator. A mutual contact can be identified in response to decrypting the second user's contact information using the contact number as a decryption key.
    Type: Application
    Filed: May 7, 2019
    Publication date: November 12, 2020
    Inventors: Saravanan Sadacharam, Manish Anand Bhide, Vijay Ekambaram, Vijay Kumar Ananthapur Bache
  • Publication number: 20200302006
    Abstract: An article is automatically augmented. The article and one or more comments are received. Comment elements are extracted from the one or more comments, and article elements are extracted from the article. Alignment scores are generated for comment-article pairs based on the extracted comment and article elements. Further, it is determined that at least one comment-article pair has an alignment score at or above a threshold alignment score. At least one augmentation feature is then generated.
    Type: Application
    Filed: July 15, 2019
    Publication date: September 24, 2020
    Inventors: Manish Anand Bhide, Nishtha Madaan, Seema Nagar, Sameep Mehta, Kuntal Dey
  • Publication number: 20200302005
    Abstract: An article is automatically augmented. The article and one or more comments are received. Comment elements are extracted from the one or more comments, and article elements are extracted from the article. Alignment scores are generated for comment-article pairs based on the extracted comment and article elements. Further, it is determined that at least one comment-article pair has an alignment score at or above a threshold alignment score. At least one augmentation feature is then generated.
    Type: Application
    Filed: March 22, 2019
    Publication date: September 24, 2020
    Inventors: Manish Anand Bhide, Nishtha Madaan, Seema Nagar, Sameep Mehta, Kuntal Dey
  • Patent number: 9542469
    Abstract: In the context of data administration in enterprises, an effective manner of providing a central data warehouse, particularly via employing a tool that helps by analyzing existing data and reports from different business units. In accordance with at least one embodiment of the invention, such a tool analyzes the data model of an enterprise and proposes alternatives for building a new data warehouse. The tool, in accordance with at least one embodiment of the invention, models the problem of identifying fact/dimension attributes of a warehouse model as a graph cut on a Dependency Analysis Graph (DAG). The DAG is built using existing data models and the report generation scripts. The tool also uses the DAG for generation of ETL (Extract, Transform Load) scripts that can be used to populate the newly proposed data warehouse from data present in the existing schemas.
    Type: Grant
    Filed: August 25, 2010
    Date of Patent: January 10, 2017
    Assignee: International Business Machines Corporation
    Inventors: Vishal Singh Batra, Manish Anand Bhide, Mukesh Kumar Mohania, Sumit Negi
  • Patent number: 9477651
    Abstract: A method, a computer program product and a system identify partition locations within an extended markup language (XML) document without parsing so as to process portions of said document in parallel. The XML document includes sections required to remain continuous. The document is scanned for continuous sections without parsing, and boundaries of the initial partitions are adjusted to reside outside the continuous sections to determine resulting partitions for the document. The resulting partitions may be processed in parallel to provide the document information for storage.
    Type: Grant
    Filed: September 29, 2010
    Date of Patent: October 25, 2016
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Manoj K. Agarwal, Amir Bar-Or, Manish Anand Bhide, Sebastian Ertel, Sriram K. Padmanabhan
  • Patent number: 9298787
    Abstract: A computer-implemented method, computer program product and a system for supporting star and snowflake data schemas for use with an Extract, Transform, Load (ETL) process, comprising selecting a data source comprising dimensional data, where the dimensional data comprises at least one source table comprising at least one source column, importing a data model for the dimensional data into a data integration system, analyzing the imported data model to select a star or snowflake target data schema comprising target dimensions and target facts, generating a meta-model representation by mapping at least one source table or source column to each target fact and target dimension, automatically converting the meta-model representation into one or more ETL jobs, and executing the ETL jobs to extract the dimensional data from the data source and loading the dimensional data into the selected target data schema in a target data system.
    Type: Grant
    Filed: November 9, 2011
    Date of Patent: March 29, 2016
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Manish Anand Bhide, Srinivas Kiran Mittapalli, Sriram Padmanabhan
  • Patent number: 9031902
    Abstract: A computer-implemented method, computer program product and a system for identifying and handling slowly changing dimension (SCD) attributes for use with an Extract, Transform, Load (ETL) process, comprising importing a data model for dimensional data into a data integration system, where the dimensional data comprises a plurality of attributes, identifying via a data discovery analyzer one or more attributes in the data model as SCD attributes, importing the identified SCD attributes into the data integration system, selecting a data source comprising dimensional data, automatically generating an ETL job for the dimensional data utilizing the imported SCD attributes, and executing the automatically generated ETL to extract the dimensional data from the data source and loading the dimensional data into the imported SCD attributes in a target data system.
    Type: Grant
    Filed: November 10, 2011
    Date of Patent: May 12, 2015
    Assignee: International Business Machines Corporation
    Inventors: Manish Anand Bhide, Srinivas Kiran Mittapalli, Sriram K. Padmanabhan
  • Patent number: 8688606
    Abstract: An embodiment of the invention provides a method and system for analyzing a plurality of reports. More specifically, a change detection module predicts results of future reports based on past reports and identifies a first report that deviates from its predicted results. A dependency analysis module connected to the change detection module at least one report sharing a dependency with the first report by performing a dependency analysis and/or a usage analysis. The dependency analysis labels the first report and at least one second report as sharing a dependency if the second report deviates from its predicted results. The usage analysis labeling the first report and at least one report analyzed by an analyst as sharing a dependency if the report analyzed by the analyst is analyzed in response to the identification of the first report.
    Type: Grant
    Filed: January 24, 2011
    Date of Patent: April 1, 2014
    Assignee: International Business Machines Corporation
    Inventors: Sumit Negi, Manish Anand Bhide, Vishal Singh Batra, Govind Kothari
  • Patent number: 8682825
    Abstract: An embodiment of the invention provides a method and system for analyzing a plurality of reports. More specifically, a change detection module predicts results of future reports based on past reports and identifies a first report that deviates from its predicted results. A dependency analysis module connected to the change detection module at least one report sharing a dependency with the first report by performing a dependency analysis and/or a usage analysis. The dependency analysis labels the first report and at least one second report as sharing a dependency if the second report deviates from its predicted results. The usage analysis labeling the first report and at least one report analyzed by an analyst as sharing a dependency if the report analyzed by the analyst is analyzed in response to the identification of the first report.
    Type: Grant
    Filed: July 21, 2012
    Date of Patent: March 25, 2014
    Assignee: International Business Machines Corporation
    Inventors: Sumit Negi, Manish Anand Bhide, Vishal Singh Batra, Govind Kothari
  • Patent number: 8589384
    Abstract: Methods and arrangements for employing descriptors for agent-customer interactions are disclosed. Filtering the pooled records based on one or more predetermined criteria is done such that analyzing the filtered records and comparing one interaction between an agent and a customer with another interaction between an agent and a customer may occur.
    Type: Grant
    Filed: August 25, 2010
    Date of Patent: November 19, 2013
    Assignee: International Business Machines Corporation
    Inventors: Manish Anand Bhide, Om Dadaji Deshmukh, Ashish Verma
  • Patent number: 8561126
    Abstract: Methods, systems and computer program products for automatically enforcing obligations in accordance with a data-handling policy are disclosed. Requests by users for accessing data stored in a data repository are intercepted. A determination is made whether any obligations apply to each data item requested in accordance with the data handling policy. The determination may relate to whether rules having associated obligations identified in the data-handling policy apply to data items requested by a user. The obligations are automatically executed at an appropriate time after access of the data. Association of a data item requested by the user with an obligation may be recorded and tracked to determine the appropriate time for executing the obligation.
    Type: Grant
    Filed: December 29, 2004
    Date of Patent: October 15, 2013
    Assignee: International Business Machines Corporation
    Inventors: Rema Ananthanarayanan, Mukesh K Mohania, Ajay Kumar Gupta, Calvin Stacy Powers, Sachindra Joshi, Manish Anand Bhide