Patents by Inventor Sameep Mehta

Sameep Mehta has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20230169070
    Abstract: A computer implemented method, computer system, and computer program product for transforming mapped data fields of enterprise applications. A number of processor units receiving a matching from a source data field to a target data field. The set of processor units receiving a number of annotated examples of transformations from a source format to a target format. Based on the annotated examples, the set of processor units autogenerating a query language expression for transforming data items from the source format to the target format.
    Type: Application
    Filed: November 29, 2021
    Publication date: June 1, 2023
    Inventors: Ramkumar Ramalingam, Nagarjuna Surabathina, Thanmayi Mruthyunjaya, Nitin Gupta, Pranay Kumar Lohia, Shanmukha Chaitanya Guttula, Hima Patel, Sameep Mehta, Matu Agarwal, Mudit Mehrotra
  • Publication number: 20230169050
    Abstract: Techniques for automatically determining different data types found in databases are disclosed. In one example, a computer implemented method comprises receiving a portion of identifying information for one or more components of a database, and generating one or more descriptions for the one or more components based at least in part on the portion of the identifying information for the one or more components. The one or more descriptions are inputted to one or more machine learning models, and, using the one or more machine learning models, one or more data types associated with the one or more components are predicted. The prediction is based at least in part on the one or more descriptions.
    Type: Application
    Filed: November 29, 2021
    Publication date: June 1, 2023
    Inventors: Rajmohan Chandrahasan, Ankush Gupta, Venkata Nagaraju Pavuluri, Arvind Agarwal, Sameep Mehta
  • Publication number: 20230153537
    Abstract: An apparatus is disclosed which includes at least one processing device comprising a processor coupled to a memory. The at least one processing device, when executing program code, is configured to: extract one or more entities identified in a plurality of data artifacts based at least in part on one or more datasets, extract one or more entities identified in a plurality of code artifacts based at least in part on the one or more datasets, extract one or more entities identified in a plurality of user interface artifacts based at least in part on the one or more datasets, generate a set of dependency graphs each based at least in part on one or more relationships among the respective extracted one or more entities, and perform one or more of a lexical analysis and a semantic analysis on the set of dependency graphs to identify a data domain of the one or more datasets.
    Type: Application
    Filed: November 18, 2021
    Publication date: May 18, 2023
    Inventors: Malolan Chetlur, Arvind Agarwal, Subhendu Dey, Sameep Mehta, Sandipan Sarkar
  • Publication number: 20230135407
    Abstract: An embodiment establishes a designated attribute value as a semantic criterion for grouping records in a bucket, identifies a first set of records having attribute values that satisfy the semantic criterion, and adds the first set of records to the bucket. The embodiment detects that the first set of records represent a first series of events that occurred in succession at respective times. The embodiment derives a temporal attribute value representative of a time pattern formed by the times of the first series of events and designates the temporal attribute value as a temporal criterion for grouping records in the bucket. The embodiment identifies a second set of records that represent a second series of events and satisfy the temporal criterion and adds the second set of records to the bucket based at least in part on the second set of records satisfying the temporal criterion.
    Type: Application
    Filed: November 3, 2021
    Publication date: May 4, 2023
    Applicant: International Business Machines Corporation
    Inventors: Avirup Saha, Balaji Ganesan, Shettigar Parkala Srinivas, Sumit Bhatia, Sameep Mehta, Soma Shekar Naganna
  • Publication number: 20230128548
    Abstract: One embodiment provides a method, including: receiving, at a central server, data from each of a plurality of data sources, the plurality of data sources being within a plurality of data storage locations, wherein the central server includes a validation dataset having a plurality of annotated datapoints; computing, at the central server, an influential score for each of the plurality of data sources based upon the data provided to the central server from each of the plurality of data sources, wherein an influential score of a data source identifies an influence of the data source in accurately predicting annotations of the validation dataset; selecting, at the central server and based upon the influential score of the plurality of data sources, a subset of the plurality of data sources; and generating, at the central server, the training dataset utilizing the data of the data sources included within the subset.
    Type: Application
    Filed: October 25, 2021
    Publication date: April 27, 2023
    Inventors: Ruhi Sharma Mittal, Ramasuri Narayanam, Lokesh Nagalapatti, Sameep Mehta
  • Patent number: 11636386
    Abstract: Methods, systems, and computer program products for determining data representative of bias within a model are provided herein. A computer-implemented method includes obtaining a first dataset on which a model was trained, wherein the first dataset contains protected attributes, and a second dataset on which the model was trained, wherein the protected attributes have been removed from the second dataset; identifying, for each of the one or more protected attributes in the first dataset, one or more attributes in the second dataset correlated therewith; determining bias among at least a portion of the identified correlated attributes; and outputting, to at least one user, identifying information pertaining to the one or more instances of bias.
    Type: Grant
    Filed: November 21, 2019
    Date of Patent: April 25, 2023
    Assignee: International Business Machines Corporation
    Inventors: Pranay Kumar Lohia, Diptikalyan Saha, Manish Anand Bhide, Sameep Mehta
  • Patent number: 11630833
    Abstract: One embodiment provides a computer implemented method, including: receiving, from a user, a natural language query for data contained within at least one data repository; identifying at least one concept from the natural language query, wherein the at least one concept includes an entity and an intent; identifying a plurality of datasets satisfying the natural language query by querying the at least one data repository utilizing the at least one concept; ranking the dataset based on relevance to the query; generating an extract-transform-load script that extracts, transforms, and loads a dataset selected by the user from the plurality of datasets; and retrieving data included in the dataset utilizing the extract-transform-load script, wherein the retrieving includes returning the data to the user.
    Type: Grant
    Filed: October 29, 2020
    Date of Patent: April 18, 2023
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Manish Kesarwani, Sumit Bhatia, Sameep Mehta
  • Publication number: 20230021563
    Abstract: Methods, systems, and computer program products for federated data standardization using data privacy techniques are provided herein. A computer-implemented method includes obtaining multiple datasets from multiple clients in accordance with one or more data privacy techniques; determining one or more similar data columns across at least a portion of the multiple datasets; generating one or more column labels for the one or more similar data columns; standardizing at least a portion of data within the one or more similar data columns by processing the one or more generated column labels using at least one federated learning technique; and performing one or more automated actions based at least in part on results of the standardizing of the at least a portion of data within the one or more similar data columns.
    Type: Application
    Filed: July 23, 2021
    Publication date: January 26, 2023
    Inventors: Ramasuri Narayanam, Hima Patel, Sameep Mehta
  • Patent number: 11551102
    Abstract: One embodiment provides a method, including: receiving a target unstructured document for determining whether the target unstructured document comprises biased information; identifying an objective of the target unstructured document by extracting, from the target unstructured document, (i) entities and (ii) relationships between the entities; creating a structured knowledge base, wherein the creating comprises (i) creating an entry in the structured knowledge base corresponding to the target unstructured document, (ii) identifying other unstructured documents having a similarity to the target unstructured document, and (iii) generating an entry in the structured knowledge base corresponding to each of the other unstructured documents; applying a bias detection technique on the structured knowledge base; and providing an indication of whether the target unstructured document comprises bias.
    Type: Grant
    Filed: April 15, 2019
    Date of Patent: January 10, 2023
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Pranay Kumar Lohia, Rajmohan Chandrahasan, Himanshu Gupta, Samiulla Zakir Hussain Shaikh, Sameep Mehta, Atul Kumar
  • Patent number: 11544566
    Abstract: A method, computer system, and a computer program product for generating deep learning model insights using provenance data is provided. Embodiments of the present invention may include collecting provenance data. Embodiments of the present invention may include generating model insights based on the collected provenance data. Embodiments of the present invention may include generating a training model based on the generated model insights. Embodiments of the present invention may include reducing the training model size. Embodiments of the present invention may include creating a final trained model.
    Type: Grant
    Filed: June 3, 2019
    Date of Patent: January 3, 2023
    Assignee: International Business Machines Corporation
    Inventors: Nitin Gupta, Himanshu Gupta, Rajmohan Chandrahasan, Sameep Mehta, Pranay Kumar Lohia
  • Patent number: 11521065
    Abstract: Methods, systems, and computer program products for generating explanations for a semantic parser are provided herein. A computer-implemented method includes providing to a generative model (i) at least one query and (ii) a context of at least one dataset applicable to the at least one query, wherein the generative model generates a plurality of perturbations for the at least one input query based on the context; providing the plurality of perturbations as inputs to a context aware sequence-to-sequence model, thereby obtaining a plurality of outputs; and generating, for (i) an additional query provided as input to the context aware sequence-to-sequence model and (ii) a context applicable to the additional query, an explanation indicative of one or more parts of the additional query that contributes to an output corresponding to the additional query, based at least in part on the plurality of outputs corresponding to the perturbations.
    Type: Grant
    Filed: February 6, 2020
    Date of Patent: December 6, 2022
    Assignee: International Business Machines Corporation
    Inventors: Rachamalla Anirudh Reddy, Pranay Kumar Lohia, Samiulla Zakir Hussain Shaikh, Diptikalyan Saha, Sameep Mehta
  • Patent number: 11520986
    Abstract: Aspects of the present disclosure relate to neural-based ontology generation and refinement. A set of input data can be received. A set of entities can be extracted from the set of input data using a named-entity recognition (NER) process, each entity having a corresponding label, the corresponding labels making up a label set. The label set can be compared to concepts in a set of reference ontologies. Labels that match to concepts in the set of reference ontologies can be selected as a candidate concept set. Relations associated with the candidate concepts within the set of reference ontologies can be identified as a candidate relation set. An ontology can then be generated using the candidate concept set and candidate relation set.
    Type: Grant
    Filed: July 24, 2020
    Date of Patent: December 6, 2022
    Assignee: International Business Machines Corporation
    Inventors: Balaji Ganesan, Riddhiman Dasgupta, Akshay Parekh, Hima Patel, Berthold Reinwald, Sameep Mehta
  • Patent number: 11501176
    Abstract: A method, a system, and a computer program product are provided for analyzing an instructional video. Video data of an instructional video is analyzed to form multiple units of work. Each unit of work is a respective grouping of video frames of the instructional video based on a respective logical combination of activities associated therewith. Each unit of work is analyzed to produce a respective action graph of activities included in the unit of work, the respective action graph indicating interdependencies among the activities included therein. Interdependencies among activities across the units of work are determined to form a critical path graph. A received query is processed to provide troubleshooting assistance with respect to the instructional video based on the units of work, the action graphs, the critical path graph, and a knowledge base including information related to a subject matter of the instructional video.
    Type: Grant
    Filed: December 14, 2018
    Date of Patent: November 15, 2022
    Assignee: International Business Machines Corporation
    Inventors: Abhishek Mitra, Nitin Gupta, Shashank Mujumdar, Sameep Mehta
  • Patent number: 11501191
    Abstract: Asset recommendation for a particular input dataset is provided. Candidate data analysis assets having a corresponding relatedness score associated with the particular input dataset greater than a defined relatedness score threshold value are selected. Those candidate data analysis assets having a corresponding relatedness score greater than the defined relatedness score threshold value are ranked by score. Those candidate data analysis assets having a corresponding relatedness score greater than the defined relatedness score threshold value are listed by rank from highest to lowest. A justification for each candidate data analysis asset is inserted in the ranked list of candidate data analysis assets. The ranked list of candidate data analysis assets along with each respective justification is outputted on a display device.
    Type: Grant
    Filed: September 21, 2018
    Date of Patent: November 15, 2022
    Assignee: International Business Machines Corporation
    Inventors: Samiulla Shaikh, Sameep Mehta, Manish Bhide, William B. Lobig
  • Patent number: 11494802
    Abstract: A service receives a persuasion-based input comprising a text and one or more marketing objectives to persuade a desired response. The service evaluates persuasion values of text segments of the text and persuasion transition values consecutively between respective persuasion values of the persuasion values across the text segments. The service generates a desired curve of persuasion factors across the text segments according to the one or more marketing objectives. The service recommends one or more replacement words to replace one or more selected words in text to move a deviation between the persuasion values and transition values in comparison to the desired curve of persuasion factors.
    Type: Grant
    Filed: January 14, 2020
    Date of Patent: November 8, 2022
    Assignee: International Business Machines Corporation
    Inventors: Abhishek Shah, Ananya Aniruddha Poddar, Inkit Padhi, Nishtha Madaan, Sameep Mehta, Kuntal Dey
  • Publication number: 20220342869
    Abstract: Methods, systems, and computer program products for identifying anomalous transformations using lineage data are provided herein. A computer-implemented method includes generating a set of column profiles for a corresponding set of columns within one or more datasets based at least in part on lineage data and glossary data, wherein the lineage data comprises information related to transformations performed on each column in the set by a computing platform, and wherein the glossary data comprises information related to one or more terms assigned to one or more of the columns; obtaining information related to a new transformation involving at least one column in the set of columns; comparing the new transformation to the set of column profiles to determine whether the new transformation is anomalous; and in response to determining the new transformation is anomalous, outputting an alert to a user of the computing platform.
    Type: Application
    Filed: April 26, 2021
    Publication date: October 27, 2022
    Inventors: Rajmohan Chandrahasan, Himanshu Gupta, Sameep Mehta, Emma Rose Tucker, Andrzej Jan Wrobel
  • Patent number: 11483154
    Abstract: A method for blockchain certification of artificial intelligence factsheets that includes receiving by a computing device, an artificial intelligence model. The computing device generates an artificial intelligence factsheet based upon logic of the artificial intelligence model. The computing device generates a blockchain link for a blockchain. The blockchain link certifies the artificial intelligence factsheet. The computing device transmits the blockchain link certifying the artificial intelligence factsheet to other computing devices.
    Type: Grant
    Filed: February 19, 2020
    Date of Patent: October 25, 2022
    Assignee: International Business Machines Corporation
    Inventors: Kalapriya Kannan, Pranay Kumar Lohia, Samuel Hoffman, Kush Raj Varshney, Sameep Mehta
  • Patent number: 11475020
    Abstract: One embodiment provides a method, including: receiving, from a user, a dataset for encryption before its storage at a data storage location, wherein the dataset comprises a plurality of portions; identifying (i) attributes of the dataset and (ii) dataset dependencies; generating a recommendation for an encryption scheme to be used for the dataset, wherein the generating comprises (i) generating, based upon the attributes and the dataset dependencies, a recommendation of an encryption scheme for each portion of the dataset and (ii) identifying, based upon the dataset dependencies, a key label for each portion of the dataset, wherein the key label identified for a portion of the dataset that is dependent on another portion of the dataset is the same as the key label identified for said another portion of the dataset; and providing, to the user, (i) the generated recommendation and (ii) a description identifying reasons for the generated recommendation.
    Type: Grant
    Filed: June 6, 2019
    Date of Patent: October 18, 2022
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Manish Kesarwani, Akshar Kaul, Gagandeep Singh, Sameep Mehta, Hong Min, James Willis Pickel
  • Publication number: 20220261597
    Abstract: Embodiments are disclosed for a method. The method includes receiving an annotation set for a machine learning model. The annotation set includes multiple data points relevant to a task for the machine learning model. The method also includes determining total weights corresponding to the data points. The total weights are determined based on multiple ordering constraints indicating multiple data classes and corresponding weights. The corresponding weights represent a relative priority of the data classes with respect to each other. The method further includes generating an ordered annotation set from the annotation set. The ordered annotation set includes the data points in a sequence based on the determined total weights.
    Type: Application
    Filed: February 15, 2021
    Publication date: August 18, 2022
    Inventors: Naveen Panwar, Anush Sankaran, Kuntal Dey, Hima Patel, Sameep Mehta
  • Publication number: 20220237477
    Abstract: Methods, systems, and computer program products for factchecking artificial intelligence models using blockchain are provided herein. A computer-implemented method includes obtaining at least one artificial intelligence model and at least one set of data related to the at least one artificial intelligence model; determining a set of characteristics based at least in part on the at least one artificial intelligence model and the at least one set of data; selecting one of a plurality of networks based at least in part on a target deployment of the at least one artificial intelligence model to verify the set of characteristics; generating a report based at least in part on verifying the set of characteristics using the selected network, wherein the report establishes a threshold level of trust for the at least one artificial intelligence model; and storing the report on a blockchain.
    Type: Application
    Filed: January 22, 2021
    Publication date: July 28, 2022
    Inventors: Srikanth Govindaraj Tamilselvam, Sai Koti Reddy Danda, Senthil Kumar Kumarasamy Mani, Kalapriya Kannan, Sameep Mehta