Patents by Inventor Sameep Mehta
Sameep Mehta has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20250138920Abstract: Various systems and methods are presented regarding implementing one or more capabilities into a dialog flow occurring at an automated interface (e.g., a chatbot). A capability can be invoked at the interface in accordance with a user's requirements, e.g., the capability is a function to review data. An application programming interface (API) can be generated from the capability, wherein the API has features, parameters, metadata, etc., generated based on those of the capability. The API can be incorporated into a dialog, wherein the dialog can be subsequently presented on the interface (e.g., as part of a dialog flow). Interaction between the user and the dialog can cause the capability to be executed. Based upon the API features, etc., the API can be incorporated into a dialog, for example, by cloning a dialog, appending a dialog with the API, replacing a pre-existing API with the API in a dialog, and suchlike.Type: ApplicationFiled: October 26, 2023Publication date: May 1, 2025Inventors: Manish Kesarwani, Ankush Gupta, Arvind Agarwal, Binayak Dutta, Soujanya Soni, Sameep Mehta
-
Patent number: 12288044Abstract: A computer implemented method creates microservices for an application. A number of processor units clusters programs and data structures for the application using runtime metadata to form groups of the programs and data structures. The runtime metadata is obtained from running the application. The number of processor units creates a design for the microservices for the application using the groups of the programs and the data structures.Type: GrantFiled: January 26, 2023Date of Patent: April 29, 2025Assignee: International Business Machines CorporationInventors: Akshar Kaul, Himanshu Gupta, Sameep Mehta, Srikanth Govindaraj Tamilselvam, Amith Singhee, Vaibhav Sudhakar Dantale, Ravi Vishnu Israni
-
Patent number: 12277507Abstract: Methods, systems, and computer program products for factchecking artificial intelligence models using blockchain are provided herein. A computer-implemented method includes obtaining at least one artificial intelligence model and at least one set of data related to the at least one artificial intelligence model; determining a set of characteristics based at least in part on the at least one artificial intelligence model and the at least one set of data; selecting one of a plurality of networks based at least in part on a target deployment of the at least one artificial intelligence model to verify the set of characteristics; generating a report based at least in part on verifying the set of characteristics using the selected network, wherein the report establishes a threshold level of trust for the at least one artificial intelligence model; and storing the report on a blockchain.Type: GrantFiled: January 22, 2021Date of Patent: April 15, 2025Assignee: International Business Machines CorporationInventors: Srikanth Govindaraj Tamilselvam, Sai Koti Reddy Danda, Senthil Kumar Kumarasamy Mani, Kalapriya Kannan, Sameep Mehta
-
Publication number: 20250110970Abstract: A processor set is configured to receive tabular data records and generate a plurality of clusters, associated with specific real-world entities, within the received tabular data records, wherein each cluster is associated with a specific real-world entity. The processor set may further identify informative features within a first cluster and mask a subset of the informative features. Based on the masked subset of informative features and using self-supervision techniques, the processor set may train a tabular foundation model.Type: ApplicationFiled: September 29, 2023Publication date: April 3, 2025Inventors: Balaji Ganesan, Avirup Saha, Muhammed Abdul Majeed Ameen, Soma Shekar Naganna, Sameep Mehta
-
Patent number: 12248810Abstract: The method performs at the orchestration interface at which update information, including changes to tasks of a workflow, is received from a task manager system (TMS), where the workflow includes a set of tasks, inputs to the tasks, and outputs from the tasks. The inputs and outputs determine runtime dependencies between the tasks. Based on the update information received, the orchestration interface populates a topology of nodes and edges as a directed acyclic graph (DAG) that maps nodes to tasks and edges to runtime dependencies between tasks, based on node inputs and outputs. The orchestration interface instructs the execution of the tasks and handling dependencies by interacting with a task execution system (TES) and by traversing the DAG, the orchestration interface identifies tasks that depend on completed tasks as per the runtime dependencies and instructs the TES to execute the dependent tasks identified.Type: GrantFiled: June 15, 2022Date of Patent: March 11, 2025Assignee: International Business Machines CorporationInventors: Anton Zorin, Manish Kesarwani, Niels Dominic Pardon, Ritesh Kumar Gupta, Sameep Mehta
-
Publication number: 20250077537Abstract: A method, system, and computer program product are configured to: receive a data transfer request to transfer a dataset stored on a source cloud to a destination cloud; determine a target view of the data transfer request based on one or more policies; determine, using lineage metadata, a first portion of the target view exists in one or more copies of a dataset stored on the destination cloud; extract data corresponding to the first portion from the one or more copies of the dataset stored on the destination cloud; create the target view using the extracted data; and serve the data transfer request using the created target viewType: ApplicationFiled: August 31, 2023Publication date: March 6, 2025Inventors: Christopher J. Giblin, Rajmohan Chandrahasan, Himanshu GUPTA, Sameep Mehta
-
Publication number: 20250077877Abstract: A computer-implemented method for improving entity matching in a probabilistic matching engine can train a graph neural network (GNN) model on an output of a probabilistic matching engine to perform entity matching and determine counterfactual explanations for non-matches of entities. A list of data transformations can be identified by actionable recourse using the GNN model. The list of data transformations can be ranked, using the GNN model, based on computational overhead and an estimated improvement in entity matching within the probabilistic matching engine.Type: ApplicationFiled: August 29, 2023Publication date: March 6, 2025Inventors: Balaji Ganesan, Sameep Mehta, Muhammed Abdul Majeed Ameen, Abhishek Seth, Soma Shekar Naganna
-
Publication number: 20240256226Abstract: A computer implemented method creates microservices for an application. A number of processor units clusters programs and data structures for the application using runtime metadata to form groups of the programs and data structures. The runtime metadata is obtained from running the application. The number of processor units creates a design for the microservices for the application using the groups of the programs and the data structures.Type: ApplicationFiled: January 26, 2023Publication date: August 1, 2024Inventors: Akshar Kaul, Himanshu Gupta, Sameep Mehta, Srikanth Govindaraj Tamilselvam, Amith Singhee, Vaibhav Sudhakar Dantale, Ravi Vishnu Israni
-
Patent number: 12045291Abstract: Records can be matched by a graph neural network model performing entity resolution on the records, and representing each record as a respective node in a graph. Record matching explanations can be generated, each record matching explanation indicating a first set of attributes, and a first set of corresponding values, used for the matching at least two of the records. Nodes can be clustered into a plurality of clusters by aggregating the record matching explanations and, based on the record matching explanations, determining which of the records have high importance values, in the first set of values, that match. At least one cluster explanation can be generated, the cluster explanation indicating a second set of attributes, and a second set of values corresponding to the second set of attributes, used for the clustering the nodes. The record matching explanation and the cluster explanation can be output.Type: GrantFiled: November 3, 2022Date of Patent: July 23, 2024Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Muhammed Abdul Majeed Ameen, Balaji Ganesan, Avirup Saha, Abhishek Seth, Devbrat Sharma, Arvind Agarwal, Soma Shekar Naganna, Sameep Mehta
-
Publication number: 20240202573Abstract: A method, computer program product, and computer system for transforming sets of source data having different formats into respective sets of target data having a same format. N source patterns are determined and respectively describe N different formats in which N sets of source data items are formatted, where N?1. A target format pattern is determined and describes a target format in which a target data items are formatted. N graphs are generated and respectively describe transformations of the N source patterns to the target pattern. Each graph includes multiple transformation paths. Each transformation path transforms the source pattern to the target pattern in a manner that maps source strings in the source pattern to each target string in the target pattern. A single transformation path is selected from the multiple transformation paths resulting in N single transformation paths having been selected.Type: ApplicationFiled: December 19, 2022Publication date: June 20, 2024Inventors: Nagarjuna Surabathina, Nitin Gupta, Shramona Chakraborty, Hima Patel, Sameep Mehta, Ramkumar Ramalingam, Matu Agarwal
-
Publication number: 20240152557Abstract: Records can be matched by a graph neural network model performing entity resolution on the records, and representing each record as a respective node in a graph. Record matching explanations can be generated, each record matching explanation indicating a first set of attributes, and a first set of corresponding values, used for the matching at least two of the records. Nodes can be clustered into a plurality of clusters by aggregating the record matching explanations and, based on the record matching explanations, determining which of the records have high importance values, in the first set of values, that match. At least one cluster explanation can be generated, the cluster explanation indicating a second set of attributes, and a second set of values corresponding to the second set of attributes, used for the clustering the nodes. The record matching explanation and the cluster explanation can be output.Type: ApplicationFiled: November 3, 2022Publication date: May 9, 2024Inventors: Muhammed Abdul Majeed Ameen, Balaji Ganesan, Avirup Saha, Abhishek Seth, Devbrat Sharma, Arvind Agarwal, Soma Shekar Naganna, Sameep Mehta
-
Patent number: 11966453Abstract: Embodiments are disclosed for a method. The method includes receiving an annotation set for a machine learning model. The annotation set includes multiple data points relevant to a task for the machine learning model. The method also includes determining total weights corresponding to the data points. The total weights are determined based on multiple ordering constraints indicating multiple data classes and corresponding weights. The corresponding weights represent a relative priority of the data classes with respect to each other. The method further includes generating an ordered annotation set from the annotation set. The ordered annotation set includes the data points in a sequence based on the determined total weights.Type: GrantFiled: February 15, 2021Date of Patent: April 23, 2024Assignee: International Business Machines CorporationInventors: Naveen Panwar, Anush Sankaran, Kuntal Dey, Hima Patel, Sameep Mehta
-
Publication number: 20240086780Abstract: A method, computer program, and computer system are provided for determining similar nodes in a federated learning environment. Data corresponding to a dataset associated with a node in the federated learning environment is retrieved by the node. A frequency distribution associated with the dataset is calculated and transmitted to an aggregator. One or more frequency distributions associated with one or more other nodes in the federated learning environment are received from the aggregator. Based on the received frequency distributions associated with the one or more other nodes, a similarity between the node and a subset of the one or more other nodes is identified.Type: ApplicationFiled: September 12, 2022Publication date: March 14, 2024Inventors: Soujanya Soni, Sameep Mehta
-
Patent number: 11921861Abstract: Methods, systems, and computer program products for providing the status of model extraction in the presence of colluding users are provided herein. A computer-implemented method includes generating, for each of multiple users, a summary of user input to a machine learning model; comparing the generated summaries to boundaries of multiple feature classes within an input space of the machine learning model; computing correspondence metrics based at least in part on the comparisons; identifying, based at least in part on the computed metrics, one or more of the multiple users as candidates for extracting portions of the machine learning model in an adversarial manner; and generating and outputting an alert, based on the identified users, to an entity related to the machine learning model.Type: GrantFiled: May 21, 2018Date of Patent: March 5, 2024Assignee: International Business Machines CorporationInventors: Manish Kesarwani, Vijay Arya, Sameep Mehta
-
Publication number: 20240070350Abstract: An example operation may include one or more of identifying an external system that passes an input attribute to a process based on a workflow representation of the process, building a simulator of the external system based on attributes of the external system identified from the workflow representation, simulating future values of the input attribute to be passed to the process by the external system based on the simulator of the external system and a previous simulation run of the process performed via a workflow software application, and executing a new simulation of the process via the workflow software application based on the simulated future values of the input attribute.Type: ApplicationFiled: August 23, 2022Publication date: February 29, 2024Inventors: Rakesh Rameshrao Pimplikar, Ritwik Chaudhuri, Pranay Kumar Lohia, Ramasuri Narayanam, Sameep Mehta, Gyana Ranjan Parija
-
Publication number: 20240070519Abstract: A method, computer program, and computer system are provided for online fairness monitoring. A dataset having one or more entries with one or more protected attributes and data corresponding to a trained machine learning model is received. An entry having a maximum reward is selected based on a reward probability associated with the entry. A determination is made as to whether bias has developed in the trained machine learning model toward one or more of the one or more protected attributes based on a change to the reward probability or a distribution of reward probabilities exceeding a threshold value.Type: ApplicationFiled: August 26, 2022Publication date: February 29, 2024Inventors: Manish Kesarwani, Pranay Kumar Lohia, Ramasuri Narayanam, Rakesh Rameshrao Pimplikar, Sameep Mehta
-
Publication number: 20240045896Abstract: Mechanisms are provided for dynamic re-resolution of entities in a knowledge graph (KG) based on streaming updates. The KG and corresponding initial clusters associated with first entities are received along with a dynamic data stream having second documents referencing second entities. Clustering on the second documents based on the set of initial clusters, and document features of the second documents, is performed to provide a set of second document clusters. For second document clusters that should be modified based on entities associated with the second document cluster, a cluster modification operation is performed. Updated clusters are generated based on the clustering and modification of clusters. Entity re-resolution is dynamically performed on the entities in the KG based on the second entities associated with the updated clusters to generate an updated knowledge graph data structure.Type: ApplicationFiled: August 4, 2022Publication date: February 8, 2024Inventors: Avirup Saha, Balaji Ganesan, Soma Shekar Naganna, Sameep Mehta
-
Publication number: 20230409386Abstract: The method performs at the orchestration interface at which update information, including changes to tasks of a workflow, is received from a task manager system (TMS), where the workflow includes a set of tasks, inputs to the tasks, and outputs from the tasks. The inputs and outputs determine runtime dependencies between the tasks. Based on the update information received, the orchestration interface populates a topology of nodes and edges as a directed acyclic graph (DAG) that maps nodes to tasks and edges to runtime dependencies between tasks, based on node inputs and outputs. The orchestration interface instructs the execution of the tasks and handling dependencies by interacting with a task execution system (TES) and by traversing the DAG, the orchestration interface identifies tasks that depend on completed tasks as per the runtime dependencies and instructs the TES to execute the dependent tasks identified.Type: ApplicationFiled: June 15, 2022Publication date: December 21, 2023Inventors: Anton Zorin, Manish Kesarwani, Niels Dominic Pardon, Ritesh Kumar Gupta, Sameep Mehta
-
Patent number: 11790265Abstract: Aspects of the present invention provide an approach for reducing bias in active learning. In an embodiment, a data point is selected from a training dataset for a current training iteration while monitoring for data bias at each addition of data to a virtual training dataset. In addition, a machine learning model is examined for bias after adding the selected data point to the virtual training dataset. When data bias and/or model bias is detected, the data point is considered for potential label modification. The selected data point is modified and, if the raw value of the modified data point is within a predefined tolerance and within a bin of a desired class, the modified data point having a label of the target class is retained. Otherwise, it can be discarded.Type: GrantFiled: July 31, 2019Date of Patent: October 17, 2023Assignee: International Business Machines CorporationInventors: Kuntal Dey, Sameep Mehta, Manish Anand Bhide
-
Patent number: 11768860Abstract: An embodiment establishes a designated attribute value as a semantic criterion for grouping records in a bucket, identifies a first set of records having attribute values that satisfy the semantic criterion, and adds the first set of records to the bucket. The embodiment detects that the first set of records represent a first series of events that occurred in succession at respective times. The embodiment derives a temporal attribute value representative of a time pattern formed by the times of the first series of events and designates the temporal attribute value as a temporal criterion for grouping records in the bucket. The embodiment identifies a second set of records that represent a second series of events and satisfy the temporal criterion and adds the second set of records to the bucket based at least in part on the second set of records satisfying the temporal criterion.Type: GrantFiled: November 3, 2021Date of Patent: September 26, 2023Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Avirup Saha, Balaji Ganesan, Shettigar Parkala Srinivas, Sumit Bhatia, Sameep Mehta, Soma Shekar Naganna