Patents by Inventor Mustafa Canim

Mustafa Canim has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11989261
    Abstract: A computer answers a question using a data table. The computer receives a user question and a target table containing a target cell corresponding to a target answer for the user question, with the target cell corresponding to a target column and a target row. The computer generates, a first classifier to provide column correlation values reflecting the probability that a given column is the target column. The computer generates a second classifier that provides row correlation values reflecting the probability that a given row is the target row. The computer applies the first classifier to the target table to determine a column correlation value for each column. The computer applies the second classifier to the target table to determine a row correlation value for each row. The computer suggests, as the target cell, a cell having elevated column and row correlation values relative to other target table cells.
    Type: Grant
    Filed: September 30, 2020
    Date of Patent: May 21, 2024
    Assignee: International Business Machines Corporation
    Inventors: Mustafa Canim, Michael Robert Glass, Alfio Massimiliano Gliozzo, Nicolas Rodolfo Fauceglia
  • Patent number: 11954613
    Abstract: A method, apparatus and computer program product for establishing a logical connection between an indirect utterance and a transaction is described. An indirect utterance is received from a user as an input to a conversational system. The indirect utterance is parsed to a first logical form. A first set of predicates and terms is mapped from the first logical form to a first subgraph in a knowledge graph. A second set of predicates and terms is mapped from a second logical form belonging to a transaction to a second subgraph of the knowledge graph. A best path in the knowledge graph between the first subgraph and the second subgraph is searched for while transforming the first logical form using the node and edge labels along the best path to generate an intermediate logical form. A system action is performed for a transaction if a graph structure of the intermediate logical form matches the graph structure of the logical form of the transaction above a threshold.
    Type: Grant
    Filed: February 1, 2018
    Date of Patent: April 9, 2024
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Mustafa Canim, Robert G Farrell, Achille B Fokoue-Nkoutche, John A Gunnels, Ryan A Musa, Vijay A Saraswat
  • Patent number: 11915129
    Abstract: A system and a computer-implemented method for ranking tabular data entities by likelihood of comprising answers for (natural language) queries, based on multimodal descriptions of the tabular data entities, comprising separate representations, which represent different aspects of the tabular data entities. The ranking is based on joint representations, generated from the query representation and separate representations of the tabular data entities' aspects, using gated multimodal units. The computer-implemented method may be used for applications such as web searches, data aggregation, and research tasks.
    Type: Grant
    Filed: April 29, 2020
    Date of Patent: February 27, 2024
    Assignee: International Business Machines Corporation
    Inventors: Roee Shraga, Haggai Roitman, Guy Feigenblat, Mustafa Canim
  • Patent number: 11907842
    Abstract: A system comprises a memory that stores computer-executable components; and a processor, operably coupled to the memory, that executes the computer-executable components. The system includes a receiving component that receives a corpus of data; a relation extraction component that generates noisy knowledge graphs from the corpus; and a training component that acquires global representations of entities and relation by training from output of the relation extraction component.
    Type: Grant
    Filed: January 13, 2023
    Date of Patent: February 20, 2024
    Assignee: NTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Alfio Massimiliano Gliozzo, Sarthak Dash, Michael Robert Glass, Mustafa Canim
  • Publication number: 20230306199
    Abstract: A question answering bot that digests non-relational data tables is provided. A processor receives a question regarding a non-relational data table. A processor extracts at least one feature of the question using a natural language processing (NLP) model. A processor extracts at least one similar feature of the non-relational data table to the extracted at least one feature of the question. A processor determines at least one relevant cell in the non-relational data table based on the at least one feature of the question and the at least one similar feature of the non-relational data table. A processor provides an answer to the question, where the answer is based on the at least one relevant cell.
    Type: Application
    Filed: March 25, 2022
    Publication date: September 28, 2023
    Inventors: Vishwajeet Kumar, Jaydeep Sen, Samarth Bharadwaj, Saneem Ahmed Chemmengath, Ioannis Katsis, Mustafa Canim
  • Patent number: 11687514
    Abstract: Multimodal table encoding, including: Receiving an electronic document that contains a table. The table includes multiple rows, multiple columns, and a schema comprising column labels or row labels. The electronic document includes a description of the table which is located externally to the table. Next, operating separate machine learning encoders to separately encode the description, schema, each of the rows, and each of the columns of the table, respectively. The schema, the rows, and the columns are encoded together with end-of-column tokens and end-of-row tokens that mark an end of each column and row, respectively. Then, applying a machine learning gating mechanism to the encoded description, encoded schema, encoded rows, and encoded columns, to produce a fused encoding of the table, wherein the fused encoding is representative of both a structure of the table and a content of the table.
    Type: Grant
    Filed: July 15, 2020
    Date of Patent: June 27, 2023
    Assignee: International Business Machines Corporation
    Inventors: Roee Shraga, Haggai Roitman, Guy Feigenblat, Mustafa Canim
  • Patent number: 11681734
    Abstract: Organizing and/or aligning fragments of text that are included in a set of physical and/or digital documents so that the arrangement of the text fragments is in a readily understandable and meaningful format for a given reader. This organization and/or alignment uses a relation model of the various text fragments to correlate a meaning between and amongst the various text fragments to ultimately determine the final alignment and/or arrangement of those text fragments.
    Type: Grant
    Filed: December 9, 2020
    Date of Patent: June 20, 2023
    Assignee: International Business Machines Corporation
    Inventors: Robert G. Farrell, Mustafa Canim, Douglas Ronald Burdick
  • Publication number: 20230177335
    Abstract: A system comprises a memory that stores computer-executable components; and a processor, operably coupled to the memory, that executes the computer-executable components. The system includes a receiving component that receives a corpus of data; a relation extraction component that generates noisy knowledge graphs from the corpus; and a training component that acquires global representations of entities and relation by training from output of the relation extraction component.
    Type: Application
    Filed: January 13, 2023
    Publication date: June 8, 2023
    Inventors: Alfio Massimiliano Gliozzo, Sarthak Dash, Michael Robert Glass, Mustafa Canim
  • Patent number: 11636082
    Abstract: Ad-hoc table retrieval, including: Representing each of a plurality of tables as a multi-field text document in which: different modalities of the table are represented as separate fields, and a concatenation of all the modalities is represented as a separate, auxiliary field. Receiving a query. Executing the query on the multi-field text documents, to retrieve a list of preliminarily-ranked candidate tables out of the plurality of tables. Calculating an intrinsic table similarity score for each of the candidate tables, based on the query and the auxiliary field. Calculating an extrinsic table similarity score for each of the candidate tables, based on a cluster hypothesis of the candidate tables. Combining: the preliminary rankings, the intrinsic table similarity scores, and the extrinsic table similarity scores, to re-rank the candidate tables.
    Type: Grant
    Filed: June 23, 2020
    Date of Patent: April 25, 2023
    Assignee: International Business Machines Corporation
    Inventors: Haggai Roitman, Guy Feigenblat, Mustafa Canim, Roee Shraga
  • Patent number: 11574179
    Abstract: A system comprises a memory that stores computer-executable components; and a processor, operably coupled to the memory, that executes the computer-executable components. The system includes a receiving component that receives a corpus of data; a relation extraction component that generates noisy knowledge graphs from the corpus; and a training component that acquires global representations of entities and relation by training from output of the relation extraction component.
    Type: Grant
    Filed: January 7, 2019
    Date of Patent: February 7, 2023
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Alfio Massimiliano Gliozzo, Sarthak Dash, Michael Robert Glass, Mustafa Canim
  • Publication number: 20230016113
    Abstract: A processor-implemented method securely responds to a query for information from a data graph. One or more processors create an embedding for encrypted sensitive information in vertices in a data graph; and bucketize embedded encrypted sensitive information on an embedding graph, where bucketizing the embedded encrypted sensitive information clusters vertices from the graph database that have shared data graph features. The processor(s) receive a query of the data graph from a requester, where the query is for information related to the shared data graph features. The processor(s) retrieve a bucket from the embedding graph that contains the information related to the shared data graph features; and extract encrypted sensitive information from the retrieved bucket.
    Type: Application
    Filed: July 17, 2021
    Publication date: January 19, 2023
    Inventors: Mustafa Canim, Joshua F. Payne
  • Publication number: 20220391733
    Abstract: A policy violation detection computer-implemented method, system, and computer program product, includes predicting a violation of a portion of a policy by a future action and delivering an alert of the violation to a user device, alert settings for privacy are toggled on the user device based on a user profile.
    Type: Application
    Filed: August 16, 2022
    Publication date: December 8, 2022
    Inventors: Mustafa Canim, Robert G. Farrell
  • Patent number: 11461682
    Abstract: A policy violation detection computer-implemented method, system, and computer program product, includes extracting a policy activity from a policy, the policy activity including an actor in the policy, an object of the policy, an action of the policy, and policy scope metadata, capturing a transaction by a user including metadata of the transaction, translating the transaction by the user into an actor in the transaction, an action of the transaction, and an object of the transaction, and alerting the user of a policy violation by navigating a knowledge graph is-a hierarchy to relate the actor in the transaction to the actor in the policy, the object of the transaction to an object of the policy, and the action of the transaction to an action of the policy activity.
    Type: Grant
    Filed: September 13, 2019
    Date of Patent: October 4, 2022
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Mustafa Canim, Robert G. Farrell
  • Publication number: 20220309107
    Abstract: Methods, systems, and computer program products for self-supervision in table question answering are provided herein. A computer-implemented method includes obtaining a table comprising a plurality of entries, wherein each entry corresponds to a particular column and particular row of the table; identifying one or more of the entries in the table that correspond to a target answer of a natural language query; generating an intermediate representation of the table comprising the rows corresponding to the identified one or more entries, wherein the intermediate representation masks each of the identified one or more entries; and generating a set of natural language question and answer pairs based on the intermediate representation.
    Type: Application
    Filed: March 29, 2021
    Publication date: September 29, 2022
    Inventors: Jaydeep Sen, Saneem Ahmed Chemmengath, Samarth Bharadwaj, Vishwajeet Kumar, Mustafa Canim
  • Publication number: 20220179896
    Abstract: Organizing and/or aligning fragments of text that are included in a set of physical and/or digital documents so that the arrangement of the text fragments is in a readily understandable and meaningful format for a given reader. This organization and/or alignment uses a relation model of the various text fragments to correlate a meaning between and amongst the various text fragments to ultimately determine the final alignment and/or arrangement of those text fragments.
    Type: Application
    Filed: December 9, 2020
    Publication date: June 9, 2022
    Inventors: Robert G. Farrell, MUSTAFA CANIM, Douglas Ronald Burdick
  • Patent number: 11308083
    Abstract: An information processing system, a computer readable storage medium, and a computer-implemented method, collect tables from a corpus of documents, convert the collected tables to flattened table format and organized to be searchable by schema-less queries. A method collects tables, extracts feature values from collected table data and collected table meta-data for each collected table. A table classifier classifies each collected table as being a type of table. Based on the classifying, the collected table is converted to a flattened table including table values that are the table data and the table meta-data of the collected table. Dependencies of the data values are mapped. The flattened table and mapped dependencies are stored in a triple store searchable by schema-less queries. The table classifier learns and improves its accuracy and reliability. Dependency information is maintained among a plurality of database tables. The dependency information can be updated at variable update frequency.
    Type: Grant
    Filed: April 19, 2019
    Date of Patent: April 19, 2022
    Assignee: International Business Machines Corporation
    Inventors: Mustafa Canim, Cristina Cornelio, Arun Iyengar, Ryan A. Musa, Mariano Rodriguez Muro
  • Publication number: 20220101052
    Abstract: A computer answers a question using a data table. The computer receives a user question and a target table containing a target cell corresponding to a target answer for the user question, with the target cell corresponding to a target column and a target row. The computer generates, a first classifier to provide column correlation values reflecting the probability that a given column is the target column. The computer generates a second classifier that provides row correlation values reflecting the probability that a given row is the target row. The computer applies the first classifier to the target table to determine a column correlation value for each column. The computer applies the second classifier to the target table to determine a row correlation value for each row. The computer suggests, as the target cell, a cell having elevated column and row correlation values relative to other target table cells.
    Type: Application
    Filed: September 30, 2020
    Publication date: March 31, 2022
    Inventors: Mustafa Canim, Michael Robert Glass, Alfio Massimiliano Gliozzo, Nicolas Rodolfo Fauceglia
  • Publication number: 20220043794
    Abstract: Multimodal table encoding, including: Receiving an electronic document that contains a table. The table includes multiple rows, multiple columns, and a schema comprising column labels or row labels. The electronic document includes a description of the table which is located externally to the table. Next, operating separate machine learning encoders to separately encode the description, schema, each of the rows, and each of the columns of the table, respectively. The schema, the rows, and the columns are encoded together with end-of-column tokens and end-of-row tokens that mark an end of each column and row, respectively. Then, applying a machine learning gating mechanism to the encoded description, encoded schema, encoded rows, and encoded columns, to produce a fused encoding of the table, wherein the fused encoding is representative of both a structure of the table and a content of the table.
    Type: Application
    Filed: July 15, 2020
    Publication date: February 10, 2022
    Inventors: Roee Shraga, HAGGAI ROITMAN, Guy Feigenblat, MUSTAFA CANIM
  • Publication number: 20210397595
    Abstract: Ad-hoc table retrieval, including: Representing each of a plurality of tables as a multi-field text document in which: different modalities of the table are represented as separate fields, and a concatenation of all the modalities is represented as a separate, auxiliary field. Receiving a query. Executing the query on the multi-field text documents, to retrieve a list of preliminarily-ranked candidate tables out of the plurality of tables. Calculating an intrinsic table similarity score for each of the candidate tables, based on the query and the auxiliary field. Calculating an extrinsic table similarity score for each of the candidate tables, based on a cluster hypothesis of the candidate tables. Combining: the preliminary rankings, the intrinsic table similarity scores, and the extrinsic table similarity scores, to re-rank the candidate tables.
    Type: Application
    Filed: June 23, 2020
    Publication date: December 23, 2021
    Inventors: HAGGAI ROITMAN, Guy Feigenblat, Mustafa Canim, Roee Shraga
  • Patent number: 11194798
    Abstract: An information processing system, a computer readable storage medium, and a computer-implemented method, collect tables from a corpus of documents, convert the collected tables to flattened table format and organized to be searchable by schema-less queries. A method collects tables, extracts feature values from collected table data and collected table meta-data for each collected table. A table classifier classifies each collected table as being a type of table. Based on the classifying, the collected table is converted to a flattened table including table values that are the table data and the table meta-data of the collected table. Dependencies of the data values are mapped. The flattened table and mapped dependencies are stored in a triple store searchable by schema-less queries. The table classifier learns and improves its accuracy and reliability. Dependency information is maintained among a plurality of database tables. The dependency information can be updated at variable update frequency.
    Type: Grant
    Filed: April 19, 2019
    Date of Patent: December 7, 2021
    Assignee: International Business Machines Corporation
    Inventors: Mustafa Canim, Cristina Cornelio, Arun Iyengar, Ryan A. Musa, Mariano Rodriguez Muro