Patents by Inventor Vadim Sheinin

Vadim Sheinin has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11966389
    Abstract: A method (and structure and computer product) of machine translation for processing input questions includes receiving, in a processor on a computer, an input question presented in a natural language. The input question is preprocessed to find one or more condition values for possible Structured Query Language (SQL) queries. One or more possible SQL queries are enumerated based on the one or more found condition values and a paraphrasing model is used to rank the enumerated SQL queries. The highest ranked SQL query is executed against a relational database to search for a response to the input question.
    Type: Grant
    Filed: February 13, 2019
    Date of Patent: April 23, 2024
    Assignee: International Business Machines Corporation
    Inventors: Vadim Sheinin, Zhiguo Wang, Lingfei Wu, Kun Xu
  • Patent number: 11907226
    Abstract: A computer-implemented method, a computer system and a computer program product create rules for a rule-based natural language interface for databases (NLIDB). The method may include receiving a natural language query from a user. The method may also include generating a first explanation for the natural language query using a deep learning model and a second explanation for the natural language query using the rule-based NLIDB and validating whether the first and second explanations correctly represent the natural language query. The method may further include identifying the database value in the first explanation in response to the first explanation correctly representing the natural language query and the second explanation not correctly representing the natural language query. Lastly, the method may include creating a rule in a table for the rule-based natural language interface for databases that associates the database value with the original word of the natural language query.
    Type: Grant
    Filed: March 21, 2022
    Date of Patent: February 20, 2024
    Assignee: International Business Machines Corporation
    Inventors: Ngoc Phuoc An Vo, Vadim Sheinin, Elahe Khorasani, Hangu Yeo
  • Patent number: 11797887
    Abstract: Techniques for mapping policy documents to regulatory documents to check for compliance between the policies and documents are provided. In one example, a computer-implemented method determining, by a system operatively coupled to a processor, an information input, a control framework, and a document from a first group consisting of a regulatory document and a policy document, wherein the information input is a corpora from a second group consisting of a domain corpora and a global corpora. The computer-implemented method can also comprise mapping, by the system, the received regulatory document or the received policy document to the control framework using a supervised machine learning technique.
    Type: Grant
    Filed: December 28, 2020
    Date of Patent: October 24, 2023
    Assignee: International Business Machines Corporation
    Inventors: Swapna Buccapatnam Tirumala, Ashish Jagmohan, Elham Khabiri, Ta-Hsin Li, Matthew Daniel Riemer, Vadim Sheinin, Aditya Vempaty
  • Publication number: 20230306022
    Abstract: An embodiment for identifying and replacing logically neutral phrases in natural language queries may include receiving a natural language query. The embodiment may also identify one or more logically neutral or non-logically neutral anchors in the natural language query. The embodiment may also identify boundaries containing one or more logically neutral phrases. The embodiment may further include detecting semantic and logical relations between verbal phrases and functional language between and adjacent to the one or more logically neutral and non-logically neutral anchors to reintroduce non-logically neutral language back into a non-logically neutral portion of the natural language query. The embodiment may also include generating a modified natural language query by automatically removing the boundaries and optionally replacing the one or more logically neutral phrases in the natural language query.
    Type: Application
    Filed: March 28, 2022
    Publication date: September 28, 2023
    Inventors: Octavian Popescu, Vadim Sheinin, Ngoc Phuoc An Vo, Elahe Khorasani, Hangu Yeo
  • Publication number: 20230297577
    Abstract: A computer-implemented method, a computer system and a computer program product create rules for a rule-based natural language interface for databases (NLIDB). The method may include receiving a natural language query from a user. The method may also include generating a first explanation for the natural language query using a deep learning model and a second explanation for the natural language query using the rule-based NLIDB and validating whether the first and second explanations correctly represent the natural language query. The method may further include identifying the database value in the first explanation in response to the first explanation correctly representing the natural language query and the second explanation not correctly representing the natural language query. Lastly, the method may include creating a rule in a table for the rule-based natural language interface for databases that associates the database value with the original word of the natural language query.
    Type: Application
    Filed: March 21, 2022
    Publication date: September 21, 2023
    Inventors: Ngoc Phuoc An Vo, Vadim Sheinin, Elahe Khorasani, Hangu Yeo
  • Publication number: 20230274087
    Abstract: A natural language processor and applicable method receives an input sentence of natural language from a natural language corpus. The input sentence comprises sentence clauses that include a conditional clause. The processor performs natural language processing (NLP), using an NLP model, on the input sentence. The processing comprises using a set of rules determining the sentence clauses and which of the sentence clause is the conditional clause, determining one or more logical connections between the sentence clauses, and determining a role of the sentence clauses based upon the one or more identified logical connections. The sentence clauses are tagged to produce a labeled sentence that is output to an entity that is one or more of a storage device, a network interface, a storage device, and an input of a further language processor or application.
    Type: Application
    Filed: February 28, 2022
    Publication date: August 31, 2023
    Inventors: Octavian Popescu, Irene Lizeth Manotas GutiƩrrez, Vadim Sheinin, Ngoc Phuoc An Vo, Algimantas Cerniauskas
  • Patent number: 11693855
    Abstract: Methods, systems and computer readable media are provided for automatically creating a semantic model of a relational database for processing natural language queries. A computing device automatically extracts relational database metadata. The computing device prompts a user to enter textual labels for columns of the extracted metadata. The computing device automatically generates a schema annotation file based upon the relational database metadata and the textual labels for the columns. A natural language query is processed for the relational database using the schema annotation file.
    Type: Grant
    Filed: December 20, 2019
    Date of Patent: July 4, 2023
    Assignee: International Business Machines Corporation
    Inventors: Elahe Khorasani, Hangu Yeo, Octavian Popescu, Vadim Sheinin
  • Publication number: 20230036196
    Abstract: An embodiment of the present invention is a prime representation data structure in a computer architecture. The prime representation data structure has a plurality of records where each record contains a prime representation and where the prime representation is a product of two or more selected prime factors. Each of the selected prime factor associated with an n-gram of a domain representation of a domain string. The domain representation of the domain string is a domain string of ordered, contiguous domain characters. The n-gram being a subset of n number of the ordered, contiguous domain characters in the domain string. The computer architecture performs string searching and includes one or more central processing units (CPUs) with one or more operating systems, one or more input/output device interfaces, one or more memories, and one or more input/output devices.
    Type: Application
    Filed: July 27, 2021
    Publication date: February 2, 2023
    Inventors: Octavian Popescu, Vadim Sheinin, Bijan Davari, Gheorghe Almasi
  • Publication number: 20220366147
    Abstract: A method of authoring a conversation service for a chatbot and a database includes receiving, from a user, a selection of a database, and connecting an authoring service of the chatbot to a table in the database; outputting, from the authoring service to the user, a question requesting a description of a subject matter of the table; receiving the description of the subject matter of the table; outputting to the user a question requesting an identification of a key column of the table that contains values that represent the subject matter of the table; receiving the identification of the key column of the table; and translating, by a natural language query service, the description of the subject matter of the table and the key column of the table into the conversation service, wherein the conversation service includes SQL statements suitable for querying the database table.
    Type: Application
    Filed: May 17, 2021
    Publication date: November 17, 2022
    Inventors: TIN KAM HO, VADIM SHEININ, ELAHE KHORASANI
  • Publication number: 20220318523
    Abstract: A computer system extracts clauses using machine translation. An input sentence in a source language is translated into a translated sentence in a target language using a trained machine translation model, wherein the trained machine translation model inserts a grammatical indicator into a position of the translated sentence that identifies a dependent clause. The input sentence and the translated sentence are aligned to determine a position in the input sentence that corresponds to the position of the grammatical indicator in the translated sentence. The dependent clause is extracted, in the source language, from the input sentence based on the determined position in the input sentence. Embodiments of the present invention further include a method and program product for clause extraction using machine translation in substantially the same manner described above.
    Type: Application
    Filed: March 31, 2021
    Publication date: October 6, 2022
    Inventors: Vadim Sheinin, Octavian Popescu, Ngoc Phuoc An Vo, Irene Lizeth Manotas GutiƩrrez
  • Patent number: 11334721
    Abstract: A corpus pattern paraphrasing method, system, and non-transitory computer readable medium, include aligning slots of patterns for verbal phrases based on syntactical and lexical features along with calculated synonyms to predict paraphrases that are not previously stored in a corpus of sentences in a database.
    Type: Grant
    Filed: July 31, 2019
    Date of Patent: May 17, 2022
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Octavian Popescu, Vadim Sheinin
  • Patent number: 11138383
    Abstract: Methods, systems, and computer program products for extracting meaning representation from text are provided herein.
    Type: Grant
    Filed: August 21, 2019
    Date of Patent: October 5, 2021
    Assignee: International Business Machines Corporation
    Inventor: Vadim Sheinin
  • Publication number: 20210191936
    Abstract: Methods, systems and computer readable media are provided for automatically creating a semantic model of a relational database for processing natural language queries. A computing device automatically extracts relational database metadata. The computing device prompts a user to enter textual labels for columns of the extracted metadata. The computing device automatically generates a schema annotation file based upon the relational database metadata and the textual labels for the columns. A natural language query is processed for the relational database using the schema annotation file.
    Type: Application
    Filed: December 20, 2019
    Publication date: June 24, 2021
    Inventors: Elahe Khorasani, Hangu Yeo, Octavian Popescu, Vadim Sheinin
  • Publication number: 20210117794
    Abstract: Techniques for mapping policy documents to regulatory documents to check for compliance between the policies and documents are provided. In one example, a computer-implemented method determining, by a system operatively coupled to a processor, an information input, a control framework, and a document from a first group consisting of a regulatory document and a policy document, wherein the information input is a corpora from a second group consisting of a domain corpora and a global corpora. The computer-implemented method can also comprise mapping, by the system, the received regulatory document or the received policy document to the control framework using a supervised machine learning technique.
    Type: Application
    Filed: December 28, 2020
    Publication date: April 22, 2021
    Inventors: Swapna Buccapatnam Tirumala, Ashish Jagmohan, Elham Khabiri, Ta-Hsin Li, Matthew Daniel Riemer, Vadim Sheinin, Aditya Vempaty
  • Publication number: 20210056173
    Abstract: Methods, systems, and computer program products for extracting meaning representation from text are provided herein.
    Type: Application
    Filed: August 21, 2019
    Publication date: February 25, 2021
    Inventor: Vadim Sheinin
  • Patent number: 10922621
    Abstract: Techniques for mapping policy documents to regulatory documents to check for compliance between the policies and documents are provided. In one example, a computer-implemented method determining, by a system operatively coupled to a processor, an information input, a control framework, and a document from a first group consisting of a regulatory document and a policy document, wherein the information input is a corpora from a second group consisting of a domain corpora and a global corpora. The computer-implemented method can also comprise mapping, by the system, the received regulatory document or the received policy document to the control framework using a supervised machine learning technique.
    Type: Grant
    Filed: November 11, 2016
    Date of Patent: February 16, 2021
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Swapna Buccapatnam Tirumala, Ashish Jagmohan, Elham Khabiri, Ta-Hsin Li, Matthew Daniel Riemer, Vadim Sheinin, Aditya Vempaty
  • Patent number: 10902937
    Abstract: There is provided an apparatus and a processor-implemented method. The method includes aligning a reference genome with a plurality of DNA sequences. Each of the plurality of DNA sequences has a respective plurality of bases. The method further includes classifying and sorting the plurality of read sequences based on respective numbers of mismatched bases within the plurality of read sequences to obtain a plurality of re-arranged DNA sequences. The method also includes building a histogram based on respective positions of mismatched bases within the plurality of re-arranged DNA sequences. The method additionally includes coding at least some of the plurality of re-arranged DNA sequences based on the histogram.
    Type: Grant
    Filed: February 12, 2014
    Date of Patent: January 26, 2021
    Assignee: International Business Machines Corporation
    Inventors: Vadim Sheinin, Hangu Yeo
  • Patent number: 10902208
    Abstract: A semantic parsing method using a graph-to-sequence model, system, and computer program product include generating a syntactic graph for a sentence, generating node embeddings for each node based on other nodes the each node is connected to in the syntactic graph, generating a graph embedding over the node embeddings, performing attention-based recurrent neural network (RNN) decoding of the graph embedding and the node embeddings, and providing a logical translation of the sentence based on the decoding.
    Type: Grant
    Filed: September 28, 2018
    Date of Patent: January 26, 2021
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Kun Xu, Lingfei Wu, Zhiguo Wang, Vadim Sheinin
  • Publication number: 20200311057
    Abstract: A method, system and apparatus of processing queries, including inputting a query as query data, generating paraphrases from the query data, and normalizing the generated paraphrases according to predefined annotations of a schema.
    Type: Application
    Filed: March 28, 2019
    Publication date: October 1, 2020
    Inventors: Hangu Yeo, Octavian Popescu, Elahe Khorasani, Vadim Sheinin
  • Publication number: 20200257679
    Abstract: A method (and structure and computer product) of machine translation for processing input questions includes receiving, in a processor on a computer, an input question presented in a natural language. The input question is preprocessed to find one or more condition values for possible Structured Query Language (SQL) queries. One or more possible SQL queries are enumerated based on the one or more found condition values and a paraphrasing model is used to rank the enumerated SQL queries. The highest ranked SQL query is executed against a relational database to search for a response to the input question.
    Type: Application
    Filed: February 13, 2019
    Publication date: August 13, 2020
    Inventors: Vadim SHEININ, Zhiguo Wang, Lingfei Wu, Kun Xu