Patents by Inventor Radu Florian

Radu Florian has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

SEMANTICS-GUIDED DOMAIN-SPECIFIC DATA AUGMENTATION FOR TEXT-TO-GRAPH PARSING

Publication number: 20250005287

Abstract: Systems and techniques that facilitate semantics-guided domain-specific data augmentation for text-to-graph parsing are provided. In various embodiments, a system can access an annotated training dataset, wherein the annotated training dataset can comprise a set of abstract meaning representation graphs respectively corresponding to a set of natural language sentences. In various aspects, the system can generate an augmented version of the annotated training dataset, based on applying semantics-guided composition operations or semantics-guided substitution operations to the set of abstract meaning representation graphs. In various instances, a lexicon legend can comprise domain-specific graphs respectively representing discrete tokens unique to a domain of the annotated training dataset.

Type: Application

Filed: June 29, 2023

Publication date: January 2, 2025

Inventors: Young-Suk Lee, SADHANA KUMARAVEL, Ramon Fernandez Astudillo, TAHIRA NASEEM, Radu Florian, Salim Roukos
COMBINING GENERATIVE ALIGNERS AND TRANSITION-BASED PARSERS

Publication number: 20240362466

Abstract: Systems, computer-implemented methods, and computer program products to facilitate reducing error propagation when combining generative aligners and transition-based aligners are provided. According to an embodiment, a system can comprise a processor that executed components stored in memory. The computer executable components comprise a generative alignment component, an error propagation component, a discriminative parser, and a stochastic oracle policy component. The error propagation component can compute a posterior distribution over one or more hard alignments of parts given a pair of the generative alignment component. The discriminative parser can be trained via the stochastic oracle policy component to reduce error propagation when combining the generative alignment component with the discriminative parser.

Type: Application

Filed: April 27, 2023

Publication date: October 31, 2024

Inventors: Ramon Fernandez Astudillo, Andrew Drozdov, Jiawei Zhou, Radu Florian, Tahira Naseem, Yoon Hyung Kim
Predictive data and model selection for transfer learning in natural language processing

Patent number: 11934922

Abstract: A computer system, product, and method are provided. The computer system includes an artificial intelligence (AI) platform operatively coupled to a processor. The AI platform includes tools in the form of a machine learning model (MLM) manager, a metric manager, and a training manager. The MLM manager accesses a plurality of pre-trained source MLMs, and inputs a plurality of data objects of a test dataset into each of the source MLMs. The test dataset includes the plurality of data objects associated with respective labels. For each source MLM, associated labels are generated from the inputted data objects and a similarity metric is calculated. The MLM manager selects a base MLM to be used for transfer learning from the plurality of source MLMs based upon the calculated similarity metric. The training manager trains the selected base MLM with a target dataset for the target domain.

Type: Grant

Filed: October 9, 2020

Date of Patent: March 19, 2024

Assignee: International Business Machines Corporation

Inventors: Parul Awasthy, Bishwaranjan Bhattacharjee, John Ronald Kender, Radu Florian, Hui Wan
Treebank synthesis for training production parsers

Patent number: 11769007

Abstract: An approach for generating synthetic treebanks to be used in training a parser in a production system is provided. A processor receives a request to generate one or more synthetic treebanks from a production system, wherein the request indicates a language for the one or more synthetic treebanks. A processor retrieves at least one corpus of text in which the requested language is present. A processor provides the at least one corpus to a transformer enhanced parser neural network model. A processor generates at least one synthetic treebank associated with a string of text from the at least one corpus of text in which the requested language is present. A processor sends the at least one synthetic treebank to the production system, wherein the production system trains a parser utilized by the production system with the at least one synthetic treebank.

Type: Grant

Filed: May 27, 2021

Date of Patent: September 26, 2023

Assignee: International Business Machines Corporation

Inventors: Yousef El-Kurdi, Radu Florian, Hiroshi Kanayama, Efsun Kayi, Laura Chiticariu, Takuya Ohko, Robert Todd Ward
Training a question-answer dialog system to avoid adversarial attacks

Patent number: 11755657

Abstract: A method, computer program product, and/or computer system generate a first adversarial statement via: (1) receiving a question and an original context for the question; (2) converting the question into a statement with a placeholder answer; (3) picking randomly an answer entity from a training text corpus; (4) replacing the placeholder answer with the randomly picked answer entity; and (5) leaving a correct question entity in the statement. The first adversarial statement is inserted into the original context to form a first adversarial context. The question and the first adversarial context as a first pair and the question and the original context as a second pair are input into a question-answer dialog system to train the question-answer dialog system.

Type: Grant

Filed: September 19, 2022

Date of Patent: September 12, 2023

Assignee: International Business Machines Corporation

Inventors: Sara Rosenthal, Avirup Sil, Mihaela Ancuta Bornea, Radu Florian
Search, question answering, and classifier construction

Patent number: 11651270

Abstract: A method and system are provided for combining models. The method includes forming, by a computer having a processor and a memory, model pairs from a model ensemble that includes a plurality of models. The method further includes comparing the model pairs based on sets of output results produced by the model pairs to provide comparison results. The method also includes constructing, by the computer, a combination model from at least one of the model pairs based on the comparison results. The comparing step is performed using user-generated set-based feedback.

Type: Grant

Filed: March 22, 2016

Date of Patent: May 16, 2023

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Vittorio Castelli, Radu Florian, Taesun Moon, Avirup Sil
Method, apparatus and computer program product for vehicle platooning

Patent number: 11636405

Abstract: A method, apparatus and computer program products are provided for generating routes for vehicles participating in a platooning plan, where the routes are influenced by one another. Methods may include receiving a first trip request including a first vehicle identification, a first trip origin, and a first trip destination; receiving a second trip request including a second vehicle identification, a second trip origin, and a second trip destination, where the first trip origin, second trip origin, first trip destination, and second trip destination are all different; generating a platooning plan that includes a first route and a second route, where the first route and the second route overlap for at least a portion of the respective route, where at least one of the first route and the second route are generated with an influence from the other of the first route and the second route.

Type: Grant

Filed: November 20, 2019

Date of Patent: April 25, 2023

Assignee: HERE GLOBAL B.V.

Inventors: Radu-Florian Atanasiu, Jeronimo Lopez-Navarro, Jiaqing Gu, Thomas Gehrsitz, Sebastian Van De Hoef
Enhancing multi-lingual embeddings for cross-lingual question-answer system

Patent number: 11574130

Abstract: A method includes receiving, by a question-answer system, a question in a first language and the question in a second language and predicting, by the question-answer system, a first answer to the question in the first language and a second answer to the question in the second language. The method also includes generating, by the question-answer system, a first vector representing the question in the first language and a second vector representing the question in the second language and adjusting the question-answer system based on the first and second answers and the first and second vectors such that when the question-answer system subsequently generates a third vector representing the question in the first language and a fourth vector representing the question in the second language, a distance between the third and fourth vectors is less than a distance between the first and second vectors.

Type: Grant

Filed: November 24, 2020

Date of Patent: February 7, 2023

Assignee: International Business Machines Corporation

Inventors: Mihaela Ancuta Bornea, Lin Pan, Sara Rosenthal, Avirup Sil, Radu Florian
ONLINE QUESTION ANSWERING, USING READING COMPREHENSION WITH AN ENSEMBLE OF MODELS

Publication number: 20230023958

Abstract: Receive a question via a graphical user interface (GUI), obtain a passage of text potentially relevant to the question, and receive, via the GUI, a selection of a number of question-answering models to be ensembled. Produce a plurality of answers to the question by running a plurality of question-answering models, consistent with the selection of the number of question-answering models to be ensembled, on the passage of text. Produce an ensembled answer by ensembling the plurality of answers according to their respective confidence scores. Display, via the GUI, the ensembled answer in context of the passage of text, with the ensembled answer visually marked in the passage of text. Optionally, repeat these steps for a second passage of text.

Type: Application

Filed: July 23, 2021

Publication date: January 26, 2023

Inventors: Anthony Ferritto, Radu Florian, James William Murdock, IV, Avirup Sil
Training a Question-Answer Dialog System to Avoid Adversarial Attacks

Publication number: 20230009893

Abstract: A method, computer program product, and/or computer system generate a first adversarial statement via: (1) receiving a question and an original context for the question; (2) converting the question into a statement with a placeholder answer; (3) picking randomly an answer entity from a training text corpus; (4) replacing the placeholder answer with the randomly picked answer entity; and (5) leaving a correct question entity in the statement. The first adversarial statement is inserted into the original context to form a first adversarial context. The question and the first adversarial context as a first pair and the question and the original context as a second pair are input into a question-answer dialog system to train the question-answer dialog system.

Type: Application

Filed: September 19, 2022

Publication date: January 12, 2023

Inventors: Sara Rosenthal, Avirup Sil, Mihaela Ancuta Bornea, Radu Florian
Performing fine-grained question type classification

Patent number: 11520762

Abstract: A computer-implemented method according to one embodiment includes converting an input question into a vector form using trained word embeddings; constructing a type similarity matrix using a predetermined ontology; and determining a score for all possible types for the input question, based on the input question in vector form and the type similarity matrix.

Type: Grant

Filed: December 13, 2019

Date of Patent: December 6, 2022

Assignee: International Business Machines Corporation

Inventors: Sarthak Dash, Gaetano Rossiello, Alfio Massimiliano Gliozzo, Robert G. Farrell, Bassem Makni, Avirup Sil, Vittorio Castelli, Radu Florian
Training a question-answer dialog sytem to avoid adversarial attacks

Patent number: 11520829

Abstract: A method, computer program product, and/or computer system protects a question-answer dialog system from being attacked by adversarial statements that incorrectly answer a question. A computing device accesses a plurality of adversarial statements that are capable of making an adversarial attack on a question-answer dialog system, which is trained to provide a correct answer to a specific type of question. The computing device utilizes the plurality of adversarial statements to train a machine learning model for the question-answer dialog system. The computing device then reinforces the trained machine learning model by bootstrapping adversarial policies that identify multiple types of adversarial statements onto the trained machine learning model. The computing device then utilizes the trained and bootstrapped machine learning model to avoid adversarial attacks when responding to questions submitted to the question-answer dialog system.

Type: Grant

Filed: October 21, 2020

Date of Patent: December 6, 2022

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Sara Rosenthal, Avirup Sil, Mihaela Ancuta Bornea, Radu Florian
TREEBANK SYNTHESIS FOR TRAINING PRODUCTION PARSERS

Publication number: 20220382972

Abstract: An approach for generating synthetic treebanks to be used in training a parser in a production system is provided. A processor receives a request to generate one or more synthetic treebanks from a production system, wherein the request indicates a language for the one or more synthetic treebanks. A processor retrieves at least one corpus of text in which the requested language is present. A processor provides the at least one corpus to a transformer enhanced parser neural network model. A processor generates at least one synthetic treebank associated with a string of text from the at least one corpus of text in which the requested language is present. A processor sends the at least one synthetic treebank to the production system, wherein the production system trains a parser utilized by the production system with the at least one synthetic treebank.

Type: Application

Filed: May 27, 2021

Publication date: December 1, 2022

Inventors: YOUSEF EL-KURDI, Radu Florian, HIROSHI KANAYAMA, Efsun Kayi, LAURA CHITICARIU, Takuya Ohko, Robert Todd Ward
Extracting Facts from Unstructured Text

Publication number: 20220207384

Abstract: A system, computer program product, and method are provided for extraction of factual data from unstructured natural language (NL) text. A detection model is applied to convert unstructured NL text in a first language to annotated NL text. The detection model identifies two or more mentions from the unstructured NL text and a logical position of the mentions. The detection model further identifies a sequential position for each of the mentions and attaches a sequential position identifier. A pattern of rules corresponding with the annotated NL text is identified and applied to the annotated NL text, and one or more facts embedded within the annotated NL text are extracted and converted into structured data.

Type: Application

Filed: December 30, 2020

Publication date: June 30, 2022

Applicant: International Business Machines Corporation

Inventors: Radu Florian, Salim Roukos, Martin Franz
Text classification using models with complementary granularity and accuracy

Patent number: 11373041

Abstract: A processor may receive a text segment. The processor may analyze the text segment at a plurality of granularity levels wherein each of the plurality of granularity levels has a comparative selection value for identifying one or more objects of interest within the text segment. The processor may select an optimized granularity level with an optimum comparative selection value. The processor may identify the one or more objects of interest within the text segment. The processor may display the one or more objects of interest to a user.

Type: Grant

Filed: September 18, 2020

Date of Patent: June 28, 2022

Assignee: International Business Machines Corporation

Inventors: Jian Ni, Radu Florian, Salim Roukos, Vittorio Castelli
ENHANCING MULTI-LINGUAL EMBEDDINGS FOR CROSS-LINGUAL QUESTION-ANSWER SYSTEM

Publication number: 20220164538

Abstract: A method includes receiving, by a question-answer system, a question in a first language and the question in a second language and predicting, by the question-answer system, a first answer to the question in the first language and a second answer to the question in the second language. The method also includes generating, by the question-answer system, a first vector representing the question in the first language and a second vector representing the question in the second language and adjusting the question-answer system based on the first and second answers and the first and second vectors such that when the question-answer system subsequently generates a third vector representing the question in the first language and a fourth vector representing the question in the second language, a distance between the third and fourth vectors is less than a distance between the first and second vectors.

Type: Application

Filed: November 24, 2020

Publication date: May 26, 2022

Inventors: Mihaela Ancuta BORNEA, Lin PAN, Sara ROSENTHAL, Avirup SIL, Radu FLORIAN
TRAINING A QUESTION-ANSWER DIALOG SYTEM TO AVOID ADVERSARIAL ATTACKS

Publication number: 20220121710

Abstract: A method, computer program product, and/or computer system protects a question-answer dialog system from being attacked by adversarial statements that incorrectly answer a question. A computing device accesses a plurality of adversarial statements that are capable of making an adversarial attack on a question-answer dialog system, which is trained to provide a correct answer to a specific type of question. The computing device utilizes the plurality of adversarial statements to train a machine learning model for the question-answer dialog system. The computing device then reinforces the trained machine learning model by bootstrapping adversarial policies that identify multiple types of adversarial statements onto the trained machine learning model. The computing device then utilizes the trained and bootstrapped machine learning model to avoid adversarial attacks when responding to questions submitted to the question-answer dialog system.

Type: Application

Filed: October 21, 2020

Publication date: April 21, 2022

Inventors: SARA ROSENTHAL, AVIRUP SIL, MIHAELA ANCUTA BORNEA, RADU FLORIAN
Predictive Data and Model Selection for Transfer Learning in Natural Language Processing

Publication number: 20220114473

Abstract: A computer system, product, and method are provided. The computer system includes an artificial intelligence (AI) platform operatively coupled to a processor. The AI platform includes tools in the form of a machine learning model (MLM) manager, a metric manager, and a training manager. The MLM manager accesses a plurality of pre-trained source MLMs, and inputs a plurality of data objects of a test dataset into each of the source MLMs. The test dataset includes the plurality of data objects associated with respective labels. For each source MLM, associated labels are generated from the inputted data objects and a similarity metric is calculated. The MLM manager selects a base MLM to be used for transfer learning from the plurality of source MLMs based upon the calculated similarity metric. The training manager trains the selected base MLM with a target dataset for the target domain.

Type: Application

Filed: October 9, 2020

Publication date: April 14, 2022

Applicant: International Business Machines Corporation

Inventors: Parul Awasthy, Bishwaranjan Bhattacharjee, John Ronald Kender, Radu Florian, Hui Wan
TEXT CLASSIFICATION USING MODELS WITH COMPLEMENTARY GRANULARITY AND ACCURACY

Publication number: 20220092262

Abstract: A processor may receive a text segment. The processor may analyze the text segment at a plurality of granularity levels wherein each of the plurality of granularity levels has a comparative selection value for identifying one or more objects of interest within the text segment. The processor may select an optimized granularity level with an optimum comparative selection value. The processor may identify the one or more objects of interest within the text segment. The processor may display the one or more objects of interest to a user.

Type: Application

Filed: September 18, 2020

Publication date: March 24, 2022

Inventors: Jian Ni, Radu Florian, Salim Roukos, Vittorio Castelli
PERFORMING FINE-GRAINED QUESTION TYPE CLASSIFICATION

Publication number: 20210182258

Abstract: A computer-implemented method according to one embodiment includes converting an input question into a vector form using trained word embeddings; constructing a type similarity matrix using a predetermined ontology; and determining a score for all possible types for the input question, based on the input question in vector form and the type similarity matrix.

Type: Application

Filed: December 13, 2019

Publication date: June 17, 2021

Inventors: Sarthak Dash, Gaetano Rossiello, Alfio Massimiliano Gliozzo, Robert G. Farrell, Bassem Makni, Avirup Sil, Vittorio Castelli, Radu Florian

1 2 3 4 next