Patents by Inventor Md Arafat Sultan

Md Arafat Sultan has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Automatically generating diverse text

Patent number: 11741371

Abstract: Embodiments relate to an artificial intelligence (AI) computer platform to incorporate synthetic data and ground truth data, and to promote diversity and accuracy in generating the synthetic data. Synthetic questions are generated by a question generator in response to semantically related ground truth passage and answer data. Each generated question is presented to an answer generator together with the semantically related ground truth passage. Each synthetic question is evaluated with respect to its diversity from previous synthetic questions generated for the same ground truth passage and answer data. Each synthetic question is also evaluated with respect to the accuracy of the answer generated by the answer generator. A reward function that captures both accuracy and diversity of each synthetic question is leveraged to selectively modify the question generator, with the selective modification(s) directed at increasing textual diversity and maintaining accuracy of the generated synthetic questions.

Type: Grant

Filed: March 20, 2020

Date of Patent: August 29, 2023

Assignee: International Business Machines Corporation

Inventors: MD Arafat Sultan, Vittorio Castelli, Shubham Chandel, Ramon Astudillo
Unsupervised hypernym induction machine learning

Patent number: 11507828

Abstract: Training a machine learning model such as a neural network, which can automatically extract a hypernym from unstructured data, is disclosed. A preliminary candidate list of hyponym-hypernym pairs can be parsed from the corpus. A preliminary super-term—sub-term glossary can be generated from the corpus, the preliminary super-term—sub-term glossary containing one or more super-term—sub-term pairs. A super-term—sub-term pair can be filtered from the preliminary super-term—sub-term glossary, responsive to detecting that the super-term—sub-term pair is not a candidate for hyponym-hypernym pair, to generate a final super-term—sub-term glossary. The preliminary candidate list of hyponym-hypernym pairs and the final super-term—sub-term glossary can be combined to generate a final list of hyponym-hypernym pairs. An artificial neural network can be trained using the final list of hyponym-hypernym pairs as a training data set, the artificial neural network trained to identify a hypernym given new text data.

Type: Grant

Filed: October 29, 2019

Date of Patent: November 22, 2022

Assignee: International Business Machines Corporation

Inventors: Md Faisal Mahbub Chowdhury, Robert G. Farrell, Nicholas Brady Garvan Monath, Michael Robert Glass, Md Arafat Sultan
MODEL PERFORMANCE THROUGH TEXT-TO-TEXT TRANSFORMATION VIA DISTANT SUPERVISION FROM TARGET AND AUXILIARY TASKS

Publication number: 20220358152

Abstract: A computer-implemented method of performing text-to-text transformation includes performing a text transformation operation on an original input text of a specific task to generate a plurality of transformed text. A task-specific performance metric that measures an operation of the specific task is applied to each one of the plurality of transformed text. Each of the plurality of transformed text are paired with the task-specific performance metric. A training dataset is updated to include each pairing of the plurality of transformed text with the task-specific metric.

Type: Application

Filed: May 10, 2021

Publication date: November 10, 2022

Inventors: MD Arafat Sultan, Efsun Kayi, Revanth Gangi Reddy, Rong Zhang, Avirup Sil, Vittorio Castelli
ANSWER SPAN CORRECTION

Publication number: 20220138559

Abstract: A method of using a computing device to improve an answer generated by a natural language question and answer system includes receiving, by a computing device, multiple questions in a natural language question and answer system. The computing device further generates multiple answers to the multiple questions. The computing device still further constructs a new training set with the generated multiple answers, where each answer is compared with a corresponding question of the multiple questions. The computing device additionally augments the new training set with one or more tokens delimiting a span of one or more of the generated multiple answers. The computing device further trains a new natural language question and answer system with the augmented new training set.

Type: Application

Filed: November 5, 2020

Publication date: May 5, 2022

Inventors: Revanth Gangi Reddy, Rong Zhang, MD ARAFAT SULTAN, Efsun Kayi, Avirup Sil, Robert Todd Ward, Vittorio Castelli
Automatically Generating Diverse Text

Publication number: 20210295172

Abstract: Embodiments relate to an artificial intelligence (AI) computer platform to incorporate synthetic data and ground truth data, and to promote diversity and accuracy in generating the synthetic data. Synthetic questions are generated by a question generator in response to semantically related ground truth passage and answer data. Each generated question is presented to an answer generator together with the semantically related ground truth passage. Each synthetic question is evaluated with respect to its diversity from previous synthetic questions generated for the same ground truth passage and answer data. Each synthetic question is also evaluated with respect to the accuracy of the answer generated by the answer generator. A reward function that captures both accuracy and diversity of each synthetic question is leveraged to selectively modify the question generator, with the selective modification(s) directed at increasing textual diversity and maintaining accuracy of the generated synthetic questions.

Type: Application

Filed: March 20, 2020

Publication date: September 23, 2021

Applicant: International Business Machines Corporation

Inventors: MD Arafat Sultan, Vittorio Castelli, Shubham Chandel, Ramon Astudillo
UNSUPERVISED HYPERNYM INDUCTION MACHINE LEARNING

Publication number: 20210125058

Abstract: Training a machine learning model such as a neural network, which can automatically extract a hypernym from unstructured data, is disclosed. A preliminary candidate list of hyponym-hypernym pairs can be parsed from the corpus. A preliminary super-term-sub-term glossary can be generated from the corpus, the preliminary super-term-sub-term glossary containing one or more super-term-sub-term pairs. A super-term-sub-term pair can be filtered from the preliminary super-term-sub-term glossary, responsive to detecting that the super-term-sub-term pair is not a candidate for hyponym-hypernym pair, to generate a final super-term-sub-term glossary. The preliminary candidate list of hyponym-hypernym pairs and the final super-term-sub-term glossary can be combined to generate a final list of hyponym-hypernym pairs. An artificial neural network can be trained using the final list of hyponym-hypernym pairs as a training data set, the artificial neural network trained to identify a hypernym given new text data.

Type: Application

Filed: October 29, 2019

Publication date: April 29, 2021

Inventors: Md Faisal Mahbub Chowdhury, Robert G. Farrell, Nicholas Brady Garvan Monath, Michael Robert Glass, Md Arafat Sultan

Automatically generating diverse text

Unsupervised hypernym induction machine learning

MODEL PERFORMANCE THROUGH TEXT-TO-TEXT TRANSFORMATION VIA DISTANT SUPERVISION FROM TARGET AND AUXILIARY TASKS

ANSWER SPAN CORRECTION

Automatically Generating Diverse Text

UNSUPERVISED HYPERNYM INDUCTION MACHINE LEARNING