Patents by Inventor Md Arafat Sultan

Md Arafat Sultan has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11741371
    Abstract: Embodiments relate to an artificial intelligence (AI) computer platform to incorporate synthetic data and ground truth data, and to promote diversity and accuracy in generating the synthetic data. Synthetic questions are generated by a question generator in response to semantically related ground truth passage and answer data. Each generated question is presented to an answer generator together with the semantically related ground truth passage. Each synthetic question is evaluated with respect to its diversity from previous synthetic questions generated for the same ground truth passage and answer data. Each synthetic question is also evaluated with respect to the accuracy of the answer generated by the answer generator. A reward function that captures both accuracy and diversity of each synthetic question is leveraged to selectively modify the question generator, with the selective modification(s) directed at increasing textual diversity and maintaining accuracy of the generated synthetic questions.
    Type: Grant
    Filed: March 20, 2020
    Date of Patent: August 29, 2023
    Assignee: International Business Machines Corporation
    Inventors: MD Arafat Sultan, Vittorio Castelli, Shubham Chandel, Ramon Astudillo
  • Patent number: 11507828
    Abstract: Training a machine learning model such as a neural network, which can automatically extract a hypernym from unstructured data, is disclosed. A preliminary candidate list of hyponym-hypernym pairs can be parsed from the corpus. A preliminary super-term—sub-term glossary can be generated from the corpus, the preliminary super-term—sub-term glossary containing one or more super-term—sub-term pairs. A super-term—sub-term pair can be filtered from the preliminary super-term—sub-term glossary, responsive to detecting that the super-term—sub-term pair is not a candidate for hyponym-hypernym pair, to generate a final super-term—sub-term glossary. The preliminary candidate list of hyponym-hypernym pairs and the final super-term—sub-term glossary can be combined to generate a final list of hyponym-hypernym pairs. An artificial neural network can be trained using the final list of hyponym-hypernym pairs as a training data set, the artificial neural network trained to identify a hypernym given new text data.
    Type: Grant
    Filed: October 29, 2019
    Date of Patent: November 22, 2022
    Assignee: International Business Machines Corporation
    Inventors: Md Faisal Mahbub Chowdhury, Robert G. Farrell, Nicholas Brady Garvan Monath, Michael Robert Glass, Md Arafat Sultan
  • Publication number: 20220358152
    Abstract: A computer-implemented method of performing text-to-text transformation includes performing a text transformation operation on an original input text of a specific task to generate a plurality of transformed text. A task-specific performance metric that measures an operation of the specific task is applied to each one of the plurality of transformed text. Each of the plurality of transformed text are paired with the task-specific performance metric. A training dataset is updated to include each pairing of the plurality of transformed text with the task-specific metric.
    Type: Application
    Filed: May 10, 2021
    Publication date: November 10, 2022
    Inventors: MD Arafat Sultan, Efsun Kayi, Revanth Gangi Reddy, Rong Zhang, Avirup Sil, Vittorio Castelli
  • Publication number: 20220138559
    Abstract: A method of using a computing device to improve an answer generated by a natural language question and answer system includes receiving, by a computing device, multiple questions in a natural language question and answer system. The computing device further generates multiple answers to the multiple questions. The computing device still further constructs a new training set with the generated multiple answers, where each answer is compared with a corresponding question of the multiple questions. The computing device additionally augments the new training set with one or more tokens delimiting a span of one or more of the generated multiple answers. The computing device further trains a new natural language question and answer system with the augmented new training set.
    Type: Application
    Filed: November 5, 2020
    Publication date: May 5, 2022
    Inventors: Revanth Gangi Reddy, Rong Zhang, MD ARAFAT SULTAN, Efsun Kayi, Avirup Sil, Robert Todd Ward, Vittorio Castelli
  • Publication number: 20210295172
    Abstract: Embodiments relate to an artificial intelligence (AI) computer platform to incorporate synthetic data and ground truth data, and to promote diversity and accuracy in generating the synthetic data. Synthetic questions are generated by a question generator in response to semantically related ground truth passage and answer data. Each generated question is presented to an answer generator together with the semantically related ground truth passage. Each synthetic question is evaluated with respect to its diversity from previous synthetic questions generated for the same ground truth passage and answer data. Each synthetic question is also evaluated with respect to the accuracy of the answer generated by the answer generator. A reward function that captures both accuracy and diversity of each synthetic question is leveraged to selectively modify the question generator, with the selective modification(s) directed at increasing textual diversity and maintaining accuracy of the generated synthetic questions.
    Type: Application
    Filed: March 20, 2020
    Publication date: September 23, 2021
    Applicant: International Business Machines Corporation
    Inventors: MD Arafat Sultan, Vittorio Castelli, Shubham Chandel, Ramon Astudillo
  • Publication number: 20210125058
    Abstract: Training a machine learning model such as a neural network, which can automatically extract a hypernym from unstructured data, is disclosed. A preliminary candidate list of hyponym-hypernym pairs can be parsed from the corpus. A preliminary super-term-sub-term glossary can be generated from the corpus, the preliminary super-term-sub-term glossary containing one or more super-term-sub-term pairs. A super-term-sub-term pair can be filtered from the preliminary super-term-sub-term glossary, responsive to detecting that the super-term-sub-term pair is not a candidate for hyponym-hypernym pair, to generate a final super-term-sub-term glossary. The preliminary candidate list of hyponym-hypernym pairs and the final super-term-sub-term glossary can be combined to generate a final list of hyponym-hypernym pairs. An artificial neural network can be trained using the final list of hyponym-hypernym pairs as a training data set, the artificial neural network trained to identify a hypernym given new text data.
    Type: Application
    Filed: October 29, 2019
    Publication date: April 29, 2021
    Inventors: Md Faisal Mahbub Chowdhury, Robert G. Farrell, Nicholas Brady Garvan Monath, Michael Robert Glass, Md Arafat Sultan