Patents by Inventor Siddharth A. Patwardhan

Siddharth A. Patwardhan has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11720611
    Abstract: Generating textual entailment pair by a natural language processing (NLP) system. The NLP system receives two input texts, such as a question and a candidate answer. The NLP system queries a database and retrieves passages likely to include text that support the candidate answer. The NLP system generates parse trees and performs term matching on the passages and scores them according to the matching. The NLP system detects anchor pairs in the question and in the passage and aligns subgraphs (within the parse trees) of one to the other based on matching. The NLP system identifies aligned terms in the question and the passage that are not in the aligned subgraphs. The NLP system identifies text fragments, for the question and the passage, within the non-aligned segments of their respective parse trees, that connect the aligned term to the aligned portion of the subgraph.
    Type: Grant
    Filed: May 20, 2016
    Date of Patent: August 8, 2023
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Branimir K. Boguraev, Jennifer Chu-Carroll, Aditya A. Kalyanpur, David J. McClosky, James W. Murdock, IV, Siddharth A. Patwardhan
  • Patent number: 11520813
    Abstract: Generating textual entailment pair by a natural language processing (NLP) system. The NLP system receives two input texts, such as a question and a candidate answer. The NLP system queries a database and retrieves passages likely to include text that support the candidate answer. The NLP system generates parse trees and performs term matching on the passages and scores them according to the matching. The NLP system detects anchor pairs in the question and in the passage and aligns subgraphs (within the parse trees) of one to the other based on matching. The NLP system identifies aligned terms in the question and the passage that are not in the aligned subgraphs. The NLP system identifies text fragments, for the question and the passage, within the non-aligned segments of their respective parse trees, that connect the aligned term to the aligned portion of the subgraph.
    Type: Grant
    Filed: January 4, 2016
    Date of Patent: December 6, 2022
    Assignee: International Business Machines Corporation
    Inventors: Branimir K. Boguraev, Jennifer Chu-Carroll, Aditya A. Kalyanpur, David J. McClosky, James W. Murdock, IV, Siddharth A. Patwardhan
  • Patent number: 11144718
    Abstract: In configuring a processing system with an application made up of machine learning components, where the application has been trained on a set of training data, the application is executed on the processing system using another set of training data. Outputs of the application produced from the other set of training data identified that concur with ground truth data are identified. The components are adapted to produce outputs of the application that concur with the ground truth data using the identified outputs of the application.
    Type: Grant
    Filed: February 28, 2017
    Date of Patent: October 12, 2021
    Assignee: International Business Machines Corporation
    Inventors: Youngja Park, Siddharth A. Patwardhan
  • Patent number: 10929615
    Abstract: A computer-implemented method includes detecting a first set and a second set of citations to a legal case in a plurality of legal documents and a first legal document distinct from the plurality of legal documents, respectively. The computer-implemented method further includes determining tones corresponding to each citation in the first and second sets of citations. The computer-implemented method further includes determining a score for each tone in the first and second sets of tones. The computer-implemented method further includes aggregating a first and subset and a second of the first and second sets of citations, respectively. The computer-implemented method further includes generating an average score for the first and second subsets. The computer-implemented method further includes determining a degree of similarity between the first and second subsets based, in part, on a comparison of average scores. A corresponding computer program product and computer system are also disclosed.
    Type: Grant
    Filed: June 21, 2019
    Date of Patent: February 23, 2021
    Assignee: International Business Machines Corporation
    Inventors: Hernan Badenes, Rosanna S. Mannan, Siddharth A. Patwardhan
  • Patent number: 10769138
    Abstract: Provided herein are techniques for processing a context-situated inquiry to provide results satisfying the inquiry. An inquiry and its supporting context are processed using natural language processing to determine an interpretation for the inquiry and context, and the interpretation is presented to receive corrective information for the interpretation. The corrective information is applied to the interpretation and content is retrieved based on the interpretation of inquiry and context to produce candidate results for the inquiry. Supplemental information is iteratively received to generate updated candidate results.
    Type: Grant
    Filed: June 13, 2017
    Date of Patent: September 8, 2020
    Assignee: International Business Machines Corporation
    Inventors: Sugato Bagchi, Kenneth J. Barker, Branimir K. Boguraev, Mihaela A. Bornea, Adam R. Faulkner, Yanpeng Li, Siddharth A. Patwardhan, Sara Rosenthal
  • Patent number: 10699215
    Abstract: Mechanisms are provided to implement a self-training engine of a question and answer system. The self-training engine pairs an unanswered natural language question with portions of an electronic document to generate an unlabeled data set. The self-training engine trains a model based on a labeled data set comprising a finite number of question and answer pair data structures and applies the model to the unlabeled data set to identify an answer from the portions of the electronic document to the unanswered natural language question. The self-training engine modifies the labeled data set to include the identified answer and corresponding unanswered natural language question as an additional question and answer pair data structure. The self-training engine then trains the model based on the modified labeled data set.
    Type: Grant
    Filed: November 16, 2016
    Date of Patent: June 30, 2020
    Assignee: International Business Machines Corporation
    Inventors: Murthy V. Devarakonda, Siddharth A. Patwardhan, Preethi Raghavan
  • Patent number: 10621880
    Abstract: A method of generating secondary questions in a question-answer system. Missing information is identified from a corpus of data using a computerized device. The missing information comprises any information that improves confidence scores for candidate answers to a question. The computerized device automatically generates a plurality of hypotheses concerning the missing information. The computerized device automatically generates at least one secondary question based on each of the plurality of hypotheses. The hypotheses are ranked based on relative utility to determine an order in which the computerized device outputs the at least one secondary question to external sources to obtain responses.
    Type: Grant
    Filed: September 11, 2012
    Date of Patent: April 14, 2020
    Assignee: International Business Machines Corporation
    Inventors: Branimir K. Boguraev, David W. Buchanan, Jennifer Chu-Carroll, David A. Ferrucci, Aditya A. Kalyanpur, James W. Murdock, IV, Siddharth A. Patwardhan
  • Patent number: 10614725
    Abstract: A method of generating secondary questions in a question-answer system. Missing information is identified from a corpus of data using a computerized device. The missing information comprises any information that improves confidence scores for candidate answers to a question. The computerized device automatically generates a plurality of hypotheses concerning the missing information. The computerized device automatically generates at least one secondary question based on each of the plurality of hypotheses. The hypotheses are ranked based on relative utility to determine an order in which the computerized device outputs the at least one secondary question to external sources to obtain responses.
    Type: Grant
    Filed: September 11, 2012
    Date of Patent: April 7, 2020
    Assignee: International Business Machines Corporation
    Inventors: Branimir K. Boguraev, David W. Buchanan, Jennifer Chu-Carroll, David A. Ferrucci, Aditya A. Kalyanpur, James W. Murdock, IV, Siddharth A. Patwardhan
  • Patent number: 10452780
    Abstract: A computer-implemented method includes detecting a first set and a second set of citations to a legal case in a plurality of legal documents and a first legal document distinct from the plurality of legal documents, respectively. The computer-implemented method further includes determining tones corresponding to each citation in the first and second sets of citations. The computer-implemented method further includes determining a score for each tone in the first and second sets of tones. The computer-implemented method further includes aggregating a first and subset and a second of the first and second sets of citations, respectively. The computer-implemented method further includes generating an average score for the first and second subsets. The computer-implemented method further includes determining a degree of similarity between the first and second subsets based, in part, on a comparison of average scores. A corresponding computer program product and computer system are also disclosed.
    Type: Grant
    Filed: February 15, 2017
    Date of Patent: October 22, 2019
    Assignee: International Business Machines Corporation
    Inventors: Hernan Badenes, Rosanna S. Mannan, Siddharth A. Patwardhan
  • Publication number: 20190311037
    Abstract: A computer-implemented method includes detecting a first set and a second set of citations to a legal case in a plurality of legal documents and a first legal document distinct from the plurality of legal documents, respectively. The computer-implemented method further includes determining tones corresponding to each citation in the first and second sets of citations. The computer-implemented method further includes determining a score for each tone in the first and second sets of tones. The computer-implemented method further includes aggregating a first and subset and a second of the first and second sets of citations, respectively. The computer-implemented method further includes generating an average score for the first and second subsets. The computer-implemented method further includes determining a degree of similarity between the first and second subsets based, in part, on a comparison of average scores. A corresponding computer program product and computer system are also disclosed.
    Type: Application
    Filed: June 21, 2019
    Publication date: October 10, 2019
    Inventors: Hernan Badenes, Rosanna S. Mannan, Siddharth A. Patwardhan
  • Patent number: 10372822
    Abstract: A mechanism is provided in a computing device configured with instructions executing on a processor of the computing device to implement a timeline generation system, for automated timeline completion. The timeline generation system executing on the processor of the computing device identifies a plurality of events in documents in a corpus of information. The timeline generation system places the plurality of events in a partial timeline data structure. The timeline generation system selects an event progression from an event progression knowledge base. The timeline generation system aligns the selected event progression to the partial timeline data structure. The timeline generation system identifies a set of events missing from the partial timeline data structure. The timeline generation system maps the set of events missing from the partial timeline data structure to the partial timeline based on the selected event progression to form a completed timeline data structure.
    Type: Grant
    Filed: June 3, 2016
    Date of Patent: August 6, 2019
    Assignee: International Business Machines Corporation
    Inventors: Murthy V. Devarakonda, Siddharth A. Patwardhan, Preethi Raghavan
  • Patent number: 10373278
    Abstract: A computer-implemented method includes detecting a first and second set of citations to a plurality of legal cases cited in a plurality of legal documents and a first legal document distinct from the plurality of legal documents, respectively. The computer-implemented method further includes determining a first and second set of tones corresponding to each citation in the first and second sets of citations, respectively. The computer-implemented method further includes determining a score for each tone in the first and second sets of tones, respectively. The computer-implemented method further includes detecting a first set of annotations corresponding to the first set of citations. The computer-implemented method further includes building a training model, based, at least in part, from at least a subset of the first set of annotations. The computer-implemented method further includes determining a second set of annotations corresponding to the second set of citations.
    Type: Grant
    Filed: February 15, 2017
    Date of Patent: August 6, 2019
    Assignee: International Business Machines Corporation
    Inventors: Hernan Badenes, Rosanna S. Mannan, Siddharth A. Patwardhan
  • Publication number: 20180357272
    Abstract: Provided herein are techniques for processing a context-situated inquiry to provide results satisfying the inquiry. An inquiry and its supporting context are processed using natural language processing to determine an interpretation for the inquiry and context, and the interpretation is presented to receive corrective information for the interpretation. The corrective information is applied to the interpretation and content is retrieved based on the interpretation of inquiry and context to produce candidate results for the inquiry. Supplemental information is iteratively received to generate updated candidate results.
    Type: Application
    Filed: June 13, 2017
    Publication date: December 13, 2018
    Inventors: Sugato Bagchi, Kenneth J. Barker, Branimir K. Boguraev, Mihaela A. Bornea, Adam R. Faulkner, Yanpeng Li, Siddharth A. Patwardhan, Sara Rosenthal
  • Publication number: 20180247221
    Abstract: In configuring a processing system with an application made up of machine learning components, where the application has been trained on a set of training data, the application is executed on the processing system using another set of training data. Outputs of the application produced from the other set of training data identified that concur with ground truth data are identified. The components are adapted to produce outputs of the application that concur with the ground truth data using the identified outputs of the application.
    Type: Application
    Filed: February 28, 2017
    Publication date: August 30, 2018
    Inventors: Youngja Park, Siddharth A. Patwardhan
  • Publication number: 20180232358
    Abstract: A computer-implemented method includes detecting a first set and a second set of citations to a legal case in a plurality of legal documents and a first legal document distinct from the plurality of legal documents, respectively. The computer-implemented method further includes determining tones corresponding to each citation in the first and second sets of citations. The computer-implemented method further includes determining a score for each tone in the first and second sets of tones. The computer-implemented method further includes aggregating a first and subset and a second of the first and second sets of citations, respectively. The computer-implemented method further includes generating an average score for the first and second subsets. The computer-implemented method further includes determining a degree of similarity between the first and second subsets based, in part, on a comparison of average scores. A corresponding computer program product and computer system are also disclosed.
    Type: Application
    Filed: February 15, 2017
    Publication date: August 16, 2018
    Inventors: Hernan Badenes, Rosanna S. Mannan, Siddharth A. Patwardhan
  • Publication number: 20180232827
    Abstract: A computer-implemented method includes detecting a first and second set of citations to a plurality of legal cases cited in a plurality of legal documents and a first legal document distinct from the plurality of legal documents, respectively. The computer-implemented method further includes determining a first and second set of tones corresponding to each citation in the first and second sets of citations, respectively. The computer-implemented method further includes determining a score for each tone in the first and second sets of tones, respectively. The computer-implemented method further includes detecting a first set of annotations corresponding to the first set of citations. The computer-implemented method further includes building a training model, based, at least in part, from at least a subset of the first set of annotations. The computer-implemented method further includes determining a second set of annotations corresponding to the second set of citations.
    Type: Application
    Filed: September 20, 2017
    Publication date: August 16, 2018
    Inventors: Hernan Badenes, Rosanna S. Mannan, Siddharth A. Patwardhan
  • Publication number: 20180232826
    Abstract: A computer-implemented method includes detecting a first and second set of citations to a plurality of legal cases cited in a plurality of legal documents and a first legal document distinct from the plurality of legal documents, respectively. The computer-implemented method further includes determining a first and second set of tones corresponding to each citation in the first and second sets of citations, respectively. The computer-implemented method further includes determining a score for each tone in the first and second sets of tones, respectively. The computer-implemented method further includes detecting a first set of annotations corresponding to the first set of citations. The computer-implemented method further includes building a training model, based, at least in part, from at least a subset of the first set of annotations. The computer-implemented method further includes determining a second set of annotations corresponding to the second set of citations.
    Type: Application
    Filed: February 15, 2017
    Publication date: August 16, 2018
    Inventors: Hernan Badenes, Rosanna S. Mannan, Siddharth A. Patwardhan
  • Patent number: 10042921
    Abstract: An embodiment of the invention provides a method for including receiving a natural language query from a user with an interface, and generating multiple dependency parses of the natural language query with a parser device connected to the interface. The generating of the multiple dependency parses includes dividing the natural language query into multiple components, and creating a single dependency parse by connecting each component of the components with at least one other component of the components. A processor connected to the parser device applies rules to all of the multiple dependency parses to identify entities and relations in the natural language query.
    Type: Grant
    Filed: September 18, 2015
    Date of Patent: August 7, 2018
    Assignee: International Business Machines Corporation
    Inventors: Branimir K. Boguraev, Elahe Khorasani, Vadim Sheinin, Siddharth A. Patwardhan, Petros Zerfos
  • Publication number: 20180137433
    Abstract: Mechanisms are provided to implement a self-training engine of a question and answer system. The self-training engine pairs an unanswered natural language question with portions of an electronic document to generate an unlabeled data set. The self-training engine trains a model based on a labeled data set comprising a finite number of question and answer pair data structures and applies the model to the unlabeled data set to identify an answer from the portions of the electronic document to the unanswered natural language question. The self-training engine modifies the labeled data set to include the identified answer and corresponding unanswered natural language question as an additional question and answer pair data structure. The self-training engine then trains the model based on the modified labeled data set.
    Type: Application
    Filed: November 16, 2016
    Publication date: May 17, 2018
    Inventors: Murthy V. Devarakonda, Siddharth A. Patwardhan, Preethi Raghavan
  • Publication number: 20180121603
    Abstract: A contextually relevant patient information extractor is provided that receives an input question directed to medical information about a patient; analyzes a patient's electronic medical records (EMRs) to identify an initial entry in the patient's EMRs corresponding to a candidate answer to the input question; analyzes a context of the patient's EMRs based on the initial entry to identify entries in the patient's EMR that are contextually connected to the initial entry; performs question answering analysis on the initial entry and entries that are contextually connected to the initial entry to identify one or more candidate answers to the input question; and outputs a final answer to the input question based on the question answering analysis.
    Type: Application
    Filed: October 27, 2016
    Publication date: May 3, 2018
    Inventors: Murthy V. Devarakonda, Jennifer J. Liang, Siddharth A. Patwardhan, Preethi Raghavan