Patents by Inventor Siddharth A. Patwardhan

Siddharth A. Patwardhan has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Entailment knowledge base in natural language processing systems

Patent number: 11720611

Abstract: Generating textual entailment pair by a natural language processing (NLP) system. The NLP system receives two input texts, such as a question and a candidate answer. The NLP system queries a database and retrieves passages likely to include text that support the candidate answer. The NLP system generates parse trees and performs term matching on the passages and scores them according to the matching. The NLP system detects anchor pairs in the question and in the passage and aligns subgraphs (within the parse trees) of one to the other based on matching. The NLP system identifies aligned terms in the question and the passage that are not in the aligned subgraphs. The NLP system identifies text fragments, for the question and the passage, within the non-aligned segments of their respective parse trees, that connect the aligned term to the aligned portion of the subgraph.

Type: Grant

Filed: May 20, 2016

Date of Patent: August 8, 2023

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Branimir K. Boguraev, Jennifer Chu-Carroll, Aditya A. Kalyanpur, David J. McClosky, James W. Murdock, IV, Siddharth A. Patwardhan
Entailment knowledge base in natural language processing systems

Patent number: 11520813

Abstract: Generating textual entailment pair by a natural language processing (NLP) system. The NLP system receives two input texts, such as a question and a candidate answer. The NLP system queries a database and retrieves passages likely to include text that support the candidate answer. The NLP system generates parse trees and performs term matching on the passages and scores them according to the matching. The NLP system detects anchor pairs in the question and in the passage and aligns subgraphs (within the parse trees) of one to the other based on matching. The NLP system identifies aligned terms in the question and the passage that are not in the aligned subgraphs. The NLP system identifies text fragments, for the question and the passage, within the non-aligned segments of their respective parse trees, that connect the aligned term to the aligned portion of the subgraph.

Type: Grant

Filed: January 4, 2016

Date of Patent: December 6, 2022

Assignee: International Business Machines Corporation

Inventors: Branimir K. Boguraev, Jennifer Chu-Carroll, Aditya A. Kalyanpur, David J. McClosky, James W. Murdock, IV, Siddharth A. Patwardhan
Adaptable processing components

Patent number: 11144718

Abstract: In configuring a processing system with an application made up of machine learning components, where the application has been trained on a set of training data, the application is executed on the processing system using another set of training data. Outputs of the application produced from the other set of training data identified that concur with ground truth data are identified. The components are adapted to produce outputs of the application that concur with the ground truth data using the identified outputs of the application.

Type: Grant

Filed: February 28, 2017

Date of Patent: October 12, 2021

Assignee: International Business Machines Corporation

Inventors: Youngja Park, Siddharth A. Patwardhan
Tone analysis of legal documents

Patent number: 10929615

Abstract: A computer-implemented method includes detecting a first set and a second set of citations to a legal case in a plurality of legal documents and a first legal document distinct from the plurality of legal documents, respectively. The computer-implemented method further includes determining tones corresponding to each citation in the first and second sets of citations. The computer-implemented method further includes determining a score for each tone in the first and second sets of tones. The computer-implemented method further includes aggregating a first and subset and a second of the first and second sets of citations, respectively. The computer-implemented method further includes generating an average score for the first and second subsets. The computer-implemented method further includes determining a degree of similarity between the first and second subsets based, in part, on a comparison of average scores. A corresponding computer program product and computer system are also disclosed.

Type: Grant

Filed: June 21, 2019

Date of Patent: February 23, 2021

Assignee: International Business Machines Corporation

Inventors: Hernan Badenes, Rosanna S. Mannan, Siddharth A. Patwardhan
Processing context-based inquiries for knowledge retrieval

Patent number: 10769138

Abstract: Provided herein are techniques for processing a context-situated inquiry to provide results satisfying the inquiry. An inquiry and its supporting context are processed using natural language processing to determine an interpretation for the inquiry and context, and the interpretation is presented to receive corrective information for the interpretation. The corrective information is applied to the interpretation and content is retrieved based on the interpretation of inquiry and context to produce candidate results for the inquiry. Supplemental information is iteratively received to generate updated candidate results.

Type: Grant

Filed: June 13, 2017

Date of Patent: September 8, 2020

Assignee: International Business Machines Corporation

Inventors: Sugato Bagchi, Kenneth J. Barker, Branimir K. Boguraev, Mihaela A. Bornea, Adam R. Faulkner, Yanpeng Li, Siddharth A. Patwardhan, Sara Rosenthal
Self-training of question answering system using question profiles

Patent number: 10699215

Abstract: Mechanisms are provided to implement a self-training engine of a question and answer system. The self-training engine pairs an unanswered natural language question with portions of an electronic document to generate an unlabeled data set. The self-training engine trains a model based on a labeled data set comprising a finite number of question and answer pair data structures and applies the model to the unlabeled data set to identify an answer from the portions of the electronic document to the unanswered natural language question. The self-training engine modifies the labeled data set to include the identified answer and corresponding unanswered natural language question as an additional question and answer pair data structure. The self-training engine then trains the model based on the modified labeled data set.

Type: Grant

Filed: November 16, 2016

Date of Patent: June 30, 2020

Assignee: International Business Machines Corporation

Inventors: Murthy V. Devarakonda, Siddharth A. Patwardhan, Preethi Raghavan
Generating secondary questions in an introspective question answering system

Patent number: 10621880

Abstract: A method of generating secondary questions in a question-answer system. Missing information is identified from a corpus of data using a computerized device. The missing information comprises any information that improves confidence scores for candidate answers to a question. The computerized device automatically generates a plurality of hypotheses concerning the missing information. The computerized device automatically generates at least one secondary question based on each of the plurality of hypotheses. The hypotheses are ranked based on relative utility to determine an order in which the computerized device outputs the at least one secondary question to external sources to obtain responses.

Type: Grant

Filed: September 11, 2012

Date of Patent: April 14, 2020

Assignee: International Business Machines Corporation

Inventors: Branimir K. Boguraev, David W. Buchanan, Jennifer Chu-Carroll, David A. Ferrucci, Aditya A. Kalyanpur, James W. Murdock, IV, Siddharth A. Patwardhan
Generating secondary questions in an introspective question answering system

Patent number: 10614725

Abstract: A method of generating secondary questions in a question-answer system. Missing information is identified from a corpus of data using a computerized device. The missing information comprises any information that improves confidence scores for candidate answers to a question. The computerized device automatically generates a plurality of hypotheses concerning the missing information. The computerized device automatically generates at least one secondary question based on each of the plurality of hypotheses. The hypotheses are ranked based on relative utility to determine an order in which the computerized device outputs the at least one secondary question to external sources to obtain responses.

Type: Grant

Filed: September 11, 2012

Date of Patent: April 7, 2020

Assignee: International Business Machines Corporation

Inventors: Branimir K. Boguraev, David W. Buchanan, Jennifer Chu-Carroll, David A. Ferrucci, Aditya A. Kalyanpur, James W. Murdock, IV, Siddharth A. Patwardhan
Tone analysis of legal documents

Patent number: 10452780

Abstract: A computer-implemented method includes detecting a first set and a second set of citations to a legal case in a plurality of legal documents and a first legal document distinct from the plurality of legal documents, respectively. The computer-implemented method further includes determining tones corresponding to each citation in the first and second sets of citations. The computer-implemented method further includes determining a score for each tone in the first and second sets of tones. The computer-implemented method further includes aggregating a first and subset and a second of the first and second sets of citations, respectively. The computer-implemented method further includes generating an average score for the first and second subsets. The computer-implemented method further includes determining a degree of similarity between the first and second subsets based, in part, on a comparison of average scores. A corresponding computer program product and computer system are also disclosed.

Type: Grant

Filed: February 15, 2017

Date of Patent: October 22, 2019

Assignee: International Business Machines Corporation

Inventors: Hernan Badenes, Rosanna S. Mannan, Siddharth A. Patwardhan
TONE ANALYSIS OF LEGAL DOCUMENTS

Publication number: 20190311037

Abstract: A computer-implemented method includes detecting a first set and a second set of citations to a legal case in a plurality of legal documents and a first legal document distinct from the plurality of legal documents, respectively. The computer-implemented method further includes determining tones corresponding to each citation in the first and second sets of citations. The computer-implemented method further includes determining a score for each tone in the first and second sets of tones. The computer-implemented method further includes aggregating a first and subset and a second of the first and second sets of citations, respectively. The computer-implemented method further includes generating an average score for the first and second subsets. The computer-implemented method further includes determining a degree of similarity between the first and second subsets based, in part, on a comparison of average scores. A corresponding computer program product and computer system are also disclosed.

Type: Application

Filed: June 21, 2019

Publication date: October 10, 2019

Inventors: Hernan Badenes, Rosanna S. Mannan, Siddharth A. Patwardhan
Automated timeline completion using event progression knowledge base

Patent number: 10372822

Abstract: A mechanism is provided in a computing device configured with instructions executing on a processor of the computing device to implement a timeline generation system, for automated timeline completion. The timeline generation system executing on the processor of the computing device identifies a plurality of events in documents in a corpus of information. The timeline generation system places the plurality of events in a partial timeline data structure. The timeline generation system selects an event progression from an event progression knowledge base. The timeline generation system aligns the selected event progression to the partial timeline data structure. The timeline generation system identifies a set of events missing from the partial timeline data structure. The timeline generation system maps the set of events missing from the partial timeline data structure to the partial timeline based on the selected event progression to form a completed timeline data structure.

Type: Grant

Filed: June 3, 2016

Date of Patent: August 6, 2019

Assignee: International Business Machines Corporation

Inventors: Murthy V. Devarakonda, Siddharth A. Patwardhan, Preethi Raghavan
Annotation of legal documents with case citations

Patent number: 10373278

Abstract: A computer-implemented method includes detecting a first and second set of citations to a plurality of legal cases cited in a plurality of legal documents and a first legal document distinct from the plurality of legal documents, respectively. The computer-implemented method further includes determining a first and second set of tones corresponding to each citation in the first and second sets of citations, respectively. The computer-implemented method further includes determining a score for each tone in the first and second sets of tones, respectively. The computer-implemented method further includes detecting a first set of annotations corresponding to the first set of citations. The computer-implemented method further includes building a training model, based, at least in part, from at least a subset of the first set of annotations. The computer-implemented method further includes determining a second set of annotations corresponding to the second set of citations.

Type: Grant

Filed: February 15, 2017

Date of Patent: August 6, 2019

Assignee: International Business Machines Corporation

Inventors: Hernan Badenes, Rosanna S. Mannan, Siddharth A. Patwardhan
PROCESSING CONTEXT-BASED INQUIRIES FOR KNOWLEDGE RETRIEVAL

Publication number: 20180357272

Abstract: Provided herein are techniques for processing a context-situated inquiry to provide results satisfying the inquiry. An inquiry and its supporting context are processed using natural language processing to determine an interpretation for the inquiry and context, and the interpretation is presented to receive corrective information for the interpretation. The corrective information is applied to the interpretation and content is retrieved based on the interpretation of inquiry and context to produce candidate results for the inquiry. Supplemental information is iteratively received to generate updated candidate results.

Type: Application

Filed: June 13, 2017

Publication date: December 13, 2018

Inventors: Sugato Bagchi, Kenneth J. Barker, Branimir K. Boguraev, Mihaela A. Bornea, Adam R. Faulkner, Yanpeng Li, Siddharth A. Patwardhan, Sara Rosenthal
ADAPTABLE PROCESSING COMPONENTS

Publication number: 20180247221

Abstract: In configuring a processing system with an application made up of machine learning components, where the application has been trained on a set of training data, the application is executed on the processing system using another set of training data. Outputs of the application produced from the other set of training data identified that concur with ground truth data are identified. The components are adapted to produce outputs of the application that concur with the ground truth data using the identified outputs of the application.

Type: Application

Filed: February 28, 2017

Publication date: August 30, 2018

Inventors: Youngja Park, Siddharth A. Patwardhan
TONE ANALYSIS OF LEGAL DOCUMENTS

Publication number: 20180232358

Abstract: A computer-implemented method includes detecting a first set and a second set of citations to a legal case in a plurality of legal documents and a first legal document distinct from the plurality of legal documents, respectively. The computer-implemented method further includes determining tones corresponding to each citation in the first and second sets of citations. The computer-implemented method further includes determining a score for each tone in the first and second sets of tones. The computer-implemented method further includes aggregating a first and subset and a second of the first and second sets of citations, respectively. The computer-implemented method further includes generating an average score for the first and second subsets. The computer-implemented method further includes determining a degree of similarity between the first and second subsets based, in part, on a comparison of average scores. A corresponding computer program product and computer system are also disclosed.

Type: Application

Filed: February 15, 2017

Publication date: August 16, 2018

Inventors: Hernan Badenes, Rosanna S. Mannan, Siddharth A. Patwardhan
ANNOTATION OF LEGAL DOCUMENTS WITH CASE CITATIONS

Publication number: 20180232827

Abstract: A computer-implemented method includes detecting a first and second set of citations to a plurality of legal cases cited in a plurality of legal documents and a first legal document distinct from the plurality of legal documents, respectively. The computer-implemented method further includes determining a first and second set of tones corresponding to each citation in the first and second sets of citations, respectively. The computer-implemented method further includes determining a score for each tone in the first and second sets of tones, respectively. The computer-implemented method further includes detecting a first set of annotations corresponding to the first set of citations. The computer-implemented method further includes building a training model, based, at least in part, from at least a subset of the first set of annotations. The computer-implemented method further includes determining a second set of annotations corresponding to the second set of citations.

Type: Application

Filed: September 20, 2017

Publication date: August 16, 2018

Inventors: Hernan Badenes, Rosanna S. Mannan, Siddharth A. Patwardhan
ANNOTATION OF LEGAL DOCUMENTS WITH CASE CITATIONS

Publication number: 20180232826

Abstract: A computer-implemented method includes detecting a first and second set of citations to a plurality of legal cases cited in a plurality of legal documents and a first legal document distinct from the plurality of legal documents, respectively. The computer-implemented method further includes determining a first and second set of tones corresponding to each citation in the first and second sets of citations, respectively. The computer-implemented method further includes determining a score for each tone in the first and second sets of tones, respectively. The computer-implemented method further includes detecting a first set of annotations corresponding to the first set of citations. The computer-implemented method further includes building a training model, based, at least in part, from at least a subset of the first set of annotations. The computer-implemented method further includes determining a second set of annotations corresponding to the second set of citations.

Type: Application

Filed: February 15, 2017

Publication date: August 16, 2018

Inventors: Hernan Badenes, Rosanna S. Mannan, Siddharth A. Patwardhan
Robust and readily domain-adaptable natural language interface to databases

Patent number: 10042921

Abstract: An embodiment of the invention provides a method for including receiving a natural language query from a user with an interface, and generating multiple dependency parses of the natural language query with a parser device connected to the interface. The generating of the multiple dependency parses includes dividing the natural language query into multiple components, and creating a single dependency parse by connecting each component of the components with at least one other component of the components. A processor connected to the parser device applies rules to all of the multiple dependency parses to identify entities and relations in the natural language query.

Type: Grant

Filed: September 18, 2015

Date of Patent: August 7, 2018

Assignee: International Business Machines Corporation

Inventors: Branimir K. Boguraev, Elahe Khorasani, Vadim Sheinin, Siddharth A. Patwardhan, Petros Zerfos
Self-Training of Question Answering System Using Question Profiles

Publication number: 20180137433

Abstract: Mechanisms are provided to implement a self-training engine of a question and answer system. The self-training engine pairs an unanswered natural language question with portions of an electronic document to generate an unlabeled data set. The self-training engine trains a model based on a labeled data set comprising a finite number of question and answer pair data structures and applies the model to the unlabeled data set to identify an answer from the portions of the electronic document to the unanswered natural language question. The self-training engine modifies the labeled data set to include the identified answer and corresponding unanswered natural language question as an additional question and answer pair data structure. The self-training engine then trains the model based on the modified labeled data set.

Type: Application

Filed: November 16, 2016

Publication date: May 17, 2018

Inventors: Murthy V. Devarakonda, Siddharth A. Patwardhan, Preethi Raghavan
Identification of Related Electronic Medical Record Documents in a Question and Answer System

Publication number: 20180121603

Abstract: A contextually relevant patient information extractor is provided that receives an input question directed to medical information about a patient; analyzes a patient's electronic medical records (EMRs) to identify an initial entry in the patient's EMRs corresponding to a candidate answer to the input question; analyzes a context of the patient's EMRs based on the initial entry to identify entries in the patient's EMR that are contextually connected to the initial entry; performs question answering analysis on the initial entry and entries that are contextually connected to the initial entry to identify one or more candidate answers to the input question; and outputs a final answer to the input question based on the question answering analysis.

Type: Application

Filed: October 27, 2016

Publication date: May 3, 2018

Inventors: Murthy V. Devarakonda, Jennifer J. Liang, Siddharth A. Patwardhan, Preethi Raghavan

1 2 next