Patents by Inventor David Ferrucci
David Ferrucci has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20180046705Abstract: A system, method and computer program product for conducting questions and answers with deferred type evaluation based on any corpus of data. The method includes processing a query including waiting until a “Type” (i.e. a descriptor) is determined AND a candidate answer is provided. Then, a search is conducted to look (search) for evidence that the candidate answer has the required Lexical Answer Type (e.g., as determined by a matching function that can leverage a parser, a semantic interpreter and/or a simple pattern matcher). Prior to or during candidate answer evaluation, a process is provided for extracting and storing collections of entity-type pairs from semi-structured text documents. During QA processing and candidate answer scoring, a process is implemented to match the query LAT against the lexical type of each provided candidate answer and generate a score judging a degree of match.Type: ApplicationFiled: October 23, 2017Publication date: February 15, 2018Inventors: James W. Murdock, IV, James J. Fan, Christopher A. Welty, Pablo A. Duboue, David A. Ferrucci, Wlodek W. Zadrozny
-
Publication number: 20180025127Abstract: A decision-support system for medical diagnosis and treatment comprises software modules embodied on a computer readable medium, and the software modules comprise an input/output module and a question-answering module. The method receives patient case information using the input/output module, and generates a medical diagnosis or treatment query based on the patient case information and also generates a plurality of medical diagnosis or treatment answers for the query using the question-answering module. The method also calculates numerical values for multiple medical evidence dimensions from medical evidence sources for each of the answers using the question-answering module and also calculates a corresponding confidence value for each of the answers based on the numerical value of each evidence dimension using the question-answering module.Type: ApplicationFiled: September 26, 2017Publication date: January 25, 2018Applicant: International Business Machines CorporationInventors: Sugato Bagchi, David A. Ferrucci, Anthony T. Levas, Erik T. Mueller
-
Patent number: 9864818Abstract: A method, system and computer program product for generating answers to questions. In one embodiment, the method comprises receiving an input query, identifying a plurality of candidate answers to the query; and for at least one of these candidate answers, identifying at least one proof of the answer. This proof includes a series of premises, and a multitude of documents are identified that include references to the premises. A set of these documents is selected that include references to all of the premises. This set of documents is used to generate one or more scores for the one of the candidate answers. A defined procedure is applied to the candidate answers to determine a ranking for the answers, and this includes using the one or more scores for the at least one of the candidate answers in the defined procedure to determine the ranking for this one candidate answer.Type: GrantFiled: November 10, 2016Date of Patent: January 9, 2018Assignee: International Business Machines CorporationInventors: Eric W. Brown, Jennifer Chu-Carroll, David A. Ferrucci, James W. Murdock, IV
-
Publication number: 20170371861Abstract: An architecture and processes enable computer learning and developing an understanding of arbitrary natural language text through collaboration with humans in the context of joint problem solving. The architecture ingests the text and then syntactically and semantically processes the text to infer an initial understanding of the text. The initial understanding is captured in a story model of semantic and frame structures. The story model is then tested through computer generated questions that are posed to humans through interactive dialog sessions. The knowledge gleaned from the humans is used to update the story model as well as the computing system's current world model of understanding. The process is repeated for multiple stories over time, enabling the computing system to grow in knowledge and thereby understand stories of increasingly higher reading comprehension levels.Type: ApplicationFiled: June 24, 2016Publication date: December 28, 2017Inventors: Mike Barborak, David Buchanan, Greg Burnham, Jennifer Chu-Carroll, David Ferrucci, Aditya Kalyanpur, Adam Lally, Stefano Pacifico, Chang Wang
-
Patent number: 9852213Abstract: A method, system and computer program product for generating answers to questions. In one embodiment, the method comprises receiving an input query, decomposing the input query into a plurality of different subqueries, and conducting a search in one or more data sources to identify at least one candidate answer to each of the subqueries. A ranking function is applied to each of the candidate answers to determine a ranking for each of these candidate answers; and for each of the subqueries, one of the candidate answers to the subquery is selected based on this ranking. A logical synthesis component is applied to synthesize a candidate answer for the input query from the selected the candidate answers to the subqueries. In one embodiment, the procedure applied by the logical synthesis component to synthesize the candidate answer for the input query is determined from the input query.Type: GrantFiled: May 2, 2016Date of Patent: December 26, 2017Assignee: International Business Machines CorporationInventors: Eric W. Brown, Jennifer Chu-Carroll, David A. Ferrucci, Adam P. Lally, James W. Murdock, John M. Prager
-
Patent number: 9830381Abstract: A system, program product, and methodology automatically scores candidate answers to questions in a question and answer system. In the candidate answer scoring method, a processor device performs one or more of receiving one or more candidate answers associated with a query string, the candidates obtained from a data source having semi-structured content; identifying one or more documents with semi-structured content from the data source having a candidate answer; and for each identified document: extracting one or more entity structures embedded in the identified document; determining a number of the entity structures in the identified document that appear in the received input query; and, computing a score for a candidate answer in the document as a function of the number Overall system efficiency is improved by giving the correct candidate answers higher scores through leveraging context-dependent structural information such as links to other documents and embedded tags.Type: GrantFiled: September 6, 2012Date of Patent: November 28, 2017Assignee: International Business Machines CorporationInventors: James J. Fan, David A. Ferrucci
-
Patent number: 9805613Abstract: A method for providing adaptation to a question answering (QA) system having an associated plurality of trace data and a question-answer set. The method includes submitting a set of questions to the QA system; receiving back from the QA system a set of answers; comparing the set of answers to answers in the question-answer set; and generating the plurality of trace data based on comparison, and an estimate of how much more training data is needed. The generating comprises (a) for each question type, successively sample an increasing number of question-answer pairs from the question-answer set; (b) automatically train the QA system using the sampled question-answer pairs; (c) automatically compute a functional dependence of the QA system performance on the remaining questions relative to the size of the sample; and (d) extrapolate from the functional dependence a number of training question-answer pairs of each question type.Type: GrantFiled: September 15, 2015Date of Patent: October 31, 2017Assignee: International Business Machines CorporationInventors: Sugato Bagchi, David A. Ferrucci, David C. Gondek, Anthony T. Levas, Wlodek W. Zadrozny
-
Patent number: 9798800Abstract: A system, method and computer program product for conducting questions and answers with deferred type evaluation based on any corpus of data. The method includes processing a query including waiting until a “Type” (i.e. a descriptor) is determined AND a candidate answer is provided. Then, a search is conducted to look (search) for evidence that the candidate answer has the required Lexical Answer Type (e.g., as determined by a matching function that can leverage a parser, a semantic interpreter and/or a simple pattern matcher). Prior to or during candidate answer evaluation, a process is provided for extracting and storing collections of entity-type pairs from semi-structured text documents. During QA processing and candidate answer scoring, a process is implemented to match the query LAT against the lexical type of each provided candidate answer and generate a score judging a degree of match.Type: GrantFiled: September 21, 2011Date of Patent: October 24, 2017Assignee: International Business Machines CorporationInventors: Pablo A. Duboue, James J. Fan, David A. Ferrucci, James W. Murdock, IV, Christopher A. Welty, Wlodek W. Zadrozny
-
Patent number: 9703861Abstract: System and method for providing answers to questions based on any corpus of data implements a method that generates a number of candidate passages from the corpus that answer an input query, and finds the correct resulting answer by collecting supporting evidence from the multiple passages. By analyzing all retrieved passages and that passage's metadata in parallel, an output plurality of data structures is generated including candidate answers based upon the analyzing. Then, supporting passage retrieval operations are performed upon the set of candidate answers, and for each candidate answer, the data corpus is traversed to find those passages having candidate answer in addition to query terms. All candidate answers are automatically scored by a plurality of scoring modules, each producing a module score. The modules scores are processed to determine one or more query answers; and, a query response is generated based on the one or more query answers.Type: GrantFiled: May 21, 2014Date of Patent: July 11, 2017Assignee: International Business Machines CorporationInventors: Eric W. Brown, David Ferrucci, Adam Lally, Wlodek W. Zadrozny
-
Patent number: 9690861Abstract: Methods, systems, and devices provide semantically relevant information by analyzing an Electronic Medical Record (EMR) having structured data and unstructured data. In the analysis, a first set of medical concepts is identified from the unstructured data in the EMR, and a second set of medical concepts is identified from the structured data in the EMR. Relationships between medical concepts in the first set of medical concepts and the second set of medical concepts are automatically identified in a medical ontology by such methods, systems, and devices.Type: GrantFiled: July 17, 2014Date of Patent: June 27, 2017Assignee: International Business Machines CorporationInventors: Keerthana Boloor, Eric W. Brown, Murthy V. Devarakonda, David Ferrucci, John M. Prager
-
Publication number: 20170154043Abstract: A method, system and computer program product for generating answers to questions. In one embodiment, the method comprises receiving an input query, identifying a plurality of candidate answers to the query; and for at least one of these candidate answers, identifying at least one proof of the answer. This proof includes a series of premises, and a multitude of documents are identified that include references to the premises. A set of these documents is selected that include references to all of the premises. This set of documents is used to generate one or more scores for the one of the candidate answers. A defined procedure is applied to the candidate answers to determine a ranking for the answers, and this includes using the one or more scores for the at least one of the candidate answers in the defined procedure to determine the ranking for this one candidate answer.Type: ApplicationFiled: February 14, 2017Publication date: June 1, 2017Inventors: Eric W. Brown, Jennifer Chu-Carroll, David A. Ferrucci, James W. Murdock, IV
-
Patent number: 9600601Abstract: A method, system and computer program product for generating answers to questions. In one embodiment, the method comprises receiving an input query, identifying a plurality of candidate answers to the query; and for at least one of these candidate answers, identifying at least one proof of the answer. This proof includes a series of premises, and a multitude of documents are identified that include references to the premises. A set of these documents is selected that include references to all of the premises. This set of documents is used to generate one or more scores for the one of the candidate answers. A defined procedure is applied to the candidate answers to determine a ranking for the answers, and this includes using the one or more scores for the at least one of the candidate answers in the defined procedure to determine the ranking for this one candidate answer.Type: GrantFiled: September 24, 2011Date of Patent: March 21, 2017Assignee: International Business Machines CorporationInventors: Eric W. Brown, Jennifer Chu-Carroll, David A. Ferrucci, James W. Murdock, IV
-
Publication number: 20170060990Abstract: A method, system and computer program product for generating answers to questions. In one embodiment, the method comprises receiving an input query, identifying a plurality of candidate answers to the query; and for at least one of these candidate answers, identifying at least one proof of the answer. This proof includes a series of premises, and a multitude of documents are identified that include references to the premises. A set of these documents is selected that include references to all of the premises. This set of documents is used to generate one or more scores for the one of the candidate answers. A defined procedure is applied to the candidate answers to determine a ranking for the answers, and this includes using the one or more scores for the at least one of the candidate answers in the defined procedure to determine the ranking for this one candidate answer.Type: ApplicationFiled: November 10, 2016Publication date: March 2, 2017Inventors: Eric W. Brown, Jennifer Chu-Carroll, David A. Ferrucci, James W. Murdock, IV
-
Publication number: 20170053020Abstract: A method, system and computer program product for generating answers to questions. In one embodiment, the method comprises receiving an input query; conducting a search to identify candidate answers to the input query, and producing a plurality of scores for each of the candidate answers. For each of the candidate answers, one, of a plurality of candidate ranking functions, is selected. This selected ranking function is applied to the each of the candidate answers to determine a ranking for the candidate answer based on the scores for that candidate answer. One or more of the candidate answers is selected, based on the rankings for the candidate answers, as one or more answers to the input query. In an embodiment, the ranking function selection is performed using information about the question. In an embodiment, the ranking function selection is performed using information about each answer.Type: ApplicationFiled: November 8, 2016Publication date: February 23, 2017Inventors: Eric W. Brown, David Ferrucci, James W. Murdock, IV
-
Patent number: 9529845Abstract: An aspect of candidate answer generation in a question answering system includes receiving at least one document or passage responsive to a search of an information source. The search is implemented based on a query formulated from a natural language query. A further aspect includes receiving provenance information for the at least one document or passage, searching a semi-structured source of information based on the provenance information, retrieving a substructure corresponding to the at least one document or passage from the semi-structured source of information, and returning the substructure as a candidate answer to the natural language query.Type: GrantFiled: May 26, 2015Date of Patent: December 27, 2016Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Jennifer Chu-Carroll, James J. Fan, David A. Ferrucci
-
Patent number: 9508038Abstract: A computer-implemented system, method and program product generates answers to questions in an input query text string. The method includes determining, by a programmed processor unit, a lexical answer type (LAT) string associated with an input query; automatically obtaining a candidate answer string to the input query from a data corpus; mapping the query LAT string to a first type string in a structured resource; mapping the candidate answer string to a second type string in the structured resource; and determining if the first type string and the second type string are disjointed; and scoring the candidate answer string based on the determination of the types being disjointed wherein the structured resource includes a semantic database providing ontological content.Type: GrantFiled: September 6, 2012Date of Patent: November 29, 2016Assignee: International Business Machines CorporationInventors: David A. Ferrucci, Aditya Kalyanpur, James W. Murdock, IV, Christopher A. Welty, Wlodek W. Zadrozny
-
Patent number: 9507854Abstract: A method, system and computer program product for generating answers to questions. In one embodiment, the method comprises receiving an input query; conducting a search to identify candidate answers to the input query, and producing a plurality of scores for each of the candidate answers. For each of the candidate answers, one, of a plurality of candidate ranking functions, is selected. This selected ranking function is applied to the each of the candidate answers to determine a ranking for the candidate answer based on the scores for that candidate answer. One or more of the candidate answers is selected, based on the rankings for the candidate answers, as one or more answers to the input query. In an embodiment, the ranking function selection is performed using information about the question. In an embodiment, the ranking function selection is performed using information about each answer.Type: GrantFiled: August 14, 2015Date of Patent: November 29, 2016Assignee: International Business Machines CorporationInventors: Eric W. Brown, David A. Ferrucci, James W. Murdock, IV
-
Patent number: 9495481Abstract: A method, system and computer program product for generating answers to questions. In one embodiment, the method comprises receiving an input query, identifying a plurality of candidate answers to the query; and for at least one of these candidate answers, identifying at least one proof of the answer. This proof includes a series of premises, and a multitude of documents are identified that include references to the premises. A set of these documents is selected that include references to all of the premises. This set of documents is used to generate one or more scores for the one of the candidate answers. A defined procedure is applied to the candidate answers to determine a ranking for the answers, and this includes using the one or more scores for the at least one of the candidate answers in the defined procedure to determine the ranking for this one candidate answer.Type: GrantFiled: September 14, 2012Date of Patent: November 15, 2016Assignee: International Business Machines CorporationInventors: Eric W. Brown, Jennifer Chu-Carroll, David A. Ferrucci, James W. Murdock, IV
-
Patent number: 9483731Abstract: In a method of answering questions and scoring answers, a title and at least one topical field are identified for a document. A field name and field content associated with the topical field is identified, and a title-oriented document is created by combining the title, the field name, and the field content associated with the topical field. For each title-oriented document, a term in the title is matched to previously established categories to produce a title concept identifier. The topical field is synthesized to produce a field concept identifier and a field content concept identifier. A question is received. The question topic term and the question content identifier are used to identify at least one question-matching relation instance. The title concept identifier of each question-matching relation instance is identified as a candidate answer to the question. Each candidate answer and a corresponding answer score is output.Type: GrantFiled: December 11, 2012Date of Patent: November 1, 2016Assignee: International Business Machines CorporationInventors: Sugato Bagchi, David A. Ferrucci, Anthony T. Levas, Erik T. Mueller
-
Patent number: 9454603Abstract: An apparatus includes a data processing system for generating and displaying a semantic type concordance. The data processing system includes memory storing a computer program, a display to display data of a concordance generated by the program, and a processor configured to execute the computer program. The computer program includes instructions for displaying a user interface configured to enable a user to select semantic types and specify at least one text document, generating a concordance of the at least one document based on the semantic types, and displaying data of the generated concordance on the display.Type: GrantFiled: August 6, 2010Date of Patent: September 27, 2016Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Branimir K. Boguraev, Youssef Drissi, David A. Ferrucci, Paul T. Keyser, Anthony T. Levas