Patents by Inventor David Ferrucci

David Ferrucci has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20180046705
    Abstract: A system, method and computer program product for conducting questions and answers with deferred type evaluation based on any corpus of data. The method includes processing a query including waiting until a “Type” (i.e. a descriptor) is determined AND a candidate answer is provided. Then, a search is conducted to look (search) for evidence that the candidate answer has the required Lexical Answer Type (e.g., as determined by a matching function that can leverage a parser, a semantic interpreter and/or a simple pattern matcher). Prior to or during candidate answer evaluation, a process is provided for extracting and storing collections of entity-type pairs from semi-structured text documents. During QA processing and candidate answer scoring, a process is implemented to match the query LAT against the lexical type of each provided candidate answer and generate a score judging a degree of match.
    Type: Application
    Filed: October 23, 2017
    Publication date: February 15, 2018
    Inventors: James W. Murdock, IV, James J. Fan, Christopher A. Welty, Pablo A. Duboue, David A. Ferrucci, Wlodek W. Zadrozny
  • Publication number: 20180025127
    Abstract: A decision-support system for medical diagnosis and treatment comprises software modules embodied on a computer readable medium, and the software modules comprise an input/output module and a question-answering module. The method receives patient case information using the input/output module, and generates a medical diagnosis or treatment query based on the patient case information and also generates a plurality of medical diagnosis or treatment answers for the query using the question-answering module. The method also calculates numerical values for multiple medical evidence dimensions from medical evidence sources for each of the answers using the question-answering module and also calculates a corresponding confidence value for each of the answers based on the numerical value of each evidence dimension using the question-answering module.
    Type: Application
    Filed: September 26, 2017
    Publication date: January 25, 2018
    Applicant: International Business Machines Corporation
    Inventors: Sugato Bagchi, David A. Ferrucci, Anthony T. Levas, Erik T. Mueller
  • Patent number: 9864818
    Abstract: A method, system and computer program product for generating answers to questions. In one embodiment, the method comprises receiving an input query, identifying a plurality of candidate answers to the query; and for at least one of these candidate answers, identifying at least one proof of the answer. This proof includes a series of premises, and a multitude of documents are identified that include references to the premises. A set of these documents is selected that include references to all of the premises. This set of documents is used to generate one or more scores for the one of the candidate answers. A defined procedure is applied to the candidate answers to determine a ranking for the answers, and this includes using the one or more scores for the at least one of the candidate answers in the defined procedure to determine the ranking for this one candidate answer.
    Type: Grant
    Filed: November 10, 2016
    Date of Patent: January 9, 2018
    Assignee: International Business Machines Corporation
    Inventors: Eric W. Brown, Jennifer Chu-Carroll, David A. Ferrucci, James W. Murdock, IV
  • Publication number: 20170371861
    Abstract: An architecture and processes enable computer learning and developing an understanding of arbitrary natural language text through collaboration with humans in the context of joint problem solving. The architecture ingests the text and then syntactically and semantically processes the text to infer an initial understanding of the text. The initial understanding is captured in a story model of semantic and frame structures. The story model is then tested through computer generated questions that are posed to humans through interactive dialog sessions. The knowledge gleaned from the humans is used to update the story model as well as the computing system's current world model of understanding. The process is repeated for multiple stories over time, enabling the computing system to grow in knowledge and thereby understand stories of increasingly higher reading comprehension levels.
    Type: Application
    Filed: June 24, 2016
    Publication date: December 28, 2017
    Inventors: Mike Barborak, David Buchanan, Greg Burnham, Jennifer Chu-Carroll, David Ferrucci, Aditya Kalyanpur, Adam Lally, Stefano Pacifico, Chang Wang
  • Patent number: 9852213
    Abstract: A method, system and computer program product for generating answers to questions. In one embodiment, the method comprises receiving an input query, decomposing the input query into a plurality of different subqueries, and conducting a search in one or more data sources to identify at least one candidate answer to each of the subqueries. A ranking function is applied to each of the candidate answers to determine a ranking for each of these candidate answers; and for each of the subqueries, one of the candidate answers to the subquery is selected based on this ranking. A logical synthesis component is applied to synthesize a candidate answer for the input query from the selected the candidate answers to the subqueries. In one embodiment, the procedure applied by the logical synthesis component to synthesize the candidate answer for the input query is determined from the input query.
    Type: Grant
    Filed: May 2, 2016
    Date of Patent: December 26, 2017
    Assignee: International Business Machines Corporation
    Inventors: Eric W. Brown, Jennifer Chu-Carroll, David A. Ferrucci, Adam P. Lally, James W. Murdock, John M. Prager
  • Patent number: 9830381
    Abstract: A system, program product, and methodology automatically scores candidate answers to questions in a question and answer system. In the candidate answer scoring method, a processor device performs one or more of receiving one or more candidate answers associated with a query string, the candidates obtained from a data source having semi-structured content; identifying one or more documents with semi-structured content from the data source having a candidate answer; and for each identified document: extracting one or more entity structures embedded in the identified document; determining a number of the entity structures in the identified document that appear in the received input query; and, computing a score for a candidate answer in the document as a function of the number Overall system efficiency is improved by giving the correct candidate answers higher scores through leveraging context-dependent structural information such as links to other documents and embedded tags.
    Type: Grant
    Filed: September 6, 2012
    Date of Patent: November 28, 2017
    Assignee: International Business Machines Corporation
    Inventors: James J. Fan, David A. Ferrucci
  • Patent number: 9805613
    Abstract: A method for providing adaptation to a question answering (QA) system having an associated plurality of trace data and a question-answer set. The method includes submitting a set of questions to the QA system; receiving back from the QA system a set of answers; comparing the set of answers to answers in the question-answer set; and generating the plurality of trace data based on comparison, and an estimate of how much more training data is needed. The generating comprises (a) for each question type, successively sample an increasing number of question-answer pairs from the question-answer set; (b) automatically train the QA system using the sampled question-answer pairs; (c) automatically compute a functional dependence of the QA system performance on the remaining questions relative to the size of the sample; and (d) extrapolate from the functional dependence a number of training question-answer pairs of each question type.
    Type: Grant
    Filed: September 15, 2015
    Date of Patent: October 31, 2017
    Assignee: International Business Machines Corporation
    Inventors: Sugato Bagchi, David A. Ferrucci, David C. Gondek, Anthony T. Levas, Wlodek W. Zadrozny
  • Patent number: 9798800
    Abstract: A system, method and computer program product for conducting questions and answers with deferred type evaluation based on any corpus of data. The method includes processing a query including waiting until a “Type” (i.e. a descriptor) is determined AND a candidate answer is provided. Then, a search is conducted to look (search) for evidence that the candidate answer has the required Lexical Answer Type (e.g., as determined by a matching function that can leverage a parser, a semantic interpreter and/or a simple pattern matcher). Prior to or during candidate answer evaluation, a process is provided for extracting and storing collections of entity-type pairs from semi-structured text documents. During QA processing and candidate answer scoring, a process is implemented to match the query LAT against the lexical type of each provided candidate answer and generate a score judging a degree of match.
    Type: Grant
    Filed: September 21, 2011
    Date of Patent: October 24, 2017
    Assignee: International Business Machines Corporation
    Inventors: Pablo A. Duboue, James J. Fan, David A. Ferrucci, James W. Murdock, IV, Christopher A. Welty, Wlodek W. Zadrozny
  • Patent number: 9703861
    Abstract: System and method for providing answers to questions based on any corpus of data implements a method that generates a number of candidate passages from the corpus that answer an input query, and finds the correct resulting answer by collecting supporting evidence from the multiple passages. By analyzing all retrieved passages and that passage's metadata in parallel, an output plurality of data structures is generated including candidate answers based upon the analyzing. Then, supporting passage retrieval operations are performed upon the set of candidate answers, and for each candidate answer, the data corpus is traversed to find those passages having candidate answer in addition to query terms. All candidate answers are automatically scored by a plurality of scoring modules, each producing a module score. The modules scores are processed to determine one or more query answers; and, a query response is generated based on the one or more query answers.
    Type: Grant
    Filed: May 21, 2014
    Date of Patent: July 11, 2017
    Assignee: International Business Machines Corporation
    Inventors: Eric W. Brown, David Ferrucci, Adam Lally, Wlodek W. Zadrozny
  • Patent number: 9690861
    Abstract: Methods, systems, and devices provide semantically relevant information by analyzing an Electronic Medical Record (EMR) having structured data and unstructured data. In the analysis, a first set of medical concepts is identified from the unstructured data in the EMR, and a second set of medical concepts is identified from the structured data in the EMR. Relationships between medical concepts in the first set of medical concepts and the second set of medical concepts are automatically identified in a medical ontology by such methods, systems, and devices.
    Type: Grant
    Filed: July 17, 2014
    Date of Patent: June 27, 2017
    Assignee: International Business Machines Corporation
    Inventors: Keerthana Boloor, Eric W. Brown, Murthy V. Devarakonda, David Ferrucci, John M. Prager
  • Publication number: 20170154043
    Abstract: A method, system and computer program product for generating answers to questions. In one embodiment, the method comprises receiving an input query, identifying a plurality of candidate answers to the query; and for at least one of these candidate answers, identifying at least one proof of the answer. This proof includes a series of premises, and a multitude of documents are identified that include references to the premises. A set of these documents is selected that include references to all of the premises. This set of documents is used to generate one or more scores for the one of the candidate answers. A defined procedure is applied to the candidate answers to determine a ranking for the answers, and this includes using the one or more scores for the at least one of the candidate answers in the defined procedure to determine the ranking for this one candidate answer.
    Type: Application
    Filed: February 14, 2017
    Publication date: June 1, 2017
    Inventors: Eric W. Brown, Jennifer Chu-Carroll, David A. Ferrucci, James W. Murdock, IV
  • Patent number: 9600601
    Abstract: A method, system and computer program product for generating answers to questions. In one embodiment, the method comprises receiving an input query, identifying a plurality of candidate answers to the query; and for at least one of these candidate answers, identifying at least one proof of the answer. This proof includes a series of premises, and a multitude of documents are identified that include references to the premises. A set of these documents is selected that include references to all of the premises. This set of documents is used to generate one or more scores for the one of the candidate answers. A defined procedure is applied to the candidate answers to determine a ranking for the answers, and this includes using the one or more scores for the at least one of the candidate answers in the defined procedure to determine the ranking for this one candidate answer.
    Type: Grant
    Filed: September 24, 2011
    Date of Patent: March 21, 2017
    Assignee: International Business Machines Corporation
    Inventors: Eric W. Brown, Jennifer Chu-Carroll, David A. Ferrucci, James W. Murdock, IV
  • Publication number: 20170060990
    Abstract: A method, system and computer program product for generating answers to questions. In one embodiment, the method comprises receiving an input query, identifying a plurality of candidate answers to the query; and for at least one of these candidate answers, identifying at least one proof of the answer. This proof includes a series of premises, and a multitude of documents are identified that include references to the premises. A set of these documents is selected that include references to all of the premises. This set of documents is used to generate one or more scores for the one of the candidate answers. A defined procedure is applied to the candidate answers to determine a ranking for the answers, and this includes using the one or more scores for the at least one of the candidate answers in the defined procedure to determine the ranking for this one candidate answer.
    Type: Application
    Filed: November 10, 2016
    Publication date: March 2, 2017
    Inventors: Eric W. Brown, Jennifer Chu-Carroll, David A. Ferrucci, James W. Murdock, IV
  • Publication number: 20170053020
    Abstract: A method, system and computer program product for generating answers to questions. In one embodiment, the method comprises receiving an input query; conducting a search to identify candidate answers to the input query, and producing a plurality of scores for each of the candidate answers. For each of the candidate answers, one, of a plurality of candidate ranking functions, is selected. This selected ranking function is applied to the each of the candidate answers to determine a ranking for the candidate answer based on the scores for that candidate answer. One or more of the candidate answers is selected, based on the rankings for the candidate answers, as one or more answers to the input query. In an embodiment, the ranking function selection is performed using information about the question. In an embodiment, the ranking function selection is performed using information about each answer.
    Type: Application
    Filed: November 8, 2016
    Publication date: February 23, 2017
    Inventors: Eric W. Brown, David Ferrucci, James W. Murdock, IV
  • Patent number: 9529845
    Abstract: An aspect of candidate answer generation in a question answering system includes receiving at least one document or passage responsive to a search of an information source. The search is implemented based on a query formulated from a natural language query. A further aspect includes receiving provenance information for the at least one document or passage, searching a semi-structured source of information based on the provenance information, retrieving a substructure corresponding to the at least one document or passage from the semi-structured source of information, and returning the substructure as a candidate answer to the natural language query.
    Type: Grant
    Filed: May 26, 2015
    Date of Patent: December 27, 2016
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Jennifer Chu-Carroll, James J. Fan, David A. Ferrucci
  • Patent number: 9508038
    Abstract: A computer-implemented system, method and program product generates answers to questions in an input query text string. The method includes determining, by a programmed processor unit, a lexical answer type (LAT) string associated with an input query; automatically obtaining a candidate answer string to the input query from a data corpus; mapping the query LAT string to a first type string in a structured resource; mapping the candidate answer string to a second type string in the structured resource; and determining if the first type string and the second type string are disjointed; and scoring the candidate answer string based on the determination of the types being disjointed wherein the structured resource includes a semantic database providing ontological content.
    Type: Grant
    Filed: September 6, 2012
    Date of Patent: November 29, 2016
    Assignee: International Business Machines Corporation
    Inventors: David A. Ferrucci, Aditya Kalyanpur, James W. Murdock, IV, Christopher A. Welty, Wlodek W. Zadrozny
  • Patent number: 9507854
    Abstract: A method, system and computer program product for generating answers to questions. In one embodiment, the method comprises receiving an input query; conducting a search to identify candidate answers to the input query, and producing a plurality of scores for each of the candidate answers. For each of the candidate answers, one, of a plurality of candidate ranking functions, is selected. This selected ranking function is applied to the each of the candidate answers to determine a ranking for the candidate answer based on the scores for that candidate answer. One or more of the candidate answers is selected, based on the rankings for the candidate answers, as one or more answers to the input query. In an embodiment, the ranking function selection is performed using information about the question. In an embodiment, the ranking function selection is performed using information about each answer.
    Type: Grant
    Filed: August 14, 2015
    Date of Patent: November 29, 2016
    Assignee: International Business Machines Corporation
    Inventors: Eric W. Brown, David A. Ferrucci, James W. Murdock, IV
  • Patent number: 9495481
    Abstract: A method, system and computer program product for generating answers to questions. In one embodiment, the method comprises receiving an input query, identifying a plurality of candidate answers to the query; and for at least one of these candidate answers, identifying at least one proof of the answer. This proof includes a series of premises, and a multitude of documents are identified that include references to the premises. A set of these documents is selected that include references to all of the premises. This set of documents is used to generate one or more scores for the one of the candidate answers. A defined procedure is applied to the candidate answers to determine a ranking for the answers, and this includes using the one or more scores for the at least one of the candidate answers in the defined procedure to determine the ranking for this one candidate answer.
    Type: Grant
    Filed: September 14, 2012
    Date of Patent: November 15, 2016
    Assignee: International Business Machines Corporation
    Inventors: Eric W. Brown, Jennifer Chu-Carroll, David A. Ferrucci, James W. Murdock, IV
  • Patent number: 9483731
    Abstract: In a method of answering questions and scoring answers, a title and at least one topical field are identified for a document. A field name and field content associated with the topical field is identified, and a title-oriented document is created by combining the title, the field name, and the field content associated with the topical field. For each title-oriented document, a term in the title is matched to previously established categories to produce a title concept identifier. The topical field is synthesized to produce a field concept identifier and a field content concept identifier. A question is received. The question topic term and the question content identifier are used to identify at least one question-matching relation instance. The title concept identifier of each question-matching relation instance is identified as a candidate answer to the question. Each candidate answer and a corresponding answer score is output.
    Type: Grant
    Filed: December 11, 2012
    Date of Patent: November 1, 2016
    Assignee: International Business Machines Corporation
    Inventors: Sugato Bagchi, David A. Ferrucci, Anthony T. Levas, Erik T. Mueller
  • Patent number: 9454603
    Abstract: An apparatus includes a data processing system for generating and displaying a semantic type concordance. The data processing system includes memory storing a computer program, a display to display data of a concordance generated by the program, and a processor configured to execute the computer program. The computer program includes instructions for displaying a user interface configured to enable a user to select semantic types and specify at least one text document, generating a concordance of the at least one document based on the semantic types, and displaying data of the generated concordance on the display.
    Type: Grant
    Filed: August 6, 2010
    Date of Patent: September 27, 2016
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Branimir K. Boguraev, Youssef Drissi, David A. Ferrucci, Paul T. Keyser, Anthony T. Levas