Patents by Inventor Aditya A Kalyanpur

Aditya A Kalyanpur has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20150339299
    Abstract: A system and method for automatically mapping LATs and candidate answers to multiple taxonomies without a need to merge these taxonomies. The method includes using a syntactic analysis of a corpus to extract all type instances of the LAT. The extracted instances are then mapped to a given taxonomy and clustered in a set of supertypes. Each supertype receives a score based on the coverage of LAT instances in the corpus. The method includes mapping the candidate answer to the same taxonomy to determine if the candidate answer is an instance of a significant supertype. Then the score of a candidate answer is obtained by aggregating or taking a maximum of the score of the matched significant supertypes. This score evaluates the type match between the LAT and candidate answer for a taxonomy. Multiple taxonomies can be used to increase the chance of LAT and candidate answer mapping.
    Type: Application
    Filed: May 23, 2014
    Publication date: November 26, 2015
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Sugato Bagchi, Mihaela A. Bornea, James J. Fan, Aditya A. Kalyanpur, Christopher Welty
  • Patent number: 9092989
    Abstract: Methods/systems receive a question and automatically search sources of data containing passages to produce candidate answers to the question. The searching identifies passages that support each of the candidate answers based on scoring features that indicate whether the candidate answers are correct answers to the question. These methods/systems automatically create a scoring feature-specific matrix for each scoring feature. Each scoring feature-specific matrix has a score field for each different combination of text passage and question term (vector), and each score field holds a score value (vector value) indicating how each different combination of text passage and question term supports the candidate answers as being a correct answer to the question. Next, such methods/systems automatically combine multiple such vectors to produce a combined vector score for each of the candidate answers, and then rank the candidate answers based on the combined scores.
    Type: Grant
    Filed: November 30, 2012
    Date of Patent: July 28, 2015
    Assignee: International Business Machines Corporation
    Inventors: Apoorv Agarwal, Jennifer Chu-Carroll, Aditya A. Kalyanpur, Adam P. Lally, James W. Murdock, IV, Lorrie A. Tomek
  • Patent number: 9092988
    Abstract: Methods/systems receive a question and automatically search sources of data containing passages to produce candidate answers to the question. The searching identifies passages that support each of the candidate answers based on scoring features that indicate whether the candidate answers are correct answers to the question. These methods/systems automatically create a scoring feature-specific matrix for each scoring feature. Each scoring feature-specific matrix has a score field for each different combination of text passage and question term (vector), and each score field holds a score value (vector value) indicating how each different combination of text passage and question term supports the candidate answers as being a correct answer to the question. Next, such methods/systems automatically combine multiple such vectors to produce a combined vector score for each of the candidate answers, and then rank the candidate answers based on the combined scores.
    Type: Grant
    Filed: November 16, 2012
    Date of Patent: July 28, 2015
    Assignee: International Business Machines Corporation
    Inventors: Apoorv Agarwal, Jennifer Chu-Carroll, Aditya A. Kalyanpur, Adam P. Lally, James W. Murdock, IV, Lorrie A. Tomek
  • Patent number: 9037615
    Abstract: A computer-implemented method, system, and article of manufacture for querying and integrating structured and unstructured data. The method includes: receiving entity information that is extracted from a first set of unstructured data using an open domain information extraction system, wherein the entity in-formation comprises relationship information between a first entity and a second entity of the first set of unstructured data; recognizing a pattern based on the relationship information and creating a schema for the first set of unstructured data based on the pattern; and associating an element of the created schema with (i) an entity of a second set of unstructured data or (ii) a schema element of an existing set of structured data if there is sufficient overall similarity between the created schema element and either the second unstructured data entity or the schema element of the existing structured data.
    Type: Grant
    Filed: June 11, 2012
    Date of Patent: May 19, 2015
    Assignee: International Business Machines Corporation
    Inventors: Mihaela Ancuta Bornea, Songyun Duan, James J. Fan, Achille Fokoue-Nkoutche, Alfio M. Gliozzo, Aditya Kalyanpur, Anastasios Kementsietsidis, Kavitha Srinivas, Michael J. Ward
  • Patent number: 9037452
    Abstract: Systems and method automatically collect training data from manually created semantic relations, automatically extract rules from the training data to produce extracted rules, and automatically characterize existing semantic relations in the training data based on co-occurrence of the extracted rules in the existing semantic relations. Such systems and methods automatically construct semantic relation topics based on the characterization of the existing semantic relations, and group instances of the training data into the semantic relation topics to detect new semantic relations.
    Type: Grant
    Filed: March 16, 2012
    Date of Patent: May 19, 2015
    Assignee: AFRL/RIJ
    Inventors: James J. Fan, David Gondek, Aditya A. Kalyanpur, Chang Wang
  • Patent number: 9015031
    Abstract: In an automated Question Answer (QA) system architecture for automatic open-domain Question Answering, a system, method and computer program product for predicting the Lexical Answer Type (LAT) of a question. The approach is completely unsupervised and is based on a large-scale lexical knowledge base automatically extracted from a Web corpus. This approach for predicting the LAT can be implemented as a specific subtask of a QA process, and/or used for general purpose knowledge acquisition tasks such as frame induction from text.
    Type: Grant
    Filed: July 18, 2012
    Date of Patent: April 21, 2015
    Assignee: International Business Machines Corporation
    Inventors: David A. Ferrucci, Alfio M. Gliozzo, Aditya A. Kalyanpur
  • Patent number: 8972321
    Abstract: A system, a method and a computer program product for verifying a statement are provided. The system is configured to receive a statement. The system is configured to decompose the received statement into one or more sets of question and answer pairs. The system is configured to determine a confidence value of each answer in the one or more question and answer pair sets. The system is configured to combine the determined confidence values. The combined confidence values represent a probability that the received statement is evaluated as true.
    Type: Grant
    Filed: September 28, 2011
    Date of Patent: March 3, 2015
    Assignee: International Business Machines Corporation
    Inventors: David A. Ferrucci, David C. Gondek, Aditya A. Kalyanpur, Adam P. Lally, Siddharth Patwardham
  • Patent number: 8959043
    Abstract: A system and a computer program product for verifying a statement are provided. The system is configured to receive a statement. The system is configured to decompose the received statement into one or more sets of question and answer pairs. The system is configured to determine a confidence value of each answer in the one or more question and answer pair sets. The system is configured to combine the determined confidence values. The combined confidence values represent a probability that the received statement is evaluated as true.
    Type: Grant
    Filed: August 31, 2012
    Date of Patent: February 17, 2015
    Assignee: International Business Machines Corporation
    Inventors: David A. Ferrucci, David C. Gondek, Aditya A. Kalyanpur, Adam P. Lally, Siddharth Patwardhan
  • Patent number: 8943051
    Abstract: A system, method and computer program product for automatically estimating the confidence of a detected LAT to provide a more accurate overall score for an obtained candidate answer. A confidence “score” or value of each detected LAT is obtained, and the system and method performs combining the confidence score with a degree of match between a LAT and an AnswerType of the candidate answer to provide improved overall score for the candidate answer.
    Type: Grant
    Filed: June 18, 2013
    Date of Patent: January 27, 2015
    Assignee: International Business Machines Corporation
    Inventors: James J. Fan, David A. Ferrucci, David C. Gondek, Aditya A. Kalyanpur, Adam P. Lally, James W. Murdock, Wlodek W. Zadrozny
  • Patent number: 8880388
    Abstract: In an automated Question Answer (QA) system architecture for automatic open-domain Question Answering, a system, method and computer program product for predicting the Lexical Answer Type (LAT) of a question. The approach is completely unsupervised and is based on a large-scale lexical knowledge base automatically extracted from a Web corpus. This approach for predicting the LAT can be implemented as a specific subtask of a QA process, and/or used for general purpose knowledge acquisition tasks such as frame induction from text.
    Type: Grant
    Filed: August 28, 2012
    Date of Patent: November 4, 2014
    Assignee: International Business Machines Corporation
    Inventors: David A. Ferrucci, Alfio M. Gliozzo, Aditya A. Kalyanpur
  • Publication number: 20140272904
    Abstract: In a method of answering questions, a question is received, a question LAT is determined, and a candidate answer to the question is identified. Preliminary types for the candidate answer are determined using first components to produce the preliminary types. Each of the first components produces a preliminary type using different methods. A first type-score representing a degree of match between the preliminary type and the question LAT is produced. Each preliminary type and each first type-score is evaluated using second components. Each of the second components produces a second score based on a combination of the first type-score and a measure of degree that the preliminary type matches the question LAT. The second components use different methods to produce the second score. A final score representing a degree of confidence that the candidate answer matches the question LAT is calculated based on the second score.
    Type: Application
    Filed: March 15, 2013
    Publication date: September 18, 2014
    Applicant: International Business Machines Corporation
    Inventors: Sugato Bagchi, James J. Fan, David A. Ferrucci, Aditya A. Kalyanpur, James W. Murdock, IV, Christopher A. Welty
  • Patent number: 8750630
    Abstract: An approach that provides hierarchical and index based watermarks represented as trees is described. In one embodiment, a watermark tree is formed from feature watermarks generated from a natural language processing (NLP) stack having NLP analytics. The watermark tree represents a hierarchical relationship between each of the feature watermarks. In particular, the watermark tree defines hierarchical pointers that point out inherited watermarks that exist between the feature watermarks according to the hierarchical relationship. Further, the watermark tree includes a time stamp specifying a time that a data set content residing in a corpus was accessed.
    Type: Grant
    Filed: July 13, 2012
    Date of Patent: June 10, 2014
    Assignee: International Business Machines Corporation
    Inventors: Aaron K. Baughman, Richard L. Darden, James J. Fan, Aditya A. Kalyanpur
  • Patent number: 8738365
    Abstract: Diffusing evidence among candidate answers during question answering may identify a relationship between a first candidate answer and a second candidate answer, wherein the candidate answers are generated by a question-answering computer process, the candidate answers have associated supporting evidence, and the candidate answers have associated confidence scores. All or some of the evidence may be transferred from the first candidate answer to the second candidate answer based on the identified relationship. A new confidence score may be computed for the second candidate answer based on the transferred evidence.
    Type: Grant
    Filed: September 14, 2012
    Date of Patent: May 27, 2014
    Assignee: International Business Machines Corporation
    Inventors: David A. Ferrucci, David C. Gondek, Aditya A. Kalyanpur, Adam P. Lally
  • Patent number: 8738362
    Abstract: Diffusing evidence among candidate answers during question answering may identify a relationship between a first candidate answer and a second candidate answer, wherein the candidate answers are generated by a question-answering computer process, the candidate answers have associated supporting evidence, and the candidate answers have associated confidence scores. All or some of the evidence may be transferred from the first candidate answer to the second candidate answer based on the identified relationship. A new confidence score may be computed for the second candidate answer based on the transferred evidence.
    Type: Grant
    Filed: September 23, 2011
    Date of Patent: May 27, 2014
    Assignee: International Business Machines Corporation
    Inventors: David A. Ferrucci, David C. Gondek, Aditya A. Kalyanpur, Adam P. Lally
  • Publication number: 20140141401
    Abstract: Methods/systems receive a question and automatically search sources of data containing passages to produce candidate answers to the question. The searching identifies passages that support each of the candidate answers based on scoring features that indicate whether the candidate answers are correct answers to the question. These methods/systems automatically create a scoring feature-specific matrix for each scoring feature. Each scoring feature-specific matrix has a score field for each different combination of text passage and question term (vector), and each score field holds a score value (vector value) indicating how each different combination of text passage and question term supports the candidate answers as being a correct answer to the question. Next, such methods/systems automatically combine multiple such vectors to produce a combined vector score for each of the candidate answers, and then rank the candidate answers based on the combined scores.
    Type: Application
    Filed: November 30, 2012
    Publication date: May 22, 2014
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Apoorv Agarwal, Jennifer Chu-Carroll, Aditya A. Kalyanpur, Adam P. Lally, James W. Murdock, IV, Lorrie A. Tomek
  • Publication number: 20140141399
    Abstract: Methods/systems receive a question and automatically search sources of data containing passages to produce candidate answers to the question. The searching identifies passages that support each of the candidate answers based on scoring features that indicate whether the candidate answers are correct answers to the question. These methods/systems automatically create a scoring feature-specific matrix for each scoring feature. Each scoring feature-specific matrix has a score field for each different combination of text passage and question term (vector), and each score field holds a score value (vector value) indicating how each different combination of text passage and question term supports the candidate answers as being a correct answer to the question. Next, such methods/systems automatically combine multiple such vectors to produce a combined vector score for each of the candidate answers, and then rank the candidate answers based on the combined scores.
    Type: Application
    Filed: November 16, 2012
    Publication date: May 22, 2014
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Apoorv Agarwal, Jennifer Chu-Carroll, Aditya A. Kalyanpur, Adam P. Lally, James W. Murdock, IV, Lorrie A. Tomek
  • Publication number: 20140072947
    Abstract: A method of generating secondary questions in a question-answer system. Missing information is identified from a corpus of data using a computerized device. The missing information comprises any information that improves confidence scores for candidate answers to a question. The computerized device automatically generates a plurality of hypotheses concerning the missing information. The computerized device automatically generates at least one secondary question based on each of the plurality of hypotheses. The hypotheses are ranked based on relative utility to determine an order in which the computerized device outputs the at least one secondary question to external sources to obtain responses.
    Type: Application
    Filed: September 11, 2012
    Publication date: March 13, 2014
    Applicant: International Business Machines Corporation
    Inventors: Branimir K. Boguraev, David W. Buchanan, Jennifer Chu-Carroll, David A. Ferrucci, Aditya A. Kalyanpur, James W. Murdock, IV, Siddharth A. Patwardhan
  • Publication number: 20140072948
    Abstract: A method of generating secondary questions in a question-answer system. Missing information is identified from a corpus of data using a computerized device. The missing information comprises any information that improves confidence scores for candidate answers to a question. The computerized device automatically generates a plurality of hypotheses concerning the missing information. The computerized device automatically generates at least one secondary question based on each of the plurality of hypotheses. The hypotheses are ranked based on relative utility to determine an order in which the computerized device outputs the at least one secondary question to external sources to obtain responses.
    Type: Application
    Filed: September 11, 2012
    Publication date: March 13, 2014
    Applicant: International Business Machines Corporation
    Inventors: Branimir K. Boguraev, David W. Buchanan, Jennifer Chu-Carroll, David A. Ferrucci, Aditya A. Kalyanpur, James W. Murdock, IV, Siddharth A. Patwardhan
  • Publication number: 20140016814
    Abstract: An approach that provides hierarchical and index based watermarks represented as trees is described. In one embodiment, a watermark tree is formed from feature watermarks generated from a natural language processing (NLP) stack having NLP analytics. The watermark tree represents a hierarchical relationship between each of the feature watermarks. In particular, the watermark tree defines hierarchical pointers that point out inherited watermarks that exist between the feature watermarks according to the hierarchical relationship. Further, the watermark tree includes a time stamp specifying a time that a data set content residing in a corpus was accessed.
    Type: Application
    Filed: July 13, 2012
    Publication date: January 16, 2014
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Aaron K. Baughman, Richard L. Darden, James J. Fan, Aditya A. Kalyanpur
  • Publication number: 20130332478
    Abstract: A computer-implemented method, system, and article of manufacture for querying and integrating structured and unstructured data. The method includes: receiving entity information that is extracted from a first set of unstructured data using an open domain information extraction system, wherein the entity information comprises relationship information between a first entity and a second entity of the first set of unstructured data; recognizing a pattern based on the relationship information and creating a schema for the first set of unstructured data based on the pattern; and associating an element of the created schema with (i) an entity of a second set of unstructured data or (ii) a schema element of an existing set of structured data if there is sufficient overall similarity between the created schema element and either the second unstructured data entity or the schema element of the existing structured data.
    Type: Application
    Filed: June 11, 2012
    Publication date: December 12, 2013
    Applicant: International Business Machines Corporation
    Inventors: Mihaela Ancuta Bornea, Songyun Duan, James J. Fan, Achille Fokoue-Nkoutche, Alfio M. Gliozzo, Aditya Kalyanpur, Anastasios Kementsietsidis, Kavitha Srinivas, Michael J. Ward