Patents by Inventor Kristen M. Summers

Kristen M. Summers has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Leveraging extracted entity and relation data to automatically filter data streams

Patent number: 10621177

Abstract: Embodiments are directed to an entity extraction and filtering system that enables a close search of documents to build filters necessary for near real-time monitoring of streaming sources of information. According to an embodiment, the entity extraction and filtering system operates based on the following parameters. First, a detection of an entity of interest warrants flagging an arriving article for analyst attention. Nothing more than a match may be required. The list of entities may be derived by an entity extractor from a corpus of data. Secondly, automatic updates may be utilized, so that exports are automatically updated to the filters. Thirdly, information flowing past the filters may update a static corpus whether or not they are flagged for an analyst or user. This allows for new relationships to be detected and extracted, and the filters subsequently updated.

Type: Grant

Filed: March 23, 2017

Date of Patent: April 14, 2020

Assignee: International Business Machines Corporation

Inventors: Charles E. Beller, William G. Dubyak, Joshua G. Hong, Brian L. Keith, Palani Sakthi, Kristen M. Summers
Identifying nonsense passages in a question answering system based on domain specific policy

Patent number: 10585898

Abstract: A mechanism is provided in a data processing system for identifying nonsense passages. An annotator in a natural language processing pipeline configured to execute in the data processing system annotates an input passage in a corpus with linguistic features to form an annotated passage. A domain-specific policy is associated with a domain of the corpus. A metric counters component in the natural language processing pipeline counts a number of instances of each type of linguistic feature in the annotated passage to form a set of feature counts. The metric counters component of the natural language processing pipeline determines a value for a metric based on the set of feature counts. The metric is specified in the domain-specific policy. A comparator component of the natural language processing pipeline compares the value for the metric to a predetermined model threshold. The threshold is specified in the domain-specific policy.

Type: Grant

Filed: May 12, 2016

Date of Patent: March 10, 2020

Assignee: International Business Machines Corporation

Inventors: Charles E. Beller, Michael Drzewucki, Christopher Phipps, Kristen M. Summers, Julie T. Yu
Resolution of acronyms in question answering systems

Patent number: 10572597

Abstract: According to one embodiment, a method, computer system, and computer program product for acronym resolution is provided. The present invention may include receiving documents; identifying explicit expansions within the documents; receiving an input from a user; retrieving passages relevant to the received input from the documents; for each acronym within the one or more relevant passages, determining whether the acronym corresponds with explicit expansions within the relevant passages; for each of the acronyms that do not correspond with explicit expansions, determining whether the acronym corresponds with implicit expansions within the relevant passages; and for each of the acronyms that do not correspond with implicit expansions, determining whether the acronym corresponds with acronyms within a universal acronym list, and transmitting the one or more resolved acronyms to a question answering system.

Type: Grant

Filed: November 30, 2017

Date of Patent: February 25, 2020

Assignee: International Business Machines Corporation

Inventors: Christopher F. Ackermann, Charles E. Beller, Stephen A. Boxwell, Edward G. Katz, Kristen M. Summers
System and method for scoring the geographic relevance of answers in a deep question answering system based on geographic context of a candidate answer

Patent number: 10552461

Abstract: A method, system and a computer program product are provided for scoring candidate answers for geographic relevance by identifying document location information that is associated with a document, associating each token in the document with the document location information, and then comparing geographic foci identified for a candidate answer from the tokens with geographic foci identified for an input question to generate a geographic relevance score for the candidate answer to the input question.

Type: Grant

Filed: September 16, 2016

Date of Patent: February 4, 2020

Assignee: International Business Machines Corporation

Inventors: Edward G. Katz, Kristen M. Summers
DOCUMENT PREPARATION WITH ARGUMENTATION SUPPORT FROM A DEEP QUESTION ANSWERING SYSTEM

Publication number: 20190318001

Abstract: Dynamic semantic processing of text in a word processing application with engagement of question-answering system. A user provides a text stream to a computer system via an input source. The input text stream includes a first natural language statement. The system determines that the first natural language statement includes a fact-based component expressed in natural language form. The system identifies an initial span and an alternative span of the first natural language component, based on the determining. The system engages a question-answering (QA) system by providing the initial span and the alternative span of the first natural language component to the QA system, and by receiving, in response to the providing, a set of natural language results from the QA system. The system evaluates the initial span and the alternative span of the first natural language component based on the received natural language results.

Type: Application

Filed: June 27, 2019

Publication date: October 17, 2019

Inventors: Charles E. Beller, William G. Dubyak, Palani Sakthi, Robert A. Sheets, Kristen M. Summers
DISPERSED TEMPLATE-BASED BATCH INTERACTION WITH A QUESTION ANSWERING SYSTEM

Publication number: 20190318220

Abstract: Embodiments are directed to interaction with an open-domain question and answer system to seek answers to broad and general questions by providing templates and words or phrases to fill slots in the templates that specify alternative specific questions, the answer to any of which may serve the broader purpose. Responses to all of the questions in a batch are considered as candidates, with the strongest general answers being returned. The approach, according to embodiments herein, addresses the need for responses to broad questions in which the user is seeking any response to a known pattern. The user is able to articulate the question as a template that can be instantiated in many forms, and the user may specify how strongly the concrete answers indicate an answer to the underlying general question.

Type: Application

Filed: April 13, 2018

Publication date: October 17, 2019

Inventors: Charles E. Beller, William G. Dubyak, Palani Sakthi, Kristen M. Summers, Hayes McCormick, Jay Oakey
DISPERSED BATCH INTERACTION WITH A QUESTION ANSWERING SYSTEM

Publication number: 20190318221

Abstract: Embodiments are directed to interaction with an open-domain question and answer system by recognizing questions that are highly broad or abstract, and generating and processing a batch of questions expressing alternate, concrete instances of the more general, abstract question. Responses to all of the questions in the batch are considered as candidates, and the strongest general answers are returned. A weighted, ranked answer set, based on weighting individual concrete questions and scaling the answers proportional to the weight of the questions, is provided to the user. The approach, according to embodiments herein, addresses the need for responses to broad questions in which a response to any of a set of more concrete question instances may serve to answer the question.

Type: Application

Filed: April 13, 2018

Publication date: October 17, 2019

Inventors: Charles E. Beller, William G. Dubyak, Palani Sakthi, Kristen M. Summers
CORPUS MANAGEMENT BY AUTOMATIC CATEGORIZATION INTO FUNCTIONAL DOMAINS TO SUPPORT FACETED QUERYING

Publication number: 20190258652

Abstract: Embodiments can provide a computer implemented method, in a data processing system comprising a processor and a memory comprising instructions which are executed by the processor to cause the processor to implement an enhanced corpus management system, the method comprising: identifying one or more functional domain categories; ingesting one or more incoming documents to form an open-domain corpus; for each functional domain category, identifying one or more representative documents to establish a seed sub-corpus; calculating a degree of fit score between each of the one or more incoming documents and the one or more established functional domain category seed sub-corpora; and assigning one or more of the incoming documents to one or more of the functional domain categories based upon the degree of fit score to create an enhanced corpus.

Type: Application

Filed: May 3, 2019

Publication date: August 22, 2019

Inventors: Charles E. Beller, William G. Dubyak, Palani Sakthi, Kristen M. Summers
Document preparation with argumentation support from a deep question answering system

Patent number: 10387576

Abstract: Dynamic semantic processing of text in a word processing application with engagement of question-answering system. A user provides a text stream to a computer system via an input source. The input text stream includes a first natural language statement. The system determines that the first natural language statement includes a fact-based component expressed in natural language form. The system identifies an initial span and an alternative span of the first natural language component, based on the determining. The system engages a question-answering (QA) system by providing the initial span and the alternative span of the first natural language component to the QA system, and by receiving, in response to the providing, a set of natural language results from the QA system. The system evaluates the initial span and the alternative span of the first natural language component based on the received natural language results.

Type: Grant

Filed: November 30, 2017

Date of Patent: August 20, 2019

Assignee: International Business Machines Corporation

Inventors: Charles E. Beller, William G. Dubyak, Palani Sakthi, Robert A. Sheets, Kristen M. Summers
Corpus management by automatic categorization into functional domains to support faceted querying

Patent number: 10346442

Abstract: Embodiments can provide a computer implemented method, in a data processing system comprising a processor and a memory comprising instructions which are executed by the processor to cause the processor to implement an enhanced corpus management system, the method comprising: identifying one or more functional domain categories; ingesting one or more incoming documents to form an open-domain corpus; for each functional domain category, identifying one or more representative documents to establish a seed sub-corpus; calculating a degree of fit score between each of the one or more incoming documents and the one or more established functional domain category seed sub-corpora; and assigning one or more of the incoming documents to one or more of the functional domain categories based upon the degree of fit score to create an enhanced corpus.

Type: Grant

Filed: November 17, 2016

Date of Patent: July 9, 2019

Assignee: International Business Machines Corporation

Inventors: Charles E. Beller, William G. Dubyak, Palani Sakthi, Kristen M. Summers
SUGGESTING FOLLOW UP QUESTIONS FROM USER BEHAVIOR

Publication number: 20190171726

Abstract: An embodiment of the invention may include a method, computer program product and system for generating follow-up questions based on machine learning utilizing a computing device. The embodiment may include receiving an input question from a user. The embodiment may include parsing the received input question to extract input question components. Parsing utilizes natural language processing techniques. The embodiment may include executing trained question component models to predict follow-up question components. The extracted input question components are utilized as inputs to the trained question component models. The embodiment may include combining the predicted follow-up question components to generate one or more follow-up questions. The embodiment may include returning the one or more follow-up questions to the user.

Type: Application

Filed: December 1, 2017

Publication date: June 6, 2019

Inventors: Mohamed N. Ahmed, Charles E. Beller, WILLIAM G. DUBYAK, Palani Sakthi, Kristen M. Summers, Andeep S. Toor
DOCUMENT PREPARATION WITH ARGUMENTATION SUPPORT FROM A DEEP QUESTION ANSWERING SYSTEM

Publication number: 20190163745

Abstract: Dynamic semantic processing of text in a word processing application with engagement of question-answering system. A user provides a text stream to a computer system via an input source. The input text stream includes a first natural language statement. The system determines that the first natural language statement includes a fact-based component expressed in natural language form. The system identifies an initial span and an alternative span of the first natural language component, based on the determining. The system engages a question-answering (QA) system by providing the initial span and the alternative span of the first natural language component to the QA system, and by receiving, in response to the providing, a set of natural language results from the QA system. The system evaluates the initial span and the alternative span of the first natural language component based on the received natural language results.

Type: Application

Filed: November 30, 2017

Publication date: May 30, 2019

Inventors: Charles E. Beller, William G. Dubyak, Palani Sakthi, Robert A. Sheets, Kristen M. Summers
LATENT TOKEN REPRESENTATIONS FOR PASSAGE AND ANSWER SCORING IN QUESTION ANSWERING SYSTEMS

Publication number: 20190163789

Abstract: The present invention may receive the question, a plurality of the candidate answers, and a plurality of documents associated with the plurality of candidate answers in the natural language. Then the present invention may tokenize the question, the plurality of the candidate answers, and the plurality of the documents into a corresponding n-gram sequence. The present invention may map n-gram elements from the tokenized question to the n-gram elements of the plurality of the tokenized candidate answers and the plurality of the tokenized documents using the latent token representation technique. The present invention may score the plurality of tokenized candidate answers based on the latent token representation technique. Then, the present invention may determine the precise answer based on the plurality of scored candidate answers.

Type: Application

Filed: November 29, 2017

Publication date: May 30, 2019

Inventors: CHRISTOPHER F. ACKERMANN, Charles E. BELLER, STEPHEN A. BOXWELL, EDWARD G. KATZ, Kristen M. Summers
LEARNING USER SYNONYMS FROM SEQUENCED QUERY SESSIONS

Publication number: 20190163781

Abstract: A system and a method for determining user synonyms in a query processing system is disclosed. A first query is received from a user of the query processing system. The query is processed through the query processing system to generate results for the first query. The user then provides a second query. The system determines a contextual similarity between the first query and the second query. When the first query and the second query are determined to be contextually similar, a first term is identified in the first query that is different from a second term in the second query. Once identified, the system determines if the first term and the second term are not listed as synonyms in the thesaurus. If they are not listed as synonyms they can be added as synonyms to the thesaurus.

Type: Application

Filed: November 30, 2017

Publication date: May 30, 2019

Inventors: Christopher F. Ackermann, Charles E. Beller, Stephen A. Boxwell, Edward G. Katz, Kristen M. Summers
RESOLUTION OF ACRONYMS IN QUESTION ANSWERING SYSTEMS

Publication number: 20190163740

Abstract: According to one embodiment, a method, computer system, and computer program product for acronym resolution is provided. The present invention may include receiving documents; identifying explicit expansions within the documents; receiving an input from a user; retrieving passages relevant to the received input from the documents; for each acronym within the one or more relevant passages, determining whether the acronym corresponds with explicit expansions within the relevant passages; for each of the acronyms that do not correspond with explicit expansions, determining whether the acronym corresponds with implicit expansions within the relevant passages; and for each of the acronyms that do not correspond with implicit expansions, determining whether the acronym corresponds with acronyms within a universal acronym list, and transmitting the one or more resolved acronyms to a question answering system.

Type: Application

Filed: November 30, 2017

Publication date: May 30, 2019

Inventors: CHRISTOPHER F. ACKERMANN, Charles E. Beller, STEPHEN A. BOXWELL, EDWARD G. KATZ, KRISTEN M. SUMMERS
WATCHED HYPOTHESIS FOR DEEP QUESTION ANSWERING

Publication number: 20190147353

Abstract: In an approach to watching hypotheses in a deep question answering system, one or more processors receive a question from a user and generate a first result set based on the question. One or more processors receive a request from the user to watch one or more hypothesis answers in the first result set. One or more processors generate a second result set based on the question, where the second result set is generated at a later time than the first result set. One or more processors further determine a similarity score between a hypothesis answer in the second set of one or more hypothesis answers and the watched one or more hypothesis answers and, responsive to determining that the similarity score is below a predetermined threshold, one or more processors send a contradiction alert to the user indicating a potential alternative hypothesis.

Type: Application

Filed: November 15, 2017

Publication date: May 16, 2019

Inventors: Charles E. Beller, William G. Dubyak, Palani Sakthi, Kristen M. Summers
HARVESTING QUESTION/ANSWER TRAINING DATA FROM WATCHED HYPOTHESES IN A DEEP QA SYSTEM

Publication number: 20190103035

Abstract: Embodiments can provide a computer implemented method for harvesting training data for a training set for use by a system capable of answering questions, the system comprising a processor and a memory comprising instructions executed by the processor, the method comprising receiving, from a user, an input question; processing the input question and returning, to the user, a result set comprising one or more ranked hypotheses and one or more ranked evidence passages corresponding to the one or more ranked hypotheses; receiving, from the user, an indication that one of the one or more ranked hypotheses is to be designated a watched hypothesis; adding the input question and the watched hypothesis to a to-be-vetted question/answer (QA) pair set comprising one or more to-be-vetted QA pairs; vetting each of the one or more to-be-vetted QA pairs in the to-be-vetted QA pair set through a first-pass automatic vetting procedure; if a vetted QA pair passes the first-pass automatic vetting procedure, adding the vetted QA

Type: Application

Filed: September 29, 2017

Publication date: April 4, 2019

Inventors: Charles E. Beller, William G. Dubyak, Palani Sakthi, Kristen M. Summers
Post-processing for identifying nonsense passages in a question answering system

Patent number: 10169328

Abstract: A mechanism is provided in a data processing system for identifying nonsense passages. The mechanism annotates an input passage with linguistic features to form an annotated passage. The mechanism counts a number of instances of each type of linguistic feature in the annotated passage to form a set of feature counts. The mechanism determines a value for a metric based on the set of feature counts and compares the value for the metric to a predetermined model threshold. The mechanism identifies whether the input passage is a nonsense passage based on a result of the comparison.

Type: Grant

Filed: May 12, 2016

Date of Patent: January 1, 2019

Assignee: International Business Machines Corporation

Inventors: Charles E. Beller, Michael Drzewucki, Christopher Phipps, Kristen M. Summers, Julie T. Yu
LEVERAGING EXTRACTED ENTITY AND RELATION DATA TO AUTOMATICALLY FILTER DATA STREAMS

Publication number: 20180276284

Abstract: Embodiments are directed to an entity extraction and filtering method that enables a close search of documents to build filters necessary for near real-time monitoring of streaming sources of information. According to an embodiment, the entity extraction and filtering method operates based on the following parameters. First, a detection of an entity of interest warrants flagging an arriving article for analyst attention. Nothing more than a match may be required. The list of entities may be derived by an entity extractor from a corpus of data. Secondly, automatic updates may be utilized, so that exports are automatically updated to the filters. Thirdly, information flowing past the filters may update a static corpus whether or not they are flagged for an analyst or user. This allows for new relationships to be detected and extracted, and the filters subsequently updated.

Type: Application

Filed: June 16, 2017

Publication date: September 27, 2018

Inventors: Charles E. Beller, William G. Dubyak, Joshua G. Hong, Brian L. Keith, Palani Sakthi, Kristen M. Summers
LEVERAGING EXTRACTED ENTITY AND RELATION DATA TO AUTOMATICALLY FILTER DATA STREAMS

Publication number: 20180276279

Abstract: Embodiments are directed to an entity extraction and filtering system that enables a close search of documents to build filters necessary for near real-time monitoring of streaming sources of information. According to an embodiment, the entity extraction and filtering system operates based on the following parameters. First, a detection of an entity of interest warrants flagging an arriving article for analyst attention. Nothing more than a match may be required. The list of entities may be derived by an entity extractor from a corpus of data. Secondly, automatic updates may be utilized, so that exports are automatically updated to the filters. Thirdly, information flowing past the filters may update a static corpus whether or not they are flagged for an analyst or user. This allows for new relationships to be detected and extracted, and the filters subsequently updated.

Type: Application

Filed: March 23, 2017

Publication date: September 27, 2018

Inventors: Charles E. Beller, William G. Dubyak, Joshua G. Hong, Brian L. Keith, Palani Sakthi, Kristen M. Summers

prev 1 2 3 next