Patents by Inventor William G. Dubyak
William G. Dubyak has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11550794Abstract: A candidate document is received, for example, by a document filter. A determination is made based on the content of the candidate document, whether the candidate document is relevant to a document corpus. A determination is made based on the content of the candidate document, whether the candidate document is novel with respect to the document corpus. In response to determining that the candidate document is relevant to the document corpus and novel with respect to the document corpus, the candidate document is added to the document corpus to make at least a portion of the content of the candidate document available for a response to a search query.Type: GrantFiled: June 26, 2019Date of Patent: January 10, 2023Assignee: International Business Machines CorporationInventors: Charles Evan Beller, William G Dubyak, Palani Sakthi, Kristen Maria Summers
-
Patent number: 11501006Abstract: Natural language processing is enhanced by linguistically extracting intelligence about a user. A history of user queries is analyzed by a natural language classifier to determine various user intents, and these intents are combined to form a user intent profile. The profile includes elements of sentiment, emotion and tone. The profile can be used in various ways including restricting access to documents in a collection, or refining a cognitive analysis of a query. For access restriction, a determination is made that the user intent is inconsistent with a document, and the user is denied access to the document. This determination involves a user intent score which is compared to a score of the document. For cognitive analysis, searching of reference documents is filtered by excluding documents based on the user intent. The searching includes a comparison of meta-data tags of the documents to the user intent.Type: GrantFiled: March 5, 2018Date of Patent: November 15, 2022Assignees: HYUNDAI MOTOR COMPANY, KIA CORPORATIONInventors: William G. Dubyak, Vijai Gandikota, Palani Sakthi
-
Patent number: 11379706Abstract: Embodiments are directed to interaction with an open-domain question and answer system by recognizing questions that are highly broad or abstract, and generating and processing a batch of questions expressing alternate, concrete instances of the more general, abstract question. Responses to all of the questions in the batch are considered as candidates, and the strongest general answers are returned. A weighted, ranked answer set, based on weighting individual concrete questions and scaling the answers proportional to the weight of the questions, is provided to the user. The approach, according to embodiments herein, addresses the need for responses to broad questions in which a response to any of a set of more concrete question instances may serve to answer the question.Type: GrantFiled: April 13, 2018Date of Patent: July 5, 2022Assignee: International Business Machines CorporationInventors: Charles E. Beller, William G. Dubyak, Palani Sakthi, Kristen M. Summers
-
Patent number: 11227113Abstract: Batch interaction with a computerized question answering system can produce an answer that more closely relates to a user's information need. A batch of questions can be generated interactively, and provides a context for a first question received from a user. The batch of questions includes or more additional questions which have terms with a nonsynonymous semantic relation to a first term in the first question. A question answering system can process the batch of questions to determine candidate answers. An answer to the first question can be determined based, at least in part, on a combined ranking of the candidate answers.Type: GrantFiled: January 20, 2016Date of Patent: January 18, 2022Assignee: International Business Machines CorporationInventors: Charles Evan Beller, William G Dubyak, Palani Sakthi, Kristen Maria Summers
-
Patent number: 11170660Abstract: Embodiments can provide a computer implemented method for harvesting training data for a training set for use by a system capable of answering questions, the system comprising a processor and a memory comprising instructions executed by the processor, the method comprising receiving, from a user, an input question; processing the input question and returning, to the user, a result set comprising one or more ranked hypotheses and one or more ranked evidence passages corresponding to the one or more ranked hypotheses; receiving, from the user, an indication that one of the one or more ranked hypotheses is to be designated a watched hypothesis; adding the input question and the watched hypothesis to a to-be-vetted question/answer (QA) pair set comprising one or more to-be-vetted QA pairs; vetting each of the one or more to-be-vetted QA pairs in the to-be-vetted QA pair set through a first-pass automatic vetting procedure; if a vetted QA pair passes the first-pass automatic vetting procedure, adding the vetted QAType: GrantFiled: September 29, 2017Date of Patent: November 9, 2021Assignee: International Business Machines CorporationInventors: Charles E. Beller, William G. Dubyak, Palani Sakthi, Kristen M. Summers
-
Patent number: 11170181Abstract: Dynamic semantic processing of text in a word processing application with engagement of question-answering system. A user provides a text stream to a computer system via an input source. The input text stream includes a first natural language statement. The system determines that the first natural language statement includes a fact-based component expressed in natural language form. The system identifies an initial span and an alternative span of the first natural language component, based on the determining. The system engages a question-answering (QA) system by providing the initial span and the alternative span of the first natural language component to the QA system, and by receiving, in response to the providing, a set of natural language results from the QA system. The system evaluates the initial span and the alternative span of the first natural language component based on the received natural language results.Type: GrantFiled: June 27, 2019Date of Patent: November 9, 2021Assignee: International Business Machines CorporationInventors: Charles E. Beller, William G. Dubyak, Palani Sakthi, Robert A. Sheets, Kristen M. Summers
-
Patent number: 11163804Abstract: Embodiments can provide a computer implemented method, in a data processing system comprising a processor and a memory comprising instructions which are executed by the processor to cause the processor to implement an enhanced corpus management system, the method comprising: identifying one or more functional domain categories; ingesting one or more incoming documents to form an open-domain corpus; for each functional domain category, identifying one or more representative documents to establish a seed sub-corpus; calculating a degree of fit score between each of the one or more incoming documents and the one or more established functional domain category seed sub-corpora; and assigning one or more of the incoming documents to one or more of the functional domain categories based upon the degree of fit score to create an enhanced corpus.Type: GrantFiled: May 3, 2019Date of Patent: November 2, 2021Assignee: International Business Machines CorporationInventors: Charles E. Beller, William G. Dubyak, Palani Sakthi, Kristen M. Summers
-
Publication number: 20210149936Abstract: Embodiments can provide a computer implemented method, in a data processing system comprising a processor and a memory comprising instructions which are executed by the processor to cause the processor to implement an improved search query generation system, the method comprising inputting a natural language question; parsing the natural language question into a parse tree; identifying argument positions comprising one or more argument position terms, wherein each argument position term is a single word; for each argument position: comparing a head term's discriminator score against a threshold discriminator score; and if the head term surpasses the threshold discriminator score, adding the head term as a required term to an improved search query; and outputting the improved search query.Type: ApplicationFiled: December 22, 2020Publication date: May 20, 2021Inventors: Charles E. Beller, Sean L. Bethard, William G. Dubyak, Alexander C. Tonetti, Sean T. Thatcher, Julie T. Yu
-
Patent number: 10956463Abstract: Embodiments can provide a computer implemented method, in a data processing system comprising a processor and a memory comprising instructions which are executed by the processor to cause the processor to implement an improved search query generation system, the method comprising inputting a natural language question; parsing the natural language question into a parse tree; identifying argument positions comprising one or more argument position terms; for each argument position: comparing a head term's discriminator score against a threshold discriminator score; and if the head term surpasses the threshold discriminator score, adding the head term as a required term to an improved search query; and outputting the improved search query.Type: GrantFiled: January 18, 2019Date of Patent: March 23, 2021Assignee: International Business Machines CorporationInventors: Charles E. Beller, Sean L. Bethard, William G. Dubyak, Alexander C. Tonetti, Sean T. Thatcher, Julie T. Yu
-
Patent number: 10891324Abstract: A method of retrieving information includes obtaining access to a plurality of documents comprising document terms, receiving a lexicon including a plurality of lexical terms, receiving a query, identifying a plurality of search terms in the query by decomposing the query, determining at least one match of the search terms to the lexical terms, determining a plurality of matches of the search terms to the document terms, and scoring each of the documents based on the at least one match to the lexical terms and the matches to the document terms.Type: GrantFiled: September 21, 2018Date of Patent: January 12, 2021Assignee: International Business Machines CorporationInventors: William G. Dubyak, Edward G. Katz, Brian L. Keith, Nicole O'Connor
-
Patent number: 10878033Abstract: An embodiment of the invention may include a method, computer program product and system for generating follow-up questions based on machine learning utilizing a computing device. The embodiment may include receiving an input question from a user. The embodiment may include parsing the received input question to extract input question components. Parsing utilizes natural language processing techniques. The embodiment may include executing trained question component models to predict follow-up question components. The extracted input question components are utilized as inputs to the trained question component models. The embodiment may include combining the predicted follow-up question components to generate one or more follow-up questions. The embodiment may include returning the one or more follow-up questions to the user.Type: GrantFiled: December 1, 2017Date of Patent: December 29, 2020Assignee: International Business Machines CorporationInventors: Mohamed N. Ahmed, Charles E. Beller, William G. Dubyak, Palani Sakthi, Kristen M. Summers, Andeep S. Toor
-
Patent number: 10803101Abstract: A method for recommending responses to emergent conditions is provided. The present invention may include receiving a query from a user. The present invention may also include retrieving a plurality of recommended responses for the received query from a plurality of entities and a plurality of relations stored in a graph-based knowledge resource. The present invention may further include presenting the retrieved plurality of recommended responses to the user.Type: GrantFiled: December 19, 2017Date of Patent: October 13, 2020Assignee: International Business Machines CorporationInventors: William G. Dubyak, Edward G. Katz, Nicole M. O'Connor
-
Patent number: 10803255Abstract: Natural language processing is enhanced by linguistically extracting intelligence about a user. A history of user queries is analyzed by a natural language classifier to determine various user intents, and these intents are combined to form a user intent profile. The profile includes elements of sentiment, emotion and tone. The profile can be used in various ways including restricting access to documents in a collection, or refining a cognitive analysis of a query. For access restriction, a determination is made that the user intent is inconsistent with a document, and the user is denied access to the document. This determination involves a user intent score which is compared to a score of the document. For cognitive analysis, searching of reference documents is filtered by excluding documents based on the user intent. The searching includes a comparison of meta-data tags of the documents to the user intent.Type: GrantFiled: March 5, 2018Date of Patent: October 13, 2020Assignee: International Business Machines CorporationInventors: William G. Dubyak, Vijai Gandikota, Palani Sakthi
-
Patent number: 10803100Abstract: A computer-implemented method, a computer program product, and a computer processing system are provided. The method includes identifying, by a processor using a topic identification system, topic information for a source topic-domain in a corpus. The method further includes extracting, by the processor, an entity from the source topic-domain. The method also includes tagging, by the processor, the entity with the topic information to obtain a tagged entity that includes a tag specifying the topic information. The method additionally includes storing the tagged entity in a memory device. The method further includes performing, by the processor, downstream processing of the tagged entity in a natural language processing pipeline using the tag of the tagged entity as an additional data point.Type: GrantFiled: November 30, 2017Date of Patent: October 13, 2020Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Christopher F. Ackermann, William G. Dubyak, Edward Graham Katz, Nicole O'Connor
-
Patent number: 10776409Abstract: A method, computer system, and a computer program product for recommending responses to emergent conditions is provided. The present invention may include receiving a query from a user. The present invention may also include retrieving a plurality of recommended responses for the received query from a plurality of entities and a plurality of relations stored in a graph-based knowledge resource. The present invention may further include presenting the retrieved plurality of recommended responses to the user.Type: GrantFiled: June 21, 2017Date of Patent: September 15, 2020Assignee: International Business Machines CorporationInventors: William G. Dubyak, Edward G. Katz, Nicole M. O'Connor
-
Patent number: 10621178Abstract: Embodiments are directed to an entity extraction and filtering method that enables a close search of documents to build filters necessary for near real-time monitoring of streaming sources of information. According to an embodiment, the entity extraction and filtering method operates based on the following parameters. First, a detection of an entity of interest warrants flagging an arriving article for analyst attention. Nothing more than a match may be required. The list of entities may be derived by an entity extractor from a corpus of data. Secondly, automatic updates may be utilized, so that exports are automatically updated to the filters. Thirdly, information flowing past the filters may update a static corpus whether or not they are flagged for an analyst or user. This allows for new relationships to be detected and extracted, and the filters subsequently updated.Type: GrantFiled: June 16, 2017Date of Patent: April 14, 2020Assignee: International Business Machines CorporationInventors: Charles E. Beller, William G. Dubyak, Joshua G. Hong, Brian L. Keith, Palani Sakthi, Kristen M. Summers
-
Patent number: 10621177Abstract: Embodiments are directed to an entity extraction and filtering system that enables a close search of documents to build filters necessary for near real-time monitoring of streaming sources of information. According to an embodiment, the entity extraction and filtering system operates based on the following parameters. First, a detection of an entity of interest warrants flagging an arriving article for analyst attention. Nothing more than a match may be required. The list of entities may be derived by an entity extractor from a corpus of data. Secondly, automatic updates may be utilized, so that exports are automatically updated to the filters. Thirdly, information flowing past the filters may update a static corpus whether or not they are flagged for an analyst or user. This allows for new relationships to be detected and extracted, and the filters subsequently updated.Type: GrantFiled: March 23, 2017Date of Patent: April 14, 2020Assignee: International Business Machines CorporationInventors: Charles E. Beller, William G. Dubyak, Joshua G. Hong, Brian L. Keith, Palani Sakthi, Kristen M. Summers
-
Publication number: 20200097599Abstract: A method of retrieving information includes obtaining access to a plurality of documents comprising document terms, receiving a lexicon including a plurality of lexical terms, receiving a query, identifying a plurality of search terms in the query by decomposing the query, determining at least one match of the search terms to the lexical terms, determining a plurality of matches of the search terms to the document terms, and scoring each of the documents based on the at least one match to the lexical terms and the matches to the document terms.Type: ApplicationFiled: September 21, 2018Publication date: March 26, 2020Inventors: WILLIAM G. DUBYAK, EDWARD G. KATZ, BRIAN L. KEITH, NICOLE O'CONNOR
-
Patent number: 10528660Abstract: Popular trends are predicted by leveraging the language of influencers as found in their electronic publications such as social media, blogs, etc. A list of influencers in a given field is curated along with a lexicon of the field which includes product names and associated modifiers. Natural language processing is performed on the current publications to identify a particular word combination based on syntactic relationships. The current usage frequency of the particular word combination is compared to a historical usage frequency derived from a baseline. If the current usage frequency is significantly higher, an alert is generated indicating that the particular word combination represents a candidate trend. The word combination may be a syntactic n-gram. The current usage frequency is based on a first, recent time window, and the historical usage frequency is based on a second time window preceding the first time window.Type: GrantFiled: December 2, 2017Date of Patent: January 7, 2020Assignee: International Business Machines CorporationInventors: William G. Dubyak, Joshua G. Hong, Brian L. Keith
-
Publication number: 20190324970Abstract: A candidate document is received, for example, by a document filter. A determination is made based on the content of the candidate document, whether the candidate document is relevant to a document corpus. A determination is made based on the content of the candidate document, whether the candidate document is novel with respect to the document corpus. In response to determining that the candidate document is relevant to the document corpus and novel with respect to the document corpus, the candidate document is added to the document corpus to make at least a portion of the content of the candidate document available for a response to a search query.Type: ApplicationFiled: June 26, 2019Publication date: October 24, 2019Inventors: Charles Evan Beller, William G. Dubyak, Palani Sakthi, Kristen Maria Summers