Patents by Inventor William G. Dubyak

William G. Dubyak has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11550794
    Abstract: A candidate document is received, for example, by a document filter. A determination is made based on the content of the candidate document, whether the candidate document is relevant to a document corpus. A determination is made based on the content of the candidate document, whether the candidate document is novel with respect to the document corpus. In response to determining that the candidate document is relevant to the document corpus and novel with respect to the document corpus, the candidate document is added to the document corpus to make at least a portion of the content of the candidate document available for a response to a search query.
    Type: Grant
    Filed: June 26, 2019
    Date of Patent: January 10, 2023
    Assignee: International Business Machines Corporation
    Inventors: Charles Evan Beller, William G Dubyak, Palani Sakthi, Kristen Maria Summers
  • Patent number: 11501006
    Abstract: Natural language processing is enhanced by linguistically extracting intelligence about a user. A history of user queries is analyzed by a natural language classifier to determine various user intents, and these intents are combined to form a user intent profile. The profile includes elements of sentiment, emotion and tone. The profile can be used in various ways including restricting access to documents in a collection, or refining a cognitive analysis of a query. For access restriction, a determination is made that the user intent is inconsistent with a document, and the user is denied access to the document. This determination involves a user intent score which is compared to a score of the document. For cognitive analysis, searching of reference documents is filtered by excluding documents based on the user intent. The searching includes a comparison of meta-data tags of the documents to the user intent.
    Type: Grant
    Filed: March 5, 2018
    Date of Patent: November 15, 2022
    Assignees: HYUNDAI MOTOR COMPANY, KIA CORPORATION
    Inventors: William G. Dubyak, Vijai Gandikota, Palani Sakthi
  • Patent number: 11379706
    Abstract: Embodiments are directed to interaction with an open-domain question and answer system by recognizing questions that are highly broad or abstract, and generating and processing a batch of questions expressing alternate, concrete instances of the more general, abstract question. Responses to all of the questions in the batch are considered as candidates, and the strongest general answers are returned. A weighted, ranked answer set, based on weighting individual concrete questions and scaling the answers proportional to the weight of the questions, is provided to the user. The approach, according to embodiments herein, addresses the need for responses to broad questions in which a response to any of a set of more concrete question instances may serve to answer the question.
    Type: Grant
    Filed: April 13, 2018
    Date of Patent: July 5, 2022
    Assignee: International Business Machines Corporation
    Inventors: Charles E. Beller, William G. Dubyak, Palani Sakthi, Kristen M. Summers
  • Patent number: 11227113
    Abstract: Batch interaction with a computerized question answering system can produce an answer that more closely relates to a user's information need. A batch of questions can be generated interactively, and provides a context for a first question received from a user. The batch of questions includes or more additional questions which have terms with a nonsynonymous semantic relation to a first term in the first question. A question answering system can process the batch of questions to determine candidate answers. An answer to the first question can be determined based, at least in part, on a combined ranking of the candidate answers.
    Type: Grant
    Filed: January 20, 2016
    Date of Patent: January 18, 2022
    Assignee: International Business Machines Corporation
    Inventors: Charles Evan Beller, William G Dubyak, Palani Sakthi, Kristen Maria Summers
  • Patent number: 11170660
    Abstract: Embodiments can provide a computer implemented method for harvesting training data for a training set for use by a system capable of answering questions, the system comprising a processor and a memory comprising instructions executed by the processor, the method comprising receiving, from a user, an input question; processing the input question and returning, to the user, a result set comprising one or more ranked hypotheses and one or more ranked evidence passages corresponding to the one or more ranked hypotheses; receiving, from the user, an indication that one of the one or more ranked hypotheses is to be designated a watched hypothesis; adding the input question and the watched hypothesis to a to-be-vetted question/answer (QA) pair set comprising one or more to-be-vetted QA pairs; vetting each of the one or more to-be-vetted QA pairs in the to-be-vetted QA pair set through a first-pass automatic vetting procedure; if a vetted QA pair passes the first-pass automatic vetting procedure, adding the vetted QA
    Type: Grant
    Filed: September 29, 2017
    Date of Patent: November 9, 2021
    Assignee: International Business Machines Corporation
    Inventors: Charles E. Beller, William G. Dubyak, Palani Sakthi, Kristen M. Summers
  • Patent number: 11170181
    Abstract: Dynamic semantic processing of text in a word processing application with engagement of question-answering system. A user provides a text stream to a computer system via an input source. The input text stream includes a first natural language statement. The system determines that the first natural language statement includes a fact-based component expressed in natural language form. The system identifies an initial span and an alternative span of the first natural language component, based on the determining. The system engages a question-answering (QA) system by providing the initial span and the alternative span of the first natural language component to the QA system, and by receiving, in response to the providing, a set of natural language results from the QA system. The system evaluates the initial span and the alternative span of the first natural language component based on the received natural language results.
    Type: Grant
    Filed: June 27, 2019
    Date of Patent: November 9, 2021
    Assignee: International Business Machines Corporation
    Inventors: Charles E. Beller, William G. Dubyak, Palani Sakthi, Robert A. Sheets, Kristen M. Summers
  • Patent number: 11163804
    Abstract: Embodiments can provide a computer implemented method, in a data processing system comprising a processor and a memory comprising instructions which are executed by the processor to cause the processor to implement an enhanced corpus management system, the method comprising: identifying one or more functional domain categories; ingesting one or more incoming documents to form an open-domain corpus; for each functional domain category, identifying one or more representative documents to establish a seed sub-corpus; calculating a degree of fit score between each of the one or more incoming documents and the one or more established functional domain category seed sub-corpora; and assigning one or more of the incoming documents to one or more of the functional domain categories based upon the degree of fit score to create an enhanced corpus.
    Type: Grant
    Filed: May 3, 2019
    Date of Patent: November 2, 2021
    Assignee: International Business Machines Corporation
    Inventors: Charles E. Beller, William G. Dubyak, Palani Sakthi, Kristen M. Summers
  • Publication number: 20210149936
    Abstract: Embodiments can provide a computer implemented method, in a data processing system comprising a processor and a memory comprising instructions which are executed by the processor to cause the processor to implement an improved search query generation system, the method comprising inputting a natural language question; parsing the natural language question into a parse tree; identifying argument positions comprising one or more argument position terms, wherein each argument position term is a single word; for each argument position: comparing a head term's discriminator score against a threshold discriminator score; and if the head term surpasses the threshold discriminator score, adding the head term as a required term to an improved search query; and outputting the improved search query.
    Type: Application
    Filed: December 22, 2020
    Publication date: May 20, 2021
    Inventors: Charles E. Beller, Sean L. Bethard, William G. Dubyak, Alexander C. Tonetti, Sean T. Thatcher, Julie T. Yu
  • Patent number: 10956463
    Abstract: Embodiments can provide a computer implemented method, in a data processing system comprising a processor and a memory comprising instructions which are executed by the processor to cause the processor to implement an improved search query generation system, the method comprising inputting a natural language question; parsing the natural language question into a parse tree; identifying argument positions comprising one or more argument position terms; for each argument position: comparing a head term's discriminator score against a threshold discriminator score; and if the head term surpasses the threshold discriminator score, adding the head term as a required term to an improved search query; and outputting the improved search query.
    Type: Grant
    Filed: January 18, 2019
    Date of Patent: March 23, 2021
    Assignee: International Business Machines Corporation
    Inventors: Charles E. Beller, Sean L. Bethard, William G. Dubyak, Alexander C. Tonetti, Sean T. Thatcher, Julie T. Yu
  • Patent number: 10891324
    Abstract: A method of retrieving information includes obtaining access to a plurality of documents comprising document terms, receiving a lexicon including a plurality of lexical terms, receiving a query, identifying a plurality of search terms in the query by decomposing the query, determining at least one match of the search terms to the lexical terms, determining a plurality of matches of the search terms to the document terms, and scoring each of the documents based on the at least one match to the lexical terms and the matches to the document terms.
    Type: Grant
    Filed: September 21, 2018
    Date of Patent: January 12, 2021
    Assignee: International Business Machines Corporation
    Inventors: William G. Dubyak, Edward G. Katz, Brian L. Keith, Nicole O'Connor
  • Patent number: 10878033
    Abstract: An embodiment of the invention may include a method, computer program product and system for generating follow-up questions based on machine learning utilizing a computing device. The embodiment may include receiving an input question from a user. The embodiment may include parsing the received input question to extract input question components. Parsing utilizes natural language processing techniques. The embodiment may include executing trained question component models to predict follow-up question components. The extracted input question components are utilized as inputs to the trained question component models. The embodiment may include combining the predicted follow-up question components to generate one or more follow-up questions. The embodiment may include returning the one or more follow-up questions to the user.
    Type: Grant
    Filed: December 1, 2017
    Date of Patent: December 29, 2020
    Assignee: International Business Machines Corporation
    Inventors: Mohamed N. Ahmed, Charles E. Beller, William G. Dubyak, Palani Sakthi, Kristen M. Summers, Andeep S. Toor
  • Patent number: 10803101
    Abstract: A method for recommending responses to emergent conditions is provided. The present invention may include receiving a query from a user. The present invention may also include retrieving a plurality of recommended responses for the received query from a plurality of entities and a plurality of relations stored in a graph-based knowledge resource. The present invention may further include presenting the retrieved plurality of recommended responses to the user.
    Type: Grant
    Filed: December 19, 2017
    Date of Patent: October 13, 2020
    Assignee: International Business Machines Corporation
    Inventors: William G. Dubyak, Edward G. Katz, Nicole M. O'Connor
  • Patent number: 10803255
    Abstract: Natural language processing is enhanced by linguistically extracting intelligence about a user. A history of user queries is analyzed by a natural language classifier to determine various user intents, and these intents are combined to form a user intent profile. The profile includes elements of sentiment, emotion and tone. The profile can be used in various ways including restricting access to documents in a collection, or refining a cognitive analysis of a query. For access restriction, a determination is made that the user intent is inconsistent with a document, and the user is denied access to the document. This determination involves a user intent score which is compared to a score of the document. For cognitive analysis, searching of reference documents is filtered by excluding documents based on the user intent. The searching includes a comparison of meta-data tags of the documents to the user intent.
    Type: Grant
    Filed: March 5, 2018
    Date of Patent: October 13, 2020
    Assignee: International Business Machines Corporation
    Inventors: William G. Dubyak, Vijai Gandikota, Palani Sakthi
  • Patent number: 10803100
    Abstract: A computer-implemented method, a computer program product, and a computer processing system are provided. The method includes identifying, by a processor using a topic identification system, topic information for a source topic-domain in a corpus. The method further includes extracting, by the processor, an entity from the source topic-domain. The method also includes tagging, by the processor, the entity with the topic information to obtain a tagged entity that includes a tag specifying the topic information. The method additionally includes storing the tagged entity in a memory device. The method further includes performing, by the processor, downstream processing of the tagged entity in a natural language processing pipeline using the tag of the tagged entity as an additional data point.
    Type: Grant
    Filed: November 30, 2017
    Date of Patent: October 13, 2020
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Christopher F. Ackermann, William G. Dubyak, Edward Graham Katz, Nicole O'Connor
  • Patent number: 10776409
    Abstract: A method, computer system, and a computer program product for recommending responses to emergent conditions is provided. The present invention may include receiving a query from a user. The present invention may also include retrieving a plurality of recommended responses for the received query from a plurality of entities and a plurality of relations stored in a graph-based knowledge resource. The present invention may further include presenting the retrieved plurality of recommended responses to the user.
    Type: Grant
    Filed: June 21, 2017
    Date of Patent: September 15, 2020
    Assignee: International Business Machines Corporation
    Inventors: William G. Dubyak, Edward G. Katz, Nicole M. O'Connor
  • Patent number: 10621178
    Abstract: Embodiments are directed to an entity extraction and filtering method that enables a close search of documents to build filters necessary for near real-time monitoring of streaming sources of information. According to an embodiment, the entity extraction and filtering method operates based on the following parameters. First, a detection of an entity of interest warrants flagging an arriving article for analyst attention. Nothing more than a match may be required. The list of entities may be derived by an entity extractor from a corpus of data. Secondly, automatic updates may be utilized, so that exports are automatically updated to the filters. Thirdly, information flowing past the filters may update a static corpus whether or not they are flagged for an analyst or user. This allows for new relationships to be detected and extracted, and the filters subsequently updated.
    Type: Grant
    Filed: June 16, 2017
    Date of Patent: April 14, 2020
    Assignee: International Business Machines Corporation
    Inventors: Charles E. Beller, William G. Dubyak, Joshua G. Hong, Brian L. Keith, Palani Sakthi, Kristen M. Summers
  • Patent number: 10621177
    Abstract: Embodiments are directed to an entity extraction and filtering system that enables a close search of documents to build filters necessary for near real-time monitoring of streaming sources of information. According to an embodiment, the entity extraction and filtering system operates based on the following parameters. First, a detection of an entity of interest warrants flagging an arriving article for analyst attention. Nothing more than a match may be required. The list of entities may be derived by an entity extractor from a corpus of data. Secondly, automatic updates may be utilized, so that exports are automatically updated to the filters. Thirdly, information flowing past the filters may update a static corpus whether or not they are flagged for an analyst or user. This allows for new relationships to be detected and extracted, and the filters subsequently updated.
    Type: Grant
    Filed: March 23, 2017
    Date of Patent: April 14, 2020
    Assignee: International Business Machines Corporation
    Inventors: Charles E. Beller, William G. Dubyak, Joshua G. Hong, Brian L. Keith, Palani Sakthi, Kristen M. Summers
  • Publication number: 20200097599
    Abstract: A method of retrieving information includes obtaining access to a plurality of documents comprising document terms, receiving a lexicon including a plurality of lexical terms, receiving a query, identifying a plurality of search terms in the query by decomposing the query, determining at least one match of the search terms to the lexical terms, determining a plurality of matches of the search terms to the document terms, and scoring each of the documents based on the at least one match to the lexical terms and the matches to the document terms.
    Type: Application
    Filed: September 21, 2018
    Publication date: March 26, 2020
    Inventors: WILLIAM G. DUBYAK, EDWARD G. KATZ, BRIAN L. KEITH, NICOLE O'CONNOR
  • Patent number: 10528660
    Abstract: Popular trends are predicted by leveraging the language of influencers as found in their electronic publications such as social media, blogs, etc. A list of influencers in a given field is curated along with a lexicon of the field which includes product names and associated modifiers. Natural language processing is performed on the current publications to identify a particular word combination based on syntactic relationships. The current usage frequency of the particular word combination is compared to a historical usage frequency derived from a baseline. If the current usage frequency is significantly higher, an alert is generated indicating that the particular word combination represents a candidate trend. The word combination may be a syntactic n-gram. The current usage frequency is based on a first, recent time window, and the historical usage frequency is based on a second time window preceding the first time window.
    Type: Grant
    Filed: December 2, 2017
    Date of Patent: January 7, 2020
    Assignee: International Business Machines Corporation
    Inventors: William G. Dubyak, Joshua G. Hong, Brian L. Keith
  • Publication number: 20190324970
    Abstract: A candidate document is received, for example, by a document filter. A determination is made based on the content of the candidate document, whether the candidate document is relevant to a document corpus. A determination is made based on the content of the candidate document, whether the candidate document is novel with respect to the document corpus. In response to determining that the candidate document is relevant to the document corpus and novel with respect to the document corpus, the candidate document is added to the document corpus to make at least a portion of the content of the candidate document available for a response to a search query.
    Type: Application
    Filed: June 26, 2019
    Publication date: October 24, 2019
    Inventors: Charles Evan Beller, William G. Dubyak, Palani Sakthi, Kristen Maria Summers