Patents by Inventor Clemens Drews

Clemens Drews has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11468234
    Abstract: At least some embodiments are directed to a computer-implemented method that comprises receiving original input text that includes a term, comparing a definition of the term to definitions of multiple candidate replacement terms to generate a set of candidate replacement terms, and substituting each of the candidate replacement terms in the set for the term in the original input text to produce a plurality of modified input texts. The method also comprises determining the grammatical accuracy of each of the plurality of modified input texts, comparing meanings of the modified input texts to a meaning of the original input text, and modifying the set of candidate replacement terms based on the determinations of grammatical accuracy and the comparisons of the meanings. The method still further comprises ranking the modified set of candidate replacement terms using one or more criteria, and displaying the ranking on a display.
    Type: Grant
    Filed: June 26, 2017
    Date of Patent: October 11, 2022
    Assignee: International Business Machines Corporation
    Inventors: Alfredo Alba, Clemens Drews, Daniel F. Gruhl, Christian B. Kau, Neal R. Lewis, Pablo N. Mendes, Meenakshi Nagarajan, Cartic Ramakrishnan
  • Patent number: 11068376
    Abstract: Disclosed aspects relate to analytics engine selection management. A set of reference data may be ingested by a first analytics engine to compile a first set of characteristic data. The set of reference data may be ingested by a second analytics engine to compile a second set of characteristic data. The first set of characteristic data may be compiled for the first analytics engine. The second set of characteristic data may be compiled for the second analytics engine. A set of distinct attributes related to the first and second analytics engines may be determined based on the first and second sets of characteristic data. An analytics engine selection operation may be executed.
    Type: Grant
    Filed: March 11, 2017
    Date of Patent: July 20, 2021
    Assignee: International Business Machines Corporation
    Inventors: Alfredo Alba, Clemens Drews, Daniel F. Gruhl, Linda H. Kato, Christian B. Kau, Neal R. Lewis, Pablo N. Mendes, Meenakshi Nagarajan, Cartic Ramakrishnan
  • Patent number: 11043291
    Abstract: Embodiments of the present invention relate to performing entity recognition on a stream while providing ongoing training or supplementation of an entity dictionary. In one embodiment, a method of and computer program product for stream based named entity recognition is provided. A first portion of a textual input is received. A plurality of patterns is applied to the first portion to determine that a predetermined type is present in the first portion. Approval is requested of the presence of the predetermined type. An indication of approval or disapproval of the predetermined type is received. A dictionary is supplemented according to the indication. A second portion of the textual input is received. The plurality of patterns is applied to the second portion.
    Type: Grant
    Filed: May 30, 2014
    Date of Patent: June 22, 2021
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Varun Bhagwan, Clemens Drews, Daniel F. Gruhl, Neal R. Lewis, April L. Webster, Steven R. Welch
  • Patent number: 10902346
    Abstract: One embodiment provides generating a similarity matrix corresponding to an input collection including initializing, by a processor, a working set as a collection of a multiple items. Until the similarity matrix converges: receiving a seed for similarity for at least one pair of items of the multiple items, and obtaining a similarity value for all other item pairs using a Naive Triangle Inequality process. The similarity is generated with obtained similarity values.
    Type: Grant
    Filed: March 28, 2017
    Date of Patent: January 26, 2021
    Assignee: International Business Machines Corporation
    Inventors: Alfredo Alba, Kenneth L. Clarkson, Clemens Drews, Ronald Fagin, Daniel F. Gruhl, Neal R. Lewis, Pablo N. Mendes, Meenakshi Nagarajan, Cartic Ramakrishnan
  • Patent number: 10769382
    Abstract: A method, system and computer program product for disambiguating meaning of terms. A natural language processing system obtains a first corpora of words used in a first sense and a second corpora of the same words that are used in a second sense. Each of these corpora of words may be associated with different linguistic domains. The natural language processing system generates a first and a second set of patterns using both the first and second corpora of words, respectively. A question passage is then received by the natural language processing system. The natural language processing system examines a word of interest in the question passage. The user is then notified that the word of interest is being used in the first sense or the second sense in response to identifying the word of interest in only the first set of patterns or the second set of patterns, respectively.
    Type: Grant
    Filed: January 28, 2019
    Date of Patent: September 8, 2020
    Assignee: International Business Machines Corporation
    Inventors: Alfredo Alba, Clemens Drews, Daniel F. Gruhl, Linda H. Kato, Christian B. Kau, Neal R. Lewis, Pablo N. Mendes, Meenakshi Nagarajan, Cartic Ramakrishnan
  • Patent number: 10740379
    Abstract: A mechanism is provided in a data processing system comprising at least one processor and at least one memory, the at least one memory comprising instructions executed by the at least one processor to cause the at least one processor to implement an automated lexicon expansion for an identified corpus. For a selected corpus in a set of corpora, the mechanism determines an estimated number of new terms in the selected corpus that are not in the lexicon based on a frequency count known terms in the selected corpus. Responsive to the estimated number of new terms in the selected corpus being greater than a threshold, the mechanism performs lexicon expansion using the selected corpus to form an expanded lexicon. Responsive to the estimated number of new terms in the selected corpus not being greater than the threshold, the mechanism halts lexicon expansion.
    Type: Grant
    Filed: December 8, 2017
    Date of Patent: August 11, 2020
    Assignee: International Business Machines Corporation
    Inventors: Alfredo Alba, Clemens Drews, Daniel F. Gruhl, Linda H. Kato, Neal R. Lewis, Pablo N. Mendes, Meenakshi Nagarajan
  • Patent number: 10733224
    Abstract: A mechanism is provided in a data processing system comprising at least one processor and at least one memory, the at least one memory comprising instructions executed by the at least one processor to cause the at least one processor to implement an automated lexicon expansion for an identified corpus. For a selected corpus in a set of corpora, the mechanism determines an estimated number of new terms in the selected corpus that are not in the lexicon based on a frequency count known terms in the selected corpus. Responsive to the estimated number of new terms in the selected corpus being greater than a threshold, the mechanism performs lexicon expansion using the selected corpus to form an expanded lexicon. Responsive to the estimated number of new terms in the selected corpus not being greater than the threshold, the mechanism halts lexicon expansion.
    Type: Grant
    Filed: February 7, 2017
    Date of Patent: August 4, 2020
    Assignee: International Business Machines Corporation
    Inventors: Alfredo Alba, Clemens Drews, Daniel F. Gruhl, Linda H. Kato, Neal R. Lewis, Pablo N. Mendes, Meenakshi Nagarajan
  • Publication number: 20190377740
    Abstract: Influencers (individuals or groups) over a selected audience (observers or recipients of information, objects and/or events) on a given topic are measured based on influence features, which include a sentiment flipping influence feature indicative of ability of an audience member to influence other audience members to change their sentiment on the selected topic. Other influence features include the ability to influence others: to change followership; to express interest in a topic associated with a hashtag pioneered by the influencer, based on the effectiveness and phrasing of language used. The output of the influence engine can be a score representing the relative influence of audience members over the audience on the topic of interest. Influencers may be ranked according to their total influence score over the audience on the topic.
    Type: Application
    Filed: August 22, 2019
    Publication date: December 12, 2019
    Inventors: Alfredo Alba, Clemens Drews, Daniel Gruhl, Neal R. Lewis, Pablo N. Mendes, Meenakshi Nagarajan, CARTIC RAMAKRISHNAN
  • Patent number: 10430422
    Abstract: Influencers (individuals or groups) over a selected audience (observers or recipients of information, objects and/or events) on a given topic are measured based on influence features, which include a sentiment flipping influence feature indicative of ability of an audience member to influence other audience members to change their sentiment on the selected topic. Other influence features include the ability to influence others: to change followership; to express interest in a topic associated with a hashtag pioneered by the influencer, based on the effectiveness and phrasing of language used. The output of the influence engine can be a score representing the relative influence of audience members over the audience on the topic of interest. Influencers may be ranked according to their total influence score over the audience on the topic.
    Type: Grant
    Filed: September 29, 2015
    Date of Patent: October 1, 2019
    Assignee: International Business Machines Corporation
    Inventors: Alfredo Alba, Clemens Drews, Daniel Gruhl, Neal R. Lewis, Pablo N. Mendes, Meenakshi Nagarajan, Cartic Ramakrishnan
  • Publication number: 20190155908
    Abstract: A method, system and computer program product for disambiguating meaning of terms. A natural language processing system obtains a first corpora of words used in a first sense and a second corpora of the same words that are used in a second sense. Each of these corpora of words may be associated with different linguistic domains. The natural language processing system generates a first and a second set of patterns using both the first and second corpora of words, respectively. A question passage is then received by the natural language processing system. The natural language processing system examines a word of interest in the question passage. The user is then notified that the word of interest is being used in the first sense or the second sense in response to identifying the word of interest in only the first set of patterns or the second set of patterns, respectively.
    Type: Application
    Filed: January 28, 2019
    Publication date: May 23, 2019
    Inventors: Alfredo Alba, Clemens Drews, Daniel F. Gruhl, Linda H. Kato, Christian B. Kau, Neal R. Lewis, Pablo N. Mendes, Meenakshi Nagarajan, Cartic Ramakrishnan
  • Patent number: 10255271
    Abstract: A method, system and computer program product for disambiguating meaning of terms. A natural language processing system obtains a first corpora of words used in a first sense and a second corpora of the same words that are used in a second sense. Each of these corpora of words may be associated with different linguistic domains. The natural language processing system generates a first and a second set of patterns using both the first and second corpora of words, respectively. A question passage is then received by the natural language processing system. The natural language processing system examines a word of interest in the question passage. The user is then notified that the word of interest is being used in the first sense or the second sense in response to identifying the word of interest in only the first set of patterns or the second set of patterns, respectively.
    Type: Grant
    Filed: February 6, 2017
    Date of Patent: April 9, 2019
    Assignee: International Business Machines Corporation
    Inventors: Alfredo Alba, Clemens Drews, Daniel F. Gruhl, Linda H. Kato, Christian B. Kau, Neal R. Lewis, Pablo N. Mendes, Meenakshi Nagarajan, Cartic Ramakrishnan
  • Patent number: 10210231
    Abstract: Embodiments of the present invention disclose a method, computer program product, and system for optimizing data analysis. A set of analytics are received and a plurality of subsets of the set of analytics is generated. An optimization goal(s) is defined. The plurality of subsets of analytics are applied to a set of data. The output of applied analytics are recorded on a data storage device by the computer. Performance values of the computer applying the plurality of subsets of analytics are recorded and displayed. An optimal subset of the plurality of subsets of analytics is calculated, using recorded performance values and outputs of the applied plurality of subsets of analytics, being based on the optimization goal. A set of applied analytics are displayed, where the set of applied analytics includes a number of subsets of analytics of the plurality of analytics that meet the criteria of the optimal subset.
    Type: Grant
    Filed: August 6, 2015
    Date of Patent: February 19, 2019
    Assignee: International Business Machines Corporation
    Inventors: Varun Bhagwan, Clemens Drews, Daniel F. Gruhl, Neal R. Lewis, April L. Webster, Steven R. Welch
  • Patent number: 10176889
    Abstract: A computer program product comprising a computer-readable storage medium having program instructions embodied therewith, the program instructions executable by a processor to cause the processor to receive a document having multiple section headers, segment the document into at least first and second sections based on the section headers, segment items in the first section into fragments and identify a section type for each of the fragments, determine that the identified section type for at least one of the fragments better matches a type of the second section than it matches a type of the first section, and re-locate the at least one of the fragments to the second section.
    Type: Grant
    Filed: February 9, 2017
    Date of Patent: January 8, 2019
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Alfredo Alba, Anni R. Coden, Clemens Drews, Daniel F. Gruhl, Neal R. Lewis, Pablo N. Mendes, Cartic Ramakrishnan, Joseph F. Terdiman
  • Patent number: 10176164
    Abstract: A method comprising receiving an item via a computer input device, comparing the item to a plurality of lexicons using a processor, and determining, using the processor and based upon the comparison, that the item is part of a fragment matching at least one of the plurality of lexicons. The method also comprises storing, using the processor, the fragment in a section of a document determined based on the at least one of the plurality of lexicons, and receiving at least one other item via the computer input device after the fragment is stored in the section.
    Type: Grant
    Filed: June 19, 2017
    Date of Patent: January 8, 2019
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Alfredo Alba, Anni R. Coden, Clemens Drews, Daniel F. Gruhl, Neal R. Lewis, Pablo N. Mendes, Cartic Ramakrishnan, Joseph F. Terdiman
  • Patent number: 10176890
    Abstract: A method comprising receiving a document having multiple sections of different types using a processor. The method also comprises obtaining a plurality of lexicons using the processor, each of the lexicons for interpreting fragments in one or more of the section types. The method further comprises interpreting fragments in a first section of the multiple sections using the processor and one or more lexicons. The method still further comprises determining, based upon the interpretation and using the processor, that a fragment in the first section is misplaced. The method still further comprises re-locating, using the processor, the misplaced fragment to a second section of the multiple sections in the document to generate a re-organized document. The method additionally includes storing the re-organized document in a hardware storage system using the processor.
    Type: Grant
    Filed: June 19, 2017
    Date of Patent: January 8, 2019
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Alfredo Alba, Anni R. Coden, Clemens Drews, Daniel F. Gruhl, Neal R. Lewis, Pablo N. Mendes, Cartic Ramakrishnan, Joseph F. Terdiman
  • Patent number: 10169325
    Abstract: A system, comprising an input device configured to receive a first item and a second item, and a processor communicably coupled to the input device and configured to determine that the first item is a fragment matching a lexicon, and place the fragment in a section of a document, the section selected based on the matching lexicon, wherein the processor is configured to perform the determination and the placement after it receives the first item but before it receives the second item.
    Type: Grant
    Filed: February 9, 2017
    Date of Patent: January 1, 2019
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Alfredo Alba, Anni R. Coden, Clemens Drews, Daniel F. Gruhl, Neal R. Lewis, Pablo N. Mendes, Cartic Ramakrishnan, Joseph F. Terdiman
  • Publication number: 20180373691
    Abstract: At least some embodiments are directed to a computer-implemented method that comprises receiving original input text that includes a term, comparing a definition of the term to definitions of multiple candidate replacement terms to generate a set of candidate replacement terms, and substituting each of the candidate replacement terms in the set for the term in the original input text to produce a plurality of modified input texts. The method also comprises determining the grammatical accuracy of each of the plurality of modified input texts, comparing meanings of the modified input texts to a meaning of the original input text, and modifying the set of candidate replacement terms based on the determinations of grammatical accuracy and the comparisons of the meanings. The method still further comprises ranking the modified set of candidate replacement terms using one or more criteria, and displaying the ranking on a display.
    Type: Application
    Filed: June 26, 2017
    Publication date: December 27, 2018
    Inventors: Alfredo ALBA, Clemens DREWS, Daniel F. GRUHL, Christian B. KAU, Neal R. LEWIS, Pablo N. MENDES, Meenakshi NAGARAJAN, Cartic RAMAKRISHNAN
  • Publication number: 20180285762
    Abstract: One embodiment provides generating a similarity matrix corresponding to an input collection including initializing, by a processor, a working set as a collection of a multiple items. Until the similarity matrix converges: receiving a seed for similarity for at least one pair of items of the multiple items, and obtaining a similarity value for all other item pairs using a Naive Triangle Inequality process. The similarity is generated with obtained similarity values.
    Type: Application
    Filed: March 28, 2017
    Publication date: October 4, 2018
    Inventors: Alfredo Alba, Kenneth L. Clarkson, Clemens Drews, Ronald Fagin, Daniel F. Gruhl, Neal R. Lewis, Pablo N. Mendes, Meenakshi Nagarajan, Cartic Ramakrishnan
  • Publication number: 20180260459
    Abstract: Disclosed aspects relate to analytics engine selection management. A set of reference data may be ingested by a first analytics engine to compile a first set of characteristic data. The set of reference data may be ingested by a second analytics engine to compile a second set of characteristic data. The first set of characteristic data may be compiled for the first analytics engine. The second set of characteristic data may be compiled for the second analytics engine. A set of distinct attributes related to the first and second analytics engines may be determined based on the first and second sets of characteristic data. An analytics engine selection operation may be executed.
    Type: Application
    Filed: March 11, 2017
    Publication date: September 13, 2018
    Inventors: Alfredo Alba, Clemens Drews, Daniel F. Gruhl, Linda H. Kato, Christian B. Kau, Neal R. Lewis, Pablo N. Mendes, Meenakshi Nagarajan, Cartic Ramakrishnan
  • Publication number: 20180260460
    Abstract: Disclosed aspects relate to analytics engine selection management. A set of reference data may be ingested by a first analytics engine to compile a first set of characteristic data. The set of reference data may be ingested by a second analytics engine to compile a second set of characteristic data. The first set of characteristic data may be compiled for the first analytics engine. The second set of characteristic data may be compiled for the second analytics engine. A set of distinct attributes related to the first and second analytics engines may be determined based on the first and second sets of characteristic data. An analytics engine selection operation may be executed.
    Type: Application
    Filed: December 29, 2017
    Publication date: September 13, 2018
    Inventors: Alfredo Alba, Clemens Drews, Daniel F. Gruhl, Linda H. Kato, Christian B. Kau, Neal R. Lewis, Pablo N. Mendes, Meenakshi Nagarajan, Cartic Ramakrishnan