Patents by Inventor Clemens Drews
Clemens Drews has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11468234Abstract: At least some embodiments are directed to a computer-implemented method that comprises receiving original input text that includes a term, comparing a definition of the term to definitions of multiple candidate replacement terms to generate a set of candidate replacement terms, and substituting each of the candidate replacement terms in the set for the term in the original input text to produce a plurality of modified input texts. The method also comprises determining the grammatical accuracy of each of the plurality of modified input texts, comparing meanings of the modified input texts to a meaning of the original input text, and modifying the set of candidate replacement terms based on the determinations of grammatical accuracy and the comparisons of the meanings. The method still further comprises ranking the modified set of candidate replacement terms using one or more criteria, and displaying the ranking on a display.Type: GrantFiled: June 26, 2017Date of Patent: October 11, 2022Assignee: International Business Machines CorporationInventors: Alfredo Alba, Clemens Drews, Daniel F. Gruhl, Christian B. Kau, Neal R. Lewis, Pablo N. Mendes, Meenakshi Nagarajan, Cartic Ramakrishnan
-
Patent number: 11068376Abstract: Disclosed aspects relate to analytics engine selection management. A set of reference data may be ingested by a first analytics engine to compile a first set of characteristic data. The set of reference data may be ingested by a second analytics engine to compile a second set of characteristic data. The first set of characteristic data may be compiled for the first analytics engine. The second set of characteristic data may be compiled for the second analytics engine. A set of distinct attributes related to the first and second analytics engines may be determined based on the first and second sets of characteristic data. An analytics engine selection operation may be executed.Type: GrantFiled: March 11, 2017Date of Patent: July 20, 2021Assignee: International Business Machines CorporationInventors: Alfredo Alba, Clemens Drews, Daniel F. Gruhl, Linda H. Kato, Christian B. Kau, Neal R. Lewis, Pablo N. Mendes, Meenakshi Nagarajan, Cartic Ramakrishnan
-
Patent number: 11043291Abstract: Embodiments of the present invention relate to performing entity recognition on a stream while providing ongoing training or supplementation of an entity dictionary. In one embodiment, a method of and computer program product for stream based named entity recognition is provided. A first portion of a textual input is received. A plurality of patterns is applied to the first portion to determine that a predetermined type is present in the first portion. Approval is requested of the presence of the predetermined type. An indication of approval or disapproval of the predetermined type is received. A dictionary is supplemented according to the indication. A second portion of the textual input is received. The plurality of patterns is applied to the second portion.Type: GrantFiled: May 30, 2014Date of Patent: June 22, 2021Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Varun Bhagwan, Clemens Drews, Daniel F. Gruhl, Neal R. Lewis, April L. Webster, Steven R. Welch
-
Patent number: 10902346Abstract: One embodiment provides generating a similarity matrix corresponding to an input collection including initializing, by a processor, a working set as a collection of a multiple items. Until the similarity matrix converges: receiving a seed for similarity for at least one pair of items of the multiple items, and obtaining a similarity value for all other item pairs using a Naive Triangle Inequality process. The similarity is generated with obtained similarity values.Type: GrantFiled: March 28, 2017Date of Patent: January 26, 2021Assignee: International Business Machines CorporationInventors: Alfredo Alba, Kenneth L. Clarkson, Clemens Drews, Ronald Fagin, Daniel F. Gruhl, Neal R. Lewis, Pablo N. Mendes, Meenakshi Nagarajan, Cartic Ramakrishnan
-
Patent number: 10769382Abstract: A method, system and computer program product for disambiguating meaning of terms. A natural language processing system obtains a first corpora of words used in a first sense and a second corpora of the same words that are used in a second sense. Each of these corpora of words may be associated with different linguistic domains. The natural language processing system generates a first and a second set of patterns using both the first and second corpora of words, respectively. A question passage is then received by the natural language processing system. The natural language processing system examines a word of interest in the question passage. The user is then notified that the word of interest is being used in the first sense or the second sense in response to identifying the word of interest in only the first set of patterns or the second set of patterns, respectively.Type: GrantFiled: January 28, 2019Date of Patent: September 8, 2020Assignee: International Business Machines CorporationInventors: Alfredo Alba, Clemens Drews, Daniel F. Gruhl, Linda H. Kato, Christian B. Kau, Neal R. Lewis, Pablo N. Mendes, Meenakshi Nagarajan, Cartic Ramakrishnan
-
Patent number: 10740379Abstract: A mechanism is provided in a data processing system comprising at least one processor and at least one memory, the at least one memory comprising instructions executed by the at least one processor to cause the at least one processor to implement an automated lexicon expansion for an identified corpus. For a selected corpus in a set of corpora, the mechanism determines an estimated number of new terms in the selected corpus that are not in the lexicon based on a frequency count known terms in the selected corpus. Responsive to the estimated number of new terms in the selected corpus being greater than a threshold, the mechanism performs lexicon expansion using the selected corpus to form an expanded lexicon. Responsive to the estimated number of new terms in the selected corpus not being greater than the threshold, the mechanism halts lexicon expansion.Type: GrantFiled: December 8, 2017Date of Patent: August 11, 2020Assignee: International Business Machines CorporationInventors: Alfredo Alba, Clemens Drews, Daniel F. Gruhl, Linda H. Kato, Neal R. Lewis, Pablo N. Mendes, Meenakshi Nagarajan
-
Patent number: 10733224Abstract: A mechanism is provided in a data processing system comprising at least one processor and at least one memory, the at least one memory comprising instructions executed by the at least one processor to cause the at least one processor to implement an automated lexicon expansion for an identified corpus. For a selected corpus in a set of corpora, the mechanism determines an estimated number of new terms in the selected corpus that are not in the lexicon based on a frequency count known terms in the selected corpus. Responsive to the estimated number of new terms in the selected corpus being greater than a threshold, the mechanism performs lexicon expansion using the selected corpus to form an expanded lexicon. Responsive to the estimated number of new terms in the selected corpus not being greater than the threshold, the mechanism halts lexicon expansion.Type: GrantFiled: February 7, 2017Date of Patent: August 4, 2020Assignee: International Business Machines CorporationInventors: Alfredo Alba, Clemens Drews, Daniel F. Gruhl, Linda H. Kato, Neal R. Lewis, Pablo N. Mendes, Meenakshi Nagarajan
-
Publication number: 20190377740Abstract: Influencers (individuals or groups) over a selected audience (observers or recipients of information, objects and/or events) on a given topic are measured based on influence features, which include a sentiment flipping influence feature indicative of ability of an audience member to influence other audience members to change their sentiment on the selected topic. Other influence features include the ability to influence others: to change followership; to express interest in a topic associated with a hashtag pioneered by the influencer, based on the effectiveness and phrasing of language used. The output of the influence engine can be a score representing the relative influence of audience members over the audience on the topic of interest. Influencers may be ranked according to their total influence score over the audience on the topic.Type: ApplicationFiled: August 22, 2019Publication date: December 12, 2019Inventors: Alfredo Alba, Clemens Drews, Daniel Gruhl, Neal R. Lewis, Pablo N. Mendes, Meenakshi Nagarajan, CARTIC RAMAKRISHNAN
-
Patent number: 10430422Abstract: Influencers (individuals or groups) over a selected audience (observers or recipients of information, objects and/or events) on a given topic are measured based on influence features, which include a sentiment flipping influence feature indicative of ability of an audience member to influence other audience members to change their sentiment on the selected topic. Other influence features include the ability to influence others: to change followership; to express interest in a topic associated with a hashtag pioneered by the influencer, based on the effectiveness and phrasing of language used. The output of the influence engine can be a score representing the relative influence of audience members over the audience on the topic of interest. Influencers may be ranked according to their total influence score over the audience on the topic.Type: GrantFiled: September 29, 2015Date of Patent: October 1, 2019Assignee: International Business Machines CorporationInventors: Alfredo Alba, Clemens Drews, Daniel Gruhl, Neal R. Lewis, Pablo N. Mendes, Meenakshi Nagarajan, Cartic Ramakrishnan
-
Publication number: 20190155908Abstract: A method, system and computer program product for disambiguating meaning of terms. A natural language processing system obtains a first corpora of words used in a first sense and a second corpora of the same words that are used in a second sense. Each of these corpora of words may be associated with different linguistic domains. The natural language processing system generates a first and a second set of patterns using both the first and second corpora of words, respectively. A question passage is then received by the natural language processing system. The natural language processing system examines a word of interest in the question passage. The user is then notified that the word of interest is being used in the first sense or the second sense in response to identifying the word of interest in only the first set of patterns or the second set of patterns, respectively.Type: ApplicationFiled: January 28, 2019Publication date: May 23, 2019Inventors: Alfredo Alba, Clemens Drews, Daniel F. Gruhl, Linda H. Kato, Christian B. Kau, Neal R. Lewis, Pablo N. Mendes, Meenakshi Nagarajan, Cartic Ramakrishnan
-
Patent number: 10255271Abstract: A method, system and computer program product for disambiguating meaning of terms. A natural language processing system obtains a first corpora of words used in a first sense and a second corpora of the same words that are used in a second sense. Each of these corpora of words may be associated with different linguistic domains. The natural language processing system generates a first and a second set of patterns using both the first and second corpora of words, respectively. A question passage is then received by the natural language processing system. The natural language processing system examines a word of interest in the question passage. The user is then notified that the word of interest is being used in the first sense or the second sense in response to identifying the word of interest in only the first set of patterns or the second set of patterns, respectively.Type: GrantFiled: February 6, 2017Date of Patent: April 9, 2019Assignee: International Business Machines CorporationInventors: Alfredo Alba, Clemens Drews, Daniel F. Gruhl, Linda H. Kato, Christian B. Kau, Neal R. Lewis, Pablo N. Mendes, Meenakshi Nagarajan, Cartic Ramakrishnan
-
Patent number: 10210231Abstract: Embodiments of the present invention disclose a method, computer program product, and system for optimizing data analysis. A set of analytics are received and a plurality of subsets of the set of analytics is generated. An optimization goal(s) is defined. The plurality of subsets of analytics are applied to a set of data. The output of applied analytics are recorded on a data storage device by the computer. Performance values of the computer applying the plurality of subsets of analytics are recorded and displayed. An optimal subset of the plurality of subsets of analytics is calculated, using recorded performance values and outputs of the applied plurality of subsets of analytics, being based on the optimization goal. A set of applied analytics are displayed, where the set of applied analytics includes a number of subsets of analytics of the plurality of analytics that meet the criteria of the optimal subset.Type: GrantFiled: August 6, 2015Date of Patent: February 19, 2019Assignee: International Business Machines CorporationInventors: Varun Bhagwan, Clemens Drews, Daniel F. Gruhl, Neal R. Lewis, April L. Webster, Steven R. Welch
-
Patent number: 10176164Abstract: A method comprising receiving an item via a computer input device, comparing the item to a plurality of lexicons using a processor, and determining, using the processor and based upon the comparison, that the item is part of a fragment matching at least one of the plurality of lexicons. The method also comprises storing, using the processor, the fragment in a section of a document determined based on the at least one of the plurality of lexicons, and receiving at least one other item via the computer input device after the fragment is stored in the section.Type: GrantFiled: June 19, 2017Date of Patent: January 8, 2019Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Alfredo Alba, Anni R. Coden, Clemens Drews, Daniel F. Gruhl, Neal R. Lewis, Pablo N. Mendes, Cartic Ramakrishnan, Joseph F. Terdiman
-
Patent number: 10176889Abstract: A computer program product comprising a computer-readable storage medium having program instructions embodied therewith, the program instructions executable by a processor to cause the processor to receive a document having multiple section headers, segment the document into at least first and second sections based on the section headers, segment items in the first section into fragments and identify a section type for each of the fragments, determine that the identified section type for at least one of the fragments better matches a type of the second section than it matches a type of the first section, and re-locate the at least one of the fragments to the second section.Type: GrantFiled: February 9, 2017Date of Patent: January 8, 2019Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Alfredo Alba, Anni R. Coden, Clemens Drews, Daniel F. Gruhl, Neal R. Lewis, Pablo N. Mendes, Cartic Ramakrishnan, Joseph F. Terdiman
-
Patent number: 10176890Abstract: A method comprising receiving a document having multiple sections of different types using a processor. The method also comprises obtaining a plurality of lexicons using the processor, each of the lexicons for interpreting fragments in one or more of the section types. The method further comprises interpreting fragments in a first section of the multiple sections using the processor and one or more lexicons. The method still further comprises determining, based upon the interpretation and using the processor, that a fragment in the first section is misplaced. The method still further comprises re-locating, using the processor, the misplaced fragment to a second section of the multiple sections in the document to generate a re-organized document. The method additionally includes storing the re-organized document in a hardware storage system using the processor.Type: GrantFiled: June 19, 2017Date of Patent: January 8, 2019Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Alfredo Alba, Anni R. Coden, Clemens Drews, Daniel F. Gruhl, Neal R. Lewis, Pablo N. Mendes, Cartic Ramakrishnan, Joseph F. Terdiman
-
Patent number: 10169325Abstract: A system, comprising an input device configured to receive a first item and a second item, and a processor communicably coupled to the input device and configured to determine that the first item is a fragment matching a lexicon, and place the fragment in a section of a document, the section selected based on the matching lexicon, wherein the processor is configured to perform the determination and the placement after it receives the first item but before it receives the second item.Type: GrantFiled: February 9, 2017Date of Patent: January 1, 2019Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Alfredo Alba, Anni R. Coden, Clemens Drews, Daniel F. Gruhl, Neal R. Lewis, Pablo N. Mendes, Cartic Ramakrishnan, Joseph F. Terdiman
-
Publication number: 20180373691Abstract: At least some embodiments are directed to a computer-implemented method that comprises receiving original input text that includes a term, comparing a definition of the term to definitions of multiple candidate replacement terms to generate a set of candidate replacement terms, and substituting each of the candidate replacement terms in the set for the term in the original input text to produce a plurality of modified input texts. The method also comprises determining the grammatical accuracy of each of the plurality of modified input texts, comparing meanings of the modified input texts to a meaning of the original input text, and modifying the set of candidate replacement terms based on the determinations of grammatical accuracy and the comparisons of the meanings. The method still further comprises ranking the modified set of candidate replacement terms using one or more criteria, and displaying the ranking on a display.Type: ApplicationFiled: June 26, 2017Publication date: December 27, 2018Inventors: Alfredo ALBA, Clemens DREWS, Daniel F. GRUHL, Christian B. KAU, Neal R. LEWIS, Pablo N. MENDES, Meenakshi NAGARAJAN, Cartic RAMAKRISHNAN
-
Publication number: 20180285762Abstract: One embodiment provides generating a similarity matrix corresponding to an input collection including initializing, by a processor, a working set as a collection of a multiple items. Until the similarity matrix converges: receiving a seed for similarity for at least one pair of items of the multiple items, and obtaining a similarity value for all other item pairs using a Naive Triangle Inequality process. The similarity is generated with obtained similarity values.Type: ApplicationFiled: March 28, 2017Publication date: October 4, 2018Inventors: Alfredo Alba, Kenneth L. Clarkson, Clemens Drews, Ronald Fagin, Daniel F. Gruhl, Neal R. Lewis, Pablo N. Mendes, Meenakshi Nagarajan, Cartic Ramakrishnan
-
Publication number: 20180260459Abstract: Disclosed aspects relate to analytics engine selection management. A set of reference data may be ingested by a first analytics engine to compile a first set of characteristic data. The set of reference data may be ingested by a second analytics engine to compile a second set of characteristic data. The first set of characteristic data may be compiled for the first analytics engine. The second set of characteristic data may be compiled for the second analytics engine. A set of distinct attributes related to the first and second analytics engines may be determined based on the first and second sets of characteristic data. An analytics engine selection operation may be executed.Type: ApplicationFiled: March 11, 2017Publication date: September 13, 2018Inventors: Alfredo Alba, Clemens Drews, Daniel F. Gruhl, Linda H. Kato, Christian B. Kau, Neal R. Lewis, Pablo N. Mendes, Meenakshi Nagarajan, Cartic Ramakrishnan
-
Publication number: 20180260460Abstract: Disclosed aspects relate to analytics engine selection management. A set of reference data may be ingested by a first analytics engine to compile a first set of characteristic data. The set of reference data may be ingested by a second analytics engine to compile a second set of characteristic data. The first set of characteristic data may be compiled for the first analytics engine. The second set of characteristic data may be compiled for the second analytics engine. A set of distinct attributes related to the first and second analytics engines may be determined based on the first and second sets of characteristic data. An analytics engine selection operation may be executed.Type: ApplicationFiled: December 29, 2017Publication date: September 13, 2018Inventors: Alfredo Alba, Clemens Drews, Daniel F. Gruhl, Linda H. Kato, Christian B. Kau, Neal R. Lewis, Pablo N. Mendes, Meenakshi Nagarajan, Cartic Ramakrishnan