Patents by Inventor Daniel F. Gruhl
Daniel F. Gruhl has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20170220584Abstract: Embodiments of the invention relate to identification of material that contains linguistically related content. Key phrases are filtered through a content store to ascertain the linguistically related content and to move the identified content to a target corpus. At least two iterations of the filtering process are employed. Each subsequent iteration of the filtering process identifies at least one new key phrase within the filtered material. In addition, each subsequent iteration takes place with a union of each previously employed key phrase and each new key phrase. As new content is identified, the content is populated to the target corpus.Type: ApplicationFiled: February 22, 2016Publication date: August 3, 2017Applicant: International Business Machines CorporationInventors: Daniel F. Gruhl, Joseph M. Kaufmann, Joseph N. Kozhaya, Pablo N. Mendes, Sridhar Sudarsan
-
Publication number: 20170220936Abstract: Embodiments of the invention relate to identification of material that contains linguistically related content. Key phrases are filtered through a content store to ascertain the linguistically related content and to move the identified content to a target corpus. At least two iterations of the filtering process are employed. Each subsequent iteration of the filtering process identifies at least one new key phrase within the filtered material. In addition, each subsequent iteration takes place with a union of each previously employed key phrase and each new key phrase. As new content is identified, the content is populated to the target corpus.Type: ApplicationFiled: January 29, 2016Publication date: August 3, 2017Applicant: International Business Machines CorporationInventors: Daniel F. Gruhl, Joseph M. Kaufmann, Joseph N. Kozhaya, Pablo N. Mendes, Sridhar Sudarsan
-
Patent number: 9672593Abstract: Embodiments of the invention provide a system and circuit for image distortion correction. The system includes neurosynaptic core circuits that: receive a set of inputs comprising image dimensions and pixel distortion coefficients for one or more image frames via one or more input core circuits, map each distorted pixel to zero or more undistorted pixels by processing the set of inputs corresponding to each pixel of the one or more image frames by the one or more input core circuits, and route corresponding pixel intensity values of each distorted pixel to output undistorted pixels for each image frame via one or more output core circuits.Type: GrantFiled: June 25, 2014Date of Patent: June 6, 2017Assignee: International Business Machines CorporationInventors: Alexander Andreopoulos, Daniel F. Gruhl, Michael Karasick, Dharmendra S. Modha
-
Publication number: 20170147693Abstract: Methods and systems for optimizing: the retrieval of data from multiple sources are described. A slot map including slots for the storage of data elements can be obtained. The data elements associated with the slots can be prioritized by weighting values with costs of retrieving the data elements from respective data sources. Each value can be associated with a different data element and can indicate a respective degree of importance of the associated data element. Further, the systems and methods can direct the retrieval of data elements from the respective data sources in an order in accordance with the priority of the data elements to optimize the quality of data obtainable within a critical time constraint. In addition, the retrieved data elements can be stored in corresponding slots on a storage medium.Type: ApplicationFiled: February 2, 2017Publication date: May 25, 2017Inventors: VARUN BHAGWAN, TYRONE W. A. GRANDISON, DANIEL F. GRUHL
-
Patent number: 9652627Abstract: Probabilistic surfacing of potentially sensitive identifiers is provided. In one embodiment of the present invention, a method of and computer program product for surfacing of potentially sensitive identifiers are provided. An input string is read. The input string has a length. The input string is divided into a plurality of tokens. Each of the tokens has a predetermined length. A score is determined for each of the plurality of tokens. A composite score is determined based on the scores of each of the plurality of tokens. Whether the input string comprises an identifier is determined by comparing the composite score to a predetermined threshold.Type: GrantFiled: October 22, 2014Date of Patent: May 16, 2017Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Varun Bhagwan, Laura Chiticariu, Daniel F. Gruhl
-
Patent number: 9589065Abstract: Methods and systems for optimizing the retrieval of data from multiple sources are described. A slot map including slots for the storage of data elements can be obtained. The data elements associated with the slots can be prioritized by weighting values with costs of retrieving the data elements from respective data sources. Each value can be associated with a different data element and can indicate a respective degree of importance of the associated data element. Further, the systems and methods can direct the retrieval of data elements from the respective data sources in an order in accordance with the priority of the data elements to optimize the quality of data obtainable within a critical time constraint. In addition, the retrieved data elements can be stored in corresponding slots on a storage medium.Type: GrantFiled: September 5, 2012Date of Patent: March 7, 2017Assignee: International Business Machines CorporationInventors: Varun Bhagwan, Tyrone W. A. Grandison, Daniel F. Gruhl
-
Publication number: 20170046241Abstract: Embodiments of the present invention disclose a computer program product for optimizing data analysis. A set of analytics are received and a plurality of subsets of the set of analytics is generated. An optimization goal(s) is defined. The plurality of subsets of analytics are applied to a set of data. The output of applied analytics are recorded on a data storage device by the computer. Performance values of the computer applying the plurality of subsets of analytics are recorded and displayed. An optimal subset of the plurality of subsets of analytics is calculated, using recorded performance values and outputs of the applied plurality of subsets of analytics, being based on the optimization goal. A set of applied analytics are displayed, where the set of applied analytics includes a number of subsets of analytics of the plurality of analytics that meet the criteria of the optimal subset.Type: ApplicationFiled: November 2, 2016Publication date: February 16, 2017Inventors: Varun Bhagwan, Clemens Drews, Daniel F. Gruhl, Neal R. Lewis, April L. Webster, Steven R. Welch
-
Publication number: 20170046413Abstract: Embodiments of the present invention disclose a computer system for optimizing data analysis. A set of analytics are received and a plurality of subsets of the set of analytics is generated. An optimization goal(s) is defined. The plurality of subsets of analytics are applied to a set of data. The output of applied analytics are recorded on a data storage device by the computer. Performance values of the computer applying the plurality of subsets of analytics are recorded and displayed. An optimal subset of the plurality of subsets of analytics is calculated, using recorded performance values and outputs of the applied plurality of subsets of analytics, being based on the optimization goal. A set of applied analytics are displayed, where the set of applied analytics includes a number of subsets of analytics of the plurality of analytics that meet the criteria of the optimal subset.Type: ApplicationFiled: November 2, 2016Publication date: February 16, 2017Inventors: Varun Bhagwan, Clemens Drews, Daniel F. Gruhl, Neal R. Lewis, April L. Webster, Steven R. Welch
-
Publication number: 20170039241Abstract: Embodiments of the present invention disclose a method for optimizing data analysis. A set of analytics are received and a plurality of subsets of the set of analytics is generated. An optimization goal(s) is defined. The plurality of subsets of analytics are applied to a set of data. The output of applied analytics are recorded on a data storage device by the computer. Performance values of the computer applying the plurality of subsets of analytics are recorded and displayed. An optimal subset of the plurality of subsets of analytics is calculated, using recorded performance values and outputs of the applied plurality of subsets of analytics, being based on the optimization goal. A set of applied analytics are displayed, where the set of applied analytics includes a number of subsets of analytics of the plurality of analytics that meet the criteria of the optimal subset.Type: ApplicationFiled: July 22, 2016Publication date: February 9, 2017Inventors: Varun Bhagwan, Clemens Drews, Daniel F. Gruhl, Neal R. Lewis, April L. Webster, Steven R. Welch
-
Publication number: 20170039249Abstract: Embodiments of the present invention disclose a method, computer program product, and system for optimizing data analysis. A set of analytics are received and a plurality of subsets of the set of analytics is generated. An optimization goal(s) is defined. The plurality of subsets of analytics are applied to a set of data. The output of applied analytics are recorded on a data storage device by the computer. Performance values of the computer applying the plurality of subsets of analytics are recorded and displayed. An optimal subset of the plurality of subsets of analytics is calculated, using recorded performance values and outputs of the applied plurality of subsets of analytics, being based on the optimization goal. A set of applied analytics are displayed, where the set of applied analytics includes a number of subsets of analytics of the plurality of analytics that meet the criteria of the optimal subset.Type: ApplicationFiled: August 6, 2015Publication date: February 9, 2017Inventors: Varun Bhagwan, Clemens Drews, Daniel F. Gruhl, Neal R. Lewis, April L. Webster, Steven R. Welch
-
Patent number: 9563661Abstract: Embodiments of the present invention disclose a method for optimizing data analysis. A set of analytics are received and a plurality of subsets of the set of analytics is generated. An optimization goal(s) is defined. The plurality of subsets of analytics are applied to a set of data. The output of applied analytics are recorded on a data storage device by the computer. Performance values of the computer applying the plurality of subsets of analytics are recorded and displayed. An optimal subset of the plurality of subsets of analytics is calculated, using recorded performance values and outputs of the applied plurality of subsets of analytics, being based on the optimization goal. A set of applied analytics are displayed, where the set of applied analytics includes a number of subsets of analytics of the plurality of analytics that meet the criteria of the optimal subset.Type: GrantFiled: July 22, 2016Date of Patent: February 7, 2017Assignee: International Business Machines CorporationInventors: Varun Bhagwan, Clemens Drews, Daniel F. Gruhl, Neal R. Lewis, April L. Webster, Steven R. Welch
-
Publication number: 20160335238Abstract: Embodiments of the disclosure relate to automatic analytic driven markup for rapid handling of forms. Aspects include receiving a form, identifying one or more characters on the form by performing optical character recognition on the form, and identifying one or more phrases of interest from the one or more characters using automated analytics. Aspects also include generating a map of a spacial location of each of the one or more phrases of interest on the form and creating a revised form based on the form and the map.Type: ApplicationFiled: May 13, 2015Publication date: November 17, 2016Inventors: Varun Bhagwan, Clemens Drews, Daniel F. Gruhl, Neal R. Lewis, April L. Webster, Steven R. Welch
-
Patent number: 9348806Abstract: Embodiments of the present invention relate to a pattern-based system for building dictionaries of terms related to a seed set of terms. In one embodiment, a text is read. The text comprises a plurality of tokens. A first plurality of patterns is read. The first plurality of tokens is searched using the first plurality of patterns to generate a plurality of candidate terms. Each of the plurality of candidate term comprises one or more of the plurality of tokens. A plurality of seed terms is read. Each of the first plurality of patterns is scored based on the plurality of candidate terms and the plurality of seed terms.Type: GrantFiled: September 30, 2014Date of Patent: May 24, 2016Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Kenneth L. Clarkson, Daniel F. Gruhl, Neal R. Lewis, Nimrod Megiddo
-
Publication number: 20160117522Abstract: Probabilistic surfacing of potentially sensitive identifiers is provided. In one embodiment of the present invention, a method of and computer program product for surfacing of potentially sensitive identifiers are provided. An input string is read. The input string has a length. The input string is divided into a plurality of tokens. Each of the tokens has a predetermined length. A score is determined for each of the plurality of tokens. A composite score is determined based on the scores of each of the plurality of tokens. Whether the input string comprises an identifier is determined by comparing the composite score to a predetermined threshold.Type: ApplicationFiled: October 22, 2014Publication date: April 28, 2016Inventors: Varun Bhagwan, Laura Chiticariu, Daniel F. Gruhl
-
Publication number: 20160092435Abstract: Embodiments of the present invention relate to a pattern-based system for building dictionaries of terms related to a seed set of terms. In one embodiment, a text is read. The text comprises a plurality of tokens. A first plurality of patterns is read. The first plurality of tokens is searched using the first plurality of patterns to generate a plurality of candidate terms. Each of the plurality of candidate term comprises one or more of the plurality of tokens. A plurality of seed terms is read. Each of the first plurality of patterns is scored based on the plurality of candidate terms and the plurality of seed terms.Type: ApplicationFiled: September 30, 2014Publication date: March 31, 2016Inventors: Kenneth L. Clarkson, Daniel F. Gruhl, Neal R. Lewis, Nimrod Megiddo
-
Patent number: 9280441Abstract: A race condition in a workflow representation is detected and corrected. First and second contracts are retrieved for respective first and second analytics of the workflow representation, wherein the contracts specify input types and output types of their analytics. Both contracts include information required to execute their respective analytics by a workflow executor. It is determined that the output type of the first analytic matches the input type of the second analytic based on a comparison of the first contract and the second contract, and that the workflow representation does not include a directed edge connecting the first analytic to the second analytic. The inclusion of a directed edge in the workflow representation connecting the first analytic to the second analytic will correct the race condition in the workflow representation.Type: GrantFiled: June 4, 2013Date of Patent: March 8, 2016Assignee: International Business Machines CorporationInventors: Varun Bhagwan, Daniel F. Gruhl, John T. E. Timm, April L. Webster
-
Publication number: 20150379689Abstract: Embodiments of the invention provide a system and circuit for image distortion correction. The system includes neurosynaptic core circuits that: receive a set of inputs comprising image dimensions and pixel distortion coefficients for one or more image frames via one or more input core circuits, map each distorted pixel to zero or more undistorted pixels by processing the set of inputs corresponding to each pixel of the one or more image frames by the one or more input core circuits, and route corresponding pixel intensity values of each distorted pixel to output undistorted pixels for each image frame via one or more output core circuits.Type: ApplicationFiled: June 25, 2014Publication date: December 31, 2015Inventors: Alexander Andreopoulos, Daniel F. Gruhl, Michael Karasick, Dharmendra S. Modha
-
Publication number: 20150347433Abstract: Embodiments of the present invention relate to performing entity recognition on a stream while providing ongoing training or supplementation of an entity dictionary. In one embodiment, a method of and computer program product for stream based named entity recognition is provided. A first portion of a textual input is received. A plurality of patterns is applied to the first portion to determine that a predetermined type is present in the first portion. Approval is requested of the presence of the predetermined type. An indication of approval or disapproval of the predetermined type is received. A dictionary is supplemented according to the indication. A second portion of the textual input is received. The plurality of patterns is applied to the second portion.Type: ApplicationFiled: May 30, 2014Publication date: December 3, 2015Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Varun Bhagwan, Clemens Drews, Daniel F. Gruhl, Neal R. Lewis, April L. Webster, Steven R. Welch
-
Publication number: 20150324341Abstract: A method for updating automated annotations for a paper-based document is provided. The method may include receiving an image of the paper-based document, the image of the paper-based document including a plurality of highlighted annotations having at least one edited annotation. The method may also include extracting the at least one edited annotation from the received image of the paper-based document. The method may further include adding the extracted at least one edited annotation of the paper-based document to a data retention system.Type: ApplicationFiled: May 7, 2014Publication date: November 12, 2015Applicant: International Business Machines CorporationInventors: Varun Bhagwan, Daniel F. Gruhl, Neal R. Lewis, Steven R. Welch
-
Publication number: 20140359625Abstract: A race condition in a workflow representation is detected and corrected. First and second contracts are retrieved for respective first and second analytics of the workflow representation, wherein the contracts specify input types and output types of their analytics. Both contracts include information required to execute their respective analytics by a workflow executor. It is determined that the output type of the first analytic matches the input type of the second analytic based on a comparison of the first contract and the second contract, and that the workflow representation does not include a directed edge connecting the first analytic to the second analytic. The inclusion of a directed edge in the workflow representation connecting the first analytic to the second analytic will correct the race condition in the workflow representation.Type: ApplicationFiled: June 4, 2013Publication date: December 4, 2014Inventors: Varun Bhagwan, Daniel F. Gruhl, John T.E. Timm, April L. Webster