Patents by Inventor Daniel F. Gruhl

Daniel F. Gruhl has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20170220584
    Abstract: Embodiments of the invention relate to identification of material that contains linguistically related content. Key phrases are filtered through a content store to ascertain the linguistically related content and to move the identified content to a target corpus. At least two iterations of the filtering process are employed. Each subsequent iteration of the filtering process identifies at least one new key phrase within the filtered material. In addition, each subsequent iteration takes place with a union of each previously employed key phrase and each new key phrase. As new content is identified, the content is populated to the target corpus.
    Type: Application
    Filed: February 22, 2016
    Publication date: August 3, 2017
    Applicant: International Business Machines Corporation
    Inventors: Daniel F. Gruhl, Joseph M. Kaufmann, Joseph N. Kozhaya, Pablo N. Mendes, Sridhar Sudarsan
  • Publication number: 20170220936
    Abstract: Embodiments of the invention relate to identification of material that contains linguistically related content. Key phrases are filtered through a content store to ascertain the linguistically related content and to move the identified content to a target corpus. At least two iterations of the filtering process are employed. Each subsequent iteration of the filtering process identifies at least one new key phrase within the filtered material. In addition, each subsequent iteration takes place with a union of each previously employed key phrase and each new key phrase. As new content is identified, the content is populated to the target corpus.
    Type: Application
    Filed: January 29, 2016
    Publication date: August 3, 2017
    Applicant: International Business Machines Corporation
    Inventors: Daniel F. Gruhl, Joseph M. Kaufmann, Joseph N. Kozhaya, Pablo N. Mendes, Sridhar Sudarsan
  • Patent number: 9672593
    Abstract: Embodiments of the invention provide a system and circuit for image distortion correction. The system includes neurosynaptic core circuits that: receive a set of inputs comprising image dimensions and pixel distortion coefficients for one or more image frames via one or more input core circuits, map each distorted pixel to zero or more undistorted pixels by processing the set of inputs corresponding to each pixel of the one or more image frames by the one or more input core circuits, and route corresponding pixel intensity values of each distorted pixel to output undistorted pixels for each image frame via one or more output core circuits.
    Type: Grant
    Filed: June 25, 2014
    Date of Patent: June 6, 2017
    Assignee: International Business Machines Corporation
    Inventors: Alexander Andreopoulos, Daniel F. Gruhl, Michael Karasick, Dharmendra S. Modha
  • Publication number: 20170147693
    Abstract: Methods and systems for optimizing: the retrieval of data from multiple sources are described. A slot map including slots for the storage of data elements can be obtained. The data elements associated with the slots can be prioritized by weighting values with costs of retrieving the data elements from respective data sources. Each value can be associated with a different data element and can indicate a respective degree of importance of the associated data element. Further, the systems and methods can direct the retrieval of data elements from the respective data sources in an order in accordance with the priority of the data elements to optimize the quality of data obtainable within a critical time constraint. In addition, the retrieved data elements can be stored in corresponding slots on a storage medium.
    Type: Application
    Filed: February 2, 2017
    Publication date: May 25, 2017
    Inventors: VARUN BHAGWAN, TYRONE W. A. GRANDISON, DANIEL F. GRUHL
  • Patent number: 9652627
    Abstract: Probabilistic surfacing of potentially sensitive identifiers is provided. In one embodiment of the present invention, a method of and computer program product for surfacing of potentially sensitive identifiers are provided. An input string is read. The input string has a length. The input string is divided into a plurality of tokens. Each of the tokens has a predetermined length. A score is determined for each of the plurality of tokens. A composite score is determined based on the scores of each of the plurality of tokens. Whether the input string comprises an identifier is determined by comparing the composite score to a predetermined threshold.
    Type: Grant
    Filed: October 22, 2014
    Date of Patent: May 16, 2017
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Varun Bhagwan, Laura Chiticariu, Daniel F. Gruhl
  • Patent number: 9589065
    Abstract: Methods and systems for optimizing the retrieval of data from multiple sources are described. A slot map including slots for the storage of data elements can be obtained. The data elements associated with the slots can be prioritized by weighting values with costs of retrieving the data elements from respective data sources. Each value can be associated with a different data element and can indicate a respective degree of importance of the associated data element. Further, the systems and methods can direct the retrieval of data elements from the respective data sources in an order in accordance with the priority of the data elements to optimize the quality of data obtainable within a critical time constraint. In addition, the retrieved data elements can be stored in corresponding slots on a storage medium.
    Type: Grant
    Filed: September 5, 2012
    Date of Patent: March 7, 2017
    Assignee: International Business Machines Corporation
    Inventors: Varun Bhagwan, Tyrone W. A. Grandison, Daniel F. Gruhl
  • Publication number: 20170046241
    Abstract: Embodiments of the present invention disclose a computer program product for optimizing data analysis. A set of analytics are received and a plurality of subsets of the set of analytics is generated. An optimization goal(s) is defined. The plurality of subsets of analytics are applied to a set of data. The output of applied analytics are recorded on a data storage device by the computer. Performance values of the computer applying the plurality of subsets of analytics are recorded and displayed. An optimal subset of the plurality of subsets of analytics is calculated, using recorded performance values and outputs of the applied plurality of subsets of analytics, being based on the optimization goal. A set of applied analytics are displayed, where the set of applied analytics includes a number of subsets of analytics of the plurality of analytics that meet the criteria of the optimal subset.
    Type: Application
    Filed: November 2, 2016
    Publication date: February 16, 2017
    Inventors: Varun Bhagwan, Clemens Drews, Daniel F. Gruhl, Neal R. Lewis, April L. Webster, Steven R. Welch
  • Publication number: 20170046413
    Abstract: Embodiments of the present invention disclose a computer system for optimizing data analysis. A set of analytics are received and a plurality of subsets of the set of analytics is generated. An optimization goal(s) is defined. The plurality of subsets of analytics are applied to a set of data. The output of applied analytics are recorded on a data storage device by the computer. Performance values of the computer applying the plurality of subsets of analytics are recorded and displayed. An optimal subset of the plurality of subsets of analytics is calculated, using recorded performance values and outputs of the applied plurality of subsets of analytics, being based on the optimization goal. A set of applied analytics are displayed, where the set of applied analytics includes a number of subsets of analytics of the plurality of analytics that meet the criteria of the optimal subset.
    Type: Application
    Filed: November 2, 2016
    Publication date: February 16, 2017
    Inventors: Varun Bhagwan, Clemens Drews, Daniel F. Gruhl, Neal R. Lewis, April L. Webster, Steven R. Welch
  • Publication number: 20170039241
    Abstract: Embodiments of the present invention disclose a method for optimizing data analysis. A set of analytics are received and a plurality of subsets of the set of analytics is generated. An optimization goal(s) is defined. The plurality of subsets of analytics are applied to a set of data. The output of applied analytics are recorded on a data storage device by the computer. Performance values of the computer applying the plurality of subsets of analytics are recorded and displayed. An optimal subset of the plurality of subsets of analytics is calculated, using recorded performance values and outputs of the applied plurality of subsets of analytics, being based on the optimization goal. A set of applied analytics are displayed, where the set of applied analytics includes a number of subsets of analytics of the plurality of analytics that meet the criteria of the optimal subset.
    Type: Application
    Filed: July 22, 2016
    Publication date: February 9, 2017
    Inventors: Varun Bhagwan, Clemens Drews, Daniel F. Gruhl, Neal R. Lewis, April L. Webster, Steven R. Welch
  • Publication number: 20170039249
    Abstract: Embodiments of the present invention disclose a method, computer program product, and system for optimizing data analysis. A set of analytics are received and a plurality of subsets of the set of analytics is generated. An optimization goal(s) is defined. The plurality of subsets of analytics are applied to a set of data. The output of applied analytics are recorded on a data storage device by the computer. Performance values of the computer applying the plurality of subsets of analytics are recorded and displayed. An optimal subset of the plurality of subsets of analytics is calculated, using recorded performance values and outputs of the applied plurality of subsets of analytics, being based on the optimization goal. A set of applied analytics are displayed, where the set of applied analytics includes a number of subsets of analytics of the plurality of analytics that meet the criteria of the optimal subset.
    Type: Application
    Filed: August 6, 2015
    Publication date: February 9, 2017
    Inventors: Varun Bhagwan, Clemens Drews, Daniel F. Gruhl, Neal R. Lewis, April L. Webster, Steven R. Welch
  • Patent number: 9563661
    Abstract: Embodiments of the present invention disclose a method for optimizing data analysis. A set of analytics are received and a plurality of subsets of the set of analytics is generated. An optimization goal(s) is defined. The plurality of subsets of analytics are applied to a set of data. The output of applied analytics are recorded on a data storage device by the computer. Performance values of the computer applying the plurality of subsets of analytics are recorded and displayed. An optimal subset of the plurality of subsets of analytics is calculated, using recorded performance values and outputs of the applied plurality of subsets of analytics, being based on the optimization goal. A set of applied analytics are displayed, where the set of applied analytics includes a number of subsets of analytics of the plurality of analytics that meet the criteria of the optimal subset.
    Type: Grant
    Filed: July 22, 2016
    Date of Patent: February 7, 2017
    Assignee: International Business Machines Corporation
    Inventors: Varun Bhagwan, Clemens Drews, Daniel F. Gruhl, Neal R. Lewis, April L. Webster, Steven R. Welch
  • Publication number: 20160335238
    Abstract: Embodiments of the disclosure relate to automatic analytic driven markup for rapid handling of forms. Aspects include receiving a form, identifying one or more characters on the form by performing optical character recognition on the form, and identifying one or more phrases of interest from the one or more characters using automated analytics. Aspects also include generating a map of a spacial location of each of the one or more phrases of interest on the form and creating a revised form based on the form and the map.
    Type: Application
    Filed: May 13, 2015
    Publication date: November 17, 2016
    Inventors: Varun Bhagwan, Clemens Drews, Daniel F. Gruhl, Neal R. Lewis, April L. Webster, Steven R. Welch
  • Patent number: 9348806
    Abstract: Embodiments of the present invention relate to a pattern-based system for building dictionaries of terms related to a seed set of terms. In one embodiment, a text is read. The text comprises a plurality of tokens. A first plurality of patterns is read. The first plurality of tokens is searched using the first plurality of patterns to generate a plurality of candidate terms. Each of the plurality of candidate term comprises one or more of the plurality of tokens. A plurality of seed terms is read. Each of the first plurality of patterns is scored based on the plurality of candidate terms and the plurality of seed terms.
    Type: Grant
    Filed: September 30, 2014
    Date of Patent: May 24, 2016
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Kenneth L. Clarkson, Daniel F. Gruhl, Neal R. Lewis, Nimrod Megiddo
  • Publication number: 20160117522
    Abstract: Probabilistic surfacing of potentially sensitive identifiers is provided. In one embodiment of the present invention, a method of and computer program product for surfacing of potentially sensitive identifiers are provided. An input string is read. The input string has a length. The input string is divided into a plurality of tokens. Each of the tokens has a predetermined length. A score is determined for each of the plurality of tokens. A composite score is determined based on the scores of each of the plurality of tokens. Whether the input string comprises an identifier is determined by comparing the composite score to a predetermined threshold.
    Type: Application
    Filed: October 22, 2014
    Publication date: April 28, 2016
    Inventors: Varun Bhagwan, Laura Chiticariu, Daniel F. Gruhl
  • Publication number: 20160092435
    Abstract: Embodiments of the present invention relate to a pattern-based system for building dictionaries of terms related to a seed set of terms. In one embodiment, a text is read. The text comprises a plurality of tokens. A first plurality of patterns is read. The first plurality of tokens is searched using the first plurality of patterns to generate a plurality of candidate terms. Each of the plurality of candidate term comprises one or more of the plurality of tokens. A plurality of seed terms is read. Each of the first plurality of patterns is scored based on the plurality of candidate terms and the plurality of seed terms.
    Type: Application
    Filed: September 30, 2014
    Publication date: March 31, 2016
    Inventors: Kenneth L. Clarkson, Daniel F. Gruhl, Neal R. Lewis, Nimrod Megiddo
  • Patent number: 9280441
    Abstract: A race condition in a workflow representation is detected and corrected. First and second contracts are retrieved for respective first and second analytics of the workflow representation, wherein the contracts specify input types and output types of their analytics. Both contracts include information required to execute their respective analytics by a workflow executor. It is determined that the output type of the first analytic matches the input type of the second analytic based on a comparison of the first contract and the second contract, and that the workflow representation does not include a directed edge connecting the first analytic to the second analytic. The inclusion of a directed edge in the workflow representation connecting the first analytic to the second analytic will correct the race condition in the workflow representation.
    Type: Grant
    Filed: June 4, 2013
    Date of Patent: March 8, 2016
    Assignee: International Business Machines Corporation
    Inventors: Varun Bhagwan, Daniel F. Gruhl, John T. E. Timm, April L. Webster
  • Publication number: 20150379689
    Abstract: Embodiments of the invention provide a system and circuit for image distortion correction. The system includes neurosynaptic core circuits that: receive a set of inputs comprising image dimensions and pixel distortion coefficients for one or more image frames via one or more input core circuits, map each distorted pixel to zero or more undistorted pixels by processing the set of inputs corresponding to each pixel of the one or more image frames by the one or more input core circuits, and route corresponding pixel intensity values of each distorted pixel to output undistorted pixels for each image frame via one or more output core circuits.
    Type: Application
    Filed: June 25, 2014
    Publication date: December 31, 2015
    Inventors: Alexander Andreopoulos, Daniel F. Gruhl, Michael Karasick, Dharmendra S. Modha
  • Publication number: 20150347433
    Abstract: Embodiments of the present invention relate to performing entity recognition on a stream while providing ongoing training or supplementation of an entity dictionary. In one embodiment, a method of and computer program product for stream based named entity recognition is provided. A first portion of a textual input is received. A plurality of patterns is applied to the first portion to determine that a predetermined type is present in the first portion. Approval is requested of the presence of the predetermined type. An indication of approval or disapproval of the predetermined type is received. A dictionary is supplemented according to the indication. A second portion of the textual input is received. The plurality of patterns is applied to the second portion.
    Type: Application
    Filed: May 30, 2014
    Publication date: December 3, 2015
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Varun Bhagwan, Clemens Drews, Daniel F. Gruhl, Neal R. Lewis, April L. Webster, Steven R. Welch
  • Publication number: 20150324341
    Abstract: A method for updating automated annotations for a paper-based document is provided. The method may include receiving an image of the paper-based document, the image of the paper-based document including a plurality of highlighted annotations having at least one edited annotation. The method may also include extracting the at least one edited annotation from the received image of the paper-based document. The method may further include adding the extracted at least one edited annotation of the paper-based document to a data retention system.
    Type: Application
    Filed: May 7, 2014
    Publication date: November 12, 2015
    Applicant: International Business Machines Corporation
    Inventors: Varun Bhagwan, Daniel F. Gruhl, Neal R. Lewis, Steven R. Welch
  • Publication number: 20140359625
    Abstract: A race condition in a workflow representation is detected and corrected. First and second contracts are retrieved for respective first and second analytics of the workflow representation, wherein the contracts specify input types and output types of their analytics. Both contracts include information required to execute their respective analytics by a workflow executor. It is determined that the output type of the first analytic matches the input type of the second analytic based on a comparison of the first contract and the second contract, and that the workflow representation does not include a directed edge connecting the first analytic to the second analytic. The inclusion of a directed edge in the workflow representation connecting the first analytic to the second analytic will correct the race condition in the workflow representation.
    Type: Application
    Filed: June 4, 2013
    Publication date: December 4, 2014
    Inventors: Varun Bhagwan, Daniel F. Gruhl, John T.E. Timm, April L. Webster