Patents by Inventor Paul Joseph Hake

Paul Joseph Hake has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11990214
    Abstract: Aspects include receiving a document and classifying at least a subset of the document as having a first type of data. Features are extracted from the document. The extracting includes initiating processing of the at least a subset of the document by a first processing engine that was previously trained to extract features from the first type of data. The extracting also includes initiating processing of a remaining portion of the document not included in the at least a subset of the document by a second processing engine that was previously trained to extract features from a second type of data. The first type of data is different than the second type of data. Features are received from one or both of the first processing engine and the second processing engine. The received features are stored as features of the document.
    Type: Grant
    Filed: July 21, 2020
    Date of Patent: May 21, 2024
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Paul Joseph Hake, Igor S. Ramos, Andrew J. Lavery, Scott Carrier
  • Patent number: 11755822
    Abstract: Aspects of the invention include a computer-implemented method for generating promise identifiers for documents. Aspects include processing a document including a reference, wherein processing includes performing natural language processing (NLP) the document, and identifying the reference included in the document. Aspects also include generating a promise identifier for the reference in the document, and responsive to processing the document, resolving the promise identifier for the reference by providing data of the reference associated with the promise identifier. Aspects of the invention also include a computer program product and system for generating promise identifiers for documents.
    Type: Grant
    Filed: August 4, 2020
    Date of Patent: September 12, 2023
    Assignee: International Business Machines Corporation
    Inventors: Andrew J. Lavery, Scott Carrier, Paul Joseph Hake, Igor S. Ramos
  • Patent number: 11520972
    Abstract: Aspects of the invention include resolving future reference identifiers for documents. Aspects of the invention include processing a document including a reference to a future event, wherein processing includes performing natural language processing (NLP) on the document, and identifying the reference to the future event included in the document. Aspects of the invention also include generating a future reference identifier for the reference to the future event, and responsive to processing an occurrence of the future event, resolving the future reference identifier by providing data from a subsequent document for the future event associated with the future reference identifier.
    Type: Grant
    Filed: August 4, 2020
    Date of Patent: December 6, 2022
    Assignee: International Business Machines Corporation
    Inventors: Andrew J Lavery, Scott Carrier, Paul Joseph Hake, Igor S. Ramos
  • Patent number: 11347928
    Abstract: Aspects of the invention include detecting and processing sections spanning processed document partitions by caching a document partition. The document partition includes metadata indicating that the document partition is a portion of a whole document. Aspects also include pairing a candidate paragraph from the document partition with a cached paragraph segment and determining, using a coherence model, a probability that the candidate paragraph and the cached paragraph segment constitute a semantically coherent paragraph. Aspects further include discarding the cached paragraph segment and processing the candidate paragraph and the cached paragraph segment separately based on a determination that the probability is less than a threshold level and processing the candidate paragraph and the cached paragraph segment together as a cross-partition paragraph based on a determination that the probability is greater than the threshold level.
    Type: Grant
    Filed: July 27, 2020
    Date of Patent: May 31, 2022
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Andrew J Lavery, Igor S. Ramos, Paul Joseph Hake, Scott Carrier
  • Publication number: 20220043967
    Abstract: Aspects of the invention include a computer-implemented method for generating promise identifiers for documents. Aspects include processing a document including a reference, wherein processing includes performing natural language processing (NLP) the document, and identifying the reference included in the document. Aspects also include generating a promise identifier for the reference in the document, and responsive to processing the document, resolving the promise identifier for the reference by providing data of the reference associated with the promise identifier. Aspects of the invention also include a computer program product and system for generating promise identifiers for documents.
    Type: Application
    Filed: August 4, 2020
    Publication date: February 10, 2022
    Inventors: Andrew J. Lavery, Scott Carrier, Paul Joseph Hake, Igor S. Ramos
  • Publication number: 20220043968
    Abstract: Aspects of the invention include resolving future reference identifiers for documents. Aspects of the invention include processing a document including a reference to a future event, wherein processing includes performing natural language processing (NLP) on the document, and identifying the reference to the future event included in the document. Aspects of the invention also include generating a future reference identifier for the reference to the future event, and responsive to processing an occurrence of the future event, resolving the future reference identifier by providing data from a subsequent document for the future event associated with the future reference identifier.
    Type: Application
    Filed: August 4, 2020
    Publication date: February 10, 2022
    Inventors: Andrew J Lavery, Scott Carrier, Paul Joseph Hake, Igor S. Ramos
  • Publication number: 20220028502
    Abstract: Aspects include receiving a document and classifying at least a subset of the document as having a first type of data. Features are extracted from the document. The extracting includes initiating processing of the at least a subset of the document by a first processing engine that was previously trained to extract features from the first type of data. The extracting also includes initiating processing of a remaining portion of the document not included in the at least a subset of the document by a second processing engine that was previously trained to extract features from a second type of data. The first type of data is different than the second type of data. Features are received from one or both of the first processing engine and the second processing engine. The received features are stored as features of the document.
    Type: Application
    Filed: July 21, 2020
    Publication date: January 27, 2022
    Inventors: Paul Joseph Hake, Igor S. Ramos, Andrew J. Lavery, Scott Carrier
  • Publication number: 20220027612
    Abstract: Aspects of the invention include detecting and processing sections spanning processed document partitions by caching a document partition. The document partition includes metadata indicating that the document partition is a portion of a whole document. Aspects also include pairing a candidate paragraph from the document partition with a cached paragraph segment and determining, using a coherence model, a probability that the candidate paragraph and the cached paragraph segment constitute a semantically coherent paragraph. Aspects further include discarding the cached paragraph segment and processing the candidate paragraph and the cached paragraph segment separately based on a determination that the probability is less than a threshold level and processing the candidate paragraph and the cached paragraph segment together as a cross-partition paragraph based on a determination that the probability is greater than the threshold level.
    Type: Application
    Filed: July 27, 2020
    Publication date: January 27, 2022
    Inventors: ANDREW J LAVERY, IGOR S. RAMOS, PAUL JOSEPH HAKE, SCOTT CARRIER
  • Patent number: 11222165
    Abstract: According to one or more embodiments of the present invention, an input request to a natural language processing (NLP) system is optimized. A window-size is selected for annotating an input corpus. The corpus is divided into partitions of the window-size, each partition processed separately. Further, a first set of entities is identified in a first partition, and a second set of entities in a second partition. Further, a third partition containing a first segment and a second segment is determined. The first segment overlaps the first partition, and the second segment overlaps the second partition. The method further includes identifying a third set of entities in the third partition. In response to the third set of entities being distinct from a set of entities from the first segment and the second segment, the window-size is adjusted. The input request for the NLP system is generated using the adjusted window-size.
    Type: Grant
    Filed: August 18, 2020
    Date of Patent: January 11, 2022
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Igor S. Ramos, Andrew J. Lavery, Scott Carrier, Paul Joseph Hake