Patents by Inventor Harry Urbschat

Harry Urbschat has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20250209846
    Abstract: Described herein are various technologies pertaining to text extraction from a document. A computing device receives the document. The document comprises computer-readable text and a layout, wherein the layout defines positions of the computer-readable text. Responsive to receiving the document, the computing device identifies at least one textual element in the computer-readable text based upon spatial factors between portions of the computer-readable text and contextual relationships between the portions of the computer-readable text. The computing device then outputs the at least one textual element.
    Type: Application
    Filed: March 10, 2025
    Publication date: June 26, 2025
    Inventors: Ralph Meier, Thorsten Wanschura, Johannes Hausmann, Harry Urbschat
  • Publication number: 20250139150
    Abstract: A hierarchical dictionary having methods of storing words based on frequency thereof in one or more documents that includes the steps of identifying a hash value corresponding to an inputted word; storing the word in a first hash map and in a second hash map having a substantially larger word storage capacity than the first hash map based on the identified hash value; clearing the first hash map at every predetermined period or triggering event; determining whether a frequency of the word as stored in the second hash map exceeds a predetermined value; and if so, promoting the word from the second hash map to a third hash map having a substantially larger word storage capacity than the second hash map for long-term storage and later retrieval.
    Type: Application
    Filed: January 6, 2025
    Publication date: May 1, 2025
    Inventors: Ralph Meier, Johannes Hausmann, Harry Urbschat, Thorston Wanschura
  • Publication number: 20250094479
    Abstract: A hierarchical dictionary having methods of storing words based on frequency thereof in one or more documents which includes the steps of identifying a hash value corresponding to an inputted word; storing the word in a first hash map and in a second hash map having a substantially larger word storage capacity than the first hash map based on the identified hash value; clearing the first hash map at every predetermined period or triggering event; determining whether a frequency of the word as stored in the second hash map exceeds a predetermined value; and if so, promoting the word from the second hash map to a third hash map having a substantially larger word storage capacity than the second hash map for long-term storage and later retrieval.
    Type: Application
    Filed: October 28, 2024
    Publication date: March 20, 2025
    Inventors: Ralph Meier, Johannes Hausmann, Harry Urbschat, Thorston Wanschura
  • Patent number: 12249171
    Abstract: Described herein are various technologies pertaining to text extraction from a document. A computing device receives the document. The document comprises computer-readable text and a layout, wherein the layout defines positions of the computer-readable text. Responsive to receiving the document, the computing device identifies at least one textual element in the computer-readable text based upon spatial factors between portions of the computer-readable text and contextual relationships between the portions of the computer-readable text. The computing device then outputs the at least one textual element.
    Type: Grant
    Filed: August 29, 2023
    Date of Patent: March 11, 2025
    Assignee: Hyland Software, Inc.
    Inventors: Ralph Meier, Thorsten Wanschura, Johannes Hausmann, Harry Urbschat
  • Patent number: 12189674
    Abstract: A hierarchical dictionary having methods of storing words based on frequency thereof in one or more documents which includes the steps of identifying a hash value corresponding to an inputted word; storing the word in a first hash map and in a second hash map having a substantially larger word storage capacity than the first hash map based on the identified hash value; clearing the first hash map at every predetermined period or triggering event; determining whether a frequency of the word as stored in the second hash map exceeds a predetermined value; and if so, promoting the word from the second hash map to a third hash map having a substantially larger word storage capacity than the second hash map for long-term storage and later retrieval.
    Type: Grant
    Filed: February 12, 2021
    Date of Patent: January 7, 2025
    Assignee: HYLAND SWITZERLAND SARL
    Inventors: Ralph Meier, Johannes Hausmann, Harry Urbschat, Thorsten Wanschura
  • Patent number: 12130854
    Abstract: A hierarchical dictionary having methods of storing words based on frequency thereof in one or more documents which includes the steps of identifying a hash value corresponding to an inputted word; storing the word in a first hash map and in a second hash map having a substantially larger word storage capacity than the first hash map based on the identified hash value; clearing the first hash map at every predetermined period or triggering event; determining whether a frequency of the word as stored in the second hash map exceeds a predetermined value; and if so, promoting the word from the second hash map to a third hash map having a substantially larger word storage capacity than the second hash map for long-term storage and later retrieval.
    Type: Grant
    Filed: February 12, 2021
    Date of Patent: October 29, 2024
    Assignee: HYLAND SWITZERLAND SARL
    Inventors: Ralph Meier, Johannes Hausmann, Harry Urbschat, Thorsten Wanschura
  • Publication number: 20230401879
    Abstract: Described herein are various technologies pertaining to text extraction from a document. A computing device receives the document. The document comprises computer-readable text and a layout, wherein the layout defines positions of the computer-readable text. Responsive to receiving the document, the computing device identifies at least one textual element in the computer-readable text based upon spatial factors between portions of the computer-readable text and contextual relationships between the portions of the computer-readable text. The computing device then outputs the at least one textual element.
    Type: Application
    Filed: August 29, 2023
    Publication date: December 14, 2023
    Inventors: Ralph Meier, Thorsten Wanschura, Johannes Hausmann, Harry Urbschat
  • Patent number: 11763588
    Abstract: Described herein are various technologies pertaining to text extraction from a document. A computing device receives the document. The document comprises computer-readable text and a layout, wherein the layout defines positions of the computer-readable text within a two-dimensional area represented by the document. Responsive to receiving the document, the computing device identifies at least one textual element in the computer-readable text based upon spatial factors between portions of the computer-readable text and contextual relationships between the portions of the computer-readable text. The computing device then outputs the at least one textual element.
    Type: Grant
    Filed: November 1, 2021
    Date of Patent: September 19, 2023
    Inventors: Ralph Meier, Thorsten Wanschura, Johannes Hausmann, Harry Urbschat
  • Publication number: 20220058386
    Abstract: Described herein are various technologies pertaining to text extraction from a document. A computing device receives the document. The document comprises computer-readable text and a layout, wherein the layout defines positions of the computer-readable text within a two-dimensional area represented by the document. Responsive to receiving the document, the computing device identifies at least one textual element in the computer-readable text based upon spatial factors between portions of the computer-readable text and contextual relationships between the portions of the computer-readable text. The computing device then outputs the at least one textual element.
    Type: Application
    Filed: November 1, 2021
    Publication date: February 24, 2022
    Inventors: Ralph Meier, Thorsten Wanschura, Johannes Hausmann, Harry Urbschat
  • Patent number: 11176364
    Abstract: Described herein are various technologies pertaining to text extraction from a document. A computing device receives the document. The document comprises computer-readable text and a layout, wherein the layout defines positions of the computer-readable text within a two-dimensional area represented by the document. Responsive to receiving the document, the computing device identifies at least one textual element in the computer-readable text based upon spatial factors between portions of the computer-readable text and contextual relationships between the portions of the computer-readable text. The computing device then outputs the at least one textual element.
    Type: Grant
    Filed: March 19, 2019
    Date of Patent: November 16, 2021
    Assignee: HYLAND SOFTWARE, INC.
    Inventors: Ralph Meier, Thorsten Wanschura, Johannes Hausmann, Harry Urbschat
  • Publication number: 20210165813
    Abstract: A hierarchical dictionary having methods of storing words based on frequency thereof in one or more documents which includes the steps of identifying a hash value corresponding to an inputted word; storing the word in a first hash map and in a second hash map having a substantially larger word storage capacity than the first hash map based on the identified hash value; clearing the first hash map at every predetermined period or triggering event; determining whether a frequency of the word as stored in the second hash map exceeds a predetermined value; and if so, promoting the word from the second hash map to a third hash map having a substantially larger word storage capacity than the second hash map for long-term storage and later retrieval.
    Type: Application
    Filed: February 12, 2021
    Publication date: June 3, 2021
    Inventors: Ralph Meier, Johannes Hausmann, Harry Urbschat, Thorsten Wanschura
  • Publication number: 20210165814
    Abstract: A hierarchical dictionary having methods of storing words based on frequency thereof in one or more documents which includes the steps of identifying a hash value corresponding to an inputted word; storing the word in a first hash map and in a second hash map having a substantially larger word storage capacity than the first hash map based on the identified hash value; clearing the first hash map at every predetermined period or triggering event; determining whether a frequency of the word as stored in the second hash map exceeds a predetermined value; and if so, promoting the word from the second hash map to a third hash map having a substantially larger word storage capacity than the second hash map for long-term storage and later retrieval.
    Type: Application
    Filed: February 12, 2021
    Publication date: June 3, 2021
    Inventors: Ralph Meier, Johannes Hausmann, Harry Urbschat, Thorsten Wanschura
  • Patent number: 10922347
    Abstract: A hierarchical dictionary having methods of storing words based on frequency thereof in one or more documents which includes the steps of identifying a hash value corresponding to an inputted word; storing the word in a first hash map and in a second hash map having a substantially larger word storage capacity than the first hash map based on the identified hash value; clearing the first hash map at every predetermined period or triggering event; determining whether a frequency of the word as stored in the second hash map exceeds a predetermined value; and if so, promoting the word from the second hash map to a third hash map having a substantially larger word storage capacity than the second hash map for long-term storage and later retrieval.
    Type: Grant
    Filed: December 30, 2016
    Date of Patent: February 16, 2021
    Assignee: HYLAND SWITZERLAND SÀRL
    Inventors: Ralph Meier, Johannes Hausmann, Harry Urbschat, Thorsten Wanschura
  • Publication number: 20200302166
    Abstract: Described herein are various technologies pertaining to text extraction from a document. A computing device receives the document. The document comprises computer-readable text and a layout, wherein the layout defines positions of the computer-readable text within a two-dimensional area represented by the document. Responsive to receiving the document, the computing device identifies at least one textual element in the computer-readable text based upon spatial factors between portions of the computer-readable text and contextual relationships between the portions of the computer-readable text. The computing device then outputs the at least one textual element.
    Type: Application
    Filed: March 19, 2019
    Publication date: September 24, 2020
    Inventors: Ralph Meier, Thorsten Wanschura, Johannes Hausmann, Harry Urbschat
  • Patent number: 9798924
    Abstract: Methods and apparatus delineate grouped together content in documents. Void and unvoid pixels in document images get clustered together. Execution of a histogram and autocorrelation function, including peak detection, against the unvoid clusters reveals the content. Techniques for clustering include iteratively transforming an original image into secondary images with a Haar wavelet transformation, for example. Clustering begins on a lowest image plane and advances to a next highest plane until all void and unvoid pixels in the images are grouped. Void clusters at lower levels remain void clusters at higher levels, thus only unvoid clusters of pixels require processing at higher levels thereby optimizing processing. Imaging devices with scanners define suitable hardware for transformation of the document into images and processors with executable code cluster together pixels to delineate content. Further processing includes executing OCR or other routines post void/unvoid analysis.
    Type: Grant
    Filed: August 17, 2015
    Date of Patent: October 24, 2017
    Assignee: KOFAX INTERNATIONAL SWITZERLAND SARL
    Inventors: Ralph Meier, Johannes Hausmann, Harry Urbschat, Thorsten Wanschura
  • Patent number: 9785830
    Abstract: Methods to select and extract tabular data among the optical character recognition returned strings to automatically process documents, including documents containing academic transcripts.
    Type: Grant
    Filed: December 28, 2015
    Date of Patent: October 10, 2017
    Assignee: KOFAX INTERNATIONAL SWITZERLAND SARL
    Inventors: Johannes Hausmann, Ralph Meier, Harry Urbschat, Thorsten Wanschura
  • Publication number: 20170220679
    Abstract: A hierarchical dictionary having methods of storing words based on frequency thereof in one or more documents which includes the steps of identifying a hash value corresponding to an inputted word; storing the word in a first hash map and in a second hash map having a substantially larger word storage capacity than the first hash map based on the identified hash value; clearing the first hash map at every predetermined period or triggering event; determining whether a frequency of the word as stored in the second hash map exceeds a predetermined value; and if so, promoting the word from the second hash map to a third hash map having a substantially larger word storage capacity than the second hash map for long-term storage and later retrieval.
    Type: Application
    Filed: December 30, 2016
    Publication date: August 3, 2017
    Inventors: Ralph Meier, Johannes Hausmann, Harry Urbschat, Thorsten Wanschura
  • Publication number: 20170147577
    Abstract: A system and method for extracting information from at least one document in at least one set of documents, the method comprising: generating, using at least one ranking and/or matching processor, at least one ranked possible match list comprising at least one possible match for at least one target entry on the at least one document, the at least one ranked possible match list based on at least one attribute score and at least one localization score.
    Type: Application
    Filed: September 21, 2012
    Publication date: May 25, 2017
    Inventors: Gennady Lapir, Harry Urbschat, Ralph Meier, Thorsten Wanschura, Johannes Hausmann
  • Publication number: 20170053163
    Abstract: Methods and apparatus delineate grouped together content in documents. Void and unvoid pixels in document images get clustered together. Execution of a histogram and autocorrelation function, including peak detection, against the unvoid clusters reveals the content. Techniques for clustering include iteratively transforming an original image into secondary images with a Haar wavelet transformation, for example. Clustering begins on a lowest image plane and advances to a next highest plane until all void and unvoid pixels in the images are grouped. Void clusters at lower levels remain void clusters at higher levels, thus only unvoid clusters of pixels require processing at higher levels thereby optimizing processing. Imaging devices with scanners define suitable hardware for transformation of the document into images and processors with executable code cluster together pixels to delineate content. Further processing includes executing OCR or other routines post void/unvoid analysis.
    Type: Application
    Filed: August 17, 2015
    Publication date: February 23, 2017
    Inventors: Ralph Meier, Johannes Hausmann, Harry Urbschat, Thorsten Wanschura
  • Publication number: 20160321499
    Abstract: Storage volumes with historic values from document processing are used to create learn-sets for extraction engine training. Text and locations of the text in documents are obtained, such as with OCR routines or by retrieval from storage. The values of the storage volumes get matched to the text and the locations of the text are associated back to the values. Both the values and their locations are provided to extraction engine(s) for training. The form of the values and text may or may not match exactly. A degree of fuzziness matching occurs depending upon a type of value in storage. Types can be provided as user input, defined by entry in a database, or determined heuristically through characters found in the values and text. Merging of character fragments defines still other embodiments as does arranging executable code into modules for hardware, such as imaging devices.
    Type: Application
    Filed: April 28, 2015
    Publication date: November 3, 2016
    Inventors: Ralph Meier, Johannes Hausmann, Harry Urbschat, Thorsten Wanschura