Patents by Inventor Thomas A. P. Hampp-Bahnmueller

Thomas A. P. Hampp-Bahnmueller has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11593405
    Abstract: Provided is a technique for matching different user representations of a person in a plurality of computer systems may be provided. The technique includes collecting information sets about user representations from a plurality of computer systems; normalizing the information sets to a unified format; grouping the information sets in the unified format into indexing buckets based on a user name using a non-phonetic algorithm; determining a similarity score for each pair of information sets in each of the indexing buckets; classifying each information set pair into a set of classes based on the similarity scores, wherein the set of classes comprise at least matches and non-matches; and using a data structure for merging information of information set pairs classified as matches.
    Type: Grant
    Filed: April 21, 2015
    Date of Patent: February 28, 2023
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Lars Bremer, Thomas A. P. Hampp-Bahnmueller, Markus Lorch, Pavlo Petrenko, Sebastian B. Schmid
  • Patent number: 11239858
    Abstract: A method for determining an encoding used for a sequence of bytes may be provided. The method comprises providing a set of candidate code pages and transforming them into different groups of sequences of bytes, wherein each group of sequences of bytes corresponds to one of the candidate code pages. Thereby each code point is transformed by applying a transformation from one of the candidate code pages to a reference code point value relating to a reference encoding for each code point. The method comprises further separating each of the transformed sequences of bytes into groups of tokens, wherein each group of tokens relates to one candidate code page, and providing an index relating to a text corpus. Furthermore, the method comprises selecting a code page from the set of candidate code pages at least partially based on how many tokens are found in the index.
    Type: Grant
    Filed: May 27, 2016
    Date of Patent: February 1, 2022
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Michael Baessler, Thomas A. P. Hampp-Bahnmueller, Peng Hui Jiang
  • Patent number: 10891591
    Abstract: A computer processor may extract identity information from a document. The identity information may include at least one custodian identity attribute. After extracting the identity information, the computer processor may determine that the identity information is associated with a specific custodian. The computer processor may then search for the custodian identity attribute in a custodian directory to determine whether the custodian directory contains an entry for the custodian. If the custodian is not in the custodian directory, the computer processor may create a new entry in the custodian directory for the custodian and store the extracted identity information in the new entry.
    Type: Grant
    Filed: April 17, 2018
    Date of Patent: January 12, 2021
    Assignee: International Business Machines Corporation
    Inventors: Michael Baessler, Jochen Doerre, Thomas A. P. Hampp-Bahnmueller, Philipp Hoffmann, Juergen Maletz, Daniel Pittner, Werner Schollenberger
  • Patent number: 10394852
    Abstract: Provided is a technique for matching different user representations of a person in a plurality of computer systems may be provided. The technique includes collecting information sets about user representations from a plurality of computer systems; normalizing the information sets to a unified format; grouping the information sets in the unified format into indexing buckets based on a user name using a non-phonetic algorithm; determining a similarity score for each pair of information sets in each of the indexing buckets; classifying each information set pair into a set of classes based on the similarity scores, wherein the set of classes comprise at least matches and non-matches; and using a data structure for merging information of information set pairs classified as matches.
    Type: Grant
    Filed: March 11, 2016
    Date of Patent: August 27, 2019
    Assignee: International Business Machines Corporation
    Inventors: Lars Bremer, Thomas A. P. Hampp-Bahnmueller, Markus Lorch, Pavlo Petrenko, Sebastian B. Schmid
  • Publication number: 20180232698
    Abstract: A computer processor may extract identity information from a document. The identity information may include at least one custodian identity attribute. After extracting the identity information, the computer processor may determine that the identity information is associated with a specific custodian. The computer processor may then search for the custodian identity attribute in a custodian directory to determine whether the custodian directory contains an entry for the custodian. If the custodian is not in the custodian directory, the computer processor may create a new entry in the custodian directory for the custodian and store the extracted identity information in the new entry.
    Type: Application
    Filed: April 17, 2018
    Publication date: August 16, 2018
    Inventors: Michael Baessler, Jochen Doerre, Thomas A. P. Hampp-Bahnmueller, Philipp Hoffmann, Juergen Maletz, Daniel Pittner, Werner Schollenberger
  • Patent number: 10037435
    Abstract: Providing an encrypted search index for performing searches on encrypted documents, the method comprising: (i) providing a set of documents, the documents comprising a plurality of unencrypted phrases; (ii) providing a master key; (iii) providing, based on the master key, for each phrase a set of encryption keys comprising one or more encryption keys; (iv) selecting, for each phrase, one encryption key of the set of encryption keys; (v) encrypting each phrase with the selected encryption key; and (vi) building an index based on the encrypted phrases, the index comprising information regarding which encrypted phrase is comprised within a certain document.
    Type: Grant
    Filed: August 11, 2016
    Date of Patent: July 31, 2018
    Assignee: International Business Machines Corporation
    Inventors: Michael Baessler, Jochen Doerre, Thomas A. P. Hampp-Bahnmueller, Philipp Hoffmann, Daniel Pittner, Werner Schollenberger
  • Patent number: 10013673
    Abstract: A computer processor may extract identity information from a document. The identity information may include at least one custodian identity attribute. After extracting the identity information, the computer processor may determine that the identity information is associated with a specific custodian. The computer processor may then search for the custodian identity attribute in a custodian directory to determine whether the custodian directory contains an entry for the custodian. If the custodian is not in the custodian directory, the computer processor may create a new entry in the custodian directory for the custodian and store the extracted identity information in the new entry.
    Type: Grant
    Filed: May 16, 2016
    Date of Patent: July 3, 2018
    Assignee: International Business Machines Corporation
    Inventors: Michael Baessler, Jochen Doerre, Thomas A. P. Hampp-Bahnmueller, Philipp Hoffmann, Juergen Maletz, Daniel Pittner, Werner Schollenberger
  • Patent number: 10007894
    Abstract: A computer processor may extract identity information from a document. The identity information may include at least one custodian identity attribute. After extracting the identity information, the computer processor may determine that the identity information is associated with a specific custodian. The computer processor may then search for the custodian identity attribute in a custodian directory to determine whether the custodian directory contains an entry for the custodian. If the custodian is not in the custodian directory, the computer processor may create a new entry in the custodian directory for the custodian and store the extracted identity information in the new entry.
    Type: Grant
    Filed: July 22, 2015
    Date of Patent: June 26, 2018
    Assignee: International Business Machines Corporation
    Inventors: Michael Baessler, Jochen Doerre, Thomas A. P. Hampp-Bahnmueller, Philipp Hoffmann, Juergen Maletz, Daniel Pittner, Werner Schollenberger
  • Patent number: 9953023
    Abstract: A computer-implemented method, carried out by one or more processors, for consolidating an index entry of a dictionary. In an embodiment, the method comprises the steps of receiving, by one or more processors, a set of parameters, wherein the set of parameters indicates at least a length of prefix and a length of hash value; receiving, by one or more processors, a first term for entry into an index; converting, by one or more processors, the first term for entry into the index according to the set of parameters; and responsive to determining the converted first term is not present in the index, storing, by one or more processors, the first converted term into the index.
    Type: Grant
    Filed: November 4, 2014
    Date of Patent: April 24, 2018
    Assignee: International Business Machines Corporation
    Inventors: Michael Baessler, Jochen Doerre, Thomas A. P. Hampp-Bahnmueller, Philipp Hoffmann, Markus Lorch, Juergen Maletz, Daniel Pittner
  • Patent number: 9940322
    Abstract: A computer-implemented method, carried out by one or more processors, for consolidating an index entry of a dictionary. In an embodiment, the method comprises the steps of receiving, by one or more processors, a set of parameters, wherein the set of parameters indicates at least a length of prefix and a length of hash value; receiving, by one or more processors, a first term for entry into an index; converting, by one or more processors, the first term for entry into the index according to the set of parameters; and responsive to determining the converted first term is not present in the index, storing, by one or more processors, the first converted term into the index.
    Type: Grant
    Filed: March 31, 2014
    Date of Patent: April 10, 2018
    Assignee: International Business Machines Corporation
    Inventors: Michael Baessler, Jochen Doerre, Thomas A. P. Hampp-Bahnmueller, Philipp Hoffmann, Markus Lorch, Juergen Maletz, Daniel Pittner
  • Patent number: 9898516
    Abstract: A computer manages methods for determining accurate document transformation by rendering the source document into a non-rasterized format, where the non-rasterized format is a rendered source document. The computer rendering the target document into a non-rasterized format, where the non-rasterized format is a rendered target document. The computer comparing one or more aspects of the rendered source document to corresponding one or more aspects of the rendered target document. The computer determining, based, at least in part, on the compared one or more aspects, whether or not the source document was accurately transformed to the target document.
    Type: Grant
    Filed: September 30, 2016
    Date of Patent: February 20, 2018
    Assignee: International Business Machines Corporation
    Inventors: Michael Baessler, Thomas A. P. Hampp-Bahnmueller, Philipp Hoffmann, Markus Lorch, Juergen Maletz, Daniel Pittner, Werner Schollenberger, Dirk Seider
  • Patent number: 9722627
    Abstract: A method for determining an encoding used for a sequence of bytes may be provided. The method comprises providing a set of candidate code pages and transforming them into different groups of sequences of bytes, wherein each group of sequences of bytes corresponds to one of the candidate code pages. Thereby each code point is transformed by applying a transformation from one of the candidate code pages to a reference code point value relating to a reference encoding for each code point. The method comprises further separating each of the transformed sequences of bytes into groups of tokens, wherein each group of tokens relates to one candidate code page, and providing an index relating to a text corpus. Furthermore, the method comprises selecting a code page from the set of candidate code pages at least partially based on how many tokens are found in the index.
    Type: Grant
    Filed: August 11, 2015
    Date of Patent: August 1, 2017
    Assignee: International Business Machines Corporation
    Inventors: Michael Baessler, Thomas A. P. Hampp-Bahnmueller, Peng Hui Jiang
  • Patent number: 9633219
    Abstract: Providing an encrypted search index for performing searches on encrypted documents, the method comprising: (i) providing a set of documents, the documents comprising a plurality of unencrypted phrases; (ii) providing a master key; (iii) providing, based on the master key, for each phrase a set of encryption keys comprising one or more encryption keys; (iv) selecting, for each phrase, one encryption key of the set of encryption keys; (v) encrypting each phrase with the selected encryption key; and (vi) building an index based on the encrypted phrases, the index comprising information regarding which encrypted phrase is comprised within a certain document.
    Type: Grant
    Filed: August 26, 2015
    Date of Patent: April 25, 2017
    Assignee: International Business Machines Corporation
    Inventors: Michael Baessler, Jochen Doerre, Thomas A. P. Hampp-Bahnmueller, Philipp Hoffmann, Daniel Pittner, Werner Schollenberger
  • Publication number: 20170060940
    Abstract: Providing an encrypted search index for performing searches on encrypted documents, the method comprising: (i) providing a set of documents, the documents comprising a plurality of unencrypted phrases; (ii) providing a master key; (iii) providing, based on the master key, for each phrase a set of encryption keys comprising one or more encryption keys; (iv) selecting, for each phrase, one encryption key of the set of encryption keys; (v) encrypting each phrase with the selected encryption key; and (vi) building an index based on the encrypted phrases, the index comprising information regarding which encrypted phrase is comprised within a certain document.
    Type: Application
    Filed: August 11, 2016
    Publication date: March 2, 2017
    Inventors: Michael Baessler, Jochen Doerre, Thomas A. P. Hampp-Bahnmueller, Philipp Hoffmann, Daniel Pittner, Werner Schollenberger
  • Publication number: 20170061151
    Abstract: Providing an encrypted search index for performing searches on encrypted documents, the method comprising: (i) providing a set of documents, the documents comprising a plurality of unencrypted phrases; (ii) providing a master key; (iii) providing, based on the master key, for each phrase a set of encryption keys comprising one or more encryption keys; (iv) selecting, for each phrase, one encryption key of the set of encryption keys; (v) encrypting each phrase with the selected encryption key; and (vi) building an index based on the encrypted phrases, the index comprising information regarding which encrypted phrase is comprised within a certain document.
    Type: Application
    Filed: August 26, 2015
    Publication date: March 2, 2017
    Inventors: Michael Baessler, Jochen Doerre, Thomas A. P. Hampp-Bahnmueller, Philipp Hoffmann, Daniel Pittner, Werner Schollenberger
  • Publication number: 20170048069
    Abstract: A method for determining an encoding used for a sequence of bytes may be provided. The method comprises providing a set of candidate code pages and transforming them into different groups of sequences of bytes, wherein each group of sequences of bytes corresponds to one of the candidate code pages. Thereby each code point is transformed by applying a transformation from one of the candidate code pages to a reference code point value relating to a reference encoding for each code point. The method comprises further separating each of the transformed sequences of bytes into groups of tokens, wherein each group of tokens relates to one candidate code page, and providing an index relating to a text corpus. Furthermore, the method comprises selecting a code page from the set of candidate code pages at least partially based on how many tokens are found in the index.
    Type: Application
    Filed: August 11, 2015
    Publication date: February 16, 2017
    Inventors: Michael Baessler, Thomas A. P. Hampp-Bahnmueller, Peng Hui Jiang
  • Publication number: 20170047943
    Abstract: A method for determining an encoding used for a sequence of bytes may be provided. The method comprises providing a set of candidate code pages and transforming them into different groups of sequences of bytes, wherein each group of sequences of bytes corresponds to one of the candidate code pages. Thereby each code point is transformed by applying a transformation from one of the candidate code pages to a reference code point value relating to a reference encoding for each code point. The method comprises further separating each of the transformed sequences of bytes into groups of tokens, wherein each group of tokens relates to one candidate code page, and providing an index relating to a text corpus. Furthermore, the method comprises selecting a code page from the set of candidate code pages at least partially based on how many tokens are found in the index.
    Type: Application
    Filed: May 27, 2016
    Publication date: February 16, 2017
    Inventors: Michael Baessler, Thomas A. P. Hampp-Bahnmueller, Peng Hui Jiang
  • Publication number: 20170024696
    Abstract: A computer processor may extract identity information from a document. The identity information may include at least one custodian identity attribute. After extracting the identity information, the computer processor may determine that the identity information is associated with a specific custodian. The computer processor may then search for the custodian identity attribute in a custodian directory to determine whether the custodian directory contains an entry for the custodian. If the custodian is not in the custodian directory, the computer processor may create a new entry in the custodian directory for the custodian and store the extracted identity information in the new entry.
    Type: Application
    Filed: July 22, 2015
    Publication date: January 26, 2017
    Inventors: Michael Baessler, Jochen Doerre, Thomas A.P. Hampp-Bahnmueller, Philipp Hoffmann, Juergen Maletz, Daniel Pittner, Werner Schollenberger
  • Publication number: 20170024697
    Abstract: A computer processor may extract identity information from a document. The identity information may include at least one custodian identity attribute. After extracting the identity information, the computer processor may determine that the identity information is associated with a specific custodian. The computer processor may then search for the custodian identity attribute in a custodian directory to determine whether the custodian directory contains an entry for the custodian. If the custodian is not in the custodian directory, the computer processor may create a new entry in the custodian directory for the custodian and store the extracted identity information in the new entry.
    Type: Application
    Filed: May 16, 2016
    Publication date: January 26, 2017
    Inventors: Michael Baessler, Jochen Doerre, Thomas A. P. Hampp-Bahnmueller, Philipp Hoffmann, Juergen Maletz, Daniel Pittner, Werner Schollenberger
  • Publication number: 20170017704
    Abstract: A computer manages methods for determining accurate document transformation by rendering the source document into a non-rasterized format, where the non-rasterized format is a rendered source document. The computer rendering the target document into a non-rasterized format, where the non-rasterized format is a rendered target document. The computer comparing one or more aspects of the rendered source document to corresponding one or more aspects of the rendered target document. The computer determining, based, at least in part, on the compared one or more aspects, whether or not the source document was accurately transformed to the target document.
    Type: Application
    Filed: September 30, 2016
    Publication date: January 19, 2017
    Inventors: Michael Baessler, Thomas A. P. Hampp-Bahnmueller, Philipp Hoffmann, Markus Lorch, Juergen Maletz, Daniel Pittner, Werner Schollenberger, Dirk Seider