Patents by Inventor Albert Maier

Albert Maier has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11860904
    Abstract: Aspects of the present invention disclose a method, computer program product, and system for governing a set of information assets using an information governance system. The method includes one or more processors applying one or more high-level classification assignment rules to one or more information assets of the set of information assets. Furthermore, the method includes one or more processors applying one or more high-level classification propagation rules to the one or more information assets provided with the high-level classification assignments for propagating the respective high-level classification assignments upwards within a containment hierarchy formed by the set of information assets to one or more superordinate information assets of the set of information assets.
    Type: Grant
    Filed: December 1, 2020
    Date of Patent: January 2, 2024
    Assignee: International Business Machines Corporation
    Inventors: Oliver Suhre, Albert Maier, Peter Gerstl, Thomas Schwarz, Michael Baessler
  • Publication number: 20230185786
    Abstract: A computer-implemented method for detecting reference data standardization gaps in data sets is disclosed. The method comprises identifying at least one reference data candidate in a data set, using an index for values of the identified at least one reference data candidate, and determining a difference between an earlier version of a reference data set relating to the reference data candidate and a current version of the reference data set. Furthermore, the method comprises comparing the determined difference with values of the index, and identifying entries in the at least one reference data candidate having a value identical to a value of the difference as reference data standardization gap.
    Type: Application
    Filed: December 13, 2021
    Publication date: June 15, 2023
    Inventors: Albert Maier, Dennis Butterstein, Alexandre Luz Xavier Da Costa, Mike W. Grasselt, Timo Kussmaul, Yevgen Karpenko
  • Publication number: 20230177193
    Abstract: A database system can comprise records, each record including a set of attributes. The database system can further comprise database views, each database view representing a subset of the set of attributes. Data purpose objects indicating a subset of attributes of the set of attributes and a processing purpose can be stored. Each processing purpose can be associated with one or more entities that authorized access to the subset of attributes of the processing purpose. A request for data for a specific processing purpose and a selected view of the database views can be received. A data purpose object that indicates the specific processing purpose can be retrieved. The subset of attributes represented by the selected view can be compared with the subset of the attributes indicated in the retrieved data purpose object. Values of the subset of attributes of the selected view can be provided.
    Type: Application
    Filed: December 8, 2021
    Publication date: June 8, 2023
    Inventors: Lars Bremer, Albert Maier, Mike W. Grasselt, Yannick Saillet, Michael Baessler
  • Publication number: 20230153566
    Abstract: Classification of cell data includes obtaining a target dataset and an artificial intelligence (AI) model trained to identify relationship(s) between cells of a row and classify whether a focus cell of the row is erroneous based on the identified relationship(s), and applying the AI model to the target dataset to identify erroneous cell(s) thereof. The applying includes selecting a row of cells of the target dataset, inputting the selected row of cells to the AI model with an identification of a focus cell, the focus cell to be classified by the AI model, classifying the focus cell to obtain a classification of the focus cell, the classifying identifying whether the focus cell is erroneous, and outputting an indication of the classification of the focus cell.
    Type: Application
    Filed: November 18, 2021
    Publication date: May 18, 2023
    Inventors: Shaikh Shahriar Quader, Omar Al-Shamali, James Miller, Yannick Saillet, Albert Maier, Remus Lazar
  • Patent number: 11593417
    Abstract: In an approach, a processor groups documents into a plurality of groups based on similarity, where: documents of each group have a same document structure; and the document structure is defined by coordinates of text blocks. A processor, for each group of the plurality of groups and for each document of the respective group: retrieves a value of each text block of the respective document in accordance with a document structure of the group; and assigns to each text block of the respective document an attribute that represents the retrieved value of the text block. A processor assigns a first document of the documents to an entity of a database that matches the first document based on the group of text block values and the assigned attributes of the document.
    Type: Grant
    Filed: January 21, 2021
    Date of Patent: February 28, 2023
    Assignee: International Business Machines Corporation
    Inventors: Thomas Schwarz, Albert Maier, Michael Baessler, Oliver Suhre, Peter Gerstl, Werner Schuetz, Jonathan Roesner, Mariya Chkalova
  • Publication number: 20220391848
    Abstract: Embodiments of the present invention provide methods, computer program products, and systems. Embodiments of the present invention can condense a hierarchy in a data governance system, wherein the hierarchy comprises a root node and at least one child node comprising related sub-trees by determining, for a parent node in the hierarchy of governance system, governance terms and respective assignment relationships from a plurality of information assets, determining usage of the governance term in at least one of a plurality of governance rules, and marking a governance term of the plurality of governance terms for elimination based on the determined assignment relationships and the determined usage of the governance term in the plurality of governance rules. Embodiments of the present invention can then delete the governance term from the hierarchy if the governance term is marked for elimination.
    Type: Application
    Filed: June 7, 2021
    Publication date: December 8, 2022
    Inventors: Albert Maier, Mike W. Grasselt, Yannick Saillet, Lars Bremer, Michael Baessler
  • Patent number: 11494469
    Abstract: The present disclosure relates to assigning an ownership of a first component of a communication system. A processor may identify a second component of the communication system having a relationship to the first component. The processor may identify an owning entity of the second component, the owning entity may have ownership of the second component, and the communication system may include the owning entity. The processor may generate an owner information, the owner information may relate the owning entity of the second component to the first component. The processor may provide the owner information to the communication system. In response to the providing of the owner information, the processor may receive a command for assigning the ownership of the first component to the owning entity. In response to receiving the command, the processor may assign the ownership of the first component to the owning entity.
    Type: Grant
    Filed: April 29, 2020
    Date of Patent: November 8, 2022
    Assignee: International Business Machines Corporation
    Inventors: Albert Maier, Thomas Schwarz, Peter Gerstl, Oliver Suhre
  • Patent number: 11487770
    Abstract: A computer implemented method is used for sorting data elements of a given set. The method includes performing an evaluation of a first type of usage of each data element. The method includes determining a set of data element candidates dependent on the evaluation of the first type of usage. The method includes performing an evaluation of a second type of usage of each data element of the set of data element candidates. The method includes sorting the data elements of the set of data element candidates dependent on the evaluation of the second type of usage of each data element of the set of data element candidates. The method includes providing the sorted data elements of the set of data element candidates, and in response, receiving a request for a data processing based on the provided sorted data elements of the set of data element candidates.
    Type: Grant
    Filed: May 18, 2020
    Date of Patent: November 1, 2022
    Assignee: International Business Machines Corporation
    Inventors: Albert Maier, Mike W. Grasselt, Yannick Saillet, Lars Bremer, Michael Baessler
  • Publication number: 20220229863
    Abstract: In an approach, a processor groups documents into a plurality of groups based on similarity, where: documents of each group have a same document structure; and the document structure is defined by coordinates of text blocks. A processor, for each group of the plurality of groups and for each document of the respective group: retrieves a value of each text block of the respective document in accordance with a document structure of the group; and assigns to each text block of the respective document an attribute that represents the retrieved value of the text block. A processor assigns a first document of the documents to an entity of a database that matches the first document based on the group of text block values and the assigned attributes of the document.
    Type: Application
    Filed: January 21, 2021
    Publication date: July 21, 2022
    Inventors: Thomas Schwarz, Albert Maier, Michael Baessler, Oliver Suhre, Peter Gerstl, Werner Schuetz, Jonathan Roesner, Mariya Chkalova
  • Patent number: 11366843
    Abstract: The invention relates to a computer-implemented method for classifying a set of data values. For each of the data values of the set of data values, a set of one or more terms associated with the respective data value is determined using one or more first knowledge bases. A set of common terms is determined. The set of common terms comprises terms present in more than one of the sets of terms. For each of the common terms, a number of hits for a lookup query against one or more second knowledge data bases is determined. One or more common terms of the set of common terms with the smallest number of hits are determined and a result is returned. The result comprises the one or more common terms with the smallest number of hits as one or more candidate classes for classifying the set of data values.
    Type: Grant
    Filed: April 23, 2019
    Date of Patent: June 21, 2022
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Albert Maier, Martin Oberhofer, Yannick Saillet
  • Publication number: 20220188512
    Abstract: A system may receive a data glossary comprising a list of terms. The system may then measure a usage dimension for a set of the terms from the list of terms. The system may select a candidate term from the set based on the usage dimension and perform a maintenance action on the candidate terms.
    Type: Application
    Filed: December 13, 2020
    Publication date: June 16, 2022
    Inventors: Albert Maier, Michael Baessler, Peter Gerstl, Oliver Suhre, Thomas Schwarz
  • Publication number: 20220171793
    Abstract: Aspects of the present invention disclose a method, computer program product, and system for governing a set of information assets using an information governance system. The method includes one or more processors applying one or more high-level classification assignment rules to one or more information assets of the set of information assets. Furthermore, the method includes one or more processors applying one or more high-level classification propagation rules to the one or more information assets provided with the high-level classification assignments for propagating the respective high-level classification assignments upwards within a containment hierarchy formed by the set of information assets to one or more superordinate information assets of the set of information assets.
    Type: Application
    Filed: December 1, 2020
    Publication date: June 2, 2022
    Inventors: Oliver Suhre, Albert Maier, Peter Gerstl, Thomas Schwarz, Michael Baessler
  • Publication number: 20220123935
    Abstract: The exemplary embodiments disclose a method, a computer program product, and a computer system for protecting sensitive information. The exemplary embodiments may include using an inverted text index for evaluating one or more statistical measures of an index token of the inverted text index, using the one or more statistical measures for selecting a set of candidate tokens, extracting metadata from the inverted text index, associating the set of candidate tokens with respective token metadata, tokenizing at least one document resulting in one or more document tokens, comparing the one or more document tokens with the set of candidate tokens, selecting a set of document tokens to be masked, selecting at least part of the set of document tokens that comprises sensitive information according to the associated token metadata, masking the at least part of the set of document tokens, and providing one or more masked documents.
    Type: Application
    Filed: October 19, 2020
    Publication date: April 21, 2022
    Inventors: Michael Baessler, Albert Maier, Mike W. Grasselt, Yannick Saillet, Lars Bremer
  • Patent number: 11308128
    Abstract: A method, system and computer program product for classifying a data collection of data of a predefined domain. A hierarchical representation scheme describing terms of the domain and one or more relationships between the terms is provided. At least one classifier may be applied on the data collection, resulting in a set of term assignments. Each term assignment of the term assignments associates a term candidate with a respective confidence value to the collection or to one or more data items of the collection. At least one of the term assignments may be refined based on the representation scheme and the set of term assignments.
    Type: Grant
    Filed: December 11, 2017
    Date of Patent: April 19, 2022
    Assignee: International Business Machines Corporation
    Inventors: Peter Gerstl, Robert Kern, Albert Maier, Thomas Schwarz, Oliver Suhre
  • Publication number: 20220114189
    Abstract: Embodiments of the present invention provide methods, computer program products, and systems. Embodiments of the present invention can extract of structured information for unstructured document analysis. Embodiments of the present invention can extract structured information for unstructured document analysis by identifying tables and columns of a database that correspond to business terms of a business glossary. Embodiments of the present invention can then receive a specification of business terms of interest for recognizing in an unstructured document. Embodiments of the present invention can then generate an analysis module based on the identified tables and columns that enables to identify or recognize attribute values of attributes of the tables and columns. Embodiments of the present invention can then use the analysis module for automatic extraction of values of at least part of the attributes from the unstructured document based on the specification of business terms of interest.
    Type: Application
    Filed: October 14, 2020
    Publication date: April 14, 2022
    Inventors: Michael Baessler, Albert Maier, Dirk Jahn, Thomas Hampp-Bahnmueller
  • Publication number: 20220100899
    Abstract: In an approach, a processor receives a request of a document. A processor identifies a set of datasets comprising a sensitive dataset, the set of datasets being interrelated in accordance with a relational model. A processor extracts attribute values of the document. A processor determines that a set of one or more attribute values of the extracted attribute values is in the set of datasets, the set of attribute values being values of a set of attributes. A processor determines that one or more entities of the sensitive dataset can be identified based on relations of the relational model between the set of attributes, where at least part of attribute values of the one or more entities comprises sensitive information. A processor, responsive to determining that the one or more entities can be identified, masks at least part of the set of one or more attribute values in the document.
    Type: Application
    Filed: September 25, 2020
    Publication date: March 31, 2022
    Inventors: Yannick Saillet, Albert Maier, Mike W. Grasselt, Michael Baessler, Lars Bremer
  • Publication number: 20210357183
    Abstract: A computer implemented method is used for sorting data elements of a given set. The method includes performing an evaluation of a first type of usage of each data element. The method includes determining a set of data element candidates dependent on the evaluation of the first type of usage. The method includes performing an evaluation of a second type of usage of each data element of the set of data element candidates. The method includes sorting the data elements of the set of data element candidates dependent on the evaluation of the second type of usage of each data element of the set of data element candidates. The method includes providing the sorted data elements of the set of data element candidates, and in response, receiving a request for a data processing based on the provided sorted data elements of the set of data element candidates.
    Type: Application
    Filed: May 18, 2020
    Publication date: November 18, 2021
    Inventors: Albert Maier, Mike W. Grasselt, Yannick Saillet, Lars Bremer, Michael Baessler
  • Publication number: 20210342421
    Abstract: The present disclosure relates to assigning an ownership of a first component of a communication system. A processor may identify a second component of the communication system having a relationship to the first component. The processor may identify an owning entity of the second component, the owning entity may have ownership of the second component, and the communication system may include the owning entity. The processor may generate an owner information, the owner information may relate the owning entity of the second component to the first component. The processor may provide the owner information to the communication system. In response to the providing of the owner information, the processor may receive a command for assigning the ownership of the first component to the owning entity. In response to receiving the command, the processor may assign the ownership of the first component to the owning entity.
    Type: Application
    Filed: April 29, 2020
    Publication date: November 4, 2021
    Inventors: Albert Maier, Thomas Schwarz, Peter Gerstl, Oliver Suhre
  • Patent number: 11165875
    Abstract: The present disclosure relates to a method for accessing data of one or more data sources using a discovery engine. The method comprises: determining a discovery space content from initial metadata of a data source indicated in a data exploration request. The discovery space content may be rendered. The rendered content may be used for determining a set of one or more tasks for generating further metadata from at least part of the data of the data source, wherein the set of tasks comprises a combination of API calls. The API calls may be issued to the discovery engine. Discovery results of the issued API calls may be received. A data discovery status may be devalued using the discovery results. The discovery space content may be augmented using the further metadata and the data discovery status. The augmented discovery space content may be rendered for receiving further API calls.
    Type: Grant
    Filed: September 16, 2020
    Date of Patent: November 2, 2021
    Assignee: International Business Machines Corporation
    Inventors: Albert Maier, Bernhard Mitschang, Peter Gerstl, Kunjavihari Madhav Kashalikar
  • Patent number: 11106820
    Abstract: The present disclosure relates to a method for data anonymization of a database system. The method comprises: determining if a first dataset and second dataset of the database system have a relationship indicative of an entity having values in the two datasets. A request may be received from a user for at least one of the first and second datasets. In case the first dataset and second dataset have the relationship, at least one of the first and second datasets may be modified such that the indication of the entity is not accessible to the user. And the requested dataset may be provided.
    Type: Grant
    Filed: March 19, 2018
    Date of Patent: August 31, 2021
    Assignee: International Business Machines Corporation
    Inventors: Martin Oberhofer, Albert Maier, Yannick Saillet