Patents by Inventor Martin Oberhofer

Martin Oberhofer has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11418322
    Abstract: An example operation may include one or more receiving an entry at a blockchain-as-a-service (Baas) provider, determining whether the entry satisfies a first set of policies, and controlling placement of the entry into a first queue when the first set of policies is satisfied and into a second queue when the first set of policies is not satisfied, wherein the first queue is to store confirmed entries to be submitted for consensus without validation and the second queue is to store pending entries that require validation before consensus.
    Type: Grant
    Filed: March 26, 2019
    Date of Patent: August 16, 2022
    Assignee: International Business Machines Corporation
    Inventors: Dinesh C. Verma, Donna N. Dillenberger, Martin Oberhofer, Namik Hrle
  • Patent number: 11409772
    Abstract: A method includes training a machine learning model using a current set of labeled data points. Each of the data points is multiple data records. A label of a data point indicates a classification of the data point. The training results in a trained machine learning model configured to classify a data point as representing a same entity or different entities. The method includes selecting a subset of unlabeled data points from a current set of unlabeled data points using classification results of the current set of unlabeled data points. The method includes providing the subset of unlabeled data points to a classifier and in response to providing receiving labels of the subset of unlabeled data points. The method may be repeated using the subset of labeled data points in addition to the current set of labeled data points as the current set of labeled data points.
    Type: Grant
    Filed: April 27, 2020
    Date of Patent: August 9, 2022
    Assignee: International Business Machines Corporation
    Inventors: Lars Bremer, Utkarsh Bajpai, Martin Oberhofer, Alexandre Luz Xavier Da Costa
  • Patent number: 11398895
    Abstract: An example operation may include one or more of hosting a first virtual node in a blockchain-as-a-service (Baas) provider, hosting a second virtual node in the Baas provider, and controlling transmission of information between the first virtual node and the second virtual node along an internal signal path of the Baas provider, wherein the information corresponds to a block in a blockchain that includes an entry for the first and second virtual nodes.
    Type: Grant
    Filed: March 26, 2019
    Date of Patent: July 26, 2022
    Assignee: International Business Machines Corporation
    Inventors: Dinesh C. Verma, Donna N. Dillenberger, Martin Oberhofer, Namik Hrle
  • Patent number: 11397855
    Abstract: A method for generating data standardization rules includes receiving a training data set containing tokenized and tagged data values. A set of machine mining models is built using different learning algorithms for identifying tags and tag patterns using the training set. For each data value in a further data set: a tokenization is applied on the data value, resulting in a set of tokens. For each token of the set of tokens one or more tag candidates are determined using a lookup dictionary of tags and tokens and/or at least part of the set of machine mining models, resulting for each token of the set of tokens in a list of possible tags. Unique combinations of the sets of tags of the further data set having highest aggregated confidence values are provided for use as standardization rules.
    Type: Grant
    Filed: December 12, 2017
    Date of Patent: July 26, 2022
    Assignee: International Business Machines Corporation
    Inventors: Yannick Saillet, Martin Oberhofer, Namit Kabra
  • Publication number: 20220222543
    Abstract: A method matches information. A first center node in a first subgraph and a second center node in a second subgraph are identified. Groups of neighboring nodes having the neighboring nodes from both of subgraphs are identified. A group of the neighboring nodes in the groups has the neighboring nodes with a same node type. A best matching node pair of the neighboring nodes in each cluster is identified. The neighboring nodes in each best matching node pair comprise a first node from the first subgraph and a second node from the second subgraph. Whether the center nodes match is determined based on an overall distance between the center nodes using the first and second center node and the best matching node pair pairs.
    Type: Application
    Filed: January 13, 2021
    Publication date: July 14, 2022
    Inventors: Mohammad Khatibi, Eitan Daniel Farchi, Martin Oberhofer
  • Patent number: 11366843
    Abstract: The invention relates to a computer-implemented method for classifying a set of data values. For each of the data values of the set of data values, a set of one or more terms associated with the respective data value is determined using one or more first knowledge bases. A set of common terms is determined. The set of common terms comprises terms present in more than one of the sets of terms. For each of the common terms, a number of hits for a lookup query against one or more second knowledge data bases is determined. One or more common terms of the set of common terms with the smallest number of hits are determined and a result is returned. The result comprises the one or more common terms with the smallest number of hits as one or more candidate classes for classifying the set of data values.
    Type: Grant
    Filed: April 23, 2019
    Date of Patent: June 21, 2022
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Albert Maier, Martin Oberhofer, Yannick Saillet
  • Publication number: 20220188299
    Abstract: A computer-implemented method includes determining, by a computer device, composite view rules for combining first data from a first data record and second data from a second data record to create a composite view of an entity in a master data management system; receiving, by the computer device, the first data; receiving, by the computer device, the second data; creating, by the computer device, the composite view from the first data and the second data based on the composite view rules; physically materializing the composite view on a storage device; preserving, by the computer device, the first data record; and preserving, by the computer device, the second data record.
    Type: Application
    Filed: March 3, 2022
    Publication date: June 16, 2022
    Inventors: Martin OBERHOFER, Jason Allen WOODS, Rin DAVIS, Soma Shekar NAGANNA, Pankuj CHACHRA
  • Publication number: 20220188344
    Abstract: The present disclosure relates to a method, computer program product and system. The method may comprise providing a first graph being an instance of a first ontology. Sample values of a plurality of concept attributes may be collected from the first graph. The sample values may be clustered into one or more clusters based on content and/or format of the sample values. A cluster of the clusters that contains sample values representing different concept attributes may be identified. An additional concept and associated set of relations representing the concept attribute values of the cluster may be determined and the first ontology may be updated using the additional concept and associated set of relations.
    Type: Application
    Filed: December 14, 2020
    Publication date: June 16, 2022
    Inventors: Thuany Karoline Stuart, Martin Oberhofer, Lars Bremer, Hemanth Kumar Babu
  • Patent number: 11354282
    Abstract: A computer implemented method for classifying at least one source dataset of a computer system. The method may include providing a plurality of associated reference tables organized and associated in accordance with a reference storage model in the computer system. The method may also include calculating, by a data classifier application of the computer system, a first similarity score between the source dataset and a first reference table of the reference tables based on common attributes in the source dataset and a join of the first reference table with at least one further reference table of the reference tables having a relationship with the first reference table. The method may further include classifying, by the data classifier application, the source dataset by determining using at least the calculated first similarity score whether the source dataset is organized as the first reference table in accordance to the reference storage model.
    Type: Grant
    Filed: January 10, 2020
    Date of Patent: June 7, 2022
    Assignee: International Business Machinos Corporation
    Inventors: Martin Oberhofer, Adapala S. Reddy, Yannick Saillet, Jens Seifert
  • Patent number: 11334757
    Abstract: Methods and apparatus, including computer program products, implementing and using techniques for processing suspect duplicate records in a master data management system. A master data management module identifies two or more suspect duplicate records in the master data management system based on scores. A matching engine classifies the two or more suspect duplicate records, by comparing the scores against threshold values, into one of: a match, a non-match, and a possible match. The master data management module re-classifies the suspect duplicate records and adjusting the threshold values of the matching engine for classification of future records, in response to receiving, by a data stewardship client, a user input indicating an incorrect classification of the suspect duplicate records.
    Type: Grant
    Filed: June 11, 2018
    Date of Patent: May 17, 2022
    Assignee: International Business Machines Corporation
    Inventors: Sushain Pandit, Martin Oberhofer, Joerg Rehr, Ivan M. Milman
  • Publication number: 20220138512
    Abstract: Methods, computer program products and/or systems are provided that perform the following operations: obtaining a first graph comprising first nodes representing first entities and first edges representing relationships between first entities, the first nodes being associated with first entity attributes descriptive of the first entities represented by the first nodes, the first edges being associated with first edge attributes descriptive of the relationships represented by the first edges; determining a first subgraph for a certain node of the first nodes of the first graph, the first subgraph including the certain node and at least one neighboring node of the certain node; and determining a data quality issue regarding the certain node based, at least in part, on applying one or more applicable rules of a set of data quality rules to first entity attribute values and first edge attribute values of the first subgraph.
    Type: Application
    Filed: October 29, 2020
    Publication date: May 5, 2022
    Inventors: Yannick Saillet, Claudio Andrea Fanconi, Martin Oberhofer, Hemanth Kumar Babu, Basem Elasioty, Mike W. Grasselt, Robert Kern, Thuany Karoline Stuart
  • Publication number: 20220092469
    Abstract: In an approach to improving machine learning model training for data matching from manual decisions, one or more computer processors detect a correction made to two data records. One or more computer processors determine a common attribute between the two data records. One or more computer processors identify a first machine learning model associated with the common attribute. One or more computer processors add comparison data of the two data records to training data for the machine learning model, where the comparison data includes the correction.
    Type: Application
    Filed: September 23, 2020
    Publication date: March 24, 2022
    Inventors: Lars Bremer, Hemanth Kumar Babu, Thuany Karoline Stuart, Martin Oberhofer
  • Publication number: 20220092064
    Abstract: Selecting entity matching systems sample record pairs by selecting at least one first record pair from entity matching system data records, scoring attribute compare methods of the at least one first record pair according to the entity matching system, adding the at least one first record pair to a no-match set according to the attribute matching score, selecting at least one second record pair from an entity matching system data record bucket, scoring attribute compare methods of the at least one second record pair according to the entity matching system, adding the at least one second record pair to record pair set, according to the second record pair attribute compare method score, and providing the record pair set to a user.
    Type: Application
    Filed: September 21, 2020
    Publication date: March 24, 2022
    Inventors: Lars Bremer, Martin Oberhofer, Thuany Karoline Stuart, Claudio Andrea Fanconi, Jonathan Roesner, Daniel Suski
  • Patent number: 11275727
    Abstract: A computer-implemented method includes determining, by a computer device, composite view rules for combining first data from a first data record and second data from a second data record to create a composite view of an entity in a master data management system; receiving, by the computer device, the first data; receiving, by the computer device, the second data; creating, by the computer device, the composite view from the first data and the second data based on the composite view rules; physically materializing the composite view on a storage device; preserving, by the computer device, the first data record; and preserving, by the computer device, the second data record.
    Type: Grant
    Filed: September 19, 2019
    Date of Patent: March 15, 2022
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Martin Oberhofer, Jason Allen Woods, Rin Davis, Soma Shekar Naganna, Pankuj Chachra
  • Publication number: 20220075762
    Abstract: A computer implemented method for classifying at least one source dataset of a computer system. The method may include providing a plurality of associated reference tables organized and associated in accordance with a reference storage model in the computer system. The method may also include calculating, by a data classifier application of the computer system, a first similarity score between the source dataset and a first reference table of the reference tables based on common attributes in the source dataset and a join of the first reference table with at least one further reference table of the reference tables having a relationship with the first reference table. The method may further include classifying, by the data classifier application, the source dataset by determining using at least the calculated first similarity score whether the source dataset is organized as the first reference table in accordance to the reference storage model.
    Type: Application
    Filed: November 16, 2021
    Publication date: March 10, 2022
    Inventors: Martin Oberhofer, Adapala S. Reddy, Yannick Saillet, Jens Seifert
  • Patent number: 11269858
    Abstract: An example operation may include one or more of determining data at a first node satisfies a condition, obtaining a pointer to a shared storage area for the data, controlling generation of a block including the pointer, and appending the block to a blockchain without the data, wherein the first node corresponds to first virtual node hosted by blockchain-as-a-service (Baas) provider and wherein the first virtual node receives the data from a network coupled to the Baas provider.
    Type: Grant
    Filed: March 26, 2019
    Date of Patent: March 8, 2022
    Assignee: International Business Machines Corporation
    Inventors: Dinesh C. Verma, Donna N. Dillenberger, Martin Oberhofer, Namik Hrle
  • Patent number: 11250067
    Abstract: A method for presenting search query results is provided. The method may include detecting an occurrence of the trigger event. The method may include determining a category of information based on data associated with the trigger event. The method may include identifying at least one constraint based on the determined category of information. The method may include appending to the identified at least one constraint to the determined category of information. The method may include generating at least one search query. The method may include selecting at least one candidate website based on the category of information. The method may include performing the at least one search query on the at least one candidate website. The method may include filtering each search query result within the search query results. The method may include sending each filtered search query result within the search query results to a user.
    Type: Grant
    Filed: August 14, 2018
    Date of Patent: February 15, 2022
    Assignee: International Business Machines Corporation
    Inventors: Geetika T. Lakshmanan, Martin A. Oberhofer
  • Patent number: 11222131
    Abstract: The present disclosure relates to a method for a secure storage, matching and linking of data records. The method comprises: receiving a current data record having one or more attributes, each attribute having an attribute value. For each attribute of at least part of the attributes a predefined set of variations of the attribute value of the attribute may be generated. The received attribute values may be encrypted resulting in an encrypted record and the generated sets of variations may be encrypted. The encrypted record may be stores in a storage system in association with the respective encrypted sets of variations.
    Type: Grant
    Filed: September 25, 2019
    Date of Patent: January 11, 2022
    Assignee: International Business Machines Corporation
    Inventors: Martin Oberhofer, Soma Shekar Naganna, Scott Schumacher, Abhishek Seth, Geetha Sravanthi Pulipaty
  • Publication number: 20210374525
    Abstract: The present disclosure relates to a method comprising providing a set of one or more records, each record of the set of records having a set of one or more attributes. Values of the set of attributes of the set of records may be input to a trained data representation learning model for receiving, as an output of the trained data representation model, a set of feature vectors representing the set of records respectively. The set of feature vectors may be stored.
    Type: Application
    Filed: December 9, 2020
    Publication date: December 2, 2021
    Inventors: Lars Bremer, Jonathan Roesner, Claudio Andrea Fanconi, Martin Oberhofer, Karin Steckler
  • Patent number: 11176108
    Abstract: Cross-matching of match indexes assigned to respective data sources can be performed. Each match index can store at least a portion of data from a subset of data records in the data source to which the match index is assigned. Such data can be stored in the match index as match index records. Responsive to identifying matches between match records of different match indexes, the data records in the data sources to which the match records correspond can be updated to add links between the respective data records.
    Type: Grant
    Filed: February 4, 2019
    Date of Patent: November 16, 2021
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Martin A. Oberhofer, Jonathan Limburn, Scott Schumacher, Olena Woolf