Patents by Inventor Stefan Klaus Baur

Stefan Klaus Baur has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240045890
    Abstract: Methods, systems, and computer-readable storage media for a machine learning (ML) system for matching a query entity to one or more target entities, the ML system that reducing a number of query-target entity pairs from consideration as potential matches during inference.
    Type: Application
    Filed: August 4, 2022
    Publication date: February 8, 2024
    Inventors: Hoang-Vu Nguyen, Li Rong Wang, Matthias Frank, Rajesh Vellore Arumugam, Stefan Klaus Baur, Sundeep Gullapudi
  • Publication number: 20230325708
    Abstract: Computer-readable media, methods, and systems are disclosed for feature attribution in a machine learning model. Samples may be generated for a machine learning model based on a normalized probability distribution. The samples may be used to determine a weight for features and feature pairs for the machine learning model. The weights of the features and feature pairs may be used to determine which features are significant for predictions within the machine learning model.
    Type: Application
    Filed: April 12, 2022
    Publication date: October 12, 2023
    Inventors: Stefan Klaus Baur, Matthias Frank, Hoang-Vu Nguyen, Kannan Presanna Kumar
  • Publication number: 20230222147
    Abstract: Methods, systems, and computer-readable storage media for receiving a set of inference results generated by a ML model, the inference results including a set of query entities and a set of target entities, each query entity having one or more target entities matched thereto by the ML model, processing the set of inference results to generate a set of matched sub-sets of target entities by executing a search over target entities in the set of target entities based on constraints, for each problem in a set of problems, providing the problem as a tuple including an index value representative of a target entity in the set of target entities and a value associated with the query entity, the value including a constraint relative to the query entity, and executing at least one task in response to one or more matched sub-sets in the set of matched sub-sets.
    Type: Application
    Filed: January 10, 2022
    Publication date: July 13, 2023
    Inventors: Hoang-Vu Nguyen, Rajesh Vellore Arumugam, Matthias Frank, Stefan Klaus Baur
  • Patent number: 11687575
    Abstract: Methods, systems, and computer-readable storage media for receiving a set of inference results generated by a ML model, the inference results including a set of query entities and a set of target entities, each query entity having one or more target entities matched thereto by the ML model, processing the set of inference results to generate a set of matched sub-sets of target entities by executing a search over target entities in the set of target entities based on constraints, for each problem in a set of problems, providing the problem as a tuple including an index value representative of a target entity in the set of target entities and a value associated with the query entity, the value including a constraint relative to the query entity, and executing at least one task in response to one or more matched sub-sets in the set of matched sub-sets.
    Type: Grant
    Filed: January 10, 2022
    Date of Patent: June 27, 2023
    Assignee: SAP SE
    Inventors: Hoang-Vu Nguyen, Rajesh Vellore Arumugam, Matthias Frank, Stefan Klaus Baur
  • Publication number: 20230128485
    Abstract: Methods, systems, and computer-readable storage media for receiving IRF data sets, the IRF data sets including a set of records including inference results determined by the ML model during production use of the ML model and at least one correction to an inference result, executing incremental training of the ML model to provide an updated ML model by selectively filtering one or more records of the set of records to adjust a negative sample to positive sample proportion of a sub-set of records based on a negative sample to positive sample proportion of initial training of the ML model, for each record in the sub-set of records, determining a weight, and during incremental training, applying the weight of a respective record being in a loss function in determining an accuracy of the ML model based on the respective record, and deploying the updated ML model for production use.
    Type: Application
    Filed: October 27, 2021
    Publication date: April 27, 2023
    Inventors: Rajesh Vellore Arumugam, Andrew Ivan Soegeng, Stefan Klaus Baur
  • Patent number: 11615120
    Abstract: Pairwise entity matching systems and methods are disclosed herein. A deep learning model may be used to match entities from separate data tables. Entities may be preprocessed to fuse textual and numeric data early in the neural network architecture. Numeric data may be represented as a vector of a geometrically progressing function. By fusing textual and numeric data, including dates, early in the neural network architecture the neural network may better learn the relationships between the numeric and textual data. Once preprocessed, the paired entities may be scored and matched using a neural network.
    Type: Grant
    Filed: July 14, 2021
    Date of Patent: March 28, 2023
    Assignee: SAP SE
    Inventors: Stefan Klaus Baur, Matthias Frank, Hoang-Vu Nguyen
  • Publication number: 20220391414
    Abstract: Pairwise entity matching systems and methods are disclosed herein. A deep learning model may be used to match entities from separate data tables. Entities may be preprocessed to fuse textual and numeric data early in the neural network architecture. Numeric data may be represented as a vector of a geometrically progressing function. By fusing textual and numeric data, including dates, early in the neural network architecture the neural network may better learn the relationships between the numeric and textual data. Once preprocessed, the paired entities may be scored and matched using a neural network.
    Type: Application
    Filed: July 14, 2021
    Publication date: December 8, 2022
    Inventors: Stefan Klaus Baur, Matthias Frank, Hoang-Vu Nguyen
  • Publication number: 20220092405
    Abstract: In an example embodiment, a deep neural network may be utilized to determine matches between candidate pairs of entities, as well as confidence scores that reflect how certain the deep neural network is about the corresponding match. The deep neural network is also able to find these matches without requiring domain knowledge that would be required if features for a machine-learned model were handcrafted, which is a drawback of prior art machine-learned models used to match entities in multiple tables. Thus, the deep neural network improves on the functioning of prior art machine learned models designed to perform the same tasks. Specifically, the deep neural network learns the relationships of tabular fields and the patterns that define a match from historical data alone, making this approach generic and applicable independent of the context.
    Type: Application
    Filed: September 18, 2020
    Publication date: March 24, 2022
    Inventors: Matthias Frank, Hoang-Vu Nguyen, Stefan Klaus Baur, Alexey Streltsov, Jasmin Mankad, Cordula Guder, Konrad Schenk, Philipp Lukas Jamscikov, Rohit Kumar Gupta
  • Patent number: 11275934
    Abstract: Disclosed herein are system, method, and computer program product embodiments for generating document labels using positional embeddings. In an embodiment, a label system may identify tokens, such as words, of a document image. The label system may apply a position vector neural network to the document image to analyze the pixels and determine positional embedding vectors corresponding to the words. The label system may then combine the positional embedding vectors to corresponding word vectors for use as an input to a neural network trained to generate document labels. This combination may embed the positional information with the corresponding word information in a serialized manner for processing by the document label neural network. Using this formatting, the label system may generate document labels in a light-weight and fast manner while still preserving spatial relationships between words.
    Type: Grant
    Filed: November 20, 2019
    Date of Patent: March 15, 2022
    Assignee: SAP SE
    Inventors: Christian Reisswig, Stefan Klaus Baur
  • Publication number: 20210150201
    Abstract: Disclosed herein are system, method, and computer program product embodiments for generating document labels using positional embeddings. In an embodiment, a label system may identify tokens, such as words, of a document image. The label system may apply a position vector neural network to the document image to analyze the pixels and determine positional embedding vectors corresponding to the words. The label system may then combine the positional embedding vectors to corresponding word vectors for use as an input to a neural network trained to generate document labels. This combination may embed the positional information with the corresponding word information in a serialized manner for processing by the document label neural network. Using this formatting, the label system may generate document labels in a light-weight and fast manner while still preserving spatial relationships between words.
    Type: Application
    Filed: November 20, 2019
    Publication date: May 20, 2021
    Inventors: Christian Reisswig, Stefan Klaus Baur