Patents by Inventor Sean Saito

Sean Saito has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11861692
    Abstract: Methods, systems, and computer-readable storage media for receiving a first bank statement at a hybrid pipeline including a set of lookup tables and a deep learning (DL) model that can each be used to determine customer IDs from bank statements, providing a first key based on data associated with the first bank statement, and determining that the first key is included in a first lookup table of the set of lookup tables, and in response: identifying a first set of customer IDs from the first lookup table, the first set of customer IDs including one or more customer IDs, and outputting the first set of customer IDs to computer-executable software that matches the first bank statement to one or more electronic documents at least partially based on the first set of customer IDs.
    Type: Grant
    Filed: June 4, 2019
    Date of Patent: January 2, 2024
    Assignee: SAP SE
    Inventors: Auguste Byiringiro, Jiatai Qiang, Atreya Biswas, Sean Saito
  • Patent number: 11645686
    Abstract: Methods, systems, and computer-readable storage media for providing, by a machine learning (ML) platform, a first binary classifier, processing, by the first binary classifier a super-set of invoices to provide a plurality of sets of invoices based on matching pairs of invoices in the super-set of invoices, providing, by the ML platform, a second binary classifier, processing, by the second binary classifier, a bank statement and the plurality of sets of invoices to define two or more super-invoices based on aggregate features of invoices in the plurality of sets of invoices, and match the bank statement to a super-invoice of the two or more super-invoices, and outputting a match of the bank statement to the super-invoice.
    Type: Grant
    Filed: December 5, 2018
    Date of Patent: May 9, 2023
    Assignee: SAP SE
    Inventors: Truc Viet Le, Sean Saito, Chaitanya Krishna Joshi, Rajalingappaa Shanmugamani
  • Patent number: 11537946
    Abstract: Methods, systems, and computer-readable storage media for a machine learning (ML) model and framework for training of the ML model to enable the ML model to correctly match entities even in instances where new entities are added after the ML model has been trained. More particularly, implementations of the present disclosure are directed to a ML model provided as a neural network that is trained to provide a scalar confidence score that indicates whether two entities in a pair of entities are considered a match, even if an entity in the set of entities was not accounted for in training of the ML model.
    Type: Grant
    Filed: March 10, 2020
    Date of Patent: December 27, 2022
    Assignee: SAP SE
    Inventors: Sean Saito, Auguste Byiringiro
  • Patent number: 11507886
    Abstract: Methods, systems, and computer-readable storage media for receiving structured data including a set of columns and a set of rows, determining, for each column, a column width defining a number of characters, providing, for each row, a set of padded values, each padded value corresponding to a column and including a value and one or more padding characters, the value and the one or more padding values collectively having a length equal to a respective column width, defining a set of strings by, for each row, concatenating padded values in the set of padded values to provide a string, and training the ML model by providing, for each string in the set of strings, an embedding as an abstract representation of a record of a respective row and processing the embedding through an attention layer of the ML model.
    Type: Grant
    Filed: March 17, 2020
    Date of Patent: November 22, 2022
    Assignee: SAP SE
    Inventor: Sean Saito
  • Patent number: 11507832
    Abstract: Methods, systems, and computer-readable storage media for tuning behavior of a machine learning (ML) model by providing an alternative loss function used during training of a ML model, the alternative loss function enhancing reliability of the ML model, calibrating the confidence of the ML model after training, and reducing risk in downstream tasks by providing a mapping between the confidence of the ML model to the expected accuracy of the ML model.
    Type: Grant
    Filed: March 10, 2020
    Date of Patent: November 22, 2022
    Assignee: SAP SE
    Inventors: Sean Saito, Auguste Byiringiro
  • Patent number: 11263555
    Abstract: Methods, systems, and computer-readable storage media for receiving a set of documents matched by a ML model, each document in the set of documents including a computer-readable electronic document, processing a set of pairwise features, the ML model, and the set of documents by an explanation framework to generate a set of raw explanations, the set of raw explanations including one or more raw explanations, each raw explanation including a pairwise feature and an importance score, for each raw explanation, identifying a natural language template based on the pairwise feature and the importance score, and populating the natural language template with one or more parameters provided from the set of documents to provide a natural language explanation for matching of the documents in the set of documents by the ML model.
    Type: Grant
    Filed: May 6, 2019
    Date of Patent: March 1, 2022
    Assignee: SAP SE
    Inventor: Sean Saito
  • Patent number: 11250221
    Abstract: Methods, systems, and computer-readable storage media for contextual interpretation of a Japanese word are provided. A first set of characters representing Japanese words is received. The first set of characters are received is input to a neural network. The neural network is trained to processes characters based on bi-directional context interpretation. The first set of characters is processed by the neural network through a plurality of learning layers that process the first set of characters in an order of the first set of characters and in a reverse order to determine semantical meanings of the characters in the first set of characters. An alphabet representation of at least one character of the first set of characters representing a Japanese word is output. The alphabet representation corresponds to a semantical meaning of the at least one character within the first set of characters.
    Type: Grant
    Filed: March 14, 2019
    Date of Patent: February 15, 2022
    Assignee: SAP SE
    Inventor: Sean Saito
  • Publication number: 20210295200
    Abstract: Methods, systems, and computer-readable storage media for receiving structured data including a set of columns and a set of rows, determining, for each column, a column width defining a number of characters, providing, for each row, a set of padded values, each padded value corresponding to a column and including a value and one or more padding characters, the value and the one or more padding values collectively having a length equal to a respective column width, defining a set of strings by, for each row, concatenating padded values in the set of padded values to provide a string, and training the ML model by providing, for each string in the set of strings, an embedding as an abstract representation of a record of a respective row and processing the embedding through an attention layer of the ML model.
    Type: Application
    Filed: March 17, 2020
    Publication date: September 23, 2021
    Inventor: Sean SAITO
  • Publication number: 20210287129
    Abstract: Methods, systems, and computer-readable storage media for a machine learning (ML) model and framework for training of the ML model to enable the ML model to correctly match entities even in instances where new entities are added after the ML model has been trained. More particularly, implementations of the present disclosure are directed to a ML model provided as a neural network that is trained to provide a scalar confidence score that indicates whether two entities in a pair of entities are considered a match, even if an entity in the set of entities was not accounted for in training of the ML model.
    Type: Application
    Filed: March 10, 2020
    Publication date: September 16, 2021
    Inventors: Sean Saito, Auguste Byiringiro
  • Publication number: 20210287081
    Abstract: Methods, systems, and computer-readable storage media for tuning behavior of a machine learning (ML) model by providing an alternative loss function used during training of a ML model, the alternative loss function enhancing reliability of the ML model, calibrating the confidence of the ML model after training, and reducing risk in downstream tasks by providing a mapping between the confidence of the ML model to the expected accuracy of the ML model.
    Type: Application
    Filed: March 10, 2020
    Publication date: September 16, 2021
    Inventors: Sean Saito, Auguste Byiringiro
  • Publication number: 20200387963
    Abstract: Methods, systems, and computer-readable storage media for receiving a first bank statement at a hybrid pipeline including a set of lookup tables and a deep learning (DL) model that can each be used to determine customer IDs from bank statements, providing a first key based on data associated with the first bank statement, and determining that the first key is included in a first lookup table of the set of lookup tables, and in response: identifying a first set of customer IDs from the first lookup table, the first set of customer IDs including one or more customer IDs, and outputting the first set of customer IDs to computer-executable software that matches the first bank statement to one or more electronic documents at least partially based on the first set of customer IDs.
    Type: Application
    Filed: June 4, 2019
    Publication date: December 10, 2020
    Inventors: Auguste Byiringiro, Jiatai Qiang, Atreya Biswas, Sean Saito
  • Patent number: 10839265
    Abstract: Methods, systems, and computer-readable storage media for receiving a set of training images and a set of classification labels, generating a set of target codebooks based on the set of classification labels, the set of target codebooks being provided as a first set of vectors of random value and dimension, generating a set of output codebooks based on the set of training images, the set of output codebooks being provided as a second set of vectors of random value and dimension, training a ML model by minimizing a loss function provided as a mean-squared-error (MSE) loss function, the loss function being measured by the Euclidean distance between an output codebook of the set of output codebooks and a target codebook of the set of target codebooks.
    Type: Grant
    Filed: November 12, 2018
    Date of Patent: November 17, 2020
    Assignee: SAP SE
    Inventors: Sean Saito, Sujoy Roy
  • Publication number: 20200356891
    Abstract: Methods, systems, and computer-readable storage media for receiving a set of documents matched by a ML model, each document in the set of documents including a computer-readable electronic document, processing a set of pairwise features, the ML model, and the set of documents by an explanation framework to generate a set of raw explanations, the set of raw explanations including one or more raw explanations, each raw explanation including a pairwise feature and an importance score, for each raw explanation, identifying a natural language template based on the pairwise feature and the importance score, and populating the natural language template with one or more parameters provided from the set of documents to provide a natural language explanation for matching of the documents in the set of documents by the ML model.
    Type: Application
    Filed: May 6, 2019
    Publication date: November 12, 2020
    Inventor: Sean Saito
  • Publication number: 20200293623
    Abstract: Methods, systems, and computer-readable storage media for contextual interpretation of a Japanese word are provided. A first set of characters representing Japanese words is received. The first set of characters are received is input to a neural network. The neural network is trained to processes characters based on bi-directional context interpretation. The first set of characters is processed by the neural network through a plurality of learning layers that process the first set of characters in an order of the first set of characters and in a reverse order to determine semantical meanings of the characters in the first set of characters. An alphabet representation of at least one character of the first set of characters representing a Japanese word is output. The alphabet representation corresponds to a semantical meaning of the at least one character within the first set of characters.
    Type: Application
    Filed: March 14, 2019
    Publication date: September 17, 2020
    Inventor: Sean Saito
  • Publication number: 20200193511
    Abstract: Methods, systems, and computer-readable storage media for receiving, by a machine learning (ML) platform, a set of invoices including two or more invoices, processing, by the ML platform, each invoice through a neural network to provide respective invoice embeddings, each invoice embedding including a multi-dimensional vector, comparing, by the ML platform, invoice embeddings to define two or more super-invoices, each super-invoice including a sub-set of the set of invoices, and matching a bank statement to a super-invoice of the two or more super-invoices.
    Type: Application
    Filed: December 12, 2018
    Publication date: June 18, 2020
    Inventors: Sean Saito, Chaitanya Krishna Joshi, Rajalingappaa Shanmugamani, Truc Viet Le, Rajesh Vellore Arumugam
  • Publication number: 20200184281
    Abstract: Methods, systems, and computer-readable storage media for providing, by a machine learning (ML) platform, a first binary classifier, processing, by the first binary classifier a super-set of invoices to provide a plurality of sets of invoices based on matching pairs of invoices in the super-set of invoices, providing, by the ML platform, a second binary classifier, processing, by the second binary classifier, a bank statement and the plurality of sets of invoices to define two or more super-invoices based on aggregate features of invoices in the plurality of sets of invoices, and match the bank statement to a super-invoice of the two or more super-invoices, and outputting a match of the bank statement to the super-invoice.
    Type: Application
    Filed: December 5, 2018
    Publication date: June 11, 2020
    Inventors: Truc Viet LE, Sean Saito, Chaitanya Krishna Joshi, Rajalingappaa Shanmugamani
  • Publication number: 20200175559
    Abstract: Methods, systems, and computer-readable storage media for providing a set of column pairs, each column pair including a column of a bank statement table, and a column of a super invoice table, each column pair corresponding to a modality, the super invoice table including at least one row including data associated with multiple invoices, for each column pair, determining a feature descriptor based on an operator, a feature vector being provided based on feature descriptors of the set of column pairs, inputting the feature vector to a ML model that processes the feature vector to determine a probability of a match between the bank statement, and a super invoice represented by the super invoice table, and outputting a binary output representing one of a match and no match between the bank statement, and the super invoice based on the probability.
    Type: Application
    Filed: December 4, 2018
    Publication date: June 4, 2020
    Inventors: Sean Saito, Truc Viet Le, Chaitanya Joshi, Rajalingappaa Shanmugamani
  • Publication number: 20200151505
    Abstract: Methods, systems, and computer-readable storage media for receiving a set of training images and a set of classification labels, generating a set of target codebooks based on the set of classification labels, the set of target codebooks being provided as a first set of vectors of random value and dimension, generating a set of output codebooks based on the set of training images, the set of output codebooks being provided as a second set of vectors of random value and dimension, training a ML model by minimizing a loss function provided as a mean-squared-error (MSE) loss function, the loss function being measured by the Euclidean distance between an output codebook of the set of output codebooks and a target codebook of the set of target codebooks.
    Type: Application
    Filed: November 12, 2018
    Publication date: May 14, 2020
    Inventors: Sean Saito, Sujoy Roy