Patents by Inventor Alexey Streltsov

Alexey Streltsov has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20230334309
    Abstract: Systems, methods, and computer-readable media for generating a synthetic training data set from an original unstructured electronic document are disclosed. The synthetic training data set may be used to train a deep learning model to extract data from the original electronic document. The original electronic document may comprise annotated data fields. Each annotated data field may comprise a bounding box and a label. The original electronic document may comprise a header, a table, and a footer. Macro augmentation operations may be applied to the original electronic document to create sub-templates representative of distinct page layouts in the original electronic document. The synthetic training data set may be generated by applying geometric and semantic data augmentations to the sub-templates and the original electronic documents. The synthetic training data set may then be provided the deep learning model for training.
    Type: Application
    Filed: April 14, 2022
    Publication date: October 19, 2023
    Inventors: Alexey Streltsov, Monit Shah Singh, Dhananjay Tomar, Christian Reisswig, Minh Duc Bui
  • Publication number: 20230053109
    Abstract: The present disclosure involves systems, software, and computer-implemented methods for generating data regulation-compliant data from application interface data. One example method includes receiving a request for creation of document data. The request includes personal data of a user. Document data, including at least some of the personal data, is created based on the request. The document data is encoded into an encoded document that does not include any personal data of the user and includes structural information that describes the structure of the document data. A request to use the encoded document is received and the encoded document is decoded. A synthetic document is generated using the structural information included in the encoded document. Generation of the synthetic document includes insertion of synthetic user data into the synthetic document at positions in the synthetic document that correspond to positions of personal data within the document data.
    Type: Application
    Filed: August 16, 2021
    Publication date: February 16, 2023
    Inventors: Igor Schukovets, Alexey Streltsov
  • Publication number: 20220366301
    Abstract: In an example embodiment, a confidence score is computed for a predicted label (from a first model) for information extracted from a document. The confidence score is computed using a machine learned model different than the first model which is based on a Sliding-Window method. The Sliding-Window method may be based on convolutional neural networks classification, using sliding windows. It receives as input (1) the string of extracted information from an independent previous information extracted step (the “input text”), (2) the string's predicted class label, (3) the string's coordinate location in the document, and (4) the text of the document (for additional context information). The Sliding-Window method's task is to predict the confidence score to determine the correctness of the predicted label for the information.
    Type: Application
    Filed: June 22, 2021
    Publication date: November 17, 2022
    Inventors: Nurzat Rakhmanberdieva, Alexey Streltsov, Christian Reisswig
  • Publication number: 20220092405
    Abstract: In an example embodiment, a deep neural network may be utilized to determine matches between candidate pairs of entities, as well as confidence scores that reflect how certain the deep neural network is about the corresponding match. The deep neural network is also able to find these matches without requiring domain knowledge that would be required if features for a machine-learned model were handcrafted, which is a drawback of prior art machine-learned models used to match entities in multiple tables. Thus, the deep neural network improves on the functioning of prior art machine learned models designed to perform the same tasks. Specifically, the deep neural network learns the relationships of tabular fields and the patterns that define a match from historical data alone, making this approach generic and applicable independent of the context.
    Type: Application
    Filed: September 18, 2020
    Publication date: March 24, 2022
    Inventors: Matthias Frank, Hoang-Vu Nguyen, Stefan Klaus Baur, Alexey Streltsov, Jasmin Mankad, Cordula Guder, Konrad Schenk, Philipp Lukas Jamscikov, Rohit Kumar Gupta