Patents by Inventor Alexey Streltsov

Alexey Streltsov has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Generating data regulation compliant data from application interface data

Patent number: 12079284

Abstract: The present disclosure involves systems, software, and computer-implemented methods for generating data regulation-compliant data from application interface data. One example method includes receiving a request for creation of document data. The request includes personal data of a user. Document data, including at least some of the personal data, is created based on the request. The document data is encoded into an encoded document that does not include any personal data of the user and includes structural information that describes the structure of the document data. A request to use the encoded document is received and the encoded document is decoded. A synthetic document is generated using the structural information included in the encoded document. Generation of the synthetic document includes insertion of synthetic user data into the synthetic document at positions in the synthetic document that correspond to positions of personal data within the document data.

Type: Grant

Filed: August 16, 2021

Date of Patent: September 3, 2024

Assignee: SAP SE

Inventors: Igor Schukovets, Alexey Streltsov
AUGMENTING ELECTRONIC DOCUMENTS TO GENERATE SYNTHETIC TRAINING DATA SETS

Publication number: 20230334309

Abstract: Systems, methods, and computer-readable media for generating a synthetic training data set from an original unstructured electronic document are disclosed. The synthetic training data set may be used to train a deep learning model to extract data from the original electronic document. The original electronic document may comprise annotated data fields. Each annotated data field may comprise a bounding box and a label. The original electronic document may comprise a header, a table, and a footer. Macro augmentation operations may be applied to the original electronic document to create sub-templates representative of distinct page layouts in the original electronic document. The synthetic training data set may be generated by applying geometric and semantic data augmentations to the sub-templates and the original electronic documents. The synthetic training data set may then be provided the deep learning model for training.

Type: Application

Filed: April 14, 2022

Publication date: October 19, 2023

Inventors: Alexey Streltsov, Monit Shah Singh, Dhananjay Tomar, Christian Reisswig, Minh Duc Bui
GENERATING DATA REGULATION COMPLIANT DATA FROM APPLICATION INTERFACE DATA

Publication number: 20230053109

Abstract: The present disclosure involves systems, software, and computer-implemented methods for generating data regulation-compliant data from application interface data. One example method includes receiving a request for creation of document data. The request includes personal data of a user. Document data, including at least some of the personal data, is created based on the request. The document data is encoded into an encoded document that does not include any personal data of the user and includes structural information that describes the structure of the document data. A request to use the encoded document is received and the encoded document is decoded. A synthetic document is generated using the structural information included in the encoded document. Generation of the synthetic document includes insertion of synthetic user data into the synthetic document at positions in the synthetic document that correspond to positions of personal data within the document data.

Type: Application

Filed: August 16, 2021

Publication date: February 16, 2023

Inventors: Igor Schukovets, Alexey Streltsov
MODEL-INDEPENDENT CONFIDENCE VALUE PREDICTION MACHINE LEARNED MODEL

Publication number: 20220366301

Abstract: In an example embodiment, a confidence score is computed for a predicted label (from a first model) for information extracted from a document. The confidence score is computed using a machine learned model different than the first model which is based on a Sliding-Window method. The Sliding-Window method may be based on convolutional neural networks classification, using sliding windows. It receives as input (1) the string of extracted information from an independent previous information extracted step (the “input text”), (2) the string's predicted class label, (3) the string's coordinate location in the document, and (4) the text of the document (for additional context information). The Sliding-Window method's task is to predict the confidence score to determine the correctness of the predicted label for the information.

Type: Application

Filed: June 22, 2021

Publication date: November 17, 2022

Inventors: Nurzat Rakhmanberdieva, Alexey Streltsov, Christian Reisswig
DEEP NEURAL NETWORK FOR MATCHING ENTITIES IN SEMI-STRUCTURED DATA

Publication number: 20220092405

Abstract: In an example embodiment, a deep neural network may be utilized to determine matches between candidate pairs of entities, as well as confidence scores that reflect how certain the deep neural network is about the corresponding match. The deep neural network is also able to find these matches without requiring domain knowledge that would be required if features for a machine-learned model were handcrafted, which is a drawback of prior art machine-learned models used to match entities in multiple tables. Thus, the deep neural network improves on the functioning of prior art machine learned models designed to perform the same tasks. Specifically, the deep neural network learns the relationships of tabular fields and the patterns that define a match from historical data alone, making this approach generic and applicable independent of the context.

Type: Application

Filed: September 18, 2020

Publication date: March 24, 2022

Inventors: Matthias Frank, Hoang-Vu Nguyen, Stefan Klaus Baur, Alexey Streltsov, Jasmin Mankad, Cordula Guder, Konrad Schenk, Philipp Lukas Jamscikov, Rohit Kumar Gupta

Generating data regulation compliant data from application interface data

AUGMENTING ELECTRONIC DOCUMENTS TO GENERATE SYNTHETIC TRAINING DATA SETS

GENERATING DATA REGULATION COMPLIANT DATA FROM APPLICATION INTERFACE DATA

MODEL-INDEPENDENT CONFIDENCE VALUE PREDICTION MACHINE LEARNED MODEL

DEEP NEURAL NETWORK FOR MATCHING ENTITIES IN SEMI-STRUCTURED DATA