Patents by Inventor Ivan Zagaynov

Ivan Zagaynov has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Detecting fields in document images

Patent number: 12354397

Abstract: A method of detecting fields in document images includes: receiving a codebook comprising a set of visual words, each visual word corresponding to a center of a cluster of local descriptors; calculating, based on a set of user labeled document images, for each visual word of the codebook, a respective frequency distribution of a field position of a specified labeled field with respect to the visual word; loading a document image for extraction of target fields; calculating a statistical predicate of a possible position of a target field in the document image based on the frequency distributions; and detecting, using the trained model, fields in the document image based on the calculated statistical predicate.

Type: Grant

Filed: November 6, 2023

Date of Patent: July 8, 2025

Assignee: ABBYY Development Inc.

Inventors: Ivan Zagaynov, Vasily Loginov, Stanislav Semenov, Aleksandr Valiukov
PROCESSING IMAGES OF DEFORMED INDICIA-BEARING SURFACES

Publication number: 20250166403

Abstract: An example method of processing images of deformed indicia-bearing surfaces includes: detecting, within a document image, a plurality of image fragments, wherein each image fragment of the plurality of image fragment contains a respective sequence of alphabet symbols; grouping the plurality of image fragments by lines of text to be reconstructed in the document image; generating a map of isolines associated with the document image, wherein an isoline identifies a set of points that lie on a straight line of an undistorted image corresponding to the document image; generating a reverse transformation matrix that defines a set of transformations to be applied to the document image in order to remove image distortions caused by deformations of an indicia bearing surface; and generating an undistorted document image by applying the reverse transformation matrix to the document image.

Type: Application

Filed: November 17, 2023

Publication date: May 22, 2025

Inventors: Ivan Zagaynov, Dmitry Solntsev, Alina Kalyakina
DIVISION OF IMAGES INTO SEPARATE COLOR LAYERS

Publication number: 20240331346

Abstract: A method of the disclosure includes receiving, by a processing device, a document image, dividing the document image into a plurality of patches and determining, for each patch, whether the patch is monochromatic or polychromatic. It further includes clusterizing a plurality of monochromatic patches into a plurality of clusters within a color space, wherein each cluster corresponds to a color layer of a plurality of color layers of the document image, and segmenting each polychromatic patch into a corresponding plurality of monochromatic segments. The method also includes, for each polychromatic patch, associating each monochromatic segment of the corresponding plurality of monochromatic segments with a cluster of the plurality of clusters, and utilizing the plurality of clusters for performing an information extraction task on the document image.

Type: Application

Filed: June 14, 2024

Publication date: October 3, 2024

Inventors: Vadim Mikhonov, Ivan Zagaynov
DECODING OF LINEAR BARCODES UNDER UNFAVORABLE CONDITIONS

Publication number: 20240330630

Abstract: Aspects and implementations provide for mechanisms of detection and decoding of barcodes in images. The disclosed techniques include estimating dimensions of a module of a barcode based on geometric characteristics of a barcode image, forming hypotheses that group modules into barcode symbols, and assessing viability of formed hypotheses. Various operations of the techniques may involve the use of neural networks, including estimation of module dimensions and assessment of groupings of modules into lines and lines into barcode symbols. The techniques may be used for decoding of barcodes captured in images of unfavorable conditions, including blur, perspective, sub-optimal lighting, barcode deformation, and the like. The techniques may be applied to decoding linear one-dimensional barcodes, two-dimensional barcodes, and stacked linear barcodes.

Type: Application

Filed: June 10, 2024

Publication date: October 3, 2024

Inventors: Ivan Zagaynov, Dmitry Zvonarev, Maksim Baranchikov
DECODING OF TWO-DIMENSIONAL BARCODES UNDER UNFAVORABLE CONDITIONS

Publication number: 20240256809

Abstract: Aspects and implementations provide for mechanisms of detection and decoding of barcodes in images. The disclosed techniques include estimating dimensions of a module of a barcode based on geometric characteristics of a barcode image, forming hypotheses that group modules into barcode symbols, and assessing viability of formed hypotheses. Various operations of the techniques may involve the use of neural networks, including estimation of module dimensions and assessment of groupings of modules into lines and lines into barcode symbols. The techniques may be used for decoding of barcodes captured in images of unfavorable conditions, including blur, perspective, sub-optimal lighting, barcode deformation, and the like. The techniques may be applied to decoding linear one-dimensional barcodes, two-dimensional barcodes, and stacked linear barcodes.

Type: Application

Filed: April 15, 2024

Publication date: August 1, 2024

Inventors: Ivan Zagaynov, Dmitry Zvonarev, Aleksandr Riashchikov
Division of images into separate color layers

Patent number: 12046016

Abstract: A method of the disclosure includes receiving, by a processing device, a document image, dividing the document image into a plurality of patches and determining, for each patch, whether the patch is monochromatic or polychromatic. It further includes clusterizing a plurality of monochromatic patches into a plurality of clusters within a color space, wherein each cluster corresponds to a color layer of a plurality of color layers of the document image, and segmenting each polychromatic patch into a corresponding plurality of monochromatic segments. The method also includes, for each polychromatic patch, associating each monochromatic segment of the corresponding plurality of monochromatic segments with a cluster of the plurality of clusters, and utilizing the plurality of clusters for performing an information extraction task on the document image.

Type: Grant

Filed: December 9, 2021

Date of Patent: July 23, 2024

Assignee: ABBYY Development Inc.

Inventors: Vadim Mikhonov, Ivan Zagaynov
EXTRACTING MULTIPLE DOCUMENTS FROM SINGLE IMAGE

Publication number: 20240212382

Abstract: System and method for document image detection, comprising: producing, using a neural network, a superpixel segmentation map of an input image; generating a superpixel binary mask by associating each superpixel of the superpixel segmentation map with a class of a predetermined set of classes; identifying one or more connected components in the superpixel binary mask; for each connected component of the superpixel binary mask, identifying a corresponding minimum bounding polygon; creating one or more image dividing lines based on the minimum bounding polygons; and defining boundaries of one or more objects of interest based on at least a subset of the image dividing lines.

Type: Application

Filed: March 11, 2024

Publication date: June 27, 2024

Inventors: Ivan Zagaynov, Aleksandra Stepina
DOCUMENT PROCESSING WITH EFFICIENT TYPE-OF-SOURCE CLASSIFICATION

Publication number: 20240202517

Abstract: Aspects and implementations provide for techniques of classifying images by source types for efficient, fast, and economical processing of such images. The disclosed techniques include, for example, obtaining an input into an image processing operation (IPO input). The techniques further include processing, using a first neural network (NN), a first image associated with the IPO input to obtain a first feature vector, and processing, using a second NN, a plurality of second images associated with the IPO input to obtain a second feature vector. The techniques further include identifying, using the first feature vector and the second feature vector, a type of source used to generate the IPO input.

Type: Application

Filed: December 19, 2022

Publication date: June 20, 2024

Inventors: Ivan Zagaynov, Dmitry Solntsev, Alena Vasileva
Decoding of linear barcodes under unfavorable conditions

Patent number: 12008431

Abstract: Aspects and implementations provide for mechanisms of detection and decoding of barcodes in images. The disclosed techniques include estimating dimensions of a module of a barcode based on geometric characteristics of a barcode image, forming hypotheses that group modules into barcode symbols, and assessing viability of formed hypotheses. Various operations of the techniques may involve the use of neural networks, including estimation of module dimensions and assessment of groupings of modules into lines and lines into barcode symbols. The techniques may be used for decoding of barcodes captured in images of unfavorable conditions, including blur, perspective, sub-optimal lighting, barcode deformation, and the like. The techniques may be applied to decoding linear one-dimensional barcodes, two-dimensional barcodes, and stacked linear barcodes.

Type: Grant

Filed: May 16, 2022

Date of Patent: June 11, 2024

Assignee: ABBYY Development Inc.

Inventors: Ivan Zagaynov, Dmitry Zvonarev, Maksim Baranchikov
RELIABLE DETERMINATION OF FIELD VALUES IN DOCUMENTS WITH REMOVAL OF STATIC FIELD ELEMENTS

Publication number: 20240144711

Abstract: Aspects and implementations provide for mechanisms of detection of fields in electronic documents and determination of values of the detected field. The disclosed techniques include obtaining an input into a machine learning model (MLM), the input including a first image of a field extracted from a document and depicting one or more static elements of the field and a field value, the input and further including a second image of the field. The input may be processed using the MLM to identify one or more static regions that correspond to static elements of the field. The identified static regions may be used to modify the first image in which the static regions are removed or have a reduced visibility. The modified image may be used to determine the field value.

Type: Application

Filed: October 31, 2022

Publication date: May 2, 2024

Inventors: Ivan Zagaynov, Stanislav Semenov, Alena Dedigurova
Extracting multiple documents from single image

Patent number: 11972626

Abstract: System and method for document image detection, comprising: producing, using a neural network, a superpixel segmentation map of an input image; generating a superpixel binary mask by associating each superpixel of the superpixel segmentation map with a class of a predetermined set of classes; identifying one or more connected components in the superpixel binary mask; for each connected component of the superpixel binary mask, identifying a corresponding minimum bounding polygon; creating one or more image dividing lines based on the minimum bounding polygons; and defining boundaries of one or more objects of interest based on at least a subset of the image dividing lines.

Type: Grant

Filed: December 24, 2020

Date of Patent: April 30, 2024

Assignee: ABBYY Development Inc.

Inventors: Ivan Zagaynov, Aleksandra Stepina
Decoding of two-dimensional barcodes under unfavorable conditions

Patent number: 11960966

Abstract: Aspects and implementations provide for mechanisms of detection and decoding of barcodes in images. The disclosed techniques include estimating dimensions of a module of a barcode based on geometric characteristics of a barcode image, forming hypotheses that group modules into barcode symbols, and assessing viability of formed hypotheses. Various operations of the techniques may involve the use of neural networks, including estimation of module dimensions and assessment of groupings of modules into lines and lines into barcode symbols. The techniques may be used for decoding of barcodes captured in images of unfavorable conditions, including blur, perspective, sub-optimal lighting, barcode deformation, and the like. The techniques may be applied to decoding linear one-dimensional barcodes, two-dimensional barcodes, and stacked linear barcodes.

Type: Grant

Filed: May 16, 2022

Date of Patent: April 16, 2024

Assignee: ABBYY Development Inc.

Inventors: Ivan Zagaynov, Dmitry Zvonarev, Aleksandr Riashchikov
Zero-footprint image capture by mobile device

Patent number: 11948385

Abstract: A computer-implemented method for image capture by a mobile device, comprising: receiving, by a video capturing application running on a mobile device, a video stream from a camera of the mobile device; identifying a specific frame of the video stream; generating a plurality of hypotheses defining image borders within the specific frame; selecting, by a neural network, a particular hypothesis among the plurality of hypotheses; producing a candidate image by applying the particular hypothesis to the specific frame; determining a value of a quality metric of the candidate image; determining that the value of the quality metric of the candidate image exceeds one or more values of the quality metric of one or more previously processed images extracted from the video stream; wherein the image capture application is a zero-footprint application.

Type: Grant

Filed: May 23, 2022

Date of Patent: April 2, 2024

Assignee: ABBYY Development Inc.

Inventors: Ivan Zagaynov, Stepan Lobastov, Juri Katkov, Vasily Shahov, Olga Titova, Ivan Khintsitskiy
DETECTING FIELDS IN DOCUMENT IMAGES

Publication number: 20240078828

Abstract: A method of detecting fields in document images includes: receiving a codebook comprising a set of visual words, each visual word corresponding to a center of a cluster of local descriptors; calculating, based on a set of user labeled document images, for each visual word of the codebook, a respective frequency distribution of a field position of a specified labeled field with respect to the visual word; loading a document image for extraction of target fields; calculating a statistical predicate of a possible position of a target field in the document image based on the frequency distributions; and detecting, using the trained model, fields in the document image based on the calculated statistical predicate.

Type: Application

Filed: November 6, 2023

Publication date: March 7, 2024

Inventors: Ivan Zagaynov, Vasily Loginov, Stanislav Semenov, Aleksandr Valiukov
Optimization and use of codebooks for document analysis

Patent number: 11893818

Abstract: A method of generating and optimizing a codebooks for document analysis comprises: receiving a first set of document images; extracting a plurality of keypoint regions from each document image of the first set of document images; calculating local descriptors for each keypoint region of the extracted keypoint regions; clustering the local descriptors such that each center of a cluster of local descriptors corresponds to a respective visual word; generating a codebook containing a set of visual words; and optimizing the codebook by maximizing mutual information (MI) between a target field of a second set of document images and at least one visual word of the set of visual words.

Type: Grant

Filed: July 26, 2021

Date of Patent: February 6, 2024

Assignee: ABBYY Development Inc.

Inventors: Ivan Zagaynov, Vasily Loginov, Stanislav Semenov, Aleksandr Valiukov
Assessment of image quality for optical character recognition using machine learning

Patent number: 11893784

Abstract: Aspects of the disclosure provide for systems and processes for assessing image quality for optical character recognition (OCR), including but not limited to: segmenting an image into patches, providing the segmented image as an input into a first machine learning model (MLM), obtaining, using the first MLM, for each patch, first feature vectors representative of a reduction of imaging quality in a respective patch, and second feature vectors representative of a text content of the respective patch, providing to a second MLM the first feature vectors and the second feature vectors, and obtaining, using the second MLM, an indication of suitability of the image for OCR.

Type: Grant

Filed: May 20, 2021

Date of Patent: February 6, 2024

Assignee: ABBYY Development Inc.

Inventors: Ivan Zagaynov, Dmitry Rodin, Vasily Loginov
DECODING OF LINEAR BARCODES UNDER UNFAVORABLE CONDITIONS

Publication number: 20230367984

Abstract: Aspects and implementations provide for mechanisms of detection and decoding of barcodes in images. The disclosed techniques include estimating dimensions of a module of a barcode based on geometric characteristics of a barcode image, forming hypotheses that group modules into barcode symbols, and assessing viability of formed hypotheses. Various operations of the techniques may involve the use of neural networks, including estimation of module dimensions and assessment of groupings of modules into lines and lines into barcode symbols. The techniques may be used for decoding of barcodes captured in images of unfavorable conditions, including blur, perspective, sub-optimal lighting, barcode deformation, and the like. The techniques may be applied to decoding linear one-dimensional barcodes, two-dimensional barcodes, and stacked linear barcodes.

Type: Application

Filed: May 16, 2022

Publication date: November 16, 2023

Inventors: Ivan Zagaynov, Dmitry Zvonarev, Maksim Baranchikov
DECODING OF TWO-DIMENSIONAL BARCODES UNDER UNFAVORABLE CONDITIONS

Publication number: 20230367983

Abstract: Aspects and implementations provide for mechanisms of detection and decoding of barcodes in images. The disclosed techniques include estimating dimensions of a module of a barcode based on geometric characteristics of a barcode image, forming hypotheses that group modules into barcode symbols, and assessing viability of formed hypotheses. Various operations of the techniques may involve the use of neural networks, including estimation of module dimensions and assessment of groupings of modules into lines and lines into barcode symbols. The techniques may be used for decoding of barcodes captured in images of unfavorable conditions, including blur, perspective, sub-optimal lighting, barcode deformation, and the like. The techniques may be applied to decoding linear one-dimensional barcodes, two-dimensional barcodes, and stacked linear barcodes.

Type: Application

Filed: May 16, 2022

Publication date: November 16, 2023

Inventors: Ivan Zagaynov, Dmitry Zvonarev, Aleksandr Riashchikov
Document clusterization using neural networks

Patent number: 11816909

Abstract: An example method of document classification comprises: detecting a set of keypoints in an input image; generating a set of keypoint vectors, wherein each keypoint vector of the set of keypoint vectors is associated with a corresponding keypoint of the set of keypoints; extracting a feature map from the input image; producing a combination of the set of keypoint vectors with the feature map; transforming the combination into a set of keypoint mapping vectors according to a predefined mapping scheme; estimating, based on the set of keypoint mapping vectors, a plurality of importance factors associated with the set of keypoints; and classifying the input image based on the set of keypoints and the plurality of importance factors.

Type: Grant

Filed: August 9, 2021

Date of Patent: November 14, 2023

Assignee: ABBYY Development Inc.

Inventors: Ivan Zagaynov, Stanislav Semenov
DETECTION AND IDENTIFICATION OF OBJECTS IN IMAGES

Publication number: 20230206487

Abstract: Aspects of the disclosure provide for mechanisms for identification of objects in images using neural networks. A method of the disclosure includes: obtaining an image, representing each element of a plurality of elements of the image via an input vector of a plurality of input vectors, each input vector having one or more parameters pertaining to visual appearance of a respective element of the image, providing the plurality of input vectors to a first subnetwork of a neural network to obtain a plurality of output vectors, wherein each of the plurality of output vectors is associated with an element of the image, identifying, based on the plurality of output vectors, a sub-plurality of elements of the image as belonging to the image of the object, and determining, based on locations of the sub-plurality of elements, a location of an image of an object within the image.

Type: Application

Filed: February 17, 2023

Publication date: June 29, 2023

Inventors: Ivan Zagaynov, Andrew Zharkov

1 2 3 next