Patents by Inventor Ivan Zagaynov
Ivan Zagaynov has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20240144711Abstract: Aspects and implementations provide for mechanisms of detection of fields in electronic documents and determination of values of the detected field. The disclosed techniques include obtaining an input into a machine learning model (MLM), the input including a first image of a field extracted from a document and depicting one or more static elements of the field and a field value, the input and further including a second image of the field. The input may be processed using the MLM to identify one or more static regions that correspond to static elements of the field. The identified static regions may be used to modify the first image in which the static regions are removed or have a reduced visibility. The modified image may be used to determine the field value.Type: ApplicationFiled: October 31, 2022Publication date: May 2, 2024Inventors: Ivan Zagaynov, Stanislav Semenov, Alena Dedigurova
-
Patent number: 11972626Abstract: System and method for document image detection, comprising: producing, using a neural network, a superpixel segmentation map of an input image; generating a superpixel binary mask by associating each superpixel of the superpixel segmentation map with a class of a predetermined set of classes; identifying one or more connected components in the superpixel binary mask; for each connected component of the superpixel binary mask, identifying a corresponding minimum bounding polygon; creating one or more image dividing lines based on the minimum bounding polygons; and defining boundaries of one or more objects of interest based on at least a subset of the image dividing lines.Type: GrantFiled: December 24, 2020Date of Patent: April 30, 2024Assignee: ABBYY Development Inc.Inventors: Ivan Zagaynov, Aleksandra Stepina
-
Patent number: 11960966Abstract: Aspects and implementations provide for mechanisms of detection and decoding of barcodes in images. The disclosed techniques include estimating dimensions of a module of a barcode based on geometric characteristics of a barcode image, forming hypotheses that group modules into barcode symbols, and assessing viability of formed hypotheses. Various operations of the techniques may involve the use of neural networks, including estimation of module dimensions and assessment of groupings of modules into lines and lines into barcode symbols. The techniques may be used for decoding of barcodes captured in images of unfavorable conditions, including blur, perspective, sub-optimal lighting, barcode deformation, and the like. The techniques may be applied to decoding linear one-dimensional barcodes, two-dimensional barcodes, and stacked linear barcodes.Type: GrantFiled: May 16, 2022Date of Patent: April 16, 2024Assignee: ABBYY Development Inc.Inventors: Ivan Zagaynov, Dmitry Zvonarev, Aleksandr Riashchikov
-
Patent number: 11948385Abstract: A computer-implemented method for image capture by a mobile device, comprising: receiving, by a video capturing application running on a mobile device, a video stream from a camera of the mobile device; identifying a specific frame of the video stream; generating a plurality of hypotheses defining image borders within the specific frame; selecting, by a neural network, a particular hypothesis among the plurality of hypotheses; producing a candidate image by applying the particular hypothesis to the specific frame; determining a value of a quality metric of the candidate image; determining that the value of the quality metric of the candidate image exceeds one or more values of the quality metric of one or more previously processed images extracted from the video stream; wherein the image capture application is a zero-footprint application.Type: GrantFiled: May 23, 2022Date of Patent: April 2, 2024Assignee: ABBYY Development Inc.Inventors: Ivan Zagaynov, Stepan Lobastov, Juri Katkov, Vasily Shahov, Olga Titova, Ivan Khintsitskiy
-
Publication number: 20240078828Abstract: A method of detecting fields in document images includes: receiving a codebook comprising a set of visual words, each visual word corresponding to a center of a cluster of local descriptors; calculating, based on a set of user labeled document images, for each visual word of the codebook, a respective frequency distribution of a field position of a specified labeled field with respect to the visual word; loading a document image for extraction of target fields; calculating a statistical predicate of a possible position of a target field in the document image based on the frequency distributions; and detecting, using the trained model, fields in the document image based on the calculated statistical predicate.Type: ApplicationFiled: November 6, 2023Publication date: March 7, 2024Inventors: Ivan Zagaynov, Vasily Loginov, Stanislav Semenov, Aleksandr Valiukov
-
Patent number: 11893818Abstract: A method of generating and optimizing a codebooks for document analysis comprises: receiving a first set of document images; extracting a plurality of keypoint regions from each document image of the first set of document images; calculating local descriptors for each keypoint region of the extracted keypoint regions; clustering the local descriptors such that each center of a cluster of local descriptors corresponds to a respective visual word; generating a codebook containing a set of visual words; and optimizing the codebook by maximizing mutual information (MI) between a target field of a second set of document images and at least one visual word of the set of visual words.Type: GrantFiled: July 26, 2021Date of Patent: February 6, 2024Assignee: ABBYY Development Inc.Inventors: Ivan Zagaynov, Vasily Loginov, Stanislav Semenov, Aleksandr Valiukov
-
Patent number: 11893784Abstract: Aspects of the disclosure provide for systems and processes for assessing image quality for optical character recognition (OCR), including but not limited to: segmenting an image into patches, providing the segmented image as an input into a first machine learning model (MLM), obtaining, using the first MLM, for each patch, first feature vectors representative of a reduction of imaging quality in a respective patch, and second feature vectors representative of a text content of the respective patch, providing to a second MLM the first feature vectors and the second feature vectors, and obtaining, using the second MLM, an indication of suitability of the image for OCR.Type: GrantFiled: May 20, 2021Date of Patent: February 6, 2024Assignee: ABBYY Development Inc.Inventors: Ivan Zagaynov, Dmitry Rodin, Vasily Loginov
-
Publication number: 20230367984Abstract: Aspects and implementations provide for mechanisms of detection and decoding of barcodes in images. The disclosed techniques include estimating dimensions of a module of a barcode based on geometric characteristics of a barcode image, forming hypotheses that group modules into barcode symbols, and assessing viability of formed hypotheses. Various operations of the techniques may involve the use of neural networks, including estimation of module dimensions and assessment of groupings of modules into lines and lines into barcode symbols. The techniques may be used for decoding of barcodes captured in images of unfavorable conditions, including blur, perspective, sub-optimal lighting, barcode deformation, and the like. The techniques may be applied to decoding linear one-dimensional barcodes, two-dimensional barcodes, and stacked linear barcodes.Type: ApplicationFiled: May 16, 2022Publication date: November 16, 2023Inventors: Ivan Zagaynov, Dmitry Zvonarev, Maksim Baranchikov
-
Publication number: 20230367983Abstract: Aspects and implementations provide for mechanisms of detection and decoding of barcodes in images. The disclosed techniques include estimating dimensions of a module of a barcode based on geometric characteristics of a barcode image, forming hypotheses that group modules into barcode symbols, and assessing viability of formed hypotheses. Various operations of the techniques may involve the use of neural networks, including estimation of module dimensions and assessment of groupings of modules into lines and lines into barcode symbols. The techniques may be used for decoding of barcodes captured in images of unfavorable conditions, including blur, perspective, sub-optimal lighting, barcode deformation, and the like. The techniques may be applied to decoding linear one-dimensional barcodes, two-dimensional barcodes, and stacked linear barcodes.Type: ApplicationFiled: May 16, 2022Publication date: November 16, 2023Inventors: Ivan Zagaynov, Dmitry Zvonarev, Aleksandr Riashchikov
-
Patent number: 11816909Abstract: An example method of document classification comprises: detecting a set of keypoints in an input image; generating a set of keypoint vectors, wherein each keypoint vector of the set of keypoint vectors is associated with a corresponding keypoint of the set of keypoints; extracting a feature map from the input image; producing a combination of the set of keypoint vectors with the feature map; transforming the combination into a set of keypoint mapping vectors according to a predefined mapping scheme; estimating, based on the set of keypoint mapping vectors, a plurality of importance factors associated with the set of keypoints; and classifying the input image based on the set of keypoints and the plurality of importance factors.Type: GrantFiled: August 9, 2021Date of Patent: November 14, 2023Assignee: ABBYY Development Inc.Inventors: Ivan Zagaynov, Stanislav Semenov
-
Publication number: 20230206487Abstract: Aspects of the disclosure provide for mechanisms for identification of objects in images using neural networks. A method of the disclosure includes: obtaining an image, representing each element of a plurality of elements of the image via an input vector of a plurality of input vectors, each input vector having one or more parameters pertaining to visual appearance of a respective element of the image, providing the plurality of input vectors to a first subnetwork of a neural network to obtain a plurality of output vectors, wherein each of the plurality of output vectors is associated with an element of the image, identifying, based on the plurality of output vectors, a sub-plurality of elements of the image as belonging to the image of the object, and determining, based on locations of the sub-plurality of elements, a location of an image of an object within the image.Type: ApplicationFiled: February 17, 2023Publication date: June 29, 2023Inventors: Ivan Zagaynov, Andrew Zharkov
-
Publication number: 20230186592Abstract: A method of the disclosure includes receiving, by a processing device, a document image, dividing the document image into a plurality of patches and determining, for each patch, whether the patch is monochromatic or polychromatic. It further includes clusterizing a plurality of monochromatic patches into a plurality of clusters within a color space, wherein each cluster corresponds to a color layer of a plurality of color layers of the document image, and segmenting each polychromatic patch into a corresponding plurality of monochromatic segments. The method also includes, for each polychromatic patch, associating each monochromatic segment of the corresponding plurality of monochromatic segments with a cluster of the plurality of clusters, and utilizing the plurality of clusters for performing an information extraction task on the document image.Type: ApplicationFiled: December 9, 2021Publication date: June 15, 2023Inventors: Vadim Mikhonov, Ivan Zagaynov
-
Patent number: 11587216Abstract: Aspects of the disclosure provide for mechanisms for identification of objects in images using neural networks. A method of the disclosure includes: obtaining an image, representing each element of a plurality of elements of the image via an input vector of a plurality of input vectors, each input vector having one or more parameters pertaining to visual appearance of a respective element of the image, providing the plurality of input vectors to a first subnetwork of a neural network to obtain a plurality of output vectors, wherein each of the plurality of output vectors is associated with an element of the image, identifying, based on the plurality of output vectors, a sub-plurality of elements of the image as belonging to the image of the object, and determining, based on locations of the sub-plurality of elements, a location of an image of an object within the image.Type: GrantFiled: January 22, 2020Date of Patent: February 21, 2023Assignee: ABBYY Development Inc.Inventors: Ivan Zagaynov, Andrew Zharkov
-
Publication number: 20230038097Abstract: An example method of document classification comprises: detecting a set of keypoints in an input image; generating a set of keypoint vectors, wherein each keypoint vector of the set of keypoint vectors is associated with a corresponding keypoint of the set of keypoints; extracting a feature map from the input image; producing a combination of the set of keypoint vectors with the feature map; transforming the combination into a set of keypoint mapping vectors according to a predefined mapping scheme; estimating, based on the set of keypoint mapping vectors, a plurality of importance factors associated with the set of keypoints; and classifying the input image based on the set of keypoints and the plurality of importance factors.Type: ApplicationFiled: August 9, 2021Publication date: February 9, 2023Inventors: Ivan Zagaynov, Stanislav Semenov
-
Publication number: 20230028992Abstract: A method of generating and optimizing a codebooks for document analysis comprises: receiving a first set of document images; extracting a plurality of keypoint regions from each document image of the first set of document images; calculating local descriptors for each keypoint region of the extracted keypoint regions; clustering the local descriptors such that each center of a cluster of local descriptors corresponds to a respective visual word; generating a codebook containing a set of visual words; and optimizing the codebook by maximizing mutual information (MI) between a target field of a second set of document images and at least one visual word of the set of visual words.Type: ApplicationFiled: July 26, 2021Publication date: January 26, 2023Inventors: Ivan Zagaynov, Vasily Loginov, Stanislav Semenov
-
Publication number: 20220366179Abstract: Aspects of the disclosure provide for systems and processes for assessing image quality for optical character recognition (OCR), including but not limited to: segmenting an image into patches, providing the segmented image as an input into a first machine learning model (MLM), obtaining, using the first MLM, for each patch, first feature vectors representative of a reduction of imaging quality in a respective patch, and second feature vectors representative of a text content of the respective patch, providing to a second MLM the first feature vectors and the second feature vectors, and obtaining, using the second MLM, an indication of suitability of the image for OCR.Type: ApplicationFiled: May 20, 2021Publication date: November 17, 2022Inventors: Ivan Zagaynov, Dmitry Rodin, Vasily Loginov
-
Publication number: 20220284723Abstract: A computer-implemented method for image capture by a mobile device, comprising: receiving, by a video capturing application running on a mobile device, a video stream from a camera of the mobile device; identifying a specific frame of the video stream; generating a plurality of hypotheses defining image borders within the specific frame; selecting, by a neural network, a particular hypothesis among the plurality of hypotheses; producing a candidate image by applying the particular hypothesis to the specific frame; determining a value of a quality metric of the candidate image; determining that the value of the quality metric of the candidate image exceeds one or more values of the quality metric of one or more previously processed images extracted from the video stream; wherein the image capture application is a zero-footprint application.Type: ApplicationFiled: May 23, 2022Publication date: September 8, 2022Inventors: Ivan Zagaynov, Stepan Lobastov, Juri Katkov, Vasily Shahov, Olga Titova, Ivan Khintsitskiy
-
Patent number: 11380117Abstract: A computer-implemented method for image capture by a mobile device, comprising: receiving, by a video capturing application running on a mobile device, a video stream from a camera of the mobile device; identifying a specific frame of the video stream; generating a plurality of hypotheses defining image borders within the specific frame; selecting, by a neural network, a particular hypothesis among the plurality of hypotheses; producing a candidate image by applying the particular hypothesis to the specific frame; determining a value of a quality metric of the candidate image; determining that the value of the quality metric of the candidate image exceeds one or more values of the quality metric of one or more previously processed images extracted from the video stream; wherein the image capture application is a zero-footprint application.Type: GrantFiled: December 29, 2020Date of Patent: July 5, 2022Assignee: ABBYY Development Inc.Inventors: Ivan Zagaynov, Stepan Lobastov, Juri Katkov, Vasily Shahov, Olga Titova, Ivan Khintsitskiy
-
Publication number: 20220198187Abstract: System and method for document image detection, comprising: producing, using a neural network, a superpixel segmentation map of an input image; generating a superpixel binary mask by associating each superpixel of the superpixel segmentation map with a class of a predetermined set of classes; identifying one or more connected components in the superpixel binary mask; for each connected component of the superpixel binary mask, identifying a corresponding minimum bounding polygon; creating one or more image dividing lines based on the minimum bounding polygons; and defining boundaries of one or more objects of interest based on at least a subset of the image dividing lines.Type: ApplicationFiled: December 24, 2020Publication date: June 23, 2022Inventors: Ivan Zagaynov, Aleksandra Stepina
-
Publication number: 20220198188Abstract: A computer-implemented method for image capture by a mobile device, comprising: receiving, by a video capturing application running on a mobile device, a video stream from a camera of the mobile device; identifying a specific frame of the video stream; generating a plurality of hypotheses defining image borders within the specific frame; selecting, by a neural network, a particular hypothesis among the plurality of hypotheses; producing a candidate image by applying the particular hypothesis to the specific frame; determining a value of a quality metric of the candidate image; determining that the value of the quality metric of the candidate image exceeds one or more values of the quality metric of one or more previously processed images extracted from the video stream; wherein the image capture application is a zero-footprint applicationType: ApplicationFiled: December 29, 2020Publication date: June 23, 2022Inventors: Ivan Zagaynov, Stepan Lobastov, Juri Katkov, Vasily Shahov, Olga Titova, Ivan Khintsitskiy