Patents Assigned to ABBYY DEVELOPMENT INC.
-
Patent number: 12573225Abstract: Systems and methods are disclosed to receive a training data set comprising a plurality of document images, wherein each document image of the plurality of document images is associated with respective metadata identifying a document field containing a variable text; generate, by processing the plurality of document images, a first heat map represented by a data structure comprising a plurality of heat map elements corresponding to a plurality of document image pixels, wherein each heat map element stores a counter of a number of document images in which the document field contains a document image pixel associated with the heat map element; receive an input document image; and identify, within the input document image, a candidate region comprising the document field, wherein the candidate region comprises a plurality of input document image pixels corresponding to heat map elements satisfying a threshold condition.Type: GrantFiled: November 10, 2023Date of Patent: March 10, 2026Assignee: ABBYY Development Inc.Inventors: Stanislav Semenov, Mikhail Lanin
-
Patent number: 12412653Abstract: Techniques are disclosed for creating event sequences from event data and then providing a visual analysis of event sequences. Event-related data for a set of event sequences is analyzed, and event sequences are grouped. Sequence metrics are calculated for the event sequences, and a user interface is provided to display a visual representation of the set of event sequences and the sequence metrics for the set of event sequences.Type: GrantFiled: June 21, 2021Date of Patent: September 9, 2025Assignee: ABBYY Development Inc.Inventors: Scott Opitz, Alex Elkin
-
Patent number: 12412056Abstract: Aspects and implementations provide for mechanisms of detection and decoding of barcodes in images. The disclosed techniques include estimating dimensions of a module of a barcode based on geometric characteristics of a barcode image, forming hypotheses that group modules into barcode symbols, and assessing viability of formed hypotheses. Various operations of the techniques may involve the use of neural networks, including estimation of module dimensions and assessment of groupings of modules into lines and lines into barcode symbols. The techniques may be used for decoding of barcodes captured in images of unfavorable conditions, including blur, perspective, sub-optimal lighting, barcode deformation, and the like. The techniques may be applied to decoding linear one-dimensional barcodes, two-dimensional barcodes, and stacked linear barcodes.Type: GrantFiled: April 15, 2024Date of Patent: September 9, 2025Assignee: ABBYY Development Inc.Inventors: Ivan Zagaynov, Dmitry Zvonarev, Aleksandr Riashchikov
-
Patent number: 12400467Abstract: Aspects of the disclosure provide for mechanisms for identification of text fields in documents using neural networks. A method of the disclosure includes obtaining vectors, representative of objects in a document and processing the vectors to generate key hypotheses associating key(s) with one or more objects and value hypotheses associating value(s) with zero or more objects. The method further includes generating key-value association (KVA) hypotheses associating a selected key hypothesis with a selected value hypothesis and characterized by a KVA likelihood score that is based on at least a key likelihood score associated with the selected key hypothesis and a value likelihood score associated with the selected value hypothesis. The method further includes identifying one or more target KVAs of the document using the KVA likelihood scores of the generated KVA hypotheses.Type: GrantFiled: November 21, 2022Date of Patent: August 26, 2025Assignee: ABBYY Development Inc.Inventor: Stanislav Semenov
-
Patent number: 12387518Abstract: System and method for document image detection, comprising: producing, using a neural network, a superpixel segmentation map of an input image; generating a superpixel binary mask by associating each superpixel of the superpixel segmentation map with a class of a predetermined set of classes; identifying one or more connected components in the superpixel binary mask; for each connected component of the superpixel binary mask, identifying a corresponding minimum bounding polygon; creating one or more image dividing lines based on the minimum bounding polygons; and defining boundaries of one or more objects of interest based on at least a subset of the image dividing lines.Type: GrantFiled: March 11, 2024Date of Patent: August 12, 2025Assignee: ABBYY Development Inc.Inventors: Ivan Zagaynov, Aleksandra Stepina
-
Patent number: 12387370Abstract: Aspects of the disclosure provide for mechanisms for identification of objects in images using neural networks. A method of the disclosure includes: obtaining an image, representing each element of a plurality of elements of the image via an input vector of a plurality of input vectors, each input vector having one or more parameters pertaining to visual appearance of a respective element of the image, providing the plurality of input vectors to a first subnetwork of a neural network to obtain a plurality of output vectors, wherein each of the plurality of output vectors is associated with an element of the image, identifying, based on the plurality of output vectors, a sub-plurality of elements of the image as belonging to the image of the object, and determining, based on locations of the sub-plurality of elements, a location of an image of an object within the image.Type: GrantFiled: February 17, 2023Date of Patent: August 12, 2025Assignee: ABBYY Development Inc.Inventors: Ivan Zagaynov, Andrew Zharkov
-
Patent number: 12373659Abstract: Aspects and implementations provide for mechanisms of detection and decoding of barcodes in images. The disclosed techniques include estimating dimensions of a module of a barcode based on geometric characteristics of a barcode image, forming hypotheses that group modules into barcode symbols, and assessing viability of formed hypotheses. Various operations of the techniques may involve the use of neural networks, including estimation of module dimensions and assessment of groupings of modules into lines and lines into barcode symbols. The techniques may be used for decoding of barcodes captured in images of unfavorable conditions, including blur, perspective, sub-optimal lighting, barcode deformation, and the like. The techniques may be applied to decoding linear one-dimensional barcodes, two-dimensional barcodes, and stacked linear barcodes.Type: GrantFiled: June 10, 2024Date of Patent: July 29, 2025Assignee: ABBYY Development Inc.Inventors: Ivan Zagaynov, Dmitry Zvonarev, Maksim Baranchikov
-
Patent number: 12354397Abstract: A method of detecting fields in document images includes: receiving a codebook comprising a set of visual words, each visual word corresponding to a center of a cluster of local descriptors; calculating, based on a set of user labeled document images, for each visual word of the codebook, a respective frequency distribution of a field position of a specified labeled field with respect to the visual word; loading a document image for extraction of target fields; calculating a statistical predicate of a possible position of a target field in the document image based on the frequency distributions; and detecting, using the trained model, fields in the document image based on the calculated statistical predicate.Type: GrantFiled: November 6, 2023Date of Patent: July 8, 2025Assignee: ABBYY Development Inc.Inventors: Ivan Zagaynov, Vasily Loginov, Stanislav Semenov, Aleksandr Valiukov
-
Patent number: 12314661Abstract: An example method of language detection includes: identifying a document comprising a plurality of words in one or more natural languages; for each word of at least a subset of words of the document: generating a plurality of sets of tokens representing the word, wherein each set of tokens of the plurality of sets of tokens represents the word using a corresponding plurality of tokens defined for a corresponding natural language of a set of natural languages, and identifying, based on the plurality of sets of tokens, a primary natural language associated with the word; associating each natural language of the set of natural languages with a corresponding word count indicating a number of words of the subset of words for which the natural language has been identified as the primary natural language; identifying, among the set of natural languages, a natural language associated with a maximum word count; and associating the identified natural language with the document.Type: GrantFiled: December 16, 2022Date of Patent: May 27, 2025Assignee: ABBYY Development Inc.Inventor: Michael Zatsepin
-
Patent number: 12205391Abstract: An example method of extracting structured information from document images comprises: receiving a document image; detecting a tabular structure within the document image; identifying a plurality of rows of the tabular structure, wherein each row of the plurality of rows comprises one or more lines; for each row of the plurality of rows, identifying a set of field types of one or more fields comprised by each line of the one or more lines comprised by the respective row; detecting, in each line of the one or more lines, a set of fields corresponding to a respective set of field types; and extracting information from the set of fields.Type: GrantFiled: December 27, 2021Date of Patent: January 21, 2025Assignee: ABBYY Development Inc.Inventors: Mikhail Lanin, Stanislav Semenov
-
Patent number: 12190622Abstract: A computer-implemented method for document clusterization, comprising: receiving an input document; determining, by evaluating a document similarity function, a plurality of similarity measures, wherein each similarity measure of the plurality of similarity measures reflects a degree of similarity between the input document and a corresponding cluster of documents of a plurality of clusters of documents; based on the plurality of similarity measures, determining that the input document does not belong to any of the clusters of documents of the plurality of clusters of documents; creating a new cluster of documents; and associating the input document with the new cluster of documents.Type: GrantFiled: November 18, 2020Date of Patent: January 7, 2025Assignee: ABBYY Development Inc.Inventors: Stanislav Semenov, Alexandra Antonova, Aleksey Misyurev
-
Patent number: 12158900Abstract: Mechanisms for document processing and analysis can include receiving a document and identifying, in a data structure, a record corresponding to the document. The record can include one or more entries, where each entry contains data reflecting a respective item of information extracted from a corresponding part of the document. The mechanisms can include determining for each entry of the record, a corresponding degree of association between the entry and a respective item of information referenced by the entry. They can further include updating the corresponding degrees of association, and selecting, among the corresponding degrees of association, a set of corresponding degrees of association whose aggregate degree of association satisfies a criterion.Type: GrantFiled: October 28, 2022Date of Patent: December 3, 2024Assignee: ABBYY Development Inc.Inventor: Stanislav Semenov
-
Patent number: 12118816Abstract: A document processing method includes receiving one or more sets of documents, and assigning each document to one or more basic clusters based on the metadata of the document. It further includes for each cluster, training a respective basic cluster model detecting one or more visual element types, and responsive to a first threshold criterion measure related to the one or more basic clusters being satisfied, generating one or more superclusters based on an attribute shared by documents comprised by the plurality of basic clusters. The method also includes training a respective supercluster model detecting the one or more element types and generating a generalized cluster from the one or more superclusters. It includes training a generalized model for the generalized cluster, receiving an input document, assigning the input document to corresponding clusters, and detecting visual elements by processing the input document by each of the corresponding models.Type: GrantFiled: November 3, 2021Date of Patent: October 15, 2024Assignee: ABBYY Development Inc.Inventors: Stanislav Semenov, Andrei Zyuzin
-
Patent number: 12118813Abstract: A document processing method includes receiving one or more documents, performing optical character recognition on the one or more documents to detect words comprising symbols in the one or more documents, and determining a encoding value for each of the symbols. It further includes applying a first hash function to each encoding value to generate a first set of hashed symbol values, applying a second hash function to each hashed symbol value to generate a vector array including a second set of hashed symbol values, and applying a linear transformation to each value of the second set of hashed symbol values of the vector array. The method also includes applying an irreversible non-linear activation function to the vector array to obtain abstract values associated with the symbols and saving the abstract values to train a neural network to detect fields in an input document.Type: GrantFiled: November 3, 2021Date of Patent: October 15, 2024Assignee: ABBYY Development Inc.Inventor: Stanislav Semenov
-
Patent number: 12086647Abstract: A method for dynamically generating and executing tasks can include includes executing a worker execution stream, where the worker execution stream includes multiple execution threads associated with a workflow of the workflow service, receiving, by the worker execution stream, from a workflow service, a definition of a task, and responsive to determining that the definition of the task satisfies a predefined criterion, dividing the task into a set of sub-tasks. The method further includes generating a definition of a sub-task workflow for the set of sub-tasks, and causing the workflow service to distribute, based on the definition of the sub-task workflow, the sub-tasks of the set to one or more workers for execution.Type: GrantFiled: December 16, 2022Date of Patent: September 10, 2024Assignee: ABBYY Development Inc.Inventors: Vladimir Demidov, Vladimir Bukin, Vladimir Yunev, Alexander Subbotin
-
Patent number: 12046016Abstract: A method of the disclosure includes receiving, by a processing device, a document image, dividing the document image into a plurality of patches and determining, for each patch, whether the patch is monochromatic or polychromatic. It further includes clusterizing a plurality of monochromatic patches into a plurality of clusters within a color space, wherein each cluster corresponds to a color layer of a plurality of color layers of the document image, and segmenting each polychromatic patch into a corresponding plurality of monochromatic segments. The method also includes, for each polychromatic patch, associating each monochromatic segment of the corresponding plurality of monochromatic segments with a cluster of the plurality of clusters, and utilizing the plurality of clusters for performing an information extraction task on the document image.Type: GrantFiled: December 9, 2021Date of Patent: July 23, 2024Assignee: ABBYY Development Inc.Inventors: Vadim Mikhonov, Ivan Zagaynov
-
Patent number: 12008431Abstract: Aspects and implementations provide for mechanisms of detection and decoding of barcodes in images. The disclosed techniques include estimating dimensions of a module of a barcode based on geometric characteristics of a barcode image, forming hypotheses that group modules into barcode symbols, and assessing viability of formed hypotheses. Various operations of the techniques may involve the use of neural networks, including estimation of module dimensions and assessment of groupings of modules into lines and lines into barcode symbols. The techniques may be used for decoding of barcodes captured in images of unfavorable conditions, including blur, perspective, sub-optimal lighting, barcode deformation, and the like. The techniques may be applied to decoding linear one-dimensional barcodes, two-dimensional barcodes, and stacked linear barcodes.Type: GrantFiled: May 16, 2022Date of Patent: June 11, 2024Assignee: ABBYY Development Inc.Inventors: Ivan Zagaynov, Dmitry Zvonarev, Maksim Baranchikov
-
Patent number: 11972626Abstract: System and method for document image detection, comprising: producing, using a neural network, a superpixel segmentation map of an input image; generating a superpixel binary mask by associating each superpixel of the superpixel segmentation map with a class of a predetermined set of classes; identifying one or more connected components in the superpixel binary mask; for each connected component of the superpixel binary mask, identifying a corresponding minimum bounding polygon; creating one or more image dividing lines based on the minimum bounding polygons; and defining boundaries of one or more objects of interest based on at least a subset of the image dividing lines.Type: GrantFiled: December 24, 2020Date of Patent: April 30, 2024Assignee: ABBYY Development Inc.Inventors: Ivan Zagaynov, Aleksandra Stepina
-
Patent number: 11960966Abstract: Aspects and implementations provide for mechanisms of detection and decoding of barcodes in images. The disclosed techniques include estimating dimensions of a module of a barcode based on geometric characteristics of a barcode image, forming hypotheses that group modules into barcode symbols, and assessing viability of formed hypotheses. Various operations of the techniques may involve the use of neural networks, including estimation of module dimensions and assessment of groupings of modules into lines and lines into barcode symbols. The techniques may be used for decoding of barcodes captured in images of unfavorable conditions, including blur, perspective, sub-optimal lighting, barcode deformation, and the like. The techniques may be applied to decoding linear one-dimensional barcodes, two-dimensional barcodes, and stacked linear barcodes.Type: GrantFiled: May 16, 2022Date of Patent: April 16, 2024Assignee: ABBYY Development Inc.Inventors: Ivan Zagaynov, Dmitry Zvonarev, Aleksandr Riashchikov
-
Patent number: D1089276Type: GrantFiled: November 30, 2022Date of Patent: August 19, 2025Assignee: ABBYY Development Inc.Inventors: Julia Karpukhina, Irene Epanechnikova