Patents by Inventor Sergey Zlobin

Sergey Zlobin has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Creating flexible structure descriptions of documents with repetitive non-regular structures

Patent number: 9740692

Abstract: Disclosed are systems, computer-readable mediums, and methods for creating a flexible structure description. To create the flexible structure description an image of a document of a particular document type that contains a table is received. An entry describing an item in the table is received. Title elements within the document are searched for based upon the entry. Data fields and anchor elements are detected for the entry. A flexible structure description for the particular document type is generated that includes a set of search elements for each data field in the image of the document and the title elements. The flexible structure description is matched against the image. Data from the image is extracted based upon the matching of the flexible structure description against the image.

Type: Grant

Filed: November 5, 2014

Date of Patent: August 22, 2017

Assignee: ABBYY Development LLC

Inventors: Sergei Golubev, Irene Filimonova, Sergey Zlobin
Method and system of pre-analysis and automated classification of documents

Patent number: 9633257

Abstract: Automatic classification of different types of documents is disclosed. An image of a form or document is captured. The document is assigned to one or more type definitions by identifying one or more objects within the image of the document. A matching model is selected via identification of the document image. In the case of multiple identifications, a profound analysis of the document type is performed—either automatically or manually. An automatic classifier may be trained with document samples of each of a plurality of document classes or document types where the types are known in advance or a system of classes may be formed automatically without a priori information about types of samples. An automatic classifier determines possible features and calculates a range of feature values and possible other feature parameters for each type or class of document. A decision tree, based on rules specified by a user, may be used for classifying documents.

Type: Grant

Filed: June 25, 2014

Date of Patent: April 25, 2017

Assignee: ABBYY DEVELOPMENT LLC

Inventors: Irina Filimonova, Sergey Zlobin, Andrey Myakutin
Flexible structure descriptions for multi-page documents

Patent number: 9390321

Abstract: A method for processing a batch of scanned images is provided. The method comprises processing the scanned images into documents. For documents of multiple pages, the method comprises maintaining a page-based coordinate system to specify a location of structures within a page and joining the pages to form a multi-page sheet having a sheet-based coordinate system to specify a location of structures within the multi-page sheet. The method comprises performing a data extraction operation to extract data from each document, said data extraction operation including a page mode wherein structures are detected on individual pages using the page-based coordinate system and a document mode wherein structures are detected within the entire document using the sheet-based coordinate system.

Type: Grant

Filed: September 23, 2011

Date of Patent: July 12, 2016

Assignee: ABBYY Development LLC

Inventors: Diar Tuganbaev, Marinos Dimostheons, Sergey Zlobin, Irina Filimonova
CREATING FLEXIBLE STRUCTURE DESCRIPTIONS OF DOCUMENTS WITH REPETITIVE NON-REGULAR STRUCTURES

Publication number: 20150058374

Abstract: Disclosed are systems, computer-readable mediums, and methods for creating a flexible structure description. To create the flexible structure description an image of a document of a particular document type that contains a table is received. An entry describing an item in the table is received. Title elements within the document are searched for based upon the entry. Data fields and anchor elements are detected for the entry. A flexible structure description for the particular document type is generated that includes a set of search elements for each data field in the image of the document and the title elements. The flexible structure description is matched against the image. Data from the image is extracted based upon the matching of the flexible structure description against the image.

Type: Application

Filed: November 5, 2014

Publication date: February 26, 2015

Inventors: Sergei Golubev, Irene Filimonova, Sergey Zlobin
Creating flexible structure descriptions

Patent number: 8908969

Abstract: In one embodiment, the invention provides a method, comprising detecting data fields on a scanned document image; generating a flexible document description based on the detected data fields, including creating a set of search elements for each data field, each search element having associated search criteria; and training or modifying the flexible document description using, for example, a search algorithm to detect the data fields on additional training images based on the set of search elements.

Type: Grant

Filed: July 31, 2012

Date of Patent: December 9, 2014

Assignee: ABBYY Development LLC

Inventors: Konstantin Zuev, Irina Filimonova, Sergey Zlobin, Maryana Skuratovskaya
METHOD AND SYSTEM OF PRE-ANALYSIS AND AUTOMATED CLASSIFICATION OF DOCUMENTS

Publication number: 20140307959

Abstract: Automatic classification of different types of documents is disclosed. An image of a form or document is captured. The document is assigned to one or more type definitions by identifying one or more objects within the image of the document. A matching model is selected via identification of the document image. In the case of multiple identifications, a profound analysis of the document type is performed—either automatically or manually. An automatic classifier may be trained with document samples of each of a plurality of document classes or document types where the types are known in advance or a system of classes may be formed automatically without a priori information about types of samples. An automatic classifier determines possible features and calculates a range of feature values and possible other feature parameters for each type or class of document. A decision tree, based on rules specified by a user, may be used for classifying documents.

Type: Application

Filed: June 25, 2014

Publication date: October 16, 2014

Applicant: ABBYY Development LLC

Inventors: Irina Filimonova, Sergey Zlobin, Andrey Myakutin
Method of pre-analysis of a machine-readable form image

Patent number: 8805093

Abstract: In one embodiment, the invention provides a method for a machine to perform machine-readable form pre-recognition analysis. The method comprises preliminarily assigning at least one graphic image in a form for identification of form type, preliminarily creating at least one model of the said graphic image for identification of the form type, parsing a form image into regions, determining an image form type for the form image, comprising: (a) detecting on the form image at least one of said graphic images for identification of the form type, (b) performing a primary identification of the form image type based on a comparison of the detected graphic image with the said model, and(c) performing a profound analysis using a supplementary data said-primary identification results in multiple possibilities for the form image type.

Type: Grant

Filed: December 22, 2010

Date of Patent: August 12, 2014

Assignee: ABBYY Development LLC

Inventors: Konstantin Zuev, Irina Filimonova, Sergey Zlobin
Data capture from multi-page documents

Patent number: 8547589

Abstract: A method for processing a batch of scanned images is provided. The method comprises processing the scanned images into documents; for documents comprising multiple pages maintaining a page-based coordinate system to specify a location of structures within a page and joining the pages to form a multi-page sheet having a sheet-based coordinate system to specify a location of structures within the multi-page sheet; performing a data extraction operation to extract data from each document, said data extraction operation comprising a page mode wherein structures are detected on individual pages using the page-based coordinate system and a document mode wherein structures are detected within the entire document using the sheet-based coordinate system.

Type: Grant

Filed: May 21, 2009

Date of Patent: October 1, 2013

Assignee: ABBYY Software Ltd.

Inventors: Diar Tuganbaev, Sergey Zlobin, Irina Filimonova
Data capture from multi-page documents

Patent number: 8538162

Abstract: A method for processing a batch of scanned images is disclosed. The method includes processing the scanned images into documents. For documents of multiple pages, the method maintains a page-based coordinate system to specify a location of structures within a page and joins the pages to form a multi-page sheet associated with a sheet-based coordinate system to specify a location of structures within the multi-page sheet. Data may be extracted from each document through a page mode wherein structures are detected on individual pages using the page-based coordinate system and a document mode wherein structures are detected within the entire document using the sheet-based coordinate system.

Type: Grant

Filed: March 27, 2012

Date of Patent: September 17, 2013

Assignee: ABBYY Software Ltd.

Inventors: Diar Tuganbaev, Maryana Skuratovskaya, Sergey Zlobin
Creating Flexible Structure Descriptions

Publication number: 20130198615

Abstract: In one embodiment, the invention provides a method, comprising detecting data fields on a scanned document image; generating a flexible document description based on the detected data fields, including creating a set of search elements for each data field, each search element having associated search criteria; and training or modifying the flexible document description using, for example, a search algorithm to detect the data fields on additional training images based on the set of search elements.

Type: Application

Filed: July 31, 2012

Publication date: August 1, 2013

Applicant: ABBYY SOFTWARE LTD.

Inventors: Konstantin Zuev, Irina Filimonova, Sergey Zlobin, Maryana Skuratovskaya
Method and system for creating a form template for a form

Patent number: 8295590

Abstract: A method and system for creating a form template for a form are disclosed. The method comprises analyzing an image of a form to detect object demarcations in the form. The method also comprises classifying the object demarcations into one of a plurality of predefined object categories and processing each object demarcation based on the object category into which it has been classified, thereby to create the form template automatically.

Type: Grant

Filed: August 27, 2008

Date of Patent: October 23, 2012

Assignee: ABBYY Software Ltd.

Inventors: Irina Filimonova, Sergey Zlobin
Creating a document template for capturing data from a document image and capturing data from a document image

Patent number: 8290272

Abstract: In one embodiment, there is disclosed a method capturing data from a document image. The method 300 comprises processing the document image to identify at least one repetitive structure and performing a capturing operation including creating a plurality of instances of the repetitive structure based on once-described structure properties of the repetitive structure in a document template, and populating each instance with corresponding data from the document image. The method may also include creating a document template for capturing data from a document image.

Type: Grant

Filed: September 8, 2008

Date of Patent: October 16, 2012

Assignee: ABBYY Software Ltd.

Inventors: Irina Filimonova, Sergey Zlobin
Flexible Structure Descriptions for Multi-Page Documents

Publication number: 20120243055

Abstract: A method for processing a batch of scanned images is provided. The method comprises processing the scanned images into documents. For documents of multiple pages, the method comprises maintaining a page-based coordinate system to specify a location of structures within a page and joining the pages to form a multi-page sheet having a sheet-based coordinate system to specify a location of structures within the multi-page sheet. The method comprises performing a data extraction operation to extract data from each document, said data extraction operation including a page mode wherein structures are detected on individual pages using the page-based coordinate system and a document mode wherein structures are detected within the entire document using the sheet-based coordinate system.

Type: Application

Filed: September 23, 2011

Publication date: September 27, 2012

Inventors: Diar Tuganbaev, Marinos Dimosthenos, Sergey Zlobin, Irina Filimonova
Method and system for creating flexible structure descriptions

Patent number: 8233714

Abstract: A method related to data capture from forms involving optical character recognition comprises detecting data fields on a scanned image; generating a flexible document description based on the detected data fields, including creating a set of search elements for each data field, each search element having associated search criteria; and training the flexible document description using a search algorithm to detect the data fields on additional training images based on the set of search elements.

Type: Grant

Filed: February 2, 2009

Date of Patent: July 31, 2012

Assignee: ABBYY Software Ltd.

Inventors: Konstantin Zuev, Diar Tuganbaev, Irina Filimonova, Sergey Zlobin
DATA CAPTURE FROM MULTI-PAGE DOCUMENTS

Publication number: 20120183226

Abstract: A method for processing a batch of scanned images is provided. The method comprises processing the scanned images into documents. For documents comprising multiple pages, the method maintains a page-based coordinate system to specify a location of structures within a page and joins the pages to form a multi-page sheet having a sheet-based coordinate system to specify a location of structures within the multi-page sheet. Data may be extracted from each document, such operation comprising a page mode wherein structures are detected on individual pages using the page-based coordinate system and a document mode wherein structures are detected within the entire document using the sheet-based coordinate system.

Type: Application

Filed: March 27, 2012

Publication date: July 19, 2012

Inventors: Diar Tuganbaev, Sergey Zlobin, Irina Filimonova
Method and System of Pre-Analysis and Automated Classification of Documents

Publication number: 20110188759

Abstract: Automatic classification of different types of documents is disclosed. An image of a form or document is captured. The document is assigned to one or more type definitions by identifying one or more objects within the image of the document. A matching model is selected via identification of the document image. In the case of multiple identifications, a profound analysis of the document type is performed—either automatically or manually. An automatic classifier may be trained with document samples of each of a plurality of document classes or document types where the types are known in advance or a system of classes may be formed automatically without a priori information about types of samples. An automatic classifier determines possible features and calculates a range of feature values and possible other feature parameters for each type or class of document. A decision tree, based on rules specified by a user, may be used for classifying documents.

Type: Application

Filed: April 14, 2011

Publication date: August 4, 2011

Inventors: Irina Filimonova, Sergey Zlobin, Andrey Myakutin
METHOD OF PRE-ANALYSIS OF A MACHINE-READABLE FORM IMAGE

Publication number: 20110091109

Abstract: In one embodiment, the invention provides a method for a machine to perform machine-readable form pre-recognition analysis. The method comprises preliminarily assigning at least one graphic image in a form for identification of form type, preliminarily creating at least one model of the said graphic image for identification of the form type, parsing a form image into regions, determining an image form type for the form image, comprising: (a) detecting on the form image at least one of said graphic images for identification of the form type, (b) performing a primary identification of the form image type based on a comparison of the detected graphic image with the said model, and(c) performing a profound analysis using a supplementary data said-primary identification results in multiple possibilities for the form image type.

Type: Application

Filed: December 22, 2010

Publication date: April 21, 2011

Applicant: ABBYY SOFTWARE LTD

Inventors: Konstantin Zuev, Irina Filimonova, Sergey Zlobin
Method of pre-analysis of a machine-readable form image

Patent number: 7881561

Abstract: The present invention relates generally to an optical character recognition of machine-readable forms, and in particular to a verification of a direction of spatial orientation and a definition of a form type of the document electronic image. The goals of the invention are achieved by preliminarily assigning one or more form objects as elements composing a graphic image unambiguously defining its direction of spatial orientation. Similarly, one or more form objects are preliminarily assigned as elements composing a graphic image unambiguously defining its type. The direction of spatial orientation and the type of the form are verified via identification of said images. The models of graphic images either for verification the direction of spatial orientation or for defining the form type are stored in a special data storage means, one of the embodiment of which is form model description.

Type: Grant

Filed: June 26, 2003

Date of Patent: February 1, 2011

Assignee: Abbyy Software Ltd.

Inventors: Konstantin Zuev, Irina Filimonova, Sergey Zlobin
DATA CAPTURE FROM MULTI-PAGE DOCUMENTS

Publication number: 20100060947

Abstract: A method for processing a batch of scanned images is provided. The method comprises processing the scanned images into documents; for documents comprising multiple pages maintaining a page-based coordinate system to specify a location of structures within a page and joining the pages to form a multi-page sheet having a sheet-based coordinate system to specify a location of structures within the multi-page sheet; performing a data extraction operation to extract data from each document, said data extraction operation comprising a page mode wherein structures are detected on individual pages using the page-based coordinate system and a document mode wherein structures are detected within the entire document using the sheet-based coordinate system.

Type: Application

Filed: May 21, 2009

Publication date: March 11, 2010

Inventors: Diar Tuganbaev, Sergey Zlobin, Irina Filimonova
Method and System for Creating Flexible Structure Descriptions

Publication number: 20090175532

Abstract: In one embodiment, the invention provides a method, comprising detecting data fields on a scanned image; generating a flexible document description based on the detected data fields, including creating a set of search elements for each data field, each search element having associated search criteria; and training the flexible document description using a search algorithm to detect the data fields on additional training images based on the set of search elements.

Type: Application

Filed: February 2, 2009

Publication date: July 9, 2009

Inventors: Konstantin Zuev, Diar Tuganbaev, Irina Filimonova, Sergey Zlobin

1 2 next