Patents by Inventor Irina Filimonova
Irina Filimonova has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 9633257Abstract: Automatic classification of different types of documents is disclosed. An image of a form or document is captured. The document is assigned to one or more type definitions by identifying one or more objects within the image of the document. A matching model is selected via identification of the document image. In the case of multiple identifications, a profound analysis of the document type is performed—either automatically or manually. An automatic classifier may be trained with document samples of each of a plurality of document classes or document types where the types are known in advance or a system of classes may be formed automatically without a priori information about types of samples. An automatic classifier determines possible features and calculates a range of feature values and possible other feature parameters for each type or class of document. A decision tree, based on rules specified by a user, may be used for classifying documents.Type: GrantFiled: June 25, 2014Date of Patent: April 25, 2017Assignee: ABBYY DEVELOPMENT LLCInventors: Irina Filimonova, Sergey Zlobin, Andrey Myakutin
-
Patent number: 9390321Abstract: A method for processing a batch of scanned images is provided. The method comprises processing the scanned images into documents. For documents of multiple pages, the method comprises maintaining a page-based coordinate system to specify a location of structures within a page and joining the pages to form a multi-page sheet having a sheet-based coordinate system to specify a location of structures within the multi-page sheet. The method comprises performing a data extraction operation to extract data from each document, said data extraction operation including a page mode wherein structures are detected on individual pages using the page-based coordinate system and a document mode wherein structures are detected within the entire document using the sheet-based coordinate system.Type: GrantFiled: September 23, 2011Date of Patent: July 12, 2016Assignee: ABBYY Development LLCInventors: Diar Tuganbaev, Marinos Dimostheons, Sergey Zlobin, Irina Filimonova
-
Patent number: 9015573Abstract: Methods for processing machine-readable forms or documents of non-fixed format are disclosed. The methods make use of, for example, a structural description of characteristics of document elements, a description of a logical structure of the document, and methods of searching for document elements by using the structural description. A structural description of the spatial and parametric characteristics of document elements and the logical connections between elements may include a hierarchical logical structure of the elements, specification of an algorithm of determining the search constraints, specification of characteristics of searched elements, and specification of a set of parameters for a compound element identified on the basis of the aggregate of its components. The method of describing the logical structure of a document and methods of searching for elements of a document may be based on the use of the structural description.Type: GrantFiled: April 17, 2012Date of Patent: April 21, 2015Assignee: ABBYY Development LLCInventors: Konstantin Zuev, Diar Tuganbaev, Irina Filimonova
-
Patent number: 8908969Abstract: In one embodiment, the invention provides a method, comprising detecting data fields on a scanned document image; generating a flexible document description based on the detected data fields, including creating a set of search elements for each data field, each search element having associated search criteria; and training or modifying the flexible document description using, for example, a search algorithm to detect the data fields on additional training images based on the set of search elements.Type: GrantFiled: July 31, 2012Date of Patent: December 9, 2014Assignee: ABBYY Development LLCInventors: Konstantin Zuev, Irina Filimonova, Sergey Zlobin, Maryana Skuratovskaya
-
Publication number: 20140307959Abstract: Automatic classification of different types of documents is disclosed. An image of a form or document is captured. The document is assigned to one or more type definitions by identifying one or more objects within the image of the document. A matching model is selected via identification of the document image. In the case of multiple identifications, a profound analysis of the document type is performed—either automatically or manually. An automatic classifier may be trained with document samples of each of a plurality of document classes or document types where the types are known in advance or a system of classes may be formed automatically without a priori information about types of samples. An automatic classifier determines possible features and calculates a range of feature values and possible other feature parameters for each type or class of document. A decision tree, based on rules specified by a user, may be used for classifying documents.Type: ApplicationFiled: June 25, 2014Publication date: October 16, 2014Applicant: ABBYY Development LLCInventors: Irina Filimonova, Sergey Zlobin, Andrey Myakutin
-
Patent number: 8805093Abstract: In one embodiment, the invention provides a method for a machine to perform machine-readable form pre-recognition analysis. The method comprises preliminarily assigning at least one graphic image in a form for identification of form type, preliminarily creating at least one model of the said graphic image for identification of the form type, parsing a form image into regions, determining an image form type for the form image, comprising: (a) detecting on the form image at least one of said graphic images for identification of the form type, (b) performing a primary identification of the form image type based on a comparison of the detected graphic image with the said model, and(c) performing a profound analysis using a supplementary data said-primary identification results in multiple possibilities for the form image type.Type: GrantFiled: December 22, 2010Date of Patent: August 12, 2014Assignee: ABBYY Development LLCInventors: Konstantin Zuev, Irina Filimonova, Sergey Zlobin
-
Patent number: 8750571Abstract: Embodiments of the invention disclose techniques for processing of machine-readable forms of unfixed or flexible format. An auxiliary brief description may be optionally specified to determine the spatial orientation of the image. A method of searching for elements of a document comprises the following main operations in addition to the operations of preliminary image processing: selecting the varieties of structural description from several available variants, determining the orientation of the image, selecting the text objects, where the text must be recognized, and determining the minimal required volume of recognition, recognizing the text objects, searching for elements of the form. Searching for elements of the form comprises the following actions: selecting a searched element in the structural description, gaining the algorithm of search constraints from the structural description, searching for the element, testing the obtained variants.Type: GrantFiled: August 9, 2013Date of Patent: June 10, 2014Assignee: ABBYY Development LLCInventors: Konstantin Zuev, Diar Tuganbaev, Irina Filimonova
-
Publication number: 20130322773Abstract: Embodiments of the invention disclose techniques for processing of machine-readable forms of unfixed or flexible format. An auxiliary brief description may be optionally specified to determine the spatial orientation of the image. A method of searching for elements of a document comprises the following main operations in addition to the operations of preliminary image processing: selecting the varieties of structural description from several available variants, determining the orientation of the image, selecting the text objects, where the text must be recognized, and determining the minimal required volume of recognition, recognizing the text objects, searching for elements of the form. Searching for elements of the form comprises the following actions: selecting a searched element in the structural description, gaining the algorithm of search constraints from the structural description, searching for the element, testing the obtained variants.Type: ApplicationFiled: August 9, 2013Publication date: December 5, 2013Applicant: ABBYY Production LLCInventors: Konstantin Zuev, Diar Tuganbaev, Irina Filimonova
-
Patent number: 8571262Abstract: Embodiments of the invention disclose techniques for processing of machine-readable forms of unfixed or flexible format. An auxiliary brief description may be optionally specified to determine the spatial orientation of the image. A method of searching for elements of a document comprises the following main operations in addition to the operations of preliminary image processing: selecting the varieties of structural description from several available variants, determining the orientation of the image, selecting the text objects, where the text must be recognized, and determining the minimal required volume of recognition, recognizing the text objects, searching for elements of the form. Searching for elements of the form comprises the following actions: selecting a searched element in the structural description, gaining the algorithm of search constraints from the structural description, searching for the element, testing the obtained variants.Type: GrantFiled: September 8, 2010Date of Patent: October 29, 2013Assignee: ABBYY Development LLCInventors: Konstantin Zuev, Diar Tuganbaev, Irina Filimonova
-
Patent number: 8547589Abstract: A method for processing a batch of scanned images is provided. The method comprises processing the scanned images into documents; for documents comprising multiple pages maintaining a page-based coordinate system to specify a location of structures within a page and joining the pages to form a multi-page sheet having a sheet-based coordinate system to specify a location of structures within the multi-page sheet; performing a data extraction operation to extract data from each document, said data extraction operation comprising a page mode wherein structures are detected on individual pages using the page-based coordinate system and a document mode wherein structures are detected within the entire document using the sheet-based coordinate system.Type: GrantFiled: May 21, 2009Date of Patent: October 1, 2013Assignee: ABBYY Software Ltd.Inventors: Diar Tuganbaev, Sergey Zlobin, Irina Filimonova
-
Publication number: 20130198615Abstract: In one embodiment, the invention provides a method, comprising detecting data fields on a scanned document image; generating a flexible document description based on the detected data fields, including creating a set of search elements for each data field, each search element having associated search criteria; and training or modifying the flexible document description using, for example, a search algorithm to detect the data fields on additional training images based on the set of search elements.Type: ApplicationFiled: July 31, 2012Publication date: August 1, 2013Applicant: ABBYY SOFTWARE LTD.Inventors: Konstantin Zuev, Irina Filimonova, Sergey Zlobin, Maryana Skuratovskaya
-
Patent number: 8295590Abstract: A method and system for creating a form template for a form are disclosed. The method comprises analyzing an image of a form to detect object demarcations in the form. The method also comprises classifying the object demarcations into one of a plurality of predefined object categories and processing each object demarcation based on the object category into which it has been classified, thereby to create the form template automatically.Type: GrantFiled: August 27, 2008Date of Patent: October 23, 2012Assignee: ABBYY Software Ltd.Inventors: Irina Filimonova, Sergey Zlobin
-
Patent number: 8290272Abstract: In one embodiment, there is disclosed a method capturing data from a document image. The method 300 comprises processing the document image to identify at least one repetitive structure and performing a capturing operation including creating a plurality of instances of the repetitive structure based on once-described structure properties of the repetitive structure in a document template, and populating each instance with corresponding data from the document image. The method may also include creating a document template for capturing data from a document image.Type: GrantFiled: September 8, 2008Date of Patent: October 16, 2012Assignee: ABBYY Software Ltd.Inventors: Irina Filimonova, Sergey Zlobin
-
Publication number: 20120243055Abstract: A method for processing a batch of scanned images is provided. The method comprises processing the scanned images into documents. For documents of multiple pages, the method comprises maintaining a page-based coordinate system to specify a location of structures within a page and joining the pages to form a multi-page sheet having a sheet-based coordinate system to specify a location of structures within the multi-page sheet. The method comprises performing a data extraction operation to extract data from each document, said data extraction operation including a page mode wherein structures are detected on individual pages using the page-based coordinate system and a document mode wherein structures are detected within the entire document using the sheet-based coordinate system.Type: ApplicationFiled: September 23, 2011Publication date: September 27, 2012Inventors: Diar Tuganbaev, Marinos Dimosthenos, Sergey Zlobin, Irina Filimonova
-
Publication number: 20120201420Abstract: Methods for processing machine-readable forms or documents of non-fixed format are disclosed. The methods make use of, for example, a structural description of characteristics of document elements, a description of a logical structure of the document, and methods of searching for document elements by using the structural description. A structural description of the spatial and parametric characteristics of document elements and the logical connections between elements may include a hierarchical logical structure of the elements, specification of an algorithm of determining the search constraints, specification of characteristics of searched elements, and specification of a set of parameters for a compound element identified on the basis of the aggregate of its components. The method of describing the logical structure of a document and methods of searching for elements of a document may be based on the use of the structural description.Type: ApplicationFiled: April 17, 2012Publication date: August 9, 2012Inventors: Konstantin ZUEV, Diar TUGANBAEV, Irina FILIMONOVA
-
Patent number: 8233714Abstract: A method related to data capture from forms involving optical character recognition comprises detecting data fields on a scanned image; generating a flexible document description based on the detected data fields, including creating a set of search elements for each data field, each search element having associated search criteria; and training the flexible document description using a search algorithm to detect the data fields on additional training images based on the set of search elements.Type: GrantFiled: February 2, 2009Date of Patent: July 31, 2012Assignee: ABBYY Software Ltd.Inventors: Konstantin Zuev, Diar Tuganbaev, Irina Filimonova, Sergey Zlobin
-
Publication number: 20120183226Abstract: A method for processing a batch of scanned images is provided. The method comprises processing the scanned images into documents. For documents comprising multiple pages, the method maintains a page-based coordinate system to specify a location of structures within a page and joins the pages to form a multi-page sheet having a sheet-based coordinate system to specify a location of structures within the multi-page sheet. Data may be extracted from each document, such operation comprising a page mode wherein structures are detected on individual pages using the page-based coordinate system and a document mode wherein structures are detected within the entire document using the sheet-based coordinate system.Type: ApplicationFiled: March 27, 2012Publication date: July 19, 2012Inventors: Diar Tuganbaev, Sergey Zlobin, Irina Filimonova
-
Patent number: 8171391Abstract: The proposed technical solution allows processing of machine-readable forms of unfixed format. It comprises a method of specifying the logical structure of a document characterized by: preliminary specification of the list and descriptions of varieties of elements which may be present in the form, specifying an algorithm of setting the search constraints for every element, description of at least the following characteristics of search for every simple or compound element—the spatial characteristics of the search area and the parametric characteristics of the element, description of the method of identification of obtained elements, testing the type of the element, testing the properties which are typical of the type, testing the completeness of composition of the parts of the element.Type: GrantFiled: November 3, 2006Date of Patent: May 1, 2012Assignee: ABBYY Software, LtdInventors: Konstantin Zuev, Diar Tuganbaev, Irina Filimonova
-
Publication number: 20110188759Abstract: Automatic classification of different types of documents is disclosed. An image of a form or document is captured. The document is assigned to one or more type definitions by identifying one or more objects within the image of the document. A matching model is selected via identification of the document image. In the case of multiple identifications, a profound analysis of the document type is performed—either automatically or manually. An automatic classifier may be trained with document samples of each of a plurality of document classes or document types where the types are known in advance or a system of classes may be formed automatically without a priori information about types of samples. An automatic classifier determines possible features and calculates a range of feature values and possible other feature parameters for each type or class of document. A decision tree, based on rules specified by a user, may be used for classifying documents.Type: ApplicationFiled: April 14, 2011Publication date: August 4, 2011Inventors: Irina Filimonova, Sergey Zlobin, Andrey Myakutin
-
Publication number: 20110091109Abstract: In one embodiment, the invention provides a method for a machine to perform machine-readable form pre-recognition analysis. The method comprises preliminarily assigning at least one graphic image in a form for identification of form type, preliminarily creating at least one model of the said graphic image for identification of the form type, parsing a form image into regions, determining an image form type for the form image, comprising: (a) detecting on the form image at least one of said graphic images for identification of the form type, (b) performing a primary identification of the form image type based on a comparison of the detected graphic image with the said model, and(c) performing a profound analysis using a supplementary data said-primary identification results in multiple possibilities for the form image type.Type: ApplicationFiled: December 22, 2010Publication date: April 21, 2011Applicant: ABBYY SOFTWARE LTDInventors: Konstantin Zuev, Irina Filimonova, Sergey Zlobin