Patents Assigned to ABBYY Software Ltd.
-
Publication number: 20140122479Abstract: Described herein are methods for determining a type and semi-unique features of electronic files. The methods generally include generating at least one document hypothesis corresponding to the type of the document. For each document hypothesis, the document type is verified. A document type hypothesis is selected. A document name is formed based on the selected document type hypothesis and one or more features of the document. Such steps generally include automatically or programmatically naming of electronic files. A unique or semi-unique name is given, one that reproduces some of the document's contents, attributes and/or characteristics. Each document is provided with a name that can be easily understood and that is related to the content of the document.Type: ApplicationFiled: December 12, 2012Publication date: May 1, 2014Applicant: ABBYY Software Ltd.Inventors: Vasily Panferov, Andrey Isaev
-
Publication number: 20140118796Abstract: Methods and devices are described for detecting boundaries of documents on flatbed and multi-function scanners on a first pass of a carriage assembly, and then performing a high resolution scan on a second pass. High resolution images of documents can then be obtained with little or no interaction normally necessary to identify areas of interest on the scanner bed. Patterns on the scanner cover or lid facilitate not only edge determination, but orientation of text and other objects, and straightening of images in preparation for OCR and related functions. Electronic images and files derived from paper documents may be automatically cropped, deskewed, subjected to OCR, and named consistent with content or other information derived from them.Type: ApplicationFiled: December 12, 2012Publication date: May 1, 2014Applicant: ABBYY Software Ltd.Inventor: Andrey Isaev
-
Publication number: 20140081620Abstract: Disclosed is a method that involves acquiring an image with text, displaying all or a portion of the image on an electronic device. In response to detecting a swiping action or gesture, displaying a result of translation on a screen of the device. A first screen or display becomes a second one. Original text in a first language or source language may be easily and quickly compared to translated text shown on a second screen through a swiping gesture. Electronic dictionaries and machine translation may be used. These services may be independently stored and operated from different locations including on the device performing the translation, on a server or across a network (LAN, WAN, etc.). Optional manual correction of the translated text is also possible.Type: ApplicationFiled: September 18, 2012Publication date: March 20, 2014Applicant: ABBYY Software Ltd.Inventor: Ekaterina Solntseva
-
Patent number: 8548795Abstract: In one embodiment, the invention provides a method for translating a document in an input language into an output language comprising: a) for each document fragment for which a translation is readily available, translating said document fragment based on said readily available translation; and b) for each remaining untranslated fragment for which a translation is not readily available, translating said untranslated fragment based on a model-based machine translation technique. A translation is readily available if a search reveals at least one matching translation for the document fragment in a translation database.Type: GrantFiled: August 6, 2008Date of Patent: October 1, 2013Assignee: ABBYY Software Ltd.Inventors: Konstantin Anisimovich, Vladimir Selegey, Konstantin Zuev
-
Patent number: 8547589Abstract: A method for processing a batch of scanned images is provided. The method comprises processing the scanned images into documents; for documents comprising multiple pages maintaining a page-based coordinate system to specify a location of structures within a page and joining the pages to form a multi-page sheet having a sheet-based coordinate system to specify a location of structures within the multi-page sheet; performing a data extraction operation to extract data from each document, said data extraction operation comprising a page mode wherein structures are detected on individual pages using the page-based coordinate system and a document mode wherein structures are detected within the entire document using the sheet-based coordinate system.Type: GrantFiled: May 21, 2009Date of Patent: October 1, 2013Assignee: ABBYY Software Ltd.Inventors: Diar Tuganbaev, Sergey Zlobin, Irina Filimonova
-
Patent number: 8538162Abstract: A method for processing a batch of scanned images is disclosed. The method includes processing the scanned images into documents. For documents of multiple pages, the method maintains a page-based coordinate system to specify a location of structures within a page and joins the pages to form a multi-page sheet associated with a sheet-based coordinate system to specify a location of structures within the multi-page sheet. Data may be extracted from each document through a page mode wherein structures are detected on individual pages using the page-based coordinate system and a document mode wherein structures are detected within the entire document using the sheet-based coordinate system.Type: GrantFiled: March 27, 2012Date of Patent: September 17, 2013Assignee: ABBYY Software Ltd.Inventors: Diar Tuganbaev, Maryana Skuratovskaya, Sergey Zlobin
-
Patent number: 8472719Abstract: A method of identifying stricken-out characters in handwriting, comprising parsing a scanned image into regions and objects, defining objects containing handwritten characters, applying structural or feature classifiers for primary character recognition, applying one or more supplemental feature classifiers preliminarily trained by strike-out characters, and identifying a stricken-out character if any. The stricken-out character may be further examined by special procedures, either automated or manual.Type: GrantFiled: January 22, 2003Date of Patent: June 25, 2013Assignee: ABBYY Software Ltd.Inventors: Diar Tuganbaev, Dmitri Deriaguine
-
Patent number: 8452132Abstract: Methods and system for processing document images in OCR systems, particularly for selecting a proper file name for a recognized document. The method comprises generating at least one document type hypothesis for the document; verifying each document type hypothesis; selecting a best document type hypothesis and saving the document with a proper name based on the best type hypothesis and unique features. The method further includes determining a logical structure of a document and selecting a best document model hypothesis that has the best degree of correspondence with the selected best block hypotheses for the document. On the basis of the best document model hypothesis the text document reflecting the logical structure of the source document in extended computer-editable format is formed and saved with a proper file name.Type: GrantFiled: March 30, 2010Date of Patent: May 28, 2013Assignee: ABBYY Software Ltd.Inventors: Andrey Isaev, Dmitry Deryagin, Konstantin Anisimovich
-
Patent number: 8442810Abstract: In one embodiment, the invention provides a method for machine translation of a source document in an input language to a target document in an output language, comprising generating translation options corresponding to at least portions of each sentence in the input language; and selecting a translation option for the sentence based on statistics associated with the translation options.Type: GrantFiled: September 25, 2012Date of Patent: May 14, 2013Assignee: ABBYY Software Ltd.Inventors: Konstantin Anisimovich, Diar Tuganbaev, Vladimir Selegey, Konstantin Zuev
-
Patent number: 8412513Abstract: In one embodiment, the invention provides a method for machine translation of a source document in an input language to a target document in an output language, comprising generating translation options corresponding to at least portions of each sentence in the input language; and selecting a translation option for the sentence based on statistics associated with the translation options.Type: GrantFiled: February 28, 2012Date of Patent: April 2, 2013Assignee: ABBYY Software Ltd.Inventors: Konstantin Anisimovich, Diar Tuganbaev, Vladimir Selegey, Konstantin Zuev
-
Publication number: 20130054595Abstract: Described herein are methods for determining a type and unique features of a document. The methods generally include generating at least one document hypothesis corresponding to the type of the document. For each document hypothesis, the document type is verified. A best type hypothesis is selected. A document name is formed based on the best type hypothesis and one or more unique features of the document. Such steps are generally included in automatically or programmatically naming of documents. A unique or semi-unique name is given, one that reproduces some of the document's contents, attributes and/or characteristics. Each document is provided with a name that can be easily understood and that is related to the content of the document.Type: ApplicationFiled: October 26, 2012Publication date: February 28, 2013Applicant: ABBYY Software Ltd.Inventor: ABBYY Software Ltd.
-
Patent number: 8379119Abstract: Embodiments of the present invention disclose a method, device and system for restoring a motion-blurred image. The method comprises determining parameters for a one-dimensional Optical Transfer Function (OTF) for the motion-blurred image in Fourier space; determining a signal-to-noise ratio for the motion-blurred image in the Fourier space; and correcting for motion blur based on the parameters of the OTF. Determining the parameters comprises calculating a function ?(p,q) which is based on the square of the modulus of the Fourier transform |G(p,q)|2 of the motion-blurred image. The parameters include the absolute value of the one-dimensional OTF, and the phase and sign of the OTF.Type: GrantFiled: September 23, 2011Date of Patent: February 19, 2013Assignee: ABBYY Software Ltd.Inventors: Vladimir Rybkin, Sergey Fedorov
-
Publication number: 20130024180Abstract: In one embodiment, the invention provides a method for machine translation of a source document in an input language to a target document in an output language, comprising generating translation options corresponding to at least portions of each sentence in the input language; and selecting a translation option for the sentence based on statistics associated with the translation options.Type: ApplicationFiled: September 25, 2012Publication date: January 24, 2013Applicant: ABBYY Software Ltd.Inventors: Maryana Skuratovskaya, ABBYY Software Ltd.
-
Patent number: 8295590Abstract: A method and system for creating a form template for a form are disclosed. The method comprises analyzing an image of a form to detect object demarcations in the form. The method also comprises classifying the object demarcations into one of a plurality of predefined object categories and processing each object demarcation based on the object category into which it has been classified, thereby to create the form template automatically.Type: GrantFiled: August 27, 2008Date of Patent: October 23, 2012Assignee: ABBYY Software Ltd.Inventors: Irina Filimonova, Sergey Zlobin
-
Patent number: 8290272Abstract: In one embodiment, there is disclosed a method capturing data from a document image. The method 300 comprises processing the document image to identify at least one repetitive structure and performing a capturing operation including creating a plurality of instances of the repetitive structure based on once-described structure properties of the repetitive structure in a document template, and populating each instance with corresponding data from the document image. The method may also include creating a document template for capturing data from a document image.Type: GrantFiled: September 8, 2008Date of Patent: October 16, 2012Assignee: ABBYY Software Ltd.Inventors: Irina Filimonova, Sergey Zlobin
-
Patent number: 8260049Abstract: In one embodiment, the invention provides a method for determining a logical structure of a document. The method comprises generating at least one document hypothesis for the whole document; for each document hypothesis, verifying said document hypothesis including (a) generating at least one block hypothesis for each block in the document based on the document hypothesis; and (b) selecting a best block hypothesis for each block; selecting as a best document hypothesis the document hypothesis that has the best degree of correspondence with the selected best block hypotheses for the document; and forming the document based on the best document hypothesis.Type: GrantFiled: September 23, 2008Date of Patent: September 4, 2012Assignee: ABBYY Software Ltd.Inventors: Dmitry Deryagin, Konstantin Anisimovich
-
Patent number: 8233714Abstract: A method related to data capture from forms involving optical character recognition comprises detecting data fields on a scanned image; generating a flexible document description based on the detected data fields, including creating a set of search elements for each data field, each search element having associated search criteria; and training the flexible document description using a search algorithm to detect the data fields on additional training images based on the set of search elements.Type: GrantFiled: February 2, 2009Date of Patent: July 31, 2012Assignee: ABBYY Software Ltd.Inventors: Konstantin Zuev, Diar Tuganbaev, Irina Filimonova, Sergey Zlobin
-
Patent number: 8218887Abstract: In one embodiment, the invention discloses, a method for processing a document image. The method comprises segmenting the document image into a picture component and a non-picture component; compressing the non-picture component; and saving the uncompressed picture component and the compressed non-picture component in memory so that the document image may be recomposed to form a recomposed image based on the uncompressed picture component and the compressed non-picture component.Type: GrantFiled: September 16, 2008Date of Patent: July 10, 2012Assignee: ABBYY Software, Ltd.Inventor: German Zyuzin
-
Patent number: 8214199Abstract: A method and computer system for translating sentences between languages from an intermediate language-independent semantic representation is provided. On the basis of comprehensive understanding about languages and semantics, exhaustive linguistic descriptions are used to analyze sentences, to build syntactic structures and language independent semantic structures and representations, and to synthesize one or more sentences in a natural or artificial language. A computer system is also provided to analyze and synthesize various linguistic structures and to perform translation of a wide spectrum of various sentence types. As result, a generalized data structure, such as a semantic structure, is generated from a sentence of an input language and can be transformed into a natural sentence expressing its meaning correctly in an output language.Type: GrantFiled: March 22, 2007Date of Patent: July 3, 2012Assignee: ABBYY Software, Ltd.Inventors: Konstantin Anismovich, Vladimir Selegey, Konstantin Zuev
-
Patent number: 8195447Abstract: A method and computer system for translating sentences between languages from an intermediate language-independent semantic representation is provided. On the basis of comprehensive understanding about languages and semantics, exhaustive linguistic descriptions are used to analyze sentences, to build syntactic structures and language independent semantic structures and representations, and to synthesize one or more sentences in a natural or artificial language. A computer system is also provided to analyze and synthesize various linguistic structures and to perform translation of a wide spectrum of various sentence types. As result, a generalized data structure, such as a semantic structure, is generated from a sentence of an input language and can be transformed into a natural sentence expressing its meaning correctly in an output language.Type: GrantFiled: March 22, 2007Date of Patent: June 5, 2012Assignee: ABBYY Software Ltd.Inventors: Konstantin Anismovich, Vladimir Selegey, Konstantin Zuev