Patents Assigned to ABBYY Software Ltd.
  • Publication number: 20140122479
    Abstract: Described herein are methods for determining a type and semi-unique features of electronic files. The methods generally include generating at least one document hypothesis corresponding to the type of the document. For each document hypothesis, the document type is verified. A document type hypothesis is selected. A document name is formed based on the selected document type hypothesis and one or more features of the document. Such steps generally include automatically or programmatically naming of electronic files. A unique or semi-unique name is given, one that reproduces some of the document's contents, attributes and/or characteristics. Each document is provided with a name that can be easily understood and that is related to the content of the document.
    Type: Application
    Filed: December 12, 2012
    Publication date: May 1, 2014
    Applicant: ABBYY Software Ltd.
    Inventors: Vasily Panferov, Andrey Isaev
  • Publication number: 20140118796
    Abstract: Methods and devices are described for detecting boundaries of documents on flatbed and multi-function scanners on a first pass of a carriage assembly, and then performing a high resolution scan on a second pass. High resolution images of documents can then be obtained with little or no interaction normally necessary to identify areas of interest on the scanner bed. Patterns on the scanner cover or lid facilitate not only edge determination, but orientation of text and other objects, and straightening of images in preparation for OCR and related functions. Electronic images and files derived from paper documents may be automatically cropped, deskewed, subjected to OCR, and named consistent with content or other information derived from them.
    Type: Application
    Filed: December 12, 2012
    Publication date: May 1, 2014
    Applicant: ABBYY Software Ltd.
    Inventor: Andrey Isaev
  • Publication number: 20140081620
    Abstract: Disclosed is a method that involves acquiring an image with text, displaying all or a portion of the image on an electronic device. In response to detecting a swiping action or gesture, displaying a result of translation on a screen of the device. A first screen or display becomes a second one. Original text in a first language or source language may be easily and quickly compared to translated text shown on a second screen through a swiping gesture. Electronic dictionaries and machine translation may be used. These services may be independently stored and operated from different locations including on the device performing the translation, on a server or across a network (LAN, WAN, etc.). Optional manual correction of the translated text is also possible.
    Type: Application
    Filed: September 18, 2012
    Publication date: March 20, 2014
    Applicant: ABBYY Software Ltd.
    Inventor: Ekaterina Solntseva
  • Patent number: 8548795
    Abstract: In one embodiment, the invention provides a method for translating a document in an input language into an output language comprising: a) for each document fragment for which a translation is readily available, translating said document fragment based on said readily available translation; and b) for each remaining untranslated fragment for which a translation is not readily available, translating said untranslated fragment based on a model-based machine translation technique. A translation is readily available if a search reveals at least one matching translation for the document fragment in a translation database.
    Type: Grant
    Filed: August 6, 2008
    Date of Patent: October 1, 2013
    Assignee: ABBYY Software Ltd.
    Inventors: Konstantin Anisimovich, Vladimir Selegey, Konstantin Zuev
  • Patent number: 8547589
    Abstract: A method for processing a batch of scanned images is provided. The method comprises processing the scanned images into documents; for documents comprising multiple pages maintaining a page-based coordinate system to specify a location of structures within a page and joining the pages to form a multi-page sheet having a sheet-based coordinate system to specify a location of structures within the multi-page sheet; performing a data extraction operation to extract data from each document, said data extraction operation comprising a page mode wherein structures are detected on individual pages using the page-based coordinate system and a document mode wherein structures are detected within the entire document using the sheet-based coordinate system.
    Type: Grant
    Filed: May 21, 2009
    Date of Patent: October 1, 2013
    Assignee: ABBYY Software Ltd.
    Inventors: Diar Tuganbaev, Sergey Zlobin, Irina Filimonova
  • Patent number: 8538162
    Abstract: A method for processing a batch of scanned images is disclosed. The method includes processing the scanned images into documents. For documents of multiple pages, the method maintains a page-based coordinate system to specify a location of structures within a page and joins the pages to form a multi-page sheet associated with a sheet-based coordinate system to specify a location of structures within the multi-page sheet. Data may be extracted from each document through a page mode wherein structures are detected on individual pages using the page-based coordinate system and a document mode wherein structures are detected within the entire document using the sheet-based coordinate system.
    Type: Grant
    Filed: March 27, 2012
    Date of Patent: September 17, 2013
    Assignee: ABBYY Software Ltd.
    Inventors: Diar Tuganbaev, Maryana Skuratovskaya, Sergey Zlobin
  • Patent number: 8472719
    Abstract: A method of identifying stricken-out characters in handwriting, comprising parsing a scanned image into regions and objects, defining objects containing handwritten characters, applying structural or feature classifiers for primary character recognition, applying one or more supplemental feature classifiers preliminarily trained by strike-out characters, and identifying a stricken-out character if any. The stricken-out character may be further examined by special procedures, either automated or manual.
    Type: Grant
    Filed: January 22, 2003
    Date of Patent: June 25, 2013
    Assignee: ABBYY Software Ltd.
    Inventors: Diar Tuganbaev, Dmitri Deriaguine
  • Patent number: 8452132
    Abstract: Methods and system for processing document images in OCR systems, particularly for selecting a proper file name for a recognized document. The method comprises generating at least one document type hypothesis for the document; verifying each document type hypothesis; selecting a best document type hypothesis and saving the document with a proper name based on the best type hypothesis and unique features. The method further includes determining a logical structure of a document and selecting a best document model hypothesis that has the best degree of correspondence with the selected best block hypotheses for the document. On the basis of the best document model hypothesis the text document reflecting the logical structure of the source document in extended computer-editable format is formed and saved with a proper file name.
    Type: Grant
    Filed: March 30, 2010
    Date of Patent: May 28, 2013
    Assignee: ABBYY Software Ltd.
    Inventors: Andrey Isaev, Dmitry Deryagin, Konstantin Anisimovich
  • Patent number: 8442810
    Abstract: In one embodiment, the invention provides a method for machine translation of a source document in an input language to a target document in an output language, comprising generating translation options corresponding to at least portions of each sentence in the input language; and selecting a translation option for the sentence based on statistics associated with the translation options.
    Type: Grant
    Filed: September 25, 2012
    Date of Patent: May 14, 2013
    Assignee: ABBYY Software Ltd.
    Inventors: Konstantin Anisimovich, Diar Tuganbaev, Vladimir Selegey, Konstantin Zuev
  • Patent number: 8412513
    Abstract: In one embodiment, the invention provides a method for machine translation of a source document in an input language to a target document in an output language, comprising generating translation options corresponding to at least portions of each sentence in the input language; and selecting a translation option for the sentence based on statistics associated with the translation options.
    Type: Grant
    Filed: February 28, 2012
    Date of Patent: April 2, 2013
    Assignee: ABBYY Software Ltd.
    Inventors: Konstantin Anisimovich, Diar Tuganbaev, Vladimir Selegey, Konstantin Zuev
  • Publication number: 20130054595
    Abstract: Described herein are methods for determining a type and unique features of a document. The methods generally include generating at least one document hypothesis corresponding to the type of the document. For each document hypothesis, the document type is verified. A best type hypothesis is selected. A document name is formed based on the best type hypothesis and one or more unique features of the document. Such steps are generally included in automatically or programmatically naming of documents. A unique or semi-unique name is given, one that reproduces some of the document's contents, attributes and/or characteristics. Each document is provided with a name that can be easily understood and that is related to the content of the document.
    Type: Application
    Filed: October 26, 2012
    Publication date: February 28, 2013
    Applicant: ABBYY Software Ltd.
    Inventor: ABBYY Software Ltd.
  • Patent number: 8379119
    Abstract: Embodiments of the present invention disclose a method, device and system for restoring a motion-blurred image. The method comprises determining parameters for a one-dimensional Optical Transfer Function (OTF) for the motion-blurred image in Fourier space; determining a signal-to-noise ratio for the motion-blurred image in the Fourier space; and correcting for motion blur based on the parameters of the OTF. Determining the parameters comprises calculating a function ?(p,q) which is based on the square of the modulus of the Fourier transform |G(p,q)|2 of the motion-blurred image. The parameters include the absolute value of the one-dimensional OTF, and the phase and sign of the OTF.
    Type: Grant
    Filed: September 23, 2011
    Date of Patent: February 19, 2013
    Assignee: ABBYY Software Ltd.
    Inventors: Vladimir Rybkin, Sergey Fedorov
  • Publication number: 20130024180
    Abstract: In one embodiment, the invention provides a method for machine translation of a source document in an input language to a target document in an output language, comprising generating translation options corresponding to at least portions of each sentence in the input language; and selecting a translation option for the sentence based on statistics associated with the translation options.
    Type: Application
    Filed: September 25, 2012
    Publication date: January 24, 2013
    Applicant: ABBYY Software Ltd.
    Inventors: Maryana Skuratovskaya, ABBYY Software Ltd.
  • Patent number: 8295590
    Abstract: A method and system for creating a form template for a form are disclosed. The method comprises analyzing an image of a form to detect object demarcations in the form. The method also comprises classifying the object demarcations into one of a plurality of predefined object categories and processing each object demarcation based on the object category into which it has been classified, thereby to create the form template automatically.
    Type: Grant
    Filed: August 27, 2008
    Date of Patent: October 23, 2012
    Assignee: ABBYY Software Ltd.
    Inventors: Irina Filimonova, Sergey Zlobin
  • Patent number: 8290272
    Abstract: In one embodiment, there is disclosed a method capturing data from a document image. The method 300 comprises processing the document image to identify at least one repetitive structure and performing a capturing operation including creating a plurality of instances of the repetitive structure based on once-described structure properties of the repetitive structure in a document template, and populating each instance with corresponding data from the document image. The method may also include creating a document template for capturing data from a document image.
    Type: Grant
    Filed: September 8, 2008
    Date of Patent: October 16, 2012
    Assignee: ABBYY Software Ltd.
    Inventors: Irina Filimonova, Sergey Zlobin
  • Patent number: 8260049
    Abstract: In one embodiment, the invention provides a method for determining a logical structure of a document. The method comprises generating at least one document hypothesis for the whole document; for each document hypothesis, verifying said document hypothesis including (a) generating at least one block hypothesis for each block in the document based on the document hypothesis; and (b) selecting a best block hypothesis for each block; selecting as a best document hypothesis the document hypothesis that has the best degree of correspondence with the selected best block hypotheses for the document; and forming the document based on the best document hypothesis.
    Type: Grant
    Filed: September 23, 2008
    Date of Patent: September 4, 2012
    Assignee: ABBYY Software Ltd.
    Inventors: Dmitry Deryagin, Konstantin Anisimovich
  • Patent number: 8233714
    Abstract: A method related to data capture from forms involving optical character recognition comprises detecting data fields on a scanned image; generating a flexible document description based on the detected data fields, including creating a set of search elements for each data field, each search element having associated search criteria; and training the flexible document description using a search algorithm to detect the data fields on additional training images based on the set of search elements.
    Type: Grant
    Filed: February 2, 2009
    Date of Patent: July 31, 2012
    Assignee: ABBYY Software Ltd.
    Inventors: Konstantin Zuev, Diar Tuganbaev, Irina Filimonova, Sergey Zlobin
  • Patent number: 8218887
    Abstract: In one embodiment, the invention discloses, a method for processing a document image. The method comprises segmenting the document image into a picture component and a non-picture component; compressing the non-picture component; and saving the uncompressed picture component and the compressed non-picture component in memory so that the document image may be recomposed to form a recomposed image based on the uncompressed picture component and the compressed non-picture component.
    Type: Grant
    Filed: September 16, 2008
    Date of Patent: July 10, 2012
    Assignee: ABBYY Software, Ltd.
    Inventor: German Zyuzin
  • Patent number: 8214199
    Abstract: A method and computer system for translating sentences between languages from an intermediate language-independent semantic representation is provided. On the basis of comprehensive understanding about languages and semantics, exhaustive linguistic descriptions are used to analyze sentences, to build syntactic structures and language independent semantic structures and representations, and to synthesize one or more sentences in a natural or artificial language. A computer system is also provided to analyze and synthesize various linguistic structures and to perform translation of a wide spectrum of various sentence types. As result, a generalized data structure, such as a semantic structure, is generated from a sentence of an input language and can be transformed into a natural sentence expressing its meaning correctly in an output language.
    Type: Grant
    Filed: March 22, 2007
    Date of Patent: July 3, 2012
    Assignee: ABBYY Software, Ltd.
    Inventors: Konstantin Anismovich, Vladimir Selegey, Konstantin Zuev
  • Patent number: 8195447
    Abstract: A method and computer system for translating sentences between languages from an intermediate language-independent semantic representation is provided. On the basis of comprehensive understanding about languages and semantics, exhaustive linguistic descriptions are used to analyze sentences, to build syntactic structures and language independent semantic structures and representations, and to synthesize one or more sentences in a natural or artificial language. A computer system is also provided to analyze and synthesize various linguistic structures and to perform translation of a wide spectrum of various sentence types. As result, a generalized data structure, such as a semantic structure, is generated from a sentence of an input language and can be transformed into a natural sentence expressing its meaning correctly in an output language.
    Type: Grant
    Filed: March 22, 2007
    Date of Patent: June 5, 2012
    Assignee: ABBYY Software Ltd.
    Inventors: Konstantin Anismovich, Vladimir Selegey, Konstantin Zuev