Search Patents

OCR multi-resolution method and apparatus

Patent number: 8611661

Abstract: In some embodiments, provided are procedures for processing images that may have different font sizes. In some embodiments, it involves OCR'ing with multiple passes at different resolutions.

Type: Grant

Filed: December 26, 2007

Date of Patent: December 17, 2013

Assignee: Intel Corporation

Inventors: Oscar Nestares, Badusha Kalathiparambil
OCR of books by word recognition

Patent number: 8014604

Abstract: Disclosed embodiments of the invention provide automated global optimization methods and systems of OCR, tailored to each document being digitized. A document-specific database is created from an OCR scan of a document of interest, which contains an exhaustive listing of words in the document. Images of each word, taken from all the fonts encountered, are entered into the database and mapped to a corresponding textual representation. After entry of a first instance of an image of a word written in a particular font, each new occurrence of the word in that font can be quickly recognized by image processing techniques. The disclosed methods and systems may be used in conjunction with adaptive character recognition training and word recognition training of the OCR engines.

Type: Grant

Filed: April 16, 2008

Date of Patent: September 6, 2011

Assignee: International Business Machines Corporation

Inventors: Asaf Tzadok, Eugeniusz Walach
USER CORRECTION OF ERRORS ARISING IN A TEXTUAL DOCUMENT UNDERGOING OPTICAL CHARACTER RECOGNITION (OCR) PROCESS

Publication number: 20110280481

Abstract: An electronic model of the image document is created by undergoing an OCR process. The electronic model includes elements (e.g., words, text lines, paragraphs, images) of the image document that have been determined by each of a plurality of sequentially executed stages in the OCR process. The electronic model serves as input information which is supplied to each of the stages by a previous stage that processed the image document. A graphical user interface is presented to the user so that the user can provide user input data correcting a mischaracterized item appearing in the document. Based on the user input data, the processing stage which produced the initial error that gave rise to the mischaracterized item corrects the initial error. Stages of the OCR process subsequent to this stage then correct any consequential errors arising in their respective stages as a result of the initial error.

Type: Application

Filed: May 17, 2010

Publication date: November 17, 2011

Applicant: MICROSOFT CORPORATION

Inventors: Bogdan Radakovic, Milan Vugdelija, Nikola Todic, Aleksandar Uzelac, Bodin Dresevic
Method for processing optical character recognition (OCR) data, wherein the output comprises visually impaired character images

Patent number: 8467614

Abstract: The present invention provides a method for an Optical Character Recognition (OCR) system providing recognition of characters that are partly hidden by crossing outs due to for example an imprint of a stamp, handwritten signatures, etc. The method establishes a set of template images of certainly recognized characters from the image of the text being processed by the OCR system, wherein the effect of the crossed out section is modelled into the template images before comparing these images with the image of a visually impaired crossed out character. The modelled template image having the highest similarity with the visually impaired crossed out character is the correct identification for the visually impaired character instance.

Type: Grant

Filed: November 21, 2008

Date of Patent: June 18, 2013

Assignee: Lumex AS

Inventors: Knut Tharald Fosseide, Hans Christian Meyer
Providing in-line previews of a source image for aid in correcting OCR errors

Patent number: 9501853

Abstract: The present disclosure is directed toward systems and methods for assisting users in correcting OCR errors. For example, systems and methods described herein involve identifying the position of a cursor within a machine-readable document. Systems and methods described herein also involve identifying corresponding position co-ordinates in a source image, as well as, capturing an image preview from the source image based on the corresponding position co-ordinates. Systems and methods described herein may also involve providing the preview of the source image within the machine-readable document.

Type: Grant

Filed: January 9, 2015

Date of Patent: November 22, 2016

Assignee: ADOBE SYSTEMS INCORPORATED

Inventors: Sayeed Ullah Khan, Subrato Namata, Uttam Dwivedi
Method and system for OCR-free vehicle identification number localization

Patent number: 9965677

Abstract: Methods and systems for localizing numbers and characters in captured images. A side image of a vehicle captured by one or more cameras can be preprocessed to determine a region of interest. A confidence value of series of windows within regions of interest of different sizes and aspect ratios containing a structure of interest can be calculated. Highest confidence candidate regions can then be identified with respect to the regions of interest and at least one region adjacent to the highest confidence candidate regions. An OCR operation can then be performed in the adjacent region. An identifier can then be returned from the adjacent region in order to localize numbers and characters in the side image of the vehicle.

Type: Grant

Filed: December 9, 2014

Date of Patent: May 8, 2018

Assignee: Conduent Business Services, LLC

Inventors: Orhan Bulan, Howard Mizes, Vladimir Kozitsky, Aaron M. Burry
SEGMENTATION OF A WORD BITMAP INTO INDIVIDUAL CHARACTERS OR GLYPHS DURING AN OCR PROCESS

Publication number: 20110274354

Abstract: An image processing apparatus is provided that includes a character chopper component that segments words into individual characters in a bitmap of a textual image undergoing an OCR process. The Character chopper component is configured to produce a set of (possibly curved) chop-lines which divide a bitmap of any given word into its individual character or glyph candidates. Cases where an input bitmap contains two separate words are handled by marking a place where those words should be split. The character segmentation algorithm computes the set of vertically oriented, curved chop-lines by considering glyph and background colors in a given word bitmap. The set is filtered afterwards using various heuristics, in order to preserve those lines that indeed do separate a word's glyphs and minimize the number of those that do not.

Type: Application

Filed: May 10, 2010

Publication date: November 10, 2011

Applicant: MICROSOFT CORPORATION

Inventor: Djordje Nijemcevic
Techniques for shape clustering and assignment of character codes in post optical character recognition processing

Patent number: 8666174

Abstract: Systems, methods and computer program products on storage devices for shape clustering and applications in processing various documents, including an output of an optical character recognition (OCR) process. The output of an OCR process is classified into a plurality of clusters of clip images and a representative image for each cluster is generated to identify clusters whose clip images were incorrectly assigned character codes by the OCR process.

Type: Grant

Filed: January 17, 2012

Date of Patent: March 4, 2014

Assignee: Google Inc.

Inventors: Luc Vincent, Raymond W. Smith
Multiple image input for optical character recognition processing systems and methods

Patent number: 7734092

Abstract: A method of processing an image includes receiving a digital version of the image, processing the digital version of the image through at least two binarization processes to thereby create a first binarization and a second binarization, and processing the first binarization through a first optical character recognition process to thereby create a first OCR output file. Processing the first binarization through a first optical character recognition process includes compiling first metrics associated with the first OCR output file. The method also includes processing the second binarization through the first optical character recognition process to thereby create a second OCR output file. Processing the second binarization through the first optical character recognition process includes compiling second metrics associated with the second OCR output file. The method also includes using the metrics, at least in part, to select a final OCR output file from among the OCR output files.

Type: Grant

Filed: November 15, 2006

Date of Patent: June 8, 2010

Assignee: Ancestry.com Operations Inc.

Inventors: Donald B. Curtis, Shawn Reid
System for detecting and correcting broken words

Patent number: 9256592

Abstract: The positioning of elements of a broken word can be corrected by receiving an optical character recognition (OCR) conversion of a printed publication and identifying multiple parts of the broken word from the OCR conversion to place in a graphical user interface (GUI). The multiple parts can be placed in the GUI using original positioning data for the printed publication. A user can make a selection in the GUI indicating that multiple parts from the OCR conversion are of the broken word and can automatically adjust bounds of the multiple parts to form a corrected word.

Type: Grant

Filed: November 7, 2012

Date of Patent: February 9, 2016

Assignee: Amazon Technologies, Inc.

Inventors: Satishkumar Kothandapani Shanmugasundaram, Shubham Chandra Gupta, Arpita Agrawal
System for detecting and correcting broken words

Patent number: 10552535

Abstract: The positioning of elements of a broken word can be corrected by receiving an optical character recognition (OCR) conversion of a printed publication and identifying multiple parts of the broken word from the OCR conversion to output in a graphical user interface (GUI). The multiple parts can be placed in the GUI using original positioning data for the printed publication. A user can make a selection in the GUI indicating that multiple parts from the OCR conversion are of the broken word and can automatically adjust bounds of the multiple parts to form a corrected word.

Type: Grant

Filed: January 19, 2016

Date of Patent: February 4, 2020

Assignee: Amazon Technologies, Inc.

Inventors: Satishkumar Kothandapani Shanmugasundaram, Shubham Chandra Gupta, Arpita Agrawal
Template matching with data correction

Patent number: 9530068

Abstract: An approach is provided to generate forms with template inclusions. In the approach, optical character recognition (OCR) text is compared to corresponding text in a selected form. Characters of text in the OCR text are then replaced with text from the template text, the replacing results in a form with template inclusions. The form with template inclusions is then processed by a forms processing operation.

Type: Grant

Filed: November 10, 2014

Date of Patent: December 27, 2016

Assignee: International Business Machines Corporation

Inventors: Keith P. Biegert, Brendan C. Bull, David Contreras, Robert C. Sizemore, Sterling R. Smith
Shape clustering and cluster-level manual identification in post optical character recognition processing

Patent number: 7697758

Abstract: Techniques for shape clustering and applications in processing various documents, including an output of an optical character recognition (OCR) process.

Type: Grant

Filed: September 11, 2006

Date of Patent: April 13, 2010

Assignee: Google Inc.

Inventors: Luc Vincent, Raymond W. Smith
DETECTING POSITION OF WORD BREAKS IN A TEXTUAL LINE IMAGE

Publication number: 20110243445

Abstract: Line segmentation in an OCR process is performed to detect the positions of words within an input textual line image by extracting features from the input to locate breaks and then classifying the breaks into one of two break classes which include inter-word breaks and inter-character breaks. An output including the bounding boxes of the detected words and a probability that a given break belongs to the identified class can then be provided to downstream OCR or other components for post-processing. Advantageously, by reducing line segmentation to the extraction of features, including the position of each break and the number of break features, and break classification, the task of line segmentation is made less complex but with no loss of generality.

Type: Application

Filed: March 30, 2010

Publication date: October 6, 2011

Applicant: MICROSOFT CORPORATION

Inventors: Aleksandar Uzelac, Bodin Dresevic, Sasa Galic, Bogdan Radakovic
Detecting position of word breaks in a textual line image

Patent number: 8345978

Abstract: Line segmentation in an OCR process is performed to detect the positions of words within an input textual line image by extracting features from the input to locate breaks and then classifying the breaks into one of two break classes which include inter-word breaks and inter-character breaks. An output including the bounding boxes of the detected words and a probability that a given break belongs to the identified class can then be provided to downstream OCR or other components for post-processing. Advantageously, by reducing line segmentation to the extraction of features, including the position of each break and the number of break features, and break classification, the task of line segmentation is made less complex but with no loss of generality.

Type: Grant

Filed: March 30, 2010

Date of Patent: January 1, 2013

Assignee: Microsoft Corporation

Inventors: Aleksandar Uzelac, Bodin Dresevic, Sasa Galic, Bogdan Radakovic
Character recognition device and a method therefor

Patent number: 7233697

Abstract: The present invention relates to an optical character recognition device (OCR) for reading a form provided with character frames in reading fields, into which a user fills each character. Characteristic vectors are extracted from the character images of each frame. A number of characters decision unit 16, into which the characteristic vectors are input, decides the number of characters filled in one of the character frames. A character separation unit 18 separates each of characters from the character image based on the number of characters decided by the decision unit 16. The character recognition unit 20 then recognizes each of the character. The OCR according o the present invention is able to read the form correctly, in which a plurality of characters are filled in one of the frames.

Type: Grant

Filed: March 29, 2002

Date of Patent: June 19, 2007

Assignee: Kabushiki Kaisha Toshiba

Inventor: Hiroyuki Mizutani
Post-processing system and method for correcting machine recognized text

Publication number: 20040086179

Abstract: A method of post-processing character data from an optical character recognition (OCR) engine and apparatus to perform the method. This exemplary method includes segmenting the character data into a set of initial words. The set of initial words is word level processed to determine at least one candidate word corresponding to each initial word. The set of initial words is segmented into a set of sentences. Each sentence in the set of sentences includes a plurality of initial words and candidate words corresponding to the initial words. A sentence is selected from the set of sentences. The selected sentence is word disambiguity processed to determine a plurality of final words. A final word is selected from the at least one candidate word corresponding to a matching initial word. The plurality of final words is then assembled as post-processed OCR data.

Type: Application

Filed: November 4, 2002

Publication date: May 6, 2004

Inventors: Yue Ma, Jinhong Katherine Guo, Mu Li, Yu-kun Tong, Tian-shun Yao, Jing-bo Zhu
Reader with optical character recognition

Patent number: 8783570

Abstract: An imaging-based bar code reader that includes an imaging and decoding system. Focusing optics and a sensor array define a field of view. A data processor has a memory for storing a pattern definition of previously imaged OCR characters and comparing a format of said previously stored characters to a present image to determine a character content of the present image.

Type: Grant

Filed: August 21, 2007

Date of Patent: July 22, 2014

Assignee: Symbol Technologies, Inc.

Inventors: Xiaomei Wang, Christopher J. Fjellstad
Method and a machine for processing mail runs using matrix accumulators

Patent number: 8295540

Abstract: A method of processing uniform mailpieces referred to as a “run” of mailpieces, during which method OCR is performed for recognizing certain information in a zone of interest of an image of each mailpiece, and during which method the following steps are performed: a) initializing a matrix accumulator associated with said run and including unitary accumulation elements that correspond to the pixels of the image; b) consolidating said matrix accumulator by incrementing certain unitary accumulation elements by deriving an indication of the spatial position of a block of pixels in which said certain information has been recognized unambiguously, or by using construction and local graphical correlation of blocks of image pixels to derive an optical flow map indicating local graphical movements; and c) defining, in the OCR processing, said zone of interest on the basis of the unitary accumulation elements of the consolidated matrix accumulator that present extreme accumulation values.

Type: Grant

Filed: November 4, 2011

Date of Patent: October 23, 2012

Assignee: SOLYSTIC

Inventors: Belkacem Benyoub, Emmanuel Piegay, Mathieu Letombe
Accuracy of recognition by means of a combination of classifiers

Patent number: 8660371

Abstract: In one embodiment, there is provided a method for an Optical Character Recognition (OCR) system. The method comprises: recognizing an input character based on a plurality of classifiers, wherein each classifier generates an output by comparing the input character with a plurality of trained patterns; grouping the plurality of classifiers based on a classifier grouping criterion; and combining the output of each of the plurality of classifiers based on the grouping.

Type: Grant

Filed: May 6, 2010

Date of Patent: February 25, 2014

Assignee: ABBYY Development LLC

Inventor: Diar Tuganbaev

1 2 next