Abstract: A method of binarization used in an OCR system involves in determining text pixels by checking, for each pixel, that the difference between its value and the values of a plurality of pixels located at a predetermined distance therefrom is greater than a relative threshold corresponding to the difference in intensities between the text and the background of the image, subsampling the image at a rate corresponding to at least two pixels in order to detect kernels of text, and then binarizing the image pixels only in tiles of several stroke width sides containing text kernels by using in each tile, an absolute threshold estimated in that tile.
Type:
Grant
Filed:
May 12, 1999
Date of Patent:
August 20, 2002
Assignee:
International Business Machines Corp.
Inventors:
Andrei Heilper, Yaakov Navon, Eugene Walach