Abstract: A method for matching documents based on spatial layout of regions based on a shape similarity model for detecting similarity between general 2D objects. The method uses the shape similarity model to determine if two documents are similar by logical region generation in which logical regions are automatically derived from information in the documents to be matched, region correspondence, in which a correspondence is established between the regions on the documents, pose computation in which the individual transforms relating corresponding regions are recovered, and pose verification in which the extent of spatial similarity is measured by projecting one document onto the other using the computed pose parameters.
Abstract: A method and system of recognizing handwritten words in scanned documents, wherein by processing a document containing handwriting, features for word localization are extracted from handwritten words contained in said document through basis points taken from a single curve of text lines. The method is independent of page orientation, and does not assume that the individual lines of handwritten text are parallel, and the method does not require that word regions be aligned with text line orientation wherein intra-word statistics are derived from sample pages rather than using a fixed threshold. The method has applications in digital libraries, handwriting tokenization, document management and OCR systems.
Abstract: A method for generating forms using halftones along with a set of low-complexity image processing steps to extract the characters for recognition. The method exploits texture differences between character strokes and halftoned boxes or text fields. Form frames are rendered as black and white halftones and characters are extracted by exploiting differences in texture between the frames and the character strokes. A sequence of simple image processing operations, easily done in hardware, eliminates the frames while leaving the characters intact. Halftones, giving the appearance of a light color as in the dropout-color method, are easily produced by page description languages; thus, blank and filled-in forms can be scanned, printed, stored and photocopied at low cost.
Abstract: A method for matching objects based on spatial layout of regions based on a shape similarity model for detecting similarity between general 2D objects. The method uses the shape similarity model to determine if two objects are similar by logical region generation in which logical regions are automatically derived from information in the objects to be matched, region correspondence, in which a correspondence is established between the regions on the objects, pose computation in which the individual transforms relating corresponding regions are recovered, and pose verification in which the extent of spatial similarity is measured by projecting one document onto the other using the computed pose parameters. The method of the invention can be carried out in a microprocessor-based system capable of being programmed to carry out the method of the invention.