Abstract: The present invention handles fully-lined, semi-lined and line-less cell tables by identifying the cells and cell separators during page recomposition processes as part of optical character recognition processes. The invention accomplishes such by iteratively identifying cell separators and cells. The processes accomplishes this by iteratively merging word boxes into cells, iteratively finding separators, and iteratively merging cells bounded by the same separators, and repeating these steps until the correct cell structure is found. With this method, rows are estimated, close words are merged into cells, columns are then estimated, cells within columns are merged, columns re-estimated, cells in the same row and column are merged into bigger cells, and then rows and cells are merged according to the detection of various table styles. This invention handles large complex tables with multiple lines of symbols per cell. This method handles multiple line cells in lined, semi-lined and line-less tables.