Abstract: A computer implemented method of converting a document in an input format to a document in a different output format is disclosed. The method generally comprises locating data in the input document, grouping data into one or more intermediate format blocks in an intermediate format document, and converting the intermediate format document to the output format document using the intermediate format blocks. Each intermediate format block may be a paragraph, a line, a word, a table, or an image. The input document may be received over a network and the output document is sent over the network. A linked table of contents and/or an index may be generated. A computer executable program may be generated and inserted into the output document for selecting one output format for display.
Abstract: A processor based method for recognizing, capturing and storing tabular data receives digital-computer data representing a document either as a pixel-format document-image, or as formatted text. Within the digital computer, either form of the digital-computer data is processed to locate tabular data present therein. After a table has been located, tabular data is extracted from cells present in either form of the digital-computer data. The extracted tabular data is stored into a database present on the digital computer.