Abstract: A data processing system and corresponding method are presented. The system includes a communication module to retrieve a first document and a second document; a first database to store the first and second documents; and a processor. The processor extracts first data from the first document, orders the first data into a first data structure using a set of translation tables. The processor also extracts second data from the second document, orders the second data into a second data structure using the set of translation tables. The first data structure and the second data structure have a first and a second format, respectively. The first data and the second data comprise a plurality of entities, and the processor is configured to compare entities from the first data structure and the second data structure to identify matching entities.