Abstract: The present invention relates to computer-based technology for linking or matching records in data files, based on at least one identifier in common, with a threshold probability that records are linked, the method uses a Bayesian probabilistic approach to determine the likelihood that the identified records are linked.