📄️ Probabilistic record linkage
Probabilistic linkage, so called because it uses conditional probabilities to compute likelihoods, is one of the most common methods used for record linkage. In this approach, records are compared on a pairwise basis. A comparison of two records involves comparing all the individual fields. Each field comparison results in a score based on specific weights assigned to that field. These scores are summed up for the pair comparison, and if this summed score is over a specific threshold, the two records are designated a match.
📄️ Privacy-preserving record linkage
Privacy preserving record linkage (PPRL) refers to record linkage conducted on encoded or otherwise obfuscated identifiers. It is used to improve the privacy of the record linkage process. It allows linkage to occur without any personally identifying information being transmitted to the linkage unit.
📄️ Estimating probabilities
LinXmart uses a probabilistic linkage engine that requires a number of parameters for operation. A set of default parameter values are automatically created when a new Linkage Project is created. To maximise linkage quality, it may be necessary to modify these to best suit the data in question.
📄️ Envelope file format
Envelopes are used to pass data to the LinXmart data linkage system, with specific information that describes what the data is and what LinXmart should do with it.