Library Header Image
LSE Research Online LSE Library Services

Probabilistic record linkage

Sayers, Adrian, Ben-Shlomo, Yoav, Blom, Ashley W. and Steele, Fiona (2015) Probabilistic record linkage. International Journal of Epidemiology. ISSN 0300-5771

PDF - Published Version
Available under License Creative Commons Attribution.

Download (2MB) | Preview

Identification Number: 10.1093/ije/dyv322


Studies involving the use of probabilistic record linkage are becoming increasingly common. However, the methods underpinning probabilistic record linkage are not widely taught or understood, and therefore these studies can appear to be a ‘black box’ research tool. In this article, we aim to describe the process of probabilistic record linkage through a simple exemplar. We first introduce the concept of deterministic linkage and contrast this with probabilistic linkage. We illustrate each step of the process using a simple exemplar and describe the data structure required to perform a probabilistic linkage. We describe the process of calculating and interpreting matched weights and how to convert matched weights into posterior probabilities of a match using Bayes theorem. We conclude this article with a brief discussion of some of the computational demands of record linkage, how you might assess the quality of your linkage algorithm, and how epidemiologists can maximize the value of their record-linked research using robust record linkage methods.

Item Type: Article
Official URL:
Additional Information: © 2015 The Authors
Divisions: Statistics
Subjects: H Social Sciences > HA Statistics
Date Deposited: 08 Jan 2016 16:08
Last Modified: 20 Oct 2021 00:31
Projects: MR/L01226X/1
Funders: MRC Fellowship

Actions (login required)

View Item View Item


Downloads per month over past year

View more statistics