Library Header Image
LSE Research Online LSE Library Services

Privacy protection from sampling and perturbation in survey microdata

Sholmo, Natalie and Skinner, Chris J. (2012) Privacy protection from sampling and perturbation in survey microdata. Journal of Privacy and Confidentiality, 4 (1). pp. 155-169. ISSN 2575-8527

Download (636kB) | Preview


Statistical agencies release microdata from social surveys as public-use files after applying statistical disclosure limitation (SDL) techniques. Disclosure risk is typically assessed in terms of identification risk, where it is supposed that small counts on cross-classified identifying key variables, i.e. a key, could be used to make an identification and confidential information may be learnt. In this paper we explore the application of definitions of privacy from the computer science literature to the same problem, with a focus on sampling and a form of perturbation which can be represented as misclassification. We consider two privacy definitions: differential privacy and probabilistic differential privacy. Chaudhuri and Mishra (2006) have shown that sampling does not guarantee differential privacy, but that, under certain conditions, it may ensure probabilistic differential privacy. We discuss these definitions and conditions in the context of survey microdata. We then extend this discussion to the case of perturbation. We show that differential privacy can be ensured if and only if the perturbation employs a misclassification matrix with no zero entries. We also show that probabilistic differential privacy is a viable alternative to differential privacy when there are zeros in the misclassification matrix. We discuss some common examples of SDL methods where in some cases zeros may be prevalent in the misclassification matrix.

Item Type: Article
Official URL:
Additional Information: © 2012 The Authors
Divisions: Statistics
Subjects: H Social Sciences > H Social Sciences (General)
H Social Sciences > HA Statistics
Date Deposited: 31 Aug 2012 10:36
Last Modified: 16 May 2024 01:28

Actions (login required)

View Item View Item


Downloads per month over past year

View more statistics