A Measure of Disclosure Risk for Microdata
- 1 October 2002
- journal article
- Published by Oxford University Press (OUP) in Journal of the Royal Statistical Society Series B: Statistical Methodology
- Vol. 64 (4) , 855-867
- https://doi.org/10.1111/1467-9868.00365
Abstract
Summary: Protection against disclosure is important for statistical agencies releasing microdata files from sample surveys. Simple measures of disclosure risk can provide useful evidence to support decisions about release. We propose a new measure of disclosure risk: the probability that a unique match between a microdata record and a population unit is correct. We argue that this measure has at least two advantages. First, we suggest that it may be a more realistic measure of risk than two measures that are currently used with census data. Second, we show that consistent inference (in a specified sense) may be made about this measure from sample data without strong modelling assumptions. This is a surprising finding, in its contrast with the properties of the two ‘similar’ established measures. As a result, this measure has potentially useful applications to sample surveys. In addition to obtaining a simple consistent predictor of the measure, we propose a simple variance estimator and show that it is consistent. We also consider the extension of inference to allow for certain complex sampling schemes. We present a numerical study based on 1991 census data for about 450 000 enumerated individuals in one area of Great Britain. We show that the theoretical results on the properties of the point predictor of the measure of risk and its variance estimator hold to a good approximation for these data.This publication has 11 references indexed in Scilit:
- Proposals for 2001 Samples of Anonymized Records: An Assessment of Disclosure RiskJournal of the Royal Statistical Society Series A: Statistics in Society, 2001
- Elements of Statistical Disclosure ControlPublished by Springer Nature ,2001
- Estimating the Number of Species: A ReviewJournal of the American Statistical Association, 1993
- Disclosure risk for microdata stemming from official statisticsStatistica Neerlandica, 1992
- Strategies for measuring risk in public use microdata filesStatistica Neerlandica, 1992
- Disclosure Control of MicrodataJournal of the American Statistical Association, 1990
- The Risk of Disclosure for MicrodataJournal of Business & Economic Statistics, 1989
- Disclosure Risk and Disclosure Avoidance for MicrodataJournal of Business & Economic Statistics, 1988
- Approximation Theorems of Mathematical StatisticsPublished by Wiley ,1980
- On the Estimation of the Number of Classes in a PopulationThe Annals of Mathematical Statistics, 1949