Abstract
The number of unique records in a released microdata set which are unique in the population is an important measure of re-identification disclosure risk in microdata. However, the microdata sample contains information about the disclosure risk more than the number of unique records. This paper deals with the development of a technique based on a loglinear models to extract more information from the sample about the disclosure risk not only through the number of sample unique records but also through the number of twin and triple records. These information may help microdata release committee in taking decision about releasing the data for public use. For illustration we apply the proposed method to data from a General Household Survey 1996–1997.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Agresti, A.: An Introduction to Categorical Data Analysis. Wiley, New York (1996)
Bethlehem, J.G., Keller, W.J., Pannekoek, J.: Disclosure control of microdata. J. Amer. Statist. Associ. 85, 38–45 (1990)
Bishop, Y., Fienberg, S., Holland, P.: Discrete Multivariate Analysis: Theory and Practice. MIT Press, Cambridge (1975)
Blien, U., Wirth, H., Muller, M.: Disclosure risk for microdata stemming from official statistics. Statistica Neerlandica 46, 69–82 (1992)
Cameron, C.A., Trivedi, P.K.: Regression Analysis of Count Data, Cambridge (1998)
Dale, A., Elliot, M.: Proposals for 2001 samples of anonymized records: An assessment of disclosure risk. J. Roy. Statist. Soci., Ser. A 164, 427–447 (2001)
Dalenius, T.: Finding a needle in a haystack or identifying anonymous census records. J. Official Statist. 2, 329–336 (1986)
Duncan, G.T., Lambert, D.: The risk of disclosure for microdata. J. Business Econom. Statist 7, 207–217 (1989)
Duncan, G.T., Lambert, D.: Disclosure-limited data dissemination. J. Amer. Statist. Associ 81, 10–28 (1986)
Fienberg, S., Makov, U.: Confidentiality, uniqueness and disclosure limitation for categorical data. J. Official Statist. 14, 385–397 (1998)
Skinner, C., Holmes, D.: Modelling population uniqueness. In: Proceedings of the International Seminar on Confidentiality, Dublin, pp. 175–199
Skinner, C., Holmes, D.: Estimating the re-identification risk per record in microdata. J. Official Statist. 14, 361–372 (1998)
Willenborg, L., Waal, T.D.: Elements of Statistical Disclosure Control. Springer, New York (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Elamir, E.A.H. (2004). Analysis of Re-identification Risk Based on Log-Linear Models. In: Domingo-Ferrer, J., Torra, V. (eds) Privacy in Statistical Databases. PSD 2004. Lecture Notes in Computer Science, vol 3050. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-25955-8_21
Download citation
DOI: https://doi.org/10.1007/978-3-540-25955-8_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22118-0
Online ISBN: 978-3-540-25955-8
eBook Packages: Springer Book Archive