Skip to main content

Analysis of Re-identification Risk Based on Log-Linear Models

  • Conference paper
Privacy in Statistical Databases (PSD 2004)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3050))

Included in the following conference series:

Abstract

The number of unique records in a released microdata set which are unique in the population is an important measure of re-identification disclosure risk in microdata. However, the microdata sample contains information about the disclosure risk more than the number of unique records. This paper deals with the development of a technique based on a loglinear models to extract more information from the sample about the disclosure risk not only through the number of sample unique records but also through the number of twin and triple records. These information may help microdata release committee in taking decision about releasing the data for public use. For illustration we apply the proposed method to data from a General Household Survey 1996–1997.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agresti, A.: An Introduction to Categorical Data Analysis. Wiley, New York (1996)

    MATH  Google Scholar 

  2. Bethlehem, J.G., Keller, W.J., Pannekoek, J.: Disclosure control of microdata. J. Amer. Statist. Associ. 85, 38–45 (1990)

    Article  Google Scholar 

  3. Bishop, Y., Fienberg, S., Holland, P.: Discrete Multivariate Analysis: Theory and Practice. MIT Press, Cambridge (1975)

    MATH  Google Scholar 

  4. Blien, U., Wirth, H., Muller, M.: Disclosure risk for microdata stemming from official statistics. Statistica Neerlandica 46, 69–82 (1992)

    Article  Google Scholar 

  5. Cameron, C.A., Trivedi, P.K.: Regression Analysis of Count Data, Cambridge (1998)

    Google Scholar 

  6. Dale, A., Elliot, M.: Proposals for 2001 samples of anonymized records: An assessment of disclosure risk. J. Roy. Statist. Soci., Ser. A 164, 427–447 (2001)

    Article  MATH  MathSciNet  Google Scholar 

  7. Dalenius, T.: Finding a needle in a haystack or identifying anonymous census records. J. Official Statist. 2, 329–336 (1986)

    Google Scholar 

  8. Duncan, G.T., Lambert, D.: The risk of disclosure for microdata. J. Business Econom. Statist 7, 207–217 (1989)

    Article  Google Scholar 

  9. Duncan, G.T., Lambert, D.: Disclosure-limited data dissemination. J. Amer. Statist. Associ 81, 10–28 (1986)

    Article  Google Scholar 

  10. Fienberg, S., Makov, U.: Confidentiality, uniqueness and disclosure limitation for categorical data. J. Official Statist. 14, 385–397 (1998)

    Google Scholar 

  11. Skinner, C., Holmes, D.: Modelling population uniqueness. In: Proceedings of the International Seminar on Confidentiality, Dublin, pp. 175–199

    Google Scholar 

  12. Skinner, C., Holmes, D.: Estimating the re-identification risk per record in microdata. J. Official Statist. 14, 361–372 (1998)

    Google Scholar 

  13. Willenborg, L., Waal, T.D.: Elements of Statistical Disclosure Control. Springer, New York (2001)

    Book  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Elamir, E.A.H. (2004). Analysis of Re-identification Risk Based on Log-Linear Models. In: Domingo-Ferrer, J., Torra, V. (eds) Privacy in Statistical Databases. PSD 2004. Lecture Notes in Computer Science, vol 3050. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-25955-8_21

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-25955-8_21

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-22118-0

  • Online ISBN: 978-3-540-25955-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics