Skip to main content

An Investigation of Model-Based Microdata Masking for Magnitude Tabular Data Release

  • Conference paper
Privacy in Statistical Databases (PSD 2012)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7556))

Included in the following conference series:

Abstract

Traditionally, magnitude tabular data and microdata masking have been treated as two independent problems. An increasing number of government agencies are exploring establishing remote data access centers where both types of data release may occur. We argue that in these cases, consistency across both types of data release becomes an important component in the assessment of the performance of a certain masking and a common approach to the problem of masking both tabular and microdata would produce better results than approaches that address the two problems separately. Along this line, in this study we investigate the efficacy of using a model based microdata masking method (specifically Data shuffling) when the data is also used for magnitude tabular data release. We identify some aspects of our proposal that are important in addressing this issue further to perform a comprehensive evaluation of techniques suitable for both microdata and magnitude tabular data release.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Chipperfield, J., Yu, F.: Protecting Confidentiality in a Remote Analysis Server for Tabulation and Analysis of Data. UNECE Work Session on Statistical Disclosure Limitation, October 26-28, Tarragona, Spain (2011)

    Google Scholar 

  2. Dandekar, R.A., Cox, L.H.: Synthetic Tabular Data: An Alternative to Complementary Cell Suppression (2002) (unpublished manuscript)

    Google Scholar 

  3. Giessing, S.: Post-tabular Stochastic Noise to Protect Skewed Business Data. UNECE Work Session on Statistical Disclosure Limitation, October 26-28, Tarragona, Spain (2011)

    Google Scholar 

  4. Höhne, J.: Anonymisierungsverfahren für Paneldaten. In: Wirtschafts- und Sozialstatistisches Archiv., Bd. 2, pp. 259–275. Springer (2008)

    Google Scholar 

  5. Honinger, J., Höhne, J.: Morpheus Remote Access to Microdata with a Quality Measure. UNECE Work Session on Statistical Disclosure Limitation, October 26-28, Tarragona, Spain (2011)

    Google Scholar 

  6. Massell, P., Funk, J.: Protecting the Confidentiality of Tables by Adding Noise to the Underlying Microdata. In: Proceedings of the 2007 Third International Conference on Establishment Surveys (ICES-III), Montreal, Canada, June 18-21 (2007)

    Google Scholar 

  7. Massell, P., Zayatz, L., Funk, J.: Protecting the Confidentiality of Survey Tabular Data by Adding Noise to the Underlying Microdata: Application to the Commodity Flow Survey. In: Domingo-Ferrer, J., Franconi, L. (eds.) PSD 2006. LNCS, vol. 4302, pp. 304–317. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  8. Muralidhar, K., Sarathy, R.: Data Shuffling: A New Approach for Masking Numerical Data. Management Science 52, 658–670 (2006)

    Article  Google Scholar 

  9. O’Keefe, C.M., Good, N.M.: Regression Output from a Remote Analysis Server. Data & Knowledge Engineering 68, 1175–1186 (2009)

    Article  Google Scholar 

  10. Robertson, D.A., Ethier, R.: Cell Suppression: Experience and Theory. In: Domingo-Ferrer, J. (ed.) Inference Control in Statistical Databases. LNCS, vol. 2316, pp. 8–20. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  11. Simard, M.: Progress with Real Time Remote Access. UNECE Work Session on Statistical Disclosure Limitation, Tarragona, Spain, October 26-28 (2011)

    Google Scholar 

  12. Sparks, R., Carter, C., Donnelly, J.B., O’Keefe, C.M., Duncan, J., Keighley, T., McAullay, D.: Remote Access Methods for Exploratory Data Analysis and Statistical Modelling: Privacy-Preserving Analytics TM. Comput. Methods Programs Biomed. 91, 208–222 (2008)

    Article  Google Scholar 

  13. Tarkoma, J.: Remote Access in Statistics Finland. UNECE Work Session on Statistical Disclosure Limitation, October 26-28, Tarragona, Spain (2011)

    Google Scholar 

  14. Trottini, M., Franconi, L., Polettini, S.: Italian Household Expenditure Survey: A Proposal for Data Dissemination. In: Domingo-Ferrer, J., Franconi, L. (eds.) PSD 2006. LNCS, vol. 4302, pp. 318–333. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  15. Trottini, M., Muralidhar, K., Sarathy, R.: Maintaining Tail Dependence in Data Shuffling Using t Copula. Statistics & Probability Letters 81(3), 420–428 (2011)

    Article  MathSciNet  MATH  Google Scholar 

  16. Zayatz, L.: New Implementations of Noise for Tabular Magnitude Data, Synthetic Tabular Frequencies and Microdata, and a Remote Microdata Analysis System. Statistics#2007-17, Research Report Series, US Census Bureau (2007)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Trottini, M., Muralidhar, K., Sarathy, R. (2012). An Investigation of Model-Based Microdata Masking for Magnitude Tabular Data Release. In: Domingo-Ferrer, J., Tinnirello, I. (eds) Privacy in Statistical Databases. PSD 2012. Lecture Notes in Computer Science, vol 7556. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33627-0_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-33627-0_5

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-33626-3

  • Online ISBN: 978-3-642-33627-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics