Abstract
Traditionally, magnitude tabular data and microdata masking have been treated as two independent problems. An increasing number of government agencies are exploring establishing remote data access centers where both types of data release may occur. We argue that in these cases, consistency across both types of data release becomes an important component in the assessment of the performance of a certain masking and a common approach to the problem of masking both tabular and microdata would produce better results than approaches that address the two problems separately. Along this line, in this study we investigate the efficacy of using a model based microdata masking method (specifically Data shuffling) when the data is also used for magnitude tabular data release. We identify some aspects of our proposal that are important in addressing this issue further to perform a comprehensive evaluation of techniques suitable for both microdata and magnitude tabular data release.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Chipperfield, J., Yu, F.: Protecting Confidentiality in a Remote Analysis Server for Tabulation and Analysis of Data. UNECE Work Session on Statistical Disclosure Limitation, October 26-28, Tarragona, Spain (2011)
Dandekar, R.A., Cox, L.H.: Synthetic Tabular Data: An Alternative to Complementary Cell Suppression (2002) (unpublished manuscript)
Giessing, S.: Post-tabular Stochastic Noise to Protect Skewed Business Data. UNECE Work Session on Statistical Disclosure Limitation, October 26-28, Tarragona, Spain (2011)
Höhne, J.: Anonymisierungsverfahren für Paneldaten. In: Wirtschafts- und Sozialstatistisches Archiv., Bd. 2, pp. 259–275. Springer (2008)
Honinger, J., Höhne, J.: Morpheus Remote Access to Microdata with a Quality Measure. UNECE Work Session on Statistical Disclosure Limitation, October 26-28, Tarragona, Spain (2011)
Massell, P., Funk, J.: Protecting the Confidentiality of Tables by Adding Noise to the Underlying Microdata. In: Proceedings of the 2007 Third International Conference on Establishment Surveys (ICES-III), Montreal, Canada, June 18-21 (2007)
Massell, P., Zayatz, L., Funk, J.: Protecting the Confidentiality of Survey Tabular Data by Adding Noise to the Underlying Microdata: Application to the Commodity Flow Survey. In: Domingo-Ferrer, J., Franconi, L. (eds.) PSD 2006. LNCS, vol. 4302, pp. 304–317. Springer, Heidelberg (2006)
Muralidhar, K., Sarathy, R.: Data Shuffling: A New Approach for Masking Numerical Data. Management Science 52, 658–670 (2006)
O’Keefe, C.M., Good, N.M.: Regression Output from a Remote Analysis Server. Data & Knowledge Engineering 68, 1175–1186 (2009)
Robertson, D.A., Ethier, R.: Cell Suppression: Experience and Theory. In: Domingo-Ferrer, J. (ed.) Inference Control in Statistical Databases. LNCS, vol. 2316, pp. 8–20. Springer, Heidelberg (2002)
Simard, M.: Progress with Real Time Remote Access. UNECE Work Session on Statistical Disclosure Limitation, Tarragona, Spain, October 26-28 (2011)
Sparks, R., Carter, C., Donnelly, J.B., O’Keefe, C.M., Duncan, J., Keighley, T., McAullay, D.: Remote Access Methods for Exploratory Data Analysis and Statistical Modelling: Privacy-Preserving Analytics TM. Comput. Methods Programs Biomed. 91, 208–222 (2008)
Tarkoma, J.: Remote Access in Statistics Finland. UNECE Work Session on Statistical Disclosure Limitation, October 26-28, Tarragona, Spain (2011)
Trottini, M., Franconi, L., Polettini, S.: Italian Household Expenditure Survey: A Proposal for Data Dissemination. In: Domingo-Ferrer, J., Franconi, L. (eds.) PSD 2006. LNCS, vol. 4302, pp. 318–333. Springer, Heidelberg (2006)
Trottini, M., Muralidhar, K., Sarathy, R.: Maintaining Tail Dependence in Data Shuffling Using t Copula. Statistics & Probability Letters 81(3), 420–428 (2011)
Zayatz, L.: New Implementations of Noise for Tabular Magnitude Data, Synthetic Tabular Frequencies and Microdata, and a Remote Microdata Analysis System. Statistics#2007-17, Research Report Series, US Census Bureau (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Trottini, M., Muralidhar, K., Sarathy, R. (2012). An Investigation of Model-Based Microdata Masking for Magnitude Tabular Data Release. In: Domingo-Ferrer, J., Tinnirello, I. (eds) Privacy in Statistical Databases. PSD 2012. Lecture Notes in Computer Science, vol 7556. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33627-0_5
Download citation
DOI: https://doi.org/10.1007/978-3-642-33627-0_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33626-3
Online ISBN: 978-3-642-33627-0
eBook Packages: Computer ScienceComputer Science (R0)