Skip to main content

Designing Multiple Releases from the Small and Medium Enterprises Survey

  • Conference paper
Privacy in Statistical Databases (PSD 2012)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7556))

Included in the following conference series:

  • 911 Accesses

Abstract

In this paper the problem of multiple releases from a single survey is addressed. The design and release of a public use file derived from a microdata file for research are discussed. Problems and benefits of multiple microdata releases are tackled. In order to satisfy wider variety of user needs, the aim of this work is to extend the microdata portfolio of a National Statistical Institute, without additional resources. Results obtained when applying the proposed methodology on the Italian Small and Medium Enterprises survey are illustrated.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Hundepool, A., Domingo-Ferrer, J., Franconi, L., Giessing, S., Lenz, R., Longhurst, J., Schulte Nordholt, E., Seri, G., De Wolfe, P.: Handbook on Statistical Disclosure Control. ESSnet on Statistical Disclosure Control (2010)

    Google Scholar 

  2. Becker, C., Gather, U.: The masking breakdown point of multivariate outlier identification rules. Journal of the American Statistical Association 94, 947–955 (1999)

    Article  MathSciNet  MATH  Google Scholar 

  3. Casciano, C., Ichim, D., Corallo, L.: Sampling as a way to reduce risk and create a Public Use File maintaining weighted totals. In: Joint UNECE - Eurostat Work Session on Statistical Data Confidentiality, Tarragona, Spain (2011)

    Google Scholar 

  4. Cerioli, A., Farcomeni, A.: Error rates for multivariate outlier detection. Computational Statistics and Data Analysis 55, 544–553 (2011)

    Article  MathSciNet  Google Scholar 

  5. Chew, V.: Confidence, Prediction, and Tolerance Regions for the Multivariate Normal Distribution. Journal of the American Statistical Association 61, 605–617 (1966)

    Article  MathSciNet  Google Scholar 

  6. Deville, J., Tillé, Y.: Efficient Balanced Sampling: The Cube Method. Biometrika 91(4), 893–912 (2004)

    Article  MathSciNet  MATH  Google Scholar 

  7. Elamir, E., Skinner, C.: Record level measures of disclosure risk for survey microdata. J. Official Statist. 22(3), 525–539 (2006)

    Google Scholar 

  8. Foschi, F.: Disclosure risk for high dimensional business microdata. In: Joint UNECE - Eurostat Work Session on Statistical Data Confidentiality, Tarragona, Spain (2011)

    Google Scholar 

  9. Franconi, L., Polettini, S.: Individual Risk Estimation in μ-Argus: A Review. In: Domingo-Ferrer, J., Torra, V. (eds.) PSD 2004. LNCS, vol. 3050, pp. 262–272. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  10. Hardin, J., Rocke, D.M.: The distribution of robust distances. Journal of Computational and Graphical Statistics 14, 910–927 (2005)

    Article  MathSciNet  Google Scholar 

  11. Ichim, D.: Confidentiality issues related to small area estimation and dissemination. In: Proceedings of New Techniques and Technologies for Statistics (2011)

    Google Scholar 

  12. Neykov, N., Müller, M.: Breakdown Point and Computation of Trimmed Likelihood Estimators in Generalized Linear Models. In: Dutter, R., Filzmoser, P., Gather, U., Rousseeuw, P. (eds.) Developments in Robust Statistics, pp. 277–286. Physica-Verlag, Heidelberg (2003)

    Chapter  Google Scholar 

  13. Neykov, N.M., Filzmoser, P., Dimova, R., Neytchev, P.: Robust fitting of mixtures using the Trimmed Likelihood Estimator. Computational Statistics & Data Analysis 17(3), 299–308 (2007)

    Article  MathSciNet  Google Scholar 

  14. Rousseeuw, P.J., Van Driessen, K.: A fast algorithm for the minimum covariance determinant estimator. Technometrics 41, 212–223 (1999)

    Article  Google Scholar 

  15. Pison, G., Van Aelst, S., Willems, G.: Small sample corrections for LTS and MCD. Metrika 55, 111–123 (2002)

    Article  MathSciNet  Google Scholar 

  16. Tallis, G.M.: Elliptical and radial truncation in normal samples. Annals of Mathematical Statistics 34, 940–944 (1963)

    Article  MathSciNet  MATH  Google Scholar 

  17. Trottini, M., Franconi, L., Polettini, S.: Italian Household Expenditure Survey: A Proposal for Data Dissemination. In: Domingo-Ferrer, J., Franconi, L. (eds.) PSD 2006. LNCS, vol. 4302, pp. 318–333. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  18. Vandev, D.L., Neykov, N.M.: Robust maximum likelihood in the Gaussian case. In: Morgenthaler, S., Ronchetti, E., Stahel, W. (eds.) New Directions in Statistical Data Analysis and Robustness, pp. 257–264. Birkhauser Verlag, Basel (1993)

    Google Scholar 

  19. Wilks, S.: Multivariate Statistical Outliers Sankhyā, A 25(4), 407–426 (1963)

    Google Scholar 

  20. Willenborg, L., De Waal, T.: Elements of Statistical Disclosure Control. Lecture Notes in Statistics, vol. 155. Springer, New York (2001)

    Book  MATH  Google Scholar 

  21. http://www.census.gov/main/www/pums.html

  22. http://www.forschungsdatenzentrum.de/campus-file.asp

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Foschi, F., Casciano, M.C., Franconi, L., Ichim, D. (2012). Designing Multiple Releases from the Small and Medium Enterprises Survey. In: Domingo-Ferrer, J., Tinnirello, I. (eds) Privacy in Statistical Databases. PSD 2012. Lecture Notes in Computer Science, vol 7556. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33627-0_16

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-33627-0_16

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-33626-3

  • Online ISBN: 978-3-642-33627-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics