A Two-Stage Imputation Procedure to Balance the Risk–Utility Trade-Off

Drechsler, Jörg

doi:10.1007/978-1-4614-0326-5_9

Jörg Drechsler²

Part of the book series: Lecture Notes in Statistics ((LNS,volume 201))

Abstract

There has been little discussion in the literature on how many multiply imputed datasets an agency should release. From the perspective of the secondary data analyst, a large number of datasets is desirable. The additional variance introduced by the imputation decreases with the number of released datasets. For example, Reiter (2003) finds nearly a 100% increase in the variance of regression coefficients when going from 50 to two partially synthetic datasets. From the perspective of the agency, a small number of datasets is desirable. The information available to illintentioned users seeking to identify individuals in the released datasets increases with the number of released datasets. Thus, agencies considering the release of partially synthetic data generally are confronted with a trade-off between disclosure risk and data utility.

Most of this chapter is taken from Drechsler and Reiter (2009) and Reiter and Drechsler (2010).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 99.00; Price excludes VAT (USA)

Softcover Book: USD 129.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Author information

Authors and Affiliations

Department for Statistical Methods, Institute for Employment Research, Regensburger Straße 104, 90478, Nürnberg, Germany
Jörg Drechsler

Authors

Jörg Drechsler
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jörg Drechsler .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Drechsler, J. (2011). A Two-Stage Imputation Procedure to Balance the Risk–Utility Trade-Off. In: Synthetic Datasets for Statistical Disclosure Control. Lecture Notes in Statistics(), vol 201. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-0326-5_9

Download citation

DOI: https://doi.org/10.1007/978-1-4614-0326-5_9
Published: 08 June 2011
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-0325-8
Online ISBN: 978-1-4614-0326-5
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics