Skip to main content

Small Area Estimation in the Presence of Linkage Errors

  • Conference paper
  • First Online:
Soft Methods for Data Science (SMPS 2016)

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 456))

Included in the following conference series:

Abstract

In Official Statistics, interest for data integration has been increasingly growing, though the effect of this procedure on the statistical analyses has been disregarded for a long time. In recent years, however, it is largely recognized that linkage is not an error-free procedure and linkage errors, as false links and missed links can invalidate standard estimates. More recently, growing attention is devoted to the effect of linkage errors on the subsequent analyses. For instance, Samart and Chambers (Samart in Aust N Z J Stat 56, 2014 [14]) consider the effect of linkage errors on mixed effect models. Their proposal finds a natural application in the context of longitudinal studies, where repeated measures are taken on the same individuals. In official statistics, the mixed models is largely exploited for small area estimation to increase detailed information at local level. In this work, an EBLUP estimator that takes account of the linkage errors is derived.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Battese GE, Harter RM, Fuller WA (1988) An error-components model for prediction of crop areas using survey and satellite data. J Am Stat Assoc 83:28–36

    Article  Google Scholar 

  2. Chambers R (2009) Regression analysis of probability-linked data. In: Official Statistics Research Series, vol 4

    Google Scholar 

  3. Di Consiglio L, Tuoto T (2014) When adjusting for bias due to linkage errors: a sensitivity analysis. In: European Conference on Quality in Official Statistics (Q2014), Vienna, 3–5 June 2014

    Google Scholar 

  4. Essnet DI- McLeod, Heasman, Forbes (2011) Simulated data for the on the job training. http://www.cros-portal.eu/content/job-training

  5. Fellegi IP, Sunter AB (1969) A theory for record linkage. J Am Stat Assoc 64:1183–1210

    Article  MATH  Google Scholar 

  6. Harville DA (1977) Maximum likelihood approaches to variance component estimation and to related problems. J Am Stat Assoc 72:320–338

    Article  MathSciNet  MATH  Google Scholar 

  7. Jaro M (1989) Advances in record linkage methodology as applied to matching the 1985 test census of Tampa, Florida. J Am Stat Assoc 84:414–420

    Article  Google Scholar 

  8. Lahiri P, Larsen MD (2005) Regression analysis with linked data. J Am Stat Assoc 100:222–230

    Article  MathSciNet  MATH  Google Scholar 

  9. Neter J, Maynes ES, Ramanathan R (1965) The effect of mismatching on the measurement of response errors. J Am Stat Assoc 60:1005–1027

    MathSciNet  Google Scholar 

  10. Rao JNK (2003) Small area estimation. Wiley, New York

    Book  MATH  Google Scholar 

  11. RELAIS 3.0 Users Guide (2015). http://www.istat.it/it/strumenti/metodi-e-strumenti-it/strumenti-di-elaborazione/relais

  12. Samart K (2011) Analysis of probabilistically linked data. Ph.D. thesis, School of Mathematics and Applied Statistics, University of Wollongong

    Google Scholar 

  13. Samart K, Chambers R (2010) Fitting linear mixed models using linked data, centre for statistical and survey methodology. University of Wollongong, Working Paper 18-10

    Google Scholar 

  14. Samart K, Chambers R (2014) Linear regression with nested errors using probability-linked data. Aust N Z J Stat 56

    Google Scholar 

  15. Scheuren F, Winkler WE (1993) Regression analysis of data files that are computer matched Part I. Surv Methodol 19:39–58

    Google Scholar 

  16. Scheuren F, Winkler WE (1997) Regression analysis of data files that are computer matched- part II. Surv Methodol 23:157–165

    Google Scholar 

  17. Searle SR, Casella G, McCulloch CE (2006) Variance components. Wiley, New York

    MATH  Google Scholar 

  18. Tancredi A, Liseo B (2011) A hierachical Bayesian approach to record linkage and population size problems. Ann Appl Stat 5:1553–1585

    Article  MathSciNet  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tiziana Tuoto .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing Switzerland

About this paper

Cite this paper

Di Consiglio, L., Tuoto, T. (2017). Small Area Estimation in the Presence of Linkage Errors. In: Ferraro, M., et al. Soft Methods for Data Science. SMPS 2016. Advances in Intelligent Systems and Computing, vol 456. Springer, Cham. https://doi.org/10.1007/978-3-319-42972-4_21

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-42972-4_21

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-42971-7

  • Online ISBN: 978-3-319-42972-4

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics