Abstract
For a long time, when data were needed by a nation, a census was often carried out in an attempt to make measurements on every household or every person in that nation. The move from conducting a census to conducting a sample survey where only a subset of households or persons was measured was met with opposition. Sampling requires that weights be calculated and applied to the observations in the sample, while observations of all units in a census do not require such calculation. Wanting to preserve censuses, the opposition to sampling was best expressed in the translated firm phrase, “No calculation when observation can be made”. With advances in probability sampling theory and applications, this resistance movement eventually failed. Societies have become more complex and people more mobile; data collection costs have risen; and the public’s response rates to government sample surveys continue to fall, though not as sharply as with non-government sample surveys. Other sources of data (e.g., administrative records, big data, commercial data) to measure human behavior and condition are being investigated for increased use in production of official statistics. Though there are technical concerns (e.g., representativeness, data quality, privacy), other features of big data (e.g., relatively inexpensive; largely digital measurement; minimize respondent burden; lot of data; rich, complex, diverse, flexible) and the growing demand for data gathering and managing tools suggest a very useful source of data. However, with big data and the implicit promise of observation of everything, there is a risk that users may move to question the need for sampling or statistical calculation. Indeed, the new phrase might be “No statistical calculation when big data observation can be made”, a phrase lacking scientific merit. In this paper, we take a brief look at the move from censuses to samples. Specifically, we (i) highlight Kiaer (1895, 1897) nonrandom representative method which laid seeds for the current use of survey sampling methodology to make measurements for official statistics; (ii) highlight Neyman’s 1934 contribution with stratified random sampling and optimal sample allocation to achieve representativeness, (iii) note some recent developments (Wright 2012, 2014, 2016, 2017) to improve Neyman’s optimal allocation, and (iv) offer some personal observations looking forward.
Tommy Wright is Chief of the Center for Statistical Research and Methodology, U. S. Census Bureau, Washington, D.C. 20233 (E-mail: tommy.wright@census.gov) and adjunct faculty in statistics at Georgetown University. The views expressed are the author’s and not necessarily those of the U. S. Census Bureau.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Bellhouse, D. R. (1988). A brief history of random sampling methods. In P. R. Krishnaiah, & C. R. Rao (Eds.) Handbook of statistics (Vol. 6, pp. 1–14).
Bethlehem, J. (2009). The rise of survey sampling. In Discussion paper (09015). The Hague/Heerlen: Statistics Netherlands.
Capps, C., & Wright, T. (2013). Toward a vision: Official statistics and big data. In AMSTAT news, No. 434. American Statistical Association, Alexandria, Virginia, 9.
Cochran, W. G. (1977). Sampling techniques (3rd ed.). New York, NY: Wiley.
Cochran, W. G. (1978). Laplace’s ratio estimator. In H. A. David (Ed.) Contributions to survey sampling and applied statistics (pp. 3–10). New York: Academic Press.
Kiaer, A. N. (1895). Observations et expériences concernant des dénombrements représentative. Bulletin of the International Statistical Institute, IX, Book, 2, 176–183.
Kiaer, A. N. (1897). The representative method of statistical surveys (1976 English Translation of the Original Norwegian). In Central Bureau of Statistics of Norway, Oslo. [Kiaer, A. N. (1897). Sur les Mèthods Représentatives ou Typologiques Appliquées à la Statistique, Bulletin of the International Statistical Institute, XI, 180–189].
Kish, L. (1979). Samples and censuses. International Statistical Review, 47, 99–109.
Kruskal, W., & Mosteller, F. (1980). Representative sampling IV: The history of the concept in statistics. 1895–1939. International Statistical Review, 48(2), 169–195.
Neyman, J. (1934). On the two different aspects of the representative method: The method of stratified sampling and the method of purposive selection. Journal of the Royal Statistical Society, 97, 558–606.
Stephan, F. F. (1948). History of the uses of modern sampling procedures. Journal of the American Statistical Association , 43(241), 12–39.
Tschuprow, A. A. (1923). On the mathematical expectation of the moments of frequency distributions in the case of correlated observations. Metron, 2(461–493), 646–683.
Wright, T. (2001). Selected moments in the development of probability sampling: Theory and practice. In Survey research methods section newsletter: American statistical association (pp. 1–6).
Wright, T. (2012). The equivalence of Neyman optimum allocation for sampling and equal proportions for apportioning the U. S. House of representatives. The American Statistician, 66(4), 217–224.
Wright, T. (2014). A simple method of exact optimal sample allocation under stratification with any mixed constraint patterns. In Research report series (Statistics #2014–07). Washington, D.C.: Center for Statistical Research & Methodology, U. S. Bureau of the Census.
Wright, T. (2016). Two optimal exact sample allocation algorithms: Sampling variance decomposition is key. In Research report series (Statistics #2016–03). Washington, D.C.: Center for Statistical Research & Methodology, U. S. Bureau of the Census.
Wright, T. (2017). Exact optimal sample allocation: More efficient than Neyman. Statistics and Probability Letters, 129, 50–57.
Acknowledgements
The author is grateful to a referee and his colleagues John Eltinge, Mary Mulry, Lauren Emanuel, and Michael Leibert for providing helpful comments on earlier drafts of this paper.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Wright, T. (2018). No Calculation When Observation Can Be Made. In: Chattopadhyay, A., Chattopadhyay, G. (eds) Statistics and its Applications. PJICAS 2016. Springer Proceedings in Mathematics & Statistics, vol 244. Springer, Singapore. https://doi.org/10.1007/978-981-13-1223-6_13
Download citation
DOI: https://doi.org/10.1007/978-981-13-1223-6_13
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-1222-9
Online ISBN: 978-981-13-1223-6
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)