Demography in the Big Data Revolution: Changing the Culture to Forge New Frontiers
Despite the widespread and rapidly growing popularity of Big Data, researchers have yet to agree on what the concept entails, what tools are still needed to best interrogate these data, whether or not Big Data’s emergence represents a new academic field or simply a set of tools, and how much confidence we can place on results derived from Big Data. Despite these ambiguities, most would agree that Big Data and the methods for analyzing it represent a remarkable potential for advancing social science knowledge. In my Presidential address to the Southern Demographic Association, I argue that demographers have long collected and analyzed Big Data in a small way, by parsing out the points of information that we can manipulate with familiar models and restricting analyses to what typical computing systems can handle or restricted-access data disseminators will allow. In order to better interrogate the data we already have, we need to change the culture of demography to treat demographic microdata as Big. This includes shaping the definition of Big Data, changing how we conceptualize models, and re-evaluating how we silo confidential data.
KeywordsBig Data Population-generalizable data Data security Complex statistical modeling Big data demography
- Bell, B. A., Onwuegbuzie, A. J., Ferron, J. M., Jiao, Q. G., Hibbard, S. T., & Kromrey, J. D. (2012). Use of design effects and sample weights in complex health survey data: a review of published articles using data from 3 commonly used adolescent health surveys. American Journal of Public Health, 102(7), 1399–1405.CrossRefGoogle Scholar
- Bryant, A., & Raja, U. (2014). In the realm of Big Data. First Monday 19(2). http://firstmonday.org/article/view/4991/3822. Accessed 17 Jan 2018.
- Chantala, K., & Tabor, J. (1999). Strategies to perform a design-based analysis using the Add Health data. Resource document. Carolina Population Center, University of North Carolina at Chapel Hill. http://www.cpc.unc.edu/projects/addhealth/documentation/guides/weight1.pdf. Accessed 17 Jan 2018.
- Crowder, J. A., & Carbone, J. A. (2017). Abductive artificial intelligence learning models. In H. R. Arabnia, D. de la Fuente, E. B. Kozerenko, J. A. Olivas, & F. G. Tinetti (Eds.), Proceedings of the 2017 International Conference on Artificial Intelligence (pp. 90–96). Las Vegas: CSREA Press.Google Scholar
- Cutter, S. L., Emrich, C. T., Mitchell, J. T., Boruff, B. J., Gall, M., Schmidtlein, M. C., et al. (2006). The long road home: race, class, and recovery from Hurricane Katrina. Environment: Science and Policy for Sustainable Development, 4(2), 8–20.Google Scholar
- Davenport, T. H., & Patil, D. J. (2012). Data scientist—the sexiest job of the 21st century: meet the people who can coax treasure out of messy, unstructured data. Harvard Business Review, 95(5), 70–76.Google Scholar
- Fuchs, C., & Sandoval, M. (2013). The diamond model of open access publishing: why policy makers, scholars, universities, libraries, labour unions and the publishing world need to take non-commercial, non-profit open access serious. TripleC: Communication, Capitalism & Critique, 11(2), 428–443.Google Scholar
- Fussell, E., Curran, S. R., Dunbar, M. D., Babb, M. A., Thompson, L., & Meijer-Irons, J. (2017). Weather-related hazards and population change: a study of hurricanes and tropical storms in the United States, 1980-2012. The Annals of the American Academy of Political and Social Science., 669(1), 146–167.CrossRefGoogle Scholar
- Hayden, E. C. (2015). Genome researchers raise alarm over Big Data. Nature: International Weekly Journal of Science. http://www.nature.com/news/genome-researchers- raise-alarm-over-big-data-1.17912. Accessed 17 Jan 2018.
- Head, M. L., Holman, L., Lanfear, R., Kahn, A. T., & Jennions, M. D. (2015). The extent and consequences of p-hacking in science. PLOS Biology. http://journals.plos.org/plosbiology/article?id=10.1371/journal.pbio.1002106. Accessed 17 Jan 2018.
- HLG-PCCB (High-level group for partnership, coordination and capacity-building for statistics for the 20130 agenda for sustainable development). (2016). Global action plan for sustainable development data. Report. https://unstats.un.org/sdgs/files/global-consultation-hlg-1/GAP_HLG-20161021.pdf. Accessed 17 Jan 2018.
- Horrigan, M. W. (2013). Big data and official statistics. presentation for the international year of statistics. Bureau of Labor Statistics, Office of Prices and Living Conditions Washington, DCGoogle Scholar
- Iceland, J., Weinberg, D. H., & Steinmetz, E. (2002). Racial and ethnic residential segregation in the United States: 1980–2000. Washington, DC: US Census Bureau, Series CENSR-3.Google Scholar
- King, G. (2016). Preface: big data is not about the data. In R. Michael Alvarez (Ed.), Computational social science: discovery and prediction. Cambridge: Cambridge University Press.Google Scholar
- Kitchin, R. (2014b). The data revolution: big data, open data, data infrastructures & their consequences. Los Angeles: Sage.Google Scholar
- Letouzé, E. (2015). Demography, meet big data; big data, meet demography: reflections on the data-rich future of population science. In Paper presented at the United Nations EGM on strengthening the demographic evidence base for the post-2015 development agenda. New York, October 5.Google Scholar
- Manovich, L. (2011). Trending: the promises and the challenges of big social data. In M. K. Gold (Ed.), Debates in the Digital Humanities 2 (pp. 460–475). Minneapolis: University of Minnesota.Google Scholar
- Maples, J. N. (2012). Changes in US Ethnic Niches, 2005-2010. Doctoral Dissertation, University of Tennessee. http://trace.tennessee.edu/socioetds/. Accessed 17 Jan 2018.
- Martin, J. A., Hamilton, B. E., Osterman, M. J. K., Driscoll, A. K., & Matthews, T. J. (2017). Births: final data for 2015. National Vital Statistics Reports, 66, 1–70.Google Scholar
- Metzler, K., Kim, D. A., Allum, N., & Denman, A. (2016). Who is doing computational social science? A white paper. Sage Publishing. https://us.sagepub.com/sites/default/files/compsocsci.pdf. Accessed 17 Jan 2018.
- Minnesota Population Center. (2016). Terra populus: integrated data on population and environment: version 1. Minneapolis: University of Minnesota.Google Scholar
- Pokhriyal, N., Dong, W., & Govindaraju, V. (2015). Big data for improved diagnosis of poverty: a case study of Senegal. Washington, DC: A report for the brookings institution africa in focus series.Google Scholar
- Portes, A., & Rumbaut, R. G. (2006). Immigrant America: a portrait. Berkeley: University of California Press.Google Scholar
- Ramakrishnan, S. K. (2005). Democracy in Immigrant America: changing demographics and political participation. Palo Alto: Stanford University Press.Google Scholar
- Singer, A. (2004). The rise of new immigrant gateways. Washington, DC: Brookings Institution, Center on Urban and Metropolitan Policy.Google Scholar
- Udry, J. R. (2003). The national longitudinal study of adolescent health (Add Health), Wave 1, 1994. Chapel Hill: Carolina Population Center, University of North Carolina.Google Scholar
- Vilhuber, L. (2016). Census research nodes: a progress report. In Presentation at the 2016 FSRDC Research Conference. September 15. College Station, Texas.Google Scholar
- Vital Wave Consulting. (2012). Big data, big impact: new possibilities for international development. A report for the World Economic Forum. Geneva, Switzerland.Google Scholar
- Waga, D., & Rabah, K. (2014). Environmental conditions’, big data management, and cloud computing analytics for sustainable agriculture. World Journal of Computer Application and Technology, 2(3), 73–81.Google Scholar