Abstract
In this paper I will discuss Big Data as a suite of new methods for social and political research. I will start by tracing a genealogy of the idea that machine can perform better than human beings in managing extremely huge quantity of data, and that the quantity of information could change the quality of the interrogation posed to those data.
In the second part of the paper I will analyse Big Data as a social and rhetorical construction of the politics of research, claiming in favour of a more detailed account of the consequences for its progressive institutionalization. Without a serious methodological assessment of the changes that these new methods produce in the scientific epistemology of social and political sciences, we risk to underestimate the distortive or uncontrollable effects of the massive use of computer techniques. The challenge is how to avoid situations in which it is very difficult to reproduce the designed experiment, and it is arduous to explain the theories that can justify the output of researches. As an exemplification of the problem I will discuss the work on emotional contagion led by Facebook and published on PNAS in 2014.
Until now it was difficult to explore all the Big Data projects’ consequences on the perception of human intelligence and on the future of social research methods. The vision that there is no way to manage social data than to follow the results of a machine learning algorithm that works on inaccessible, epistemologically opaque and uncontrollable systems is rather problematic and deserve some extra consideration.
Verwisch Die Spuren
[…]
Was immer du sagst, sag es nicht zweimal
Findest du deinen Gedanken bei einem andern: verleugne ihn.
Wer seine Unterschrift nicht gegeben hat, wer kein Bild hinterließ
Wer nicht dabei war, wer nichts gesagt hat
Wie soll der zu fassen sein!
Verwisch die Spuren!
Sorge, wenn du zu sterben gedenkst
Daß kein Grabmal steht und verrät, wo du liegst
Mit einer deutlichen Schrift, die dich anzeigt
Und dem Jahr deines Todes, das dich überführt!
Noch einmal:
Verwisch die Spuren!
(Das wurde mir gelehrt.)
Erase Traces
[…]
Whatever you say, don’t say it twice
If you find your ideas in anyone else, disown them
He who has signed nothing, who has left no picture behind
Who was not there at the time, who has said nothing
How are they to catch him!
Erase the traces!
Make sure, when you turn your thoughts to dying
That no gravestone divulges where you lie
With a clear inscription indicting you
And the year of your death, which convicts you!
Once again,
Erase the traces!
(That’s what I was told.)
Berthold Brecht 1926
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Ahonen, P. 2015. Institutionalizing Big Data methods in social and political research. Big Data & Society 2(2, July). http://bds.sagepub.com/content/2/2/2053951715591224. Accessed 9 Oct 2016.
Anderson, C. 2008. The end of theory: The data deluge makes the scientific method obsolete. Wired, 23 June 2008. http://www.wired.com/science/discoveries/magazine/16-07/pb_theory. Accessed 9 Oct 2016.
Barabási, A.L. 2010. Bursts: The hidden patterns behind everything we do, from your e-mail to bloody crusades. New York: Dutton.
Boyd, D. 2014. It’s complicated. In New Haven. London: Yale University Press.
Bowker, G.C. 2013. Data flakes: An afterword to ‘Raw Data’ is an oxymoron. In: Gitelman. 2013, 167–171.
Bowker, G.C., and S. Leigh Star. 2000. Sorting things out: Classification and its consequences. Cambridge, MA: MIT Press.
Boyd, D. 2010. Privacy and publicity in the context of Big Data. Raleigh (nc), 29 April 2010 http://www.danah.org/papers/talks/2010/WWW2010.html. Accessed 9 Oct 2016.
Boyd, D., and K. Crawford. 2012. Critical questions for Big Data. Information, Communication & Society 15 (5): 662–679.
Burrell, J. 2016. How the machine ‘thinks’: Understanding opacity in machine learning algorithms. Big Data & Society 3(1, January). http://bds.sagepub.com/content/3/1/2053951715622512. Accessed 8 Oct 2016.
Chun, H.K.W. 2011. Programmed visions: Software and memory. Cambridge, MA: MIT Press.
Chun, W.H. 2016. Updating to remain the same. Cambridge, MA: The MIT Press.
Custers, B. 2016. Click here to consent forever: Expiry dates for informed consent. Big Data & Society 3(January–June). http://bds.sagepub.com/content/3/1/2053951715624935. Accessed 9 Oct 2016.
Derrida, J. 1995. Mal d’archive: une impression freudienne. Édition GalilÉe, Paris; En. Trans. Archive fever: A Freudian impression. Chicago: Chicago University Press, 1998.
Epstein, R., and R. E. Robertson. 2015. The search engine manipulation effect (SEME) and its possible impact on the outcomes of elections. In Proceedings of the National Academy of Sciences of the United States of America, PNAS 112(33): E4512–E4521. https://doi.org/10.1073/pnas.1419828112. Accessed 9 Oct 2016.
Fiormonte, D., T. Numerico, and F. Tomasi. 2015. Digital humanist. A critical enquiry. New York: Punctum Books.
Foucault, M. 1969/1982. The archaeology of knowledge. London: Vintage.
Gillespie, T. 2014. The relevance of algorithms. In Media technologies :Essays on communication, materiality, and society, ed. T. Gillespie, P. Boczkowski, and K. Foot. Cambridge, MA: The MIT Press.
Gitelman, L., ed. 2013. “Raw Data” is an oxymoron. Cambridge, MA: MIT Press.
Han, B.C. 2013. Im Schwarm : Ansichten des Digitalen. Berlin: Matthes & Seitz.
Humphreys, P. 2004. Extending Ourselves: Computational Science, Empiricism, and Scientific Method. New York: Oxford University Press.
———. 2009. The philosophical novelty of computer simulation methods. Synthese 169 (3): 615–626.
Kramer, A.I. et al. 2014. Experimental evidence of massive-scale emotional contagion through social networks. PNAS 111(24, 17 June): 8788–8790. www.pnas.org/cgi/doi/10.1073/pnas.1320040111. Accessed 9 Oct 2016.
Laney, D. 2001. 3D data management: Controlling data volume, velocity, and variety, meta group (now Gartner Group) report. http://blogs.gartner.com/doug-laney/files/2012/01/ad949-3D-Data-Management-Controlling-Data-Volume-Velocity-and-Variety.pdf. Accessed 9 Oct 2016.
Lanier, J. 2013. Who owns the future? New York: Simon & Schuster.
Lazer, D., Kennedy, R., King, G., and Vespignani, A. 2014. The parable of Google Flu: Traps in Big data analysis. Science 343(14 March): 1203–1205.
Lazer D., Kennedy R. 2015. What We Can Learn From the Epic Failure of Google Flu Trends. Wired, 10/1/2015. https://www.wired.com/2015/10/can-learn-epic-failure-google-flu-trends/. Accessed 2 nov 2018.
Licklider, J.C.R. 1965. Libraries of the future. Cambridge, MA: The MIT Press.
LIWC (2007) Linguistic Inquiry and Word Count (LIWC) program dictionary, 2007 version http://liwc.wpengine.com/. Accessed 2 Nov 2018
Markham, A.N. 2013. Undermining ‘data’: A critical examination of a core term in scientific inquiry. First Monday, 18, 10, October. http://firstmonday.org/ojs/index.php/fm/article/view/4868/3749. Accessed 9 Oct 2016.
Mayer-Schönberger, V., and K. Cukier. 2013. Big Data: A revolution that will transform how we live, work, and think. London: John Murray.
Metcalf, J., and K. Crawford. 2016. Where are human subjects in Big Data research? The emerging ethics divide. Big Data and Society 3(January–June). http://bds.sagepub.com/content/3/1/2053951716650211. Accessed 9 Oct 2016.
Meyer, J.W., and B. Rowan. 1977. Institutional organizations: Formal structure as myth and ceremony. American Journal of Sociology 83 (2): 340–363.
Nielsen, M. 2011. Reinventing discovery: The new era of networked science. Princeton: Princeton University Press.
O’Neil, C. 2016. Weapons of math destruction. In Allen Lane. London: Penguin.
Open Science Collaboration, B.A. Nosek, A.A. Aarts, C.J. Anderson, J.E. Anderson, H.B. Kappes, et al. 2015. Estimating the reproducibility of psychological science. Science 349 (6251): aac4716–aac4716 ISSN 0036-8075.
Pasquale, F. 2015. The black box society: The secret algorithms that control money and information. Cambridge, MA: Harvard University Press.
Raghavan, P. 2014. It’s time to scale the science in the social sciences. Big Data & Society 1(1, June): 1–4. http://bds.sagepub.com/content/1/1/2053951714532240. Accessed 9 Oct 2016.
Reality check on reproducibility. 2016. Nature 533(26 May): 437.https://www.nature.com/news/reality-check-on-reproducibility-1.19961. Accessed 2 Nov 2018
Rieder, G., and J. Simon. 2016. Datatrust: Or, the political quest for numerical evidence and the epistemologies of Big Data. Big Data and society 3. http://bds.sagepub.com/content/3/1/2053951716649398. Accessed 9 Oct 2016.
Rodotà, S. 2014. Il mondo nella rete. Quali i diritti, quali i vincoli. Repubblica/Laterza, Rome.
Schroeck, M. et al. 2012. Analytics: The real-world use of Big Data, IBM Report. http://www-03.ibm.com/systems/hu/resources/the_real_word_use_of_big_data.pdf. Accessed 9 Oct 2016.
Schroeder, R. 2014. Big Data and the brave new world of social and media research. Big Data and society 1(2, December): 1–11. http://bds.sagepub.com/content/1/2/2053951714563194. Accessed 9 Oct 2016.
Symons, J., and R. Alvarado 2016. Big Can we trust Big Data? Applying philosophy of science to software. Big Data & Society 1–17(July–December). http://bds.sagepub.com/content/spbds/3/2/2053951716664747.full.pdf. Accessed 7 Oct 2016.
Turing A. M. 1945. Proposal for the development in the mathematical division of an automatic computing engine (ACE), Report to the Executive Committee of National Physical Laboratory 1945, reprinted in Turing 1992, Collected Works of A.M. Turing: mechanical intelligence, Ince D. C. ed. Amsterdam: North-Holland, 1–86.
———. 1947. Lecture to the London Mathematical Society on 20 February 1947. In The essential Turing, ed. J. Copeland, 378–394. 2004. Oxford: Clarendon Press.
Zwitter, A. 2014. Big Data Ethics. Big Data & Society 1(2, November): 1–6. http://bds.sagepub.com/content/1/2/2053951714559253. Accessed 9 Oct 2016.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Numerico, T. (2019). Politics and Epistemology of Big Data: A Critical Assessment. In: Berkich, D., d'Alfonso, M. (eds) On the Cognitive, Ethical, and Scientific Dimensions of Artificial Intelligence. Philosophical Studies Series, vol 134. Springer, Cham. https://doi.org/10.1007/978-3-030-01800-9_8
Download citation
DOI: https://doi.org/10.1007/978-3-030-01800-9_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-01799-6
Online ISBN: 978-3-030-01800-9
eBook Packages: Computer ScienceComputer Science (R0)