This chapter discusses how ‘big data’ has become a catchphrase in the technology section of the news media. Through the synergic tools of Corpus-Assisted Discourse Studies (CADS), it identifies the news values and linguistic and discursive features in global big data coverage in English and elicits what kind of rhetoric is emerging. The big data narrative is rife with metaphors and novel lexical compounds. Keywords, concordance lines and collocations construct a mixed semantic prosody that takes a marked negative turn after the recent instances of data leaks and privacy violations. Finally, the analysis focuses on the strategies deployed in the construction and dissemination of expert discourse about big data by observing the processes of reconceptualisation and recontextualisation of knowledge that are activated in its argumentation.
KeywordsBig data Corpus-Assisted Discourse Studies (CADS) Expert discourse Knowledge dissemination News media
- Ali, Samina. 2018. “Newspaper Corpus Design and Representativeness.” WhatEvery1Says Project, 3 July. http://we1s.ucsb.edu.
- Baker, Paul. 2006. Using Corpora in Discourse Analysis. London and New York: Continuum.Google Scholar
- Baker, Paul, Costas Gabrielatos, Majid Khosravinik, Michał Krzyżanowski, Tony McEnery, and Ruth Wodak. 2008. “A Useful Methodological Synergy? Combining Critical Discourse Analysis and Corpus Linguistics to Examine Discourses of Refugees and Asylum Seekers in the UK Press.” Discourse & Society 19, no. 3: 273–306. https://doi.org/10.1177/0957926508088962.CrossRefGoogle Scholar
- Baker, Paul, and Tony McEnery, eds. 2015. Corpora and Discourse Studies: Integrating Discourse and Corpora. Basingstoke and New York: Palgrave Macmillan.Google Scholar
- ———. 2017. The Discourse of News Values: How News Organizations Create ‘Newsworthiness’. Oxford: Oxford University Press.Google Scholar
- Bondi, Marina, Silvia Cacchiani, and Davide Mazzi, eds. 2015. Discourse In and Through the Media: Recontextualizing and Reconceptualizing Expert Discourse. Newcastle upon Tyne: Cambridge Scholars Publishing.Google Scholar
- Economist. 2014. “Self-Made Wealth in America: Robber Barons and Silicon Sultans.” 30 December. https://www.economist.com/briefing/2014/12/30/robber-barons-and-silicon-sultans.
- Garzone, Giuliana, and Francesca Santulli. 2004. “What Can Corpus Linguistics Do for Critical Discourse Analysis?” In Corpora and Discourse, edited by Alan Partington, John Morley, and Louann Haarman, 351–368. Bern: Peter Lang.Google Scholar
- Goodwin, Jean, and Lee Honeycutt. 2009. “When Science Goes Public: From Technical Arguments to Appeals to Authority.” Studies in Communication Sciences 9, no. 2: 19–30.Google Scholar
- Graves, Christopher, and Sandra Matz. 2018. “What Marketers Should Know about Personality-Based Marketing.” Harvard Business Review, May 2. https://hbr.org/2018/05/what-marketers-should-know-about-personality-based-marketing.
- Greco Morasso, Sara, and Carlo Morasso. 2014. “Argumentation from Expert Opinion in Science Journalism: The Case of Eureka’s Fight Club.” In Rhétorique et cognition - Rhetoric and Cognition: Perspectives théoriques et stratégies persuasives - Theoretical Perspectives and Persuasive Strategies, edited by Thierry Herman and Steve Oswald, 185–213. Bern: Peter Lang.Google Scholar
- Kilgarriff, Adam. 2009. “Simple Maths for Keywords.” Proceedings of the Corpus Linguistics Conference CL2009, edited by Michaela Mahlberg, Victorina González-Díaz, and Catherine Smith, article number 171, 1–6. Liverpool: University of Liverpool. http://ucrel.lancs.ac.uk/publications/cl2009.
- Kitchin, Rob. 2014. The Data Revolution: Big Data, Open Data, Data Infrastructures & Their Consequences. London: Sage.Google Scholar
- Koester, Almut. 2010. “Building Small Specialised Corpora.” In The Routledge Handbook of Corpus Linguistics, edited by Anne O’Keeffe and Michael McCarthy, 66–79. Abingdon and New York: Routledge.Google Scholar
- Lohr, Steve. 2012. “How Big Data Became So Big.” New York Times, 11 August. http://www.nytimes.com/2012/08/12/business/how-big-data-became-so-big-unboxed.html.
- Newman, Nic. 2018. Journalism, Media, and Technology Trends and Predictions 2018. Oxford: Reuters Institute for the Study of Journalism, The University of Oxford.Google Scholar
- Newman, Nic, Richard Fletcher, Antonis Kalogeropoulos, David A. L. Levy, and Rasmus Kleis Nielsen. 2018. Reuters Institute Digital News Report 2018, 14 June. Oxford: Reuters Institute for the Study of Journalism, The University of Oxford. https://ssrn.com/abstract=3245355.
- O’ Halloran, Kieran. 2010. “How to Use Corpus Linguistics in the Study of Media Discourse.” In The Routledge Handbook of Corpus Linguistics, edited by Anne O’Keeffe and Michael McCarthy, 563–577. Abingdon and New York: Routledge.Google Scholar
- Oxford Internet Institute. 2017. “Digital Ethics Lab.” https://www.oii.ox.ac.uk.
- Partington, Alan. 2004a. “Corpora and Discourse, A Most Congruous Beast.” In Corpora and Discourse, edited by Alan Partington, John Morley, and Louann Haarman, 11–20. Bern: Peter Lang.Google Scholar
- Perelman, Chaïm, and Lucie Olbrechts-Tyteca. 1969. The New Rhetoric: A Treatise on Argumentation. Notre Dame, IN: University of Notre Dame Press.Google Scholar
- Puaschunder, Julia M. 2017. “The Nudging Divide in the Digital Big Data Era.” International Journal of Research in Business, Economics and Management 4: 11–12, 49–53. https://ssrn.com/abstract=3007085.
- Puschmann, Cornelius, and Jean Burgess. 2014. “Big Data, Big Questions| Metaphors of Big Data.” International Journal of Communication 8: 1690–1709. http://ijoc.org/index.php/ijoc/article/view/2169.
- Quantified Self Institute. n.d. “What Is Quantified Self?” https://qsinstitute.com.
- Schofield, Alexandra, Laure Thompson, and David Mimno. 2017. “Quantifying the Effects of Text Duplication on Semantic Models.” In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, edited by Martha Palmer, Rebecca Hwa, and Sebastian Riedel, 2737–2747. Copenhagen: Association for Computational Linguistics. https://aclweb.org/anthology/D17-1290.
- ———. 1999. WordSmith Tools Help Manual. Version 3.0. Oxford: Mike Scott and Oxford University Press.Google Scholar
- ———. 2010. “Problems in Investigating Keyness, or Cleaning the Undergrowth and Marking Out Trails…” In Keyness in Texts, edited by Marina Bondi and Mike Scott, 43–57. Bern: Peter Lang.Google Scholar
- Sinclair, John. 1991. Corpus, Concordance, Collocation. Oxford: Oxford University Press.Google Scholar
- Sketch Engine. n.d. “Simple Maths.” https://www.sketchengine.eu/documentation/simple-maths.
- Stubbs, Michael. 1996. Text and Corpus Linguistics: Computer-Assisted Studies of Language and Culture. Oxford: Blackwell.Google Scholar
- ———. 2001. Words and Phrases: Corpus Studies of Lexical Semantics. Oxford: Blackwell. Google Scholar
- Thornbury, Scott. 2010. “What Can a Corpus Tell Us about Discourse?” In The Routledge Handbook of Corpus Linguistics, edited by Anne O’Keeffe and Michael McCarthy, 270–287. Abingdon and New York: Routledge.Google Scholar
- van Dijk, Teun A. 1988. News as Discourse. Hillsdale, NJ: Lawrence Erlbaum Associates.Google Scholar
- Watson, Sara M. 2014. “Data Is the New ‘____’: Sara M. Watson on the Industrial Metaphor of Big Data.” DIS Magazine. http://dismagazine.com/discussion/73298/sara-m-watson-metaphors-of-big-data.
- ———. 2016. “Toward a Constructive Technology Criticism.” Tow Center for Digital Journalism White Papers. New York: Columbia University. https://doi.org/10.7916/D86401Z7.