Skip to main content

Introduction

  • Chapter
  • First Online:
Taming the Corpus

Abstract

Empirical linguistics has always gravitated towards quantification. With the advent of electronic corpora—large, searchable sets of natural language data, quantification has become part and parcel of linguistic studies. In the past few decades in particular, we have witnessed a “quantitative turn” in various schools of linguistics (cf. Janda, 2013 for cognitive linguistics) and in the digital humanities which was further accelerated by the advent of text corpora. This volume aims to showcase a variety of recent quantitative approaches that “tame the corpus”; it shows how language corpora can be used for research questions of interest to students and scholars in the humanities and social scientists. It simultaneously fills a lacuna in mainstream English-based quantitative linguistic studies by demonstrating that quantitative methods applied on inflectional language may reveal novel phenomena.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 99.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 129.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    The volume was inspired by the Workshop on Quantitative Text Analysis for the Humanities and Social Sciences, which the editors organized at Brown University on April 8 and 9, 2016.

  2. 2.

    Superficial Internet search often leads one to have such an impression, cf. https://www.orau.gov/cdcynergy/soc2web/content/phase05/phase05_step03_deeper_qualitative_and_quantitative.htm and https://keydifferences.com/difference-between-qualitative-and-quantitative-research.html#ComparisonChart. Accessed 25 May 2018.

  3. 3.

    Even a singular appearance represents quantity (=1) and the difference between a single or no occurrence may result in ascribing an important property to the phenomenon under examination or not. But usually, even in qualitative studies, multiple examples demonstrating a hypothesis are better than one.

  4. 4.

    Unlike many quantitative studies, where the amount of reduction is sometimes explicitly acknowledged. Johnson states that in fact any (statistical) inference about the data is guessing; what quantitative methods can help us with is to quantify how reliable our guesses are (2008, p. 3).

References

  • Baker, P., & McEnery, T. (2005). A corpus-based approach to discourses of refugees and asylum seekers in UN and newspaper texts. Journal of Language and Politics, 4(2), 197–226.

    Article  Google Scholar 

  • Biber, D., & Conrad, S. (2009). Register, genre, and style. Cambridge, UK: Cambridge University Press.

    Book  Google Scholar 

  • Gries, S. T. (2013). 50-something years of work on collocations. International Journal of Corpus Linguistics, 18(1), 137–165.

    Article  Google Scholar 

  • Herdan, G. (1966). The advanced theory of language as choice and chance. Berlin, Germany: Springer.

    Book  Google Scholar 

  • Janda, L. A. (Ed.). (2013). Cognitive linguistics: The quantitative turn. Berlin, Germany: De Gruyter Mouton.

    Google Scholar 

  • Jockers, M. L. (2013). Macroanalysis. Digital methods and literary history. Urbana, IL: University of Illinois Press.

    Google Scholar 

  • Johnson, K. (2008). Quantitative methods in linguistics. Malden, MA: Blackwell publishing.

    Google Scholar 

  • Popper, K. (1959) [2005]. The logic of scientific discovery. London, UK: Routledge.

    Book  Google Scholar 

  • Rasinger, S. M. (2008). Quantitative research in linguistics. An introduction. London, England: Continuum.

    Google Scholar 

  • Tognini-Bonelli, E. (2001). Corpus linguistics at work. Amsterdam: John Benjamins.

    Book  Google Scholar 

Download references

Acknowledgments

The publication of this volume was made possible by support from grant Progres Q08 Czech National Corpus implemented at the Faculty of Arts, Charles University and the Humanities Research Grant from Brown University. Special thanks goes to Mathew Amboy and Faith Su from Springer who saw through the entire publication process and Marek Nekula for thoughtful and helpful comments on the manuscripts. The editors would also like to thank Andrew Malcovsky for copyediting work. Last but not least, many thanks to Lída Cvrčková Porkertová and Vlastimil Fidler for their support and patience.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Václav Cvrček .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Cvrček, V., Fidler, M. (2018). Introduction. In: Fidler, M., Cvrček, V. (eds) Taming the Corpus. Quantitative Methods in the Humanities and Social Sciences. Springer, Cham. https://doi.org/10.1007/978-3-319-98017-1_1

Download citation

Publish with us

Policies and ethics