Skip to main content

Evaluating Logistic Mixed-Effects Models of Corpus-Linguistic Data in Light of Lexical Diffusion

  • Chapter
  • First Online:
Mixed-Effects Regression Models in Linguistics

Abstract

We explore methods for evaluating logistic mixed-effects models of both corpus and experimental data types through simulations. We suggest that the fit of the model should be evaluated by examining the variance explained by the fixed effects alone, rather than both fixed and random effects put together. Nonetheless, for corpus data, in which frequent items contribute more observations, coefficient estimates for fixed effects should be derived from a model that includes the random effects. Including random effects in the model with such datasets allows for better estimates of the fixed-effects predictor coefficients. Not having random effects in the model can cause fixed-effects coefficients to be overly influenced by frequent items, which are often exceptional in linguistic data due to lexical diffusion of ongoing changes.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 109.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 139.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 139.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Labov W (1969) Contraction, deletion, and inherent variability of the English copula. Language 45:715–762

    Article  Google Scholar 

  2. Sankoff D (1988) Sociolinguistics and syntactic variation. In: Newmeyer F (ed) Linguistics: the Cambridge survey, volume IV. Cambridge University Press, Cambridge, pp 140–161

    Chapter  Google Scholar 

  3. Shih S, Grafmiller J, Futrell R, Bresnan J (2015) Rhythm’s role in genitive construction choice in spoken English. In: Vogel R, van de Vijver R (eds) Rhythm in phonetics, grammar and cognition. Mouton de Gruyter, Berlin

    Google Scholar 

  4. Cedergren H, Sankoff D (1974) Variable rules: performance as a statistical reflection of competence. Language 50:333–355

    Article  Google Scholar 

  5. Tagliamonte S, Baayen RH (2012) Models, forests and trees of York English: was/were variation as a case study for statistical practice. Lang Var Chang 24:135–178

    Article  Google Scholar 

  6. Tagliamonte S (2006) Analyzing sociolinguistic variation. Cambridge University Press, Cambridge

    Book  Google Scholar 

  7. Bresnan J, Cueni A, Nikitina T, Baayen RH (2007) Predicting the dative alternation. In: Bouma G, Kraemer I, Zwarts J (eds) Cognitive foundations of interpretation. Royal Netherlands Academy of Arts and Sciences, Amsterdam, pp 69–94

    Google Scholar 

  8. Drager K, Hay J (2012) Exploiting random intercepts: two case studies in sociophonetics. Lang Var Chang 24:59–78

    Article  Google Scholar 

  9. Johnson DE (2014) Progress in regression: why sociolinguistic data calls for mixed-effects models. Ms. Lancaster University. http://danielezrajohnson.com/johnson_2014b.pdf

  10. Johnson DE (2009) Getting off the GoldVarb standard: introducing Rbrul for mixed-effects variable rule analysis. Lang Ling Compass 3:359–383

    Article  Google Scholar 

  11. Sapir E (1921) Language. Harcourt Brace, New York

    Google Scholar 

  12. Bybee J (2001) Phonology and language use. Cambridge University Press, Cambridge

    Book  Google Scholar 

  13. Bybee J, Scheibman J (1999) The effect of usage on degrees of constituency: the reduction of don’t in American English. Linguistics 37:575–596

    Article  Google Scholar 

  14. Clark HH (1973) The language-as-fixed-effect fallacy: a critique of language statistics in psychological research. J Verbal Learn Verbal Behav 12:335–359

    Article  Google Scholar 

  15. Coleman EB (1964) Generalizing to a language population. Psychol Rep 14:219–226

    Article  Google Scholar 

  16. Bybee J (2003) Mechanisms of change in grammaticization: the role of frequency. In: Joseph B, Janda R (eds) Handbook of historical linguistics. Blackwell, Oxford, pp 602–623

    Chapter  Google Scholar 

  17. Bybee J (2002) Word frequency and context of use in the lexical diffusion of phonetically conditioned sound change. Lang Var Chang 14:261–290

    Google Scholar 

  18. Gafos A, Kirov C (2009) A dynamical model of change in phonological representations: the case of lenition. In: Pellegrino F, Marsico E, Chitoran I, Coupé C (eds) Approaches to phonological complexity. Mouton de Gruyter, Berlin, pp 219–240

    Google Scholar 

  19. Kapatsinski V (2010) Rethinking rule reliability: why an exceptionless rule can fail. In: Proceedings from the annual meeting of the Chicago Linguistic Society, vol 44, no. 2, pp 277–291

    Google Scholar 

  20. Phillips BS (1984) Word frequency and the actuation of sound change. Language 60:320–342

    Article  Google Scholar 

  21. Pierrehumbert J (2001) Exemplar dynamics: word frequency, lenition, and contrast. In: Bybee J, Hopper P (eds) Frequency and the emergence of linguistic structure. John Benjamins, Amsterdam

    Google Scholar 

  22. Schuchardt H (1885) Ãœber die lautgesetze: gegen die junggrammatiker. Oppenheim, Berlin

    Google Scholar 

  23. Pitt MA, Johnson K, Hume E, Kiesling S, Raymond W (2005) The Buckeye corpus of conversational speech: labeling conventions and a test of transcriber reliability. Speech Comm 45:89–95

    Article  Google Scholar 

  24. Barth D (2015) To have and to be: function word reduction in child speech, child-directed speech and inter-adult speech. Ph.D. Dissertation, University of Oregon

    Google Scholar 

  25. Barlow GM (2013) Individual differences and usage-based grammar. Int J Corpus Linguist 18:443–478

    Article  Google Scholar 

  26. Baayen RH (2001) Word frequency distributions. Kluwer, Dordrecht

    Book  MATH  Google Scholar 

  27. Zipf GK (1949) Human behaviour and the principle of least effort. Addison-Wesley, Reading

    Google Scholar 

  28. Oldfield RC, Wingfield A (1965) Response latencies in naming objects. Q J Exp Psychol 17(4):272–281

    Google Scholar 

  29. Barabási A-L, Albert R (1999) Emergence of scaling in random networks. Science 286: 509–512

    Article  MathSciNet  MATH  Google Scholar 

  30. Merton RK (1968) The Matthew effect in science. Science 159:56–63

    Article  Google Scholar 

  31. Simon HA (1955) On a class of skew distribution functions. Biometrika 42:425–440

    Article  MathSciNet  MATH  Google Scholar 

  32. Yule GU (1925) A mathematical theory of evolution based on the conclusions of Dr. J. C. Willis. Philos Trans R Soc B 213:21–87

    Article  Google Scholar 

  33. Newman MEJ (2005) Power laws, Pareto distributions and Zipf’s law. Contemp Phys 46(5):323–351

    Article  Google Scholar 

  34. Hay JB, Pierrehumbert JB, Walker AJ, LaShell P (2015) Tracking word frequency effects through 130 years of sound change. Cognition 139:83–91

    Article  Google Scholar 

  35. Pierrehumbert J (2002) Word-specific phonetics. In: Gussenhoven C, Warner N (eds) Laboratory phonology 7. Mouton de Gruyter, Berlin, pp 101–140

    Google Scholar 

  36. Briscoe T, Copestake A (1999) Lexical rules in constraint-based grammars. Comput Linguist 25:487–526

    MathSciNet  Google Scholar 

  37. Goldberg AE (1995) Constructions. Chicago University Press, Chicago

    Google Scholar 

  38. Pinker S (1984) Language learnability and language development. Harvard University Press, Cambridge

    Google Scholar 

  39. Erker D, Guy GR (2012) The role of lexical frequency in syntactic variability: variable subject personal pronoun expression in Spanish. Language 88:526–557

    Article  Google Scholar 

  40. Sankoff D, Labov W (1979) On the uses of variable rules. Lang Soc 8:189–222

    Article  Google Scholar 

  41. Gerard J, Keller F, Palpanas T (2010) Corpus evidence for age effects on priming in child language. In: Proceedings of the 32nd annual meeting of the Cognitive Science Society, pp 218–223

    Google Scholar 

  42. Sonderegger M (2010) Testing for frequency and structural effects in an English stress shift. In: Proceedings of the Berkeley Linguistics Society, vol 36. pp. 411–425. https://dx.doi.org/10.3765/bls.v36i1.3927

  43. Stefanowitsch A (2008) Negative entrenchment: a usage-based approach to negative evidence. Cogn Linguist 19:513–531

    Article  Google Scholar 

  44. Ambridge B, Pine JM, Rowland CF, Chang F (2012) The roles of verb semantics, entrenchment, and morphophonology in the retreat from dative argument-structure overgeneralization errors. Language 88:45–81

    Article  Google Scholar 

  45. Ambridge B, Pine JM, Rowland CF (2012) Semantics versus statistics in the retreat from locative overgeneralization errors. Cognition 123:260–279

    Article  Google Scholar 

  46. Kapatsinski V (2010) What is it I am writing? Lexical frequency effects in spelling Russian prefixes: uncertainty and competition in an apparently regular system. Corpus Linguist Linguist Theory 6:157–215

    Article  Google Scholar 

  47. Kapatsinski V (submitted) Sound change and hierarchical inference. What is being inferred? Effects of words, phones and frequency. Available at http://blogs.uoregon.edu/ublab/files/2014/11/SoundChange-21fo1iz.pdf

  48. Raymond WD, Brown EL (2012) Are effects of word frequency effects of context of use? An analysis of initial fricative reduction in Spanish. In: Gries ST, Divjak D (eds) Frequency effects in language learning and processing. Mouton de Gruyter, Berlin, pp 35–52

    Google Scholar 

  49. Barth D, Kapatsinski V (2017) A multimodel inference approach to categorical variant choice: Construction, priming and frequency effects on the choice between full and contracted forms of am, are and is. Corpus Linguist Linguist Theory 13:203–260. https://doi.org/10.1515/cllt-2014-0022

  50. Burnham KP, Anderson DR (2002) Model selection and multimodel inference: a practical information-theoretic approach. Springer, New York

    MATH  Google Scholar 

  51. Nakagawa S, Schielzeth H (2013) A general and simple method for obtaining R2 from generalized linear mixed-effects models. Methods Ecol Evol 4:133–142

    Article  Google Scholar 

  52. Baayen RH (2008) Analyzing linguistic data: a practical introduction to statistics using R. Cambridge University Press, Cambridge

    Book  Google Scholar 

  53. Keune K, Ernestus M, Van Hout R, Baayen RH (2005) Social, geographical, and register variation in Dutch: from written MOGELIJK to spoken MOK. Corpus Linguist Linguist Theory 1:183–223

    Article  Google Scholar 

  54. Kothari A (2007) Accented pronouns and unusual antecedents: a corpus study. In: Proceedings of the 8th SIGDial workshop on discourse and dialogue. Association for Computational Linguistics, Antwerp, pp 150–157

    Google Scholar 

  55. Lohmann A (2011) Help vs. help to: a multifactorial, mixed-effects account of infinitive marker omission. Engl Lang Linguist 15:499–521

    Article  Google Scholar 

  56. Theijssen D (2009) Variable selection in logistic regression: the British English dative alternation. In: Icard T, Muskens R (eds) Interfaces: explorations in logic, language and computation. Springer, Berlin, pp 87–101

    Google Scholar 

  57. Antić E (2012) Relative frequency effects in Russian morphology. In: Gries ST, Divjak D (eds) Frequency effects in language learning and processing. Mouton de Gruyter, Berlin, pp 83–108

    Google Scholar 

  58. Antić E (2010) The representation of morphemes in the Russian lexicon. Dissertation, UC Berkeley

    Google Scholar 

  59. Yao Y (2011) The effects of phonological neighborhoods on pronunciation variation in conversational speech. Ph.D. Dissertation, UC Berkeley

    Google Scholar 

  60. Pitt MA, Myung IJ (2002) When a good fit can be bad. Trends Cogn Sci 6:421–425

    Article  Google Scholar 

  61. Bresnan J, Ford M (2010) Predicting syntax: processing dative constructions in American and Australian varieties of English. Language 86:168–213

    Article  Google Scholar 

  62. Riordan B (2007) There’s two ways to say it: modeling nonprestige there’s. Corpus Linguist Linguist Theory 3:233–279

    Article  Google Scholar 

  63. Theijssen D, ten Bosch L, Boves L, Cranen B, van Halteren H (2013) Choosing alternatives: using Bayesian networks and memory-based learning to study the dative alternation. Corpus Linguist Linguist Theory 9:227–262

    Article  Google Scholar 

  64. Albright A, Hayes B (2003) Rules vs. analogy in English past tenses: a computational/experimental study. Cognition 90:119–161

    Article  Google Scholar 

  65. Bates D, Maechler M, Dai B (2012) lme4: linear mixed-effects models using S4 classes. R package version 0.999999-0

    Google Scholar 

  66. Kohavi R (1995) A study of cross-validation and bootstrap for accuracy estimation and model selection. In: International joint conference on artificial intelligence (IJCAI), vol 2. Morgan Kaufmann, San Francisco, pp 1136–1143

    Google Scholar 

  67. Efron B, Tibshirani R (1993) An introduction to the bootstrap. Chapman & Hall, London

    Book  MATH  Google Scholar 

Download references

Acknowledgments

We would like to thank the two helpful anonymous reviewers of this article, as well as Daniel Johnson for bringing the Nakagawa and Schielzeth’s [51] paper to our attention. We further thank the audience of the LSD 2012 for helpful comments and questions.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Vsevolod Kapatsinski .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG, part of Springer Nature

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Barth, D., Kapatsinski, V. (2018). Evaluating Logistic Mixed-Effects Models of Corpus-Linguistic Data in Light of Lexical Diffusion. In: Speelman, D., Heylen, K., Geeraerts, D. (eds) Mixed-Effects Regression Models in Linguistics. Quantitative Methods in the Humanities and Social Sciences. Springer, Cham. https://doi.org/10.1007/978-3-319-69830-4_6

Download citation

Publish with us

Policies and ethics