Evaluating Logistic Mixed-Effects Models of Corpus-Linguistic Data in Light of Lexical Diffusion

Barth, Danielle; Kapatsinski, Vsevolod

doi:10.1007/978-3-319-69830-4_6

Danielle Barth⁹ &
Vsevolod Kapatsinski⁹

Part of the book series: Quantitative Methods in the Humanities and Social Sciences ((QMHSS))

1934 Accesses
9 Citations

Abstract

We explore methods for evaluating logistic mixed-effects models of both corpus and experimental data types through simulations. We suggest that the fit of the model should be evaluated by examining the variance explained by the fixed effects alone, rather than both fixed and random effects put together. Nonetheless, for corpus data, in which frequent items contribute more observations, coefficient estimates for fixed effects should be derived from a model that includes the random effects. Including random effects in the model with such datasets allows for better estimates of the fixed-effects predictor coefficients. Not having random effects in the model can cause fixed-effects coefficients to be overly influenced by frequent items, which are often exceptional in linguistic data due to lexical diffusion of ongoing changes.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 109.00; Price excludes VAT (USA)

Softcover Book: USD 139.99; Price excludes VAT (USA)

Hardcover Book: USD 139.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Labov W (1969) Contraction, deletion, and inherent variability of the English copula. Language 45:715–762
Article Google Scholar
Sankoff D (1988) Sociolinguistics and syntactic variation. In: Newmeyer F (ed) Linguistics: the Cambridge survey, volume IV. Cambridge University Press, Cambridge, pp 140–161
Chapter Google Scholar
Shih S, Grafmiller J, Futrell R, Bresnan J (2015) Rhythm’s role in genitive construction choice in spoken English. In: Vogel R, van de Vijver R (eds) Rhythm in phonetics, grammar and cognition. Mouton de Gruyter, Berlin
Google Scholar
Cedergren H, Sankoff D (1974) Variable rules: performance as a statistical reflection of competence. Language 50:333–355
Article Google Scholar
Tagliamonte S, Baayen RH (2012) Models, forests and trees of York English: was/were variation as a case study for statistical practice. Lang Var Chang 24:135–178
Article Google Scholar
Tagliamonte S (2006) Analyzing sociolinguistic variation. Cambridge University Press, Cambridge
Book Google Scholar
Bresnan J, Cueni A, Nikitina T, Baayen RH (2007) Predicting the dative alternation. In: Bouma G, Kraemer I, Zwarts J (eds) Cognitive foundations of interpretation. Royal Netherlands Academy of Arts and Sciences, Amsterdam, pp 69–94
Google Scholar
Drager K, Hay J (2012) Exploiting random intercepts: two case studies in sociophonetics. Lang Var Chang 24:59–78
Article Google Scholar
Johnson DE (2014) Progress in regression: why sociolinguistic data calls for mixed-effects models. Ms. Lancaster University. http://danielezrajohnson.com/johnson_2014b.pdf
Johnson DE (2009) Getting off the GoldVarb standard: introducing Rbrul for mixed-effects variable rule analysis. Lang Ling Compass 3:359–383
Article Google Scholar
Sapir E (1921) Language. Harcourt Brace, New York
Google Scholar
Bybee J (2001) Phonology and language use. Cambridge University Press, Cambridge
Book Google Scholar
Bybee J, Scheibman J (1999) The effect of usage on degrees of constituency: the reduction of don’t in American English. Linguistics 37:575–596
Article Google Scholar
Clark HH (1973) The language-as-fixed-effect fallacy: a critique of language statistics in psychological research. J Verbal Learn Verbal Behav 12:335–359
Article Google Scholar
Coleman EB (1964) Generalizing to a language population. Psychol Rep 14:219–226
Article Google Scholar
Bybee J (2003) Mechanisms of change in grammaticization: the role of frequency. In: Joseph B, Janda R (eds) Handbook of historical linguistics. Blackwell, Oxford, pp 602–623
Chapter Google Scholar
Bybee J (2002) Word frequency and context of use in the lexical diffusion of phonetically conditioned sound change. Lang Var Chang 14:261–290
Google Scholar
Gafos A, Kirov C (2009) A dynamical model of change in phonological representations: the case of lenition. In: Pellegrino F, Marsico E, Chitoran I, Coupé C (eds) Approaches to phonological complexity. Mouton de Gruyter, Berlin, pp 219–240
Google Scholar
Kapatsinski V (2010) Rethinking rule reliability: why an exceptionless rule can fail. In: Proceedings from the annual meeting of the Chicago Linguistic Society, vol 44, no. 2, pp 277–291
Google Scholar
Phillips BS (1984) Word frequency and the actuation of sound change. Language 60:320–342
Article Google Scholar
Pierrehumbert J (2001) Exemplar dynamics: word frequency, lenition, and contrast. In: Bybee J, Hopper P (eds) Frequency and the emergence of linguistic structure. John Benjamins, Amsterdam
Google Scholar
Schuchardt H (1885) Über die lautgesetze: gegen die junggrammatiker. Oppenheim, Berlin
Google Scholar
Pitt MA, Johnson K, Hume E, Kiesling S, Raymond W (2005) The Buckeye corpus of conversational speech: labeling conventions and a test of transcriber reliability. Speech Comm 45:89–95
Article Google Scholar
Barth D (2015) To have and to be: function word reduction in child speech, child-directed speech and inter-adult speech. Ph.D. Dissertation, University of Oregon
Google Scholar
Barlow GM (2013) Individual differences and usage-based grammar. Int J Corpus Linguist 18:443–478
Article Google Scholar
Baayen RH (2001) Word frequency distributions. Kluwer, Dordrecht
Book MATH Google Scholar
Zipf GK (1949) Human behaviour and the principle of least effort. Addison-Wesley, Reading
Google Scholar
Oldfield RC, Wingfield A (1965) Response latencies in naming objects. Q J Exp Psychol 17(4):272–281
Google Scholar
Barabási A-L, Albert R (1999) Emergence of scaling in random networks. Science 286: 509–512
Article MathSciNet MATH Google Scholar
Merton RK (1968) The Matthew effect in science. Science 159:56–63
Article Google Scholar
Simon HA (1955) On a class of skew distribution functions. Biometrika 42:425–440
Article MathSciNet MATH Google Scholar
Yule GU (1925) A mathematical theory of evolution based on the conclusions of Dr. J. C. Willis. Philos Trans R Soc B 213:21–87
Article Google Scholar
Newman MEJ (2005) Power laws, Pareto distributions and Zipf’s law. Contemp Phys 46(5):323–351
Article Google Scholar
Hay JB, Pierrehumbert JB, Walker AJ, LaShell P (2015) Tracking word frequency effects through 130 years of sound change. Cognition 139:83–91
Article Google Scholar
Pierrehumbert J (2002) Word-specific phonetics. In: Gussenhoven C, Warner N (eds) Laboratory phonology 7. Mouton de Gruyter, Berlin, pp 101–140
Google Scholar
Briscoe T, Copestake A (1999) Lexical rules in constraint-based grammars. Comput Linguist 25:487–526
MathSciNet Google Scholar
Goldberg AE (1995) Constructions. Chicago University Press, Chicago
Google Scholar
Pinker S (1984) Language learnability and language development. Harvard University Press, Cambridge
Google Scholar
Erker D, Guy GR (2012) The role of lexical frequency in syntactic variability: variable subject personal pronoun expression in Spanish. Language 88:526–557
Article Google Scholar
Sankoff D, Labov W (1979) On the uses of variable rules. Lang Soc 8:189–222
Article Google Scholar
Gerard J, Keller F, Palpanas T (2010) Corpus evidence for age effects on priming in child language. In: Proceedings of the 32nd annual meeting of the Cognitive Science Society, pp 218–223
Google Scholar
Sonderegger M (2010) Testing for frequency and structural effects in an English stress shift. In: Proceedings of the Berkeley Linguistics Society, vol 36. pp. 411–425. https://dx.doi.org/10.3765/bls.v36i1.3927
Stefanowitsch A (2008) Negative entrenchment: a usage-based approach to negative evidence. Cogn Linguist 19:513–531
Article Google Scholar
Ambridge B, Pine JM, Rowland CF, Chang F (2012) The roles of verb semantics, entrenchment, and morphophonology in the retreat from dative argument-structure overgeneralization errors. Language 88:45–81
Article Google Scholar
Ambridge B, Pine JM, Rowland CF (2012) Semantics versus statistics in the retreat from locative overgeneralization errors. Cognition 123:260–279
Article Google Scholar
Kapatsinski V (2010) What is it I am writing? Lexical frequency effects in spelling Russian prefixes: uncertainty and competition in an apparently regular system. Corpus Linguist Linguist Theory 6:157–215
Article Google Scholar
Kapatsinski V (submitted) Sound change and hierarchical inference. What is being inferred? Effects of words, phones and frequency. Available at http://blogs.uoregon.edu/ublab/files/2014/11/SoundChange-21fo1iz.pdf
Raymond WD, Brown EL (2012) Are effects of word frequency effects of context of use? An analysis of initial fricative reduction in Spanish. In: Gries ST, Divjak D (eds) Frequency effects in language learning and processing. Mouton de Gruyter, Berlin, pp 35–52
Google Scholar
Barth D, Kapatsinski V (2017) A multimodel inference approach to categorical variant choice: Construction, priming and frequency effects on the choice between full and contracted forms of am, are and is. Corpus Linguist Linguist Theory 13:203–260. https://doi.org/10.1515/cllt-2014-0022
Burnham KP, Anderson DR (2002) Model selection and multimodel inference: a practical information-theoretic approach. Springer, New York
MATH Google Scholar
Nakagawa S, Schielzeth H (2013) A general and simple method for obtaining R2 from generalized linear mixed-effects models. Methods Ecol Evol 4:133–142
Article Google Scholar
Baayen RH (2008) Analyzing linguistic data: a practical introduction to statistics using R. Cambridge University Press, Cambridge
Book Google Scholar
Keune K, Ernestus M, Van Hout R, Baayen RH (2005) Social, geographical, and register variation in Dutch: from written MOGELIJK to spoken MOK. Corpus Linguist Linguist Theory 1:183–223
Article Google Scholar
Kothari A (2007) Accented pronouns and unusual antecedents: a corpus study. In: Proceedings of the 8th SIGDial workshop on discourse and dialogue. Association for Computational Linguistics, Antwerp, pp 150–157
Google Scholar
Lohmann A (2011) Help vs. help to: a multifactorial, mixed-effects account of infinitive marker omission. Engl Lang Linguist 15:499–521
Article Google Scholar
Theijssen D (2009) Variable selection in logistic regression: the British English dative alternation. In: Icard T, Muskens R (eds) Interfaces: explorations in logic, language and computation. Springer, Berlin, pp 87–101
Google Scholar
Antić E (2012) Relative frequency effects in Russian morphology. In: Gries ST, Divjak D (eds) Frequency effects in language learning and processing. Mouton de Gruyter, Berlin, pp 83–108
Google Scholar
Antić E (2010) The representation of morphemes in the Russian lexicon. Dissertation, UC Berkeley
Google Scholar
Yao Y (2011) The effects of phonological neighborhoods on pronunciation variation in conversational speech. Ph.D. Dissertation, UC Berkeley
Google Scholar
Pitt MA, Myung IJ (2002) When a good fit can be bad. Trends Cogn Sci 6:421–425
Article Google Scholar
Bresnan J, Ford M (2010) Predicting syntax: processing dative constructions in American and Australian varieties of English. Language 86:168–213
Article Google Scholar
Riordan B (2007) There’s two ways to say it: modeling nonprestige there’s. Corpus Linguist Linguist Theory 3:233–279
Article Google Scholar
Theijssen D, ten Bosch L, Boves L, Cranen B, van Halteren H (2013) Choosing alternatives: using Bayesian networks and memory-based learning to study the dative alternation. Corpus Linguist Linguist Theory 9:227–262
Article Google Scholar
Albright A, Hayes B (2003) Rules vs. analogy in English past tenses: a computational/experimental study. Cognition 90:119–161
Article Google Scholar
Bates D, Maechler M, Dai B (2012) lme4: linear mixed-effects models using S4 classes. R package version 0.999999-0
Google Scholar
Kohavi R (1995) A study of cross-validation and bootstrap for accuracy estimation and model selection. In: International joint conference on artificial intelligence (IJCAI), vol 2. Morgan Kaufmann, San Francisco, pp 1136–1143
Google Scholar
Efron B, Tibshirani R (1993) An introduction to the bootstrap. Chapman & Hall, London
Book MATH Google Scholar

Download references

Acknowledgments

We would like to thank the two helpful anonymous reviewers of this article, as well as Daniel Johnson for bringing the Nakagawa and Schielzeth’s [51] paper to our attention. We further thank the audience of the LSD 2012 for helpful comments and questions.

Author information

Authors and Affiliations

University of Oregon, Eugene, OR, USA
Danielle Barth & Vsevolod Kapatsinski

Authors

Danielle Barth
View author publications
You can also search for this author in PubMed Google Scholar
Vsevolod Kapatsinski
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Vsevolod Kapatsinski .

Editor information

Editors and Affiliations

Faculty of Arts, Research Group QLVL, KU Leuven, Belgium
Dirk Speelman
Faculty of Arts, Research Group QLVL, KU Leuven, Belgium
Kris Heylen
Faculty of Arts, Research Group QLVL, KU Leuven, Belgium
Dirk Geeraerts

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Barth, D., Kapatsinski, V. (2018). Evaluating Logistic Mixed-Effects Models of Corpus-Linguistic Data in Light of Lexical Diffusion. In: Speelman, D., Heylen, K., Geeraerts, D. (eds) Mixed-Effects Regression Models in Linguistics. Quantitative Methods in the Humanities and Social Sciences. Springer, Cham. https://doi.org/10.1007/978-3-319-69830-4_6

Download citation

DOI: https://doi.org/10.1007/978-3-319-69830-4_6
Published: 08 February 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-69828-1
Online ISBN: 978-3-319-69830-4
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics