Abstract
We explore methods for evaluating logistic mixed-effects models of both corpus and experimental data types through simulations. We suggest that the fit of the model should be evaluated by examining the variance explained by the fixed effects alone, rather than both fixed and random effects put together. Nonetheless, for corpus data, in which frequent items contribute more observations, coefficient estimates for fixed effects should be derived from a model that includes the random effects. Including random effects in the model with such datasets allows for better estimates of the fixed-effects predictor coefficients. Not having random effects in the model can cause fixed-effects coefficients to be overly influenced by frequent items, which are often exceptional in linguistic data due to lexical diffusion of ongoing changes.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Labov W (1969) Contraction, deletion, and inherent variability of the English copula. Language 45:715–762
Sankoff D (1988) Sociolinguistics and syntactic variation. In: Newmeyer F (ed) Linguistics: the Cambridge survey, volume IV. Cambridge University Press, Cambridge, pp 140–161
Shih S, Grafmiller J, Futrell R, Bresnan J (2015) Rhythm’s role in genitive construction choice in spoken English. In: Vogel R, van de Vijver R (eds) Rhythm in phonetics, grammar and cognition. Mouton de Gruyter, Berlin
Cedergren H, Sankoff D (1974) Variable rules: performance as a statistical reflection of competence. Language 50:333–355
Tagliamonte S, Baayen RH (2012) Models, forests and trees of York English: was/were variation as a case study for statistical practice. Lang Var Chang 24:135–178
Tagliamonte S (2006) Analyzing sociolinguistic variation. Cambridge University Press, Cambridge
Bresnan J, Cueni A, Nikitina T, Baayen RH (2007) Predicting the dative alternation. In: Bouma G, Kraemer I, Zwarts J (eds) Cognitive foundations of interpretation. Royal Netherlands Academy of Arts and Sciences, Amsterdam, pp 69–94
Drager K, Hay J (2012) Exploiting random intercepts: two case studies in sociophonetics. Lang Var Chang 24:59–78
Johnson DE (2014) Progress in regression: why sociolinguistic data calls for mixed-effects models. Ms. Lancaster University. http://danielezrajohnson.com/johnson_2014b.pdf
Johnson DE (2009) Getting off the GoldVarb standard: introducing Rbrul for mixed-effects variable rule analysis. Lang Ling Compass 3:359–383
Sapir E (1921) Language. Harcourt Brace, New York
Bybee J (2001) Phonology and language use. Cambridge University Press, Cambridge
Bybee J, Scheibman J (1999) The effect of usage on degrees of constituency: the reduction of don’t in American English. Linguistics 37:575–596
Clark HH (1973) The language-as-fixed-effect fallacy: a critique of language statistics in psychological research. J Verbal Learn Verbal Behav 12:335–359
Coleman EB (1964) Generalizing to a language population. Psychol Rep 14:219–226
Bybee J (2003) Mechanisms of change in grammaticization: the role of frequency. In: Joseph B, Janda R (eds) Handbook of historical linguistics. Blackwell, Oxford, pp 602–623
Bybee J (2002) Word frequency and context of use in the lexical diffusion of phonetically conditioned sound change. Lang Var Chang 14:261–290
Gafos A, Kirov C (2009) A dynamical model of change in phonological representations: the case of lenition. In: Pellegrino F, Marsico E, Chitoran I, Coupé C (eds) Approaches to phonological complexity. Mouton de Gruyter, Berlin, pp 219–240
Kapatsinski V (2010) Rethinking rule reliability: why an exceptionless rule can fail. In: Proceedings from the annual meeting of the Chicago Linguistic Society, vol 44, no. 2, pp 277–291
Phillips BS (1984) Word frequency and the actuation of sound change. Language 60:320–342
Pierrehumbert J (2001) Exemplar dynamics: word frequency, lenition, and contrast. In: Bybee J, Hopper P (eds) Frequency and the emergence of linguistic structure. John Benjamins, Amsterdam
Schuchardt H (1885) Ãœber die lautgesetze: gegen die junggrammatiker. Oppenheim, Berlin
Pitt MA, Johnson K, Hume E, Kiesling S, Raymond W (2005) The Buckeye corpus of conversational speech: labeling conventions and a test of transcriber reliability. Speech Comm 45:89–95
Barth D (2015) To have and to be: function word reduction in child speech, child-directed speech and inter-adult speech. Ph.D. Dissertation, University of Oregon
Barlow GM (2013) Individual differences and usage-based grammar. Int J Corpus Linguist 18:443–478
Baayen RH (2001) Word frequency distributions. Kluwer, Dordrecht
Zipf GK (1949) Human behaviour and the principle of least effort. Addison-Wesley, Reading
Oldfield RC, Wingfield A (1965) Response latencies in naming objects. Q J Exp Psychol 17(4):272–281
Barabási A-L, Albert R (1999) Emergence of scaling in random networks. Science 286: 509–512
Merton RK (1968) The Matthew effect in science. Science 159:56–63
Simon HA (1955) On a class of skew distribution functions. Biometrika 42:425–440
Yule GU (1925) A mathematical theory of evolution based on the conclusions of Dr. J. C. Willis. Philos Trans R Soc B 213:21–87
Newman MEJ (2005) Power laws, Pareto distributions and Zipf’s law. Contemp Phys 46(5):323–351
Hay JB, Pierrehumbert JB, Walker AJ, LaShell P (2015) Tracking word frequency effects through 130 years of sound change. Cognition 139:83–91
Pierrehumbert J (2002) Word-specific phonetics. In: Gussenhoven C, Warner N (eds) Laboratory phonology 7. Mouton de Gruyter, Berlin, pp 101–140
Briscoe T, Copestake A (1999) Lexical rules in constraint-based grammars. Comput Linguist 25:487–526
Goldberg AE (1995) Constructions. Chicago University Press, Chicago
Pinker S (1984) Language learnability and language development. Harvard University Press, Cambridge
Erker D, Guy GR (2012) The role of lexical frequency in syntactic variability: variable subject personal pronoun expression in Spanish. Language 88:526–557
Sankoff D, Labov W (1979) On the uses of variable rules. Lang Soc 8:189–222
Gerard J, Keller F, Palpanas T (2010) Corpus evidence for age effects on priming in child language. In: Proceedings of the 32nd annual meeting of the Cognitive Science Society, pp 218–223
Sonderegger M (2010) Testing for frequency and structural effects in an English stress shift. In: Proceedings of the Berkeley Linguistics Society, vol 36. pp. 411–425. https://dx.doi.org/10.3765/bls.v36i1.3927
Stefanowitsch A (2008) Negative entrenchment: a usage-based approach to negative evidence. Cogn Linguist 19:513–531
Ambridge B, Pine JM, Rowland CF, Chang F (2012) The roles of verb semantics, entrenchment, and morphophonology in the retreat from dative argument-structure overgeneralization errors. Language 88:45–81
Ambridge B, Pine JM, Rowland CF (2012) Semantics versus statistics in the retreat from locative overgeneralization errors. Cognition 123:260–279
Kapatsinski V (2010) What is it I am writing? Lexical frequency effects in spelling Russian prefixes: uncertainty and competition in an apparently regular system. Corpus Linguist Linguist Theory 6:157–215
Kapatsinski V (submitted) Sound change and hierarchical inference. What is being inferred? Effects of words, phones and frequency. Available at http://blogs.uoregon.edu/ublab/files/2014/11/SoundChange-21fo1iz.pdf
Raymond WD, Brown EL (2012) Are effects of word frequency effects of context of use? An analysis of initial fricative reduction in Spanish. In: Gries ST, Divjak D (eds) Frequency effects in language learning and processing. Mouton de Gruyter, Berlin, pp 35–52
Barth D, Kapatsinski V (2017) A multimodel inference approach to categorical variant choice: Construction, priming and frequency effects on the choice between full and contracted forms of am, are and is. Corpus Linguist Linguist Theory 13:203–260. https://doi.org/10.1515/cllt-2014-0022
Burnham KP, Anderson DR (2002) Model selection and multimodel inference: a practical information-theoretic approach. Springer, New York
Nakagawa S, Schielzeth H (2013) A general and simple method for obtaining R2 from generalized linear mixed-effects models. Methods Ecol Evol 4:133–142
Baayen RH (2008) Analyzing linguistic data: a practical introduction to statistics using R. Cambridge University Press, Cambridge
Keune K, Ernestus M, Van Hout R, Baayen RH (2005) Social, geographical, and register variation in Dutch: from written MOGELIJK to spoken MOK. Corpus Linguist Linguist Theory 1:183–223
Kothari A (2007) Accented pronouns and unusual antecedents: a corpus study. In: Proceedings of the 8th SIGDial workshop on discourse and dialogue. Association for Computational Linguistics, Antwerp, pp 150–157
Lohmann A (2011) Help vs. help to: a multifactorial, mixed-effects account of infinitive marker omission. Engl Lang Linguist 15:499–521
Theijssen D (2009) Variable selection in logistic regression: the British English dative alternation. In: Icard T, Muskens R (eds) Interfaces: explorations in logic, language and computation. Springer, Berlin, pp 87–101
Antić E (2012) Relative frequency effects in Russian morphology. In: Gries ST, Divjak D (eds) Frequency effects in language learning and processing. Mouton de Gruyter, Berlin, pp 83–108
Antić E (2010) The representation of morphemes in the Russian lexicon. Dissertation, UC Berkeley
Yao Y (2011) The effects of phonological neighborhoods on pronunciation variation in conversational speech. Ph.D. Dissertation, UC Berkeley
Pitt MA, Myung IJ (2002) When a good fit can be bad. Trends Cogn Sci 6:421–425
Bresnan J, Ford M (2010) Predicting syntax: processing dative constructions in American and Australian varieties of English. Language 86:168–213
Riordan B (2007) There’s two ways to say it: modeling nonprestige there’s. Corpus Linguist Linguist Theory 3:233–279
Theijssen D, ten Bosch L, Boves L, Cranen B, van Halteren H (2013) Choosing alternatives: using Bayesian networks and memory-based learning to study the dative alternation. Corpus Linguist Linguist Theory 9:227–262
Albright A, Hayes B (2003) Rules vs. analogy in English past tenses: a computational/experimental study. Cognition 90:119–161
Bates D, Maechler M, Dai B (2012) lme4: linear mixed-effects models using S4 classes. R package version 0.999999-0
Kohavi R (1995) A study of cross-validation and bootstrap for accuracy estimation and model selection. In: International joint conference on artificial intelligence (IJCAI), vol 2. Morgan Kaufmann, San Francisco, pp 1136–1143
Efron B, Tibshirani R (1993) An introduction to the bootstrap. Chapman & Hall, London
Acknowledgments
We would like to thank the two helpful anonymous reviewers of this article, as well as Daniel Johnson for bringing the Nakagawa and Schielzeth’s [51] paper to our attention. We further thank the audience of the LSD 2012 for helpful comments and questions.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this chapter
Cite this chapter
Barth, D., Kapatsinski, V. (2018). Evaluating Logistic Mixed-Effects Models of Corpus-Linguistic Data in Light of Lexical Diffusion. In: Speelman, D., Heylen, K., Geeraerts, D. (eds) Mixed-Effects Regression Models in Linguistics. Quantitative Methods in the Humanities and Social Sciences. Springer, Cham. https://doi.org/10.1007/978-3-319-69830-4_6
Download citation
DOI: https://doi.org/10.1007/978-3-319-69830-4_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-69828-1
Online ISBN: 978-3-319-69830-4
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)