Learning Morphological Constructions

Chapter
Part of the Studies in Morphology book series (SUMO, volume 4)

Abstract

The great variability of morphological structure across languages makes it uncontroversial that morphology is learned. Yet, morphology presents formidable learning challenges, on par with those of syntax. This article takes a constructionist perspective in assuming that morphological constructions are a major outcome of the learning process. However, the existence of morphological paradigms in many languages suggests that they are often not the only outcome. The article reviews domain-general approaches to achieving this outcome. The primary focus is on mechanisms proposed within the associative/connectionist tradition, which are compared with Bayesian approaches. The issues discussed include the role of prediction and prediction error in learning, generative vs. discriminative learning models, directionality of associations, the roles of (unexpectedly) present vs. absent stimuli, general-to-specific vs. specific-to-general learning, and the roles of type and token frequency. In the process, the notion of a construction itself is shown to be more complicated that it first appears.

Keywords

Learning Morphology Connectionism Bayes Type frequency Token frequency Contingency learning Schema Linguistic constructions Morphological paradigms Productivity 

References

  1. Ackerman, F., and R. Malouf. 2013. Morphological organization: The low conditional entropy conjecture. Language 89 (3): 429–464.Google Scholar
  2. Albright, A., and B. Hayes. 2003. Rules vs. analogy in English past tenses: A computational/experimental study. Cognition 90 (2): 119–161.Google Scholar
  3. Ambridge, B., and E.V. Lieven. 2011. Child language acquisition: Contrasting theoretical approaches. Cambridge: Cambridge University Press.Google Scholar
  4. Ambridge, B., J.M. Pine, C.F. Rowland, and C.R. Young. 2008. The effect of verb semantic class and verb frequency (entrenchment) on children’s and adults’ graded judgments of argument-structure overgeneralisation errors. Cognition 106 (1): 87–129.Google Scholar
  5. Ambridge, B., J.M. Pine, C.F. Rowland, and F. Chang. 2012. The roles of verb semantics, entrenchment and morphophonology in the retreat from dative argument structure overgeneralization errors. Language 88 (1): 45–81.Google Scholar
  6. Arcediano, F., M. Escobar, and R.R. Miller. 2003. Temporal integration and temporal backward associations in human and nonhuman subjects. Animal Learning & Behavior 31 (3): 242–256.Google Scholar
  7. ———. 2005. Bidirectional associations in humans and rats. Journal of Experimental Psychology: Animal Behavior Processes 31 (3): 301–318.Google Scholar
  8. Asch, S.E., and S.M. Ebenholtz. 1962. The principle of associative symmetry. Proceedings of the American Philosophical Society 106 (2): 135–163.Google Scholar
  9. Aronoff, M. 2016. Competition and the lexicon. In Livelli di Analisi e fenomeni di interfaccia. Atti del XLVII Congresso Internazionale della Società di Linguistica Italiana, ed. A. Elia, C. Iacobini, and M. Voghera, 39–52. Roma: Bulzoni Editore.Google Scholar
  10. Ashby, F.G., J.M. Ennis, and B.J. Spiering. 2007. A neurobiological theory of automaticity in perceptual categorization. Psychological Review 114: 632–656.Google Scholar
  11. Baayen, R.H. 1992. Quantitative aspects of morphological productivity. In Yearbook of morphology 1991, ed. G. Booij and J. van Marle, 109–149. Dordrecht: Kluwer.Google Scholar
  12. Baayen, R.H., T. Dijkstra, and R. Schreuder. 1997. Singulars and plurals in Dutch: Evidence for a parallel dual-route model. Journal of Memory and Language 37 (1): 94–117.Google Scholar
  13. Baayen, R.H., P. Milin, D.F. Đurđević, P. Hendrix, and M. Marelli. 2011. An amorphous model for morphological processing in visual comprehension based on naive discriminative learning. Psychological Review 118 (3): 438–481.Google Scholar
  14. Barðdal, J. 2008. Productivity: Evidence from case and argument structure in Icelandic. Amsterdam: John Benjamins.Google Scholar
  15. Baroni, M., J. Matiasek, and H. Trost. 2002. Unsupervised discovery of morphologically related words based on orthographic and semantic similarity. In Proceedings of the workshop on Morphological and Phonological Learning of ACL/SIGPHON-2002, 48–57. Philadelphia.Google Scholar
  16. Becker, M., and M. Gouskova. 2016. Source-oriented generalizations as grammar inference in Russian vowel deletion. Linguistic Inquiry 47 (3): 391–425.Google Scholar
  17. Booij, G. 2010. Construction morphology. Oxford: Oxford University Press.Google Scholar
  18. Boudelaa, S., and M.G. Gaskell. 2002. A re-examination of the default system for Arabic plurals. Language & Cognitive Processes 17 (3): 321–343.Google Scholar
  19. Boyd, J.K., and A.E. Goldberg. 2011. Learning what not to say: The role of statistical preemption and categorization in a-adjective production. Language 87 (1): 55–83.Google Scholar
  20. Braine, M.D. 1963. The ontogeny of English phrase structure. Language 39: 1–14.Google Scholar
  21. Braine, M.D., R.E. Brody, P.J. Brooks, V. Sudhalter, J.A. Ross, L. Catalano, and S.M. Fisch. 1990. Exploring language acquisition in children with a miniature artificial language: Effects of item and pattern frequency, arbitrary subclasses, and correction. Journal of Memory & Language 29 (5): 591–610.Google Scholar
  22. Brooks, P.J., M.D.S. Braine, L. Catalano, R.E. Brody, and V. Sudhalter. 1993. Acquisition of gender-like noun subclasses in an artificial language: The contribution of phonological markers to learning. Journal of Memory & Language 32: 79–95.Google Scholar
  23. Buz, E., M.K. Tanenhaus, and T.F. Jaeger. 2016. Dynamically adapted context-specific hyper-articulation: Feedback from interlocutors affects speakers’ subsequent pronunciations. Journal of Memory and Language 89: 68–86.Google Scholar
  24. Bybee, J. 1985. Morphology: A study of the relation between meaning and form. Amsterdam: John Benjamins.Google Scholar
  25. ———. 2001. Phonology and language use. Cambridge: Cambridge University Press.Google Scholar
  26. ———. 2003. Cognitive processes in grammaticalization. In The new psychology of language, ed. M. Tomasello, vol. 2, 145–167. Mahwah: Lawrence Erlbaum.Google Scholar
  27. Caplan, Jeremy B., Kathy L. Boulton, and Christina L. Gagné. 2014. Associative asymmetry of compound words. Journal of Experimental Psychology: Learning, Memory, and Cognition 40: 1163–1171.Google Scholar
  28. Cappelle, B. 2006. Particle placement and the case for “allostructions”. Constructions SV1(7): 1–28.Google Scholar
  29. Carvalho, P.F., and R.L. Goldstone. 2016. The encoding of characteristic and diagnostic properties during interleaved and blocked category learning. Paper presented at the 57th annual meeting of the Psychonomic Society, Boston, MA, November 17–20.Google Scholar
  30. Chomsky, N. 1981. Principles and parameters in syntactic theory. In Explanation in linguistics: The logical problem of language acquisition, ed. N. Hornstein and D. Lightfoot, 32–75. London: Longman.Google Scholar
  31. Clahsen, H. 1999. Lexical entries and rules of language: A multidisciplinary study of German inflection. Behavioral and Brain Sciences 22 (6): 991–1013.Google Scholar
  32. Cohen, A.L., R.M. Nosofsky, and S.R. Zaki. 2001. Category variability, exemplar similarity, and perceptual classification. Memory & Cognition 29 (8): 1165–1175.Google Scholar
  33. Croft, W. 2001. Radical construction grammar: Syntactic theory in typological perspective. Oxford: Oxford University Press.Google Scholar
  34. Dąbrowska, E. 2004. Language, mind and brain: Some psychological and neurological constraints on theories of grammar. Edinburgh: Edinburgh University Press.Google Scholar
  35. ———. 2012. Different speakers, different grammars: Individual differences in native language attainment. Linguistic Approaches to Bilingualism 2 (3): 219–253.Google Scholar
  36. De Houwer, J., and T. Beckers. 2003. Secondary task difficulty modulates forward blocking in human contingency learning. Quarterly Journal of Experimental Psychology 56B (4): 345–357.Google Scholar
  37. Ellis, N.C. 2006. Language acquisition as rational contingency learning. Applied Linguistics 27 (1): 1–24.Google Scholar
  38. Ervin, S.M. 1961. Changes with age in the verbal determinants of word-association. American Journal of Psychology 74: 361–372.Google Scholar
  39. Fellbaum, C. 1996. Co-occurrence and antonymy. International Journal of Lexicography 8 (4): 281–303.Google Scholar
  40. Fillmore, C.J., P. Kay, and M.C. O'Connor. 1988. Regularity and idiomaticity in grammatical constructions: The case of let alone. Language 64 (3): 501–538.Google Scholar
  41. Frigo, L., and J.L. McDonald. 1998. Properties of phonological markers that affect the acquisition of gender-like subclasses. Journal of Memory and Language 39 (2): 218–245.Google Scholar
  42. Gershkoff-Stowe, L., and L.B. Smith. 1997. A curvilinear trend in naming errors as a function of early vocabulary growth. Cognitive Psychology 34 (1): 37–71.Google Scholar
  43. Goldberg, A.E. 1995. Constructions: A construction grammar approach to argument structure. Chicago: Chicago University Press.Google Scholar
  44. ———. 2002. Surface generalizations: An alternative to alternations. Cognitive Linguistics 13 (4): 327–356.Google Scholar
  45. ———. 2006. Constructions at work: The nature of generalization in language. Oxford: Oxford University Press.Google Scholar
  46. Goldberg, A.E., D.M. Casenhiser, and N. Sethuraman. 2004. Learning argument structure generalizations. Cognitive Linguistics 15: 289–316.Google Scholar
  47. Goldstein, M.H., and J.A. Schwade. 2010. From birds to words: Perception of structure in social interactions guides vocal development and language learning. In The Oxford handbook of developmental behavioral neuroscience, ed. M.S. Blumberg, J.H. Freeman, and S.R. Robinson, 708–729. Oxford: Oxford University Press.Google Scholar
  48. Gómez, R.L. 2002. Variability and detection of invariant structure. Psychological Science 13 (5): 431–436.Google Scholar
  49. Hare, M., J.L. Elman, and K.G. Daugherty. 1995. Default generalisation in connectionist networks. Language and Cognitive Processes 10 (6): 601–630.Google Scholar
  50. Harmon, Z., and V. Kapatsinski. 2017. Putting old tools to new uses: The role of form accessibility in semantic extension. Cognitive Psychology 98: 22–44.Google Scholar
  51. Hay, J. 2001. Lexical frequency in morphology: Is everything relative? Linguistics 39 (6): 1041–1070.Google Scholar
  52. ———. 2003. Causes and consequences of word structure. London: Routledge.Google Scholar
  53. Hayes, B. 2004. Phonological acquisition in optimality theory: The early stages. In Constraints in phonological acquisition, ed. R. Kager, J. Pater, and W. Zonneveld, 158–203. Cambridge: Cambridge University Press.Google Scholar
  54. Hebb, D.O. 1949. The organization of behavior: A neuropsychological approach. New York: Wiley.Google Scholar
  55. Hilpert, M. 2008. Germanic future constructions: A usage-based approach to language change. Amsterdam: John Benjamins.Google Scholar
  56. Horwood, G. 2001. Anti-faithfulness and subtractive morphology. Ms., Rutgers University, available as ROA, 466-0901.Google Scholar
  57. Hsu, A., and T.L. Griffiths. 2009. Differential use of implicit negative evidence in generative and discriminative language learning. Advances in Neural Information Processing Systems 22: 754–762.Google Scholar
  58. Hsu, A.S., and T.E. Griffiths. 2010. Effects of generative and discriminative learning on use of category variability. Proceedings of the Annual Conference of the Cognitive Science Society 32: 242–247.Google Scholar
  59. Jones, S., C. Paradis, M.L. Murphy, and C. Willners. 2007. Googling for ‘opposites’: A web-based study of antonym canonicity. Corpora 2 (2): 129–154.Google Scholar
  60. Jun, J., and A. Albright. 2017. Speakers’ knowledge of alternations is asymmetrical: Evidence from Seoul Korean verb paradigms. Journal of Linguistics 53: 567–611.Google Scholar
  61. Justeson, J.S., and S.M. Katz. 1991. Co-occurrences of antonymous adjectives and their contexts. Computational Linguistics 17: 1–19.Google Scholar
  62. Kahana, M.J. 2002. Associative symmetry and memory theory. Memory & Cognition 30: 823–840.Google Scholar
  63. Kamin, L.J. 1969. Predictability, surprise, attention, and conditioning. In Punishment and aversive behavior, ed. B.A. Campbell and R.M. Church, 279–296. New York: Appleton-Century-Crofts.Google Scholar
  64. Kapatsinski, V. 2005. Productivity of Russian stem extensions: Evidence for and a formalization of network theory. MA thesis, The University of New Mexico.Google Scholar
  65. ———. 2013. Conspiring to mean: Experimental and computational evidence for a usage-based harmonic approach to morphophonology. Language 89 (1): 110–148.Google Scholar
  66. ———. 2017. Learning a subtractive morphological system: Statistics and representations. Proceedings of the Boston University Conference on Language Development 41 (1): 357–372.Google Scholar
  67. ———. Forthcoming. Changing minds changing tools: From learning theory to language acquisition to language change. Cambridge, MA: MIT Press.Google Scholar
  68. Kapatsinski, V., and Z. Harmon. 2017. A Hebbian account of entrenchment and (over-)extension in language learning. Proceedings of the Annual Conference of the Cognitive Science Society 39: 2366–2371.Google Scholar
  69. Kirby, S., H. Cornish, and K. Smith. 2008. Cumulative cultural evolution in the laboratory: An experimental approach to the origins of structure in human language. Proceedings of the National Academy of Sciences 105 (31): 10681–10686.Google Scholar
  70. Köpcke, K.-M. 1998. The acquisition of plural marking in English and German revisited: Schemata versus rules. Journal of Child Language 25 (2): 293–319.Google Scholar
  71. Köpcke, K.-M., and V. Wecker. 2017. Source-and product-oriented strategies in L2 acquisition of plural marking in German. Morphology 27: 77–103.Google Scholar
  72. Kruschke, J.K. 2006. Locally Bayesian learning with applications to retrospective revaluation and highlighting. Psychological Review 113 (4): 677–699.Google Scholar
  73. Kurisu, K. 2001. The phonology of morpheme realization. Doctoral dissertation, UC Santa Cruz.Google Scholar
  74. Landauer, T.K., and S.T. Dumais. 1997. A solution to Plato's problem: The latent semantic analysis theory of acquisition, induction, and representation of knowledge. Psychological Review 104 (2): 211–240.Google Scholar
  75. Langacker, R. 1987. Foundations of cognitive grammar, Theoretical prerequisites. Vol. 1. Stanford: Stanford University Press.Google Scholar
  76. Łubowicz, A. 2007. Paradigmatic contrast in polish. Journal of Slavic Linguistics 15: 229–262.Google Scholar
  77. Mackintosh, N.J. 1975. A theory of attention: Variations in the associability of stimuli with reinforcement. Psychological Review 82 (4): 276–298.Google Scholar
  78. Madlener, K. 2016. Input optimization: Effects of type and token frequency manipulations in instructed second language learning. In Experience counts: Frequency effects in language, ed. H. Behrens and S. Pfänder, 133–174. Berlin: Walter de Gruyter.Google Scholar
  79. Maniwa, K., A. Jongman, and T. Wade. 2009. Acoustic characteristics of clearly spoken English fricatives. The Journal of the Acoustical Society of America 125 (6): 3962–3973.Google Scholar
  80. McCarthy, J.J. 1998. Morpheme structure constraints and paradigm occultation. Chicago Linguistic Society 32 (2): 123–150.Google Scholar
  81. McClelland, J.L. 2001. Failures to learn and their remediation: A Hebbian account. In Mechanisms of cognitive development: Behavioral and neural perspectives, ed. J.L. McClelland and R.S. Siegler, 97–121. Mahwah: Erlbaum.Google Scholar
  82. McClelland, J.L., B.L. McNaughton, and R.C. O'Reilly. 1995. Why there are complementary learning systems in the hippocampus and neocortex: Insights from the successes and failures of connectionist models of learning and memory. Psychological Review 102 (3): 419–457.Google Scholar
  83. McKenzie, C.R., and L.A. Mikkelsen. 2007. A Bayesian view of covariation assessment. Cognitive Psychology 54 (1): 33–61.Google Scholar
  84. McMurray, B., J.S. Horst, and L.K. Samuelson. 2012. Word learning emerges from the interaction of online referent selection and slow associative learning. Psychological Review 119 (4): 831–877.Google Scholar
  85. McNeill, D. 1963. The origin of associations within the same grammatical class. Journal of Verbal Learning & Verbal Behavior 2 (3): 250–262.Google Scholar
  86. ———. 1966. A study of word association. Journal of Verbal Learning & Verbal Behavior 5: 548–557.Google Scholar
  87. Murphy, M.L. 2006. Antonyms as lexical constructions: Or, why paradigmatic construction is not an oxymoron. Constructions SV1(8): 1–37.Google Scholar
  88. Naigles, L.G., and S.A. Gelman. 1995. Overextensions in comprehension and production revisited: Preferential-looking in a study of dog, cat, and cow. Journal of Child Language 22 (1): 19–46.Google Scholar
  89. Nesset, T. 2008. Abstract phonology in a concrete model. Cognitive linguistics and the morphology-phonology interface. Berlin: Mouton de Gruyter.Google Scholar
  90. Nosofsky, R.M. 1988. Similarity, frequency, and category representations. Journal of Experimental Psychology: Learning, Memory, & Cognition 14 (1): 54–65.Google Scholar
  91. O’Donnell, T.J. 2015. Productivity and reuse in language: A theory of linguistic computation and storage. Cambridge, MA: MIT Press.Google Scholar
  92. Oldfield, R.C., and A. Wingfield. 1965. Response latencies in naming objects. Quarterly Journal of Experimental Psychology 17: 273–281.Google Scholar
  93. Pavlov, I.P. 1927. Conditioned reflexes. An nvestigation of the physiological activity of the cerebral cortex. Mineola: Dover.Google Scholar
  94. Pereira, A.F., L.B. Smith, and C. Yu. 2014. A bottom-up view of toddler word learning. Psychonomic Bulletin & Review 21 (1): 178–185.Google Scholar
  95. Perfors, A., K. Ransom, and D.J. Navarro. 2014. People ignore token frequency when deciding how widely to generalize. Proceedings of the Annual Conference of the Cognitive Science Society 36: 2759–2764.Google Scholar
  96. Piantadosi, S.T., H. Tily, and E. Gibson. 2012. The communicative function of ambiguity in language. Cognition 122 (3): 280–291.Google Scholar
  97. Pinker, S., and A. Prince. 1988. On language and connectionism: Analysis of a parallel distributed processing model of language acquisition. Cognition 28 (1): 73–193.Google Scholar
  98. Poser, W.J. 1990. Evidence for foot structure in Japanese. Language 66 (1): 78–105.Google Scholar
  99. Ramscar, M., and D. Yarlett. 2007. Linguistic self-correction in the absence of feedback: A new approach to the logical problem of language acquisition. Cognitive Science 31 (6): 927–960.Google Scholar
  100. Ramscar, M., D. Yarlett, M. Dye, K. Denny, and K. Thorpe. 2010. The effects of feature-label-order and their implications for symbolic learning. Cognitive Science 34 (6): 909–957.Google Scholar
  101. Ramscar, M., M. Dye, and J. Klein. 2013a. Children value informativity over logic in word learning. Psychological Science 24 (6): 1017–1023.Google Scholar
  102. Ramscar, M., M. Dye, and S.M. McCauley. 2013b. Error and expectation in language learning: The curious absence of mouses in adult speech. Language 89 (4): 760–793.Google Scholar
  103. Regier, T., and S. Gahl. 2004. Learning the unlearnable: The role of missing evidence. Cognition 93 (2): 147–155.Google Scholar
  104. Rescorla, R.A., and A.R. Wagner. 1972. A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement. In Classical conditioning II: Current research and theory, ed. A.H. Black and W.F. Prokasy, 64–99. New York: Appleton-Century-Crofts.Google Scholar
  105. Rogers, T.T., and J.L. McClelland. 2004. Semantic cognition: A parallel distributed processing approach. Cambridge, MA: MIT Press.Google Scholar
  106. Schertz, J. 2013. Exaggeration of featural contrasts in clarifications of misheard speech in English. Journal of Phonetics 41 (3): 249–263.Google Scholar
  107. Seyfarth, S., E. Buz, and T.F. Jaeger. 2016. Dynamic hyperarticulation of coda voicing contrasts. The Journal of the Acoustical Society of America 139 (2): EL31–EL37.Google Scholar
  108. Smolek, A., and V. Kapatsinski. in preparation. The importance of contiguity for learning paradigmatic mappings. Ms., U Oregon.Google Scholar
  109. Sternberg, D., and J.L. McClelland. 2009. When should we expect indirect effects in human contingency learning. Proceedings of the Annual Conference of the Cognitive Science Society 31: 206–211.Google Scholar
  110. Suttle, L., and A.E. Goldberg. 2011. The partial productivity of constructions as induction. Linguistics 49 (6): 1237–1269.Google Scholar
  111. Taatgen, N.A., and J.R. Anderson. 2002. Why do children learn to say “broke”? A model of learning the past tense without feedback. Cognition 86 (2): 123–155.Google Scholar
  112. Tassoni, C.J. 1995. The least mean squares network with information coding: A model of cue learning. Journal of Experimental Psychology: Learning, Memory, & Cognition 21 (1): 193–204.Google Scholar
  113. Tomasello, M. 2003. Constructing a language: A usage-based theory of language acquisition. Cambridge, MA: Harvard University Press.Google Scholar
  114. Torres Cacoullos, R., and J.A. Walker. 2009. The present of the English future: Grammatical variation and collocations in discourse. Language 85 (2): 321–354.Google Scholar
  115. Vadillo, M.A., and H. Matute. 2010. Augmentation in contingency learning under time pressure. British Journal of Psychology 101 (3): 579–589.Google Scholar
  116. van Hamme, L.J., and E.A. Wasserman. 1994. Cue competition in causality judgments: The role of nonpresentation of compound stimulus elements. Learning & Motivation 25 (2): 127–151.Google Scholar
  117. Waldmann, M.R. 2000. Competition among causes but not effects in predictive and diagnostic learning. Journal of Experimental Psychology: Learning, Memory, & Cognition 26 (1): 53–76.Google Scholar
  118. Waldmann, M.R., and K.J. Holyoak. 1992. Predictive and diagnostic learning within causal models: Asymmetries in cue competition. Journal of Experimental Psychology: General 121 (2): 222–236.Google Scholar
  119. Wasserman, E.A., W.W. Dorner, and S.-F. Kao. 1990. Contributions of specific cell information to judgments to interevent contingency. Journal of Experimental Psychology: Learning, Memory, and Cognition 16: 509–521.Google Scholar
  120. Weir, R.H. 1962. Language in the crib. The Hague: Mouton.Google Scholar
  121. Williams, J.N. 2003. Inducing abstract linguistic representations: Human and connectionist learning of noun classes. In The lexicon-syntax interface in second language acquisition, ed. R. van Hout, A. Hulk, F. Kuiken, and R.J. Towell, 151–174. Amsterdam: John Benjamins.Google Scholar
  122. Xu, J., and W.B. Croft. 1998. Corpus-based stemming using co-occurrence of word variants. ACM Transactions on Information Systems 16 (1): 61–81.Google Scholar
  123. Xu, F., and J.B. Tenenbaum. 2007. Word learning as Bayesian inference. Psychological Review 114 (2): 245–272.Google Scholar
  124. Yu, C., and L.B. Smith. 2007. Rapid word learning under uncertainty via cross-situational statistics. Psychological Science 18 (5): 414–420.Google Scholar
  125. Zaki, S., A. Rich, and S. Stacy. 2016. The sequence of items in category learning: Modeling and eye-tracking data. Paper presented at the 57th annual meeting of the Psychonomic Society, Boston, MA, November 17–20.Google Scholar
  126. Zipf, G.K. 1949. Human behavior and the principle of least effort: An introduction to human ecology. Cambridge, MA: Addison-Wesley Press.Google Scholar

Copyright information

© Springer International Publishing AG, part of Springer Nature 2018

Authors and Affiliations

  1. 1.Department of LinguisticsUniversity of OregonEugeneUSA

Personalised recommendations