Natural Language Complexity and Machine Learning

  • Leonor Becerra-Bonache
  • M. Dolores Jiménez-LópezEmail author
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 801)


Eventhough complexity is a central notion in linguistics, until recently, it has not been widely researched in the area. During the 20th century, linguistic complexity was supposed to be invariant. In general, recent work on language complexity takes an absolute perspective of the concept while the relative complexity approach –although considered as conceptually coherent– has hardly begun to be developed. In this paper, we introduce machine learning tools that can be used to calculate natural language complexity from a relative point of view by considering the process of first language acquisition.


Complexity Natural language Machine learning 



This research has been supported by the Ministerio de Economía y Competitividad and the Fondo Europeo de Desarrollo Regional under the project number FFI2015-69978-P (MINECO/FEDER, UE) of the Programa Estatal de Fomento de la Investigación Científica y Técnica de Excelencia, Subprograma Estatal de Generación de Conocimiento.

The work of Leonor Becerra-Bonache has been performed during her teaching leave granted by the CNRS (French National Center for Scientific Research) in the Computer Science Department of Aix-Marseille University.


  1. 1.
    Andrason, A.: Language complexity: an insight from complex-system theory. Int. J. Lang. Linguist. 2(2), 74–89 (2014)Google Scholar
  2. 2.
    Angluin, D., Becerra-Bonache, L.: A model of semantics and corrections in language learning. Technical report, Yale University (2010)Google Scholar
  3. 3.
    Angluin, D., Becerra-Bonache, L.: Effects of meaning-preserving corrections on language learning. In: Proceedings of the 15th International Conference on Computational Natural Language Learning, CoNLL 2011, Portland, pp. 97–105 (2011)Google Scholar
  4. 4.
    Angluin, D., Becerra-Bonache, L.: A model of language learning with semantics and meaning preserving corrections. Artif. Intell. 242, 23–51 (2016)MathSciNetCrossRefGoogle Scholar
  5. 5.
    Bane, M.: Quantifying and measuring morphological complexity. In: Chang, C., Haynie, H. (eds.) Proceedings of the 26th West Coast Conference on Formal Linguistics, pp. 69–76. Cascadilla Proceedings Project, Somerville (2008)Google Scholar
  6. 6.
    Becerra-Bonache, L., Blockeel, H., Galván, M., Jacquenet, F.: A first-order-logic based model for grounded language learning. In: Advances in Intelligent Data Analysis XIV - 14th International Symposium, IDA 2015, pp. 49–60 (2015)CrossRefGoogle Scholar
  7. 7.
    Becerra-Bonache, L., Blockeel, H., Galván, M., Jacquenet, F.: Learning language models from images with regll. In: Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2016, pp. 55–58 (2016)Google Scholar
  8. 8.
    Becerra-Bonache, L., Blockeel, H., Galván, M., Jacquenet, F.: Relational grounded language learning. In: ECAI 2016 - 22nd European Conference on Artificial Intelligence, 29 August-2 September 2016, The Hague, The Netherlands - Including Prestigious Applications of Artificial Intelligence (PAIS 2016), pp. 1764–1765 (2016)Google Scholar
  9. 9.
    Blache, P.: A computational model for linguistic complexity. In: Bel-Enguix, G., Dahl, V., Jiménez-López, M. (eds.) Biology, Computation and Linguistics. New Interdisciplinary Paradigms, pp. 155–167. IOS Press, Amsterdam (2011)Google Scholar
  10. 10.
    Crystal, D.: The Cambridge Encyclopedia of Language. Cambridge University Press, Cambridge (1997)Google Scholar
  11. 11.
    Dahl, O.: The Growth and Maintenance of Linguistic Complexity. John Benjamins, Amsterdam (2004)CrossRefGoogle Scholar
  12. 12.
    Deutscher, G.: Overall complexity: a wild goose chase? In: Sampson, G., Gil, D., Trudgill, P. (eds.) Language Complexity as an Evolving Variable, pp. 243–251. Oxford University Press, Oxford (2009)Google Scholar
  13. 13.
    Juola, P.: Assessing linguistic complexity. In: Miestamo, M., Sinnemäki, K., Karlsson, F. (eds.) Language Complexity: Typology, Contact, Change, pp. 89–108. John Benjamins, Amsterdam (2009)Google Scholar
  14. 14.
    Kusters, W.: Linguistic Complexity: The Influence of Social Change on Verbal Inflection. LOT, Utrecht (2003)Google Scholar
  15. 15.
    McWhorter, J.: The world’s simplest grammars are creole grammars. Linguist. Typol. 6, 125–166 (2001)Google Scholar
  16. 16.
    Miestamo, M.: On the feasibility of complexity metrics. In: Krista, K., Sepper, M. (eds.) Finest Linguistics. Proceedings of the Annual Finish and Estonian Conference of Linguistics, pp. 11–26. Tallinna Ülikooli Kirjastus, Tallinn (2006)Google Scholar
  17. 17.
    Miestamo, M.: Grammatical complexity in a cross-linguistic perspective. In: Miestamo, M., Sinnemäki, K., Karlsson, F. (eds.) Language Complexity: Typology, Contact, Change, pp. 23–42. John Benjamins, Amsterdam (2009)Google Scholar
  18. 18.
    Miestamo, M.: Implicational hierarchies and grammatical complexity. In: Sampson, G., Gil, D., Trudgill, P. (eds.) Language Complexity as an Evolving Variable, pp. 80–97. Oxford University Press, Oxford (2009)Google Scholar
  19. 19.
    Mitchell, T.M.: Machine Learning. McGraw Hill Series in Computer Science. McGraw-Hill, New York (1997)zbMATHGoogle Scholar
  20. 20.
    Mufwene, S., Coupé, C., Pellegrino, F.: Complexity in Language. Cambridge University Press, New York (2017)CrossRefGoogle Scholar
  21. 21.
    Pallotti, G.: A simple view of linguistic complexity. Second Lang. Res. 31, 117–134 (2015)CrossRefGoogle Scholar
  22. 22.
    Stolcke, A., Feldman, J., Lakoff, G., Weber, S.: Miniature language acquisition: a touchstone for cognitive science. Cogn. Sci. 8, 686–693 (1994)Google Scholar
  23. 23.
    Zitnick, C., Parikh, D.: Bringing semantics into focus using visual abstraction. In: Proceedings of the International Conference on Computer Vision and Pattern Recognition, pp. 3009–3016. Portland (2013)Google Scholar

Copyright information

© Springer Nature Switzerland AG 2019

Authors and Affiliations

  • Leonor Becerra-Bonache
    • 1
  • M. Dolores Jiménez-López
    • 2
    Email author
  1. 1.CNRS, Laboratoire Hubert-Curien UMR 5516Univ. Lyon, UJM-St-EtienneSaint-ÉtienneFrance
  2. 2.Departament de Filologies Romániques, Research Group on Mathematical LinguisticsUniversitat Rovira i VirgiliTarragonaSpain

Personalised recommendations