Computational and Corpus Approaches to Chinese Language Learning: An Introduction

  • Xiaofei LuEmail author
  • Berlin Chen
Part of the Chinese Language Learning Sciences book series (CLLS)


In this introductory chapter, we first provide a discussion of the rationale and objectives of the book. We then offer a brief review of the body of corpus linguistics research that intersects with Chinese language pedagogy and acquisition. This is followed by an overview of the state of the art of research in computational linguistics and natural language processing that pertains to Chinese language teaching, learning, and assessment. We conclude with a description of the organization of the book.


  1. Aijmer, K. (Ed.). (2009). Corpora and language teaching. Amsterdam/Philadelphia: John Benjamins.Google Scholar
  2. Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent Dirichlet allocation. Journal of Machine Learning Research, 3(4–5), 993–1022.Google Scholar
  3. Chen, N. F., & Li, H. (2016). Computer-assisted pronunciation training: From pronunciation scoring towards spoken language learning. In Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference.Google Scholar
  4. Deerwester, S., Dumais, S. T., Furnas, G. W., Landauer, T. K., & Harshman, R. (1990). Indexing by latent semantic analysis. Journal of the American Society for Information Science, 41(6), 391–417.CrossRefGoogle Scholar
  5. Díaz-Negrillo, A., Ballier, N., & Thompson, P. (Eds.). (2013). Automatic treatment and analysis of learner corpus data (pp. 249–264). Amsterdam/Philadelphia: John Benjamins.Google Scholar
  6. Flowerdew, L. (2009). Applying corpus linguistics to pedagogy. International Journal of Corpus Linguistics, 14(3), 393–417.CrossRefGoogle Scholar
  7. Gales, M., & Young, S. (2007). The application of hidden Markov models in speech recognition. Foundations and Trends in Signal Processing, 3(1), 195–304.CrossRefGoogle Scholar
  8. Goodfellow, I., Bengio, Y., & Courville, A. (2016). Deep learning. Cambridge: The MIT Press.Google Scholar
  9. Hofmann, T. (1999). Probabilistic latent semantic indexing. In Proceedings of the Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 50–57).Google Scholar
  10. Jin, T., & Lu, X. (2018). A data-driven approach to text adaptation in teaching material preparation: Design, implementation and teacher professional development. TESOL Quarterly, 52(2), 457–467.CrossRefGoogle Scholar
  11. Jurafsky, D., & Martin, J. (2008). An introduction to natural language processing, computational linguistics, and speech recognition (2nd ed.). Englewood Cliffs, NJ: Prentice Hall.Google Scholar
  12. Lu, X. (2014). Computational methods for corpus annotation and analysis. Dordrecht: Springer.CrossRefGoogle Scholar
  13. Lu, X. (2018). Natural language processing and intelligent computer-assisted language learning (ICALL). In J. I. Liontas (Ed.), The TESOL encyclopedia of english language teaching. Chichester, UK: Wiley Blackwell.Google Scholar
  14. Manning, C., & Schütze, H. (1999). Foundations of statistical natural language processing. Cambridge: MIT Press.Google Scholar
  15. McEnery, T., & Hardie, A. (2011). Corpus linguistics: Method, theory and practice. Cambridge: Cambridge University Press.CrossRefGoogle Scholar
  16. Peng, X., Ke, D., Chen, Z., & Xu, B. (2010). Automated Chinese essay scoring using vector space models. In Proceedings of the International Universal Communication Symposium (pp. 149–153).Google Scholar
  17. Wang, T., & Hirst, G. (2010). Near-synonym lexical choice in latent semantic space. In Proceedings of the International Conference on Computational Linguistics (pp. 1182–1190).Google Scholar
  18. Xiao, R., Rayson, P., & McEnery, T. (2009). A frequency dictionary of Mandarin Chinese: Core vocabulary for learners. London: Routledge.Google Scholar
  19. Yeh, J.-F., Hsu, T.-W., & Yeh, C.-K. (2016). Grammatical error detection based on machine learning for Mandarin as second language learning. In Proceedings of the 3rd Workshop on Natural Language Processing Techniques for Educational Applications (pp. 140–147).Google Scholar

Copyright information

© Springer Nature Singapore Pte Ltd. 2019

Authors and Affiliations

  1. 1.The Pennsylvania State UniversityUniversity ParkUSA
  2. 2.National Taiwan Normal UniversityTaipeiTaiwan

Personalised recommendations