Prediction of Learner Native Language by Writing Error Pattern

  • Brendan FlanaganEmail author
  • Chengjiu Yin
  • Takahiko Suzuki
  • Sachio Hirokawa
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9192)


The native language of a foreign language learner can have an effect on the errors they make because of similarities or differences between the two languages. In order to provide effective error prediction and correction for non-native English language learners it is important to identify their specific characteristic error patterns that are influenced by their native language. In this paper, we examine analyzing error detection scores to predict the native language of an English language learner. 15 categories of error detection scores are combined to create an error prediction score vector representation of each sentence. The native language is predicted by training an SVM classifier with the error vectors. The results are compared to an SVM classifier trained with just word representations of the learner writing sentences.


Native language prediction Writing errors SVM classifier 



This work was partially supported by JSPS KAKENHI Grant Number 24500176.


  1. 1.
    Graddol, D.: English Next: Why Global English May Mean the End of English as a Foreign Language. British Council, London (2006)Google Scholar
  2. 2.
    Guo, Y., Beckett, G.H.: The hegemony of english as a global language: reclaiming local knowledge and culture in china. Convergence 40, 117–132 (2007)Google Scholar
  3. 3.
    Flanagan, B., Yin, C., Suzuki, T., Hirokawa, S.: Classification and clustering english writing errors based on native language. In: IIAI 3rd International Conference on Advanced Applied Informatics (IIAIAAI), pp. 318-323 (2014)Google Scholar
  4. 4.
    Kroll, B.: What does time buy? ESL student performance on home versus class compositions. In: Kroll, B. (ed.) Second Language Writing: Research Insights for the Classroom, pp. 140–154. Cambridge University Press, Cambridge (1990)CrossRefGoogle Scholar
  5. 5.
    Weltig, M.S.: Effects of language errors and importance attributed to language on language and rhetorical-level essay scoring. In: Spaan Fellow Working Papers in . Second or Foreign Language Assess. vol. 2(1001), pp. 53-81 (2004)Google Scholar
  6. 6.
    Flanagan, B., Yin, C., Suzuki, T., Hirokawa, S.: Intelligent Computer Classification of English Writing Errors. In: Proceedings of the 6th International Conference on Intelligent Interactive Multimedia Systems and Services (IIMSS 2013) vol. 254, pp. 174-183, IOS Press (2013)Google Scholar
  7. 7.
    Flanagan, B., Yin, C., Hashimoto, K., Hirokawa, S.: Clustering English Writing Errors based on Error Category Prediction, ISEEE 2013, pp. 733-738 (2013)Google Scholar
  8. 8.
    Flanagan, B., Yin, C., Suzuki, T., Hirokawa, S.: Classification of english language learner writing errors using a parallel corpus with svm. Int. J. Knowl. Web Intell. 5(1), 21–35 (2014)CrossRefGoogle Scholar
  9. 9.
    Wong, S.M.J., Dras, M., Johnson, M.: Exploring adaptor grammars for native language identification. In: Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning. Association for Computational Linguistics, pp. 699-709 (2012)Google Scholar
  10. 10.
    Brooke, J., Hirst, G.: Native language detection with ‘cheap’ learner corpora. In Twenty Years of Learner Corpus Research. Looking Back, Moving Ahead: Proceedings of the First Learner Corpus Research Conference (LCR 2011), pp. 37-57, Presses universitaires de Louvain (2013)Google Scholar
  11. 11.
    Tetreault, J., Blanchard, D., Cahill, A.: A report on the first native language identification shared task. In: Proceedings of the Eighth Workshop on Innovative Use of NLP for Building Educational Applications, pp. 48-57 (2013)Google Scholar
  12. 12.
    Jarvis, S., Bestgen, Y., Pepper, S.: Maximizing classification accuracy in native language identification. NAACL/HLT 2013, 111–118 (2013)Google Scholar
  13. 13.
    Koppel, M., Schler, J., Zigdon, K.: Determining an author’s native language by mining a text for errors. In Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery in Data Mining, pp. 624-628, ACM (2005)Google Scholar
  14. 14.
    Kochmar, E.: Identification of a writer’s native language by error analysis. Master’s thesis. University of Cambridge (2011)Google Scholar
  15. 15.
    Bestgen, Y., Granger, S., Thewissen, J.: Error patterns and automatic L1 identification. In: Approaching Language Transfer Through Text Classification, pp. 127-153 (2012)Google Scholar
  16. 16.
    Flanagan, B., Yin, C., Hirokawa, S., Hashimoto, K., Tabata, Y.: An automated method to generate e-learning quizzes from online language learner writing. Int. J Distance Educ. Technol. 11(4), 63–80 (2013)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing Switzerland 2015

Authors and Affiliations

  • Brendan Flanagan
    • 1
    Email author
  • Chengjiu Yin
    • 2
  • Takahiko Suzuki
    • 3
  • Sachio Hirokawa
    • 3
  1. 1.Graduate School of Information Science and Electrical EngineeringKyushu UniversityFukuokaJapan
  2. 2.Faculty of Arts and ScienceKyushu UniversityFukuokaJapan
  3. 3.Research Institute for Information TechnologyKyushu UniversityFukuokaJapan

Personalised recommendations