Abstract
In this paper, we provide a review of the literature on named entity recognition in Turkish together with pointers to related open research problems. Unlike well-studied languages such as English and Spanish; Turkish is an agglutinative, morphologically-rich, and non-configurational language with limited language processing resources, tools, data sets, and guidelines. Hence, we believe that this paper will serve as a significant reference for computational linguists working on Turkish, those working on languages with similar structural characteristics to Turkish, and also for those working on resource-scarce languages.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Cucerzan, S., Yarowsky, D.: Language independent named entity recognition combining morphological and contextual evidence. In: Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora (1999)
Çelikkaya, G., Torunoğlu, D., Eryiğit, G.: Named entity recognition on real data: a preliminary investigation for Turkish. In: 7th International Conference on Application of Information and Communication Technologies (2013)
Eken, B., Tantuğ, A.C.: Recognizing named entities in Turkish tweets. In: International Conference on Computer Science, Engineering and Applications (2015)
Göksel, A., Kerslake, C.: Turkish: A Comprehensive Grammar. Routledge, London (2005)
Grishman, R., Sundheim, B.: Message understanding conference-6: a brief history. In: International Conference on Computational Linguistics (1996)
Küçük, D.: Automatic compilation of language resources for named entity recognition in Turkish by utilizing wikipedia article titles. Comput. Stand. Interfaces 41, 1–9 (2015)
Küçük, D., Jacquet, G., Steinberger, R.: Named entity recognition on Turkish tweets. In: Language Resources and Evaluation Conference (2014)
Küçük, D., Küçük, D.: High-precision person name extraction from Turkish texts using Wikipedia. In: Biemann, C., Handschuh, S., Freitas, A., Meziane, F., Métais, E. (eds.) NLDB 2015. LNCS, vol. 9103, pp. 347–354. Springer, Cham (2015). doi:10.1007/978-3-319-19581-0_31
Küçük, D., Küçük, D., Arıcı, N.: A named entity recognition dataset for Turkish. In: Signal Processing and Communications Applications Conference (2016)
Küçük, D., Steinberger, R.: Experiments to improve named entity recognition on Turkish tweets. In: EACL Workshop on Language Analysis for Social Media (2014)
Küçük, D., Yazıcı, A.: Named entity recognition experiments on Turkish texts. In: Andreasen, T., Yager, R.R., Bulskov, H., Christiansen, H., Larsen, H.L. (eds.) FQAS 2009. LNCS, vol. 5822, pp. 524–535. Springer, Heidelberg (2009). doi:10.1007/978-3-642-04957-6_45
Küçük, D., Yazıcı, A.: Exploiting information extraction techniques for automatic semantic video indexing with an application to Turkish news videos. Knowl. Based Syst. 24(6), 844–857 (2011)
Küçük, D., Yazıcı, A.: A hybrid named entity recognizer for Turkish. Expert Syst. Appl. 39(3), 2733–2742 (2012)
Metin, S.K., Kışla, T., Karaoğlan, B.: Named entity recognition in Turkish using association measures. Adv. Comput. Int. J. 3, 43–49 (2012)
Nadeau, D., Sekine, S.: A survey of named entity recognition and classification. Lingvisticae Investigationes 30(1), 3–26 (2007)
Özkaya, S., Diri, B.: Named entity recognition by conditional random fields from Turkish informal texts. In: Signal Processing and Communications Applications Conference (2011)
Say, B., Zeyrek, D., Oflazer, K., Özge, U.: Development of a corpus and a treebank for present-day written Turkish. In: International Conference of Turkish Linguistics (2002)
Şeker, G.A., Eryiğit, G.: Initial explorations on using CRFs for Turkish named entity recognition. In: International Conference on Computational Linguistics (2012)
Tatar, S., Çicekli, I.: Automatic rule learning exploiting morphological features for named entity recognition in Turkish. J. Inf. Sci. 37(2), 137–151 (2011)
Tjong Kim Sang, E.F.: Text chunking by system combination. In: Workshop on Learning Language in Logic and the Conference on Computational Natural Language Learning (2000)
Tür, G., Hakkani-Tür, D., Oflazer, K.: A statistical information extraction system for Turkish. Nat. Lang. Eng. 9(2), 181–210 (2003)
Yeniterzi, R.: Exploiting morphology in Turkish named entity recognition system. In: ACL Student Session (2011)
Zesch, T., Müller, C., Gurevych, I.: Extracting lexical semantic knowledge from wikipedia and wiktionary. In: Language Resources and Evaluation Conference (2008)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Küçük, D., Arıcı, N., Küçük, D. (2017). Named Entity Recognition in Turkish: Approaches and Issues. In: Frasincar, F., Ittoo, A., Nguyen, L., Métais, E. (eds) Natural Language Processing and Information Systems. NLDB 2017. Lecture Notes in Computer Science(), vol 10260. Springer, Cham. https://doi.org/10.1007/978-3-319-59569-6_20
Download citation
DOI: https://doi.org/10.1007/978-3-319-59569-6_20
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-59568-9
Online ISBN: 978-3-319-59569-6
eBook Packages: Computer ScienceComputer Science (R0)