Abstract
The Georgian Dialect Corpus – GDC (http://mygeorgia.ge/gdc) serves as a source to document and study the regional varieties of the Georgian language. The first steps in terms of the Georgian dialect data collection were taken by Prof. Iost Gippert within his research projects [TITUS, ARMAZI].
The Corpus design strategy on one hand is based on an international corpus linguistics practice and on the other hand on the traditions of the Georgian dialectology and dialectography. The Georgian linguistic and cultural characteristics are being considered in the Corpus design.
The dialect dictionaries are incorporated in the corpus for two reasons: (a) to achieve a high level of representativeness and (b) to use the POS markers of the dictionary lemmas for the morphological annotation of the Corpus. The present paper deals with the practical tasks how these dictionaries complement the dialect lexical fund and how the part of speech markers of the dictionaries are applied in the process of morphological annotation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Jorbenadze, B.: The Kartvelian Languages and Dialects. Mecniereba, Tbilisi (1991)
Biber, D.: Representativeness in corpus design. Lit. Linguist. Comput. 8, 243–257 (1993)
Leech, G.: The Importance of Reference Corpora, Lancaster University. http://www.uzei.com/modulos/usuariosFtp/conexion/archivos59A.pdf
Sinclair, J.: Preliminary recommendations on corpus typology. In: EAG–TCWG–CTYP/P, Version of May (1996)
Leech, G.: New resources, or just better old ones? The Holy Grail of representativeness. In: Hundt, M., Nesselehauf, N., Biewer, C. (eds.) Corpus Linguistics and the Web, pp. 133–149. Rodopi, Amsterdam (2007)
Kryuchkova, O.U., Goldyn, V.E.: Textual dialect corpuses as a model of traditional rural communication. In: Papers of the International Conference on Computational Linguistics, Dialogue-2008, pp. 268–273, Moscow (2008)
Beridze, M., Nadaraia, D.: Dictionary as a textual component of a corpus (Georgian Dialect Corpus). In: Proceedings of the International Conference “Corpus Linguistics-2011”, pp. 92–97, St. Petersburg (2011)
Beridze, M., Nadaraia, D.: The corpus of georgian dialects. In: Fifth International Conference: NLP, Corpus Linguistics, Corpus Based Grammar Research, Slovakia, Bratislava (2009)
Lortkipanidze L.: Record and reproduction of morphological functions. In: Proceedings of the 5th Tbilisi Symposium on Language, Logic and Computation. ILLC, University of Amsterdam CLLS, Tbilisi State University, pp. 105–111 (2003)
Lortkipanidze L.: Interactive system for compilation of multilingual concordancers. In: Conference abstracts of the 6th International Contrastive Linguistics Conference (ICLC6), Berlin (2010)
Dzotsenidze, Q.: Upper Imeretian Dictionary, Tbilisi (1974)
Lortkipanidze, L.: Software Tools for Morphological Annotation of Corpus. In: Proceedings of the International Conference “Corpus Linguistics – 2011”. St. Petersburg, pp. 243–248. (2011)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Beridze, M., Lortkipanidze, L., Nadaraia, D. (2015). Dialect Dictionaries in the Georgian Dialect Corpus. In: Aher, M., Hole, D., Jeřábek, E., Kupke, C. (eds) Logic, Language, and Computation. TbiLLC 2013. Lecture Notes in Computer Science(), vol 8984. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-46906-4_6
Download citation
DOI: https://doi.org/10.1007/978-3-662-46906-4_6
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-46905-7
Online ISBN: 978-3-662-46906-4
eBook Packages: Computer ScienceComputer Science (R0)