Abstract
This paper presents a survey of Arabic treebanks to facilitate their reuse for the building of new linguistic resources. In our case, we created from a treebank an automatically induced Property Grammar (GP). So, we discussed characteristics of these treebanks to choose the appropriate one. To build our resource, we adopted an automatic technique, acquiring first a context-free grammar (CFG) from the chosen treebank, and second, inducing a GP by generating relations between grammatical units described in the CFG.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Blache, P.: Les Grammaires de Propriétés: Des contraintes pour le traitement automatique des langues naturelles. Hermès Sciences Publications (2001)
Diab, M.T., Habash, N., Rambow, O., Roth, R.: LDC Arabic Treebanks and Associated Corpora: Data Divisions Manual. Columbia University. Technical Report, Center for Computational Learning Systems (2013)
Dukes, K., Buckwalter, T.: A Dependency Treebank of the Quran using traditional Arabic grammar. Institute of Electrical and Electronics Engineers (2010)
Habash, N., Faraj, R., Roth, R.: Syntactic Annotation in the Columbia Arabic Treebank. In: Conference on Arabic Language Resources and Tools, Cairo, Egypt (2009)
Hajič, J., Smrž, O., Zemánek, P., Snaidauf, J., Beska, E.: Prague Arabic Dependency Treebank: Development in Data and Tools. In: Proceedings of the NEMLAR International Conference on Arabic Language Resources and Tools (2004)
Maamouri, M., Bies, A., Buckwalter, T.: The Penn Arabic Treebank: Building a Large-scale Annotated Arabic Corpus. In: Proceedings of the Network for Euro-Mediterranean Language Resources Conference on Arabic Language Resources, Cairo, Egypt (2004)
Maamouri, M., Bies, A., Krouna, S., Gaddeche, F., Bouziri, B.: Penn Arabic Treebank guidelines v4.8. Technical report, LDC, University of Pennsylvania (2009)
Smrž, O., Bielický, V., Kouřilová, I., Kráčmar, J., Hajič, J., Zemánek, P.: Prague Arabic Dependency Treebank: A Word on the Million Words. In: Proceedings of the Workshop on Arabic and Local Languages (LREC 2008), Marrakech, Morocco, pp. 16–23 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Bahloul, R.B., Elkarwi, M., Haddar, K., Blache, P. (2014). Building an Arabic Linguistic Resource from a Treebank: The Case of Property Grammar. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2014. Lecture Notes in Computer Science(), vol 8655. Springer, Cham. https://doi.org/10.1007/978-3-319-10816-2_30
Download citation
DOI: https://doi.org/10.1007/978-3-319-10816-2_30
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-10815-5
Online ISBN: 978-3-319-10816-2
eBook Packages: Computer ScienceComputer Science (R0)