Abstract
Annotating sentences is important to exploit the different features of Arabic corpora. This annotation can be successful thanks to a robust analyzer. That is why in this paper we propose to mention the improvement of our previous analyzer. To do this, we propose a description of our previous analyzer, which presents advantages and gaps. Then, we choose a method of improvement, which is inspired by the former one. Finally, we put forward an idea about the implementation and experimentation of our new cascade of transducers in NooJ platform. The obtained results appear satisfactory.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Abney, S.: Partial parsing via finite-state cascades. Nat. Lang. Eng. 2(4), 337–344 (1996)
Boukedi, S., Haddar, K.: HPSG grammar for Arabic coordination experimented with LKB system. In: Proceedings of the Twenty-Seventh International Florida Artificial Intelligence Research Society Conference, FLAIRS 2014, Pensacola Beach, Florida, 21–23 May 2014, pp. 166–169 (2014)
Hammouda, N.G., Haddar, K.: Parsing Arabic nominal sentences with transducers to annotate corpora. Computación y Sistemas, vol. 21, no. 4: Advances in Human Language Technologies (Guest Editor: A. Gelbukh), pp. 647–656 (2017)
Hammouda, N.G., Haddar, K.: Integration of a segmentation tool for Arabic corpora in NooJ Platform to build an automatic annotation tool. In: Barone, L., Monteleone, M., Silberztein, M. (eds.) NooJ 2016. CCIS, vol. 667, pp. 89–100. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-55002-2_8
Hammouda, N.G., Haddar, K.: Arabic NooJ parser: nominal sentence case. In: Mbarki, S., Mourchid, M., Silberztein, M. (eds.) NooJ 2017. CCIS, vol. 811, pp. 69–80. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-73420-0_6
Maamouri, M., Bies, A., Buckwalter, T., Mekki, W.: The Penn Arabic Treebank: building a large-scale annotated Arabic corpus. In: NEMLAR Conference on Arabic Language Resources and Tools, vol. 27, pp. 466–467 (2004)
Mesmia, F.B., Zid, F., Haddar, K., Maurel, D.: ASRextractor: a tool extracting semantic relations between Arabic named entities. In: 3rd International Conference on Arabic Computational Linguistics, ACLing 2017, 5–6 November 2017, Dubai (2017)
Pasha, A., et al.: MADAMIRA: a fast, comprehensive tool for morphological analysis and disambiguation of Arabic. In: Proceedings of LREC, Reykjavik, vol. 14, pp. 1094–1101 (2014)
Schiehlen, M.: A cascaded finite-state parser for German. In: Proceedings of EACL 2003, vol. 2, pp. 163–166 (2003)
Silberztein, M.: A new linguistic engine for NooJ: parsing context-sensitive grammars with finite-state machines. In: Mbarki, S., Mourchid, M., Silberztein, M. (eds.) NooJ 2017. CCIS, vol. 811, pp. 240–250. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-73420-0_20
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Ghezaiel Hammouda, N., Haddar, K. (2019). Improvement of Arabic NooJ Parser with Disambiguation Rules. In: Mirto, I., Monteleone, M., Silberztein, M. (eds) Formalizing Natural Languages with NooJ 2018 and Its Natural Language Processing Applications. NooJ 2018. Communications in Computer and Information Science, vol 987. Springer, Cham. https://doi.org/10.1007/978-3-030-10868-7_18
Download citation
DOI: https://doi.org/10.1007/978-3-030-10868-7_18
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-10867-0
Online ISBN: 978-3-030-10868-7
eBook Packages: Computer ScienceComputer Science (R0)