Improvement of Arabic NooJ Parser with Disambiguation Rules

  • Nadia Ghezaiel HammoudaEmail author
  • Kais Haddar
Conference paper
Part of the Communications in Computer and Information Science book series (CCIS, volume 987)


Annotating sentences is important to exploit the different features of Arabic corpora. This annotation can be successful thanks to a robust analyzer. That is why in this paper we propose to mention the improvement of our previous analyzer. To do this, we propose a description of our previous analyzer, which presents advantages and gaps. Then, we choose a method of improvement, which is inspired by the former one. Finally, we put forward an idea about the implementation and experimentation of our new cascade of transducers in NooJ platform. The obtained results appear satisfactory.


Arabic analyzer Disambiguation rules Disambiguation process Cascade of transducers NooJ platform 


  1. 1.
    Abney, S.: Partial parsing via finite-state cascades. Nat. Lang. Eng. 2(4), 337–344 (1996)CrossRefGoogle Scholar
  2. 2.
    Boukedi, S., Haddar, K.: HPSG grammar for Arabic coordination experimented with LKB system. In: Proceedings of the Twenty-Seventh International Florida Artificial Intelligence Research Society Conference, FLAIRS 2014, Pensacola Beach, Florida, 21–23 May 2014, pp. 166–169 (2014)Google Scholar
  3. 3.
    Hammouda, N.G., Haddar, K.: Parsing Arabic nominal sentences with transducers to annotate corpora. Computación y Sistemas, vol. 21, no. 4: Advances in Human Language Technologies (Guest Editor: A. Gelbukh), pp. 647–656 (2017)Google Scholar
  4. 4.
    Hammouda, N.G., Haddar, K.: Integration of a segmentation tool for Arabic corpora in NooJ Platform to build an automatic annotation tool. In: Barone, L., Monteleone, M., Silberztein, M. (eds.) NooJ 2016. CCIS, vol. 667, pp. 89–100. Springer, Cham (2016). Scholar
  5. 5.
    Hammouda, N.G., Haddar, K.: Arabic NooJ parser: nominal sentence case. In: Mbarki, S., Mourchid, M., Silberztein, M. (eds.) NooJ 2017. CCIS, vol. 811, pp. 69–80. Springer, Cham (2018). Scholar
  6. 6.
    Maamouri, M., Bies, A., Buckwalter, T., Mekki, W.: The Penn Arabic Treebank: building a large-scale annotated Arabic corpus. In: NEMLAR Conference on Arabic Language Resources and Tools, vol. 27, pp. 466–467 (2004)Google Scholar
  7. 7.
    Mesmia, F.B., Zid, F., Haddar, K., Maurel, D.: ASRextractor: a tool extracting semantic relations between Arabic named entities. In: 3rd International Conference on Arabic Computational Linguistics, ACLing 2017, 5–6 November 2017, Dubai (2017)Google Scholar
  8. 8.
    Pasha, A., et al.: MADAMIRA: a fast, comprehensive tool for morphological analysis and disambiguation of Arabic. In: Proceedings of LREC, Reykjavik, vol. 14, pp. 1094–1101 (2014)Google Scholar
  9. 9.
    Schiehlen, M.: A cascaded finite-state parser for German. In: Proceedings of EACL 2003, vol. 2, pp. 163–166 (2003)Google Scholar
  10. 10.
    Silberztein, M.: A new linguistic engine for NooJ: parsing context-sensitive grammars with finite-state machines. In: Mbarki, S., Mourchid, M., Silberztein, M. (eds.) NooJ 2017. CCIS, vol. 811, pp. 240–250. Springer, Cham (2018). Scholar

Copyright information

© Springer Nature Switzerland AG 2019

Authors and Affiliations

  1. 1.Miracl LaboratoryHigher Institute of Computer and Communication Technologies of Hammam SousseSousseTunisia
  2. 2.Miracl Laboratory, Faculty of Sciences of SfaxUniversity of SfaxSfaxTunisia

Personalised recommendations