Skip to main content

Arabic Name Entity Recognition Using Deep Learning

  • Conference paper
  • First Online:
Book cover Statistical Language and Speech Processing (SLSP 2018)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11171))

Included in the following conference series:

Abstract

Many applications that we use on a daily basis incorporate Natural Language Processing (NLP), from simple tasks such as automatic text correction to speech recognition. A lot of research has been done on NLP for the English language but not much attention was given to the NLP of the Arabic language. The purpose of this work is to implement a tagging model for Arabic Name Entity Recognition which is an important information extraction task in NLP. It serves as a building block for more advanced tasks. We developed a deep learning model that consists of Bidirectional Long Short Term Memory and Conditional Random Field with the addition of different network layers such as Word Embedding, Convolutional Neural Network, and Character Embedding. Hyperparameters have been tuned to maximize the F1-score.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. AbdelRahman, S., Elarnaoty, M., Magdy, M., Fahmy, A.: Integrated machine learning techniques for Arabic named entity recognition. IJCSI 7, 27–36 (2010)

    Google Scholar 

  2. Benajiba, Y., Rosso, P., BenedíRuiz, J.M.: ANERsys: an Arabic named entity recognition system based on maximum entropy. In: Gelbukh, A. (ed.) CICLing 2007. LNCS, vol. 4394, pp. 143–153. Springer, Heidelberg (2007). https://doi.org/10.1007/978-3-540-70939-8_13

    Chapter  Google Scholar 

  3. Buduma, N., Locascio, N.: Fundamentals of Deep Learning: Designing Next-Generation Machine Intelligence Algorithms. O’Reilly Media Inc., Sebastopol (2017)

    Google Scholar 

  4. Chiu, J.P.C., Nichols, E.: Named entity recognition with bidirectional LSTM-CNNs. arXiv preprint arXiv:1511.08308 (2015)

  5. Chollet, F., et al.: Keras (2015). https://keras.io

  6. Collobert, R., Weston, J.: A unified architecture for natural language processing: deep neural networks with multitask learning. In: Proceedings of the 25th International Conference on Machine Learning, pp. 160–167. ACM (2008)

    Google Scholar 

  7. Devarakonda, A., Naumov, M., Garland, M.: AdaBatch: adaptive batch sizes for training deep neural networks. arXiv preprint arXiv:1712.02029 (2017)

  8. Gridach, M.: Character-aware neural networks for Arabic named entity recognition for social media. In: Proceedings of the 6th Workshop on South and Southeast Asian Natural Language Processing (WSSANLP 2016), pp. 23–32 (2016)

    Google Scholar 

  9. Huang, Z., Xu, W., Yu, K.: Bidirectional LSTM-CRF models for sequence tagging. arXiv preprint arXiv:1508.01991 (2015)

  10. Jain, A., Kulkarni, G., Shah, V.: Natural language processing. Int. J. Comput. Sci. Eng. 6(1) (2018)

    Google Scholar 

  11. Lample, G., Ballesteros, M., Subramanian, S., Kawakami, K., Dyer, C.: Neural architectures for named entity recognition. arXiv preprint arXiv:1603.01360 (2016)

  12. Li, P.-H., Dong, R.-P., Wang, Y.-S., Chou, J.-C., Ma, W.-Y.: Leveraging linguistic structures for named entity recognition with bidirectional recursive neural networks. In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pp. 2664–2669 (2017)

    Google Scholar 

  13. Lopez, M.M., Kalita, J.: Deep learning applied to NLP. arXiv preprint arXiv:1703.03091 (2017)

  14. Ma, X., Hovy, E.: End-to-end sequence labeling via bi-directional LSTM-CNNs-CRF. arXiv preprint arXiv:1603.01354 (2016)

  15. Mohit, B., Schneider, N., Bhowmick, R., Oflazer, K., Smith, N.A.: Recall-oriented learning of named entities in Arabic Wikipedia. In: Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, pp. 162–173. Association for Computational Linguistics (2012)

    Google Scholar 

  16. Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15(1), 1929–1958 (2014)

    MathSciNet  MATH  Google Scholar 

  17. Sun, Y., Li, L., Xie, Z., Xie, Q., Li, X., Xu, G.: Co-training an improved recurrent neural network with probability statistic models for named entity recognition. In: Candan, S., Chen, L., Pedersen, T.B., Chang, L., Hua, W. (eds.) DASFAA 2017. LNCS, vol. 10178, pp. 545–555. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-55699-4_33

    Chapter  Google Scholar 

  18. Taquini, R., Finardi, K.R., Amorim, G.B.: English as a medium of instruction at Turkish state universities. Educ. Linguist. Res. 3(2), 35 (2017)

    Article  Google Scholar 

  19. Xia, L., Wang, G.A., Fan, W.: A deep learning based named entity recognition approach for adverse drug events identification and extraction in health social media. In: Chen, H., Zeng, D.D., Karahanna, E., Bardhan, I. (eds.) ICSH 2017. LNCS, vol. 10347, pp. 237–248. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-67964-8_23

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to David Awad , Caroline Sabty , Mohamed Elmahdy or Slim Abdennadher .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Awad, D., Sabty, C., Elmahdy, M., Abdennadher, S. (2018). Arabic Name Entity Recognition Using Deep Learning. In: Dutoit, T., Martín-Vide, C., Pironkov, G. (eds) Statistical Language and Speech Processing. SLSP 2018. Lecture Notes in Computer Science(), vol 11171. Springer, Cham. https://doi.org/10.1007/978-3-030-00810-9_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-00810-9_10

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-00809-3

  • Online ISBN: 978-3-030-00810-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics