Skip to main content

A Supervised Approach to Arabic Text Summarization Using AdaBoost

  • Conference paper

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 353))

Abstract

In recent years, research in text summarization has become very active for many languages. Unfortunately, looking at the effort devoted to Arabic text summarization, we find much fewer attention paid to it. This paper presents a Machine Learning-based approach to Arabic text summarization which uses AdaBoost. This technique is employed to predict whether a new sentence is likely to be included in the summary or not. In order to evaluate the approach, we have used a corpus of Arabic articles. This approach was compared against other Machine Learning approaches and the results obtained show that the approach we suggest using AdaBoost outperforms other existing approaches.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   369.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abdel Fattah, M., Ren, F.: Probabilistic neural network based text summarization. In: International Conference of Natural Language Processing and Knowledge Engineering, NLP-KE 2008, pp. 1–6. IEEE (2008)

    Google Scholar 

  2. Azmi, A., Al-thanyyan, S.: Ikhtasir—a user selected compression ratio arabic text summarization system. In: Proceeding of International Conference of Natural Language Processing and Knowledge Engineering (NLP-KE 2009), pp. 1–7 (2009)

    Google Scholar 

  3. Bauer, E., Kohavi, R.: An empirical comparison of voting classification algorithms: Bagging, boosting, and variants. Machine Learning 36(1-2), 105–139 (1999)

    Article  Google Scholar 

  4. Douzidia, F.S., Lapalme, G.: Lakhas, an arabic summarization system. In: Proceedings of DUC 2004(2004)

    Google Scholar 

  5. El-Haj, M., Kruschwitz, U., Fox, C.: Multi-document arabic text summarisation. In: 2011 3rd Computer Science and Electronic Engineering Conference (CEEC), pp. 40–44. IEEE (2011)

    Google Scholar 

  6. Freund, Y., Schapire, R.E., et al.: Experiments with a new boosting algorithm. In: ICML, vol. 96, pp. 148–156 (1996)

    Google Scholar 

  7. Keskes, I., Boudabous, M.M., Maaloul, M.H., Belguith, L.H.: Étude comparative entre trois approches de résumé automatique de documents arabes. In: Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, Grenoble, France, vol. 2, pp. 225–238. ATALA/AFCP (2012)

    Google Scholar 

  8. Lin, C.-Y.: Rouge: A package for automatic evaluation of summaries. In: Text Summarization Branches Out: Proceedings of the ACL-2004 Workshop, pp. 74–81 (2004)

    Google Scholar 

  9. Lloret, E., Palomar, M.: Text summarisation in progress: a literature review. Artificial Intelligence Review 37(1), 1–41 (2012)

    Article  Google Scholar 

  10. Louis, A., Nenkova, A.: Automatically assessing machine summary content without a gold standard. Computational Linguistics 39(2), 267–300 (2013)

    Article  Google Scholar 

  11. Nenkova, A., McKeown, K.: Automatic summarization. Foundations and Trends in Information Retrieval 5(2-3), 103–233 (2011)

    Article  Google Scholar 

  12. Saggion, H., Poibeau, T.: Automatic text summarization: Past, present and future. In: Multi-source, Multilingual Information Extraction and Summarization, pp. 3–21. Springer (2013)

    Google Scholar 

  13. Schapire, R.E.: The strength of weak learnability. Machine Learning 5(2), 197–227 (1990)

    Google Scholar 

  14. Jones, K.S.: Automatic summarising: The state of the art. Information Processing & Management 43(6), 1449–1481 (2007)

    Article  Google Scholar 

  15. Vapnik, V.N.: The nature of statistical learning theory. statistics for engineering and information science. Springer-Verlag, New York (2000)

    Book  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Riadh Belkebir .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Belkebir, R., Guessoum, A. (2015). A Supervised Approach to Arabic Text Summarization Using AdaBoost. In: Rocha, A., Correia, A., Costanzo, S., Reis, L. (eds) New Contributions in Information Systems and Technologies. Advances in Intelligent Systems and Computing, vol 353. Springer, Cham. https://doi.org/10.1007/978-3-319-16486-1_23

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-16486-1_23

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-16485-4

  • Online ISBN: 978-3-319-16486-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics