Skip to main content

A Coherence Model for Sentence Ordering

  • Conference paper
  • First Online:
Natural Language Processing and Information Systems (NLDB 2019)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11608))

  • 1487 Accesses

Abstract

Text generation applications such as machine translation and automatic summarization require an additional post-processing step to enhance readability and coherence of output texts. In this work, we identify a set of coherence features from different levels of discourse analysis. Features have either positive or negative input to the output coherence. We propose a new model that combines these features to produce more coherent summaries for our target application: extractive summarization. The model use a genetic algorithm to search for a better ordering of the extracted sentences to form output summaries. Experimentations on two datasets using an automatic coherence assessment measure show promising results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 59.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 74.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://duc.nist.gov/duc2002/.

References

  1. Slakta, D.: L’ordre du texte (The Order of the Text). Etudes de Linguistique Appliquee 19, 30–42 (1975)

    Google Scholar 

  2. Barzilay, R.: The rhetorical parsing of natural language texts. In: Proceedings of the Eighth Conference on European Chapter of the Association for Computational Linguistics. EACL 1997, pp. 96–103 (1997)

    Google Scholar 

  3. Barzilay, R., Lapata, M.: Modeling local coherence: an entity-based approach. In: Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics. ACL 2005, pp. 25–30 (2005)

    Google Scholar 

  4. Barzilay, R., Elhadad, N., McKeown, K.R.: Inferring strategies for sentence ordering in multidocument news summarization. J. Artif. Intell. Res. 1(17), 35–55 (2002)

    Article  Google Scholar 

  5. Knight, K., Marcu, D.: Summarization beyond sentence extraction: a probabilistic approach to sentence compression. J. Artif. Intell. 139(1), 91–107 (2002)

    Article  Google Scholar 

  6. Molina, A., Torres-Moreno, J.-M., SanJuan, E., da Cunha, I., Sierra Martínez, G.E.: Discursive sentence compression. In: Gelbukh, A. (ed.) CICLing 2013. LNCS, vol. 7817, pp. 394–407. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-37256-8_33

    Chapter  Google Scholar 

  7. Christensen, J., Soderland, S., Etzioni, O.: Towards coherent multi-document summarization. In: Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. HLT-NAACL 2013, pp. 1163–1173 (2013)

    Google Scholar 

  8. Li, J., Hovy, E.: A model of coherence based on distributed sentence representation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing. EMNLP 2014, pp. 2039–2048 (2014)

    Google Scholar 

  9. Nguyen, D. T., Joty, S.: A model of coherence based on distributed sentence representation. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics. ACL 2017, pp. 1320–1330 (2017)

    Google Scholar 

  10. Charolles, M.: Introduction aux problèmes de la cohérence des textes: Approche théorique et étude des pratiques pédagogiques. Langue française 1(38), 7–41 (1978)

    Article  Google Scholar 

  11. Oufaida, H., Blache, P., Nouali, O.: Using distributed word representations and mrmr discriminant analysis for multilingual text summarization. In: Biemann, C., Handschuh, S., Freitas, A., Meziane, F., Métais, E. (eds.) NLDB 2015. LNCS, vol. 9103, pp. 51–63. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-19581-0_4

    Chapter  Google Scholar 

  12. Razali, N. M., Geraghty, J.: Genetic algorithm performance with different selection strategies in solving TSP. In: Proceedings of the World Congress on Engineering, pp. 1–6 (2011)

    Google Scholar 

  13. Fortin, F., Rainville, D., Gardner, M., Parizeau, M., Gagné, C.: DEAP: evolutionary algorithms made easy. J. Mach. Learn. Res. 13(1), 2171–2175 (2012)

    MathSciNet  MATH  Google Scholar 

  14. Lin, Z., Liu, C., Ng, H. T., Kan, M. Y.: Combining coherence models and machine translation evaluation metrics for summarization evaluation. In: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers. ACL 2012, vol. 1, pp. 1006–1014 (2012)

    Google Scholar 

  15. Giannakopoulos, G., et al.: MultiLing 2015: multilingual summarization of single and multi-documents, on-line fora, and call-center conversations. In: Proceedings of the 16th Annual Meeting of the Special Interest Group on Discourse and Dialogue, SIGDIAL 2015, pp. 270–274 (2015)

    Google Scholar 

  16. Peng, H., Long, F., Ding, C.: Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans. Pattern Anal. Mach. Intell. (8), 1226–1238 (2005)

    Google Scholar 

  17. Muller, P., Tannier, X.: Annotating and measuring temporal relations in texts. In: Proceedings of the 20th International Conference on Computational Linguistics. COLING 2004, p. 50 (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Houda Oufaida .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Oufaida, H., Blache, P., Nouali, O. (2019). A Coherence Model for Sentence Ordering. In: Métais, E., Meziane, F., Vadera, S., Sugumaran, V., Saraee, M. (eds) Natural Language Processing and Information Systems. NLDB 2019. Lecture Notes in Computer Science(), vol 11608. Springer, Cham. https://doi.org/10.1007/978-3-030-23281-8_21

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-23281-8_21

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-23280-1

  • Online ISBN: 978-3-030-23281-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics