Skip to main content

Quality Features for Summarizing Text Forum Threads by Selecting Quality Replies

  • Conference paper
  • First Online:

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 843))

Abstract

Text Forum Threads contain a huge volume of user-generated content derived from the discussion and information exchange among users who have similar interests. Often, some of the replies in a thread are completely off-topic which changes the discussion’s direction. This phenomenon impacts negatively on the user’s desire to continue with the discussion hence, a user might be interested in reading a few selected replies that provide a brief summary of the discussion topic. This paper aims at selecting quality replies about a topic raised in the initial-post which provide a short summary. We present a detailed analysis of the text forum threads structure based on a set of quality features for the forum summarization. Moreover, crowdsourcing platforms were used for judging the quality of the replies. Therefore, we have performed a text forum threads summarization based on replies weights and human judgment. TripAdvisor dataset has been used, therefore, the system summary helps the traveler in planning a journey. The experimental results conducted showed that the proposed approach can improve the performance of the text forum threads summarization based on forum quality features and crowdsourcing.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    https://www.tripadvisor.com.my/ShowForum-g28953-i4-New_York.html.

  2. 2.

    www.crowdflower.com/.

  3. 3.

    https://www.tripadvisor.com.my/ShowTopic-g60763-i5-k3128263-How_to_get_from_JFK_to_New_Rochelle-New_York_City_New_York.html.

References

  1. Agichtein, E., et al.: Finding high-quality content in social media. In: Proceedings of the 2008 International Conference on Web Search and Data Mining. ACM (2008)

    Google Scholar 

  2. Martínez Carod, N., et al.: Búsqueda de estrategias para la clasificación del contenido en foros técnicos de discusión. In: XIX Workshop de Investigadores en Ciencias de la Computación (WICC 2017, ITBA, Buenos Aires) (2017)

    Google Scholar 

  3. Fan, W.: Effective Search in Online Knowledge Communities: A Genetic Algorithm Approach. Virginia Polytechnic Institute and State University, Virginia (2009)

    Google Scholar 

  4. Krishnamani, J., Zhao, Y., Sunderraman, R.: Forum summarization using topic models and content-metadata sensitive clustering. In: 2013 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT). IEEE (2013)

    Google Scholar 

  5. Tigelaar, A.S.: Automatic discussion summarization: a study of Internet fora (2008)

    Google Scholar 

  6. Ren, Z., et al.: Summarizing web forum threads based on a latent topic propagation process. In: Proceedings of the 20th ACM International Conference on Information and Knowledge Management. ACM (2011)

    Google Scholar 

  7. Almahy, I., Salim, N.: Web discussion summarization: study review. In: Proceedings of the First International Conference on Advanced Data and Information Engineering (DaEng-2013). Springer (2014)

    Google Scholar 

  8. Bhatia, S., Biyani, P., Mitra, P.: Summarizing online forum discussions-can dialog acts of individual messages help? In: EMNLP (2014)

    Google Scholar 

  9. Grozin, V.A., Gusarova, N.F., Dobrenko, N.V.: Feature selection for language independent text forum summarization. In: International Conference on Knowledge Engineering and the Semantic Web. Springer (2015)

    Google Scholar 

  10. Verberne, S., et al.: Automatic summarization of domain-specific forum threads: collecting reference data. In: Proceedings of the 2017 Conference on Conference Human Information Interaction and Retrieval. ACM (2017)

    Google Scholar 

  11. Almahy, I., et al.: Discussion summarization based on Crossdocument relation using model selection technique. In: Advances in Neural Networks, Fuzzy Systems and Artificial Intelligence, pp. 218–229 (2014)

    Google Scholar 

  12. Kabadjov, M.A., et al.: The OnForumS corpus from the Shared Task on Online Forum Summarisation at MultiLing 2015. In: LREC (2016)

    Google Scholar 

  13. Buraya, K., et al.: Mining of relevant and informative posts from text forums

    Google Scholar 

  14. Grozin, V., Buraya, K., Gusarova, N.: Comparison of text forum summarization depending on query type for text forums. In: Advances in Machine Learning and Signal Processing, pp. 269–279. Springer (2016)

    Google Scholar 

  15. Altantawy, M., Rafea, A., Aly, S.: Summarizing online discussions by filtering posts. In: IEEE International Conference on Information Reuse and Integration, IRI 2009. IEEE (2009)

    Google Scholar 

  16. Tigelaar, A.S., Den Akker, R.O., Hiemstra, D.: Automatic summarisation of discussion fora. Nat. Lang. Eng. 16(2), 161–192 (2010)

    Article  Google Scholar 

  17. Farrell, R., Fairweather, P.G., Snyder, K.: Summarization of discussion groups. In: Proceedings of the Tenth International Conference on Information and Knowledge Management. ACM (2001)

    Google Scholar 

  18. Hu, M., Sun, A., Lim, E.-P.: Comments-oriented blog summarization by sentence extraction. In: Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management. ACM (2007)

    Google Scholar 

  19. Ying, D., Jiang., J.: Towards Opinion Summarization from Online Forums. ACL, Stroudsburg (2015)

    Google Scholar 

  20. Lloret, E., Plaza, L., Aker, A.: Analyzing the capabilities of crowdsourcing services for text summarization. Lang. Resour. Eval. 47(2), 337–369 (2013)

    Article  Google Scholar 

  21. Bhatia, S., Biyani, P., Mitra, P.: Identifying the role of individual user messages in an online discussion and its use in thread retrieval. J. Assoc. Inf. Sci. Technol. 67(2), 276–288 (2016)

    Article  Google Scholar 

  22. Manning, C.D., Raghavan, E.-P., Schütze, H.: Introduction to Information Retrieval, vol. 1. Cambridge University Press, Cambridge (2008)

    Book  Google Scholar 

  23. Osman, A., Salim, N., Saeed, F.: Quality-based text web forum summarization-a review. Int. J. Soft Comput. 12(1), 31–44 (2017)

    Google Scholar 

  24. Hoogeveen, D., et al.: Web forum retrieval and text analytics: a survey. Foundations and trends®. Inf. Retriev. 12(1), 1–163 (2018)

    Google Scholar 

  25. Chai, K.E.K.: A Machine Learning-Based Approach for Automated Quality Assessment of User Generated Content in Web Forums. Curtin University, Digital Ecosystems and Business Intelligence Institute, Perth (2011)

    Google Scholar 

  26. Albaham, A.T., Salim, N., Adekunle, O.I.: Leveraging post level quality indicators in online forum thread retrieval. In: Proceedings of the First International Conference on Advanced Data and Information Engineering (DaEng-2013). Springer (2014)

    Google Scholar 

  27. Biyani, P., et al.: Online thread retrieval using thread structure and query subjectivity. Google Patents (2016)

    Google Scholar 

Download references

Acknowledgment

This work is supported by the Ministry of Higher Education (MOHE) and the Research Management Centre (RMC) at the Universiti Teknologi Malaysia (UTM) under the Research University Grant Category (VOT Q.J130000.2528.16H74).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Akram Osman .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Osman, A., Salim, N., Saeed, F., Abdelhamid, I. (2019). Quality Features for Summarizing Text Forum Threads by Selecting Quality Replies. In: Saeed, F., Gazem, N., Mohammed, F., Busalim, A. (eds) Recent Trends in Data Science and Soft Computing. IRICT 2018. Advances in Intelligent Systems and Computing, vol 843. Springer, Cham. https://doi.org/10.1007/978-3-319-99007-1_5

Download citation

Publish with us

Policies and ethics