Abstract
Text Forum Threads contain a huge volume of user-generated content derived from the discussion and information exchange among users who have similar interests. Often, some of the replies in a thread are completely off-topic which changes the discussion’s direction. This phenomenon impacts negatively on the user’s desire to continue with the discussion hence, a user might be interested in reading a few selected replies that provide a brief summary of the discussion topic. This paper aims at selecting quality replies about a topic raised in the initial-post which provide a short summary. We present a detailed analysis of the text forum threads structure based on a set of quality features for the forum summarization. Moreover, crowdsourcing platforms were used for judging the quality of the replies. Therefore, we have performed a text forum threads summarization based on replies weights and human judgment. TripAdvisor dataset has been used, therefore, the system summary helps the traveler in planning a journey. The experimental results conducted showed that the proposed approach can improve the performance of the text forum threads summarization based on forum quality features and crowdsourcing.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Agichtein, E., et al.: Finding high-quality content in social media. In: Proceedings of the 2008 International Conference on Web Search and Data Mining. ACM (2008)
Martínez Carod, N., et al.: Búsqueda de estrategias para la clasificación del contenido en foros técnicos de discusión. In: XIX Workshop de Investigadores en Ciencias de la Computación (WICC 2017, ITBA, Buenos Aires) (2017)
Fan, W.: Effective Search in Online Knowledge Communities: A Genetic Algorithm Approach. Virginia Polytechnic Institute and State University, Virginia (2009)
Krishnamani, J., Zhao, Y., Sunderraman, R.: Forum summarization using topic models and content-metadata sensitive clustering. In: 2013 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT). IEEE (2013)
Tigelaar, A.S.: Automatic discussion summarization: a study of Internet fora (2008)
Ren, Z., et al.: Summarizing web forum threads based on a latent topic propagation process. In: Proceedings of the 20th ACM International Conference on Information and Knowledge Management. ACM (2011)
Almahy, I., Salim, N.: Web discussion summarization: study review. In: Proceedings of the First International Conference on Advanced Data and Information Engineering (DaEng-2013). Springer (2014)
Bhatia, S., Biyani, P., Mitra, P.: Summarizing online forum discussions-can dialog acts of individual messages help? In: EMNLP (2014)
Grozin, V.A., Gusarova, N.F., Dobrenko, N.V.: Feature selection for language independent text forum summarization. In: International Conference on Knowledge Engineering and the Semantic Web. Springer (2015)
Verberne, S., et al.: Automatic summarization of domain-specific forum threads: collecting reference data. In: Proceedings of the 2017 Conference on Conference Human Information Interaction and Retrieval. ACM (2017)
Almahy, I., et al.: Discussion summarization based on Crossdocument relation using model selection technique. In: Advances in Neural Networks, Fuzzy Systems and Artificial Intelligence, pp. 218–229 (2014)
Kabadjov, M.A., et al.: The OnForumS corpus from the Shared Task on Online Forum Summarisation at MultiLing 2015. In: LREC (2016)
Buraya, K., et al.: Mining of relevant and informative posts from text forums
Grozin, V., Buraya, K., Gusarova, N.: Comparison of text forum summarization depending on query type for text forums. In: Advances in Machine Learning and Signal Processing, pp. 269–279. Springer (2016)
Altantawy, M., Rafea, A., Aly, S.: Summarizing online discussions by filtering posts. In: IEEE International Conference on Information Reuse and Integration, IRI 2009. IEEE (2009)
Tigelaar, A.S., Den Akker, R.O., Hiemstra, D.: Automatic summarisation of discussion fora. Nat. Lang. Eng. 16(2), 161–192 (2010)
Farrell, R., Fairweather, P.G., Snyder, K.: Summarization of discussion groups. In: Proceedings of the Tenth International Conference on Information and Knowledge Management. ACM (2001)
Hu, M., Sun, A., Lim, E.-P.: Comments-oriented blog summarization by sentence extraction. In: Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management. ACM (2007)
Ying, D., Jiang., J.: Towards Opinion Summarization from Online Forums. ACL, Stroudsburg (2015)
Lloret, E., Plaza, L., Aker, A.: Analyzing the capabilities of crowdsourcing services for text summarization. Lang. Resour. Eval. 47(2), 337–369 (2013)
Bhatia, S., Biyani, P., Mitra, P.: Identifying the role of individual user messages in an online discussion and its use in thread retrieval. J. Assoc. Inf. Sci. Technol. 67(2), 276–288 (2016)
Manning, C.D., Raghavan, E.-P., Schütze, H.: Introduction to Information Retrieval, vol. 1. Cambridge University Press, Cambridge (2008)
Osman, A., Salim, N., Saeed, F.: Quality-based text web forum summarization-a review. Int. J. Soft Comput. 12(1), 31–44 (2017)
Hoogeveen, D., et al.: Web forum retrieval and text analytics: a survey. Foundations and trends®. Inf. Retriev. 12(1), 1–163 (2018)
Chai, K.E.K.: A Machine Learning-Based Approach for Automated Quality Assessment of User Generated Content in Web Forums. Curtin University, Digital Ecosystems and Business Intelligence Institute, Perth (2011)
Albaham, A.T., Salim, N., Adekunle, O.I.: Leveraging post level quality indicators in online forum thread retrieval. In: Proceedings of the First International Conference on Advanced Data and Information Engineering (DaEng-2013). Springer (2014)
Biyani, P., et al.: Online thread retrieval using thread structure and query subjectivity. Google Patents (2016)
Acknowledgment
This work is supported by the Ministry of Higher Education (MOHE) and the Research Management Centre (RMC) at the Universiti Teknologi Malaysia (UTM) under the Research University Grant Category (VOT Q.J130000.2528.16H74).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Osman, A., Salim, N., Saeed, F., Abdelhamid, I. (2019). Quality Features for Summarizing Text Forum Threads by Selecting Quality Replies. In: Saeed, F., Gazem, N., Mohammed, F., Busalim, A. (eds) Recent Trends in Data Science and Soft Computing. IRICT 2018. Advances in Intelligent Systems and Computing, vol 843. Springer, Cham. https://doi.org/10.1007/978-3-319-99007-1_5
Download citation
DOI: https://doi.org/10.1007/978-3-319-99007-1_5
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-99006-4
Online ISBN: 978-3-319-99007-1
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)