Can Text Summaries Help Predict Ratings? A Case Study of Movie Reviews

Saggion, Horacio; Lloret, Elena; Palomar, Manuel

doi:10.1007/978-3-642-31178-9_33

Horacio Saggion¹⁹,
Elena Lloret²⁰ &
Manuel Palomar²⁰

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7337))

Included in the following conference series:

International Conference on Application of Natural Language to Information Systems

2298 Accesses
4 Citations

Abstract

This paper presents an analysis of the rating inference task – the task of correctly predicting the rating associated to a review, in the context of movie reviews. For achieving this objective, we study the use of automatic text summaries instead of the full reviews. An extrinsic evaluation framework is proposed, where full reviews and different types of summaries (positional, generic and sentiment-based) of several compression rates (from 10% to 50%) are evaluated. We are facing a difficult task; however, the results obtained are very promising and demonstrate that summaries are appropriate for the rating inference problem, performing at least equally to the full reviews when summaries are at least 30% compression rate. Moreover, we also find out that the way the review is organised, as well as the style of writing, strongly determines the performance of the different types of summaries.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Cunningham, H., Maynard, D., Bontcheva, K., Tablan, V.: GATE: A Framework and Graphical Development Environment for Robust NLP Tools and Applications. In: Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, Philadelphia, USA (2002)
Google Scholar
Goldberg, A.B., Zhu, X.: Seeing stars when there aren’t many stars: graph-based semi-supervised learning for sentiment categorization. In: Proceedings of the 1st Workshop on Graph Based Methods for NLP, pp. 45–52 (2006)
Google Scholar
Li, Y., Bontcheva, K., Cunningham, H.: Adapting SVM for Data Sparseness and Imbalance: A Case Study in Information Extraction. Natural Language Engineering 15(2), 241–271 (2009)
Article Google Scholar
Lloret, E.: Text Summarisation based on Human Language Technologies and its Applications. Ph.D. thesis, University of Alicante (2011)
Google Scholar
Lloret, E., Saggion, H., Palomar, M.: Experiments on Summary-based Opinion Classification. In: Proceedings of the Workshop on Computational Approaches to Analysis and Generation of Emotion in Text, pp. 107–115 (2010)
Google Scholar
Pang, B., Lee, L.: Seeing Stars: Exploiting Class Relationships for Sentiment Categorization with Respect to Rating Scales. In: Proceedings of the Association of Computational Linguistics, pp. 115–124 (2005)
Google Scholar
Pang, B., Lee, L.: Opinion Mining and Sentiment Analysis. Foundations and Trends in Information Retrieval 2(1-2), 1–135 (2008)
Article Google Scholar
Qu, L., Ifrim, G., Weikum, G.: The bag-of-opinions method for review rating prediction from sparse text patterns. In: Proceedings of the 23rd International Conference on Computational Linguistics, pp. 913–921 (2010)
Google Scholar
Saggion, H., Funk, A.: Extracting Opinions and Facts for Business Intelligence. RNTI E-17, 119–146 (2009)
Google Scholar
Saggion, H., Lloret, E., Palomar, M.: Using Text Summaries for Predicting Rating Scales. In: Proceedings of the 1st Workshop on Computational Approaches to Subjectivity and Sentiment Analysis (2010)
Google Scholar
Spärck Jones, K.: Automatic Summarising: The State of the Art. Information Processing & Management 43(6), 1449–1481 (2007)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Information and Communication Technologies, Universitat Pompeu Fabra, C/ Tanger 122, 08018, Barcelona, Spain
Horacio Saggion
Department of Software and Computing Systems, University of Alicante, Apdo. de correos, 99, E-03080, Alicante, Spain
Elena Lloret & Manuel Palomar

Authors

Horacio Saggion
View author publications
You can also search for this author in PubMed Google Scholar
Elena Lloret
View author publications
You can also search for this author in PubMed Google Scholar
Manuel Palomar
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Information Science Department, University of Groningen, Oude Kijk in ’t Jatstraat 26, 9712 EK, Groningen, The Netherlands
Gosse Bouma
Faculty of Economics and Business, University of Groningen, Nettelbosje 2, 9747 AE, Groningen, The Netherlands
Ashwin Ittoo & Hans Wortmann &
CNAM-Laboratoire Cédric, 292 rue St. Martin, 75141, Paris Cedex 03, France
Elisabeth Métais

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Saggion, H., Lloret, E., Palomar, M. (2012). Can Text Summaries Help Predict Ratings? A Case Study of Movie Reviews. In: Bouma, G., Ittoo, A., Métais, E., Wortmann, H. (eds) Natural Language Processing and Information Systems. NLDB 2012. Lecture Notes in Computer Science, vol 7337. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31178-9_33

Download citation

DOI: https://doi.org/10.1007/978-3-642-31178-9_33
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-31177-2
Online ISBN: 978-3-642-31178-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics