Mining Worse and Better Opinions

Unsupervised and Agnostic Aggregation of Online Reviews
  • Michela FazzolariEmail author
  • Marinella Petrocchi
  • Alessandro Tommasi
  • Cesare Zavattari
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10360)


In this paper, we propose a novel approach for aggregating online reviews, according to the opinions they express. Our methodology is unsupervised, due to the fact that it does not rely on pre-labeled reviews, and it is agnostic, since it does not make any assumption about the domain or the language of the review content. We measure the adherence of a review content to the domain terminology extracted from a review set. First, we demonstrate the informativeness of the adherence metric with respect to the score associated with a review. Then, we exploit the metric values to group reviews, according to the opinions they express. Our experimental campaign has been carried out on two large datasets collected from Booking and Amazon, respectively.


Social web mining Online reviews aggregation Adherence metric Domain terminology Contrastive approach 



Research partly supported by MSCA-ITN-2015-ETN grant agreement #675320 (European Network of Excellence in Cybersecurity) and by Fondazione Cassa di Risparmio di Lucca, financing the project Reviewland.


  1. 1.
    Basili, R., et al.: A contrastive approach to term extraction. In: Terminologie et intelligence artificielle, pp. 119–128. Rencontres (2001)Google Scholar
  2. 2.
    Bonin, F., et al.: A contrastive approach to multi-word extraction from domain-specific corpora. In: Language Resources and Evaluation, ELRA (2010)Google Scholar
  3. 3.
    Bravo-Marquez, F., et al.: Building a twitter opinion lexicon from automatically-annotated tweets. Knowl.-Based Syst. 108, 65–78 (2016)CrossRefGoogle Scholar
  4. 4.
    Cambria, E., Hussain, A.: Sentic Computing: A Common-Sense-Based Framework for Concept-Level Sentiment Analysis. Springer, Heidelberg (2015)CrossRefGoogle Scholar
  5. 5.
    Cambria, E., et al.: SenticNet 3: a common and common-sense knowledge base for cognition-driven sentiment analysis. In: 28th AAAI, pp. 1515–1521 (2014)Google Scholar
  6. 6.
    Chung, T.M., Nation, P.: Identifying technical vocabulary. System 32(2), 251–263 (2004)CrossRefGoogle Scholar
  7. 7.
    Del Vigna, F., Petrocchi, M., Tommasi, A., Zavattari, C., Tesconi, M.: Semi-supervised knowledge extraction for detection of drugs and their effects. In: Social Informatics I (2016)Google Scholar
  8. 8.
    Esuli, A., Sebastiani, F.: SENTIWORDNET: a publicly available lexical resource for opinion mining. In: Language Resources and Evaluation, pp. 417–422 (2006)Google Scholar
  9. 9.
    Li, G., Liu, F.: Application of a clustering method on sentiment analysis. J. Inf. Sci. 38(2), 127–139 (2012)CrossRefGoogle Scholar
  10. 10.
    Ling Lo, S., et al.: A multilingual semi-supervised approach in deriving Singlish sentic patterns for polarity detection. Knowl.-Based Syst. 105, 236–247 (2016)CrossRefGoogle Scholar
  11. 11.
    Liu, B.: Sentiment Analysis and Opinion Mining. Morgan & Claypool, San Rafael (2012)Google Scholar
  12. 12.
    Ma, B., Yuan, H., Wu, Y.: Exploring performance of clustering methods on document sentiment analysis. Inf. Sci. 43, 54–74 (2015)CrossRefGoogle Scholar
  13. 13.
    McAuley, J., Pandey, R., Leskovec, J.: Inferring networks of substitutable and complementary products. In: 21th KDD, pp. 785–794. ACM (2015)Google Scholar
  14. 14.
    McAuley, J., et al.: Image-based recommendations on styles and substitutes. In: 38th Research and Development in Information Retrieval, pp. 43–52. ACM (2015)Google Scholar
  15. 15.
    Mellinas, J.P., María-Dolores, S.M.M., García, J.J.B.: the unexpected scoring system. Tourism Manage. 49, 72–74 (2015)CrossRefGoogle Scholar
  16. 16.
    Muhammad, A., Wiratunga, N., Lothian, R.: Contextual sentiment analysis for social media genres. Knowl.-Based Syst. 108, 92–101 (2016)CrossRefGoogle Scholar
  17. 17.
    Nagamma, P., et al.: An improved sentiment analysis of online movie reviews based on clustering for box-office prediction. In: Computing, Communication and Automation, pp. 933–937 (2015)Google Scholar
  18. 18.
    Pang, B., Lee, L.: Opinion mining and sentiment analysis. Found. Trends Inf. Retr. 2(1–2), 1–135 (2008)CrossRefGoogle Scholar
  19. 19.
    Pazienza, M.T., Zanzotto, F.M.: Terminology extraction: an analysis of linguistic and statistical approaches. In: Sirmakessis, S. (ed.) Knowledge Mining. Studies in Fuzziness and Soft Computing, vol. 185, pp. 255–279. Springer, Heidelberg (2005). doi: 10.1007/3-540-32394-5_20 CrossRefGoogle Scholar
  20. 20.
    Peñas, A., Verdejo, F., Gonzalo, J.: Corpus-based terminology extraction applied to information access. Corpus Linguist. 13, 458–465 (2001)Google Scholar
  21. 21.
    Ren, Y., Zhang, Y., Zhang, M., Ji, D.: Context-sensitive Twitter sentiment classification using neural network. In: Artificial Intelligence, pp. 215–221. AAAI (2016)Google Scholar
  22. 22.
    Reviewland Project: Additional material associated to ICWE 2017 submission (2017). Accessed 15 Mar 2017
  23. 23.
    Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Inf. Process. Manage. 24(5), 513–523 (1988)CrossRefGoogle Scholar
  24. 24.
    Turney, P.D.: Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews. In: Computational Linguistics Meeting, pp. 417–424. ACL (2002)Google Scholar
  25. 25.
    Wilson, T., et al.: OpinionFinder: a system for subjectivity analysis. In: HLT/EMNLP on Interactive Demonstrations, pp. 34–35. ACL (2005)Google Scholar
  26. 26.
    Wilson, T., et al.: Recognizing contextual polarity in phrase-level sentiment analysis. In: HLT/EMNLP, pp. 347–354. ACL (2005)Google Scholar
  27. 27.
    Wilson, T., et al.: Recognizing contextual polarity: an exploration of features for phrase-level sentiment analysis. Comput. Linguist. 35(3), 399–433 (2009)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing AG 2017

Authors and Affiliations

  1. 1.Institute of Informatics and Telematics (IIT-CNR)PisaItaly
  2. 2.LUCENSE SCaRLLuccaItaly

Personalised recommendations