Abstract
Automatic aspect identification and clustering are critical tasks for opinion mining/sentiment analysis, as users employ varied terms (explicitly or not) to evaluate objects of interest and their characteristics. In this paper, we focus on aspect clustering methods and present a new approach to group implicit and explicit aspects from online reviews. We evaluate four linguistic methods inspired in the literature and one statistical method (using word embeddings), and also propose a new one, based on varied linguistic knowledge. We test the methods in three commonly used domains and show that the method that we propose significantly outperforms the other methods by a large margin.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
- 3.
We only look for coreference, foreignism and diminutive-augmentative relations, because we empirically observed that they were the most accurate ones in this step.
- 4.
References
Abu-Jbara, A., King, B., Diab, M.T., Radev, D.R.: Identifying opinion subgroups in arabic online discussions. In: Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, pp. 829–835 (2013)
Alvarez, M., Lim, S.: A graph modeling of semantic similarity between words. In: Proceedings of the Conference on Semantic Computing, Irvine, United States, pp. 355–362 (2007)
Balage Filho, P.P.: Aspect extraction in sentiment analysis for Portuguese. Ph.D. thesis, University of São Paulo, São Carlos, Brazil (2017)
Chen, Y., Zhao, Y., Qin, B., Liu, T.: Product aspect clustering by incorporating background knowledge for opinion mining. PLOS One 11(8), 1–16 (2016)
Ferreira, J.P., Janssen, M.: Dicionário de Formas Não Adaptadas, 1a edn. Instituto de Linguística Teórica e Computacional (2017)
Fonseca, E., Sesti, V., Antonitsch, A., Vanin, A., Vieira, R.: CORP: Uma abordagem baseada em regras e conhecimento semântico para a resoluão de correferências. Linguamática 9(1), 3–18 (2017). https://doi.org/10.21814/lm.9.1.241. http://linguamatica.com/index.php/linguamatica/article/view/v9n1p1
Fonseca, E.B., Vieira, R., Vanin, A.A.: CORP: coreference resolution for portuguese. In: Proceedings of the 12th International Conference on the Computational Processing of Portuguese, Tomar, Portugal, pp. 9–11 (2016)
García, A., Cuadros, M., Rigau, G., Gaines, S.: V3: unsupervised generation of domain aspect terms for aspect based sentiment analysis. In: Proceedings of the 8th International Workshop on Semantic Evaluation, Dublin, Ireland, pp. 833–837 (2014)
Hartmann, N., Fonseca, E., Shulby, C., Treviso, M., Rodrigues, J., Aluisio, S.: Portuguese word embeddings: evaluating on word analogies and natural language tasks. In: Proceedings of the Symposium in Information and Human Language Technology, Uberlandia, Brazil, pp. 122–131 (2017)
Hughes, T., Ramage, D.: Lexical semantic relatedness with random graph walks. Comput. Linguist. 7(1), 581–589 (2007)
Janssen, M., Ferreira, J.P.: Dicionário de nomes deverbais, 1a edn. Intituto de Linguística Teórica e Computacional (2007)
Lee, L.: Measures of distributional similarity. In: Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics on Computational Linguistics ACL 1999, pp. 25–32. Association for Computational Linguistics, Stroudsburg (1999). https://doi.org/10.3115/1034678.1034693
Liu, B.: Sentiment Analysis and Opinion Mining, 1st edn. Morgan & Claypool Publishers, San Rafael (2012)
Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. Computing Research Repository 1301.3781(1) (2013)
Miller, G.A., Beckwith, R., Fellbaum, C., Gross, D., Miller, K.: WordNet: an on-line lexical database. Int. J. Lexicograph. 3, 235–244 (1990)
Oliveira, H.G.: Beyond the automatic construction of a lexical ontology for Portuguese: resources developed in the scope of Onto.PT. In: Proceedings of the Workshop on Tools and Resources for Automatically Processing Portuguese and Spanish, São Carlos, Brazil, pp. 64–68 (2014)
Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up? Sentiment classification using machine learning techniques. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, Stroudsburg, United States, pp. 79–86 (2002)
Patra, B.G., Mandal, S., Das, D., Bandyopadhyay, S.: Ju\_cse: a conditional random field (CRF) based approach to aspect based sentiment analysis. In: Proceedings of the 8th International Workshop on Semantic Evaluation, Dublin, Ireland, pp. 370–374 (2014)
Pereira, F., Tishby, N., Lee, L.: Distributional clustering of English words. In: Proceedings of the 31st Annual Meeting on Association for Computational Linguistics, Stroudsburg, United States, pp. 183–190 (1993)
Seno, E.R.M.: Um mátodo para fusão automática de sentenas similares em português. Ph.D. thesis, University of São Paulo, São Carlos, Brazil (2010)
Taboada, M.: Sentiment analysis: an overview from linguistics. Ann. Rev. Linguist. 2(1), 325–347 (2016). http://www.annualreviews.org/doi/full/10.1146/annurev-linguistics-011415-040518
Wu, C.W., Liu, C.L.: Ontology-based text summarization for business news articles. In: Proceedings of the 3th International Symposium on Computer Architecture, Honolulu, United States, pp. 389–392 (2003)
Yang, H., Callan, J.: A metric-based framework for automatic taxonomy induction. In: Proceedings of the Joint Conference of the 47th Annual Meeting of the Association for Computational Linguistics and the 4th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, Suntec, Singapore, pp. 271–279 (2009)
Yu, J., Zha, Z., Wang, M., Wang, K., Chua, T.: Domain-assisted product aspect hierarchy generation: towards hierarchical organization of unstructured consumer reviews. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, Edinburgh, United Kingdom, pp. 140–150 (2011)
Zhai, Z., Liu, B., Xu, H., Jia, P.: Clustering product features for opinion mining. In: Proceedings of the 4th International Conference on Web Search and Data Mining, New York, United States, pp. 347–354 (2011)
Zhang, S., Jia, W., Xia, Y., Meng, Y., Yu, H.: Product features extraction and categorization in Chinese reviews. In: Proceedings of the 6th International Multi-Conference on Computing in the Global Information Technology, Nice, France, pp. 38–42 (2011)
Zhao, L., Li, C.: Ontology based opinion mining for movie reviews. In: Karagiannis, D., Jin, Z. (eds.) KSEM 2009. LNCS (LNAI), vol. 5914, pp. 204–214. Springer, Heidelberg (2009). https://doi.org/10.1007/978-3-642-10488-6_22
Zhou, X., Wan, X., Xiao, J.: Representation learning for aspect category detection in online reviews. In: Proceedings of the 29th AAAI Conference on Artificial Intelligence, Texas, United States, pp. 417–423 (2015)
Acknowledgments
The authors are grateful to FAPESP, CAPES and CNPq for supporting this work.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Vargas, F.A., Pardo, T.A.S. (2018). Aspect Clustering Methods for Sentiment Analysis. In: Villavicencio, A., et al. Computational Processing of the Portuguese Language. PROPOR 2018. Lecture Notes in Computer Science(), vol 11122. Springer, Cham. https://doi.org/10.1007/978-3-319-99722-3_37
Download citation
DOI: https://doi.org/10.1007/978-3-319-99722-3_37
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-99721-6
Online ISBN: 978-3-319-99722-3
eBook Packages: Computer ScienceComputer Science (R0)