Making Sentiment Analysis Algorithms Scalable

Cristani, Marco; Cristani, Matteo; Pesarin, Anna; Tomazzoli, Claudio; Zorzi, Margherita

doi:10.1007/978-3-030-03056-8_12

Marco Cristani¹⁷,
Matteo Cristani¹⁷,
Anna Pesarin¹⁷,
Claudio Tomazzoli¹⁷ &
…
Margherita Zorzi¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11153))

Included in the following conference series:

International Conference on Web Engineering

788 Accesses
4 Citations

Abstract

In this paper we introduce a simplified approach to sentiment analysis: a lexicon-driven method based upon only adjectives and adverbs. This method is compared in cross-validation with other known techniques and then compared directly to the gold standard, a sample of human subjects asked to deliver the same class of judgments computed by the method. We prove that the method is similar in accuracy and precision with the other methods. We finally argue that the approach we employ is more valid than others for it is scalable, and exportable to languages other than English.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
OpenNLP package (http://opennlp.sourceforge.net).
2.
http://sentiwordnet.isti.cnr.it/.

References

Baccianella, S., Esuli, A., Sebastiani, F.: Sentiwordnet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In: Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC 2010). European Language Resources Association (ELRA) (2010)
Google Scholar
Balahur, A., Turchi, M.: Multilingual sentiment analysis using machine translation? In: Proceedings of the 3rd Workshop in Computational Approaches to Subjectivity and Sentiment Analysis, WASSA 2012, pp. 52–60 (2012)
Google Scholar
Benamara, F., Cesarano, C., Picariello, A., Reforgiato, D., Subrahmanian, V.S.: Sentiment analysis: adjectives and adverbs are better than adjectives alone. In: Proceedings of the International Conference on Weblogs and Social Media (ICWSM) (2007)
Google Scholar
Combi, C., Zorzi, M., Pozzani, G., Arzenton, E., Moretti, U.: Normalizing spontaneous reports into MedDRA: some experiments with MagiCoder. IEEE J. Biomed. Health Inform. 1–8 (2018). https://doi.org/10.1109/JBHI.2018.2861213
Combi, C., Zorzi, M., Pozzani, G., Moretti, U., Arzenton, E.: From narrative descriptions to MedDRA: automagically encoding adverse drug reactions. J. Biomed. Inform. 84, 184–199 (2018)
Article Google Scholar
Cristani, M., Bertolaso, A., Scannapieco, S., Tomazzoli, C.: Future paradigms of automated processing of business documents. Int. J. Inf. Manag. 40, 67–75 (2018)
Article Google Scholar
Cristani, M., Olivieri, F., Tomazzoli, C.: Viral experiments. In: 3rd International Workshop on Knowledge Discovery on the WEB, KDWeb 2017, vol. 1959. CEUR Workshop Proceedings, pp. 27–35 (2017)
Google Scholar
Di Caro, L., Grella, M.: Sentiment analysis via dependency parsing. Comput. Stand. Interfaces 35(5), 442–453 (2013)
Article Google Scholar
Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification. Wiley, New York (2001)
MATH Google Scholar
Esuli, A., Sebastiani, F.: Sentiwordnet: a publicly available lexical resource for opinion mining. In: Proceedings of the 5th Conference on Language Resources and Evaluation, LREC 2006, pp. 417–422 (2006)
Google Scholar
Joachims, T.: Making large-scale SVM learning practical. In: Advances in Kernel Methods - Support Vector Learning. MIT Press (1999)
Google Scholar
Liu, B.: Sentiment analysis and opinion mining (2012)
Google Scholar
Lyu, K., Kim, H.: Sentiment analysis using word polarity of social media. Wirel. Pers. Commun. 89(3), 941–958 (2016)
Article Google Scholar
Miller, G.A.: Wordnet: a lexical database for English. Commun. ACM 38(11), 39–41 (1995)
Article Google Scholar
Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up? Sentiment classification using machine learning techniques. In: Proceedings of the ACL02 Conference on Empirical Methods in Natural Language Processing, EMNLP 2002, vol. 10, pp. 79–86. Association for Computational Linguistics (2002)
Google Scholar
Ramanathan, N., Bing, L., Alok, C.: Sentiment analysis of conditional sentences. In: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1, EMNLP 2009, vol. 1, pp. 180–189 (2009)
Google Scholar
Socher, R., Huval, B., Manning, C.D., Ng, A.Y.: Semantic compositionality through recursive matrix-vector spaces. In: Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, EMNLP-CoNLL 2012, pp. 1201–1211 (2012)
Google Scholar
Taboada, M., Brooke, J., Tofiloski, M., Voll, K., Stede, M.: Lexicon-basedmethods for sentiment analysis. Comput. Linguist. 37(2), 267–307 (2011)
Article Google Scholar
Turney, P.D., Littman, M.L.: Measuring praise and criticism: inference of semantic orientation from association. ACM Trans. Inf. Syst. 21(4), 315–346 (2003)
Article Google Scholar
Vapnik, V.: The Nature of Statistical Learning Theory. Springer, New York (1995). https://doi.org/10.1007/978-1-4757-3264-1
Book MATH Google Scholar
Williams, G.K., Anand, S.S.: Predicting the polarity strength of adjectives using wordnet. In: ICWSM. The AAAI Press (2009)
Google Scholar
Zorzi, M., Combi, C., Lora, R., Pagliarini, M., Moretti, U.: Automagically encoding adverse drug reactions in MedDRA. In: Proceedings of the 2015 IEEE International Conference on Healthcare Informatics, ICHI 2015, pp. 90–99 (2015)
Google Scholar
Zorzi, M., Combi, C., Pozzani, G., Arzenton, E., Moretti, U.: A co-occurrence based MedDRA terminology generation: some preliminary results. In: ten Teije, A., Popow, C., Holmes, J.H., Sacchi, L. (eds.) AIME 2017. LNCS (LNAI), vol. 10259, pp. 215–220. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-59758-4_24
Chapter Google Scholar
Zorzi, M., Combi, C., Pozzani, G., Moretti, U.: Mapping free text into MedDRA by natural language processing: a modular approach in designing and evaluating software extensions. In: Proceedings of the 8th ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics, ACM-BCB 2017, pp. 27–35 (2017)
Google Scholar

Download references

Author information

Authors and Affiliations

University of Verona, Verona, Italy
Marco Cristani, Matteo Cristani, Anna Pesarin, Claudio Tomazzoli & Margherita Zorzi

Authors

Marco Cristani
View author publications
You can also search for this author in PubMed Google Scholar
Matteo Cristani
View author publications
You can also search for this author in PubMed Google Scholar
Anna Pesarin
View author publications
You can also search for this author in PubMed Google Scholar
Claudio Tomazzoli
View author publications
You can also search for this author in PubMed Google Scholar
Margherita Zorzi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Marco Cristani .

Editor information

Editors and Affiliations

Faculty of Informatics, University of Lugano, Lugano, Switzerland
Cesare Pautasso
Quercus Software Engineering Group, Universidad de Extremadura, Cáceres, Spain
Fernando Sánchez-Figueroa
Tampere University of Technology, Tampere, Finland
Kari Systä
Escuela Politécnica, Universidad de Extremadura, Cáceres, Spain
Juan Manuel Murillo Rodríguez

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cristani, M., Cristani, M., Pesarin, A., Tomazzoli, C., Zorzi, M. (2018). Making Sentiment Analysis Algorithms Scalable. In: Pautasso, C., Sánchez-Figueroa, F., Systä, K., Murillo Rodríguez, J. (eds) Current Trends in Web Engineering. ICWE 2018. Lecture Notes in Computer Science(), vol 11153. Springer, Cham. https://doi.org/10.1007/978-3-030-03056-8_12

Download citation

DOI: https://doi.org/10.1007/978-3-030-03056-8_12
Published: 29 November 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-03055-1
Online ISBN: 978-3-030-03056-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics