Abstract
Twitter is an ocean of diverse topics while sentiment classifier is always limited to a specific domain or topic. Twitter lacks data labeling and a mechanism to acquire sentiment labels. Sentiment words extracted from the twitter are generalized. It is important to find the correct sentiment from a tweet; otherwise, it may generate different sentiment than the desired. Sentiment analysis work done up to now has limitation, i.e., it is based on predefined lexicons. Sentiment of a word based on this lexicon is not generalized. State of art of our work suggests a solution that will make sentiment analysis based on the sentence sentiments and not just only based on predefined lexicons. In this paper, we propose a framework to analyze sentiment from the sentence, by applying independent component analysis (ICA) in coordination with probabilistic latent semantic analysis. We view pLSA as word categorization technique based on some topics, wherein a given corpus is split among different topics. We further utilize these topics for tagging sentiment with the help of ICA, in this way, we are able to assign more accurate sentiment of a sentence than the existing approaches. The proposed work is efficient as its outcome is more precise and accurate than the lexicons-based sentiment analysis. With adequate unsupervised machine learning training, accurate outcomes with a normal precision rate of 77.98% are accomplished.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Pang B, Lee L (2008) Opinion mining and sentiment analysis. Found Trends Inf Retr 2(1–2):1–135
Spencer J, Uchyigit G (2012) Sentimentor: sentiment analysis of twitter data. CEUR Workshop Proc 917:56–66. https://doi.org/10.1007/978-3-642-35176-1_32
Mejova YA (2012) Sentiment analysis within and across social media streams, vol 190
Abbasi A, Chen H, Salem A (2008) Sentiment analysis in multiple languages: feature selection for opinion classification in web forums. ACM Trans Inf Syst 26(3):1–34. https://doi.org/10.1145/1361684.1361685
Medhat W, Hassan A, Korashy H (2014) Sentiment analysis algorithms and applications: a survey. Ain Shams Eng J 5(4):1093–1113. https://doi.org/10.1016/j.asej.2014.04.011
Esuli A, Sebastiani F (2006) SENTIWORDNET: a publicly available lexical resource for opinion mining. In: Proceedings of the 5th conference on language resources and evaluation (LREC’06), Genova, IT, pp 417–422
Düsterhöft A, Thalheim B (2003) Natural language processing and information systems. Natural Lang Process Inf Syst (June 2013) 220–233. https://doi.org/10.1007/978-3-319-41754-7
Cambria E et al (2010) SenticNet: a publicly available semantic resource for opinion mining. In: AAAI fall symposium: commonsense knowledge, vol 10, no 0
Tan, S et al (2009) Adapting naive bayes to domain adaptation for sentiment analysis. Adv Inf Retr 337–349
Blei, DM, Ng AY, Jordan MI (2003) Latent dirichlet allocation. J Mach Learn Res 993–1022
Zafar MB, Bhattacharya P, Ganguly N, Gummadi KP, Ghosh S (2015) Sampling content from online social networks. ACM Trans Web 9(3):1–33. https://doi.org/10.1145/2743023
Najaflou Y, Jedari B, Xia F, Member S, Yang LT, Obaidat MS (2013) Mobile social networks. IEEE Syst J 9(3):1–21
Li X, Li J, Wu Y (2015) A global optimization approach to multi-polarity sentiment analysis. PLoS ONE 10(4):1–18. https://doi.org/10.1371/journal.pone.0124672
Varghese R, Jayasree M (2013) Aspect based sentiment analysis using support vector machine classifier. In: 2013 international conference on advances in computing, communications and informatics (ICACCI), 22–25 Aug 2013, pp 1581–1586. https://doi.org/10.1109/ICACCI.2013.6637416
Mejova Y, Srinivasan P (2011) Exploring feature definition and selection for sentiment classifiers. In: Fifth international AAAI conference on weblogs and social media, pp 546–549
Prager J (2006) Open-domain question-answering. foundations and trends®. Inf Retr 1(2):91–231. https://doi.org/10.1561/1500000001
Martineau Justin, Finin Tim (2009) Delta TFIDF: an improved feature space for sentiment analysis. Icwsm 9:106
Wilson T, Wiebe J, Hwa R (2004) Just how mad are you? Finding strong and weak opinion clauses. In: Proceedings of AAAI, pp 761–769 (Extended version in Comput Intel 22(2):73–99, 2006)
Salton Gerard, Buckley Christopher (1988) Term-weighting approaches in automatic text retrieval. Inf Process Manag 24(5):513–523
Hofmann, T (1999) Probabilistic latent semantic analysis. In: Proceedings of the fifteenth conference on uncertainty in artificial intelligence. Morgan Kaufmann Publishers Inc
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Kumar, P., Vardhan, M. (2019). Aspect-Based Sentiment Analysis of Tweets Using Independent Component Analysis (ICA) and Probabilistic Latent Semantic Analysis (pLSA). In: Kolhe, M., Trivedi, M., Tiwari, S., Singh, V. (eds) Advances in Data and Information Sciences . Lecture Notes in Networks and Systems, vol 39. Springer, Singapore. https://doi.org/10.1007/978-981-13-0277-0_1
Download citation
DOI: https://doi.org/10.1007/978-981-13-0277-0_1
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-0276-3
Online ISBN: 978-981-13-0277-0
eBook Packages: EngineeringEngineering (R0)