Abstract
With the advent of Web 2.0, user-generated content is led to an explosion of data on the Internet. Several platforms such as social networking, microblogging, and picture sharing exist that allow users to express their views on almost any topic. The user views express their emotions and sentiments on products, services, any action by governments, etc. Sentiment analysis allows quantifying popular mood on any product, service or an idea. Twitter is popular microblogging platform, which permits users to express their views in a very concise manner. In this paper, a new framework is crafted which carried out the entire chain of tasks starting with extraction of tweets to presenting the results in multiple formats using an ETL (Extract, Transform, and Load) big data tool called Talend. The framework includes a technique to quantify sentiment in a Twitter stream by normalizing the text and judge the polarity of textual data as positive, negative, or neutral. The technique addresses peculiarities of Twitter communication to enhance accuracy. The technique gives an accuracy of above 84% on standard datasets.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
O’Reilly, T. and Battelle, J., 2004. Opening welcome: State of the internet industry. San Francisco, California, October, 5.
C. Smith, “170 Amazing Twitter Statistics and Facts: Social Media Article,” 2015. http://expandedramblings.com/index.php/march-2013-by-the-numbers-a-few-amazing-twitter-stats/.
Google Search Statistics Internet Live Stats in 1 Second: The Official World Wide Web Anniversary. http://www.internetlivestats.com/one-second/.
A. Sharp, Dispatch from the Denver Debate, 2012. https://blog.twitter.com/2012/dispatch-from-the-denver-debate.
Talend: About Talend. https://www.talend.com/about-us.
Turney, P.D., 2002, July. Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews. In Proceedings of the 40th annual meeting on association for computational linguistics (pp. 417–424). Association for Computational Linguistics.
Pang, B., Lee, L. and Vaithyanathan, S., 2002, July. Thumbs up?: sentiment classification using machine learning techniques. In Proceedings of the ACL-02 conference on Empirical methods in natural language processing-Volume 10 (pp. 79–86). Association for Computational Linguistics.
Bifet, A. and Frank, E., 2010, October. Sentiment knowledge discovery in twitter streaming data. In Discovery Science (pp. 1–15). Springer Berlin Heidelberg.
Go, A., Bhayani, R. and Huang, L., 2009. Twitter sentiment classification using distant supervision. CS224 N Project Report, Stanford, 1, p. 12.
Agarwal, A., Xie, B., Vovsha, I., Rambow, O. and Passonneau, R., 2011, June. Sentiment analysis of twitter data. In Proceedings of the workshop on languages in social media (pp. 30–38). Association for Computational Linguistics.
Mudinas, A., Zhang, D. and Levene, M., 2012, August. Combining lexicon and learning based approaches for concept-level sentiment analysis. In Proceedings of the First International Workshop on Issues of Sentiment Discovery and Opinion Mining (p. 5). ACM.
Soo-Guan Khoo, C., Nourbakhsh, A. and Na, J.C., 2012. Sentiment analysis of online news text: a case study of appraisal theory. Online Information Review, 36(6), pp. 858–878.
Mane, S.B., Sawant, Y., Kazi, S. and Shinde, V., 2014. Real Time Sentiment Analysis of Twitter Data Using Hadoop. IJCSIT) International Journal of Computer Science and Information Technologies, 5(3), pp. 3098–3100.
Hopper, A.M. and Uriyo, M., 2015. Using sentiment analysis to review patient satisfaction data located on the internet. Journal of health organization and management, 29(2), pp. 221–233.
Hridoy, S.A.A., Ekram, M.T., Islam, M.S., Ahmed, F. and Rahman, R.M., 2015. Localized twitter opinion mining using sentiment analysis. Decision Analytics, 2(1), pp. 1–19.
Twitter Application Management. https://apps.twitter.com/.
Baccianella, S., Esuli, A. and Sebastiani, F., 2010, May. SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining. In LREC (Vol. 10, pp. 2200–2204).
Hu, M. and Liu, B., 2004, August. Mining and summarizing customer reviews. In Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining (pp. 168–177). ACM.
Liu, B., Hu, M. and Cheng, J., 2005, May. Opinion observer: analyzing and comparing opinions on the web. In Proceedings of the 14th international conference on World Wide Web (pp. 342–351). ACM.
Hansen, L.K., Arvidsson, A., Nielsen, F.Å., Colleoni, E. and Etter, M., 2011. Good friends, bad news-affect and virality in twitter. In Future information technology (pp. 34–43). Springer Berlin Heidelberg.
Finkel, J.R., Grenager, T. and Manning, C., 2005, June. Incorporating non-local information into information extraction systems by gibbs sampling. In Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics (pp. 363–370). Association for Computational Linguistics.
Saif, H., Fernandez, M., He, Y. and Alani, H., 2013. Evaluation datasets for Twitter sentiment analysis: a survey and a new dataset, the STS-Gold.
Acknowledgements
This research was also supported by Tiger Analytics Pvt. Ltd. We are thankful to them for providing insight and expertise that greatly assisted our research.We are also thankful to Prachi Khokhar for her assistance in editing the research and her comments that greatly improved the manuscript.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Sharma, A., Nayak, G.K. (2017). Efficient and Parallel Framework for Analyzing the Sentiment. In: Satapathy, S., Bhateja, V., Udgata, S., Pattnaik, P. (eds) Proceedings of the 5th International Conference on Frontiers in Intelligent Computing: Theory and Applications . Advances in Intelligent Systems and Computing, vol 515. Springer, Singapore. https://doi.org/10.1007/978-981-10-3153-3_14
Download citation
DOI: https://doi.org/10.1007/978-981-10-3153-3_14
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-3152-6
Online ISBN: 978-981-10-3153-3
eBook Packages: EngineeringEngineering (R0)