Abstract
Tweets are cryptic and often laced with insinuation. Hence, interpretation of tweets cannot be done in isolation. Human beings can interpret the tweets because they possess the requisite Contextual Knowledge. This knowledge enables them to understand the context of tweets and interpret the text. Emulating interpretation ability in machines requires the machine to acquire this contextual knowledge. Tweets pertaining to political and societal issues contain domain-specific terms. Interpretation of such tweets solely on the basis of sentiment orientation of words produces incorrect sentiment tags. Polarity of terms is based on the topic of reference. Thus, an understanding of the pertinent domain terms and their associated sentiment is essential to guide the sentiment mining process. A resource of relevant domain-specific contextual terms and associated sentiments can help to achieve an enhanced sentiment mining performance. With the objective of equipping the machine with the contextual knowledge to facilitate semantic interpretation, we tap the Web resources, process them and structure them as Contextual Knowledge Structures (CKS). We then leverage the CKS to enable a semantic interpretation of tweets. We construct a CKS-based training set to train the Naïve Bayes classifier and classify the tweets. We further transform the CKS into sentiment training set (STS) and use it for detecting sentiment polarity tags for tweets. CKS provide the necessary background knowledge pertaining to issues, events, and the related domain-specific terms, thus facilitating semantic sentiment mining. All our experiments are conducted in the context of political/public policy, trending topic, and event-related tweets with an objective of obtaining a pulse of the political climate in India. Our CKS-based classifier exhibits an accuracy of 94.23% in mapping the tweets to the political topic. The distance-based CKS-Sentiment mining algorithm exhibits a consistent performance with an accuracy of 70.90%. The relevance of this contribution is: (a) a novel method which leverages the Web content to derive an optimum training set for tweet analysis, (b) a high degree of Accuracy, Precision, and Recall in tweet classification and sentiment mining with a small CKS-based training set, (c) a topic-adaptive model which can adapt to any domain or topic and exhibit improved tweet analysis performance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Pareto Analysis is a statistical technique in decision-making used for the selection of a limited number of task/features that produce significant overall effect. It uses the Pareto Principle (also known as the 80/20 rule) the idea that by doing 20% of the work you can generate 80% of the benefit of doing the entire job.
- 2.
Chunking is the process of grouping various words which have Part-Of-Speech (POS) tags into phrases like Noun phrases, Verb phrases etc.
- 3.
On February 9, 2016, students of Jawaharlal Nehru University (JNU) held a protest on their campus against the capital punishment meted out to the 2001 Indian Parliament attack convict Afzal Guru.
- 4.
On 8 November 2016, the Government of India announced the demonetization of all ₹500 (US 7.80) and ₹1,000 (US 16) banknotes of the Mahatma Gandhi Series.
- 5.
The Wu and Palmer [39] similarity metric is used to measure the depth of the given concepts in the Word Net taxonomy, the least common subsumer (LCS) depth and combines these figures into a similarity score.
- 6.
SentiWordNet is a lexical resource for opinion mining. SentiWordNet assigns to each synset of WordNet three sentiment scores: positivity, negativity and objectivity. It gives scores in the range [0,1] for each of the sentiments i.e. positive_score + negative_score + objective_score = 1.
References
Pang, B., & Lee, L. (2008). Opinion mining and sentiment analysis. Foundations and Trends in Information Retrieval, 2(1–2), 1–135.
Ribeiro, F. N., Araújo, M., Gonçalves, P., Gonçalves, M. A., & Benevenuto, F. (2016). Sentibench-a benchmark comparison of state-of-the-practice sentiment analysis methods. EPJ Data Science, 5(1), 1–29.
Zeng, D., Chen, H., Lusch, R., & Li, S. H. (2010). Social media analytics and intelligence. IEEE Intelligent Systems, 25(6), 13–16.
Li, C., Sun, A., Weng, J., & He, Q. (2015). Tweet segmentation and its application to named entity recognition. IEEE Transactions on Knowledge and Data Engineering, 27(2), 558–570.
Kaufmann, M., & Kalita, J. (2010, January). Syntactic normalization of twitter messages. In International Conference on Natural Language Processing, Kharagpur, India.
Volkova, S., Bachrach, Y., Armstrong, M., & Sharma, V. (2015, January). Inferring latent user properties from texts published in social media. In AAAI (pp. 4296–4297).
Burnap, P., Rana, O. F., Avis, N., Williams, M., Housley, W., Edwards, A., et al. (2015). Detecting tension in online communities with computational Twitter analysis. Technological Forecasting and Social Change, 95, 96–108.
Gomadam, K., Yeh, P. Z., Verma, K., & Miller, J. A. (2012, June). Data enrichment using web APIs. In 2012 IEEE First International Conference on Services Economics (SE) (pp. 46–53). IEEE.
Jadhav, A. S., Purohit, H., Kapanipathi, P., Anantharam, P., Ranabahu, A. H., Nguyen, V., et al. (2010). Twitris 2.0: Semantically empowered system for understanding perceptions from social data.
Villanueva, D., González-Carrasco, I., López-Cuadrado, J. L., & Lado, N. (2016). SMORE: Towards a semantic modeling for knowledge representation on social media. Science of Computer Programming, 121, 16–33.
Chai, X., Deshpande, O., Garera, N., Gattani, A., Lam, W., Lamba, D. S., … et al. (2013). Social media analytics: The Kosmix story. IEEE Data Engineering Bulletin, 36(3), 4–12.
Vosoughi, S., Zhou, H., & Roy, D. (2016). Enhanced twitter sentiment classification using contextual information. arXiv preprint arXiv:1605.05195.
Theodotou, A., & Stassopoulou, A. (2015, November). A system for automatic classification of twitter messages into categories. In International and Interdisciplinary Conference on Modeling and Using Context (pp. 532–537). Cham: Springer.
Derczynski, L., Maynard, D., Rizzo, G., van Erp, M., Gorrell, G., Troncy, R., et al. (2015). Analysis of named entity recognition and linking for tweets. Information Processing & Management, 51(2), 32–49.
Liu, S., Cheng, X., Li, F., & Li, F. (2015). TASC: Topic-adaptive sentiment classification on dynamic tweets. IEEE Transactions on Knowledge and Data Engineering, 27(6), 1696–1709.
Ritter, A., Clark, S., & Etzioni, O. (2011, July). Named entity recognition in tweets: An experimental study. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (pp. 1524–1534). Association for Computational Linguistics.
McDonald, G., Deveaud, R., McCreadie, R., Macdonald, C., & Ounis, I. (2015). Tweet enrichment for effective dimensions classification in online reputation management.
He, Y., Yang, C. S., Yu, L. C., Lai, K. R., & Liu, W. (2015, December). Sentiment classification of short texts based on semantic clustering. In 2015 International Conference on Orange Technologies (ICOT) (pp. 54–57). IEEE.
Hatzivassiloglou, V., & Wiebe, J. M. (2000, July). Effects of adjective orientation and gradability on sentence subjectivity. In Proceedings of the 18th Conference on Computational linguistics—Volume 1 (pp. 299–305). Association for Computational Linguistics.
Hu, M., & Liu, B. (2004, August). Mining and summarizing customer reviews. In Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 168–177). ACM.
Taboada, M., & Grieve, J. (2004, March). Analyzing appraisal automatically. In Proceedings of AAAI Spring Symposium on Exploring Attitude and Affect in Text, Stanford University, CA (AAAI Technical Re# port SS# 04# 07) (pp. 158–161). AAAI Press.
Read, J., & Carroll, J. (2009, November). Weakly supervised techniques for domain-independent sentiment classification. In Proceedings of the 1st International CIKM Workshop on Topic-Sentiment Analysis for Mass Opinion (pp. 45–52). ACM.
Pang, B., Lee, L., & Vaithyanathan, S. (2002, July). Thumbs up? Sentiment classification using machine learning techniques. In Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing (Vol. 10, pp. 79–86). Association for Computational Linguistics.
Pang, B., & Lee, L. (2004, July). A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts. In Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics (p. 271). Association for Computational Linguistics.
Boiy, E., Hens, P., Deschacht, K., & Moens, M. F. (2007, June). Automatic sentiment analysis in on-line text. In ELPUB (pp. 349–360).
Zhao, J., Liu, K., & Wang, G. (2008, October). Adding redundant features for CRFs-based sentence sentiment classification. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (pp. 117–126). Association for Computational Linguistics.
Narayanan, R., Liu, B., & Choudhary, A. (2009, August). Sentiment analysis of conditional sentences. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1 (pp. 180–189). Association for Computational Linguistics.
Hassan, A., Abbasi, A., & Zeng, D. (2013, September). Twitter sentiment analysis: A bootstrap ensemble framework. In 2013 International Conference on Social Computing (SocialCom) (pp. 357–364). IEEE.
Khan, F. H., Bashir, S., & Qamar, U. (2014). TOM: Twitter opinion mining framework using hybrid classification scheme. Decision Support Systems, 57, 245–257.
Abel, F., Celik, I., Houben, G. J., & Siehndel, P. (2011). Leveraging the semantics of tweets for adaptive faceted search on twitter. The Semantic Web–ISWC 2011, 1–17.
Simeon, C., & Hilderman, R. (2015, October). Evaluating the effectiveness of hashtags as predictors of the sentiment of tweets. In International Conference on Discovery Science (pp. 251–265). Cham: Springer.
Saif, H., He, Y., Fernandez, M., & Alani, H. (2016). Contextual semantics for sentiment analysis of Twitter. Information Processing and Management, 52(1), 5–19.
Ghiassi, M., Skinner, J., & Zimbra, D. (2013). Twitter brand sentiment analysis: A hybrid system using n-gram analysis and dynamic artificial neural network. Expert Systems with Applications, 40(16), 6266–6282.
Saif, H., He, Y., & Alani, H. (2012). Semantic sentiment analysis of twitter. The Semantic Web–ISWC 2012, 508–524.
Bahrainian, S. A., & Dengel, A. (2013, December). Sentiment analysis and summarization of twitter data. In 16th International Conference on Computational Science and Engineering (CSE), 2013 IEEE (pp. 227–234). IEEE.
Kontopoulos, E., Berberidis, C., Dergiades, T., & Bassiliades, N. (2013). Ontology-based sentiment analysis of twitter posts. Expert Systems with Applications, 40(10), 4065–4074.
Javed, N., & Muralidhara, B. L. (2015). Automating corpora generation with semantic cleaning and tagging of tweets for multi-dimensional social media analytics. International Journal of Computer Applications, 127(12), 11–16.
Han, J., Pei, J., & Kamber, M. (2011). Data mining: Concepts and techniques. Amsterdam: Elsevier.
Wu, Z., & Palmer, M. (1994). Verbs semantics and lexical selection. In Proceedings of the 32nd annual meeting on Association for Computational Linguistics (pp. 133–138). Association for Computational Linguistics, (1994, June).
Liu, H. (2004). MontyLingua: An end-to-end natural language processor with common sense.
Pennebaker, J. W., Booth, R. J., Boyd, R. L., & Francis, M. E. (2015). Linguistic inquiry and word count: LIWC2015. Austin, TX: Pennebaker Conglomerates.
Hutto, C. J., & Gilbert, E. (2014, May). Vader: A parsimonious rule-based model for sentiment analysis of social media text. In Eighth International AAAI Conference on Weblogs and Social Media.
Nielsen, F. Å. (2011). A new ANEW: Evaluation of a word list for sentiment analysis in microblogs. arXiv preprint arXiv:1103.2903.
Go, A., Bhayani, R., & Huang, L. (2009). Twitter sentiment classification using distant supervision. CS224N Project Report, Stanford, 1(2009), 12.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Singapore Pte Ltd.
About this chapter
Cite this chapter
Javed, N., B. L., M. (2018). Semantic Interpretation of Tweets: A Contextual Knowledge-Based Approach for Tweet Analysis. In: Margret Anouncia, S., Wiil, U. (eds) Knowledge Computing and Its Applications. Springer, Singapore. https://doi.org/10.1007/978-981-10-6680-1_4
Download citation
DOI: https://doi.org/10.1007/978-981-10-6680-1_4
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-6679-5
Online ISBN: 978-981-10-6680-1
eBook Packages: Computer ScienceComputer Science (R0)