Abstract
With advances in Web technologies, more and more people are turning to popular social media platforms such as Twitter to express their feelings and opinions on a variety of topics and current issues online. Sentiment analysis of Web data is becoming a fast and effective way of evaluating public opinion and sentiment for use in marketing and social behavioral studies. This research investigates the enhancement techniques in machine-learning methods for sentiment classification of Web data. Feature selection, negation dealing, and emoticon handling are studied in this paper for their ability to improve the performance of machine-learning methods. The range of enhancement techniques is tested using different text data sets, such as tweets and movie reviews. The results show that different enhancement methods can improve classification efficacy and accuracy differently.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Kontopoulos, E., Berberidis, C., Dergiades, T., Bassiliades, N.: Ontology-based Sentiment Analysis of Twitter Posts. Expert Systems with Applications 40(10), 4065–4074 (2013)
Go, A., Bhayani, R., Huang, L.: Twitter Sentiment Classification using Distant Supervision.CS224N Project Report, Stanford 1–12 (2009)
Ghiassi, M., Skinner, J., Zimbra, D.: Twitter Brand Sentiment Analysis: A Hybrid System using N-gram Analysis and Dynamic Artificial Neural Network. Expert Systems with Applications 40(16), 6266–6282 (2013)
Montoyo, A., MartÃnez-Barco, P., Balahur, A.: Subjectivity and Sentiment Analysis: An Overview of the Current State of the Area and Envisaged Developments. Decision Support Systems 53(4), 675–679 (2012)
Trilla, A., AlÃas, F.: Sentence-based Sentiment Analysis for Expressive Text-to-speech. IEEE Transactions on Audio, Speech, and Language Processing 21(2), 223–233 (2013)
Chung, W., Tseng, T.-L.(B.): Discovering Business Intelligence from Online Product Reviews: A Rule-induction Framework. Expert Systems with Applications 39(15), 11870–11879 (2012)
Maks, I., Vossen, P.: A Lexicon Model for Deep Sentiment Analysis and Opinion Mining Applications. Decision Support Systems 53(4), 680–688 (2012)
Boiy, E., Moens, M.-F.: A Machine Learning Approach to Sentiment Analysis in Multilingual Web texts. Information Retrieval 12(5), 526–558 (2008)
Haddi, E., Liu, X., Shi, Y.: The Role of Text Pre-processing in Sentiment Analysis. Procedia Computer Science 17, 26–32 (2013)
Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up? Sentiment Classification using Machine Learning Techniques. In: Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language processing, vol. 10, pp. 79–86. Association for Computational Linguistics (2002)
Pan, S.J., Ni, X., Sun, J.-T., Yang, Q., Chen, Z.: Cross-domain Sentiment Classification via Spectral Feature Alignment. In: Proceedings of the 19th international conference on World wide web, pp. 751–760. ACM (2010)
Duric, A., Song, F.: Feature Selection for Sentiment Analysis based on Content and Syntax Models. Decision Support Systems 53(40), 704–711 (2012)
Alshalabi, H., Tiun, S., Omar, N., Albared, M.: Experiments on the Use of Feature Selection and Machine Learning Methods in Automatic Malay Text Categorization. Procedia Technology 11, 748–754 (2013)
Li, S., Xia, R., Zong, C., Huang, C.-R.: A Framework of Feature Selection Methods for Text Categorization. In: Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, pp. 692–700 (2009)
Li, S., Yat, S., Lee, M., Chen, Y., Guodong, C.-R.H.: Sentiment Classification and Polarity Shifting. In: Proceedings of the 23rd International Conference on Computational Linguistics, Association for Computational Linguistics, pp. 635–643 (2010)
Hogenboom, A., Bal, D., Frasincar, F., Bal, M., de Jong, F., Kaymak, U.: Exploiting Emoticons in Sentiment Analysis. In: Proceedings of the 28th Annual ACM Symposium on Applied Computing - SAC 2013, p. 703 (2013)
Davidov, D., Tsur, O., Rappoport, A.: Enhanced Sentiment Learning Using Twitter Hashtags and Smileys. In: Proceedings of the 23rd International Conference on Computational Linguistics: Posters, Association for Computational Linguistics, pp. 241–249 (2010)
Ortigosa-Hernández, J., RodrÃguez, J.D., Alzate, L., Lucania, M., Inza, I., Lozano, J.A.: Approaching Sentiment Analysis by using Semi-supervised Learning of Multi-dimensional Classifiers. Neurocomputing 92, 98–115 (2012)
Glorot, X., Bordes, A., Bengio, Y.: Domain Adaptation for Large-scale Sentiment Classification: A Deep Learning Approach. In: Proceedings of the 28th International Conference on Machine Learning (ICML-2011), pp. 513–520 (2011)
Ji, H., Deng, H., Han, J.: Uncertainty Reduction for Knowledge Discovery and Information Extraction on the World Wide Web. Proceedings of the IEEE 100(9), 2658–2674 (2012)
Chang, C.-C., Lin, C.-J.: LIBSVM: A Library for Support Vector Machines.Software pp. 1-39 (2013), http://www.csie.ntu.edu.tw/~cjlin/libsvm
Wilson, T., Wiebe, J., Hoffmann, P.: Recognizing Contextual Polarity: An Exploration of Features for Phrase-level Sentiment Analysis. Computational linguistics 35(3) (2009)
Twitter-sentiment-analyzer Data, https://github.com/ravikiranj/twitter-sentiment-analyzer/tree/master/data
Movie Review data, http://www.cs.cornell.edu/people/pabo/movie-review-data/
Wang, F.-Y., Zeng, D., Carley, K.M., Mao, W.: Social Computing: From Social Informatics to Social Intelligence. IEEE Intelligence Systems 22(2), 79–83 (2007)
Byvatov, E., Fechner, U., Sadowski, J., Schneider, G.: Comparison of Support Vector Machine and Artificial Neural Network Systems for Drug/nondrug Classification. Journal of Chemical Information and Computer Sciences 43(6), 1882–1889 (2003)
Na, J.-C., Khoo, C.S.G.: Aspect-based Sentiment Analysis of Movie Reviews on Discussion Boards. Journal of Information Science 36(6), 823–848 (2010)
Bae, Y., Lee, H.: Sentiment Analysis of Twitter Audiences: Measuring the Positive or Negative Influence of Popular Twitterers. Journal of the American Society for Information Science and Technology 36(12), 2521–2535 (2012)
Gunter, B., Koteyko, N., Atanasova, D.: Sentiment Analysis: A Market-relevant and Reliable Measure of Public Feeling? International Journal of Market Research 56(2), 231 (2014)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Wang, Z., Tong, V.J.C., Chin, H.C. (2014). Enhancing Machine-Learning Methods for Sentiment Classification of Web Data. In: Jaafar, A., et al. Information Retrieval Technology. AIRS 2014. Lecture Notes in Computer Science, vol 8870. Springer, Cham. https://doi.org/10.1007/978-3-319-12844-3_34
Download citation
DOI: https://doi.org/10.1007/978-3-319-12844-3_34
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-12843-6
Online ISBN: 978-3-319-12844-3
eBook Packages: Computer ScienceComputer Science (R0)