Abstract
Many business organizations use social media data in order to understand their customer on an individual level. Consumers are keen to share their views on certain products or commodities. This leads to the generation of large amount of unstructured social media data. Thus, text data is being formed gradually in many areas like automated business, education, health care, show business. Opinion mining, the subfield of text mining, deals with mining of review text and classifying the opinions or the sentiments of that text as positive or negative. The work in this paper develops a framework for opinion mining. It includes a novel feature selection method called Most Persistent Feature Selection (MPFS). MPFS method uses information gain of the features in the review documents. The performance of the three different classifiers, namely Naïve Bayes, Maximum Entropy, and Support Vector Machine, with the proposed feature selection method is evaluated on movie reviews using the parameters accuracy, precision, recall, and F-score. The different classifier models generated show the acceptable performance in comparison with the other existing models.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Liu, B.: Sentiment Analysis and Opinion Mining. vol. 5, No. 1. Morgan & Claypool Publishers (2012)
Pang, B., Lee, L.: Opinion Mining and Sentiment Analysis (2008)
Pang, B., Lee, L.: A sentimental education: sentiment analysis using subjectivity summarization based on minimum cuts. In: Proceedings of ACL (2004)
Hu, M., Liu, B.: Mining and summarizing customer reviews. In: Proceedings ACM SIGKDD, pp. 168–177 (2004)
Wang, S., Manning, C.D.: Baselines and bigrams: simple, good sentiment and topic classification. In: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, 8–14 July 2012, pp. 90–94, Jeju, Republic of Korea (2012)
Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up? Sentiment classification using machine learning techniques. In: Proceedings of EMNLP, pp. 79–86 (2002)
Jurek, A., Mulvenna, M.D., Bi, Y.: Improved lexicon–based sentiment analysis for social media analytics. Secur. Informat. (2015)
Fu, G. Wang, X.: Chinese sentence-level sentiment classification based on fuzzy sets. In: 2010 Proceedings of the 23rd International Conference on Computational Linguistics, August 2010, Poster Volume, pp. 312–319, Beijing (2010)
Xing, F., Zhan, J.: Sentiment analysis using product review data. J Big Data (2015)
Tripathy, A., Anand, A., Rath, S.K.: Classification of sentiment reviews using n-gram machine learning approach. Exp. Syst. Appl. 57, 117–126 (2016)
Sohail, S.S., Siddiqui, J., Ali, R.: Feature extraction and analysis of online reviews for the recommendation of books using opinion mining technique. Perspect. Sci. 8, 754–756 (2016)
Zhou, X., Wan, X., Xiao, J.: CL opinion miner: opinion target extraction in a cross-language scenario. IEEE/ACM Trans. Audio Speech Lang. Process. 23(4) (2015)
Tartir, S., Abdul-Nabi, I.: Semantic sentiment analysis in Arabic social media. J. King Saud Univ. Comput. Informat. Sci. 29, 229–233 (2017)
Tripathy, A., Anand, A., Rath, S.K.: Document-level Sentiment Classification Using Hybrid Machine Learning Approach. Springer, London (2017)
Zainuddin, N., Selamat, A.: Sentiment Analysis Using Support Vector Machine, pp. 333–337. IEEE I4CT, Langkawi, Kedah, Malaysia (2014)
Jurafsky, D., Martin, J.H.: Naive Bayes and sentiment classification. Speech Lang. Process. (2016)
Manek, A.S., Shenoy, P.D., Mohan M.C., Venugopal, K.: Aspect Term Extraction for Sentiment Analysis in Large Movie Reviews Using Gini Index Feature Selection Method and SVM Classifier. Springer Science + Business Media, New York (2016)
Chunping, O., Yongbin, L., Shuqing, Z., Xiaohua, Y.: Opinion objects identification and sentiment analysis. Int. J. Database Theor. Appl. 8(6), 1–12 (2015)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Sangam, S., Shinde, S. (2019). Most Persistent Feature Selection Method for Opinion Mining of Social Media Reviews. In: Fong, S., Akashe, S., Mahalle, P. (eds) Information and Communication Technology for Competitive Strategies. Lecture Notes in Networks and Systems, vol 40. Springer, Singapore. https://doi.org/10.1007/978-981-13-0586-3_22
Download citation
DOI: https://doi.org/10.1007/978-981-13-0586-3_22
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-0585-6
Online ISBN: 978-981-13-0586-3
eBook Packages: EngineeringEngineering (R0)