A Subjectivity Detection-Based Approach to Sentiment Analysis
- 14 Downloads
With the rise of Web 2.0 where loads of complex data are generated every day, effective subjectivity classification has become a difficult task in these days. Subjectivity classification refers to classifying information into subjective (expressing feelings) or objective (expressing facts). In this paper, we use Yelp reviews dataset. Our aim is to prove that a dataset with the objective sentences removed from each review gives better results than the dataset containing both subjective and objective sentences. To achieve this, we have used two approaches, each divided into two phases. The first phase of both the approaches is mainly the subjectivity classification phase where we filter out the objective sentences and keep the subjective sentences in the reviews, thus creating a new dataset with purely subjective reviews. The second phase of the first approach uses CountVectorizer which creates word vectors, and we fit the model to the classifiers. The second phase of first approach is repeated for both the datasets, and we get better results for the newly created dataset which contains purely subjective reviews. The second phase of the second approach uses Word2Vec, an implementation of neural network which creates distributed word vectors. We fit this Word2Vec model to the classifier, and we analyze the results. Again, the newly created dataset gives better results after we repeat this phase of the second approach for both the datasets.
KeywordsSentiment analysis Subjectivity detection Opinion mining Natural language processing
- 4.Pandey, S., S. Sagnika, and B.S.P. Mishra. 2018. A technique to handle negation in sentiment analysis on movie reviews. In:2018 IEEE international conference on communication and signal processing (ICCSP), 0737–0743.Google Scholar
- 6.Keshavarz, H.R., and M. Saniee Abadeh. 2018. MHSubLex: Using metaheuristic methods for subjectivity classification of microblogs. Journal of AI and Data Mining 6 (2): 341–353.Google Scholar
- 7.Kamal, A. 2013. Subjectivity classification using machine learning techniques for mining feature-opinion pairs from web opinion sources. arXiv preprint arXiv:1312.6962.
- 9.Dey, K., R. Shrivastava, and S. Kaushik. 2017. Twitter stance detection—A subjectivity and sentiment polarity inspired two-phase approach. In 2017 IEEE international conference on data mining workshops (ICDMW), pp 365–372.Google Scholar
- 10.Rashid, A., N. Anwer, M. Iqbal, and M. Sher. 2013. A survey paper: areas, techniques and challenges of opinion mining. International Journal of Computer Science Issues (IJCSI) 10 (6): 18–31.Google Scholar
- 11.Esuli, A., and F. Sebastiani. 2006. Determining term subjectivity and term orientation for opinion mining. In 11th Conference of the European chapter of the association for computational linguistics.Google Scholar
- 12.Zhuang, L., F. Jing, and X.Y. Zhu. 2006. Movie review mining and summarization. In Proceedings of the 15th ACM international conference on Information and knowledge management, 43–50.Google Scholar
- 13.Kim, S.M., and E. Hovy. 2006. Automatic identification of pro and con reasons in online reviews. In Proceedings of the COLING/ACL on main conference poster sessions. Association for Computational Linguistics, 483–490.Google Scholar
- 14.Xuan, H.N.T., A.C. Le, and L.M. Nguyen. 2012. Linguistic features for subjectivity classification. In 2012 IEEE international conference on asian language processing, 17–20.Google Scholar
- 15.Rustamov, S. 2018. A hybrid system for subjectivity analysis. In Advances in fuzzy systems.Google Scholar