Abstract
With the rise of Web 2.0 where loads of complex data are generated every day, effective subjectivity classification has become a difficult task in these days. Subjectivity classification refers to classifying information into subjective (expressing feelings) or objective (expressing facts). In this paper, we use Yelp reviews dataset. Our aim is to prove that a dataset with the objective sentences removed from each review gives better results than the dataset containing both subjective and objective sentences. To achieve this, we have used two approaches, each divided into two phases. The first phase of both the approaches is mainly the subjectivity classification phase where we filter out the objective sentences and keep the subjective sentences in the reviews, thus creating a new dataset with purely subjective reviews. The second phase of the first approach uses CountVectorizer which creates word vectors, and we fit the model to the classifiers. The second phase of first approach is repeated for both the datasets, and we get better results for the newly created dataset which contains purely subjective reviews. The second phase of the second approach uses Word2Vec, an implementation of neural network which creates distributed word vectors. We fit this Word2Vec model to the classifier, and we analyze the results. Again, the newly created dataset gives better results after we repeat this phase of the second approach for both the datasets.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Pawar, A.B., M.A. Jawale, and D.N. Kyatanavar. 2016. Fundamentals of sentiment analysis: Concepts and methodology. Sentiment analysis and ontology engineering, 25–48. Cham: Springer.
Bravo-Marquez, F., M. Mendoza, and B. Poblete. 2014. Meta-level sentiment models for big social data analysis. Knowledge-Based Systems 69: 86–99.
Liu, B. 2012. Sentiment analysis and opinion mining. Synthesis Lectures on Human Language Technologies 5 (1): 1–167.
Pandey, S., S. Sagnika, and B.S.P. Mishra. 2018. A technique to handle negation in sentiment analysis on movie reviews. In:2018 IEEE international conference on communication and signal processing (ICCSP), 0737–0743.
Baldonado, M., C.-C.K. Chang, L. Gravano, and A. Paepcke. 1997. The stanford digital library metadata architecture. International Journal on Digital Libraries 1: 108–121.
Keshavarz, H.R., and M. Saniee Abadeh. 2018. MHSubLex: Using metaheuristic methods for subjectivity classification of microblogs. Journal of AI and Data Mining 6 (2): 341–353.
Kamal, A. 2013. Subjectivity classification using machine learning techniques for mining feature-opinion pairs from web opinion sources. arXiv preprint arXiv:1312.6962.
Chaturvedi, I., E. Cambria, R.E. Welsch, and F. Herrera. 2018. Distinguishing between facts and opinions for sentiment analysis: Survey and challenges. Information Fusion 44: 65–77.
Dey, K., R. Shrivastava, and S. Kaushik. 2017. Twitter stance detection—A subjectivity and sentiment polarity inspired two-phase approach. In 2017 IEEE international conference on data mining workshops (ICDMW), pp 365–372.
Rashid, A., N. Anwer, M. Iqbal, and M. Sher. 2013. A survey paper: areas, techniques and challenges of opinion mining. International Journal of Computer Science Issues (IJCSI) 10 (6): 18–31.
Esuli, A., and F. Sebastiani. 2006. Determining term subjectivity and term orientation for opinion mining. In 11th Conference of the European chapter of the association for computational linguistics.
Zhuang, L., F. Jing, and X.Y. Zhu. 2006. Movie review mining and summarization. In Proceedings of the 15th ACM international conference on Information and knowledge management, 43–50.
Kim, S.M., and E. Hovy. 2006. Automatic identification of pro and con reasons in online reviews. In Proceedings of the COLING/ACL on main conference poster sessions. Association for Computational Linguistics, 483–490.
Xuan, H.N.T., A.C. Le, and L.M. Nguyen. 2012. Linguistic features for subjectivity classification. In 2012 IEEE international conference on asian language processing, 17–20.
Rustamov, S. 2018. A hybrid system for subjectivity analysis. In Advances in fuzzy systems.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Das, N., Sagnika, S. (2020). A Subjectivity Detection-Based Approach to Sentiment Analysis. In: Swain, D., Pattnaik, P., Gupta, P. (eds) Machine Learning and Information Processing. Advances in Intelligent Systems and Computing, vol 1101. Springer, Singapore. https://doi.org/10.1007/978-981-15-1884-3_14
Download citation
DOI: https://doi.org/10.1007/978-981-15-1884-3_14
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-1883-6
Online ISBN: 978-981-15-1884-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)