A Subjectivity Detection-Based Approach to Sentiment Analysis

  • Nilanjana DasEmail author
  • Santwana Sagnika
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 1101)


With the rise of Web 2.0 where loads of complex data are generated every day, effective subjectivity classification has become a difficult task in these days. Subjectivity classification refers to classifying information into subjective (expressing feelings) or objective (expressing facts). In this paper, we use Yelp reviews dataset. Our aim is to prove that a dataset with the objective sentences removed from each review gives better results than the dataset containing both subjective and objective sentences. To achieve this, we have used two approaches, each divided into two phases. The first phase of both the approaches is mainly the subjectivity classification phase where we filter out the objective sentences and keep the subjective sentences in the reviews, thus creating a new dataset with purely subjective reviews. The second phase of the first approach uses CountVectorizer which creates word vectors, and we fit the model to the classifiers. The second phase of first approach is repeated for both the datasets, and we get better results for the newly created dataset which contains purely subjective reviews. The second phase of the second approach uses Word2Vec, an implementation of neural network which creates distributed word vectors. We fit this Word2Vec model to the classifier, and we analyze the results. Again, the newly created dataset gives better results after we repeat this phase of the second approach for both the datasets.


Sentiment analysis Subjectivity detection Opinion mining Natural language processing 


  1. 1.
    Pawar, A.B., M.A. Jawale, and D.N. Kyatanavar. 2016. Fundamentals of sentiment analysis: Concepts and methodology. Sentiment analysis and ontology engineering, 25–48. Cham: Springer.CrossRefGoogle Scholar
  2. 2.
    Bravo-Marquez, F., M. Mendoza, and B. Poblete. 2014. Meta-level sentiment models for big social data analysis. Knowledge-Based Systems 69: 86–99.CrossRefGoogle Scholar
  3. 3.
    Liu, B. 2012. Sentiment analysis and opinion mining. Synthesis Lectures on Human Language Technologies 5 (1): 1–167.CrossRefGoogle Scholar
  4. 4.
    Pandey, S., S. Sagnika, and B.S.P. Mishra. 2018. A technique to handle negation in sentiment analysis on movie reviews. In:2018 IEEE international conference on communication and signal processing (ICCSP), 0737–0743.Google Scholar
  5. 5.
    Baldonado, M., C.-C.K. Chang, L. Gravano, and A. Paepcke. 1997. The stanford digital library metadata architecture. International Journal on Digital Libraries 1: 108–121.CrossRefGoogle Scholar
  6. 6.
    Keshavarz, H.R., and M. Saniee Abadeh. 2018. MHSubLex: Using metaheuristic methods for subjectivity classification of microblogs. Journal of AI and Data Mining 6 (2): 341–353.Google Scholar
  7. 7.
    Kamal, A. 2013. Subjectivity classification using machine learning techniques for mining feature-opinion pairs from web opinion sources. arXiv preprint arXiv:1312.6962.
  8. 8.
    Chaturvedi, I., E. Cambria, R.E. Welsch, and F. Herrera. 2018. Distinguishing between facts and opinions for sentiment analysis: Survey and challenges. Information Fusion 44: 65–77.CrossRefGoogle Scholar
  9. 9.
    Dey, K., R. Shrivastava, and S. Kaushik. 2017. Twitter stance detection—A subjectivity and sentiment polarity inspired two-phase approach. In 2017 IEEE international conference on data mining workshops (ICDMW), pp 365–372.Google Scholar
  10. 10.
    Rashid, A., N. Anwer, M. Iqbal, and M. Sher. 2013. A survey paper: areas, techniques and challenges of opinion mining. International Journal of Computer Science Issues (IJCSI) 10 (6): 18–31.Google Scholar
  11. 11.
    Esuli, A., and F. Sebastiani. 2006. Determining term subjectivity and term orientation for opinion mining. In 11th Conference of the European chapter of the association for computational linguistics.Google Scholar
  12. 12.
    Zhuang, L., F. Jing, and X.Y. Zhu. 2006. Movie review mining and summarization. In Proceedings of the 15th ACM international conference on Information and knowledge management, 43–50.Google Scholar
  13. 13.
    Kim, S.M., and E. Hovy. 2006. Automatic identification of pro and con reasons in online reviews. In Proceedings of the COLING/ACL on main conference poster sessions. Association for Computational Linguistics, 483–490.Google Scholar
  14. 14.
    Xuan, H.N.T., A.C. Le, and L.M. Nguyen. 2012. Linguistic features for subjectivity classification. In 2012 IEEE international conference on asian language processing, 17–20.Google Scholar
  15. 15.
    Rustamov, S. 2018. A hybrid system for subjectivity analysis. In Advances in fuzzy systems.Google Scholar

Copyright information

© Springer Nature Singapore Pte Ltd. 2020

Authors and Affiliations

  1. 1.TATA Consultancy ServicesKolkataIndia
  2. 2.School of Computer EngineeringKalinga Institute of Industrial Technology (Deemed to be University)BhubaneswarIndia

Personalised recommendations