Provision of Efficient Sentiment Analysis for Unstructured Data

  • C. PriyaEmail author
  • K. Santhi
  • P. M. Durairaj Vincent
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 862)


Applications on sentiment analysis via diverse context in analyzing the individual opinion on various issues such as political events, e-governance, and product reviews. Decision-making through the sentiment analysis improves the understanding of the public opinion. Opinion mining can be achieved by retrieving data through social network, microblogs, blogs, and search engines. Twitter tweets are an invaluable source of knowing the individual opinion from lots and lots of unique personality. However, the unstructured data to the huge volume and unwanted punctuation used in context and the emoticons used in the context is the main task to analyze the efficiency data and with greater accuracy. Most of the existing computational methods/algorithms identify sentiment on unstructured data via algorithms on machine learning like (BOW) bags of word approach. Here is the work of both the supervised and unsupervised approaches on various training datasets used. Automatic generation of the sentiment for the tweets extracted is provided by the unsupervised approach. And various algorithms in machine learning are used to determine the sentiment analysis they are as Maximum entropy (ME), Multinomial Naïve Bayes (MNB) and support vector machines (SVM) and are used to identify sentiment from the tweets. Here in this work I have achieved an accuracy of 87% unsupervised 97% in supervised approach. The ngram, unigram, bigram, and parts-of-speech (POS) were combined together to identify the hidden emotion and sentiment in the context that mentioned in the tweet that are all in an unstructured format. The lexicon-based approach 75.20% is achieved, based on the sentiment prediction the opinion is given.


Opinion Unstructured data Ngram Bigram Unigram Supervised approach Unsupervised approach Decision-making 


  1. 1.
    N. Jindal, B. Liu, Mining comparative sentences and relations, in Proceedings of the 21st National Conference on Artificial Intelligence, vol 2, pp. 1331–1336 (2006)Google Scholar
  2. 2.
    E. Cambria, B. Schuller, Y. Xia, C. Havasi, New avenues in opinion mining and sentiment analysis. IEEE Intell. Syst. 28(2), 15–21 (2013)CrossRefGoogle Scholar
  3. 3.
    R. Chen, W. Xu, The determinants of online customer ratings: a combined domain ontology and topic text analytics approach. Electron. Commer. Res. (2016)Google Scholar
  4. 4.
    R. Feldman, M. Fresco, J. Goldenberg, O. Netzer, L. Ungar, Extracting product comparisons from discussion boards, in Seventh IEEE International Conference on Data Mining (ICDM 2007), pp. 469–474 (2007)Google Scholar
  5. 5.
    Y. Li, Z. Qin, W. Xu, J. Guo, A holistic model of mining product aspects and associated sentiments from online reviews. Multimedia Tools Appl. 74(23), 10177–10194 (2015). Scholar
  6. 6.
    B. Liu, Opinion mining and sentiment analysis, in Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data (pp. 459–526) (2011). Scholar
  7. 7.
    Y. Ma, G. Chen, Q. Wei, Finding users preferences from large-scale online reviews for personalized recommendation. Electron. Commer. Res. 17(1), 3–29 (2017).
  8. 8.
    N. Godbole, M. Srinivasaiah, S. Skiena, Large-scale sentiment analysis for news and blogs. Proc. Int. Conf. Weblogs Soc. Media (ICWSM) 7(21), 219–222 (2007)Google Scholar
  9. 9.
    M. Van de Kauter, D. Breesch, V. Hoste, Fine-grained analysis of explicit and implicit sentiment in financial news articles. Expert Syst. Appl. 42(11), 4999–5010 (2015). Scholar
  10. 10.
    J. Li, S. Fong, Y. Zhuang, R. Khoury, Hierarchical classification in text mining for sentiment analysis of online news. Soft Comput. 20, 3411–3420 (2015). Scholar
  11. 11.
    P. Liu, J.A. Gulla, L. Zhang, Dynamic topic-based sentiment analysis of large-scale online news, in Proceedings of the 17th International Conference on Web Information Systems Engineering (pp. 3–18) (2016)Google Scholar
  12. 12.
    S.Y.K. Mo, A. Liu, S.Y. Yang, News sentiment to market impact and its feedback effect. Environ. Syst. Decisions 36(2), 158–166 (2016). Scholar
  13. 13.
    A.K. Nassirtoussi, S. Aghabozorgi, T.Y. Wah, D.C.L. Ngo, Text mining of newsheadlines for forex market prediction: A multi-layer dimension reduction algorithm with semantics and sentiment. Expert Syst. Appl. 42(1), 306–324 (2015). Scholar
  14. 14.
    X. Ding, B. Liu, P.S. Yu, A holistic lexicon-based approach to opinion mining, in Proceedings of the 2008 international conference on web search and data mining, pp. 231–240 (2008).
  15. 15.
    A. Esuli, F. Sebastiani, Sentiwordnet: a publicly available lexical resource for opinion mining, in Proceedings of 5th language resources and evaluation, vol 6, pp. 417–422 (2006)Google Scholar
  16. 16.
    B. Ohana, Opinion mining with the sentiwordnet lexical resource. M.Sc. dissertation, Dublin Institute of Technology (2009)Google Scholar
  17. 17.
    A. Hamouda, M. Rohaim, Reviews classification using sentiwordnet lexicon, in World Congress on Computer Science and Information Technology (2011)Google Scholar
  18. 18.
    G. Fei, B. Liu, M. Hsu, M. Castellanos, R. Ghosh, A dictionary-based approach to identifying aspects implied by adjectives for opinion mining, in Proceedings of 24th International Conference on Computational Linguistics, p. 309 (2012)Google Scholar

Copyright information

© Springer Nature Singapore Pte Ltd. 2019

Authors and Affiliations

  1. 1.Vellore Institute of TechnologyVelloreIndia

Personalised recommendations