Skip to main content

A Novel Feature Selection Method Based on Genetic Algorithm for Opinion Mining of Social Media Reviews

  • Conference paper
  • First Online:
Book cover Information, Communication and Computing Technology (ICICCT 2018)

Abstract

Use of social media for sharing the opinions about the products or the services by individuals or business organizations is becoming very common nowadays. Consumers are keen to share their views on certain products or commodities. This leads to the generation of large amount of unstructured social media data. Thus text data is being formed gradually in many areas like automated business, education, health care, show business and so on. Opinion mining, the sub field of text mining, deals with mining of review text and classifying the opinions or the sentiments of that text as positive or negative. The work in this paper develops a framework for opinion mining. It includes a novel feature selection method called Most Persistent Feature Selection (MPFS) for feature selection and a genetic algorithm (GA) based optimization technique for optimizing the feature set. MPFS method uses information gain of the features in the review documents. The feature set thus produced is optimized using GA technique to get the most effective feature set for sentiment classification. Then a Support Vector Machine (SVM) algorithm is used for classifying the sentiments of reviews expressed in text with the proposed feature selection and optimization method. The classifier models generated show the acceptable performance in terms of accuracy when compared with the other existing models.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Liu, B.: Sentiment Analysis and Opinion Mining, vol. 5, no. 1. Morgan & Claypool Publishers, San Rafael, May 2012

    Google Scholar 

  2. Pang, B., Lee, L.: Opinion mining and sentiment analysis (2008)

    Article  Google Scholar 

  3. Pang, B., Lee, L.: A sentimental education: sentiment analysis using subjectivity summarization based on minimum cuts. In: Proceedings of ACL (2004)

    Google Scholar 

  4. Hu, M., Liu, B.: Mining and summarizing customer reviews. In: Proceedings ACM SIGKDD, pp. 168–177 (2004)

    Google Scholar 

  5. Wang, S., Manning, C.D.: Baselines and bigrams: simple, good sentiment and topic classification. In: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, pp. 90–94, Jeju, Republic of Korea, 8–14 July 2012

    Google Scholar 

  6. Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up? sentiment classification using machine learning techniques. In: Proceedings of EMNLP, pp. 79–86 (2002)

    Google Scholar 

  7. Jurek, A., Mulvenna, M.D., Bi, Y.: Improved lexicon-based sentiment analysis for social media analytics. Secur. Inform. 4(1), 9 (2015)

    Google Scholar 

  8. Fu, G., Wang, X.: Chinese Sentence-Level Sentiment Classification Based on Fuzzy Sets, Coling 2010: Poster Volume, pp. 312–319, Beijing, August 2010

    Google Scholar 

  9. Fang, X., Zhan, J.: Sentiment analysis using product review data. J. Big Data 2(1), 5 (2015)

    Google Scholar 

  10. Tripathy, A., Anand, A., Rath, S.K.: Classification of sentiment reviews using N-gram machine learning approach. Expert Syst. Appl. 57, 117–126 (2016)

    Article  Google Scholar 

  11. Sohail, S.S., Siddiqui, J., Ali, R.: Feature extraction and analysis of online reviews for the recommendation of books using opinion mining technique. Perspect. Sci. 8, 754–756 (2016)

    Article  Google Scholar 

  12. Zhou, X., Wan, X., Xiao, J.: CL opinion miner: opinion target extraction in a cross-language scenario. In: IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 23, no. 4, April 2015

    Article  Google Scholar 

  13. Tartir, S., Nabi, I.A.: Semantic sentiment analysis in arabic social media. J. King Saud Univ. Comput. Inf. Sci. 29, 229–233 (2017)

    Article  Google Scholar 

  14. Tripathy, A., Anand, A., Rath, S.K.: Document-level sentiment classification using hybrid machine learning approach. Knowl. Inf. Syst. 53, 805 (2017)

    Article  Google Scholar 

  15. Zainuddin, N., Selamat, A.: Sentiment Analysis Using Support Vector Machine, IEEE I4CT, Langkawi, Kedah, Malaysia, pp. 333–337 (2014)

    Google Scholar 

  16. Jurafsky, D., Martin, J.H.: Naive Bayes and Sentiment Classification, Speech and Language Processing, 7 November 2016

    Google Scholar 

  17. Manek, A.S., Shenoy, P.D., Mohan, M.C., Venugopal, K.: Aspect term extraction for sentiment analysis in large movie reviews using Gini Index feature selection method and SVM classifier. world wide web 20(2), 135–154 (2016)

    Article  Google Scholar 

  18. Chunping, O., Yongbin, L., Shuqing, Z., Xiaohua, Y.: Opinion objects identification and sentiment analysis. Int. J. Database Theor. Appl. 8(6), 1–12 (2015)

    Article  Google Scholar 

  19. Ferreira, L.C., Dosciatti, M.M., Nievola, J.C., Paraiso, E.C.: Using a genetic algorithm approach to study the impact of imbalanced corpora in sentiment analysis. In: Proceedings of the Twenty-Eighth International Florida Artificial Intelligence Research Society Conference

    Google Scholar 

  20. Catak, F., Bilgem, T.: Genetic algorithm based feature selection in high dimensional text dataset classification. WSEAS Trans. Inf. Sci. Appl. 12(1), 290–296 (2015)

    Google Scholar 

  21. Gómez, F., Quesada, A.: Genetic algorithms for feature selection in data analytics. www.neuraldesigner.com. Artelnics

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Savita Sangam .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Sangam, S., Shinde, S. (2019). A Novel Feature Selection Method Based on Genetic Algorithm for Opinion Mining of Social Media Reviews. In: Minz, S., Karmakar, S., Kharb, L. (eds) Information, Communication and Computing Technology. ICICCT 2018. Communications in Computer and Information Science, vol 835. Springer, Singapore. https://doi.org/10.1007/978-981-13-5992-7_15

Download citation

  • DOI: https://doi.org/10.1007/978-981-13-5992-7_15

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-13-5991-0

  • Online ISBN: 978-981-13-5992-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics