Skip to main content

Integrated Feature Selection Methods Using Metaheuristic Algorithms for Sentiment Analysis

  • Conference paper
Intelligent Information and Database Systems (ACIIDS 2016)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9621))

Included in the following conference series:

  • 2383 Accesses

Abstract

In text mining, the feature selection process can potentially improve classification accuracy by reducing the high-dimensional feature space to a low-dimensional feature space resulting in an optimal subset of available features. In this paper, a hybrid method and two meta-heuristic algorithms are employed to find an optimal feature subset. The feature selection task is performed in two steps: first, different feature subsets (called local-solutions) are obtained using a hybrid filter and wrapper approaches to reduce high-dimensional feature space; second, local-solutions are integrated using two meta-heuristic algorithms (namely, the harmony search algorithm and the genetic algorithm) in order to find an optimal feature subset. The results of a wide range of comparative experiments on three widely-used datasets in sentiment analysis show that the proposed method for feature selection outperforms other baseline methods in terms of accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Ekbal, A., Saha, S.: Combining feature selection and classifier ensemble using a multiobjective simulated annealing approach: application to named entity recognition. Soft. Comput. 17, 1–16 (2013)

    Article  Google Scholar 

  2. Saeys, Y., Inza, I., Larrañaga, P.: A review of feature selection techniques in bioinformatics. Bioinformatics 23, 2507–2517 (2007)

    Article  Google Scholar 

  3. Yousefpour, A., Ibrahim, R., Abdull Hamed, H.N., Hajmohammadi, M.S.: A comparative study on sentiment analysis. Adv. Environ. Biol. 8, 53–68 (2014)

    Google Scholar 

  4. Diao, R., Shen, Q.: Feature selection with harmony search. Syst., Man, Cybern., Part B: Cybern., IEEE Trans. 42, 1509–1523 (2012)

    Article  Google Scholar 

  5. Oreski, S., Oreski, G.: Genetic algorithm-based heuristic for feature selection in credit risk assessment. Expert Syst. Appl. 41, 2052–2064 (2014)

    Article  Google Scholar 

  6. Rogati, M., Yang, Y.: High-performing feature selection for text classification. In: Proceedings of the Eleventh International Conference on Information and knowledge management, pp. 659–661 (2002)

    Google Scholar 

  7. Uğuz, H.: A two-stage feature selection method for text categorization by using information gain, principal component analysis and genetic algorithm. Knowl.-Based Syst. 24, 1024–1032 (2011)

    Article  Google Scholar 

  8. Yousefpour, A., Ibrahim, R., Abdull Hamed, H.N., Hajmohammadi, M.S.: Feature reduction using standard deviation with different subsets selection in sentiment analysis. In: Nguyen, N.T., Attachoo, B., Trawiński, B., Somboonviwat, K. (eds.) ACIIDS 2014, Part II. LNCS, vol. 8398, pp. 33–41. Springer, Heidelberg (2014)

    Chapter  Google Scholar 

  9. Warren Liao, T.: T.: Feature extraction and selection from acoustic emission signals with an application in grinding wheel condition monitoring. Eng. Appl. Artif. Intell. 23, 74–84 (2010)

    Article  Google Scholar 

  10. Wang, Y., Liu, Y., Feng, L., Zhu, X.: Novel feature selection method based on harmony search for email classification. Knowl.-Based Syst. 73, 311–323 (2015)

    Article  Google Scholar 

  11. Blitzer, J., Dredze, M., Pereira, F.: Biographies, bollywood, boom-boxes and blenders: Domain adaptation for sentiment classification. In: ACL, pp. 440–447 (2007)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Roliana Ibrahim .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Yousefpour, A., Ibrahim, R., Hamed, H.N.A., Yokoi, T. (2016). Integrated Feature Selection Methods Using Metaheuristic Algorithms for Sentiment Analysis. In: Nguyen, N.T., Trawiński, B., Fujita, H., Hong, TP. (eds) Intelligent Information and Database Systems. ACIIDS 2016. Lecture Notes in Computer Science(), vol 9621. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-49381-6_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-662-49381-6_13

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-662-49380-9

  • Online ISBN: 978-3-662-49381-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics