Skip to main content

Skyline-Based Feature Selection for Polarity Classification in Social Networks

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10438))

Abstract

This paper deals with the feature selection in sentiment analysis for the purpose of polarity classification. We propose a method for selecting a subset of non-redundant and discriminating features, providing better performance in classification. This method relies on the skyline paradigm often used in multi criteria decision and Database fields. To demonstrate the effectiveness of our method with regard to dimensionality reduction and classification rate, some experiments are conducted on real data sets.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    https://www.cs.cornell.edu/people/pabo/movie-review-data/.

  2. 2.

    http://ankara.lti.cs.cmu.edu/side/download.html.

References

  1. Maks, I., Vossen, P.: A lexicon model for deep sentiment analysis and opinion mining applications. Decis. Support Syst. 53, 680–688 (2012)

    Article  Google Scholar 

  2. Neviarouskaya, A., Prendinger, H., Ishizuka, M.: SentiFul: a lexicon for sentiment analysis. IEEE Trans. Affect. Comput. 2, 22–36 (2011)

    Article  Google Scholar 

  3. Boiy, E., Moens, M.F.: A machine learning approach to sentiment analysis in multilingual Web texts. Inf. Retrieval 12, 526–558 (2009)

    Article  Google Scholar 

  4. Turney, P.D.: Mining the Web for synonyms: PMI-IR versus LSA on TOEFL. In: Raedt, L., Flach, P. (eds.) ECML 2001. LNCS, vol. 2167, pp. 491–502. Springer, Heidelberg (2001). doi:10.1007/3-540-44795-4_42

    Chapter  Google Scholar 

  5. Turney, P.D.: Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews. In: 40th Annual Meeting on Association for Computational Linguistics, pp. 417–424. ACL, Stroudsburg (2002)

    Google Scholar 

  6. Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20, 273–297 (1995)

    MATH  Google Scholar 

  7. Murphy, K.P.: Naive Bayes Classifiers. University of British Columbia (2006)

    Google Scholar 

  8. Kumar, V., Minz, S.: Feature selection. SmartCR 4, 211–229 (2014)

    Google Scholar 

  9. Börzsönyi, S., Kossmann, D., Stocker, K.: The skyline operator. In: 17th International Conference on Data Engineering, pp. 421–430. IEEE, New York (2001)

    Google Scholar 

  10. Abbasi, A., Chen, H., Salem, A.: Sentiment analysis in multiple languages: feature selection for opinion classification in web forums. ACM Trans. Inf. Syst. 26, 12 (2008)

    Google Scholar 

  11. O’Keefe, T., Koprinska, I.: Feature selection and weighting methods in sentiment analysis. In: Proceedings of the 14th Australasian Document Computing Symposium, Sydney, pp. 67–74 (2009)

    Google Scholar 

  12. Duric, A., Song, F.: Feature selection for sentiment analysis based on content and syntax models. Decis. Support Syst. 53, 704–711 (2012)

    Article  Google Scholar 

  13. Xu, H., Zhang, F., Wang, W.: Implicit feature identification in Chinese reviews using explicit topic mining model. Knowl.-Based Syst. 76, 166–175 (2015)

    Article  Google Scholar 

  14. Wang, S., Li, D., Song, X., Wei, Y., Li, H.: A feature selection method based on improved fishers discriminant ratio for text sentiment classification. Expert Syst. Appl. 38, 8696–8702 (2011)

    Article  Google Scholar 

  15. Wang, Y., Li, Z., Liu, J., He, Z., Huang, Y., Li, D.: Word vector modeling for sentiment analysis of product reviews. In: Zong, C., Nie, J.Y., Zhao, D., Feng, Y. (eds.) NLPCC 2014. CCIS, vol. 496, pp. 168–180. Springer, Heidelberg (2014). doi:10.1007/978-3-662-45924-9_16

    Google Scholar 

  16. Onan, A., Korukoğlu, S.: A feature selection model based on genetic rank aggregation for text sentiment classification. J. Inf. Sci. 43, 25–38 (2015)

    Article  Google Scholar 

  17. Kalaivani, P., Shunmuganathan, K.L.: Feature reduction based on genetic algorithm and hybrid model for opinion mining. Sci. Program.-Neth. 12, 15–26 (2015)

    Google Scholar 

  18. Shaw, J.: Term-relevance computations and perfect retrieval performance. Commun. Comput. Inf. Sci. 31, 491–498 (1995)

    Google Scholar 

  19. Belkasmi, D., Hadjali, A.: MP2R: a human-centric skyline relaxation approach. In: Andreasen, T., et al. (eds.) Flexible Query Answering Systems 2015. AISC, vol. 400, pp. 227–241. Springer, Cham (2016). doi:10.1007/978-3-319-26154-6_18

    Chapter  Google Scholar 

  20. Hall, M., Frank, E., Holmes, G., Pfahringer, B.: The WEKA data mining software: an update. ACM SIGKDD Explor. Newslett. J. 11, 10–18 (2009)

    Article  Google Scholar 

  21. Yang, Y., Pedersen, J.O.: A comparative study on feature selection in text categorization. In: ICML, vol. 97, pp. 412–420 (1997)

    Google Scholar 

  22. Church, K.W., Hanks, P.: Word association norms, mutual information, and lexicography. Comput. Linguist. 16, 22–29 (1990)

    Google Scholar 

  23. Mukras, R., Wiratunga, N., Lothian, R.: Selecting bi-tags for sentiment analysis of text. In: Bramer, M., Coenen, F., Petridis, M. (eds.) Research and Development in Intelligent Systems XXIV, pp. 181–194. Springer, London (2008). doi:10.1007/978-1-84800-094-0_14

    Chapter  Google Scholar 

  24. Banea, C., Mihalcea, R., Wiebe, J.: Sense-level subjectivity in a multilingual setting. Comput. Speech Lang. 28, 7–19 (2014)

    Article  Google Scholar 

  25. Deng, Z.W., Luo, K.H., Yu, H.L.: A study of supervised term weighting scheme for sentiment analysis. Expert Syst. Appl. 41, 3506–3513 (2014)

    Article  Google Scholar 

  26. Liu, Y., Jin, J., Ji, P., Harding, J.A., Fung, R.Y.: Identifying helpful online reviews: a product designer’s perspective. Comput. Aided Des. 45, 180–194 (2013)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Fayçal Rédha Saidani .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Saidani, F.R., Hadjali, A., Rassoul, I., Belkasmi, D. (2017). Skyline-Based Feature Selection for Polarity Classification in Social Networks. In: Benslimane, D., Damiani, E., Grosky, W., Hameurlain, A., Sheth, A., Wagner, R. (eds) Database and Expert Systems Applications. DEXA 2017. Lecture Notes in Computer Science(), vol 10438. Springer, Cham. https://doi.org/10.1007/978-3-319-64468-4_29

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-64468-4_29

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-64467-7

  • Online ISBN: 978-3-319-64468-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics