Skip to main content

A New Way for Combining Filter Feature Selection Methods

  • Conference paper
  • First Online:
  • 1084 Accesses

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 43))

Abstract

This study investigates the issue of obtaining stable ranking from the fusion of the result of multiple filtering methods. Rank aggregation is the process of performing multiple runs of feature selection and then aggregating the results into a final ranked list. However, a fundamental question of is how to aggregate the individual results into a single robust ranked feature list. There are a number of available methods, ranging from simple to complex. Hence we present a new rank aggregation approach. The proposed approach is composed of two stages: in the first we evaluate he similarity and stability of single filtering methods then, in the second we aggregate the results of the stable ones. The obtained results on the Australian and German credit datasets using support vector machine and decision tree confirms that ensemble feature ranking have a major impact in the performance improvement.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Ben Brahim, A., Bouaguel, W., Limam, M.: Feature selection aggregation versus classifiers aggregation for several data dimensionalities. In: Proceedings of the International Conference on Control, Engineering & Information Technology (CEIT13) (2013)

    Google Scholar 

  2. Ben brahim, A., Bouaguel, W., Limam, M.: Combining feature selection and data classification using ensemble approaches: application to cancer diagnosis and credit scoring. In: Francisr, T. (ed.) Case Studies in Intelligent Computing: Achievements and Trendss. CRC Press, Boca Raton (2013)

    Google Scholar 

  3. Fernandez, G.: Statistical data mining using SAS applications. In; Chapman & Hall/Crc: Data Mining and Knowledge Discovery. Taylor and Francis, Boca Raton (2010)

    Google Scholar 

  4. Forman, G.: BNS feature scaling: an improved representation over TF-IDF for SVM text classification. In: Proceedings of the 17th ACM Conference on Information and Knowledge Mining, pp. 263–270. ACM, New York, NY, USA (2008)

    Google Scholar 

  5. Rodriguez, I., Huerta, R., Elkan, C., Cruz, C.S.: Quadratic programming feature selection. J. Mach. Learn. Res. 11(4), 1491–1516 (2010)

    MATH  MathSciNet  Google Scholar 

  6. Saeys, Y., Inza, I.N., Larrañaga, P.: A review of feature selection techniques in bioinformatics. Bioinformatics 23(19), 2507–2517 (2007)

    Google Scholar 

  7. Bouaguel, W., Bel Mufti, G., Limam, M.: A new feature selection technique applied to credit scoring data using a rank aggregation approach based on: optimization, genetic algorithm and similarity. In: Francisr, T. (ed.) Knowledge Discovery & Data Mining (KDDM) for Economic Development: Applications, Strategies and Techniques. CRC Press, Chicago (2014)

    Google Scholar 

  8. Wu, O., Zuo, H., Zhu, M., Hu, W., Gao, J., Wang, H.: Rank aggregation based text feature selection. In: Proceedings of the Web Intelligence, pp. 165–172. (2009)

    Google Scholar 

  9. Wang, C.M., Huang, W.F.: Evolutionary-based feature selection approaches with new criteria for data mining: a case study of credit approval data. Expert Syst. Appl. 36(3), 5900–5908 (2009)

    Article  Google Scholar 

  10. Bouaguel, W., Bel Mufti, G.: An improvement direction for filter selection techniques using information theory measures and quadratic optimization. Int. J. Adv. Res. Artif. Intell. 1(5), 7–11 (2012)

    Google Scholar 

  11. Dittman, D.J., Khoshgoftaar, T.M., Wald, R., Napolitano, A.: Classification performance of rank aggregation techniques for ensemble gene selection. In: Boonthum-Denecke, C., Youngblood, G.M. (eds.) Proceedings of the International Conference of the Florida Artificial Intelligence Research Society (FLAIRS), AAAI Press, Coconut Grove (2013)

    Google Scholar 

  12. Saeys, Y., Abeel, T., Peer, Y.: Robust feature selection using ensemble feature selection techniques. In: Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases—Part II. ECML PKDD ‘08, pp. 313–325. Springer, Berlin, Heidelberg (2008)

    Google Scholar 

  13. Molina, L.C., Belanche, L., Nebot, A.: Feature selection algorithms: a survey and experimental evaluation. In: Proceedings of the IEEE International Conference on Data Mining, pp. 306–313. IEEE Computer Society (2002)

    Google Scholar 

  14. Dash, M., Liu, H.: Consistency-based search in feature selection. Artif. Intell. 151(1–2), 155–176 (2003)

    Article  MATH  MathSciNet  Google Scholar 

  15. Krishnaiah, P., Kanal, L.: Preface. In: Krishnaiah, P., Kanal, L. (eds.) Classification Pattern Recognition and Reduction of Dimensionality. Handbook of Statistics, vol. 2, pp. v–ix. Elsevier (1982)

    Google Scholar 

  16. Guyon, I., Elisseeff, A.: An introduction to variable and feature selection. J. Mach. Learn. Res. 3(9), 1157–1182 (2003)

    MATH  Google Scholar 

  17. Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning. Springer Series in Statistics. Springer New York Inc, New York (2001)

    Book  MATH  Google Scholar 

  18. Prati, R.C.: Combining feature ranking algorithms through rank aggregation. In: The 2012 International Joint Conference on Neural Networks (IJCNN), pp. 1–8. Brisbane, Australia, 10–15 June 2012

    Google Scholar 

  19. Kalousis, A., Prados, J., Hilario, M.: Stability of feature selection algorithms: a study on high-dimensional spaces. Knowl. Inf. Syst. 12(1), 95–116 (2007)

    Article  Google Scholar 

  20. Pihur, V., Datta, S., Datta, S.: RankAggreg, an R package for weighted rank aggregation. BMC Bioinform. 10(1), 62–72 (2009)

    Article  Google Scholar 

  21. Mak, M.W., Kung, S.Y.: Fusion of feature selection methods for pairwise scoring svm. Neurocomputing 71(16–18), 3104–3113 (2008)

    Article  Google Scholar 

  22. Bouckaert, R.R., Frank, E., Hall, M., Kirkby, R., Reutemann, P., Seewald, A., Scuse, D.: Weka manual (3.7.1) (2009)

    Google Scholar 

  23. Kolde, R., Laur, S., Adler, P., Vilo, J.: Robust rank aggregation for gene list integration and meta-analysis. Bioinformatics 28(4), 573–580 (2012)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Waad Bouaguel .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer India

About this paper

Cite this paper

Bouaguel, W., Limam, M. (2016). A New Way for Combining Filter Feature Selection Methods. In: Nagar, A., Mohapatra, D., Chaki, N. (eds) Proceedings of 3rd International Conference on Advanced Computing, Networking and Informatics. Smart Innovation, Systems and Technologies, vol 43. Springer, New Delhi. https://doi.org/10.1007/978-81-322-2538-6_43

Download citation

  • DOI: https://doi.org/10.1007/978-81-322-2538-6_43

  • Published:

  • Publisher Name: Springer, New Delhi

  • Print ISBN: 978-81-322-2537-9

  • Online ISBN: 978-81-322-2538-6

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics