Advertisement

On Stopping Rules in Dependency-Aware Feature Ranking

  • Petr Somol
  • Jiří Grim
  • Jiří Filip
  • Pavel Pudil
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8258)

Abstract

Feature Selection in very-high-dimensional or small sample problems is particularly prone to computational and robustness complications. It is common to resort to feature ranking approaches only or to randomization techniques. A recent novel approach to the randomization idea in form of Dependency-Aware Feature Ranking (DAF) has shown great potential in tackling these problems well. Its original definition, however, leaves several technical questions open. In this paper we address one of these questions: how to define stopping rules of the randomized computation that stands at the core of the DAF method. We define stopping rules that are easier to interpret and show that the number of randomly generated probes does not need to be extensive.

Keywords

dimensionality reduction feature selection randomization stopping rule 

References

  1. 1.
    Fan, J., Li, R.: Statistical challenges with high dimensionality: Feature selection in knowledge discovery (2006)Google Scholar
  2. 2.
    Kuncheva, L.I.: A stability index for feature selection. In: Proc. 25th IASTED International Multi-Conference, AIAP 2007, pp. 390–395. ACTA Press (2007)Google Scholar
  3. 3.
    Kononenko, I.: Estimating attributes: Analysis and extensions of RELIEF. In: Bergadano, F., De Raedt, L. (eds.) ECML 1994. LNCS, vol. 784, pp. 171–182. Springer, Heidelberg (1994)CrossRefGoogle Scholar
  4. 4.
    Gheyas, I.A., Smith, L.S.: Feature subset selection in large dimensionality domains. Pattern Recognition 43(1), 5–13 (2010)CrossRefzbMATHGoogle Scholar
  5. 5.
    Glover, F.W., Kochenberger, G.A. (eds.): Handbook of Metaheuristics. Int. Series in Operations Research & Management Science, vol. 57. Springer (2003)Google Scholar
  6. 6.
    Lai, C., Reinders, M.J.T., Wessels, L.: Random subspace method for multivariate feature selection. Pattern Recogn. Lett. 27(10), 1067–1076 (2006)CrossRefGoogle Scholar
  7. 7.
    Somol, P., Grim, J., Pudil, P.: Fast dependency-aware feature selection in very-high-dimensional pattern recognition. In: Proceedings of the IEEE International Conference on Systems, Man and Cybernetics (SMC), pp. 502–509. IEEE (2011)Google Scholar
  8. 8.
    Liu, H., Motoda, H.: Feature Selection for Knowledge Discovery and Data Mining. Kluwer Academic Publishers, Norwell (1998)CrossRefzbMATHGoogle Scholar
  9. 9.
    Newman, D., Hettich, S., Blake, C., Merz, C.: UCI repository of machine learning databases (1998)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  • Petr Somol
    • 1
    • 2
  • Jiří Grim
    • 2
  • Jiří Filip
    • 2
  • Pavel Pudil
    • 1
  1. 1.Faculty of ManagementPrague University of EconomicsCzech Republic
  2. 2.Institute of Information Theory and Automation of the AS CRCzech Republic

Personalised recommendations