Strengthening the Forward Variable Selection Stopping Criterion

Herrera, Luis Javier; Rubio, G.; Pomares, H.; Paechter, B.; Guillén, A.; Rojas, I.

doi:10.1007/978-3-642-04277-5_22

Luis Javier Herrera¹⁸,
G. Rubio¹⁸,
H. Pomares¹⁸,
B. Paechter¹⁸,
A. Guillén¹⁸ &
…
I. Rojas¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 5769))

Included in the following conference series:

International Conference on Artificial Neural Networks

3689 Accesses
3 Citations

Abstract

Given any modeling problem, variable selection is a preprocess step that selects the most relevant variables with respect to the output variable. Forward selection is the most straightforward strategy for variable selection; its application using the mutual information is simple, intuitive and effective, and is commonly used in the machine learning literature. However the problem of when to stop the forward process doesn’t have a direct satisfactory solution due to the inaccuracies of the Mutual Information estimation, specially as the number of variables considered increases. This work proposes a modified stopping criterion for this variable selection methodology that uses the Markov blanket concept. As it will be shown, this approach can increase the performance and applicability of the stopping criterion of a forward selection process using mutual information.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

François, D., Rossi, F., Wertz, V., Verleysen, M.: Resampling methods for parameter-free and robust feature selection with mutual information. Neurocomputing 70, 1276–1288 (2007)
Article Google Scholar
Rossi, F., Lendasse, A., François, D., Wertz, V., Verleysen, M.: Mutual information for the selection of relevant variables in spectrometric nonlinear modelling. Chem. and Int. Lab. Syst. 80, 215–226 (2006)
Article Google Scholar
Kraskov, A., Stogbauer, H., Grassberger, P.: Estimating mutual information. Phys.Rev. E 69, 66138 (2004)
MathSciNet Google Scholar
Bellman, R.: Adaptive Control Processes: A Guided Tour. Princeton University Press, Princeton (1961)
Book MATH Google Scholar
Koller, D., Sahami, M.: Toward optimal feature selection. In: Proc. Int. Conf. on Machine Learning, pp. 284–292 (1996)
Google Scholar
Herrera, L., Pomares, H., Rojas, I., Verleysen, M., Guillén, A.: Effective input variable selection for function approximation. In: Kollias, S.D., Stafylopatis, A., Duch, W., Oja, E. (eds.) ICANN 2006. LNCS, vol. 4131, pp. 41–50. Springer, Heidelberg (2006)
Chapter Google Scholar
Suykens, J., Gestel, T.V., Brabanter, J.D., Moor, J.D., Vandewalle, B.: Least Squares Support Vector Machines. World Scientific, Singapore (2002)
Book MATH Google Scholar
Saunders, C., Gammerman, A., Vovk, V.: Ridge regression learning algorithm in dual variables. In: Proceedings of the 15th International Conference on Machine Learning, pp. 515–521. Morgan Kaufmann, San Francisco (1998)
Google Scholar
An, S., Liu, W., Venkatesh, S.: Fast cross-validation algorithms for least squares support vector machine and kernel ridge regression. Pattern Recogn. 40(8), 2154–2162 (2007)
Article MATH Google Scholar
Guillen, A., Rojas, I., Rubio, G., Pomares, H., Herrera, L., Gonzalez, J.: A new interface for mpi in matlab and its application over a genetic algorithm. In: ESTSP 2008: Proceedings of the European Symposium on Time Series Prediction, pp. 37–46 (2008)
Google Scholar
Hyndman, R.: Time series data library (1994), http://www-personal.buseco.monash.edu.au/~hyndman/TSDL/hydrology.html
Herrera, L., Pomares, H., Rojas, I., Guillén, A., Prieto, A., Valenzuela, O.: Recursive prediction for long term time series forecasting using advanced models. Neurocomputing 70, 2870–2880 (2007)
Article Google Scholar
Astakhov, S., Grassberger, P., Kraskov, A., Stögbauer, H.: Mutual information least dependent component analysis (2004), http://www.klab.caltech.edu/~kraskov/MILCA/

Download references

Author information

Authors and Affiliations

Department of Computer Architecture and Technology, University of Granada, Spain
Luis Javier Herrera, G. Rubio, H. Pomares, B. Paechter, A. Guillén & I. Rojas

Authors

Luis Javier Herrera
View author publications
You can also search for this author in PubMed Google Scholar
G. Rubio
View author publications
You can also search for this author in PubMed Google Scholar
H. Pomares
View author publications
You can also search for this author in PubMed Google Scholar
B. Paechter
View author publications
You can also search for this author in PubMed Google Scholar
A. Guillén
View author publications
You can also search for this author in PubMed Google Scholar
I. Rojas
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dipartimento di Elettronica, Politecnico di Milano, Piazza L. da Vinci 32, 20133, Milano, Italy
Cesare Alippi
Department of Electrical and Computer Engineering, University of Cyprus, 75 Kallipoleos Street, 1678, Nicosia, Cyprus
Marios Polycarpou , Christos Panayiotou & Georgios Ellinas , &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Herrera, L.J., Rubio, G., Pomares, H., Paechter, B., Guillén, A., Rojas, I. (2009). Strengthening the Forward Variable Selection Stopping Criterion. In: Alippi, C., Polycarpou, M., Panayiotou, C., Ellinas, G. (eds) Artificial Neural Networks – ICANN 2009. ICANN 2009. Lecture Notes in Computer Science, vol 5769. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04277-5_22

Download citation

DOI: https://doi.org/10.1007/978-3-642-04277-5_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04276-8
Online ISBN: 978-3-642-04277-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics