Where Should We Stop? An Investigation on Early Stopping for GP Learning

Nguyen, Thi Hien; Nguyen, Xuan Hoai; McKay, Bob; Nguyen, Quang Uy

doi:10.1007/978-3-642-34859-4_39

Thi Hien Nguyen²¹,
Xuan Hoai Nguyen²²,
Bob McKay²³ &
…
Quang Uy Nguyen²¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7673))

Included in the following conference series:

Asia-Pacific Conference on Simulated Evolution and Learning

1592 Accesses
2 Citations

Abstract

We investigate the impact of early stopping on the speed and accuracy of Genetic Programming (GP) learning from noisy data. Early stopping, using a popular stopping criterion, maintains the generalisation capacity of GP while significantly reducing its training time.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Koza, J.R.: Genetic Programming: On the Programming of Computers by Means of Natural Selection. The MIT Press, Cambridge (1992)
MATH Google Scholar
Poli, R., McPhee, W.L.N.: A Field Guide to Genetic Programming (2008), http://lulu.com
Mitchell, T.M.: Machine Learning. McGraw Hill (1997)
Google Scholar
Costelloe, D., Ryan, C.: On Improving Generalisation in Genetic Programming. In: Vanneschi, L., Gustafson, S., Moraglio, A., De Falco, I., Ebner, M. (eds.) EuroGP 2009. LNCS, vol. 5481, pp. 61–72. Springer, Heidelberg (2009)
Chapter Google Scholar
Foreman, N., Evett, M.: Preventing overfitting in GP with canary functions. In: Proceedings of the 2005 Conference on Genetic and Evolutionary Computation (GECCO 2005), pp. 1779–1780. ACM (2005)
Google Scholar
Gagné, C., Schoenauer, M., Parizeau, M., Tomassini, M.: Genetic Programming, Validation Sets, and Parsimony Pressure. In: Collet, P., Tomassini, M., Ebner, M., Gustafson, S., Ekárt, A. (eds.) EuroGP 2006. LNCS, vol. 3905, pp. 109–120. Springer, Heidelberg (2006)
Chapter Google Scholar
Kushchu, I.: Genetic programming and evolutionary generalization. IEEE Transactions on Evolutionary Computation 6, 431–442 (2002)
Article Google Scholar
Uy, N.Q., Hien, N.T., Hoai, N.X., O’Neill, M.: Improving the Generalisation Ability of Genetic Programming with Semantic Similarity based Crossover. In: Esparcia-Alcázar, A.I., Ekárt, A., Silva, S., Dignum, S., Uyar, A.Ş. (eds.) EuroGP 2010. LNCS, vol. 6021, pp. 184–195. Springer, Heidelberg (2010)
Chapter Google Scholar
Panait, L., Luke, S.: Methods for Evolving Robust Programs. In: Cantú-Paz, E., Foster, J.A., Deb, K., Davis, L., Roy, R., O’Reilly, U.-M., Beyer, H.-G., Kendall, G., Wilson, S.W., Harman, M., Wegener, J., Dasgupta, D., Potter, M.A., Schultz, A., Dowsland, K.A., Jonoska, N., Miller, J., Standish, R.K. (eds.) GECCO 2003. LNCS, vol. 2724, pp. 1740–1751. Springer, Heidelberg (2003)
Chapter Google Scholar
Paris, G., Robilliard, D., Fonlupt, C.: Exploring Overfitting in Genetic Programming. In: Liardet, P., Collet, P., Fonlupt, C., Lutton, E., Schoenauer, M. (eds.) EA 2003. LNCS, vol. 2936, pp. 267–277. Springer, Heidelberg (2004)
Chapter Google Scholar
Vanneschi, L., Gustafson, S.: Using crossover based similarity measure to improve genetic programming generalization ability. In: Proceedings of the 11th Annual Conference on Genetic and Evolutionary Computation (GECCO 2009), pp. 1139–1146. ACM (2009)
Google Scholar
Prechelt, L.: Early Stopping - But When? In: Orr, G.B., Müller, K.-R. (eds.) NIPS-WS 1996. LNCS, vol. 1524, pp. 55–69. Springer, Heidelberg (1998)
Chapter Google Scholar
Finno, W., Hergert, F., Zimmermann, H.: Improving model selection by nonconvergent methods. Neural Networks 6, 771–783 (1993)
Article Google Scholar
Zhang, B.T., Muhlenbein, H.: Balancing accuracy and parsimony in genetic programming. Evolutionary Computation 3, 17–38 (1995)
Article Google Scholar
Hooper, D., Flann, N.: Improving the accuracy and robustness of genetic programming through expression simplification. In: Proceedings of the First Annual Conference on Genetic Programming 1996, vol. 428. MIT Press (1996)
Google Scholar
Becker, L., Seshadri, M.: Comprehensibility and overfitting avoidance in genetic programming for technical trading rules. Technical report, Worcester Polytechnic Institute (2003)
Google Scholar
Liu, Y., Khoshgoftaar, T.: Reducing overfitting in genetic programming models for software quality classification. In: Proceedings of the Eighth IEEE Symposium on International High Assurance Systems Engineering, pp. 56–65 (2004)
Google Scholar
Silva, S., Vanneschi, L.: Operator equalisation, bloat and overfitting: a study on human oral bioavailability prediction. In: Proceedings of the 11th Annual Conference on Genetic and Evolutionary Computation (GECCO 2009), pp. 1115–1122 (2009)
Google Scholar
Tuite, C., Agapitos, A., O’Neill, M., Brabazon, A.: Early stopping criteria to counteract overfitting in genetic programming. In: Proceedings of the 13th Annual Conference Companion on Genetic and Evolutionary Computation, GECCO 2011, pp. 203–204. ACM, New York (2011)
Google Scholar
Hien, N.T., Hoai, N.X., Uy, N.Q., McKay, R.: Where should we stop? - an investigation on early stopping for gp learning. Technical Report TRSNUSC:2011:001, Strutural Complexity Laboratory, Seoul National University, Seoul, Korea (February 2011)
Google Scholar
Francone, F., Nordin, P., Banzhaf, W.: Benchmarking the generalization capabilities of a compiling genetic programming system using sparse data sets. In: Proceedings of the First Annual Conference on Genetic Programming 1996, pp. 72–80. MIT Press (1996)
Google Scholar
Iba, H.: Bagging, boosting, and bloating in genetic programming. In: Proceedings of the Genetic and Evolutionary Computation Conference, pp. 1053–1060. Morgan Kaufmann (1999)
Google Scholar
Paris, G., Robilliard, D., Fonlupt, C.: Exploring Overfitting in Genetic Programming. In: Liardet, P., Collet, P., Fonlupt, C., Lutton, E., Schoenauer, M. (eds.) EA 2003. LNCS, vol. 2936, pp. 267–277. Springer, Heidelberg (2004)
Chapter Google Scholar
Mahler, S., Robilliard, D., Fonlupt, C.: Tarpeian Bloat Control and Generalization Accuracy. In: Keijzer, M., Tettamanzi, A.G.B., Collet, P., van Hemert, J., Tomassini, M. (eds.) EuroGP 2005. LNCS, vol. 3447, pp. 203–214. Springer, Heidelberg (2005)
Chapter Google Scholar
Keijzer, M.: Improving Symbolic Regression with Interval Arithmetic and Linear Scaling. In: Ryan, C., Soule, T., Keijzer, M., Tsang, E.P.K., Poli, R., Costa, E. (eds.) EuroGP 2003. LNCS, vol. 2610, pp. 70–82. Springer, Heidelberg (2003)
Chapter Google Scholar
Gustafson, S., Burke, E.K., Krasnogor, N.: On improving genetic programming for symbolic regression. In: Proceedings of the 2005 IEEE Congress on Evolutionary Computation, vol. 1, pp. 912–919. IEEE Press, Edinburgh (2005)
Chapter Google Scholar
Vanneschi, L., Castelli, M., Silva, S.: Measuring bloat, overfitting and functional complexity in genetic programming. In: Proceedings of the 12th Annual Conference on Genetic and Evolutionary Computation (GECCO 2010), pp. 877–884. ACM (2010)
Google Scholar
Shafi, K., Abbass, H.A., Zhu, W.: The Role of Early Stopping and Population Size in XCS for Intrusion Detection. In: Wang, T.-D., Li, X., Chen, S.-H., Wang, X., Abbass, H.A., Iba, H., Chen, G.-L., Yao, X. (eds.) SEAL 2006. LNCS, vol. 4247, pp. 50–57. Springer, Heidelberg (2006)
Chapter Google Scholar
Blake, C., Keogh, E., Merz, C.J.: UCI machine learning repository (1998)
Google Scholar
Vlachos, P.: Statlib project repository (2000)
Google Scholar

Download references

Author information

Authors and Affiliations

Le Quy Don University, Vietnam
Thi Hien Nguyen & Quang Uy Nguyen
Hanoi University, Vietnam
Xuan Hoai Nguyen
Seoul National University, Korea
Bob McKay

Authors

Thi Hien Nguyen
View author publications
You can also search for this author in PubMed Google Scholar
Xuan Hoai Nguyen
View author publications
You can also search for this author in PubMed Google Scholar
Bob McKay
View author publications
You can also search for this author in PubMed Google Scholar
Quang Uy Nguyen
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Information Technology, Le Quy Don Technical University, 100 Hoang Quoc Viet Street, Cau Giay District,, Hanoi, Vietnam
Lam Thu Bui
School of Computer Engineering, Nanyang Technological University, Block N4, 2b-39, Nanyang Avenue, 639798,, Singapore, Singapore
Yew Soon Ong
Faculty of Information Technology,, Hanoi University, km9 Nguyen Trai Road, Hanoi, Vietnam
Nguyen Xuan Hoai
Graduate School of Engineering, Department of Computer Science, Osaka Prefecture University, 1-1 Gakuen-cho, Nakaku, 599-8531, Sakai, Osaka, Japan
Hisao Ishibuchi
School of Electrical and Electronic Engineering, Nanyang Technological University, Block N4, 2b-39, Nanyang Avenue, 639798, Singapore, Singapore
Ponnuthurai Nagaratnam Suganthan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Nguyen, T.H., Nguyen, X.H., McKay, B., Nguyen, Q.U. (2012). Where Should We Stop? An Investigation on Early Stopping for GP Learning. In: Bui, L.T., Ong, Y.S., Hoai, N.X., Ishibuchi, H., Suganthan, P.N. (eds) Simulated Evolution and Learning. SEAL 2012. Lecture Notes in Computer Science, vol 7673. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34859-4_39

Download citation

DOI: https://doi.org/10.1007/978-3-642-34859-4_39
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-34858-7
Online ISBN: 978-3-642-34859-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics