Skip to main content

Robust Pareto Front Genetic Programming Parameter Selection Based on Design of Experiments and Industrial Data

  • Chapter

Part of the book series: Genetic and Evolutionary Computation ((GEVO))

Abstract

Symbolic regression based on Pareto-Front GP is a very effective approach for generating high-performance parsimonious empirical models acceptable for industrial applications. The chapter addresses the issue of finding the optimal parameter settings of Pareto-Front GP which direct the simulated evolution toward simple models with acceptable prediction error. A generic methodology based on statistical design of experiments is proposed. It includes determination of the number of replicates by half width confidence intervals, determination of the significant factors by fractional factorial design of experiments, approaching the optimum by steepest ascent/descent, and local exploration around the optimum by Box Behnken design of experiments. The results from implementing the proposed methodology to different types of industrial data sets show that the statistically significant factors are the number of cascades, the number of generations, and the population size. The optimal values for the three parameters have been defined based on second order regression models with R 2 hig herthan 0.97 for small, medium, and large-sized data sets. The robustness of the optimal parameters toward the types of data sets was explored and a robust setting for the three significant parameters was obtained. It reduces the calculation time by 30% to 50% without statistically significant reduction in the mean response.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Banzhaf, W., Nordin, P., Keller, R., and Francone, F. (1998). Genetic Programming: An Introduction. Morgan Kaufmann, San Francisco, CA.

    MATH  Google Scholar 

  • Box, G., Hunter, W., and Hunter, J. (2005). Statistics for Experiments: An Itroduction to Design, Data Analysis, and Model Building. Wiley, New York, NY, 2 edition.

    Google Scholar 

  • Castillo, F., Kordon, A., Smits, G., Christenson, B., and Dickerson, D. (2006). Pareto front genetic programming parameter selection based on design of experiments and industrial data. In Proceedings of GECCO 2006.

    Google Scholar 

  • Chan, Kit Yan, Aydin, M. Emin, and Fogarty, Terence C. (2003). New factorial design theoretic crossover operator for parametrical problem. In Ryan, Conor, Soule, Terence, Keijzer, Maarten, Tsang, Edward, Poli, Riccardo, and Costa, Ernesto, editors, Genetic Programming, Proceedings of EuroGP’2003, volume 2610 of LNCS, pages 22–33, Essex. Springer-Verlag.

    Google Scholar 

  • Derringer, G. and Suich, R. (1980). Simultaneous optimization of several response variables. Journal of Quality Technology, 28(1):61–70.

    Google Scholar 

  • Feldt, Robert and Nordin, Peter (2000). Using factorial experiments to evaluate the effect of genetic programming parameters. In Poli, Riccardo, Banzhaf, Wolfgang, Langdon, William B., Miller, Julian F., Nordin, Peter, and Fogarty, Terence C, editors, Genetic Programming, Proceedings of EuroGP’ 2000, volume 1802 of LNCS, pages 271–282, Edinburgh. Springer-Verlag.

    Google Scholar 

  • Kordon, A., Kalos, A., and Adams, B. (2003a). Empirical emulators for process monitoring and optimization. In Proceedings of the IEEE 11th Conference on Control and Automation MED’2003, page 111, Greece. Rhodes.

    Google Scholar 

  • Kordon, A., Smits, G., Kalos, A., and Jordaan, E. (2003b). Robust soft sensor development using genetic programming. In Leardi, R., editor, Nature-Inspired Methods in Chemometrics, Amsterdam. Elsevier.

    Google Scholar 

  • Kordon, Arthur, Castillo, Flor, Smits, Guido, and Kotanchek, Mark (2005). Application issues of genetic programming in industry. In Yu, Tina, Riolo, Rick L., and Worzel, Bill, editors, Genetic Programming Theory and Practice III, volume 9 of Genetic Programming, chapter 16, pages 241–258. Springer, Ann Arbor.

    Google Scholar 

  • Kotanchek, M., Smits, G., and Vladislavleva, E. (2006). Pursuing the pareto paradigm. In Yu, Tina, Riolo, Rick L., and Soule, Terry, editors, Genetic Programming Theory and Practice IV. Kluwer, Ann Arbor.

    Google Scholar 

  • Koza, John R. (1992). Genetic Programming: On the Programming of Computers by Means of Natural Selection. MIT Press, Cambridge, MA, USA.

    MATH  Google Scholar 

  • Montgomery, D. (1995). Design and Analysis of Experiments. Wiley, New York, NY.

    Google Scholar 

  • Myers, R. and Montgomery, D. (1995). Design and Analysis of Experiments.

    Google Scholar 

  • Petrovski, A., Brownless, A., and McCall, J. (2005). Statistical optimization and tuning of ga factors. In Proceedings of the Congress of Evolutionary Computation (CEC’2005), pages 758–764, Edinburgh, UK.

    Google Scholar 

  • Reeves, C. and Wright, C. (1995). An experimental design prospective on genetic algorithms. In Whitley, D. and Vose, M., editors, Foundations of Genetic Algorithms 3, San Mateo, CA. Morgan Kaufmann.

    Google Scholar 

  • Smits, G. and Kotanchek, M. (2004). Pareto front exploitation in symbolic regression. In Yu, Tina, Riolo, Rick L., O’Reilly, U.M., and Worzel, Bill, editors, Genetic Programming Theory and Practice II, pages 283–300. Springer, New York.

    Google Scholar 

  • Welch, B.L. (1951). On the comparison of several mean values: An alternative approach. Biometrika, 38:330–336.

    MATH  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer Science+Business Media, LLC

About this chapter

Cite this chapter

Castillo, F., Kordon, A., Smits, G. (2007). Robust Pareto Front Genetic Programming Parameter Selection Based on Design of Experiments and Industrial Data. In: Riolo, R., Soule, T., Worzel, B. (eds) Genetic Programming Theory and Practice IV. Genetic and Evolutionary Computation. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-49650-4_10

Download citation

  • DOI: https://doi.org/10.1007/978-0-387-49650-4_10

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-0-387-33375-5

  • Online ISBN: 978-0-387-49650-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics