Abstract
From a real-world perspective, good enough has been achieved in the core representations and evolutionary strategies of genetic programming assuming state-of-the-art algorithms and implementations are being used. What is needed for industrial symbolic regression are tools to (a) explore and refine the data, (b) explore the developed model space and extract insight and guidance from the available sample of the infinite possibilities of model forms and (c) identify appropriate models for deployment as predictors, emulators, etc. This chapter focuses on the approaches used in DataModeler to address the modeling life cycle. A special focus in this chapter is the identification of driving variables and metavariables. Exploiting the diversity of search paths followed during independent evolutions and, then, looking at the distributions of variables and metavariable usage also provides an opportunity to gather key insights. The goal in this framework, however, is not to replace the modeler but, rather, to augment the inclusion of context and collection of insight by removing mechanistic requirements and facilitating the ability to think. We believe that the net result is higher quality and more robust models.
Key words
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bleuler S, Brack M, Thiele L, Zitzler E (2001) Multiobjective Genetic Programming: Reducing Bloat by Using SPEA2. In: Congress on Evolutionary Computation (CEC 2001), IEEE, Piscataway, NJ, pp 536–543
Evolved Analytics (2011) DataModeler Release 8.0 Documentation. Evolved Analytics LLC, URL http://www.evolved-analytics.com
Kotanchek M, Smits G, Vladislavleva E (2006) Pursuing the pareto paradigm tournaments, algorithm variations & ordinal optimization. In: Riolo RL, Soule T, Worzel B (eds) Genetic Programming Theory and Practice IV, Genetic and Evolutionary Computation, vol 5, Springer, Ann Arbor, chap 12, pp 167–186
Kotanchek M, Smits G, Vladislavleva E (2008) Exploiting trustable models via pareto GP for targeted data collection. In: Riolo RL, Soule T, Worzel B (eds) Genetic Programming Theory and Practice VI, Genetic and Evolutionary Computation, Springer, Ann Arbor, chap 10, pp 145–163
Kotanchek ME, Vladislavleva EY, Smits GF (2009) Symbolic regression via GP as a discovery engine: Insights on outliers and prototypes. In: Riolo RL, O’Reilly UM, McConaghy T (eds) Genetic Programming Theory and Practice VII, Genetic and Evolutionary Computation, Springer, Ann Arbor, chap 4, pp 55–72
McConaghy T (2008) Variation-aware structural synthesis and knowledge extraction of analog circuits. PhD thesis, Katholieke Universiteit Leuven, Leuven, Belgium
Moore JH, White BC (2006) Genome-wide genetic analysis using genetic programming: The critical need for expert knowledge. In: Riolo RL, Soule T, Worzel B (eds) Genetic Programming Theory and Practice IV, Genetic and Evolutionary Computation, vol 5, Springer, Ann Arbor, chap 11, pp –
Schmidt M, Lipson H (2009) Symbolic regression of implicit equations. In: Riolo RL, O’Reilly UM, McConaghy T (eds) Genetic Programming Theory and Practice VII, Genetic and Evolutionary Computation, Springer, Ann Arbor, chap 5, pp 73–85
Smits G, Kotanchek M (2004) Pareto-front exploitation in symbolic regression. In: O’Reilly UM, Yu T, Riolo RL, Worzel B (eds) Genetic Programming Theory and Practice II, Springer, Ann Arbor, chap 17, pp 283–299, DOI doi:10.1007/0-387-23254-0-17
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer Science+Business Media New York
About this chapter
Cite this chapter
Kotanchek, M.E., Vladislavleva, E., Smits, G. (2013). Symbolic Regression Is Not Enough: It Takes a Village to Raise a Model. In: Riolo, R., Vladislavleva, E., Ritchie, M., Moore, J. (eds) Genetic Programming Theory and Practice X. Genetic and Evolutionary Computation. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-6846-2_13
Download citation
DOI: https://doi.org/10.1007/978-1-4614-6846-2_13
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-6845-5
Online ISBN: 978-1-4614-6846-2
eBook Packages: Computer ScienceComputer Science (R0)