Skip to main content

Predicting the Toxicity of Chemical Compounds Using GPTIPS: A Free Genetic Programming Toolbox for MATLAB

  • Chapter

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 70))

Abstract

In this contribution GPTIPS, a free, open source MATLAB toolbox for performing symbolic regression by genetic programming (GP) is introduced. GPTIPS is specifically designed to evolve mathematical models of predictor response data that are “multigene” in nature, i.e. linear combinations of low order non-linear transformations of the input variables. The functionality of GPTIPS is demonstrated by using it to generate an accurate, compact QSAR (quantitative structure activity relationship) model of existing toxicity data in order to predict the toxicity of chemical compounds. It is shown that the low-order “multigene” GP methods implemented by GPTIPS can provide a useful alternative, as well as a complementary approach, to currently accepted empirical modelling and data analysis techniques. GPTIPS and documentation is available for download at http://sites.google.com/site/gptips4matlab/.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Alfaro-Cid, E., Esparcia-Alcázar, A.I., Moya, P., Femenia-Ferrer, B., Sharman, K., Merelo, J.J.: Modeling pheromone dispensers using genetic programming. In: Lecture Notes in Computer Science, vol. 5484/2009, pp. 635–644. Springer, Berlin/Heidelberg (2009)

    Google Scholar 

  2. Greeff, D.J., Aldrich, C.: Empirical modeling of chemical process systems with evolutionary programming. Comp. Chem. Eng. 22, 995–1005 (1998)

    Article  Google Scholar 

  3. Grosman, B., Lewin, D.R.: Automated nonlinear model predictive control using genetic programming. Comp. Chem. Eng. 26, 631–640 (2002)

    Article  Google Scholar 

  4. Hinchliffe, M.P., Willis, M.J.: Dynamic systems modelling using genetic programming. Comp. Chem. Eng. 27(12), 1841–1854 (2003)

    Article  Google Scholar 

  5. Hinchliffe, M.P., Willis, M.J., Hiden, H., Tham, M.T., McKay, B., Barton, G.W.: Modelling chemical process systems using a multi-gene genetic programming algorithm. In: Genetic Programming: Proceedings of the First Annual Conference (late breaking papers), pp. 56–65. MIT Press, Cambridge (1996)

    Google Scholar 

  6. Koza, J.R.: Genetic Programming: On the Programming of Computers by Means of Natural Selection. MIT Press, Cambridge (1992)

    Google Scholar 

  7. Luke, S., Panait, L.: Lexicographic parsimony pressure. In: Proceedings of the Genetic and Evolutionary Computation Conference (GECCO 2002), 2002

    Google Scholar 

  8. Madar, J., Abonyi, J., Sziefert, F.: Genetic programming for the identification of nonlinear input-output models. Ind. Eng. Chem. Res. 44, 3178–3186 (2005)

    Article  Google Scholar 

  9. McKay, B., Willis, M.J., Barton, G.W.: Steady-state modeling of chemical process systems using genetic programming. Comp. Chem. Eng. 21, 981–996 (1997)

    Article  Google Scholar 

  10. Poli, R., Langdon, W.B., McPhee, N.F.: A field guide to genetic programming. Published via http://lulu.com and freely available at http://www.gp-field-guide.org.uk (2008)

  11. Schultz, T.W., Yarbrough, J.W., Woldemeskel, M.: Toxicity to Tetrahymena and abiotic thiol reactivity of aromatic isothiocyanates. Cell Biol. Toxicol. 21, 181–189 (2005)

    Article  Google Scholar 

  12. Searson, D.P., Leahy, D.E., Willis, M.J.: GPTIPS: An open source genetic programming toolbox for multigene symbolic regression. In: Lecture Notes in Engineering and Computer Science: Proceedings of the International Multiconference of Engineers and Computer Scientists, IMECS 2010, Hong Kong, 17–19 March 2010

    Google Scholar 

  13. Searson, D.P., Willis, M.J., Montague, G.A.: Co-evolution of non-linear PLS model components. J. Chemom. 2, 592–603 (2007)

    Article  Google Scholar 

  14. Seavey, K.C., Jones, A.T., Kordon, A.K.: Hybrid genetic programming – First-principles approach to process and product modeling. Ind. Eng., Chem. Res. 49, 2273–2285 (2010)

    Article  Google Scholar 

  15. Steinbeck, C., Han, Y., Kuhn, S., Horlacher, O., Luttmann, E., Willighagen, E.: The Chemistry Development Kit (CDK): an open-source Java library for chemo- and bioinformatics. J. Chem. Inf. Comput. Sci. 43, 493–500 (2003)

    Article  Google Scholar 

  16. Vapnik, V.N.: The Nature of Statistical Learning Theory, second edn. Springer, New York (2000)

    Book  MATH  Google Scholar 

  17. Wang, X., Li, Y.: Synthesis of multicomponent product separation sequences via stochastic GP method. Ind. Eng. Chem. Res. 47, 8815–8822 (2008)

    Article  Google Scholar 

  18. Zhu, H., Tropsha, A., Fourches, D., Varnek, A., Papa, E., Gramatica, P., Oberg, T., Dao, P., Cherkasov, A., Tetko, I.V.: Combinatorial QSAR modeling of chemical toxicants tested against Tetrahymena pyriformis. J. Chem. Inf. Model. 48, 766–784 (2008)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Dominic P. Searson .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer Science+Business Media B.V.

About this chapter

Cite this chapter

Searson, D.P., Leahy, D.E., Willis, M.J. (2011). Predicting the Toxicity of Chemical Compounds Using GPTIPS: A Free Genetic Programming Toolbox for MATLAB. In: Ao, SI., Castillo, O., Huang, X. (eds) Intelligent Control and Computer Engineering. Lecture Notes in Electrical Engineering, vol 70. Springer, Dordrecht. https://doi.org/10.1007/978-94-007-0286-8_8

Download citation

  • DOI: https://doi.org/10.1007/978-94-007-0286-8_8

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-94-007-0285-1

  • Online ISBN: 978-94-007-0286-8

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics