Skip to main content

Abstract

Data of metabolite concentrations in tissue samples is a powerful source of information about metabolic activity in organisms. Several methods have already been reported that allow for inferences about genetic regulatory networks using expression profiling data. Here we adapted these techniques for metabolic concentration data. While somewhat accurate in predicting certain properties these methods encounter problems when dealing with networks containing a high number of vertices (τ 50). To circumvent these difficulties, larger data sets are usually reduced a priori by means of preprocessing techniques. Our approach allows to make network inferences using ensembles of decision trees that can handle almost any amount of vertices, and thus to avoid time consuming preprocessing steps. The technique works on a bootstrap principle and heuristically searches for partially correlated relations between all objects. We applied this approach to synthetically generated data as well as on data taken from real experiments.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • FIEHN, O., KOPKA, J., DÖRMANN, P., ALTMANN, T., TRETHEWEY, R.N., and WILMITZER, L. (2000): Metabolite profiling for plant functional genomics. In: Proc. Natl. Acad. Sci. USA, 95, 14863–14868.

    Google Scholar 

  • HORIMOTO, K. and TOH, H. (2001): Automatic System for Inferring a Network from Gene Expression Profiles. Genome Informatics, 12, 270–271.

    Google Scholar 

  • KOSE, F., WECKWETH, W., LINKE, T., and FIEHN, O. (2001): Visualising plant metabolomic correlation networks using clique-metabolite matrices. Bioinformatics, 17, 1198–1208.

    Article  Google Scholar 

  • MITCHELL, T. (1997): Machine Learning. McGraw-Hill, Boston, USA.

    Google Scholar 

  • PE’ER, D, REGEV, A, ELIDAN, G., and FFRIEDMAN, N. (2001): Inferring subnetworks from perturbed expression profiles. Bioinformatics, 17,Suppl. 1, 215–S224.

    Google Scholar 

  • SHIPLEY, B. (2002): Cause and Correlation in Biology. Cambridge University Press, Cambridge, UK.

    Google Scholar 

  • SILVERMAN, B.W. (1981): Using Kernel Density Estimates to investigate Multimodality. Journal of Royal Statistical Society, Series B, 43, 97–99.

    Google Scholar 

  • STATISTICA (2002): Electronic Textbook StatSoft. http://www.statsoftinc.com/textbook/stathome.html.

    Google Scholar 

  • QUINLAN, J.R. (1993): C4.5: Programs for Machine Learning. Morgan Kaufmann, San Mateo, USA.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin · Heidelberg

About this paper

Cite this paper

Flöter, A., Selbig, J., Schaub, T. (2005). Finding Metabolic Pathways in Decision Forests. In: Baier, D., Wernecke, KD. (eds) Innovations in Classification, Data Science, and Information Systems. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-26981-9_24

Download citation

Publish with us

Policies and ethics