Abstract
Data of metabolite concentrations in tissue samples is a powerful source of information about metabolic activity in organisms. Several methods have already been reported that allow for inferences about genetic regulatory networks using expression profiling data. Here we adapted these techniques for metabolic concentration data. While somewhat accurate in predicting certain properties these methods encounter problems when dealing with networks containing a high number of vertices (τ 50). To circumvent these difficulties, larger data sets are usually reduced a priori by means of preprocessing techniques. Our approach allows to make network inferences using ensembles of decision trees that can handle almost any amount of vertices, and thus to avoid time consuming preprocessing steps. The technique works on a bootstrap principle and heuristically searches for partially correlated relations between all objects. We applied this approach to synthetically generated data as well as on data taken from real experiments.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
FIEHN, O., KOPKA, J., DÖRMANN, P., ALTMANN, T., TRETHEWEY, R.N., and WILMITZER, L. (2000): Metabolite profiling for plant functional genomics. In: Proc. Natl. Acad. Sci. USA, 95, 14863–14868.
HORIMOTO, K. and TOH, H. (2001): Automatic System for Inferring a Network from Gene Expression Profiles. Genome Informatics, 12, 270–271.
KOSE, F., WECKWETH, W., LINKE, T., and FIEHN, O. (2001): Visualising plant metabolomic correlation networks using clique-metabolite matrices. Bioinformatics, 17, 1198–1208.
MITCHELL, T. (1997): Machine Learning. McGraw-Hill, Boston, USA.
PE’ER, D, REGEV, A, ELIDAN, G., and FFRIEDMAN, N. (2001): Inferring subnetworks from perturbed expression profiles. Bioinformatics, 17,Suppl. 1, 215–S224.
SHIPLEY, B. (2002): Cause and Correlation in Biology. Cambridge University Press, Cambridge, UK.
SILVERMAN, B.W. (1981): Using Kernel Density Estimates to investigate Multimodality. Journal of Royal Statistical Society, Series B, 43, 97–99.
STATISTICA (2002): Electronic Textbook StatSoft. http://www.statsoftinc.com/textbook/stathome.html.
QUINLAN, J.R. (1993): C4.5: Programs for Machine Learning. Morgan Kaufmann, San Mateo, USA.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin · Heidelberg
About this paper
Cite this paper
Flöter, A., Selbig, J., Schaub, T. (2005). Finding Metabolic Pathways in Decision Forests. In: Baier, D., Wernecke, KD. (eds) Innovations in Classification, Data Science, and Information Systems. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-26981-9_24
Download citation
DOI: https://doi.org/10.1007/3-540-26981-9_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23221-6
Online ISBN: 978-3-540-26981-6
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)