Abstract
In bioinformatics, biochemical signal pathways can be modeled by many differential equations. It is still an open problem how to fit the huge amount of parameters of the equations to the available data. Here, the approach of systematically obtaining the most appropriate model and learning its parameters is extremely interesting.
One of the most often used approaches for model selection is to choose the least complex model which “fits the needs”. For noisy measurements, the model with the smallest mean squared error of the observed data results in a model which fits too accurately to the data – it is overfitting. Such a model will perform good on the training data, but worse on unknown data.
This paper proposes as model selection criterion the least complex description of the observed data by the model, the minimum description length MDL. For the small, but important example of inflammation modeling the performance of the approach is evaluated.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Brause, R., Hanisch, E., Paetz, J., Arlt, B.: The MEDAN project: results and practical meanings, 3. Int. Symposium. In: Nierhaus, A., Schulte am Esch, J. (eds.) Sepsis, SIRS, Immune Response - Concepts, Diagnostics and Therapy, pp. 122–129. PABST Science Publishers, Lengerich (2003)
D’haeseleer, P., Liang, S., Somogyi, R.: Genetic network inference: from co-expression clustering to reverse engineering. Bioinformatics 16, 707–726 (2000)
Steffen, M., Petti, A., D’haeseleer, P., Aach, J., Church, G.M.: Automated Modeling of Signal Transduction Networks. BMC Bioinformatics 3, 34–44 (2002)
Yeung, M.K., Tegner, J., Collins, J.J.: Reverse engineering gene networks using singular value decomposition and robust regression. In: Proc. Natl. Acad. Sci., USA, vol. 99, pp. 6163–6168 (2002)
Paetz, J., Brause, R.: A Frequent Patterns Tree Approach for Rule Generation with Categorical Septic Shock Patient Data. In: Crespo, J., Maojo, V., Martin, F. (eds.) Medical Data Analysis, pp. 207–212. Springer, Heidelberg (2001)
Chow, C.: Development of a Dynamical Systems Model of Acute Inflammation. In: Neugebauer, E. (ed.) 2nd workshop on complex systems: Analysis in Shock and Trauma Research, University of Cologne (2003)
Brause, R.: Adaptive modeling of biochemical pathways. In: IEEE 15th Int. Conf. on Tools with Art. Intell. ICTAI 2003, pp. 62–68. IEEE Press, Los Alamitos (2003)
Lapedes, A., Farber, R.: How Neural Nets Work, Report LA-UR-88-418, Los Alamos Nat. Lab. In: Lee, Y.C. (ed.) Evolution, Learning and Cognition. World Scientific, Singapore (1988)
Rechenberg, I.: Evolutionsstrategie. Problemata frommann-holzboog (1973)
Schwefel, H.-P.: Evolution and Optimum Seeking. J.Wiley, Chichester (1995)
Bronstein, I.N., et al.: Handbook of Mathematics, Van Nostrand Reinhold, 3rd edn. (1991)
Hansen, M., Yu, B.: Model selection and the principle of Minimum Description Length. Journal of the American Statistical Association 96(454), 746–774 (2001)
Cover, T., Thomas, J.: Elements of Information Theory. John Wiley, New York (1991)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Brause, R.W. (2004). Model Selection and Adaptation for Biochemical Pathways. In: Barreiro, J.M., Martín-Sánchez, F., Maojo, V., Sanz, F. (eds) Biological and Medical Data Analysis. ISBMDA 2004. Lecture Notes in Computer Science, vol 3337. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30547-7_44
Download citation
DOI: https://doi.org/10.1007/978-3-540-30547-7_44
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23964-2
Online ISBN: 978-3-540-30547-7
eBook Packages: Springer Book Archive