Abstract
We present an application of the machine learning methods for modelling the retention constants in the thin layer chromatography. First a feature selection algorithm is applied to reduce the feature space and then the regression models are built with a help of the random forest algorithm. The models obtained in this way have better correlation with the experimental data than the reference models built with linear regression. They are also robust—the cross-validation tests shows that the accuracy on unseen data is on average identical to the cross-validated accuracy obtained on the training set.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Breiman, L.: Random Forests. Machine Learning 45, 5–32 (2001)
Héberger, K.: Quantitative structure—(chromatographic) retention relationships. Journal of Chromatography A 1158(1-2), 273–305 (2007)
Kaliszan, R.: Quantitative relationships between molecular structure and chromatographic retention. Implications in physical, analytical, and medicinal chemistry. Critical Reviews in Analytical Chemistry 16, 323–383 (1986)
Kaliszan, R.: Quantitative structure retention relationships. Analytical Chemistry 64, 619–631 (1992)
Komsta, L.: A functional-based approach to the retention in thin layer chromatographic screening systems. Analytica Chimica Acta 629(1-2), 66–72 (2008)
Komsta, L.: Quick prediction of the retention of solutes in 13 thin layer chromatographic screening systems on silica gel by classification and regression trees. Journal of Separation Science 31(15), 2899–2909 (2008)
Komsta, Ł.: Prediction of the retention in thin layer chromatography screening systems by atomic contributions. Analytica Chimica Acta 593(2), 224–237 (2007)
Kursa, M.B., Jankowski, A., Rudnicki, W.R.: Boruta—A System for Feature Selection. Fundamenta Informaticae 101(4), 271–285 (2010)
Kursa, M.B., Rudnicki, W.R.: Feature Selection with the Boruta Package. Journal Of Statistical Software 36(11) (2010)
Moffat, A.: Clarke’s Analysis of Drugs and Poisons, 3rd edn. Pharmaceutical Press, London (2004)
Pyka, A.: The application of topological indexes in TLC. Journal of Planar Chromatography—Modern TLC 14, 152–159 (2001)
R Development Core Team: R: A Language and Environment for Statistical Computing (2010), http://www.r-project.org/
Tetko, I.V., Gasteiger, J., Todeschini, R., Mauri, A., Livingstone, D., Ertl, P., Palyulin, V.a., Radchenko, E.V., Zefirov, N.S., Makarenko, A.S., Tanchuk, V.Y., Prokopenko, V.V.: Virtual computational chemistry laboratory-design and description. Journal of Computer-Aided Molecular Design 19(6), 453–463 (2005)
Wang, Q., Zhang, L.: Review of research on quantitative structure-retention relationships in thin-layer chromatography. Journal of Liquid Chromatography & Related Technologies 22(1), 1–14 (1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kursa, M.B., Komsta, Ł., Rudnicki, W.R. (2011). The Robust Models of Retention for Thin Layer Chromatography. In: Czachórski, T., Kozielski, S., Stańczyk, U. (eds) Man-Machine Interactions 2. Advances in Intelligent and Soft Computing, vol 103. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23169-8_19
Download citation
DOI: https://doi.org/10.1007/978-3-642-23169-8_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23168-1
Online ISBN: 978-3-642-23169-8
eBook Packages: EngineeringEngineering (R0)