Abstract
Discovery of drugs has been a complex process, time-consuming and expensive until an alternative of making drug has been found i.e. using in silico method to discover potential inhibitor. During the process of drug design, compound classification is carried out through docking score steps. The aim of this research is to predict the docking score results using proper methods for classification i.e. a computationally based method and a standard statistical method. This research examined three target enzymes listed in DUD-E database i.e. aofb, cah2 and hs90a. Each enzyme consists of different compounds that will be classified as good inhibitor (ligand) and bad inhibitor (decoy). In this research, the docking score step is conducted by binary logistic regression and logistic regression ensemble (Lorens). Binary logistic regression yields on 90.4% of accuracy for aofb, 91.7% for cah2 and 94% for hs90a enzyme. Meanwhile, logistic regression ensemble (Lorens) results on the accuracy levels of 88.95, 92.1 and 100% for aofb, cah2 and hs90a consecutively. This paper showed that logistic regression ensemble method outperforms standard logistic regression to be used for the inhibitor classification.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
DiMasi, J. A., Hansen, R. W., & Grabowski, H. G. (2003). The price of innovation: new estimates of drug development costs. Journal of Health Economics, 22(2), 151–185.
Jenwitheesuk, E. H. (2008). Novel paradigms for drug discovery computational multitarget screening. Trends in Pharmacological Sciences, 29, 62–71.
Okada, M., Ohwada, H., & Aoki, S. (2013). Docking score calculation using machine learning with an enhanced inhibitor database. Bioinformatics and Computational Biology, 1.
Cortes, C., & Vapnik, V. (1995). Support vector networks. Machine Learning, 20, 273.
Lim, N., Ahn, H., Moon, H., & Chen, J. J. (2010). Classification high dimensional data with ensemble of logistic regression models. Journal of Biopharmaceutical Statistics, 20, 160–17.
Pombo, N., Garcia, N., Bousson, K., & Felizardo, V. (2015). Machine learning approaches to automated medical decision support systems. In Handbook of Research on Artificial Intelligence Techniques and Algorithms. Chapter, 6, 183–203.
Lim, N. (2007). Classification by ensembles from random partitions using logistic models. In Applied Mathematics and Statistics. Stony Brook University.
Kuswanto, H., Asfihani, A., Sarumaha, Y., & Ohwada, H. (2015). Logistic regression ensemble for predicting customer defection with very large sample size. Procedia Computer Science, 72, 86–93.
Lee, K., Ahn, H., Moon, H., Kodell, R. L., & Chen, J. J. (2013). Multinomial logistic regression ensembles. Journal of Biopharmaceutical Statistics, 23(3), 681–694.
Hosmer, Watson D., & Lemeshow, S. (1995). Applied logistic regression. New York: Wiley.
Agresti, A. (1990). Categorical Data Analysis. New York: Wiley.
Fawcett, T. (2006). An introduction to ROC analysis. Pattern Recognition Letters, 27(8), 861–874.
Witten, I. H., Frank, E., & Hall, M. A. (2011). Data mining: Practical machine learning tools and techniques (3rd ed.). Burlington: Morgan Kaufmann.
Ahn, H., Moon, H., Fazzari, M. J., Lim, N., Chen, J. J., & Kodell, R. L. (2006). Classification by ensemble from random partitions of high-dimensional data. Computational Statistic and Data Analysis, 4–6.
Acknowledgements
The authors gratefully acknowledge the financial support from The Ministry of Research, Technology and Higher Education Indonesia through Research Grant for International Collaboration and Scientific Publication. Moreover, the authors would like to thank also to the First EAI International Conference on Computer Science and Engineering, NOVEMBER 11–12, 2016, PENANG, MALAYSIA as well as the anonymous refrees.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this chapter
Cite this chapter
Kuswanto, H., Melasasi, J.N., Ohwada, H. (2018). Enzyme Classification on DUD-E Database Using Logistic Regression Ensemble (Lorens). In: Zelinka, I., Vasant, P., Duy, V., Dao, T. (eds) Innovative Computing, Optimization and Its Applications. Studies in Computational Intelligence, vol 741. Springer, Cham. https://doi.org/10.1007/978-3-319-66984-7_6
Download citation
DOI: https://doi.org/10.1007/978-3-319-66984-7_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-66983-0
Online ISBN: 978-3-319-66984-7
eBook Packages: EngineeringEngineering (R0)