Development of an in silico prediction model for chemical-induced urinary tract toxicity by using naïve Bayes classifier
- 42 Downloads
The urinary tract toxicity is one of the major reasons for investigational drugs not coming into the market and even marketed drugs being restricted or withdrawn. The objective of this investigation is to develop an easily interpretable and practically applicable in silico prediction model of chemical-induced urinary tract toxicity by using naïve Bayes classifier. The genetic algorithm was used to select important molecular descriptors related to urinary tract toxicity, and the ECFP-6 fingerprint descriptors were applied to the urinary tract toxic/non-toxic fragments production. The established naïve Bayes classifier (NB-2) produced 87.3% overall accuracy of fivefold cross-validation for the training set and 84.2% for the external test set, which can be employed for the chemical-induced urinary tract toxicity assessment. Furthermore, six important molecular descriptors (e.g., number of N atoms, AlogP, molecular weight, number of H acceptors, number of H donors and molecular fractional polar surface area) and toxic and non-toxic fragments were obtained, which would help medicinal chemists interpret the mechanisms of urinary tract toxicity, and even provide theoretical guidance for hit and lead optimization.
KeywordsUrinary tract toxicity Naïve Bayes classifier Molecular descriptors Genetic algorithm Extended-connectivity fingerprints (ECFP-6)
This work was supported by the National Natural Science Foundation of China (Grant Nos. 81660589 and 31660101).
Compliance with ethical standards
Conflict of interest
The authors declare that they have no conflict of interest.
- 18.Berger JO (2013) Statistical decision theory and Bayesian analysis. Springer, BerlinGoogle Scholar
- 19.Box GE, Tiao CC (2011) Bayesian inference in statistical analysis. Wiley, New YorkGoogle Scholar
- 22.Lei T, Chen F, Liu H, Sun H, Kang Y, Li D, Li Y, Hou T (2017) ADMET evaluation in drug discovery. 17. Development of quantitative and qualitative prediction models for chemical-induced respiratory toxicity. Accurate prediction of rat oral acute toxicity using relevance vector machine and consensus modeling. Mol Pharm 14:2407–2421CrossRefGoogle Scholar
- 25.U.S. EPA toxicity category. https://www.ecfr.gov/cgi-bin/text-idx?SID=01a7e4bc780b0368240af36919256d1c&mc=true&node=se40.26.156_162&rgn=div84. Accessed 18 July 2017
- 26.Valdés-Martiní JR, Marrero-Ponce Y, García-Jacas CR, Martinez-Mayorga K, Barigye SJ, Vaz d’Almeida YS, Pham-The H, Pérez-Giménez F, Morell CA (2017) QuBiLS-MAS, open source multi-platform software for atom- and bond-based topological (2D) and chiral (2.5D) algebraic molecular descriptors computations. J Cheminformatics 9:35CrossRefGoogle Scholar
- 29.Davis L (ed) (1991) Handbook of genetic algorithms. Van Nostrand Reinhold, New YorkGoogle Scholar
- 34.Matthews EJ, Ursem CJ, Kruhlak NL, Benz RD, Sabaté DA, Yang C, Klopman G, Contrera JF (2009) Identification of structure-activity relationships for adverse effects of pharmaceuticals in humans: part B. Use of (Q)SAR systems for early detection of druginduced hepatobiliary and urinary tract toxicities. Regul Toxicol Pharmacol 54:23–42CrossRefGoogle Scholar