LogP prediction performance with the SMD solvation model and the M06 density functional family for SAMPL6 blind prediction challenge molecules
This work presents a quantum mechanical model for predicting octanol-water partition coefficients of small protein-kinase inhibitor fragments as part of the SAMPL6 LogP Prediction Challenge. The model calculates solvation free energy differences using the M06-2X functional with SMD implicit solvation and the def2-SVP basis set. This model was identified as dqxk4 in the SAMPL6 Challenge and was the third highest performing model in the physical methods category with 0.49 log Root Mean Squared Error (RMSE) for predicting the 11 compounds in SAMPL6 blind prediction set. We also collaboratively investigated the use of empirical models to address model deficiencies for halogenated compounds at minimal additional computational cost. A mixed model consisting of the dqxk4 physical and hdpuj empirical models found improved performance at 0.34 log RMSE on the SAMPL6 dataset. This collaborative mixed model approach shows how empirical models can be leveraged to expediently improve performance in chemical spaces that are difficult for ab initio methods to simulate.
KeywordsSAMPL6 LogP Computational chemistry Implicit solvation DFT
We acknowledge the National Institutes of Health for supporting the experimental work carried out in the SAMPL6 logP Prediction Challenge.
- 6.Neese F (2012) The ORCA program system. Wiley Interdiscip Rev: Computat Mol Sci 2(1):73–78Google Scholar
- 7.Neese F (2018) Software update: the ORCA program system, version 4.0. Wiley Interdiscip Rev: Comput Mol Sci 8(1):e1327Google Scholar
- 8.Zhao Y, Truhlar DG (2008) The M06 suite of density functionals for main group thermochemistry, thermochemical kinetics, noncovalent interactions, excited states, and transition elements: two new functionals and systematic testing of four M06-class functionals and 12 other functionals. Theor Chem Acc 120(1):215–241CrossRefGoogle Scholar
- 11.Zhao Y, Truhlar DG (2008) The M06 suite of density functionals for main group thermochemistry, thermochemical kinetics, noncovalent interactions, excited states, and transition elements: two new functionals and systematic testing of four M06-class functionals and 12 other functionals. Theor Chem Acc 120(1–3):215–241CrossRefGoogle Scholar
- 14.OECD (2004) Test No. 117: Partition coefficient (n-octanol/water), HPLC method, OECD guidelines for the testing of chemicals, Section 1, OECD Publishing, Paris. https://doi.org/10.1787/9789264069824-en
- 15.Berthold MR et al (2008) KNIME: the Konstanz information miner. Springer, BerlinGoogle Scholar
- 17.RDKit: Open-source cheminformatics. http://www.rdkit.org
- 24.D. Laikov, C. van Wüllen. Lebedev-Laikov-Grids. http://www.ccl.net/cca/software/SOURCES/FORTRAN/Lebedev-Laikov-Grids/index.shtml
- 28.Viswanadhan VN et al (1989) Atomic physicochemical parameters for three dimensional structure directed quantitative structure-activity relationships. 4. Additional parameters for hydrophobic and dispersive interactions and their application for an automated superposition of certain naturally occurring nucleoside antibiotics. J Chem Inf Comput Sci 29(3):163–172CrossRefGoogle Scholar