Computation of 3D queries for ROCS based virtual screens
- 190 Downloads
Rapid overlay of chemical structures (ROCS) is a method that aligns molecules based on shape and/or chemical similarity. It is often used in 3D ligand-based virtual screening. Given a query consisting of a single conformation of an active molecule ROCS can generate highly enriched hit lists. Typically the chosen query conformation is a minimum energy structure. Can better enrichment be obtained using conformations other than the minimum energy structure? To answer this question a methodology has been developed called CORAL (COnformational analysis, Rocs ALignment). For a given set of molecule conformations it computes optimized conformations for ROCS screening. It does so by clustering all conformations of a chosen molecule set using pairwise ROCS combo scores. The best representative conformation is that which has the highest average overlap with the rest of the conformations in the cluster. It is these best representative conformations that are then used for virtual screening. CORAL was tested by performing virtual screening experiments with the 40 DUD (Directory of Useful Decoys) data sets. Both CORAL and minimum energy queries were used. The recognition capability of each query was quantified as the area under the ROC curve (AUC). Results show that the CORAL AUC values are on average larger than the minimum energy AUC values. This demonstrates that one can indeed obtain better ROCS enrichments with conformations other than the minimum energy structure. As a result, CORAL analysis can be a valuable first step in virtual screening workflows using ROCS.
KeywordsLigand-based virtual screening ROCS Optimized query conformation ROC curve analysis Statistical significance Virtual screening workflow
The authors would like to thank Will Somers and Tarek Mansour of Wyeth Chemical Sciences for their support, Dave Diller for manuscript suggestions, Ramaswamy Nilikantan for help with the diversity analysis and Youping Huang for help in performing the statistical analysis.
- 1.Rai BK, Tawa GJ, Katz AH, Humblet C (2009) Modeling G protein-coupled receptors for structure-based drug discovery using low-frequency normal modes for refinement of homology models: application to H3 antagonist. Proteins (accepted for publication)Google Scholar
- 9.ROCS 2.3.1, OpenEye Scientific Software, Santa Fe, NM, 2007. http://www.eyesopen.com
- 23.Hanley JA, McNeil BJ (1982) The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology 143:29–36Google Scholar
- 24.OMEGA 2.2.1, OpenEye Scientific Software, Santa Fe, NM, 2007. http://www.eyesopen.com
- 27.Sokal RR, Rohlf FJ (1995) Biometry: the principles and practice of statistics in biological research. W.H. Freeman, New YorkGoogle Scholar
- 28.Turner DB, Tyrell SM, Willett P (1997) Rapid quantification of molecular diversity for selective database acquisition. J Chem Inf Comput Sci 37:18–22Google Scholar
- 31.OEChem-C++ theory manual, OEMCSSEARCH. OpenEye Scientific Software: Santa Fe, NM, 2006. http://www.eyesopen.com
- 34.Scitegic Inc, Pipeline Pilot Version 126.96.36.1990, 2009. http://www.scitegic.com