Skip to main content

PGVL Hub: An Integrated Desktop Tool for Medicinal Chemists to Streamline Design and Synthesis of Chemical Libraries and Singleton Compounds

  • Protocol
  • First Online:
Chemical Library Design

Abstract

PGVL Hub is an integrated molecular design desktop tool that has been developed and globally deployed throughout Pfizer discovery research units to streamline the design and synthesis of combinatorial libraries and singleton compounds. This tool supports various workflows for design of singletons, combinatorial libraries, and Markush exemplification. It also leverages the proprietary PGVL virtual space (which contains 1014 molecules spanned by experimentally derived synthesis protocols and suitable reactants) for lead idea generation, lead hopping, and library design. There had been an intense focus on ease of use, good performance and robustness, and synergy with existing desktop tools such as ISIS/Draw and SpotFire. In this chapter we describe the three-tier enterprise software architecture, key data structures that enable a wide variety of design scenarios and workflows, major technical challenges encountered and solved, and lessons learned during its development and deployment throughout its production cycles. In addition, PGVL Hub represents an extendable and enabling platform to support future innovations in library and singleton compound design while being a proven channel to deliver those innovations to medicinal chemists on a global scale.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Protocol
USD 49.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 89.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 119.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Hogan, J. C. Jr. (1997) Combinatorial chemistry in drug discovery. Nat Biotechnol 15, 328–330.

    Article  PubMed  CAS  Google Scholar 

  2. Hall, S. E. (1997) The future of combinatorial chemistry as drug discovery paradigm. Pharm Res 14(9), 1104–1105.

    Google Scholar 

  3. Salemme, F. R., Spurlino, J., Bone, R. (1997) serendipity meets precision: the integration of structure-based drug design and combinatorial chemistry for efficient drug discovery. Structure 5, 319–324.

    Article  PubMed  CAS  Google Scholar 

  4. Floyd, C. D., Leblanc, C., Whittaker, M. (1999) Combinatorial chemistry as a tool for drug discovery. Prog Med Chem 36, 91–163.

    Article  PubMed  CAS  Google Scholar 

  5. Beeley, N., Berger, A. (2000) A revolution in drug discovery. Combinatorial chemistry still needs logic to drive science forward. BMJ [Br Med J] 321(7261), 581–582.

    Google Scholar 

  6. Singh, J., Ator, M. A., Jaeger, E. P., Allen, M. P., Whipple, D. A., Soloweiij, J. E., Chowdhary, S., Treasurywala, A. M. (1996) Application of genetic algorithm to combinatorial synthesis: a computational approach to lead identification and lead optimization. J Am Chem Soc 118, 1669–1676.

    Article  CAS  Google Scholar 

  7. Blaney, J. M., Martin, E. J. (1997) Computational approaches for combinatorial library design and molecular diversity analysis. Curr Opin Chem Biol 1, 54–59.

    Article  PubMed  CAS  Google Scholar 

  8. Brown, R. D., Martin, Y. C. (1997) Design combinatorial library mixtures using a genetic algorithm. J Med Chem 40, 2304–2313.

    Article  PubMed  CAS  Google Scholar 

  9. Agrafiotis, D. K., Myslik, J. C., Salemme F. R. (1998) Advances in diversity profiling and combinatorial series design. Mol Diversity 4, 1–22.

    Article  CAS  Google Scholar 

  10. Bures, M. G., Martin, Y. C. (1998) computational methods in molecular diversity and combinatorial chemistry. Curr Opin Chem Biol 2, 376–380.

    Article  PubMed  CAS  Google Scholar 

  11. Zheng, W., Cho, S. J., Waller, C. L, Tropsha, A. (1999) Rational combinatorial library design. 3. Simulated annealing guided evaluation (SAGE) of molecular diversity: a novel computational tool for universal library design and database mining. J Chem Inf Comput Sci 39, 738–746.

    Article  PubMed  CAS  Google Scholar 

  12. Gillet, V. J., Willett, P., Bradshaw, J., Green, D. V. S. (1999) Selecting combinatorial libraries to optimize diversity and physical properties. J Chem Inf Comput Sci 39, 169–177.

    Article  CAS  Google Scholar 

  13. Spellmeyer, D. C., Grootenhuis, P. D. J. (1999) Computational approaches to combinatorial chemistry. Annu Rep Med Chem 34, 287–296.

    Article  CAS  Google Scholar 

  14. Brown, R. D., Hassan, M., Waldman, M. (2000) Combinatorial library design for diversity, cost efficiency, and drug-like characters. J Mol Graph Model 18, 427–437.

    Article  PubMed  CAS  Google Scholar 

  15. Jamois, E. A., Hassan, M., Waldman, M. (2000) Evaluation of reagent-based and product-based strategies in the design of combinatorial library subsets. J Chem Inf Comput Sci 40, 63–70.

    Article  PubMed  CAS  Google Scholar 

  16. Douguet, D., Thoreau, E., Grassy, G. (2000) A genetic algorithm for the automated generation of small organic molecules: drug design using an evolutionary algorithm. J Comput Aided Mol Design 14, 449–466.

    Article  CAS  Google Scholar 

  17. Sheridan, R. P., SanFeliciano, S. G., Kearsley, S. K. (2000) Designing targeted libraries with genetic algorithm. J Mol Graph Model 18, 320–334.

    Article  PubMed  CAS  Google Scholar 

  18. Gillet, V. J., Khatib, W., Willett, P., Fleming, P. J., Green, D. V. S. (2002) Combinatorial library design using a multiobjective genetic algorithm. J Chem Inf Comput Sci 42, 375–385.

    Article  PubMed  CAS  Google Scholar 

  19. Chen, G., Zheng, S., Luo, X., Shen, J., Zhu, W., Liu, H., Gui, C., Zhang, J., Zheng, M., Puah, C. M., Chen, K., and Jiang, H. (2005) Focused combinatorial library design based on structural diversity, drug likeness and binding affinity score. J Comb Chem 7 (3), 398–406.

    Article  PubMed  CAS  Google Scholar 

  20. Mason, J. S., Beno, B. R. (2000) Library design using BCUT chemistry-space descriptors and multiple four-point pharmacophore fingerprints: simultaneous optimization and structure-based diversity. J Mol Graph Model 18, 438–451.

    Article  PubMed  CAS  Google Scholar 

  21. Shi, S., Peng, Z., Kostrowicki, J., Paderes, G., Kuki, A. (2000) Efficient combinatorial filtering for desired molecular properties of reaction products. J Mol Graph Model 18, 478–496.

    Article  PubMed  CAS  Google Scholar 

  22. Gobbi, A., Poppinger, D., Rohde, B. (1997) Developing an in-house system to support combinatorial chemistry. Perspect Drug Discov Des 7/8 (Combinatorial Methods for the Analysis of Molecular Diversity), 131–158.

    Google Scholar 

  23. Polinsky, P., Feinstein, R. D., Shi, S., Kuki, A. (1996) LiBrain: software for automated design of exploratory and targeted combinatorial libraries, in (Chaiken, I. M., Handa, K. D., eds.) Molecular Diversity and Combinatorial Chemistry. American Chemical Society, Washington, DC, pp. 219–232.

    Google Scholar 

  24. Shi, S., Kuki, K., Zhou, Z., Na, J., Thacher, T., Yanovsky, A., Polinsky, P. LiBrainTM, An Intelligent System for the High-Throughput Design of Combinatorial libraries in Drug Discovery. Poster Presentations at the Fifth International Conference on Chemical Structures (1999) and the Second European Conference on Strategies and Technologies for Identification of NOVEL BIOACTIVE COMPOUNDS (1998).

    Google Scholar 

  25. Leach, A. R., Bradshaw, J., Green, D. V. S., Hann, M. M., Delany, J. J., III (1999) Implementation of a system for reagent selection and library enumeration, profiling, and design. J Chem Inf Comput Sci 39, 1161–1172; also see a review article from Leach, A. R., Hann, M. M. (2000) The in silico world of virtual libraries. Drug Discovery Today 5, 326–336.

    Google Scholar 

  26. Yasri, A., Berthelot, D., Gijsen, H., Thielemans, T., Marichal, P., Engles, M., Hoflack, J. (2004) REALISIS: a medicinal chemistry-oriented reagent selection, library design, and profiling platform. J Chem Inf Comput Sci 44, 2199–2206.

    Article  PubMed  CAS  Google Scholar 

  27. (a) Truchon, J., Bayly, C. I. (2006) GLARE: A new approach for filtering large reagent lists in combinatorial library design using product properties. J Chem Inf Model 46, 1536–1548. (b) Stanton, R. V., Mount, J., Miller, J. L. (2000) Combinatorial library design: maximizing model-fitting compounds within matrix synthesis constraints. J Chem Inf Comput Sci 40, 701–705.

    Google Scholar 

  28. Peng, Z., et al. PGVL: a vast virtual space of synthetic feasible compounds based on captured knowledge of combinatorial chemistry synthesis protocols at the enterprise level. Manuscript in preparation.

    Google Scholar 

  29. Hu, Q., Peng, Z., Kostrowicki, Kuki, A. (2011) LEAP into the Pfizer Global Virtual Library (PGVL) space – creation of readily synthesizable design ideas automatically, in (Zhou, J. Z. ed.) , Chemical Library Design. Humana Press, New York, Chapter 13.

  30. ISIS-Draw, MDL Information Systems, Inc. http://www.mdli.com.

  31. Microsoft Excel: http://office.microsoft.com/en-us/excel/FX100487621033.aspx

  32. SpotFire for data visualization and decision making: http://spotfire.tibco.com/

  33. Java web Start from Sun Microsystems, Inc: http://java.sun.com/products/javawebstart/

  34. Pipeline Pilot from SciTegic: http://www.scitegic.com/

  35. Dalby, A., Nourse, J. G., Hounshell, W .D., Gushurst, A. K. I., Grier, D. L., Leland, B. A., Laufer, J. (1992) Description of several chemical structure file formats used by computer programs developed at molecular design limited. J Chem Inf Comput Sci 32, 244–255.

    Article  CAS  Google Scholar 

  36. A good resource for BASE64 encoding could be found in: http://en.wikipedia.org/wiki/Base64

  37. Resource on data compression in Java could be found in: http://java.sun.com/developer/technicalArticles/Programming/compression/

  38. Resource about XML (http://en.wikipedia.org/wiki/XML ) and its special characters (http://www.devx.com/tips/Tip/14068)

  39. Lipinski, C. A., Lombardo, F., Dominy, B. W., Feeney, P. J. (1997) Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings. Adv Drug Deliv Rev 23 (1–3), 3–25.

    Article  CAS  Google Scholar 

  40. More info on the LogD prediction tool from of the ACD Lab could be found: http://www.acdlabs.com/products/phys_chem_lab/logd/

  41. a) A Pfizer in-house service and framework for development, validation, and deployment of in silico models for ADME-Tox and target specific SAR prediction. New ADME-Tox models published into this service will be automatically visible to all client software packages (such as PGVL Hub) subscribed into this service. b) Kalgutkar, A. S., Gardner, I., Obach, R. S., Shaffer, C. L., Callegari, E., Henne, K. R., Mutlib, A. E., Dalvie, D. K., Lee, J. S., Nakai, Y., O’Donnell, J. P., Boer, J., Harriman, S. P. (2005) A comprehensive listing of bioactivation pathways of organic functional groups. Curr Drug Metab 6, 161–225.

    Google Scholar 

  42. Steuer, R. E. (1986) Multiple Criteria Optimization: Theory, Computations, and Application, John Wiley & Sons, Inc., New York, ISBN 047188846X.

    Google Scholar 

  43. Sawaragi, Y., Nakayama, H., Tanino, T. (1985) Theory of Multiobjective Optimization (vol. 176 of Mathematics in Science and Engineering). Academic Press Inc., Orlando, FL, ISBN 0126203709.

    Google Scholar 

  44. Messac, A., Ismail-Yahaya, A., Mattson, C. A. (2003) The normalized normal constraint method for generating the pareto frontier. Struct Multidis Optim 25(2), 86–98.

    Article  Google Scholar 

  45. Das, I., Dennis, J. E. (1998) Normal-boundary intersection: a new method for generating the pareto surface in nonlinear multicriteria optimization problems. SIAM J Optim 8, 631–657.

    Article  Google Scholar 

  46. Deb, K., Pratap, A., Agarwal, S., Meyarivan, T. (2002) A fast and elitist multiobjective genetic algorithm. NSGA-II. IEEE Trans Evol Comput 6(2), 182–197.

    Article  Google Scholar 

  47. Two examples: Mobius from Coalesix, Inc.: A GA driven, R-group based compound/library design tools (http://www.coalesix.com/FAQ.html); and C2-LibX from Accelrys, Inc. contains a GA based library design module (more from http://accelrys.com/ ).

  48. Zhou, Z., Shi, S., Na, J., Peng, Z., Thacher, T. (2009) Combinatorial library-based design with basis products. J Comput Aided Mol Des 23, 725–736.

    Article  PubMed  CAS  Google Scholar 

  49. SciTegic’s re-implementation of Canonical SMILES string based on the original publication: Weininger, D., Weininger, A., Weininger, J. L. (1989) SMILES. 2. Algorithm for Generation of Unique SMILES Notation. J Chem Inf Comput Sci 29(2), 97–101.

    Google Scholar 

  50. ChemSelect AQB (Advanced Query Builder): A Pfizer in-house developed re-usable Java component that allows users to query various molecular structure databases within Pfizer based on molecular structural information and/or other properties, retrieve, manage, and export retrieved hits. It has a functional role similar to MDL ISIS/BASE, but with many enhanced capabilities. PGVL Hub has embedded this re-usable Java component within itself so that users can search for suitable reactants from various corporate reactants databases and inventory houses and return the hits seamlessly back into PGVL Hub design session. From user’s point of view, the ChemSelect AQB component is just part of PGVL Hub. This is one of the successful examples that re-usable components are developed and shared among multiple software development projects within Pfizer.

    Google Scholar 

  51. WebLogic is a J2EE middle-tier software suit from the former BEA Systems, now part of Oracle (http://www.oracle.com/appserver/weblogic/weblogic-suite.html)

  52. Oracle is a database application software from Oracle (http://www.oracle.com/index.html)

  53. Smith, G. F. (2006) Enabling HTS Hit follow-up via Chemo informatics, File Enrichment, and Outsourcing. High Throughput Medicinal Chemistry II; MMS Conferencing & Events Ltd., Institute of Physics; London, 2006. This article is also available on-line via this web link (http://www.mmsconferencing.com/pdf/htmc/g.smith.pdf).

  54. Clark, J. D., Hu, Q., Kuki, A., Peng, Z., Sciammetta, N., Smith, G. F., Ramirez-Weinhouse, M., Van Hoorn, W. (2006) Pfizer global virtual library: one-stop-shop for design on the desktop. A poster given by Nunzio Sciammetta during the 2006 Gordon Conference on Combinatorial Chemistry, August 20–25, 2006 in The Queen’s College Oxford, United Kingdom.

    Google Scholar 

  55. Teng, M., Zhu, J., Johnson, M. D., Chen, P., Kornmann, J., Chen, E., Blasina, A., Register, J., Anderes, K., Rogers, C., Deng, Y., Ninkovic, S., Grant, S., Hu, Q., Lundgren, K., Peng, Z., Kania, R. S. (2007) Structure-based design of (5-Arylamino-2H-pyrazol-3-yl)-biphenyl-2',4'-diols as Novel and Potent Human CHK1 Inhibitors. J Med Chem 50 (22), 5253–5256.

    Article  PubMed  CAS  Google Scholar 

  56. Peng, Z., Hu Q. (2011) Design of targeted libraries against the human Chk1 kinase using PGVL Hub, in (Zhou, J. Z. ed.) Chemical Library Design. Humana Press, New York, Chapter 16.

Download references

Acknowledgments

Over the years, the PGVL development team has received strong support and help from Pfizer research management, PGVL site champions and steering committee, user communities in medicinal chemistry and computational chemistry, research informatics, and sister software development projects. We would like to express our deepest gratitude and apologize for not being able to list all their names explicitly here.

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Science+Business Media, LLC

About this protocol

Cite this protocol

Peng, Z. et al. (2011). PGVL Hub: An Integrated Desktop Tool for Medicinal Chemists to Streamline Design and Synthesis of Chemical Libraries and Singleton Compounds. In: Zhou, J. (eds) Chemical Library Design. Methods in Molecular Biology, vol 685. Humana Press. https://doi.org/10.1007/978-1-60761-931-4_15

Download citation

  • DOI: https://doi.org/10.1007/978-1-60761-931-4_15

  • Published:

  • Publisher Name: Humana Press

  • Print ISBN: 978-1-60761-930-7

  • Online ISBN: 978-1-60761-931-4

  • eBook Packages: Springer Protocols

Publish with us

Policies and ethics