Skip to main content

LINGO-DOSM: LINGO for Descriptors of Outline Shape of Molecules

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7803))

Abstract

The linear notations are more compact than connection tables so they can be useful for storing and transmitting large number of chemical structures. Implicitly they contain the information needed to compute all kinds of molecular structures and, thus, molecular properties derived from these structures. In this DOSM is a new method of obtaining a rough description of 2D molecular structure from its 2D connection graph in the form of character string. Our method is based on the fragmentation of DOSM strings into overlapping substrings of a defined size that we call LINGO-DOSM. The integral set of LINGO-DOSM derived from a given DOSM string, LINGO-DOSM allows rigorous structure specification using very small and simple rule. In this paper, we study the possibility of using the textual descriptor for describing the 2D structure of the molecule. Simulated virtual screening experiments with the MDDR database show clearly the superiority of the LINGO-DOSM descriptor compared to many standard descriptors tested in this paper.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agrafiotis, D.K., Myslik, J.C., Salemme, F.R.: Advances in diversity profiling and combinatorial series design. Mol. Diversity 4, 1–22 (1999)

    Article  Google Scholar 

  2. Jorgensen, W.L.: The many roles of computation in drug discovery. Science 303 (2004)

    Google Scholar 

  3. Flower, D.R.: On the properties of bit-string-based measures of chemical similarity. J. Chem. Inf. Comput. Sci. 38, 379–386 (1998)

    Article  Google Scholar 

  4. Willett, P., Barnard, J.M., Downs, G.M.: Chemical similarity searching. J. Chem. Inf. Comput. Sci. 38, 983–996 (1998)

    Article  Google Scholar 

  5. Brown, R.D.: Descriptors for Diversity Analysis. Perspect. Drug. Discovery Des. 7/8, 31–49 (1997)

    Google Scholar 

  6. David, V., Michael, T., Miquel, P.: LINGO, an Efficient Holographic Text Based Method To Calculate Biophysical Properties and Intermolecular Similarities. J. Chem. Inf. Model. 45, 386–393 (2005)

    Article  Google Scholar 

  7. UNITY Reference Manual, Tripos Inc., St. Louis, MO (1995)

    Google Scholar 

  8. Winkler, D.A., Burden, F.R.: Holographic QSAR of benzodiazepine. Quant. Struct.-Act. Relat. 17, 224–231 (1998)

    Article  Google Scholar 

  9. Leach, A.R., Gillet, V.J.: An Introduction to Chemoinformatics. Kluwer, Dordrecht (2003)

    Google Scholar 

  10. Wild, D.J., Willett, P.: Similarity Searching in Files of Three-Dimensional Chemical Structures. Alignment of Molecular Electrostatic Potential Fields with a Genetic Algorithm. J. Chem. Inf. Comput. Sci. 36, 159–167 (1996)

    Article  Google Scholar 

  11. Kirchmair, J., Distinto, S., Markt, P., Schuster, D., Spitzer, G.M., Liedl, K.R., Wolber, G.: How To Optimize Shape-Based Virtual Screening: Choosing the Right Query and Including Chemical Information. J. Chem. Inf. Model. 49, 678–692 (2009)

    Article  Google Scholar 

  12. Rush, T.S., Grant, J.A., Mosyak, L., Nicholls, A.: A Shape-Based 3-D Scaffold Hopping Method and Its Application to a Bacterial Protein−Protein Interaction. J. Med. Chem. 48, 1489–1495 (2005)

    Article  Google Scholar 

  13. Warr, W.A.: Representation of chemical structures. Wiley Interdisciplinary Reviews: Computational Molecular Science 1, 557–579 (2011)

    Article  Google Scholar 

  14. Hall, L.H., Kier, L.B.: Issues in representation of molecular structure: The development of molecular connectivity. J. Mol. Graph. 20, 4–18 (2001)

    Article  Google Scholar 

  15. Kogej, T., Engkvist, O., Blomberg, N., Muresan, S.: Multifingerprint Based Similarity Searches for Targeted Class Compound Selection. J. Chem. Inf. Model. 46, 1201–1213 (2006)

    Article  Google Scholar 

  16. Weininger, D.: SMILES, A chemical language and information system. 1. Introduction to methodology and encoding rules. J. Chem. Inf. Comp. Sci. 28, 31–36 (1988)

    Article  Google Scholar 

  17. SciTegicAccelrys Inc.

    Google Scholar 

  18. Yap, C.W.: PaDEL-descriptor: An open source software to calculate molecular descriptors and fingerprints. J. Comput. Chem. 32, 1466–1474 (2011)

    Article  Google Scholar 

  19. Abdo, A., Chen, B., Mueller, C., Salim, N., Willett, P.: Ligand-Based Virtual Screening Using Bayesian Networks. J. Chem. Inf. Model. 50, 1012–1020 (2010)

    Article  Google Scholar 

  20. Altschul, S.F., Madden, T.L., Schäffer, A.A., Zhang, J., Zhang, Z., Miller, W., Lipman, D.J.: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25, 3389–3402 (1997)

    Article  Google Scholar 

  21. Brown, R.D., Martin, Y.C.: Use of Structure-Activity Data To Compare Structure-Based Clustering Methods and Descriptors for Use in Compound Selection. J. Chem. Inf. Comput. Sci. 36, 572–584 (1996)

    Article  Google Scholar 

  22. Siegel, S., Castellan, N.J.: Nonparametric Statistics for The Behavioral Sciences. McGraw-Hill, New York (1988)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Hentabli, H., Salim, N., Abdo, A., Saeed, F. (2013). LINGO-DOSM: LINGO for Descriptors of Outline Shape of Molecules. In: Selamat, A., Nguyen, N.T., Haron, H. (eds) Intelligent Information and Database Systems. ACIIDS 2013. Lecture Notes in Computer Science(), vol 7803. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-36543-0_33

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-36543-0_33

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-36542-3

  • Online ISBN: 978-3-642-36543-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics