Approximation Algorithms for 3-D Common Substructure Identification in Drug and Protein Molecules

Chakraborty, Samarjit; Biswas, Somenath

doi:10.1007/3-540-48447-7_26

Samarjit Chakraborty⁷ &
Somenath Biswas⁸

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1663))

Included in the following conference series:

Workshop on Algorithms and Data Structures

786 Accesses
10 Citations

Abstract

Identifying the common 3-D substructure between two drug or protein molecules is an important problem in synthetic drug design and molecular biology. This problem can be represented as the following geometric pattern matching problem: given two point sets A and B in three-dimensions, and a real number∈ > 0, find the maximum cardinality subset S ⊆ S for which there is an isometry I, such that each point of I(S) is within (ie253-1) distance of a distinct point of B. Since it is difficult to solve this problem exactly, in this paper we have proposed several approximation algorithms with guaranteed approximation ratio. Our algorithms can be classifed into two groups. In the first we extend the notion of partial decision algorithms for ∈-congruence of point sets in 2-D in order to approximate the size of S. All the algorithms in this class exactly satisfy the constraint imposed by ∈. In the second class of algorithms this constraint is satisfied only approximately. In the latter case, we improve the known approximation ratio for this class of algorithms, while keeping the time complexity unchanged. For the existing approximation ratio, we propose algorithms with substantially better running times. We also suggest several improvements of our basic algorithms, all of which have a running time of O(n ^8.5). These improvements consist of using randomization, and/or an approximate maximum matching scheme for bipartite graphs.

This work was carried out when the author was at IIT Kanpur.

Author to whom all correspondence should be directed. Currently visiting University of Nebraska-Lincoln.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Proc. 12th. Annual ACM Symp. on Computational Geometry, 1996.
Google Scholar
Proc. 3rd. Annual Intl. Conf. on Computational Molecular Biology, April, 1999.
Google Scholar
T. Akutsu. Protein structure alignment using dynamic programming and iterative improvement. IEICE Trans. Information and Systems, E79-D:1629–1636, 1996.
Google Scholar
T. Akutsu, H. Tamaki, and T. Tokuyama. Distribution of distances and triangles in a point set and algorithms for computing the largest common point sets. Discrete and Computational Geometry, 20:307–331, 1998.
Article MathSciNet MATH Google Scholar
S. Arya, D. M. Mount, N. S. Netanyahu, R. Silverman, and A. Wu. An optimal algorithm for approximate nearest neighbor searching. In Proc. 5th. Annual ACM-SIAM Symp. on Discrete Algorithms, pages 573–582, 1994.
Google Scholar
S. Chakraborty and S. Biswas. Approximation algorithms for 3-D common substructure identification in drug and protein molecules. Technical Report TIK Report No. 69, Eidgenössische Technische Hochschule Zürich, 1999. ftp://ftp.tik.ee.ethz.ch/pub/people/samarjit/paper/CB99a.ps.gz.
L. P. Chew, K. Kedem, J. Kleinberg, and D. Huttenlocher. Fast detection of common geometric substructure in proteins. In Proc. RECOMB’99-3rd. Annual International Conference on Computational Molecular Biology [2].
Google Scholar
A. Efrat and A. Itai. Improvements on bottleneck matching and related problems using geometry. In Proc. 12th. Annual ACM Symp. on Computational Geometry [1], pages 301–310.
Google Scholar
P. W. Finn, L. E. Kavraki, J-C. Latombe, R. Motwani, C. Shelton, S. Venkatasubramanian, and A. Yao. RAPID: Randomized pharmacophore identification for drug design. In Proc. 13th. Annual ACM Symp. on Computational Geometry, pages 324–333, Centre Universitaire Méditerranéen, Nice, France, 1997.
Google Scholar
D. Fischer, R. Nussinov, and H. J. Wolfson. 3-D substructure matching in protein molecules. In Proc. 3rd. Annual Symposium on Combinatorial Pattern Matching, April 1992. LNCS 644, pages 136–150.
Chapter Google Scholar
M. T. Goodrich, J. S. B. Mitchell, and M. W. Orletsky. Practical methods for approximate geometric patern matching under rigid motions. In Proc. 10th. Annual ACM Symp. On Computational Geometry, pages 103–112, 1994.
Google Scholar
J. Hopcroft and R. M. Karp. An n^5/2 algorithm for maximum matchings in bipartite graphs. SIAM J. Computing, 2:225–231, 1973.
Article MathSciNet MATH Google Scholar
P. Indyk, R. Motwani, and S. Venkatasubramanian. Geometric matching under noise: Combinatorial bounds and algorithms. In Proc. 10th. Annual ACM-SIAM Symp. on Discrete Algorithms, 1999.
Google Scholar
S. Irani and P. Raghavan. Combinatorial and experimental results for randomized point matching algorithms. In Proc. 12th. Annual ACM Symp. on Computational Geometry [1], pages 68–77.
Google Scholar
S. Lavalle, P. Finn, L. Kavraki, and J-C. Latombe. Efficient database screening for rational drug design using pharmacophore-constrained conformational search. In Proc. RECOMB’99-3rd. Annual International Conference on Computational Molecular Biology [2].
Google Scholar
K. Mehlhorn. Data Structures and Algorithms 3: Multi-dimensional Searching and Computational Geometry. Springer Verlag, Berlin, 1984.
MATH Google Scholar
S. Schirra. Approximate decision algorithms for approximate congruence. Information Processing Letters, 43:29–34, 1992.
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Eidgenöossische Technische Hochschule Züurich, Germany
Samarjit Chakraborty
Indian Institute of Technology Kanpur, India
Somenath Biswas

Authors

Samarjit Chakraborty
View author publications
You can also search for this author in PubMed Google Scholar
Somenath Biswas
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computer Science, Carleton University, 1125 Colonel By Drive, Ottawa, Canada, K1S 5B6
Frank Dehne & Jörg-Rüdiger Sack &
School of Computer Science, Simon Fraser University, Burnaby, BC, Canada, V5A 1S6
Arvind Gupta
Center for Geometric Computing Providence, Brown University, RI, 02912-1910, USA
Roberto Tamassia

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chakraborty, S., Biswas, S. (1999). Approximation Algorithms for 3-D Common Substructure Identification in Drug and Protein Molecules. In: Dehne, F., Sack, JR., Gupta, A., Tamassia, R. (eds) Algorithms and Data Structures. WADS 1999. Lecture Notes in Computer Science, vol 1663. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48447-7_26

Download citation

DOI: https://doi.org/10.1007/3-540-48447-7_26
Published: 18 July 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66279-2
Online ISBN: 978-3-540-48447-9
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics