Abstract
Protein structure motif detection is one of the fundamental problems in Structural Bioinformatics. Compared with sequence motifs, structural motifs are more sensitive in detecting the evolutionary relationships among proteins. A variety of algorithms have been proposed to attack this problem. However, they are either heuristic without theoretical performance guarantee, or inefficient for employing an exhaustive search strategy. Here, we study a reasonably restricted version of this problem: the compact structural motif problem. In this paper, we prove that this restricted version is still NP-hard, and we present a polynomial-time approximation scheme to solve it. To the best of our knowledge, this is the first approximation algorithm with a guarantee ratio for the protein structural motif problem.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Aloy, P., Querol, E., Aviles, F.X., Sternberg, M.J.: Automated structure-based prediction of functional sites in proteins: Applications to assessing the validity of inheriting protein function from homology in genome annotation and to protein docking. Journal of Molecular Biology 311, 395–C408 (2001)
Arun, K.S., Huang, T.S., Blostein, S.D.: Least square fitting of two 3-d point sets. IEEE Transactions on Pattern Analysis and Machine Intelligence 9(5), 698–700 (1987)
Bandyopadhyay, D., Huan, J., Liu, J., Prins, J., Snoeyink, J., Wang, W., Tropsha, A.: Structure-based function inference using protein family-specific fingerprints. Journal of Protein Science 15, 1537–1543 (2006)
Chew, L.P., Kedem, K.: Finding the consensus shape of a protein family. In: Proc. 18th Annual ACM Symposium on Computational Geometry, pp. 64–73 (2002)
Gelfand, I., Kister, A., Kulikowski, C., Stoyanov, O.: Geometric invariant core for the vl and vh domains of immunoglobulin molecules. Protein Engineering 11, 1015–1025 (1998)
Gerstein, M., Altman, R.B.: Average core structure and variability measures for protein families: application to the immunoglobins. Journal of Molecular Biology 112, 535–542 (1995)
Holm, L., Sander, C.: Dali: a network tool for protein structure comparison. Trends Biochem Sci. 20(11), 478–480 (1995)
Huang, T.S., Blostein, S.D., Margerum, E.A.: Least-square estimation of motion parameters from 3-d point correspondences. In: Proc of the IEEE Conference on Computer Vision and Pattern Recognition, vol. 69 pp. 198–201 (1986)
Kolodny, R., Linial, N.: Approximate protein structural alignment in polynomial time. Proc. Natl Acad. Sci. 101, 12201–12206 (2004)
Leibowitz, N., Fligelman, Z.Y., Nussinov, R.: Multiple structural alignment and core detection by geometric hashing. In: Proc. 7th Int. Conf. Intell. Sys. Mol. Biol, pp. 169–177 (1999)
Li, M., Ma, B., Wang, L.: Finding similar regions in many strings. In: Proceedings of the thirty-first annual ACM symposium on Theory of computing (STOC), pp. 473–482, Atlanta (May 1999)
Moan, C., Rusu, I.: Hard problems in similarity searching. Discrete Appl. Math. 144(1-2), 213–227 (2004)
Orengo, C.: Cora-topological fingerprints for protein structural familie. Protein Science 8, 699–715 (1999)
Orengo, C., Taylor, W.: Ssap: Sequential structure alignment program for protein structure comparison. Methods in Enzymology 266, 617–635 (1996)
Shatsky, M., Shulman-Peleg, A., Nussinov, R., Wolfson, H.: The multiple common point set problem and its application to molecule binding pattern detection. Journal of Computational Biology 13(2), 407–428 (2006)
Shindyalov, I.N., Bourne, P.E.: Protein structure alignment by incremental combinatorial extension ce of the optimal path. Protein Eng. 11(9), 739–747 (1998)
Subbiah, S., Laurents, D.V., Levitt, M.: Structural similarity of dna-binding domains ofbacteriophage repressors and the globin core. Current Biology 3, 141–148 (1993)
Xu, J., Jiao, F., Berger, B.: A parameterized algorithm for protein structure alignment. In: RECOMB, pp. 488–499 (2006)
Ye, J., Janardan, R.: Approximate multiple protein structure alignment using the sum-of-pairs distance. Journal of Computational Biology 11(5), 986–1000 (2004)
Zhang, Y., Skolnick, J.: Tm-align: a protein structure alignment algorithm based on the tm-score. Nucleic Acids Research 33, 2302–2309 (2005)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Qian, J., Li, S.C., Bu, D., Li, M., Xu, J. (2007). Finding Compact Structural Motifs. In: Ma, B., Zhang, K. (eds) Combinatorial Pattern Matching. CPM 2007. Lecture Notes in Computer Science, vol 4580. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73437-6_16
Download citation
DOI: https://doi.org/10.1007/978-3-540-73437-6_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-73436-9
Online ISBN: 978-3-540-73437-6
eBook Packages: Computer ScienceComputer Science (R0)