Ballast: A Ball-Based Algorithm for Structural Motifs
Structural motifs encapsulate local sequence-structure-function relationships characteristic of related proteins, enabling the prediction of functional characteristics of new proteins, providing molecular-level insights into how those functions are performed, and supporting the development of variants specifically maintaining or perturbing function in concert with other properties. Numerous computational methods have been developed to search through databases of structures for instances of specified motifs. However, it remains an open problem as to how best to leverage the local geometric and chemical constraints underlying structural motifs in order to develop motif-finding algorithms that are both theoretically and practically efficient. We present a simple, general, efficient approach, called Ballast (Ball-based algorithm for structural motifs), to match given structural motifs to given structures. Ballast combines the best properties of previously developed methods, exploiting the composition and local geometry of a structural motif and its possible instances in order to effectively filter candidate matches. We show that on a wide range of motif matching problems, Ballast efficiently and effectively finds good matches, and we provide theoretical insights into why it works well. By supporting generic measures of compositional and geometric similarity, Ballast provides a powerful substrate for the development of motif matching algorithms.
Keywordsprotein structure structural motif sequence-structure-function relationship geometric matching motif matching algorithm probabilistic analysis
Unable to display preview. Download preview PDF.
- 5.Bandyopadhyay, D., Snoeyink, J.: Almost-delaunay simplices: nearest neighbor relations for imprecise points. In: Proc. SODA, pp. 410–419 (2004)Google Scholar
- 16.Lueker, G.S.: A data structure for orthogonal range queries. In: Proc. FOCS, pp. 28–34. IEEE Computer Society, Washington, DC (1978)Google Scholar
- 21.Muthukrishnan, S., Pandurangan, G.: The bin-covering technique for thresholding random geometric graph properties. In: Proc. SODA, pp. 989–998 (2005)Google Scholar
- 25.Penrose, M.D.: Random Geometric Graphs. Oxford University Press (2003)Google Scholar
- 31.Willard, D.E.: Predicate-Oriented Database Search Algorithms. Outstanding Dissertations in the Computer Sciences. Garland Publishing, New York (1978)Google Scholar
- 32.Wolfson, H.J., Rigoutsos, I.: Geometric hashing: An overview. Computing in Science and Engineering 4, 10–21 (1997)Google Scholar