Protein Structure Comparison Based on a Measure of Information Discrepancy
Protein structure comparison is an important tool to explore and understand the different aspects of protein 3D structures. In this paper, a novel representation of protein structure (complete information set of C α –C α distances, CISD) is formulated at first. Then an FDOD score scheme is developed to measure the similarity between two representations. Numerical experiments of the new method are conducted in four different protein datasets and clustering analyses are given to verify the effectiveness of this new similarity measure. Furthermore, preliminary results of detecting homologous protein pairs of an existing non-redundant subset of CATH v2.5.1 based on the new similarity are given as a pilot study. All the results show that this new approach to measure the similarities between protein structures is simple to implement, computationally efficient and fast.
KeywordsRoot Mean Square Deviation Protein Dataset Score Scheme Protein Structure Comparison Subsequence Distribution
Unable to display preview. Download preview PDF.
- Michalewicz, Z., Eidhammer, I., Jonassen, I., Taylor, W.R.: Structure Comparison and Structure Patterns. Journal of Computational Biology 7(7), 685–716 (2000)Google Scholar
- Chen, L., Wu, L.Y., Wang, R., Wang, Y., Zhang, S., Zhang, X.S.: Comparison of Protein Structures by Multi-Objective Optimization. Genome Informatics 16(2) (2005)Google Scholar
- Krasnogor, N., Pelta, D.A.: Measuring the Similarity of Protein Structures by Means of the Universal Similarity Metric. Bioinformatics 20(7) (2004)Google Scholar
- Orengo, C.A.: A review of methods for protein structure comparison. In: Taylor, W.R. (ed.) Patterns in Protein Sequence and Structure, vol. 7, pp. 159–188. Springer, Heidelberg (1992)Google Scholar