Abstract
Sequence analysis often does not take the physicochemical properties into account. On the other hand, some of these properties when identified may be useful in inferring the folding and functional attributes of the molecule when considered with the original sequence information. We evaluated here an analysis using multiple aligned sequences incorporating five physicochemical properties. In addition to site invariance information, we also consider the covariation or interdependence patterns between aligned sites using an information measure. We propose a method based on analyzing the expected mutual information between sites that is statistically significant with a confidence level. When summing the measured information along the aligned sites, we compare the pattern from the measure to the structural and active site of the molecule. In the experiments, the model enzyme molecule lysozyme is chosen. The aligned sequence data are evaluated based on the mapped physicochemical properties of the amino acid residues. Analysis between the original and the transformed sequence data incorporating the physicochemical properties are then compared subtracted and visualized. From the comparisons, the plots show that some of the selected physicochemical properties in the analysis correlate to the locations of active sites and certain folding structure such as helices. The experiments generally support the useful role of incorporating additional physicochemical properties into sequence analysis, when significance of the statistical variations is taken into account.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Ladunga, I.: PHYSEAN: PHYsical Sequence Analysis for the identification of protein domains on the basis of physical and chemical properties of amino acids. Bioinformatics 15(12), 1028–1038 (1999)
Jones, D.D.: Amino acid properties and side-chain orientation in proteins: a cross correlation approach. J. Theor. Biol. 50(1), 167–183 (1975)
Branden, C., Toolze, J.: Introduction to Protein Structure, 2nd edn. Garland Publishing (1999)
Stolorz, P., Lapedes, A., Xia, Y.: Predicting protein secondary structure using neural net and statistical methods. J. Mol. Biol. 225(2), 363–377 (1992)
Crooks, G.E., Brenner, S.E.: Protein secondary structure: entropy, correlations and prediction. Bioinformatics 20(10), 1603–1611 (2004)
Haberman, S.J.: The analysis of residuals in cross-classified tables. Biometrics 29, 205–220 (1990)
Li, W.: Mutual Information Functions Versus Correlation Functions. Journal of Statistical Physics 60(5-6), 823–837 (1990)
Wong, A.K.C., Wang, Y.: High-order pattern discovery from discrete-valued data. IEEE Trans. Knowledge and Data Eng. 9(6), 877–893 (1997)
Jolles, P.: Lysozymes: Model Enzymes in Biochemistry and Biology (1996)
Eisenberg, D., Schwarz, E., Komarony, M., Wall, R.: Analysis of membrane and surface protein sequences with the hydrophobic moment plot. J. Mol. Biol. 179(1), 125–142 (1984)
Zimmerman, J.M., Eliezer, N., Simha, R.: The characterization of amino acid sequences in proteins by statistical methods. J.Theor. Biol. 21(2), 170–201 (1968)
Darnell, J., Lodish, H., Baltimore, D.: Molecular Cell Biology. Scientific American Books
Lesk, M.A.: Introduction to Protein Architecture: The Structural Biology of Proteins. Garland Publishing (1999); 2nd edition(1990)
Iyer, L.K., Qasba, P.K.: Molecular dynamics simulation of a-Lactalbumin and calcium binding c-type lysozyme. Protein Engineering 12(2), 129–139 (1999)
Phillips, D.: The Hen-White Lysozyme Molecule. Proceedings of the National Academy of Sciences of the United States of America 57, 483–495 (1967)
Hooke, S.D., Radford, S.E., Dobson, C.M.: The Refolding of Human Lysozyme: A Comparison with the Structurally Homologous Hen. Biochemistry 33(19), 5867–5876 (1994)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Gadish, M.A., Chiu, D.K.Y. (2008). Detecting Statistical Covariations of Sequence Physicochemical Properties. In: Bazzan, A.L.C., Craven, M., Martins, N.F. (eds) Advances in Bioinformatics and Computational Biology. BSB 2008. Lecture Notes in Computer Science(), vol 5167. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85557-6_6
Download citation
DOI: https://doi.org/10.1007/978-3-540-85557-6_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-85556-9
Online ISBN: 978-3-540-85557-6
eBook Packages: Computer ScienceComputer Science (R0)