Skip to main content

Detecting Statistical Covariations of Sequence Physicochemical Properties

  • Conference paper
  • 429 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNBI,volume 5167))

Abstract

Sequence analysis often does not take the physicochemical properties into account. On the other hand, some of these properties when identified may be useful in inferring the folding and functional attributes of the molecule when considered with the original sequence information. We evaluated here an analysis using multiple aligned sequences incorporating five physicochemical properties. In addition to site invariance information, we also consider the covariation or interdependence patterns between aligned sites using an information measure. We propose a method based on analyzing the expected mutual information between sites that is statistically significant with a confidence level. When summing the measured information along the aligned sites, we compare the pattern from the measure to the structural and active site of the molecule. In the experiments, the model enzyme molecule lysozyme is chosen. The aligned sequence data are evaluated based on the mapped physicochemical properties of the amino acid residues. Analysis between the original and the transformed sequence data incorporating the physicochemical properties are then compared subtracted and visualized. From the comparisons, the plots show that some of the selected physicochemical properties in the analysis correlate to the locations of active sites and certain folding structure such as helices. The experiments generally support the useful role of incorporating additional physicochemical properties into sequence analysis, when significance of the statistical variations is taken into account.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   54.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   69.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Ladunga, I.: PHYSEAN: PHYsical Sequence Analysis for the identification of protein domains on the basis of physical and chemical properties of amino acids. Bioinformatics 15(12), 1028–1038 (1999)

    Article  Google Scholar 

  2. Jones, D.D.: Amino acid properties and side-chain orientation in proteins: a cross correlation approach. J. Theor. Biol. 50(1), 167–183 (1975)

    Article  Google Scholar 

  3. Branden, C., Toolze, J.: Introduction to Protein Structure, 2nd edn. Garland Publishing (1999)

    Google Scholar 

  4. Stolorz, P., Lapedes, A., Xia, Y.: Predicting protein secondary structure using neural net and statistical methods. J. Mol. Biol. 225(2), 363–377 (1992)

    Article  Google Scholar 

  5. Crooks, G.E., Brenner, S.E.: Protein secondary structure: entropy, correlations and prediction. Bioinformatics 20(10), 1603–1611 (2004)

    Article  Google Scholar 

  6. Haberman, S.J.: The analysis of residuals in cross-classified tables. Biometrics 29, 205–220 (1990)

    Article  Google Scholar 

  7. Li, W.: Mutual Information Functions Versus Correlation Functions. Journal of Statistical Physics 60(5-6), 823–837 (1990)

    Article  MATH  MathSciNet  Google Scholar 

  8. Wong, A.K.C., Wang, Y.: High-order pattern discovery from discrete-valued data. IEEE Trans. Knowledge and Data Eng. 9(6), 877–893 (1997)

    Article  Google Scholar 

  9. Jolles, P.: Lysozymes: Model Enzymes in Biochemistry and Biology (1996)

    Google Scholar 

  10. Eisenberg, D., Schwarz, E., Komarony, M., Wall, R.: Analysis of membrane and surface protein sequences with the hydrophobic moment plot. J. Mol. Biol. 179(1), 125–142 (1984)

    Article  Google Scholar 

  11. Zimmerman, J.M., Eliezer, N., Simha, R.: The characterization of amino acid sequences in proteins by statistical methods. J.Theor. Biol. 21(2), 170–201 (1968)

    Article  Google Scholar 

  12. Darnell, J., Lodish, H., Baltimore, D.: Molecular Cell Biology. Scientific American Books

    Google Scholar 

  13. Lesk, M.A.: Introduction to Protein Architecture: The Structural Biology of Proteins. Garland Publishing (1999); 2nd edition(1990)

    Google Scholar 

  14. Iyer, L.K., Qasba, P.K.: Molecular dynamics simulation of a-Lactalbumin and calcium binding c-type lysozyme. Protein Engineering 12(2), 129–139 (1999)

    Article  Google Scholar 

  15. Phillips, D.: The Hen-White Lysozyme Molecule. Proceedings of the National Academy of Sciences of the United States of America 57, 483–495 (1967)

    Article  Google Scholar 

  16. Hooke, S.D., Radford, S.E., Dobson, C.M.: The Refolding of Human Lysozyme: A Comparison with the Structurally Homologous Hen. Biochemistry 33(19), 5867–5876 (1994)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Ana L. C. Bazzan Mark Craven Natália F. Martins

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Gadish, M.A., Chiu, D.K.Y. (2008). Detecting Statistical Covariations of Sequence Physicochemical Properties. In: Bazzan, A.L.C., Craven, M., Martins, N.F. (eds) Advances in Bioinformatics and Computational Biology. BSB 2008. Lecture Notes in Computer Science(), vol 5167. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85557-6_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-85557-6_6

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-85556-9

  • Online ISBN: 978-3-540-85557-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics