Abstract
The mutual information function is used to describe the auto-correlation of amino acids in protein. We find two interesting phenomenon: (1) for any given big protein, the mutual information function l(k) is almost a const, where k is the length of gap. (2) for any two sequence similar proteins, the mutual information are nearly the same. As a consequent, we may use mutual information of protein as a character for sequences comparison.
Similar content being viewed by others
References
Salzberg S L, Searls D B, Kasif S.Computational Methods in Molecular Biology. Amsterdam: Elsevier, 1998.
Voss R F. Evolution of Long-range Fractal Correlations and 1/f Noise in DNA Base Sequences.Physical Review Letters, 1992,68: 3805–3808.
Lareo L R. Sequence Mapping in a Three-Dimensional Space by a Numeric Method and Some of Its Application.Arfa Biotheoretica, 1999,47: 123–128.
Grosse V, Herzelh, Buldyrev S V, Stanly H E. Species Independence of Mutual Information in Coding and Noncoding DNA.Physical Review E, 2000,61: 5624–5629.
Shannon C E. A Mathematical Theory of Communication.BellSyst Tech J, 1948,27:379–423.
Armin O. Sehmitt and Hanspeter Herzel, Estimating the Entropy of DNA Sequences.J Theor Biol, 1997,188:369–377.
Pietro Lio, Antonio Politi. High Statistics Block Entropy Measures of DNA Sequences.J Theor Biol, 1996,180:151–160.
Author information
Authors and Affiliations
Additional information
Foundation item: Supported by the National Natural Science Foundation of China (30170214)
Biography: Shi Feng ( 1966-), male, Ph. D, Associate professor, research direction: bioinformatics.
Rights and permissions
About this article
Cite this article
Feng, S., Jing, H., Yuan-xiang, L. et al. Dependence of mutual information of big protein sequence. Wuhan Univ. J. of Nat. Sci. 8, 294–296 (2003). https://doi.org/10.1007/BF02899496
Received:
Issue Date:
DOI: https://doi.org/10.1007/BF02899496