A Markov Model for Protein Sequences

The protein primary sequence has information for its folding. Analyzing the interrelationship between the adjacent amino acids and estimating their entropies may be informative. The present work shows that Markov dependencies are clearly evident in the protein primary sequences of various databases studied. The higher-order Markov approximations and their entropy calculations showed that short-range interactions are evident between the neighboring amino acids in the protein primary sequences. Moreover, a strong correlation was observed between the secondary structure elements as expected.

