Classification and Function Prediction of Proteins Using Diagnostic Amino Acid Patterns

  • Aiala Reizer
  • Milton H. SaierJr
  • Jonathan Reizer
Part of the Methods in Molecular Biology book series (MIMB, volume 25)


Protein sequence similarities offer a convenient means for the classification and identification of protein families and superfamilies. Frequently, proteins descended from a common ancestor preserve their basic three-dimensional conformations even when they have accumulated large numbers of amino acid substitutions and short insertions or deletions. These may prohibit establishment of homology or evolutionary relationships by traditional global sequence alignment means. Limited regions of sequence similarity can also be the result of evolutionary convergence driven by a need for a common function. Regardless of whether divergent or convergent evolution played a role in the appearance of local sequence similarities, these confined regions of similarity can provide insight into structural and functional relationships of proteins that otherwise fail to show significant similarity by global alignment methods.


Output File Motif Program Sequence File Genetic Computer Group PROSITE Database 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. 1.
    Bairoch, A (1991) PROSITE: a dictionary of sites and patterns in proteins. Nucleic Acids Res. 19, 2241–2245.PubMedGoogle Scholar
  2. 2.
    Dayhoff, M. O, Barker, W. C, and Hunt, L. T. (1983) Establishing homolo ies in protein sequences. Meth. Enzymol. 91, 524–545PubMedCrossRefGoogle Scholar
  3. 3.
    Taylor, W. R, and Jones, D. T. (1991) Templates, consensus patterns and motifs Curr. Opin. Struct. Biol. 1, 327–333CrossRefGoogle Scholar
  4. 4.
    Staden, R. (1988) Methods to define and locate patterns of motifs in sequences ABIOS 4, 53–60.Google Scholar
  5. 5.
    Staden, R (1989) Methods for calculating the probabilities of finding patterns n sequences CABIOS 5, 89–96PubMedGoogle Scholar
  6. 6.
    Barton, G J. and Sternberg, M. J. E. (1990) Flexible protein sequence pat erns. A sensitive method to detect weak structural similarities J. Mol Biol. 12, 389–402.CrossRefGoogle Scholar
  7. 7.
    Sibbald, P R. and Argos, P. (1990) Scrutineer, a computer program that flex bly seeks and describes motifs and profiles in protein sequence databases ABIOS 6, 279–288.Google Scholar
  8. 8.
    Hodgman, T, C. (1989) The elucidation of protein function by sequence motif nalysis CABIOS 5, 1–13.PubMedGoogle Scholar
  9. 9.
    Sternberg, M. J. E (1991) PROMOT: a FORTRAN program to scan protein equences against a library of known motifs. CABIOS 7, 257–260PubMedGoogle Scholar
  10. 10.
    Seto, Y, Ikeuchi, Y, and Kanehisa, M. (1990) Fragment peptide library for lassification and functional prediction of proteins Proteins 8, 341–351.PubMedCrossRefGoogle Scholar
  11. 11.
    Argos, P, Vingron, M, and Vogt, G (1991) Protein sequence comparison ethods and significance. Prot Engin. 4, 375–383.CrossRefGoogle Scholar

Copyright information

© Humana Press Inc, Totowa, NJ 1994

Authors and Affiliations

  • Aiala Reizer
    • 1
  • Milton H. SaierJr
    • 1
  • Jonathan Reizer
    • 2
  1. 1.Department of BiologyUniversity of California at San DiegoLa JollaCA
  2. 2.Department of BiologyUniversity of California at San DiegoLa JollaCA

Personalised recommendations