Skip to main content

Part of the book series: Statistics for Biology and Health ((SBH))

  • 1547 Accesses

Abstract

One of the first chores after a gene is mapped and cloned is to search for similarities between the gene and previously cloned genes. These homologies can illuminate the evolutionary history of the new gene and the structure and function of its derived protein. Indeed, it is fair to say that sequence comparison is the single most useful application of the burgeoning genetic databases. Cross species comparisons are being done on a massive scale to identify gene families, regulatory motifs, and conserved regions within genes. In this chapter, we explore some of the principles and algorithms applied in recognizing DNA sequence patterns and producing optimal alignments between two sequences. Our treatments of both problems are necessarily superficial.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 79.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 99.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 129.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Aldous D (1989) Probability Approximations via the Poisson Clumping Heuristic. Springer-Verlag, New York

    MATH  Google Scholar 

  2. Durbin R, Eddy S, Krogh A, Mitchison G (1998) Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids. Cambridge University Press, Cambridge

    Book  MATH  Google Scholar 

  3. Feller W (1969) An Introduction to Probability Theory and its Applications, Vol 1, 2nd ed. Wiley, New York

    Google Scholar 

  4. Grimmett GR, Stirzaker DR (1992) Probability and Random Processes, 2nd ed. Oxford University Press, Oxford

    Google Scholar 

  5. Kruskal JB (1983) An overview of sequence comparison: Time warps, string edits, and macromolecules. SIAM Review 25: 201–237

    Article  MathSciNet  MATH  Google Scholar 

  6. Needleman SB, Wunsch CD (1970) A general method applicable to the search for similarities in the amino acid sequences of two proteins. J Mol Biol 48: 443–453

    Article  Google Scholar 

  7. Smith TF, Waterman MS (1981) The identification of common mole-cular subsequences. J Mol Biol 147 195–197

    Article  Google Scholar 

  8. Sankoff D, Cedergren RJ (1983) Simultaneous comparisons of three or more sequences related by a tree. In Time Warps String Edits and Macromolecules: The Theory and Practice of Sequence Comparison. edited by Sankoff D, Kruskal JB, Addison-Wesley, Reading, MA, pp 253–264

    Google Scholar 

  9. Schilling MF (1990) The longest run of heads. The College Math J 21: 196–207

    Article  MathSciNet  MATH  Google Scholar 

  10. Setubal J, Meidanis J (1997) Introduction to Computational Molecular Biology. PWS Publishing, Boston

    Google Scholar 

  11. Waterman MS (1989) Sequence alignments Mathematical Methods for DNA Sequencesedited by Waterman MS, CRC Press, Boca Raton, FL, pp 53–92

    Google Scholar 

  12. Waterman MS (1995) Introduction to Computational Biology: Maps Sequences and Genomes. Chapman and Hall, London

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag New York, Inc.

About this chapter

Cite this chapter

Lange, K. (2002). Sequence Analysis. In: Mathematical and Statistical Methods for Genetic Analysis. Statistics for Biology and Health. Springer, New York, NY. https://doi.org/10.1007/978-0-387-21750-5_13

Download citation

  • DOI: https://doi.org/10.1007/978-0-387-21750-5_13

  • Publisher Name: Springer, New York, NY

  • Print ISBN: 978-1-4684-9556-0

  • Online ISBN: 978-0-387-21750-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics