Skip to main content

Assembly of Shotgun Sequencing Data

  • Conference paper
Genetic Mapping and DNA Sequencing

Part of the book series: The IMA Volumes in Mathematics and its Applications ((IMA,volume 81))

  • 475 Accesses

Abstract

We present a simple algorithm for construction of the DNA sequence from a set of fragments generated in a shotgun sequencing project. The algorithm is based on rigorous detection of overlaps among fragments. We report assembly results of the algorithm on two genomic data sets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Chang, W. I. and Lawler, E. L., Approximate string matching in sublinear expected time, 31st IEEE Symp. Found. Comput. Sci., 116–124, 1990.

    Google Scholar 

  2. Edwards A., Voss H., Rice P., Civitello A., Stegemann J., Schwager C., Zimmermann J., Erfle H., Caskey, C. T. and Ansorge, W., Automated DNA sequencing of the human HPRT locus, Genomics 6, 593–608, 1990.

    Article  Google Scholar 

  3. [3]Gallant J., Maier, D. and Storer, J., On fìnding minimal length superstring, J. Comput. Sys. Sci. 20, 50–58, 1980.

    Article  MathSciNet  MATH  Google Scholar 

  4. Hirschberg, D.S., A linear space algorithm for computing maximal common subsequences,Comm. ACM 18, 341–343, 1975.

    Article  MathSciNet  MATH  Google Scholar 

  5. Huang, X., A contig assembly program based on sensitive detection of fragment overlaps, Genomics 14, 18–25, 1992.

    Article  Google Scholar 

  6. [6]Huang, X., On global sequence alignment, Comput. Applic. Biosci. 10, 227–235, 1994.

    Google Scholar 

  7. Kececioglu, J. D. and Myers, E. W., Combinatorial algorithms for DNA sequence assembly, Algorithmica 13, 7–51, 1995.

    Article  MathSciNet  MATH  Google Scholar 

  8. Myers, E. W., Incremental alignment algorithms and their applications, Technical Report 86-2, Department of Computer Science, The University of Arizona, Tucson, AZ, 1986.

    Google Scholar 

  9. [9]Myers, E. W. and Miller, W., Optimal alignments in linear space, Comput. Applic. Biosci. 4, 11–17, 1988.

    Google Scholar 

  10. Peltola H., Soderlund H., Tarhio, J. and Ukkonen, E., Algorithms for some string matching problems arising in molecular genetics,Information Processing 83 (Proc. IFIP Congress), 53–64, 1983.

    Google Scholar 

  11. [11]Peltola H., Soderlund, H. and Ukkonen E., Seqaid: a DNA sequence assembling program based on a mathematical model, Nucleic Acids Res. 12, 307–321, 1984.

    Article  Google Scholar 

  12. Seto D., Koop, B. F. and Hood, L.,, An experimentally derived data set constructed for testing large-scale DNA sequence assembly algorithms, Genomics 15, 673–676, 1993.

    Article  Google Scholar 

  13. [13]Smith, T. F. and Waterman, M. S., Identifìcation of common molecular subsequences, J. Mol. Biol. 147, 195–197, 1981.

    Article  Google Scholar 

  14. Staden R., A new computer method for the storage and manipulation of DNA gel reading data, Nucleic Acids Res. 8, 3673–3694, 1980.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1996 Springer Science+Business Media New York

About this paper

Cite this paper

Huang, X. (1996). Assembly of Shotgun Sequencing Data. In: Speed, T., Waterman, M.S. (eds) Genetic Mapping and DNA Sequencing. The IMA Volumes in Mathematics and its Applications, vol 81. Springer, New York, NY. https://doi.org/10.1007/978-1-4612-0751-1_11

Download citation

  • DOI: https://doi.org/10.1007/978-1-4612-0751-1_11

  • Publisher Name: Springer, New York, NY

  • Print ISBN: 978-1-4612-6890-1

  • Online ISBN: 978-1-4612-0751-1

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics