DNA Sequencing Data Analysis

  • Keyi Long
  • Lei Cai
  • Lin He
Part of the Methods in Molecular Biology book series (MIMB, volume 1754)


Among various biological data, DNA sequence is doubtlessly a fundamental datum. By obtaining particular DNA sequence data and analyzing, biologists get to understand life science more precisely. This chapter is an overview of DNA sequencing technology and its data analysis methods, providing information about DNA sequencing, several different methods, and tools applied in data analysis. Both advantages and disadvantages are discussed.

Key words

DNA sequence DNA sequencing Data analysis Sequence comparison Methods and tools 


  1. 1.
    Gingeras TR, Roberts RJ (1980) Steps toward computer analysis of nucleotide sequences. Science 209(4463):1322–1328CrossRefGoogle Scholar
  2. 2.
    Sanger F, Air GM, Barrell BG, Brown NL, Coulson AR, Fiddes CA, Hutchison CA, Slocombe PM, Smith M (1977) Nucleotide sequence of bacteriophage phi X174 DNA. Nature 265(5596):687–695. Scholar
  3. 3.
    ten Bosch JR, Grody WW (2008) Keeping up with the next generation: massively parallel sequencing in clinical diagnostics. J Mol Diagn 10(6):484–492. Scholar
  4. 4.
    Liu L, Li Y, Li S, Hu N, He Y, Pong R, Lin D, Lu L, Law M (2012) Comparison of next-generation sequencing systems. J Biomed Biotechnol 2012:251364. Scholar
  5. 5.
    Schadt EE, Turner S, Kasarskis A (2010) A window into third-generation sequencing. Hum Mol Genet 19(R2):R227–R240. Scholar
  6. 6.
    Excoffier L, Laval G, Schneider S (2005) Arlequin (version 3.0): an integrated software package for population genetics data analysis. Evol Bioinformatics Online 1:47–50Google Scholar
  7. 7.
    Librado P, Rozas J (2009) DnaSP v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics 25(11):1451–1452CrossRefGoogle Scholar
  8. 8.
    Kumar S, Nei M, Dudley J, Tamura K (2008) MEGA: a biologist-centric software for evolutionary analysis of DNA and protein sequences. Brief Bioinform 9(4):299–306. Scholar
  9. 9.
    Cai L, Yuan W, Zhang Z, He L, Chou GC (2016) In-depth comparison of somatic point mutation callers based on different tumor next-generation sequencing depth data. Sci Rep 6:36540. Scholar
  10. 10.
    Huang T, Liu CL, Li LL, Cai MH, Chen WZ, Xu YF, O'Reilly PF, Cai L, He L (2016) A new method for identifying causal genes of schizophrenia and anti-tuberculosis drug-induced hepatotoxicity. Sci Rep 6:32571. Scholar
  11. 11.
    Fang S, Zhang Y, Xu M, Xue C, He L, Cai L, Xing X (2016) Identification of damaging nsSNVs in human ERCC2 gene. Chem Biol Drug Des 88(3):441–450. Scholar
  12. 12.
    Cai L, Deng SL, Liang L, Pan H, Zhou J, Wang MY, Yue J, Wan CL, He G, He L (2013) Identification of genetic associations of SP110/MYBBP1A/RELA with pulmonary tuberculosis in the Chinese Han population. Hum Genet 132:265–273. Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2018

Authors and Affiliations

  • Keyi Long
    • 1
  • Lei Cai
    • 1
  • Lin He
    • 1
  1. 1.Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders (Ministry of Education), Collaborative Innovation Center for Genetics and DevelopmentShanghai Jiao Tong UniversityShanghaiChina

Personalised recommendations