Tools and Methods in Analysis of Complex Sequences
Genome sequencing is an important molecular technique used in diverse types of biological investigations. Since the time of Sanger’s sequencing of bacteriophage genome is completed, continuous search has been ongoing to innovate new sequencing methods which can offer faster, reliable, and extensive coverage of genomic sequences. One such latest innovation is NGS (next-generation sequencing) platforms, which utilizes different molecular techniques to sequence the entire genome of any organism. The NGS technology is customized for different types of applications, including WGS (whole genome sequencing), WES (whole exome sequencing), and targeted resequencing, de novo assembly sequencing and transcriptome sequencing at the DNA or RNA level. NGS technologies are widely used to analyze quantitative differences in gene expression, discovering transcribed sequences of miRNAs, and also in mutation screening.
Illumina SBS (sequencing by synthesis) is one of the most advanced methodologies of gene sequencing. In this method, hybridization of denatured DNA with the oligonucleotides of cell lanes takes place, and base pairs are generated. There are various software available to assess the quality of genetic sequences produced. In WES, the whole exome is sequenced, and the data is taken for primary analysis. This process involves three steps including sequence alignment, postprocessing, and variant analysis. After that, secondary analysis is performed in which generated variants are annotated and prioritized.
Finally, these variants are filtered to determine any casual genetic mutation which is associated with any disease or lethality in the human genome.
KeywordsSequencing WGS WES NGS Variants
- Al-Aama JY, Shaik NA, Banaganapalli B, Salama MA, Rashidi O, Sahly AN, Mohsen MO, Shawoosh HA, Shalabi HA, Edreesi MA, Alharthi SE, Wang J, Elango R, Saadah OI (2017) Whole exome sequencing of a consanguineous family identifies the possible modifying effect of a globally rare AK5 allelic variant in celiac disease development among Saudi patients. PLoS One 12:e0176664PubMedPubMedCentralGoogle Scholar
- Butkiewicz M, Bush WS (2016) In Silico functional annotation of genomic variation. Curr Protoc Hum Genet 88, Unit 6 15Google Scholar
- Huang H, Fang M, Jostins L, Umicevic Mirkov M, Boucher G, Anderson CA, Andersen V, Cleynen I, Cortes A, Crins F, D’amato M, Deffontaine V, Dmitrieva J, Docampo E, Elansary M, Farh KK, Franke A, Gori AS, Goyette P, Halfvarson J, Haritunians T, Knight J, Lawrance IC, Lees CW, Louis E, Mariman R, Meuwissen T, Mni M, Momozawa Y, Parkes M, Spain SL, Theatre E, Trynka G, Satsangi J, Van Sommeren S, Vermeire S, Xavier RJ, International Inflammatory Bowel Disease Genetics, C, Weersma RK, Duerr RH, Mathew CG, Rioux JD, Mcgovern DPB, Cho JH, Georges M, Daly MJ, Barrett JC (2017) Fine-mapping inflammatory bowel disease loci to single-variant resolution. Nature 547:173–178PubMedPubMedCentralGoogle Scholar
- Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, Fitzhugh W, Funke R, Gage D, Harris K, Heaford A, Howland J, Kann L, Lehoczky J, Levine R, Mcewan P, Mckernan K, Meldrim J, Mesirov JP, Miranda C, Morris W, Naylor J, Raymond C, Rosetti M, Santos R, Sheridan A, Sougnez C, Stange-Thomann Y, Stojanovic N, Subramanian A, Wyman D, Rogers J, Sulston J, Ainscough R, Beck S, Bentley D, Burton J, Clee C, Carter N, Coulson A, Deadman R, Deloukas P, Dunham A, Dunham I, Durbin R, French L, Grafham D, Gregory S, Hubbard T, Humphray S, Hunt A, Jones M, Lloyd C, Mcmurray A, Matthews L, Mercer S, Milne S, Mullikin JC, Mungall A, Plumb R, Ross M, Shownkeen R, Sims S, Waterston RH, Wilson RK, Hillier LW, Mcpherson JD, Marra MA, Mardis ER, Fulton LA, Chinwalla AT, Pepin KH, Gish WR, Chissoe SL, Wendl MC, Delehaunty KD, Miner TL, Delehaunty A, Kramer JB, Cook LL, Fulton RS, Johnson DL, Minx PJ, Clifton SW, Hawkins T, Branscomb E, Predki P, Richardson P, Wenning S, Slezak T, Doggett N, Cheng JF, Olsen A, Lucas S, Elkin C, Uberbacher E, Frazier M et al (2001) Initial sequencing and analysis of the human genome. Nature 409:860–921PubMedPubMedCentralGoogle Scholar
- Lee H, Deignan JL, Dorrani N, Strom SP, Kantarci S, Quintero-Rivera F, Das K, Toy T, Harry B, Yourshaw M, Fox M, Fogel BL, Martinez-Agosto JA, Wong DA, Chang VY, Shieh PB, Palmer CG, Dipple KM, Grody WW, Vilain E, Nelson SF (2014b) Clinical exome sequencing for genetic identification of rare Mendelian disorders. JAMA 312:1880–1887PubMedPubMedCentralGoogle Scholar
- Pruitt KD, Harrow J, Harte RA, Wallin C, Diekhans M, Maglott DR, Searle S, Farrell CM, Loveland JE, Ruef BJ, Hart E, Suner MM, Landrum MJ, Aken B, Ayling S, Baertsch R, Fernandez-Banet J, Cherry JL, Curwen V, Dicuccio M, Kellis M, Lee J, Lin MF, Schuster M, Shkeda A, Amid C, Brown G, Dukhanina O, Frankish A, Hart J, Maidak BL, Mudge J, Murphy MR, Murphy T, Rajan J, Rajput B, Riddick LD, Snow C, Steward C, Webb D, Weber JA, Wilming L, Wu W, Birney E, Haussler D, Hubbard T, Ostell J, Durbin R, Lipman D (2009) The consensus coding sequence (CCDS) project: identifying a common protein-coding gene set for the human and mouse genomes. Genome Res 19:1316–1323PubMedPubMedCentralGoogle Scholar
- Regier AA, Farjoun Y, Larson DE, Krasheninina O, Kang HM, Howrigan DP, Chen BJ, Kher M, Banks E, Ames DC, English AC, Li H, Xing J, Zhang Y, Matise T, Abecasis GR, Salerno W, Zody MC, Neale BM, Hall IM (2018) Functional equivalence of genome sequencing analysis pipelines enables harmonized variant calling across human genetics projects. Nat Commun 9:4038PubMedPubMedCentralGoogle Scholar
- Worthey EA, Mayer AN, Syverson GD, Helbling D, Bonacci BB, Decker B, Serpe JM, Dasu T, Tschannen MR, Veith RL, Basehore MJ, Broeckel U, Tomita-Mitchell A, Arca MJ, Casper JT, Margolis DA, Bick DP, Hessner MJ, Routes JM, Verbsky JW, Jacob HJ, Dimmock DP (2011) Making a definitive diagnosis: successful clinical application of whole exome sequencing in a child with intractable inflammatory bowel disease. Genet Med 13:255–262PubMedGoogle Scholar