Genomes or exomes: evaluation of cost, time and coverage
KeywordsGenome Sequencing Sequencing Data Code Region Sequencing Technology Genomic Variation
Next-generation sequencing technology platforms are driving the development of a variety of approaches to study genomic variation associated with disease. One of these approaches, exome sequencing, specifically targets the coding regions of the genome, which are captured and sequenced. Compared with whole genome sequencing, exome sequencing offers the advantages of being cost- and time-effective while providing deeper coverage of coding variants, which are more likely to affect function.
However, the protocol is known to be only partially reliable and might miss some of the coding regions. To assess how much coding region could be missed or of target, we compared whole genome and exome sequencing data derived from one sample that was processed by the Illumina GA-IIx platform.
Our in-house-developed workflow named TREAT (Targeted RE-sequencing and Annotation Tool) was used to align and annotate the data. We provide a summary of the comparison between the two datasets, including the total number of reads produced, the time needed for sequencing and analysis, the coverage of coding regions and the agreement between called variants.
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.