The Next-generation sequencing technology is a widely used sequencing method, and many genome researches are based on its sequencing data. Currently, there are many methods of detection of genomic Structure Variation, based on NGS data. And a lot of Copy Number Variation (CNV) detection methods based on the statistical models of read depth. However, since CNV has multiple subtypes and long variant lengths, the traditional detection tools have many limitations. Therefore, this paper proposes AssCNV, a new detection method for CNV, which integrated sequence assembly strategy and read depth strategy. The subtypes of CNV considered in this paper are insertion, deletion, and duplication. Our experimental results showed that AssCNV maintains a higher level of precision and sensitivity in the simulation data of different coverage, which is much better than other available tools.
This is a preview of subscription content, log in to check access.
Project supported by the National Natural Science Foundation of China (Grant No. 61472026) and Beijing Natural Science Foundation (5182018).
Eichler, E.E., Nickerson, D.A., Altshuler, D., et al.: Completing the map of human genetic variation. Nature 447(7141), 161–165 (2007)CrossRefGoogle Scholar
Redon, R., Ishikawa, S., Fitch, K.R., et al.: Global variation in copy number in the human genome. Nature 444(7118), 444–454 (2006)CrossRefGoogle Scholar
Thuresson, A.C., Van Buggenhout, G., Sheth, F., et al.: Whole gene duplication of SCN2A and SCN3A is associated with neonatal seizures and a normal intellectual development. Clin. Genet. 91(1) (2017)CrossRefGoogle Scholar
Hackmann, K., Kuhlee, F., Betcheva-Krajcir, E., et al.: Ready to clone: CNV detection and breakpoint fine-mapping in breast and ovarian cancer susceptibility genes by high-resolution array CGH. Breast Cancer Res. Treat. 159(3), 585–590 (2016)CrossRefGoogle Scholar
Miller, C.A., Hampton, O., Coarfa, C., et al.: ReadDepth: a parallel R package for detecting copy number alterations from short sequence reads. Plos One 6(1), e16327 (2011)CrossRefGoogle Scholar
Abyzov, A., Urban, A.E., Snyder, M., et al.: CNVnator: an approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequence. Genome Res. 21(6), 974–984 (2011)CrossRefGoogle Scholar
Chen, K., Wallis, J.W., McLellan, M.D., et al.: BreakDancer: an algorithm for high-resolution mapping of genomic structural variation. Nat. Methods 6(9), 677–681 (2009)CrossRefGoogle Scholar
Ye, K., Schulz, M.H., Long, Q., et al.: Pindel: a pattern growth approach to detect break points of large deletions and medium sized insertions from paired-end short reads. Bioinformatics 25(21), 2865–2871 (2009)CrossRefGoogle Scholar
Rausch, T., Zichner, T., Schlattl, A., et al.: DELLY: structural variant discovery by integrated paired-end and split-read analysis. Bioinformatics 28(18), i333–i339 (2012)CrossRefGoogle Scholar
Layer, R.M., Chiang, C., Quinlan, A.R., et al.: Lumpy: a probabilistic framework for structural variant discovery. Genome Biol. 15(6), R84 (2014)CrossRefGoogle Scholar
Boeva, V., Popova, T., Bleakley, K., et al.: Control-FREEC: a tool for assessing copy number and allelic content using next-generation sequence data. Bioinformatics 28(3), 423–425 (2012)CrossRefGoogle Scholar
Chiang, C., Layer, R.M., Faust, G.G., et al.: SpeedSeq: ultra-fast personal genome analysis and interpretation. Nat. Methods 12(10), 966 (2015)CrossRefGoogle Scholar
Garcíaalcalde, F., Okonechnikov, K., Carbonell, J., et al.: Qualimap: evaluating next-generation sequence alignment data. Bioinformatics 28(20), 2678 (2012)CrossRefGoogle Scholar