Skip to main content

Accelerating De Novo Assembler WTDBG2 on Commodity Servers

  • Conference paper
  • First Online:
Algorithms and Architectures for Parallel Processing (ICA3PP 2020)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 12452))

  • 1560 Accesses

Abstract

De novo genome assembly reconstructs the chromosomes from massive relatively short fragmented reads and serves as fundamental for studying new species where there is no reference genome. Wtdbg2 is a de novo assembler for long reads that is up to hundreds of kilobases. It is based on fuzzy-Bruijn graph (FBG) and is ten times faster than the cutting-edge assemblers such as Canu. However, the performance of wtdbg2 still requires further improvement: 1) it requires up to terabytes of memory to compute the assembly, which is infeasible to run on commodity server; 2) it requires tens of hours for assembling on large datasets such as genomes of homo sapiens. To address the above drawbacks, we propose several optimization techniques for accelerating wtdbg2 on commodity server, including a memory auto-tuning scheme, sequence alignment optimization and intermediate result elimination in the output procedure. We compare the optimized wtdbg2 with the original implementation and two cutting-edge assemblers on real-world datasets. The experiment results demonstrate that optimized wtdbg2 achieves maximum and average speedup of 2.31\(\times \) and 1.54\(\times \) respectively. In addition, our proposed optimization reduces the memory usage of wtdbg2 by 39.5% without affecting the correctness.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. C.elegans genome dataset (2019). http://datasets.pacb.com.s3.amazonaws.com/2014/c_elegans

  2. D.melanogaster iso1 genome dataset (2019). https://www.ebi.ac.uk/ena/data/view/SRR6702603

  3. Human hg00733 genome dataset (2019). https://www.ebi.ac.uk/ena/data/view/SRR7615963

  4. Human na24385 genome dataset (2019). https://ftp-trace.ncbi.nlm.nih.gov/giab/ftp/data/AshkenazimTrio/HG002_NA24385_son/PacBio_CCS_15kb

  5. wtdbg2 (2019). https://github.com/ruanjue/wtdbg2

  6. Adhianto, L., et al.: HPCTOOLKIT: tools for performance analysis of optimized parallel programs. Concurr. Comput. Pract. Exp. 22(6), 685–701 (2010)

    Google Scholar 

  7. Ahmed, N., Mushtaq, H., Bertels, K., Al-Ars, Z.: GPU accelerated API for alignment of genomics sequencing data. In: 2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), pp. 510–515. IEEE (2017)

    Google Scholar 

  8. Berlin, K., Koren, S., Chin, C.S., Drake, J.P., Landolin, J.M., Phillippy, A.M.: Assembling large genomes with single-molecule sequencing and locality-sensitive hashing. Nat. Biotechnol. 33(6), 623 (2015)

    Article  Google Scholar 

  9. Denisov, G., et al.: Consensus generation and variant detection by Celera assembler. Bioinformatics 24(8), 1035–1040 (2008)

    Article  Google Scholar 

  10. Ghosh, P., Krishnamoorthy, S.: PaKman: scalable assembly of large genomes on distributed memory machines. In: 2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS) (2019)

    Google Scholar 

  11. Gurevich, A., Saveliev, V., Vyahhi, N., Tesler, G.: QUAST: quality assessment tool for genome assemblies. Bioinformatics 29(8), 1072–1075 (2013)

    Article  Google Scholar 

  12. Jiang, T., Fu, Y., Liu, B., Wang, Y.: Long-read based novel sequence insertion detection with rCANID. IEEE Trans. Nanobiosci. 18(3), 343–352 (2019)

    Article  Google Scholar 

  13. Kolmogorov, M., Yuan, J., Lin, Y., Pevzner, P.A.: Assembly of long, error-prone reads using repeat graphs. Nat. Biotechnol. 37(5), 540 (2019)

    Article  Google Scholar 

  14. Koren, S., Walenz, B.P., Berlin, K., Miller, J.R., Bergman, N.H., Phillippy, A.M.: Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res. 27(5), 722–736 (2017)

    Article  Google Scholar 

  15. Liu, Y., Wirawan, A., Schmidt, B.: CUDASW++ 3.0: accelerating smith-waterman protein database search by coupling CPU and GPU SIMD instructions. BMC Bioinform. 14(1) (2013). Article number: 117. https://doi.org/10.1186/1471-2105-14-117

  16. Nichols, B., Buttlar, D., Farrell, J., Farrell, J.: Pthreads Programming: A POSIX Standard for Better Multiprocessing. O’Reilly Media Inc., Sebastopol (1996)

    Google Scholar 

  17. Pan, T.C., Misra, S., Aluru, S.: Optimizing high performance distributed memory parallel hash tables for DNA k-mer counting. In: SC 2018: International Conference for High Performance Computing, Networking, Storage and Analysis, pp. 135–147. IEEE (2018)

    Google Scholar 

  18. Pantaleoni, J., Subtil, N.: NVBIO: a library of reusable components designed by NVIDIA corporation to accelerate bioinformatics applications using CUDA (2014). http://nvlabs.github.io/nvbio

  19. Peng, Y., Leung, H.C., Yiu, S.M., Chin, F.Y.: IDBA-UD: a de novo assembler for single-cell and metagenomic sequencing data with highly uneven depth. Bioinformatics 28(11), 1420–1428 (2012)

    Article  Google Scholar 

  20. Qiu, M., et al.: Data allocation for hybrid memory with genetic algorithm. IEEE Trans. Emerg. Top. Comput. 3(4), 544–555 (2015)

    Article  Google Scholar 

  21. Qiu, M., Ming, Z., Li, J., Liu, S., Wang, B., Lu, Z.: Three-phase time-aware energy minimization with DVFS and unrolling for chip multiprocessors. J. Syst. Archit. 58(10), 439–445 (2012)

    Article  Google Scholar 

  22. Ruan, J., Li, H.: Fast and accurate long-read assembly with wtdbg2. Nat. Methods 17(2), 155–158 (2020)

    Article  Google Scholar 

  23. Simpson, J.T., Durbin, R.: Efficient de novo assembly of large genomes using compressed data structures. Genome Res. 22(3), 549–556 (2012)

    Article  Google Scholar 

  24. Wenger, A.M., et al.: Highly-accurate long-read sequencing improves variant detection and assembly of a human genome, p. 519025. bioRxiv (2019)

    Google Scholar 

  25. Xiao, C.L., et al.: MECAT: fast mapping, error correction, and de novo assembly for single-molecule sequencing reads. Nat. Methods 14(11), 1072 (2017)

    Article  Google Scholar 

Download references

Acknowledgment

This work is supported by National Key Research and Development Program of China (Grant No. 2016YFB1000304), National Natural Science Foundation of China (Grant No. 61502019), and the Open Project Program of the State Key Laboratory of Mathematical Engineering and Advanced Computing (Grant No. 2019A12).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hailong Yang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Dun, M., Li, Y., You, X., Sun, Q., Luan, Z., Yang, H. (2020). Accelerating De Novo Assembler WTDBG2 on Commodity Servers. In: Qiu, M. (eds) Algorithms and Architectures for Parallel Processing. ICA3PP 2020. Lecture Notes in Computer Science(), vol 12452. Springer, Cham. https://doi.org/10.1007/978-3-030-60245-1_16

Download citation

Publish with us

Policies and ethics