Disaggregating Non-Volatile Memory for Throughput-Oriented Genomics Workloads

  • Aaron CallEmail author
  • Jordà Polo
  • David Carrera
  • Francesc Guim
  • Sujoy Sen
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 11339)


Massive exploitation of next-generation sequencing technologies requires dealing with both: huge amounts of data and complex bioinformatics pipelines. Computing architectures have evolved to deal with these problems, enabling approaches that were unfeasible years ago: accelerators and Non-Volatile Memories (NVM) are becoming widely used to enhance the most demanding workloads. However, bioinformatics workloads are usually part of bigger pipelines with different and dynamic needs in terms of resources. The introduction of Software Defined Infrastructures (SDI) for data centers provides roots to dramatically increase the efficiency in the management of infrastructures. SDI enables new ways to structure hardware resources through disaggregation, and provides new hardware composability and sharing mechanisms to deploy workloads in more flexible ways. In this paper we study a state-of-the-art genomics application, SMUFIN, aiming to address the challenges of future HPC facilities.


Genomics Disaggregation Composability NVM NVMeOF Characterization Orchestration 



This work is partially supported by the European Research Council (ERC) under the EU Horizon 2020 programme (GA 639595), the Spanish Ministry of Economy, Industry and Competitivity (TIN2015-65316-P) and the Generalitat de Catalunya (2014-SGR-1051).


  1. 1.
    Cadenelli, N., Polo, J., Carrera, D.: Accelerating k-mer frequency counting with GPU and non-volatile memory. In: Proceedings of the 19th IEEE International Conference on High Performance Computing and Communications (HPCC). IEEE Computer Society, December 2017Google Scholar
  2. 2.
    Chen, F., Lee, R., Zhang, X.: Essential roles of exploiting internal parallelism of flash memory based solid state drives in high-speed data processing. In: 2011 IEEE 17th International Symposium on High Performance Computer Architecture, pp. 266–277, February 2011Google Scholar
  3. 3.
    Ciciani, B., et al.: Automated workload characterization in cloud-based transactional data grids. In: 2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops Ph.D. Forum, pp. 1525–1533, May 2012Google Scholar
  4. 4.
    NVMexpress: NVMe over fabrics overview. Tech. rep., NVM express (2017).
  5. 5.
    Facebook: Facebook disaggregated rack (2016).
  6. 6.
    Gao, P.X., et al.: Network requirements for resource disaggregation. In: Proceedings of the 12th USENIX Conference on Operating Systems Design and Implementation. USENIX Association, Berkely, CA, USA, November 2016Google Scholar
  7. 7.
    Guz, Z., Li, H.H., Shayesteh, A., Balakrishnan, V.: NVMe-over-fabrics performance characterization and the path to low-overhead flash disaggregation. In: Proceedings of the 10th ACM International Systems and Storage Conference SYSTOR 2017, pp. 16:1–16:9. ACM, New York (2017)Google Scholar
  8. 8.
    Intel: Intel rack scale design. Tech. Rep. 332937–004, Intel Corporation, August 2016.
  9. 9.
  10. 10.
    Kawalia, A., et al.: Leveraging the power of high performance computing for next generation sequencing data analysis: tricks and twists from a high throughput exome workflow. PloS one 10(5), e0126321 (2015)CrossRefGoogle Scholar
  11. 11.
    Klimovic, A., Litz, H., Kozyrakis, C.: Reflex: remote flash & local flash. In: Proceedings of the Twenty-Second International Conference on Architectural Support for Programming Languages and Operating Systems ASPLOS (2017)Google Scholar
  12. 12.
    Li, H., Durbin, R.: Fast and accurate short read alignment with burrows-wheeler transform. Bioinformatics 25(14), 1754–1760 (2009)CrossRefGoogle Scholar
  13. 13.
    Li, R., et al.: SNP detection for massively parallel whole-genome resequencing. Genome Res. 19(6), 1124–1132 (2009)CrossRefGoogle Scholar
  14. 14.
    Medvedev, P., Stanciu, M., Brudno, M.: Computational methods for discovering structural variation with next-generation sequencing. Nat. Methods 6, S13–S20 (2009)CrossRefGoogle Scholar
  15. 15.
    Moncunill, V., et al.: Comprehensive characterization of complex structural variations in cancer by directly comparing genome sequence reads. Nat. Biotechnol. 32(11), 1106–1112 (2014)CrossRefGoogle Scholar
  16. 16.
    Puckelwartz, M.J., et al.: Supercomputing for the parallelization of whole genome analysis. Bioinformatics 30(11), 1508 (2014)CrossRefGoogle Scholar
  17. 17.
    Sivashankar, S., Ramasamy, S.: Design and implementation of non-volatile memory express. In: 2014 International Conference on Recent Trends in Information Technology, Chennai, India, April 2014Google Scholar
  18. 18.
    Weerasinghe, J., Abel, F., Hagleitner, C., Herkersdorf, A.: Disaggregated FPGAs: network performance comparison against bare-metal servers, virtual machines and linux containers. In: Proceedings of the 8th IEEE International Conference on Cloud Computing Technology and Science, December 2016Google Scholar

Copyright information

© Springer Nature Switzerland AG 2019

Authors and Affiliations

  • Aaron Call
    • 1
    • 2
    Email author
  • Jordà Polo
    • 1
  • David Carrera
    • 1
    • 2
  • Francesc Guim
    • 3
  • Sujoy Sen
    • 3
  1. 1.Barcelona Supercomputing Center (BSC)BarcelonaSpain
  2. 2.Universitat Politècnica de Catalunya (UPC)BarcelonaSpain
  3. 3.Intel CorporationSanta ClaraUSA

Personalised recommendations