Skip to main content

Storage Policy for Genomic Data in Hybrid Federated Clouds

  • Conference paper
Advances in Bioinformatics and Computational Biology (BSB 2014)

Part of the book series: Lecture Notes in Computer Science ((LNBI,volume 8826))

Included in the following conference series:

  • 785 Accesses

Abstract

Execution performance of bioinformatics workflows in cloud federated environments is strongly affected by data storage and retrieval, due to the large volumes of information in genomic sequences. This paper presents a storage policy for files used in a typical bioinformatics application with genomic data that aims to reduce their transfer time and then contribute to a faster execution of the workflow. We discuss a case study using the BioNimbuZ federated cloud platform. Our results show that this storage policy significantly improved times for transferring files, and thus lowered the total time to execute the workflow.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 34.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 44.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Amazon, Amazon EC2 (July 2014), http://aws.amazon.com/ec2/

  2. Redkar, T.: Windows Azure Platform, 2nd edn., vol. 1. Apress, Berkeley (2011)

    Book  Google Scholar 

  3. Vaquero, L.M., Rodero-Merino, L., Caceres, J., Lindner, M.: A break in the clouds: towards a cloud definition. SIGCOMM Comput. Commun. Rev. 39, 50–55 (2008)

    Article  Google Scholar 

  4. Sanderson, D.: Programming Google App Engine: Build and Run Scalable Web Apps on Google’s Infrastructure, 1st edn. O’Reilly Media, Inc. (2009)

    Google Scholar 

  5. Buyya, R., Ranjan, R., Calheiros, R.N.: InterCloud: Utility-oriented federation of cloud computing environments for scaling of application services. In: Hsu, C.-H., Yang, L.T., Park, J.H., Yeo, S.-S. (eds.) ICA3PP 2010, Part I. LNCS, vol. 6081, pp. 13–31. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  6. Celesti, A., Tusa, F., Puliafito, A.: How to enhance cloud architectures to enable cross-federation. In: 3rd IEEE Int. Conf. on Cloud Comp., pp. 337–345 (2008)

    Google Scholar 

  7. Saldanha, H.V., Ribeiro, E., Holanda, M., Araujo, A., Rodrigues, G., Walter, M.E.M.T., Setubal, J.C., Davila, A.: A cloud architecture for bioinformatics workflows. In: INSTICC, L. (ed.) 1st International Conference on Cloud Computing and Services Science, CLOSER 2011, pp. 1–8 (2011)

    Google Scholar 

  8. Lima, D., Moura, B., Oliveira, G., Ribeiro, E., Araujo, A., Holanda, M., Togawa, R., Walter, M.: A Storage Policy for a Hybrid Federated Cloud platform: A Case Study for Bioinformatics. In: 2014 14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGrid (2014)

    Google Scholar 

  9. Stockinger, H., et al.: Towards a cost model for distributed and replicated data stores. In: Proc. Nineth Euromicro Workshop on Parallel and Distributed Processing, Wien Univ., Austria, pp. 461–467. IEEE Computer Society (2001)

    Google Scholar 

  10. Bermbach, D., Klems, M., Tai, S., Menzel, M.: Metastorage: A federated cloud storage system to manage consistency-latency tradeoffs, in. In: Proceedings of the 2011 IEEE 4th International Conference on Cloud Computing, CLOUD 2011, pp. 452–459. IEEE Computer Society, Washington, DC (2011)

    Google Scholar 

  11. Nicolae, B.: High throughput data-compression for cloud storage. In: Hameurlain, A., Morvan, F., Tjoa, A.M. (eds.) Globe 2010. LNCS, vol. 6265, pp. 1–12. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  12. Rubin, F.: Experiments in text file compression. Commun. ACM 19, 617–623 (1976)

    Article  Google Scholar 

  13. Google, Snappy (July 2014), http://code.google.com/p/snappy

  14. Saldanha, H., Ribeiro, E., Borges, C., Araujo, R.G.A., Holanda, M., Walter, R.T.M.E., Setubal, J.C.: Towards a hybrid federated cloud platform to efficiently execute bioinformatics workflows. In: Intech Bioinformatics, pp. 051–0878 (2012)

    Google Scholar 

  15. Apache, Apache Hadoop (July 2014), http://hadoop.apache.org/

  16. Quinlan, A.R., Hall, I.M.: BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010)

    Article  Google Scholar 

  17. de Oliveira, G., Ribeiro, E., Ferreira, D., Arajo, A., Holanda, M., Walter, M.: Acosched: A scheduling algorithm in a federated cloud infrastructure for bioinformatics applications. In: 2013 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), pp. 8–14 (December 2013)

    Google Scholar 

  18. Langmead, B., Trapnell, C., Pop, M., Salzberg, S.: Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biology 10(3), R25 (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Gallon, R., Holanda, M., Araújo, A., Walter, M.E. (2014). Storage Policy for Genomic Data in Hybrid Federated Clouds. In: Campos, S. (eds) Advances in Bioinformatics and Computational Biology. BSB 2014. Lecture Notes in Computer Science(), vol 8826. Springer, Cham. https://doi.org/10.1007/978-3-319-12418-6_14

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-12418-6_14

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-12417-9

  • Online ISBN: 978-3-319-12418-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics