Abstract
Cloud computing provides a promising solution to the big data problem associated with next generation sequencing applications. The increasing number of cloud service providers, who compete in terms of performance and price, is a clear indication of a growing market with high demand. However, current cloud computing based applications in bioinformatics do not profit from this progress, because they are still limited to just one cloud service provider. In this paper, we present different use case scenarios using hybrid services and resources from multiple cloud providers for bioinformatics applications. We also present a new version of the elasticHPC package to realize these scenarios and to support the creation of cloud computing resources over multiple cloud platforms, including Amazon, Google, Azure, and clouds supporting OpenStack. The instances created on these cloud environments are pre-configured to run big sequence analysis tasks using a large set of pre-installed software tools and parallelization techniques. In addition to its flexibility, we show by experiments that the use of hybrid cloud resources from different providers can save time and cost.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Angiuoli, S., Matalka, M., Gussman, A., et al.: CloVR: a virtual machine for automated and portable sequence analysis from the desktop using cloud computing. BMC Bioinformatics 12(1), 356+ (2011)
Gregory, J., Kuczynski, J., Stombaugh, J.: QIIME allows analysis of high-throughput community sequencing data. Nat. Meth. 7(5), 335–336 (2010)
Langmead, B., Schatz, M.C., Lin, J., Pop, M., Salzberg, S.L.: Searching for snps with cloud computing. Genome Biology 10(R134) (2009)
Wall, D.P., Kudtarkar, P., Fusaro, V.A., Pivovarov, R., Patil, P., Tonellato, P.J.: Cloud computing for comparative genomics. BMC Bioinformatics 11, 259 (2010)
Langmead, B., Hansen, K., Leek, J.: Cloud-scale rna-sequencing differential expression analysis with Myrna. Genome Biology 11(8), R83+ (2010)
StarCluster, http://web.mit.edu/stardev/cluster
Vappio, http://vappio.sf.net
Afgan, E., Coraor, N., Chapman, B., et al.: Galaxy CloudMan: delivering cloud compute clusters. BMC Bioinformatics 11(suppl. 12), S4+ (2010)
El-Kalioby, M., Abouelhoda, M., Krüger, J., et al.: Personalized cloud-based bioinformatics services for research and education: use cases and the elastichpc package. BMC Bioinformatics 13(S-17), S22 (2012)
AWS: Amazon Web Services, http://aws.amazon.com
Azure, W., http://www.microsoft.com/windowsazure
Magellan-a cloud for Science, http://magellan.alcf.anl.gov
DIAG-Data Intensive Academic Grid, http://diagcomputing.org
Li, H., Durbin, R.: Fast and accurate short read alignment with burrows and wheeler transform. Bioinformatics 25(14), 1754–1760 (2009)
DePristo, M., Banks, E., et al.: A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nature Genetics 43(5), 491–498 (2011)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Ali, A.A., El-Kalioby, M., Abouelhoda, M. (2015). Supporting Bioinformatics Applications with Hybrid Multi-cloud Services. In: Ortuño, F., Rojas, I. (eds) Bioinformatics and Biomedical Engineering. IWBBIO 2015. Lecture Notes in Computer Science(), vol 9043. Springer, Cham. https://doi.org/10.1007/978-3-319-16483-0_41
Download citation
DOI: https://doi.org/10.1007/978-3-319-16483-0_41
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-16482-3
Online ISBN: 978-3-319-16483-0
eBook Packages: Computer ScienceComputer Science (R0)