Abstract
Many bioinformatics tasks involve creating a computational pipeline from existing software components and algorithms. The traditional approach is to glue components together using scripts written in a programming language such as Perl. However, a new, more powerful approach is emerging that promises to revolutionise the way bioinformaticians create applications from existing components, an approach based on the concept of the scientific workflow.
Scientific workflows are created in graphical environments known as workflow management systems. They have many benefits over traditional program scripts, including speed of development, portability, and their suitability for developing complex, distributed applications. This chapter explains how to design and implement bioinformatics workflows using free, Open Source software tools, such as the Tav-erna workflow management system. We also demonstrate how new and existing tools can be deployed as Web services so that they can be easily integrated into novel computational pipelines using the scientific workflow paradigm.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
van der Aalst, W., van Hee, K. (2002)Workflow Management: Models, Methods, and Systems. MIT Press, Cambridge, MA.
Halling-Brown, M. D., Cohen, J., Darlington, J., et al. (2006) Constructing an antigenic peptide prediction pipeline within ICENI II. BMCBioinformatics, submitted.
Hamscher, V., Schwiegelshohn, U., Streit, A., et al. (2000)Evaluation of Job-Scheduling Strategies for Grid Computing. Springer-Verlag, Heidelberg, Germany.
Yu, J., Buyya, R (2005)A Taxonomy of Workflow Management Systems for Grid Computing. Technical Report GRIDS-TR-2005–1,Grid Computing and Distributed Systems Laboratory. University of Melbourne.
Hwang, W. H., Huang, S. Y. (2003) Estimation in capture—recapture models when covariates are subject to measurement errors. Biometrics 59, 1113–1122.
Oinn, T., Addis, M., Ferris, J., et al. (2004) Taverna: a tool for the composition and enactment of bioinformatics workflows. Bioinformatics 20, 3045–3054.
Stevens, R. D., Robinson, A. J., Goble, C. A. (2003) myGrid: personalised bioinfor-matics on the information grid. Bioinfor-matics 19, i302–i304.
Taylor, I., Shields, M., Wang, I. (2003)Resource Management of Triana P2P Services. Grid Resource Management. Kluwer, Netherlands.
Allen, G., Davis, K., Dolkas, K. N., et al. (2003)Enabling Applications on the Grid-A GridLab Overview. SAGE, London.
The GEO600 Team (1994)GEO600: Proposal for a 600 m Laser-Interferomet-ric Gravitational Wave Antenna. MPQ Report 190.
Furmento, N., Lee, W., Mayer, A., et al. (2002)ICENI: An Open Grid Service Architecture Implemented with Jini. Super-Computing, Baltimore.
von Laszewski, G., Amin, A., Hategan, M., et al. (2004)GridAnt:A ClientControllable Grid Workflow System. IEEE CS Press, Los Alamitos, CA.
Amin K, von Laszewski G. (2003) GridAnt: A Grid Workflow System. Manual.http://www.globus.org/cog/projects/gridant/gridant-manual.pdf
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Humana Press, a part of Springer Science+Business Media, LLC
About this protocol
Cite this protocol
Halling-Brown, M., Shepherd, A.J. (2008). Constructing Computational Pipelines. In: Keith, J.M. (eds) Bioinformatics. Methods in Molecular Biology™, vol 453. Humana Press. https://doi.org/10.1007/978-1-60327-429-6_24
Download citation
DOI: https://doi.org/10.1007/978-1-60327-429-6_24
Publisher Name: Humana Press
Print ISBN: 978-1-60327-428-9
Online ISBN: 978-1-60327-429-6
eBook Packages: Springer Protocols