Skip to main content

Constructing Computational Pipelines

  • Protocol
Bioinformatics

Part of the book series: Methods in Molecular Biology™ ((MIMB,volume 453))

Abstract

Many bioinformatics tasks involve creating a computational pipeline from existing software components and algorithms. The traditional approach is to glue components together using scripts written in a programming language such as Perl. However, a new, more powerful approach is emerging that promises to revolutionise the way bioinformaticians create applications from existing components, an approach based on the concept of the scientific workflow.

Scientific workflows are created in graphical environments known as workflow management systems. They have many benefits over traditional program scripts, including speed of development, portability, and their suitability for developing complex, distributed applications. This chapter explains how to design and implement bioinformatics workflows using free, Open Source software tools, such as the Tav-erna workflow management system. We also demonstrate how new and existing tools can be deployed as Web services so that they can be easily integrated into novel computational pipelines using the scientific workflow paradigm.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Protocol
USD 49.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 89.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 119.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. van der Aalst, W., van Hee, K. (2002)Workflow Management: Models, Methods, and Systems. MIT Press, Cambridge, MA.

    Google Scholar 

  2. Halling-Brown, M. D., Cohen, J., Darlington, J., et al. (2006) Constructing an antigenic peptide prediction pipeline within ICENI II. BMCBioinformatics, submitted.

    Google Scholar 

  3. Hamscher, V., Schwiegelshohn, U., Streit, A., et al. (2000)Evaluation of Job-Scheduling Strategies for Grid Computing. Springer-Verlag, Heidelberg, Germany.

    Google Scholar 

  4. Yu, J., Buyya, R (2005)A Taxonomy of Workflow Management Systems for Grid Computing. Technical Report GRIDS-TR-2005–1,Grid Computing and Distributed Systems Laboratory. University of Melbourne.

    Google Scholar 

  5. Hwang, W. H., Huang, S. Y. (2003) Estimation in capture—recapture models when covariates are subject to measurement errors. Biometrics 59, 1113–1122.

    Article  PubMed  Google Scholar 

  6. Oinn, T., Addis, M., Ferris, J., et al. (2004) Taverna: a tool for the composition and enactment of bioinformatics workflows. Bioinformatics 20, 3045–3054.

    Article  PubMed  CAS  Google Scholar 

  7. Stevens, R. D., Robinson, A. J., Goble, C. A. (2003) myGrid: personalised bioinfor-matics on the information grid. Bioinfor-matics 19, i302–i304.

    Article  Google Scholar 

  8. Taylor, I., Shields, M., Wang, I. (2003)Resource Management of Triana P2P Services. Grid Resource Management. Kluwer, Netherlands.

    Google Scholar 

  9. Allen, G., Davis, K., Dolkas, K. N., et al. (2003)Enabling Applications on the Grid-A GridLab Overview. SAGE, London.

    Google Scholar 

  10. The GEO600 Team (1994)GEO600: Proposal for a 600 m Laser-Interferomet-ric Gravitational Wave Antenna. MPQ Report 190.

    Google Scholar 

  11. Furmento, N., Lee, W., Mayer, A., et al. (2002)ICENI: An Open Grid Service Architecture Implemented with Jini. Super-Computing, Baltimore.

    Google Scholar 

  12. von Laszewski, G., Amin, A., Hategan, M., et al. (2004)GridAnt:A ClientControllable Grid Workflow System. IEEE CS Press, Los Alamitos, CA.

    Google Scholar 

  13. Amin K, von Laszewski G. (2003) GridAnt: A Grid Workflow System. Manual.http://www.globus.org/cog/projects/gridant/gridant-manual.pdf

  14. http://ant.apache.org/.

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Humana Press, a part of Springer Science+Business Media, LLC

About this protocol

Cite this protocol

Halling-Brown, M., Shepherd, A.J. (2008). Constructing Computational Pipelines. In: Keith, J.M. (eds) Bioinformatics. Methods in Molecular Biology™, vol 453. Humana Press. https://doi.org/10.1007/978-1-60327-429-6_24

Download citation

  • DOI: https://doi.org/10.1007/978-1-60327-429-6_24

  • Publisher Name: Humana Press

  • Print ISBN: 978-1-60327-428-9

  • Online ISBN: 978-1-60327-429-6

  • eBook Packages: Springer Protocols

Publish with us

Policies and ethics