Abstract
Genome research is becoming increasingly dependent on access to multiple, distributed data sources, and bioinformatic tools. The importance of integration across distributed databases and Web services will continue to grow as the number of requisite resources expands. Use of bioinformatic workflows has seen considerable growth in recent years as scientific research becomes increasingly dependent on the analysis of large sets of data and the use of distributed resources. The BioExtract Server (http://bioextract.org) is a Web-based system designed to aid researchers in the analysis of distributed genomic data by providing a platform to facilitate the creation of bioinformatic workflows. Scientific workflows are created within the system by recording the analytic tasks preformed by researchers. These steps may include querying multiple data sources, saving query results as searchable data extracts, and executing local and Web-accessible analytic tools. The series of recorded tasks can be saved as a computational workflow simply by providing a name and description.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
K. Verdi, H. Ellis, and M. Gryk, Conceptual-level workflow modeling of scientific experiments using NMR as a case study, BMC Bioinformatics, 8:31, 2007
S.F. Altschul, T.L. Madden, A.A. Schäffer, J. Zhang, Z. Zhang, W. Miller, and D.J. Lipman, Gapped BLAST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Research, 25(17):3389–3402, 1997
R. Chenna, H. Sugawara, T. Koike, R. Lopez, T.J. Gibson, D.G. Higgins, and J.D. Thompson, Multiple sequence alignment with the Clustal series of programs, Nucleic Acids Research, 31(13):3497–3500, 2003
M.I. Abouelhoda, S. Kurtz, and E. Ohlebusch, The enhanced suffix array and its application to genome analysis, Lecture Notes in Computer Science, 2452:449–463, 2002. http://www.vmatch.de/
E. Deelman and Y. Gil, Workshop on the Challenges of Scientific Workflows; Sponsored by the National Science Foundation, http://vtcpc.isi.edu/wiki/images/3/3a/NSFWorkflowFinal.pdf, May 1–2, 2006
D. De Roure and C. Goble, Software design for empowering scientists, IEEE Software, 26(1):88–95, 2009
D. De Roure, C. Goble, and R. Stevens, The design and realization of the myExperiment Virtual Research Environment for social sharing of workflows, Future Generation Computer Systems, 25(5):561–567, 2009. corrected proof available as: DOI http://dx.doi.org/10.1016/j.future.2008.06.010
D. Hull, K. Wolstencroft, R. Stevens, C. Goble, M. Pocock, P. Li, and T. Oinn, Taverna: a tool for building and running workflows of services, Nucleic Acids Research, 34(Web Server issue):W729–W732, 2006
B. Ludäscher, I. Altintas, C. Berkley, D. Higgins, E. Jaeger, M. Jones, E.A. Lee, J. Tao, and Y. Zhao, Scientific workflow management and the Kepler system, Concurrency and Computation: Practice & Experience, 18(10):1039–1065, 2006
A. Harrison, I. Taylor, I. Wang, and M. Shields, WS-RF workflow in Triana, International Journal of High Performance Computing Applications (IJHPCA), 22(3):268–283, 2008
J. Elhai, A. Taton, J. Massar, J. Myers, M. Travers, J. Casey, M. Slupesky, and J. Shrager, BioBIKE: A Web-based, programmable, integrated biological knowledge base, Nucleic Acids Research, 37(Web Server issue):W28–W32. doi10.1093, 2009
S. Bowers, T. McPhillips, B. Ludäscher, S.Cohen, and S. Davidson, A Model for user-oriented data provenance in pipelined scientific workflows, Lecture Notes in Computer Science, Springer, Berlin, ISBN: 978-3-540-46302-3, pp 133–147
C. Goble, Position statement: musings on provenance, workflow and (semantic web) annotations for bioinformatics, Proceedings of the Workshop on Data Derivation and Provenance, 2002; http://people.cs.uchicago.edu/yongzh/papers/provenance_workshop_3.doc
L. Moreau, B Ludäscher, I. Altintas, R. Barga, S. Bowers, , S. Callahan, G. Chin, B. Clifford, S. Cohen, S. Cohen-Boulakia, S. Davidson, E. Deelman, L. Digiampietri, I. Foster, J. Freire, J. Frew, J. Futrelle, T. Gibson, Y. Gil, C. Goble, J. Golbeck, P. Groth, D. A. Holland, S. Jiang, J. Kim, D. Koop, A. Krenek, T. McPhillips, G. Mehta, S. Miles, D. Metzger, S. Munroe, J. Myers, B. Plale, N. Podhorszki, V. Ratnakar, E. Santos, C. Scheidegger, K. Schuchardt, M. Seltzer, Y. Simmhan, C. Silva, P. Slaughter, E. Stephan, R. Stevens, D. Turi, H. Vo, M. Wilde, J. Zhao, and Y. Zhao, The First Provenance Challenge, Concurrency and Computation: Practice & Experience, 20(5):409–418, 2008
L. Moreau, J. Futrelle, R. McGrath, J. Myers, and P. Pualson, The open provenance model: an overview, Lecture Notes in Computer Science, Springer, Berlin/Heidelberg, ISBN 978-3-540-89964-8, 5272:323–326, 2008
Acknowledgments
The BioExtract Server project is currently supported in part by the National Science Foundation grant DBI-0606909.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer Science+Business Media, LLC
About this paper
Cite this paper
Lushbough, C.M., Brendel, V.P. (2010). An Overview of the BioExtract Server: A Distributed, Web-Based System for Genomic Analysis. In: Arabnia, H. (eds) Advances in Computational Biology. Advances in Experimental Medicine and Biology, vol 680. Springer, New York, NY. https://doi.org/10.1007/978-1-4419-5913-3_41
Download citation
DOI: https://doi.org/10.1007/978-1-4419-5913-3_41
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4419-5912-6
Online ISBN: 978-1-4419-5913-3
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)