An Overview of the BioExtract Server: A Distributed, Web-Based System for Genomic Analysis

Lushbough, C. M.; Brendel, V. P.

doi:10.1007/978-1-4419-5913-3_41

C. M. Lushbough² &
V. P. Brendel

Part of the book series: Advances in Experimental Medicine and Biology ((AEMB,volume 680))

2477 Accesses
4 Citations

Abstract

Genome research is becoming increasingly dependent on access to multiple, distributed data sources, and bioinformatic tools. The importance of integration across distributed databases and Web services will continue to grow as the number of requisite resources expands. Use of bioinformatic workflows has seen considerable growth in recent years as scientific research becomes increasingly dependent on the analysis of large sets of data and the use of distributed resources. The BioExtract Server (http://bioextract.org) is a Web-based system designed to aid researchers in the analysis of distributed genomic data by providing a platform to facilitate the creation of bioinformatic workflows. Scientific workflows are created within the system by recording the analytic tasks preformed by researchers. These steps may include querying multiple data sources, saving query results as searchable data extracts, and executing local and Web-accessible analytic tools. The series of recorded tasks can be saved as a computational workflow simply by providing a name and description.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 219.00; Price excludes VAT (USA)

Softcover Book: USD 279.99; Price excludes VAT (USA)

Hardcover Book: USD 379.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

K. Verdi, H. Ellis, and M. Gryk, Conceptual-level workflow modeling of scientific experiments using NMR as a case study, BMC Bioinformatics, 8:31, 2007
Article PubMed Google Scholar
S.F. Altschul, T.L. Madden, A.A. Schäffer, J. Zhang, Z. Zhang, W. Miller, and D.J. Lipman, Gapped BLAST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Research, 25(17):3389–3402, 1997
Article PubMed CAS Google Scholar
R. Chenna, H. Sugawara, T. Koike, R. Lopez, T.J. Gibson, D.G. Higgins, and J.D. Thompson, Multiple sequence alignment with the Clustal series of programs, Nucleic Acids Research, 31(13):3497–3500, 2003
Article PubMed CAS Google Scholar
M.I. Abouelhoda, S. Kurtz, and E. Ohlebusch, The enhanced suffix array and its application to genome analysis, Lecture Notes in Computer Science, 2452:449–463, 2002. http://www.vmatch.de/
Article Google Scholar
E. Deelman and Y. Gil, Workshop on the Challenges of Scientific Workflows; Sponsored by the National Science Foundation, http://vtcpc.isi.edu/wiki/images/3/3a/NSFWorkflowFinal.pdf, May 1–2, 2006
Google Scholar
D. De Roure and C. Goble, Software design for empowering scientists, IEEE Software, 26(1):88–95, 2009
Article Google Scholar
D. De Roure, C. Goble, and R. Stevens, The design and realization of the ^myExperiment Virtual Research Environment for social sharing of workflows, Future Generation Computer Systems, 25(5):561–567, 2009. corrected proof available as: DOI http://dx.doi.org/10.1016/j.future.2008.06.010
Article Google Scholar
D. Hull, K. Wolstencroft, R. Stevens, C. Goble, M. Pocock, P. Li, and T. Oinn, Taverna: a tool for building and running workflows of services, Nucleic Acids Research, 34(Web Server issue):W729–W732, 2006
Article PubMed CAS Google Scholar
B. Ludäscher, I. Altintas, C. Berkley, D. Higgins, E. Jaeger, M. Jones, E.A. Lee, J. Tao, and Y. Zhao, Scientific workflow management and the Kepler system, Concurrency and Computation: Practice & Experience, 18(10):1039–1065, 2006
Article Google Scholar
A. Harrison, I. Taylor, I. Wang, and M. Shields, WS-RF workflow in Triana, International Journal of High Performance Computing Applications (IJHPCA), 22(3):268–283, 2008
Article Google Scholar
J. Elhai, A. Taton, J. Massar, J. Myers, M. Travers, J. Casey, M. Slupesky, and J. Shrager, BioBIKE: A Web-based, programmable, integrated biological knowledge base, Nucleic Acids Research, 37(Web Server issue):W28–W32. doi10.1093, 2009
Article PubMed CAS Google Scholar
S. Bowers, T. McPhillips, B. Ludäscher, S.Cohen, and S. Davidson, A Model for user-oriented data provenance in pipelined scientific workflows, Lecture Notes in Computer Science, Springer, Berlin, ISBN: 978-3-540-46302-3, pp 133–147
Google Scholar
C. Goble, Position statement: musings on provenance, workflow and (semantic web) annotations for bioinformatics, Proceedings of the Workshop on Data Derivation and Provenance, 2002; http://people.cs.uchicago.edu/yongzh/papers/provenance_workshop_3.doc
L. Moreau, B Ludäscher, I. Altintas, R. Barga, S. Bowers, , S. Callahan, G. Chin, B. Clifford, S. Cohen, S. Cohen-Boulakia, S. Davidson, E. Deelman, L. Digiampietri, I. Foster, J. Freire, J. Frew, J. Futrelle, T. Gibson, Y. Gil, C. Goble, J. Golbeck, P. Groth, D. A. Holland, S. Jiang, J. Kim, D. Koop, A. Krenek, T. McPhillips, G. Mehta, S. Miles, D. Metzger, S. Munroe, J. Myers, B. Plale, N. Podhorszki, V. Ratnakar, E. Santos, C. Scheidegger, K. Schuchardt, M. Seltzer, Y. Simmhan, C. Silva, P. Slaughter, E. Stephan, R. Stevens, D. Turi, H. Vo, M. Wilde, J. Zhao, and Y. Zhao, The First Provenance Challenge, Concurrency and Computation: Practice & Experience, 20(5):409–418, 2008
Article Google Scholar
L. Moreau, J. Futrelle, R. McGrath, J. Myers, and P. Pualson, The open provenance model: an overview, Lecture Notes in Computer Science, Springer, Berlin/Heidelberg, ISBN 978-3-540-89964-8, 5272:323–326, 2008
Google Scholar

Download references

Acknowledgments

The BioExtract Server project is currently supported in part by the National Science Foundation grant DBI-0606909.

Author information

Authors and Affiliations

Department of Computer Science, University of South Dakota, Vermillion, SD, USA
C. M. Lushbough

Authors

C. M. Lushbough
View author publications
You can also search for this author in PubMed Google Scholar
V. P. Brendel
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to C. M. Lushbough .

Editor information

Editors and Affiliations

Dept. Computer Science, University of Georgia, Athens, 30602-7404, Georgia, USA
Hamid R. Arabnia

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lushbough, C.M., Brendel, V.P. (2010). An Overview of the BioExtract Server: A Distributed, Web-Based System for Genomic Analysis. In: Arabnia, H. (eds) Advances in Computational Biology. Advances in Experimental Medicine and Biology, vol 680. Springer, New York, NY. https://doi.org/10.1007/978-1-4419-5913-3_41

Download citation

DOI: https://doi.org/10.1007/978-1-4419-5913-3_41
Published: 09 August 2010
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4419-5912-6
Online ISBN: 978-1-4419-5913-3
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)

Publish with us

Policies and ethics