Skip to main content

An Overview of the BioExtract Server: A Distributed, Web-Based System for Genomic Analysis

  • Conference paper
  • First Online:
Advances in Computational Biology

Part of the book series: Advances in Experimental Medicine and Biology ((AEMB,volume 680))

Abstract

Genome research is becoming increasingly dependent on access to multiple, distributed data sources, and bioinformatic tools. The importance of integration across distributed databases and Web services will continue to grow as the number of requisite resources expands. Use of bioinformatic workflows has seen considerable growth in recent years as scientific research becomes increasingly dependent on the analysis of large sets of data and the use of distributed resources. The BioExtract Server (http://bioextract.org) is a Web-based system designed to aid researchers in the analysis of distributed genomic data by providing a platform to facilitate the creation of bioinformatic workflows. Scientific workflows are created within the system by recording the analytic tasks preformed by researchers. These steps may include querying multiple data sources, saving query results as searchable data extracts, and executing local and Web-accessible analytic tools. The series of recorded tasks can be saved as a computational workflow simply by providing a name and description.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 219.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 279.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 379.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. K. Verdi, H. Ellis, and M. Gryk, Conceptual-level workflow modeling of scientific experiments using NMR as a case study, BMC Bioinformatics, 8:31, 2007

    Article  PubMed  Google Scholar 

  2. S.F. Altschul, T.L. Madden, A.A. Schäffer, J. Zhang, Z. Zhang, W. Miller, and D.J. Lipman, Gapped BLAST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Research, 25(17):3389–3402, 1997

    Article  PubMed  CAS  Google Scholar 

  3. R. Chenna, H. Sugawara, T. Koike, R. Lopez, T.J. Gibson, D.G. Higgins, and J.D. Thompson, Multiple sequence alignment with the Clustal series of programs, Nucleic Acids Research, 31(13):3497–3500, 2003

    Article  PubMed  CAS  Google Scholar 

  4. M.I. Abouelhoda, S. Kurtz, and E. Ohlebusch, The enhanced suffix array and its application to genome analysis, Lecture Notes in Computer Science, 2452:449–463, 2002. http://www.vmatch.de/

    Article  Google Scholar 

  5. E. Deelman and Y. Gil, Workshop on the Challenges of Scientific Workflows; Sponsored by the National Science Foundation, http://vtcpc.isi.edu/wiki/images/3/3a/NSFWorkflowFinal.pdf, May 1–2, 2006

    Google Scholar 

  6. D. De Roure and C. Goble, Software design for empowering scientists, IEEE Software, 26(1):88–95, 2009

    Article  Google Scholar 

  7. D. De Roure, C. Goble, and R. Stevens, The design and realization of the myExperiment Virtual Research Environment for social sharing of workflows, Future Generation Computer Systems, 25(5):561–567, 2009. corrected proof available as: DOI http://dx.doi.org/10.1016/j.future.2008.06.010

    Article  Google Scholar 

  8. D. Hull, K. Wolstencroft, R. Stevens, C. Goble, M. Pocock, P. Li, and T. Oinn, Taverna: a tool for building and running workflows of services, Nucleic Acids Research, 34(Web Server issue):W729–W732, 2006

    Article  PubMed  CAS  Google Scholar 

  9. B. Ludäscher, I. Altintas, C. Berkley, D. Higgins, E. Jaeger, M. Jones, E.A. Lee, J. Tao, and Y. Zhao, Scientific workflow management and the Kepler system, Concurrency and Computation: Practice & Experience, 18(10):1039–1065, 2006

    Article  Google Scholar 

  10. A. Harrison, I. Taylor, I. Wang, and M. Shields, WS-RF workflow in Triana, International Journal of High Performance Computing Applications (IJHPCA), 22(3):268–283, 2008

    Article  Google Scholar 

  11. J. Elhai, A. Taton, J. Massar, J. Myers, M. Travers, J. Casey, M. Slupesky, and J. Shrager, BioBIKE: A Web-based, programmable, integrated biological knowledge base, Nucleic Acids Research, 37(Web Server issue):W28–W32. doi10.1093, 2009

    Article  PubMed  CAS  Google Scholar 

  12. S. Bowers, T. McPhillips, B. Ludäscher, S.Cohen, and S. Davidson, A Model for user-oriented data provenance in pipelined scientific workflows, Lecture Notes in Computer Science, Springer, Berlin, ISBN: 978-3-540-46302-3, pp 133–147

    Google Scholar 

  13. C. Goble, Position statement: musings on provenance, workflow and (semantic web) annotations for bioinformatics, Proceedings of the Workshop on Data Derivation and Provenance, 2002; http://people.cs.uchicago.edu/yongzh/papers/provenance_workshop_3.doc

  14. L. Moreau, B Ludäscher, I. Altintas, R. Barga, S. Bowers, , S. Callahan, G. Chin, B. Clifford, S. Cohen, S. Cohen-Boulakia, S. Davidson, E. Deelman, L. Digiampietri, I. Foster, J. Freire, J. Frew, J. Futrelle, T. Gibson, Y. Gil, C. Goble, J. Golbeck, P. Groth, D. A. Holland, S. Jiang, J. Kim, D. Koop, A. Krenek, T. McPhillips, G. Mehta, S. Miles, D. Metzger, S. Munroe, J. Myers, B. Plale, N. Podhorszki, V. Ratnakar, E. Santos, C. Scheidegger, K. Schuchardt, M. Seltzer, Y. Simmhan, C. Silva, P. Slaughter, E. Stephan, R. Stevens, D. Turi, H. Vo, M. Wilde, J. Zhao, and Y. Zhao, The First Provenance Challenge, Concurrency and Computation: Practice & Experience, 20(5):409–418, 2008

    Article  Google Scholar 

  15. L. Moreau, J. Futrelle, R. McGrath, J. Myers, and P. Pualson, The open provenance model: an overview, Lecture Notes in Computer Science, Springer, Berlin/Heidelberg, ISBN 978-3-540-89964-8, 5272:323–326, 2008

    Google Scholar 

Download references

Acknowledgments

The BioExtract Server project is currently supported in part by the National Science Foundation grant DBI-0606909.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to C. M. Lushbough .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer Science+Business Media, LLC

About this paper

Cite this paper

Lushbough, C.M., Brendel, V.P. (2010). An Overview of the BioExtract Server: A Distributed, Web-Based System for Genomic Analysis. In: Arabnia, H. (eds) Advances in Computational Biology. Advances in Experimental Medicine and Biology, vol 680. Springer, New York, NY. https://doi.org/10.1007/978-1-4419-5913-3_41

Download citation

Publish with us

Policies and ethics