Skip to main content

Bioinformatics Data Access Service in the ProGenGrid System

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3292))

Abstract

Current bioinformatics workflows require the collection of results coming from different tools on several Web sites. High-throughput services integrated through Web Services allow researchers to access a virtual organization by providing large computational and storage resources. There are considerable costs associated with running a high-throughput application including hardware, storage, maintenance, and bandwidth. Moreover, often such tools use biological data banks heterogeneous in the format and semantic, so the task of enabling their composition and cooperation is even more difficult. Researchers are now taking advantage of economies of scale by building large shared systems for bioinformatics processing. Integrating Computational Grids and Web Services technologies can be a key solution to simplify interaction between bioinformatics tools and biological databases. This paper presents a data access service for retrieving and transferring input data coming from heterogeneous data banks to high throughput applications, wrapped as Web Services.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. WfMC. Workflow management coalition reference model. Site address, http://www.wfmc.org/

  2. Foster, I., Kesselman, C.: The Grid: Blueprint for a New Computing Infrastructure. Morgan Kaufmann, San Francisco (1998)

    Google Scholar 

  3. Kreger, H.: Web Services Conceptual Architecture. WSCA 1.0. IBM (2001)

    Google Scholar 

  4. Altschul, Stephen, F., Warren, G., Miller, W., Myers, E.W., Lipman, D.J.: Basic local alignment search tool. J. Mol. Biol. 215, 403–410 (1990)

    Google Scholar 

  5. Boeckmann, B., Bairoch, A., Apweiler, R., Blatter, M., Estreicher, A., Gasteiger, E.E., Martin, M.J., Michoud, K., O’Donovan, C., Phan, I., Pilbout, S., Schneider, M.: The Swiss-Prot protein knowledgebase and its supplement TrEMBL. Nucleic Acids Research 31, 365–370 (2003), Site address: http://www.ebi.ac.uk/swissprot/

    Article  Google Scholar 

  6. Berman, H.M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T.N., Weissig, H., Shindyalov, I.N., Bourne, P.E.: The Protein Data Bank. Nucleic Acids Research 28, 235–242 (2000), Site address: http://www.rcsb.org/pdb/

    Article  Google Scholar 

  7. Global Grid Forum (GGF), Site address: www.gridforum.org

  8. Liu, D.T., Franklin, M.J.: GridDB: A Data-Centric Overlay for Scientific Grids. Technical Report UCB//CSD-04-1311, 3/16/04, Site address: http://www.cs.berkeley.edu/dtliu/pubs/griddb_tr.pdf

  9. Open Grid Services Architecture Data Access and Integration OGSA-DAI, Site Address: http://www.ogsadai.org.uk/

  10. Aloisio, G., Cafaro, M., Fiore, S., Mirto, M.: The GRelC Project: Towards GRIDDBMS. In: Proceedings of Parallel and Distributed Computing and Networks (PDCN) IASTED, Innsbruck (Austria) February 17-19 (2004), Site address: http://gandalf.unile.it

  11. Aloisio, G., Cafaro, M., Fiore, S., Mirto, M.: ProGenGrid: A Grid Framework for Bioinformatics. In: The Proceeding of International Meeting on Computational Intelligence Methods for Bioinformatics and Biostatistics (CIBB 2004), September 14-15, Perugia, Italy (2004) (to appear)

    Google Scholar 

  12. Foster, I., Kesselman, C.: Globus: A Metacomputing Infrastructure Toolkit. Intl. J. Supercomputer Applications 11(2), 115–128 (1997)

    Article  Google Scholar 

  13. National Center for Biotechnology Information, Site address: http://www.ncbi.nlm.nih.gov/

  14. Swiss Institute of Bioinformatics, Site address: http://www.isb-sib.ch/

  15. EBML-EBI European Bioinformatics Institute, Site address: http://www.ebi.ac.uk/swissprot/

  16. Grillo, G., Licciulli, F., Liuni, S., Sbisa’, E., Pesole, G.: PatSearch: a program for the detection of patterns and structural motifs in nucleotide sequences. Nucleic Acid Research 31(13), 3608–3612 (2003)

    Article  Google Scholar 

  17. Rice, P., Longden, I., Bleasby, A.: EMBOSS: The European Molecular Biology Open Software Suite. Trends in Genetics 16(6), 276–277 (2000), Site address: http://www.ch.embnet.org/EMBOSS/

    Article  Google Scholar 

  18. SRS Network Browser, Site address: http://www.ebi.ac.uk/srs/srsc/

  19. Tuecke, S.: Grid Security Infrastructure (GSI) Roadmap. Internet Draft (2001), Site address: http://www.gridforum.org/security/ggf1_2001-03/drafts/draft-ggf-gsi-roadmap-02.pdf

  20. Aloisio, G., Cafaro, M., Lezzi, D., Van Engelen, R.: Secure Web Services with Globus GSI and gSOAP. In: Kosch, H., Böszörményi, L., Hellwagner, H. (eds.) Euro-Par 2003. LNCS, vol. 2790, pp. 421–426. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  21. van Driel, Marc, A., Hekkelman, M.L., Rodriguez, R.: dBlast. A wrapper to run NCBI BLAST parallel/distributed (submitted), Site address: http://www.cmbi.kun.nl/software/dBlast/

  22. BioPerl Project, Chervitz, S.A., Fuellen, G., Dagdigian, C., Resnick, R., Brenner, S.E.: Bioperl: Object-Oriented Perl Modules for Bioinformatics. Objects in Bioinformatics Meeting (1997), Site address: http://bio.perl.org/Projects/Blast/

  23. Mathog, D.R.: Parallel BLAST on split databases. Bioinformatics 19(14), 1865–1866 (2003)

    Article  Google Scholar 

  24. Darling, A., Carey, L., Feng, W.: The Design, Implementation, and Evaluation of mpiBLAST. In: ClusterWorld Conference & Expo in conjunction with the 4th International Conference on Linux Clusters: The HPC Revolution 2003, San Jose, CA (June 2003), Site address: http://mpiblast.lanl.gov/

  25. IBM. Web services flow language - wsfl, Site address: http://www-306.ibm.com/software/solutions/webservices/pdf/WSFL.pdf

  26. IBM. Business process execution language for web services - bpel4ws, Site address: http://www-106.ibm.com/developerworks/webservices/library/ws-bpel/

  27. OMG. Uml - unified modeling language: Extensions for workflow process definition, Site address: http://www.omg.org/uml/

  28. Eshuis, R., Wieringa, R.: Verification support for workflow design with UML activity graphs. In: CSE 2002, Springer, Heidelberg (2002)

    Google Scholar 

  29. Sayle, R.A., Milner-White, E.J.: RasMol: Biomolecular graphics for all. Trends in Biochemical Science (TIBS) 20(9), 374 (1995), Site address: http://www.umass.edu/microbio/rasmol/

    Article  Google Scholar 

  30. North Carolina BioGrid Project, Site address: http://www.ncbiogrid.org/index.html

  31. Foster, I., Kesselman, C., Nick, J., Tuecke, S.: The Physiology of the Grid: An Open Grid Services Architecture for Distributed System Integration. Technical Report for the Globus project (2002), Site address: http://www.globus.org/research/papers/ogsa.pdf

  32. WS - Resource Framework (WSRF), Site address: http://www.globus.org/wsrf/

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Aloisio, G., Cafaro, M., Fiore, S., Mirto, M. (2004). Bioinformatics Data Access Service in the ProGenGrid System. In: Meersman, R., Tari, Z., Corsaro, A. (eds) On the Move to Meaningful Internet Systems 2004: OTM 2004 Workshops. OTM 2004. Lecture Notes in Computer Science, vol 3292. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30470-8_38

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-30470-8_38

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-23664-1

  • Online ISBN: 978-3-540-30470-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics