Abstract
Current bioinformatics workflows require the collection of results coming from different tools on several Web sites. High-throughput services integrated through Web Services allow researchers to access a virtual organization by providing large computational and storage resources. There are considerable costs associated with running a high-throughput application including hardware, storage, maintenance, and bandwidth. Moreover, often such tools use biological data banks heterogeneous in the format and semantic, so the task of enabling their composition and cooperation is even more difficult. Researchers are now taking advantage of economies of scale by building large shared systems for bioinformatics processing. Integrating Computational Grids and Web Services technologies can be a key solution to simplify interaction between bioinformatics tools and biological databases. This paper presents a data access service for retrieving and transferring input data coming from heterogeneous data banks to high throughput applications, wrapped as Web Services.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
WfMC. Workflow management coalition reference model. Site address, http://www.wfmc.org/
Foster, I., Kesselman, C.: The Grid: Blueprint for a New Computing Infrastructure. Morgan Kaufmann, San Francisco (1998)
Kreger, H.: Web Services Conceptual Architecture. WSCA 1.0. IBM (2001)
Altschul, Stephen, F., Warren, G., Miller, W., Myers, E.W., Lipman, D.J.: Basic local alignment search tool. J. Mol. Biol. 215, 403–410 (1990)
Boeckmann, B., Bairoch, A., Apweiler, R., Blatter, M., Estreicher, A., Gasteiger, E.E., Martin, M.J., Michoud, K., O’Donovan, C., Phan, I., Pilbout, S., Schneider, M.: The Swiss-Prot protein knowledgebase and its supplement TrEMBL. Nucleic Acids Research 31, 365–370 (2003), Site address: http://www.ebi.ac.uk/swissprot/
Berman, H.M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T.N., Weissig, H., Shindyalov, I.N., Bourne, P.E.: The Protein Data Bank. Nucleic Acids Research 28, 235–242 (2000), Site address: http://www.rcsb.org/pdb/
Global Grid Forum (GGF), Site address: www.gridforum.org
Liu, D.T., Franklin, M.J.: GridDB: A Data-Centric Overlay for Scientific Grids. Technical Report UCB//CSD-04-1311, 3/16/04, Site address: http://www.cs.berkeley.edu/dtliu/pubs/griddb_tr.pdf
Open Grid Services Architecture Data Access and Integration OGSA-DAI, Site Address: http://www.ogsadai.org.uk/
Aloisio, G., Cafaro, M., Fiore, S., Mirto, M.: The GRelC Project: Towards GRIDDBMS. In: Proceedings of Parallel and Distributed Computing and Networks (PDCN) IASTED, Innsbruck (Austria) February 17-19 (2004), Site address: http://gandalf.unile.it
Aloisio, G., Cafaro, M., Fiore, S., Mirto, M.: ProGenGrid: A Grid Framework for Bioinformatics. In: The Proceeding of International Meeting on Computational Intelligence Methods for Bioinformatics and Biostatistics (CIBB 2004), September 14-15, Perugia, Italy (2004) (to appear)
Foster, I., Kesselman, C.: Globus: A Metacomputing Infrastructure Toolkit. Intl. J. Supercomputer Applications 11(2), 115–128 (1997)
National Center for Biotechnology Information, Site address: http://www.ncbi.nlm.nih.gov/
Swiss Institute of Bioinformatics, Site address: http://www.isb-sib.ch/
EBML-EBI European Bioinformatics Institute, Site address: http://www.ebi.ac.uk/swissprot/
Grillo, G., Licciulli, F., Liuni, S., Sbisa’, E., Pesole, G.: PatSearch: a program for the detection of patterns and structural motifs in nucleotide sequences. Nucleic Acid Research 31(13), 3608–3612 (2003)
Rice, P., Longden, I., Bleasby, A.: EMBOSS: The European Molecular Biology Open Software Suite. Trends in Genetics 16(6), 276–277 (2000), Site address: http://www.ch.embnet.org/EMBOSS/
SRS Network Browser, Site address: http://www.ebi.ac.uk/srs/srsc/
Tuecke, S.: Grid Security Infrastructure (GSI) Roadmap. Internet Draft (2001), Site address: http://www.gridforum.org/security/ggf1_2001-03/drafts/draft-ggf-gsi-roadmap-02.pdf
Aloisio, G., Cafaro, M., Lezzi, D., Van Engelen, R.: Secure Web Services with Globus GSI and gSOAP. In: Kosch, H., Böszörményi, L., Hellwagner, H. (eds.) Euro-Par 2003. LNCS, vol. 2790, pp. 421–426. Springer, Heidelberg (2003)
van Driel, Marc, A., Hekkelman, M.L., Rodriguez, R.: dBlast. A wrapper to run NCBI BLAST parallel/distributed (submitted), Site address: http://www.cmbi.kun.nl/software/dBlast/
BioPerl Project, Chervitz, S.A., Fuellen, G., Dagdigian, C., Resnick, R., Brenner, S.E.: Bioperl: Object-Oriented Perl Modules for Bioinformatics. Objects in Bioinformatics Meeting (1997), Site address: http://bio.perl.org/Projects/Blast/
Mathog, D.R.: Parallel BLAST on split databases. Bioinformatics 19(14), 1865–1866 (2003)
Darling, A., Carey, L., Feng, W.: The Design, Implementation, and Evaluation of mpiBLAST. In: ClusterWorld Conference & Expo in conjunction with the 4th International Conference on Linux Clusters: The HPC Revolution 2003, San Jose, CA (June 2003), Site address: http://mpiblast.lanl.gov/
IBM. Web services flow language - wsfl, Site address: http://www-306.ibm.com/software/solutions/webservices/pdf/WSFL.pdf
IBM. Business process execution language for web services - bpel4ws, Site address: http://www-106.ibm.com/developerworks/webservices/library/ws-bpel/
OMG. Uml - unified modeling language: Extensions for workflow process definition, Site address: http://www.omg.org/uml/
Eshuis, R., Wieringa, R.: Verification support for workflow design with UML activity graphs. In: CSE 2002, Springer, Heidelberg (2002)
Sayle, R.A., Milner-White, E.J.: RasMol: Biomolecular graphics for all. Trends in Biochemical Science (TIBS) 20(9), 374 (1995), Site address: http://www.umass.edu/microbio/rasmol/
North Carolina BioGrid Project, Site address: http://www.ncbiogrid.org/index.html
Foster, I., Kesselman, C., Nick, J., Tuecke, S.: The Physiology of the Grid: An Open Grid Services Architecture for Distributed System Integration. Technical Report for the Globus project (2002), Site address: http://www.globus.org/research/papers/ogsa.pdf
WS - Resource Framework (WSRF), Site address: http://www.globus.org/wsrf/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Aloisio, G., Cafaro, M., Fiore, S., Mirto, M. (2004). Bioinformatics Data Access Service in the ProGenGrid System. In: Meersman, R., Tari, Z., Corsaro, A. (eds) On the Move to Meaningful Internet Systems 2004: OTM 2004 Workshops. OTM 2004. Lecture Notes in Computer Science, vol 3292. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30470-8_38
Download citation
DOI: https://doi.org/10.1007/978-3-540-30470-8_38
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23664-1
Online ISBN: 978-3-540-30470-8
eBook Packages: Springer Book Archive