Abstract
Recent scientific advances have fostered a mounting number of services and data sets available for utilization. These resources, though scattered across disparate locations, are often loosely coupled both semantically and operationally. This loosely coupled relationship implies the possibility of linking together operations and data sets to answer queries. This task, generally known as automatic service composition, therefore abstracts the process of complex scientific workflow planning from the user. We have been exploring a metadata-driven approach toward automatic service workflow composition, among other enabling mechanisms, in our system, Auspice: Automatic Service Planning in Cloud/Grid Environments. In this paper, we present a complete overview of our system’s unique features and outlooks for future deployment as the Cloud computing paradigm becomes increasingly eminent in enabling scientific computing.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
For instance, a series of scientific observations is easily represented by arrays but not relational tables.
- 2.
Other derivation paths may exist within a certain ontology, but for simplicity, we show just one here.
References
Agrawal, S.: Dbxplorer: a system for keyword-based search over relational databases. In: ICDE, pp. 5–16 (2002)
Altintas, I., Berkley, C., Jaeger, E., Jones, M., Ludscher, B., Mock, S.: Kepler: an extensible system for design and execution of scientific workflows (2004)
Amazon elastic compute cloud. http://aws.amazon.com/ec2
Armbrust, M., et al.: Above the clouds: a Berkeley view of cloud computing. Technical Report UCB/EECS-2009-28, EECS Department, University of California, Berkeley, Feb 2009
The atlas experiment. http://atlasexperiment.org
Beynon, M.D., Kurc, T., Catalyurek, U., Chang, C., Sussman, A., Saltz, J.: Distributed processing of very large datasets with datacutter. Parallel Comput. 27(11), 1457–1478 (2001)
Brandic, I., Pllana, S., Benkner, S.: An approach for the high-level specification of QoS-aware grid workflows considering location affinity. Sci. Program. 14(3–4), 231–250 (2006)
Brandic, I., Pllana, S., Benkner, S.: Specification, planning, and execution of QoS-aware grid workflows within the Amadeus environment. Concurr. Comput. Pract. Exp. 20(4), 331–345 (2008)
Brooks, C., Lee, E.A., Liu, X., Neuendorffer, S., Zhao, Y., Zheng, H.: Heterogeneous concurrent modeling and design in Java (vol. 2: Ptolemy II software architecture). Technical Report 22, EECS Dept., UC Berkeley, July 2005
Casati, F., Ilnicki, S., Jin, L., Krishnamoorthy, V., Shan, M.-C.: Adaptive and dynamic service composition in eFlow. In: Conference on Advanced Information Systems Engineering, pp. 13–31 (2000)
Chiu, D., Agrawal, G.: Enabling ad hoc queries over low-level scientific datasets. In: Proceedings of the 21th International Conference on Scientific and Statistical Database Management (SSDBM’09) (2009)
Chiu, D., Agrawal, G.: Flexible caches for derived scientific data over cloud environments. Technical Report OSU-CISRC-7/09-TR35, Department of Computer Science and Engineering, The Ohio State University, July 2009
Chiu, D., Agrawal, G.: Hierarchical caches for grid workflows. In: Proceedings of the 9th IEEE International Symposium on Cluster Computing and the Grid (CCGRID). IEEE, New York (2009)
Chiu, D., Deshpande, S., Agrawal, G., Li, R.: Composing geoinformatics workflows with user preferences. In: Proceedings of the 16th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (GIS’08), New York, NY, USA (2008)
Chiu, D., Deshpande, S., Agrawal, G., Li, R.: Cost and accuracy sensitive dynamic workflow composition over grid environments. In: Proceedings of the 9th IEEE/ACM International Conference on Grid Computing (Grid’08) (2008)
Chiu, D., Deshpande, S., Agrawal, G., Li, R.: A dynamic approach toward QoS-aware service workflow composition. In: Proceedings of the 7th IEEE International Conference on Web Services (ICWS’09). IEEE Computer Society, Los Alamitos (2009)
Condor dagman. http://www.cs.wisc.edu/condor/dagman
Dublin core metadata element set, version 1.1 (2008)
Deelman, E., Singh, G., Su, M.-H., Blythe, J., Gil, Y., Kesselman, C., Mehta, G., Vahi, K., Berriman, G.B., Good, J., Laity, A.C., Jacob, J.C., Katz, D.S.: Pegasus: a framework for mapping complex scientific workflows onto distributed systems. Sci. Program. 13(3), 219–237 (2005)
Fellbaum, C. (ed.): WordNet: An Electronic Lexical Database. MIT Press, Cambridge (1998)
Metadata ad hoc working group.content standard for digital geospatial metadata (1998)
Federal geospatial data clearinghouse. http://clearinghouse.fgdc.gov
Foster, I.: Service-oriented science. Science 308(5723), 814–817 (2005)
Foster, I., Kesselman, C.: Globus: a metacomputing infrastructure toolkit. Int. J. Supercomput. Appl. 11, 115–128 (1996)
Frey, J., Tannenbaum, T., Foster, I., Livny, M., Tuecke, S.: Condor-G: a computation management agent for multi-institutional grids. In: Proceedings of the Tenth IEEE Symposium on High Performance Distributed Computing (HPDC), San Francisco, CA, August 2001, pp. 7–9 (2001)
gbio: Grid for bioinformatics. http://gbio-pbil.ibcp.fr
Bioinfogrid. http://www.bioinfogrid.eu
Biomedical informatics research network. http://www.nbirn.net
Cyberstructure for the geosciences. http://www.geongrid.org
The geography network. http://www.geographynetwork.com
Glatard, T., Montagnat, J., Pennec, X.: Efficient services composition for grid-enabled data-intensive applications (2006)
Ioannidis, Y.E., Livny, M., Gupta, S., Ponnekanti, N.: Zoo: a desktop experiment management environment. In: VLDB ’96: Proceedings of the 22th International Conference on Very Large Data Bases, pp. 274–285. Morgan Kaufmann, San Francisco (1996)
Jensen, C.S., Lin, D., Ooi, B.C.: Query and update efficient B+-tree based indexing of moving objects. In: Proceedings of Very Large Databases (VLDB), pp. 768–779 (2004)
Karger, D., et al.: Consistent hashing and random trees: distributed caching protocols for relieving hot spots on the world wide web. In: ACM Symposium on Theory of Computing, pp. 654–663 (1997)
Kumar, V.S., Sadayappan, P., Mehta, G., Vahi, K., Deelman, E., Ratnakar, V., Kim, J., Gil, Y., Hall, M., Kurc, T., Saltz, J.: An integrated framework for performance-based optimization of scientific workflows. In: HPDC ’09: Proceedings of the 18th ACM International Symposium on High Performance Distributed Computing, pp. 177–186. ACM, New York (2009)
Lera, I., Juiz, C., Puigjaner, R.: Performance-related ontologies and semantic web applications for on-line performance assessment intelligent systems. Sci. Comput. Program. 61(1), 27–37 (2006)
Litzkow, M., Livny, M., Mutka, M.: Condor—a hunter of idle workstations. In: Proceedings of the 8th International Conference of Distributed Computing Systems, June 1988
Majithia, S., Shields, M.S., Taylor, I.J., Wang, I.: Triana: a graphical web service composition and execution toolkit. In: Proceedings of the IEEE International Conference on Web Services (ICWS’04), pp. 514–524. IEEE Computer Society, Los Alamitos (2004)
Biological data working group. Biological data profile (1999)
Oinn, T., Addis, M., Ferris, J., Marvin, D., Senger, M., Greenwood, M., Carver, T., Glover, K., Pocock, M.R., Wipat, A., Li, P.: Taverna: a tool for the composition and enactment of bioinformatics workflows. Bioinformatics 20(17), 3045–3054 (2004)
Sloan digital sky survey. http://www.sdss.org
Sirin, E., Parsia, B., Hendler, J.: Filtering and selecting semantic web services with interactive composition techniques. IEEE Intell. Syst. 19(4), 42–49 (2004)
University, V.H., Hristidis, V.: Discover: keyword search in relational databases. In: VLDB, pp. 670–681 (2002)
Wan, M., Rajasekar, A., Moore, R., Andrews, P.: A simple mass storage system for the SRB data grid. In: MSS ’03: Proceedings of the 20th IEEE/11th NASA Goddard Conference on Mass Storage Systems and Technologies (MSS’03), p. 20. IEEE Computer Society, Washington (2003)
Extensible markup language (xml) 1.1 (second edition)
Acknowledgements
This work is supported by NSF grants 0541058, 0619041, and 0833101. The equipment used for the experiments reported here was purchased under the grant 0403342.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag London Limited
About this chapter
Cite this chapter
Chiu, D., Agrawal, G. (2011). Auspice: Automatic Service Planning in Cloud/Grid Environments. In: Cafaro, M., Aloisio, G. (eds) Grids, Clouds and Virtualization. Computer Communications and Networks. Springer, London. https://doi.org/10.1007/978-0-85729-049-6_5
Download citation
DOI: https://doi.org/10.1007/978-0-85729-049-6_5
Publisher Name: Springer, London
Print ISBN: 978-0-85729-048-9
Online ISBN: 978-0-85729-049-6
eBook Packages: Computer ScienceComputer Science (R0)