Abstract
The execution of scientific workflows in Grid environments imposes many challenges due to the dynamic nature of such environments and the characteristics of scientific applications. This work presents an algorithm that dynamically schedules tasks of workflows to Grid sites based on the performance of these sites when running previous jobs from the same workflow. The algorithm captures the dynamic characteristics of Grid environments without the need to probe the remote sites. We evaluated the algorithm running a workflow in the Open Science Grid using twelve sites. The results showed improvements up to 150% relative to other four usual scheduling strategies.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Alhusaini, A.H., Prasanna, V.K., Raghavendra, C.S.: A Unified Resource Scheduling Framework for Heterogeneous Computing Environments. In: HCW, Eighth Heterogeneous Computing Workshop, p. 156 (1999)
Casanova, H., et al.: The AppLeS Parameter Sweep Template: User-Level Middleware for the Grid. In: SuperComputing 2000, Denver, USA (2000)
Cameron, D.G., et al.: Evaluating Scheduling and Replica Optimisation Strategies in OptorSim. In: Proc. of 4th International Workshop on Grid Computing (Grid2003), Phoenix, USA (November 2003)
Cameron, D.G., et al.: Evaluation of an Economic-Based File Replication Strategy for a Data Grid. In: Int. Workshop on Agent Based Cluster and Grid Computing at Int. Symposium on Cluster Computing and the Grid (CCGrid2003), Tokyo, Japan (May 2003)
Deelman, E., et al.: Across Grids Conference 2004, Nicosia, Cyprus (2004)
Deelman, E., et al.: Workflow Management in GriPhyn. In: The Grid Resource Management, Kluwer, Dordrecht (2003)
Dumitrescu, C., Foster, I.: Experiences in Running Workloads over Grid3. In: Zhuge, H., Fox, G.C. (eds.) GCC 2005. LNCS, vol. 3795, pp. 274–286. Springer, Heidelberg (2005)
Foster, I., Kesselman, C.: The Grid 2: Blueprint for a New Computing Infrastructure (Chapter 4). Morgan Kaufmann, San Francisco (2004)
Foster, I., et al.: Chimera: A Virtual Data System for Representing, Querying, and Automating Data Derivation. In: 14th International Conference on Scientific and Statistical Database Management (SSDBM 2002), Edinburgh (July 2002)
Foster, I., et al.: The Grid2003 Production Grid: Principles and Practice. In: 13th International Symposium on High Performance Distributed Computing (2004)
Goodale, T., Taylor, I., Wang, I.: Integrating Cactus Simulations within Triana Workflows. In: Proceedings of 13th Annual Mardi Gras Conference - Frontiers of Grid Applications and Technologies, Louisiana State University, February 2005, pp. 47–53 (2005)
Mandal, A., et al.: Scheduling Strategies for Mapping Application Workflows onto the Grid. In: 14th IEEE International Symposium on High-Performance Distributed Computing (HPDC-14), Research Triangle Park, NC, USA, July 2005, IEEE Computer Society Press, Los Alamitos (2005)
Mohamed, H.H., Epema, D.H.J.: An Evaluation of the Close-to-Files Processor and Data Co-Allocation Policy in Multiclusters. In: IEEE International Conference on Cluster Computing, San Diego, USA, September 2004, IEEE Computer Society Press, Los Alamitos (2004)
Oinn, T., Addis, M., Ferris, J., et al.: Taverna: a Tool for the Composition and Enactment of Bioinformatis Workflow. Bioinformatics 20(17), 3045–3054 (2004)
Open Science Grid, http://www.opensciencegrid.org
Ranganathan, K., Foster, I.: Simulation Studies of Computation and Data Scheduling Algorithms for Data Grids. Journal of Grid Computing 1(1) (2003)
Ranganathan, K., Foster, I.: Computation Scheduling and Data Replication Algorithms for Data Grids. In: Nabrzyski, J., Schopf, J., Weglarz, J. (eds.) Grid Resource Management: State of the Art and Future Trends, Kluwer Academic Publishers, Dordrecht (2003)
Shan, H., et al.: Scheduling in Heterogeneous Grid Environments: The Effects of Data Migration. In: International Conference on Advanced Computing and Communication, Gujarat, India (2004)
Singh, G., Kesselman, C., Deelman, E.: Optimizing Grid-Based Workflow Execution. Work submitted to 14th IEEE International Symposium on High Performance Distributing Computing (July 2005)
Thain, D., et al.: Pipeline and Batch Sharing in Grid Workloads. In: 12th Symposium on High Performance Distributing Computing, Seattle (June 2003)
Wieczorek, M., Prodan, R., Fahringer, T.: Scheduling of Scientific Workflows in the ASKALON Grid Environment. SIGMOD Record 34(3) (2005)
Yu, J., Buyya, R.: A Taxonomy of Scientific Workflow Systems for Grid Computing. SIGMOD Record 34(3) (2005)
Zhang, X., Schopf, J.: Performance Analysis of the Globus Toolkit Monitoring and Discovery Service. In: Proceedings of the International Workshop on Middleware Performance (MP 2004), part of the 23rd International Performance Computing and Communications Conference (IPCCC) (April 2004)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer Berlin Heidelberg
About this paper
Cite this paper
Meyer, L., Scheftner, D., Vöckler, J., Mattoso, M., Wilde, M., Foster, I. (2007). An Opportunistic Algorithm for Scheduling Workflows on Grids. In: Daydé, M., Palma, J.M.L.M., Coutinho, Á.L.G.A., Pacitti, E., Lopes, J.C. (eds) High Performance Computing for Computational Science - VECPAR 2006. VECPAR 2006. Lecture Notes in Computer Science, vol 4395. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71351-7_1
Download citation
DOI: https://doi.org/10.1007/978-3-540-71351-7_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-71350-0
Online ISBN: 978-3-540-71351-7
eBook Packages: Computer ScienceComputer Science (R0)