Skip to main content

An Opportunistic Algorithm for Scheduling Workflows on Grids

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4395))

Abstract

The execution of scientific workflows in Grid environments imposes many challenges due to the dynamic nature of such environments and the characteristics of scientific applications. This work presents an algorithm that dynamically schedules tasks of workflows to Grid sites based on the performance of these sites when running previous jobs from the same workflow. The algorithm captures the dynamic characteristics of Grid environments without the need to probe the remote sites. We evaluated the algorithm running a workflow in the Open Science Grid using twelve sites. The results showed improvements up to 150% relative to other four usual scheduling strategies.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Alhusaini, A.H., Prasanna, V.K., Raghavendra, C.S.: A Unified Resource Scheduling Framework for Heterogeneous Computing Environments. In: HCW, Eighth Heterogeneous Computing Workshop, p. 156 (1999)

    Google Scholar 

  2. Casanova, H., et al.: The AppLeS Parameter Sweep Template: User-Level Middleware for the Grid. In: SuperComputing 2000, Denver, USA (2000)

    Google Scholar 

  3. DagMan, http://www.cs.wisc.edu/condor/dagman/

  4. Cameron, D.G., et al.: Evaluating Scheduling and Replica Optimisation Strategies in OptorSim. In: Proc. of 4th International Workshop on Grid Computing (Grid2003), Phoenix, USA (November 2003)

    Google Scholar 

  5. Cameron, D.G., et al.: Evaluation of an Economic-Based File Replication Strategy for a Data Grid. In: Int. Workshop on Agent Based Cluster and Grid Computing at Int. Symposium on Cluster Computing and the Grid (CCGrid2003), Tokyo, Japan (May 2003)

    Google Scholar 

  6. Deelman, E., et al.: Across Grids Conference 2004, Nicosia, Cyprus (2004)

    Google Scholar 

  7. Deelman, E., et al.: Workflow Management in GriPhyn. In: The Grid Resource Management, Kluwer, Dordrecht (2003)

    Google Scholar 

  8. Dumitrescu, C., Foster, I.: Experiences in Running Workloads over Grid3. In: Zhuge, H., Fox, G.C. (eds.) GCC 2005. LNCS, vol. 3795, pp. 274–286. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  9. Foster, I., Kesselman, C.: The Grid 2: Blueprint for a New Computing Infrastructure (Chapter 4). Morgan Kaufmann, San Francisco (2004)

    Google Scholar 

  10. Foster, I., et al.: Chimera: A Virtual Data System for Representing, Querying, and Automating Data Derivation. In: 14th International Conference on Scientific and Statistical Database Management (SSDBM 2002), Edinburgh (July 2002)

    Google Scholar 

  11. Foster, I., et al.: The Grid2003 Production Grid: Principles and Practice. In: 13th International Symposium on High Performance Distributed Computing (2004)

    Google Scholar 

  12. Goodale, T., Taylor, I., Wang, I.: Integrating Cactus Simulations within Triana Workflows. In: Proceedings of 13th Annual Mardi Gras Conference - Frontiers of Grid Applications and Technologies, Louisiana State University, February 2005, pp. 47–53 (2005)

    Google Scholar 

  13. Mandal, A., et al.: Scheduling Strategies for Mapping Application Workflows onto the Grid. In: 14th IEEE International Symposium on High-Performance Distributed Computing (HPDC-14), Research Triangle Park, NC, USA, July 2005, IEEE Computer Society Press, Los Alamitos (2005)

    Google Scholar 

  14. Mohamed, H.H., Epema, D.H.J.: An Evaluation of the Close-to-Files Processor and Data Co-Allocation Policy in Multiclusters. In: IEEE International Conference on Cluster Computing, San Diego, USA, September 2004, IEEE Computer Society Press, Los Alamitos (2004)

    Google Scholar 

  15. Oinn, T., Addis, M., Ferris, J., et al.: Taverna: a Tool for the Composition and Enactment of Bioinformatis Workflow. Bioinformatics 20(17), 3045–3054 (2004)

    Article  Google Scholar 

  16. Open Science Grid, http://www.opensciencegrid.org

  17. Ranganathan, K., Foster, I.: Simulation Studies of Computation and Data Scheduling Algorithms for Data Grids. Journal of Grid Computing 1(1) (2003)

    Google Scholar 

  18. Ranganathan, K., Foster, I.: Computation Scheduling and Data Replication Algorithms for Data Grids. In: Nabrzyski, J., Schopf, J., Weglarz, J. (eds.) Grid Resource Management: State of the Art and Future Trends, Kluwer Academic Publishers, Dordrecht (2003)

    Google Scholar 

  19. Shan, H., et al.: Scheduling in Heterogeneous Grid Environments: The Effects of Data Migration. In: International Conference on Advanced Computing and Communication, Gujarat, India (2004)

    Google Scholar 

  20. Singh, G., Kesselman, C., Deelman, E.: Optimizing Grid-Based Workflow Execution. Work submitted to 14th IEEE International Symposium on High Performance Distributing Computing (July 2005)

    Google Scholar 

  21. Thain, D., et al.: Pipeline and Batch Sharing in Grid Workloads. In: 12th Symposium on High Performance Distributing Computing, Seattle (June 2003)

    Google Scholar 

  22. Wieczorek, M., Prodan, R., Fahringer, T.: Scheduling of Scientific Workflows in the ASKALON Grid Environment. SIGMOD Record 34(3) (2005)

    Google Scholar 

  23. Yu, J., Buyya, R.: A Taxonomy of Scientific Workflow Systems for Grid Computing. SIGMOD Record 34(3) (2005)

    Google Scholar 

  24. Zhang, X., Schopf, J.: Performance Analysis of the Globus Toolkit Monitoring and Discovery Service. In: Proceedings of the International Workshop on Middleware Performance (MP 2004), part of the 23rd International Performance Computing and Communications Conference (IPCCC) (April 2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Michel Daydé José M. L. M. Palma Álvaro L. G. A. Coutinho Esther Pacitti João Correia Lopes

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer Berlin Heidelberg

About this paper

Cite this paper

Meyer, L., Scheftner, D., Vöckler, J., Mattoso, M., Wilde, M., Foster, I. (2007). An Opportunistic Algorithm for Scheduling Workflows on Grids. In: Daydé, M., Palma, J.M.L.M., Coutinho, Á.L.G.A., Pacitti, E., Lopes, J.C. (eds) High Performance Computing for Computational Science - VECPAR 2006. VECPAR 2006. Lecture Notes in Computer Science, vol 4395. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71351-7_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-71351-7_1

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-71350-0

  • Online ISBN: 978-3-540-71351-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics