Skip to main content

Computational Grids

Invited Talk

  • Conference paper
  • First Online:
Book cover Vector and Parallel Processing — VECPAR 2000 (VECPAR 2000)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1981))

Included in the following conference series:

Abstract

In this introductory chapter, we lay the groundwork for the rest of the book by providing a more detailed picture of the expected purpose, shape, and architecture of future grid systems. We structure the chapter in terms of six questions that we believe are central to this discussion: Why do we need computational grids? What types of applications will grids be used for? Who will use grids? How will grids be used? What is involved in building a grid? And, what problems must be solved to make grids commonplace? We provide an overview of each of these issues here, referring to subsequent chapters for more detailed discussion.

This work was supported by the Mathematical, Information, and Computational Sciences Division subprogram of the Office of Advanced Scientific Computing Research, U.S. Department of Energy, under Contract W-31-109-Eng-38.

Reprinted by permission of Morgan Kauffman Publishers from The Grid: Blueprint for a New Computing Infrastructure, I. Foster and C. Kesselman (Eds), 1998

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. D. Abramson, R. Sosic, J. Giddy, and B. Hall. Nimrod: A tool for performing parameterised simulations using distributed workstations. In Proc. 4th IEEE Symp. on High Performance Distributed Computing. IEEE Computer Society Press, 1995.

    Google Scholar 

  2. A. D. Alexandrov, M. Ibel, K. E. Schauser, and C. J. Scheiman. Extending the operating system at the user level: The UFO global file system. In 1997 Annual Technical Conference on UNIX and Advanced Computing Systems (USENIX’97), January 1997.

    Google Scholar 

  3. T. Anderson. Glunix: A global layer Unix for NOW. http://now.cs.berkeley.edu/Glunix/glunix.html.

  4. R. Arpaci, A. Dusseau, A. Vahdat, L. Liu, T. Anderson, and D. Patterson. The interaction of parallel and sequential workloads on a network of workstations. In Proc. SIGMETRICS, 1995.

    Google Scholar 

  5. J. Baldeschwieler, R. Blumofe, and E. Brewer. ATLAS: An infrastructure for global computing. In Proc. Seventh ACM SIGOPS European Workshop on System Support for Worldwide Applications, 1996.

    Google Scholar 

  6. A. Baratloo, M. Karaul, Z. Kedem, and P. Wyckoff. Charlotte: Metacomputing on the Web. In Proc. 9th Conference on Parallel and Distributed Computing Systems, 1996.

    Google Scholar 

  7. K. P. Birman and R. van Rennesse. Reliable Distributed Computing Using the Isis Toolkit. IEEE Computer Society Press, 1994.

    Google Scholar 

  8. Kenneth P. Birman and Robbert van Renesse. Software for reliable networks. Scientific American, May 1996.

    Google Scholar 

  9. Richard Bukowski and Carlo Sequin. Interactive simulation of fire in virtual building environments. In Proceedings of SIGGRAPH 97, 1997.

    Google Scholar 

  10. N. Camiel, S. London, N. Nisan, and O. Regev. The POPCORN project: Distributed computation over the Internet in Java. In Proc. 6th International World Wide Web Conference, 1997.

    Google Scholar 

  11. Henri Casanova and Jack Dongarra. Netsolve: A network server for solving computational science problems. Technical Report CS-95-313, University of Tennessee, November 1995.

    Google Scholar 

  12. J. Casazza. The Development of Electric Power Transmission: The Role Played by Technology, Institutions and People. IEEE Computer Society Press, 1993.

    Google Scholar 

  13. C. Catlett and L. Smarr. Metacomputing. Communications of the ACM, 35(6):44–52, 1992.

    Article  Google Scholar 

  14. A. Chien, S. Pakin, M. Lauria, M. Buchanan, K. Hane, L. Giannini, and J. Prusakova. High performance virtual machines (HPVM): Clusters with supercomputing APIs and performance. In Eighth SIAM Conference on Parallel Processing for Scientific Computing (PP97), March 1997.

    Google Scholar 

  15. B. Christiansen, P. Cappello, M. Ionescu, M. Neary, K. Schauser, and D. Wu. Javelin: Internet-based parallel computing using Java. In Proc. 1997 Workshop on Java in Computational Science and Engineering, 1997.

    Google Scholar 

  16. K. Czajkowski, I. Foster, N. Karonis, C. Kesselman, S. Martin, W. Smith, and S. Tuecke. A resource management architecture for metacomputing systems. In The 4th Workshop on Job Scheduling Strategies for Parallel Processing, 1998.

    Google Scholar 

  17. Joseph Czyzyk, Michael P. Mesnier, and Jorge J. Moré. The Network-Enabled Optimization System (NEOS) Server. Preprint MCS-P615-0996, Argonne National Laboratory, Argonne, Illinois, 1996.

    Google Scholar 

  18. W. Dally. A VLSI Architecture for Concurrent Data Structures. KluwerAcademic Publishers, 1987.

    Google Scholar 

  19. T. DeFanti, I. Foster, M. Papka, R. Stevens, and T. Kuhfuss. Overview of thei-way: Wide area visual supercomputing. International Journal of Supercomputer Applications, 10(2):123–130, 1996.

    Article  Google Scholar 

  20. D. Diachin, L. Freitag, D. Heath, J. Herzog, W. Michels, and P. Plassmann. Remote engineering tools for the design of pollution control systems for commercial boilers. International Journal of Supercomputer Applications, 10(2):208–218, 1996.

    Article  Google Scholar 

  21. F. Douglis and J. Ousterhout. Transparent process migration: Design alternatives and the Sprite implementation. Software Practice and Experience, 21(8):757–85, 1991.

    Article  Google Scholar 

  22. Peter Druschel, Mark B. Abbott, Michael A. Pagels, and Larry L. Peterson. Network subsystem design. IEEE Network, 7(4):8–17, July 1993.

    Article  Google Scholar 

  23. Andrea C. Dusseau, Remzi H. Arpaci, and David E. Culler. Effective distributed scheduling of parallel workloads. In ACM SIGMETRICS’ 96 Conference on the Measurement and Modeling of Computer Systems, 1996.

    Google Scholar 

  24. S. Dwarkadas, P. Keleher, A. Cox, and W. Zwaenepoel. An evaluation of software distributed shared memory for next-generation processors and networks. In Proceedings of the 20th International Symposium on Computer Architecture, San Diego, CA, May 1993.

    Google Scholar 

  25. D. Engler, M. Kaashoek,, and J. O’Toole Jr. Exokernel: An operating system architecture for application-level resource management. In Proceedings of the Fifteenth ACM Symposium on Operating Systems Principles, pages 251–266. ACM Press, 1995.

    Google Scholar 

  26. I. Foster. Designing and Building Parallel Programs. Addison-Wesley, 1995.

    Google Scholar 

  27. I. Foster, J. Geisler, W. Gropp, N. Karonis, E. Lusk, G. Thiruvathukal, and S. Tuecke. A wide-area implementation of the Message Passing Interface. Parallel Computing, 1998. to appear.

    Google Scholar 

  28. Amy Friedlander. In God We Trust All Others Pay Cash: Banking as an American Infrastructure 1800–1935. Corporation for National Research Initiatives, Reston, VA, 199.

    Google Scholar 

  29. Amy Friedlander. Emerging Infrastructure: The Growth of Railroads. Corporation for National Research Initiatives, Reston, VA, 1995.

    Google Scholar 

  30. Amy Friedlander. Natural Monopoly and Universal Service: Telephones and Telegraphs in the U.S. Telecommunications Infrastructure 1837–1940. Corporation for National Research Initiatives, Reston, VA, 1995.

    Google Scholar 

  31. Amy Friedlander. Power and Light: Electricity in the U.S. Energy Infrastructure 1870–1940. Corporation for National Research Initiatives, Reston, VA, 1996.

    Google Scholar 

  32. I. Goldberg, D. Wagner, R. Thomas, and E. Brewer. A secure environment for untrusted helper applications. In Proceedings of the Sixth Usenix Security Symposium, July 1996.

    Google Scholar 

  33. Jr Harold Lockhart. OSF DCE: Guide to Developing Distributed Applications. McGraw Hill, 1994.

    Google Scholar 

  34. Bernardo Huberman, editor. The Ecology of Computation. Elsevier Science Publishers /North-Holland, 1988.

    MATH  Google Scholar 

  35. Van Jacobson. Efficient protocol implementation. In ACM SIGCOMM’ 90 tutorial, September 1990.

    Google Scholar 

  36. JavaSoft. RMI, The JDK 1.1 Specification. http://javasoft.com/products/jdk/1.1/docs/guide/rmi/index.html. 1997.

  37. C. Lee, C. Kesselman, and S. Schwab. Near-realtime satellite image processing: Metacomputing in CC++. IEEE Computer Graphics and Applications, 16(4):79–84, 1996.

    Article  Google Scholar 

  38. Jason Leigh, Andrew Johnson, and Thomas A. DeFanti. CAVERN: A distributed architecture for supporting scalable persistence and interoperability in collaborative virtual environments. Virtual Reality: Research, Development and Applications, 2(2):217–237, December 1997.

    Google Scholar 

  39. A. Lenstra. Factoring integers using theWeb and the number field sieve. Technical report, Bellcore, August 1995.

    Google Scholar 

  40. Michael J. Litzkow, Miron Livny, and Matt W. Mutka. Condor—a hunter of idle workstations. In Proceedings of the 8th International Conference of Distributed Computing Systems, pages 104–111, June 1988.

    Google Scholar 

  41. P. Lyster, L. Bergman, P. Li, D. Stanfill, B. Crippe, R. Blom, C. Pardo, and D. Okaya. CASA gigabit supercomputing network: CALCRUST threedimensional real-time multi-dataset rendering. In Proc. Supercomputing’ 92, 1992.

    Google Scholar 

  42. K. Marzullo, M. Ogg, A. Ricciardi, A. Amoroso, F. Calkins, and E. Rothfus. NILE: Wide-area computing for high energy physics. Proceedings of the 1996 SIGOPS Conference, 1996.

    Google Scholar 

  43. G. McGraw and E. Felten. Java Security: Hostile Applets, Holes and Antidotes. John Wiley and Sons, 1996.

    Google Scholar 

  44. C. Mechoso, C.-C. Ma, J. Farrara, J. Spahr, and R. Moore. Parallelization and distribution of a coupled atmosphere-ocean general circulation model. Mon. Wea. Rev., 121:2062, 1993.

    Article  Google Scholar 

  45. L. E. Moser, P. M. Melliar-Smith, D. A. Agarwal, R. K. Budhia, and C. A. Lingley-Papadopoulos. Totem: A fault-tolerant multicast group communication system. Communications of the ACM, 39(4):54–63, April 1996.

    Article  Google Scholar 

  46. M. Mutka and M. Livny. The available capacity of a privately owned workstation environment. Performance Evaluation, 12(4):269–84, 1991.

    Article  MATH  Google Scholar 

  47. National Research Council. National Collaboratories: Applying Information Technology for Scientific Research. National Academy Press, 1993.

    Google Scholar 

  48. National Research Council. Evolving the High Performance Computing and Communications Initiative to Support the Nation’s Information Infrastructure. National Academy Press, 1995.

    Google Scholar 

  49. National Research Council. More Than Screen Deep: Toward Every-Citizen Interfaces to the Nation’s Information Infrastructure. National Academy Press, 1997.

    Google Scholar 

  50. B. Clifford Neuman and Theodore Ts’o. Kerberos: An authentication service for computer networks. IEEE Communications, 32(9), September 1994.

    Google Scholar 

  51. J. Nieplocha and R. Harrison. Shared memory NUMA programming on the IWAY. In Proc. 5th IEEE Symp. on High Performance Distributed Computing, pages 432–441. IEEE Computer Society Press, 1996.

    Google Scholar 

  52. J. Nieplocha, R. J. Harrison, and R. J. Littlefield. Global Arrays: A portable “shared-memory” programming model for distributed memory computers. In Proceedings of Supercomputing’ 94, pages 340–349. IEEE Computer Society Press, 1994.

    Google Scholar 

  53. M. Norman, P. Beckman, G. Bryan, J. Dubinski, D. Gannon, L. Hernquist, K. Keahey, J. Ostriker, J. Shalf, J. Welling, and S. Yang. Galaxies collide on the I-WAY: An example of heterogeneous wide-area collaborative supercomputing. International Journal of Supercomputer Applications, 10(2):131–140, 1996.

    Google Scholar 

  54. Object Management Group, Inc., Framingham, MA. The Common Object Request Broker Architecture and Specifications, version 2.0 edition, July 1996.

    Google Scholar 

  55. Scott Pakin, Vijay Karamcheti, and Andrew A. Chien. Fast Messages: Efficient, portable communication for workstation clusters and mpps. IEEE Concurrency, 5(2):60–73, April-June 1997.

    Article  Google Scholar 

  56. C. Potter, R. Brady, P. Moran, C. Gregory, B. Carragher, N. Kisseberth, J. Lyding, and J. Lindquist. EVAC: A virtual environment for control of remote imaging instrumentation. IEEE Computer Graphics and Applications, pages 62–66, 1996.

    Google Scholar 

  57. C. Potter, Z-P. Liang, C. Gregory, H. Morris, and P. Lauterbur. Toward a neuroscope: A real-time system for the evaluation of brain function. In Proc. First IEEE Int’l Conf. on Image Processing, volume 3, pages 25–29. IEEE Computer Society Press, 1994.

    Google Scholar 

  58. I Richer and B Fuller. The MAGIC project: From vision to reality. IEEE Network, May/June 1996.

    Google Scholar 

  59. Maria Roussos, Andrew Johnson, Jason Leigh, Christina Valsilakis, Craig Barnes, and Thomas Moher. NICE: Combining constructionism, narrative, and collaboration in a virtual learning environment. Computer Graphics, 31(3):62–63, August 1997.

    Article  Google Scholar 

  60. A. Silberschatz, J. Peterson, and P. Galvin. Operating Systems Concepts. Addison-Wesley, 1991.

    Google Scholar 

  61. Larry Smarr. Computational infrastructure: Toward the 21st century. Communications of the ACM, 40(11), November 1997.

    Google Scholar 

  62. Patrick G. Sobalvarro and William E. Weihl. Demand-based coscheduling of parallel jobs on multiprogrammed multiprocessors. In Proceedings of the Parallel Job Scheduling Workshop at IPPS’ 95, 1995.

    Google Scholar 

  63. W. Sullivan, D. Werthimer, S. Bowyer, J. Cobb, D. Gedye, and D. Anderson. A new major SETI project based on project SERENDIP data and 100,000 personal computers. In Astronomical and Biochemical Origins and the Search for the Life in the Universe, 1997. IAU Colloquium No. 161.

    Google Scholar 

  64. R. Unrau, O. Krieger, B. Gamsa, and M. Stumm. Hierarchical clustering: A structure for scalable multiprocessor operating system design. The Journal of Supercomputing, 9(1/2):105–134, 1995.

    Article  Google Scholar 

  65. A. Vahdat, P. Eastham, and T. Anderson. WebFS: A global cache coherent filesystem. Technical report, Department of Computer Science, UC Berkeley, 1996.

    Google Scholar 

  66. A. Vahdat, P. Eastham, C. Yoshikawa, E. Belani, T. Anderson, D. Culler, and M. Dahlin. WebOS: Operating system services for wide area applications. Technical Report UCB CSD-97-938, U.C. Berkeley, 1997.

    Google Scholar 

  67. R. van Renesse, K. P. Birman, and S. Maffeis. Horus: A flexible group communication system. Communications of the ACM, 39(4):76–83, April 1996.

    Google Scholar 

  68. M. van Steen, P. Homburg, L. van Doorn, A. Tanenbaum, and W. de Jonge. Towards object-based wide area distributed systems. In Proc. International Workshop on Object Orientation in Operating Systems, pages 224–227, 1995.

    Google Scholar 

  69. T. von Eicken, D. Culler, S. Goldstein, and K. Schauser. Active messages: A mechanism for integrated communication and computation. In Proceedings of the 19th International Symposium on Computer Architecture, pages 256–266. ACM Press, May 1992.

    Google Scholar 

  70. R. Wahbe, S. Lucco, T. Anderson, and S. Graham. Efficient software-based fault isolation. In Proc. 14th Symposium on Operating System Principles. 1993.

    Google Scholar 

  71. D. Wallach, D. Balfanz, D. Dean, and E. Felten. Extensible security in Java. Technical Report 546-97, Dept of Computer Science, Princeton University, 1997.

    Google Scholar 

  72. R. Watson and R. Coyne. The parallel I/O architecture of the high performance storage system (HPSS). In 14th IEEE Symposium Mass Storage Systems, Monterey, CA, September 1995. Comp. Soc. Press.

    Google Scholar 

  73. Glen H. Wheless, Cathy M. Lascara, Arnoldo Valle-Levinson, Donald P. Brutzman, William Sherman, William L. Hibbard, and Brian E. Paul. Virtual chesapeake bay: Interacting with a coupled physical/biological model. IEEE Computer Graphics and Applications, 16(4):42–43, July 1996.

    Article  Google Scholar 

  74. S. Zhou. LSF: Load sharing in large-scale heterogeneous distributed systems. In Proc. Workshop on Cluster Computing, 1992.

    Google Scholar 

  75. S. Zhou, M. Stumm, K. Li, and D. Wortmann. Heterogeneous distributed shared memory (Mermaid). IEEE Transactions on Parallel and Distributed Systems, 3(5):540–554, September 1992.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2001 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Foster, I., Kesselman, C. (2001). Computational Grids. In: Palma, J.M.L.M., Dongarra, J., Hernández, V. (eds) Vector and Parallel Processing — VECPAR 2000. VECPAR 2000. Lecture Notes in Computer Science, vol 1981. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44942-6_2

Download citation

  • DOI: https://doi.org/10.1007/3-540-44942-6_2

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-41999-0

  • Online ISBN: 978-3-540-44942-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics