Data intensive distributed computing: A medical application example
Modern scientific computing involves organizing, moving, visualizing, and analyzing massive amounts of data from around the world, as well as employing large-scale computation. The distributed systems that solve large-scale problems will always involve aggregating and scheduling many resources. Data must be located and staged, cache and network capacity must be available at the same time as computing capacity, etc. Every aspect of such a system is dynamic: locating and scheduling resources, adapting running application systems to availability and congestion in the middleware and infrastructure, responding to human interaction, etc. The technologies, the middleware services, and the architectures that are used to build useful high-speed, wide area distributed systems, constitute the field of data intensive computing. This paper explores some of the history and future directions of that field, and describes a specific medical application example.
Unable to display preview. Download preview PDF.
- DPSS, “The Distributed Parallel Storage System”, http://www-didc.lbl.gov/DPSS/Google Scholar
- Globus, “The Globus Project”, http://www.globus.org/Google Scholar
- Greiman, W., W. E. Johnston, C. McParland, D. Olson, B. Tierney, C. Tull, “High-Speed Distributed Data Handling for HENP”, Computing in High Energy Physics, April, 1997. Berlin, Germany, http://www-itg.lbl.gov/STAR/Google Scholar
- Grimshaw, A., a. Ferrari, G. Lindahl, K. Holcomb, “Metasystems” Communications of the ACM, November, 1998, Volume 41, no 11Google Scholar
- Foster, I., C. Kesselman, eds., “The Grid: Blueprint for a New Computing Infrastructure”, Morgan Kaufmann, publisher. August, 1998.Google Scholar
- B. Fuller and I. Richer “The MAGIC Project: From Vision to Reality”, IEEE Network, May, 1996, Vol. 10, no. 3. //www.magic.net/Google Scholar
- NTON, “National Transparent Optical Network Consortium”. See http://www.ntonc.org/.Google Scholar
- Johnston, W., G. Jin, C. Larsen J. Lee, G. Hoo, M. Thompson, B. Tierney, J. Terdiman, “Real-Time Generation and Cataloguing of Large Data-Objects in Widely Distributed Environments”, International Journal of Digital Libraries— Special Issue on “Digital Libraries in Medicine”. November, 1997. (Available at http://www-itg.lbl.gov/WALDO/)Google Scholar
- Thompson, M., W. Johnston, J. Guojun, J. Lee, B. Tierney, and J. F. Terdiman, “Distributed health care imaging information systems”, PACS Design and Evaluation: Engineering and Clinical Issues, SPIE Medical Imaging 1997. (Available at http://www-itg.lbl.gov/Kaiser.IMG)Google Scholar
- Tierney, B., W. Johnston, B. Crowley, G. Hoo, C. Brooks, D. Gunter, “The Net-Logger Methodology for High Performance Distributed Systems Performance Analysis”, Seventh IEEE International Symposium on High Performance Distributed Computing, Chicago, Ill., July 28–31, 1998. Available at http://www-itg.lbl.gov/DPSS/papers.html.Google Scholar
- Tierney, B., W. Johnston, J. Lee, and G. Hoo, “Performance Analysis in High-Speed Wide Area ATM Networks: Top-to-bottom end-to-end Monitoring”, IEEE Networking, May 1996. (Available at http://www-itg.lbl.gov/DPSS/papers.)Google Scholar