Abstract
Several recent studies suggest that co-allocation techniques can improve user performance for distributed data retrieval in replicated grid systems. These studies demonstrate that co-allocation techniques can improve network bandwidth and network transfer times by concurrently utilizing as many data grid replicas as possible. However, these prior studies evaluate their techniques from a single user’s perspective and overlook evaluations of system wide performance when multiple users are using co-allocation techniques. In our study, we provide multi-user evaluations of a co-allocation technique for replicated data in a controlled grid environment. We find that co-allocation works well under low-load conditions when there are only a few users using co-allocation. However, co-allocation works very poorly for medium and high-load conditions since the response time for co-allocating users grows rapidly as the number of grid users increases. The decreased performance for co-allocating users can be directly attributed to the increased workload that their greedy retrieval technique places on the replicas in the grid. Overall, we determine that uninformed, blind utilization of greedy co-allocation techniques by multiple users is detrimental to global system performance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Minoli, D.: A Networking Approach to Grid Computing. Wiley Press, Chichester (2005)
Nicholson, C., Cameron, D.G., Doyle, A.T., Millar, A.P., Stockinger, K.: Dynamic data replication in lcg 2008. In: UK e-Science All Hands Conference, Nottingham (2006)
Lamehamedi, H., Szymanski, B., Shentu, Z., Deelman, E.: Data replication strategies in grid environments. In: ICA3PP (2002)
Ranganathan, K., Foster, I.T.: Identifying dynamic replication strategies for a high-performance data grid. In: GRID, pp. 75–86 (2001)
Slota, R., Nikolow, D., Skital, L., Kitowski, J.: Implementation of replication methods in the grid environment. Advances in Grid Computing, 474–484 (2005)
Allcock, W., Bester, J., Bresnahan, J., Chervenak, A., Foster, I., Kesselman, C., Meder, S., Nefedova, V., Quesnel, D., Tuecke, S.: Secure, efficient data transport and replica management for high-performance data-intensive computing. In: IEEE Mass Storage (2001)
Allcock, W., Bester, J., Bresnahan, J., Chervenak, A., Foster, I., Kesselman, C., Meder, S., Nefedova, V., Quesnel, D., Tuecke, S.: Data management and transfer in high performance computational grid environments. Parallel Computing Journal 28, 749–771 (2002)
Yang, C.T., Yang, I.H., Chen, C.H., Wang, S.Y.: Implementation of a dynamic adjustment mechanism with efficient replica selection in data grid environments. In: SAC (2006)
Wolski, R., Spring, N.T., Hayes, J.: The network weather service. Future Gener. Comput. Syst. 15, 757–768 (1999)
Farley, M.: Storage Networking Fundamentals: An Introduction to Storage Devices, Subsystems, Applications, Management, and Filing Systems. Cisco Press (2004)
Vazhkudai, S., Schopf, J.M.: Using disk throughput data in predictions of end-to-end grid data transfers. In: GRID, pp. 291–304 (2002)
Gray, J., Shenoy, P.: Rules of thumb in data engineering. In: IEEE International Conference on Data Engineering, April 2000, pp. 3–12 (2000)
Gilder, G.: Fiber keeps its promise. Forbes (April 7, 1997)
Chervenak, A., Foster, I., Kesselman, C., Salisbury, C., Tuecke, S.: The data grid: Towards an architecture for the distributed management and analysis of large scientific datasets. Journal of Network and Computer Applications, 187–200 (2001)
Venugopal, S., Buyya, R., Ramamohanarao, K.: A taxonomy of data grids. ACM Comput. Surv. 38, 3 (2006)
DiStefano, M.: Distributed Data Management for Grid Computing. John Wiley and Sons, Inc., Chichester (2005)
Vazhkudai, S.: Enabling the co-allocation of grid data transfers. In: GRID (2003)
Vazhkudai, S.: Distributed downloads of bulk, replicated grid data. Journal of Grid Computing 2, 31–42 (2004)
Feng, J., Humphrey, M.: Eliminating replica selection - using multiple replicas to accelerate data transfer on grids. In: ICPADS, p. 359 (2004)
Chervenak, A., Deelman, E., Foster, I., Guy, L., Hoschek, W., Iamnitchi, A., Kesselman, C., Kunszt, P., Ripeanu, M., Schwartzkopf, B., Stockinger, H., Stockinger, K., Tierney, B.: Giggle: a framework for constructing scalable replica location services. Supercomputing, 1–17 (2002)
Bresnahan, J., Link, M., Khanna, G., Imani, Z., Kettimuthu, R., Foster, I.: Globus gridftp: What’s new in 2007. In: GridNets (2007)
Zhou, X., Kim, E., Kim, J.W., Yeom, H.Y.: Recon: A fast and reliable replica retrieval service for the data grid. In: CCGRID, pp. 446–453 (2006)
Yang, C.T., Chi, Y.C., Fu, C.P.: Redundant parallel file transfer with anticipative adjustment mechanism in data grids. Journal of Information Technology and Applications (2007)
Yang, C.T., Yang, I.H., Li, K.C., Wang, S.Y.: Improvements on dynamic adjustment mechanism in co-allocation data grid environments. The Journal of Supercomputing (2007)
Buyya, R., Murshed, M.: Gridsim: A toolkit for the modeling and simulation of scheduling for grid computing. In: CCPE (2002)
Sulistio, A., Poduval, G., Buyya, R., Tham, C.K.: On incorporating differentiated levels of network service into gridsim. Future Gen. Computer Systems (2007)
Sulistio, A., Cibej, U., Robic, B., Buyya, R.: A tool for modelling and simulation of data grids. Technical Report GRIDS-TR-2005-13, Grid Computing Laboratory, University of Melbourne (2005)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Villa, A.H., Varki, E. (2008). Co-allocation in Data Grids: A Global, Multi-user Perspective. In: Wu, S., Yang, L.T., Xu, T.L. (eds) Advances in Grid and Pervasive Computing. GPC 2008. Lecture Notes in Computer Science, vol 5036. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68083-3_17
Download citation
DOI: https://doi.org/10.1007/978-3-540-68083-3_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68081-9
Online ISBN: 978-3-540-68083-3
eBook Packages: Computer ScienceComputer Science (R0)