Enhanced Data Replication Broker

  • Rafah M. Almuttairi
  • Rajeev Wankar
  • Atul Negi
  • Chillarige Raghavendra Rao
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7080)


Data Replication Broker is one of the most important components in data grid architecture as it reduces latencies related to file access and file transfers (replica). Thus it enhances performance since it avoids single site congestion by the numerous requesters. To facilitate access and transfer of the data sets, replicas of data are distributed across multiple sites. The effectiveness of a replica selection strategy in data replication broker depends on its ability to serve the requirement posed by the users’ jobs or grid application. Most jobs are required to be executed at a specific execution time. To achieve the QoS perceived by the users, response time metrics should take into account a replica selection strategy. Total execution time needs to factor latencies due to network transfer rates and latencies due to search and location. Network resources affect the speed of moving the required data and searching methods can reduce scope for replica selection. In this paper we propose an approach that extends the data replication broker with policies that factor in user quality of service by reducing time costs when transferring data. The extended broker uses a replica selection strategy called Efficient Set Technique (EST) that adapts its criteria dynamically so as to best approximate application providers’ and clients’ requirements. A realistic model of the data grid was created to simulate and explore the performance of the proposed model. The policy displayed an effective means of improving the performance of the network traffic and is indicated by the improvement of speed and cost of transfers by brokers.


Data Grid Replica Selection technique Association Rules Broker 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Buyya, R., Venugopal, S.: The Gridbus toolkit for service oriented grid and utility computing: an overview and status report. In: 1st IEEE International Workshop Grid Economics and Business Models, GECON 2004, pp. 19–66, 23 (April 2004)Google Scholar
  2. 2.
    Abbas, A.: Grid Computing: A Practical Guide to Technology and Applications (2006)Google Scholar
  3. 3.
    Venugopal, S., Buyya, R., Ramamohanarao, K.: A taxonomy of Data Grids for distributed data sharing, managment, and proce. ACM Comput. Surv. 38(1), Article 3 (June 2006)Google Scholar
  4. 4.
    Vazhkudai, S., Tuecke, S., Foster, I.: Replica selection in the globus data grid. In: First IEEE /ACM Int. Conf. on Cluster Computing and the Grid, CCGrid 2001 (2001)Google Scholar
  5. 5.
    Rahman, R.M., Barker, K., Alhajj, R.: Replica selection strategies in data grid. Journal of Parallel and Dis. Computing 68(12), 1561–1574 (2008)CrossRefzbMATHGoogle Scholar
  6. 6.
    Almuttairi, R.M., Wankar, R., Negi, A., Rao, C.R., Almahna, M.S.: New replica selection technique for binding replica sites. In: 2010 1st International Conference on Data Grids, Energy, Power and Control (EPC-IQ), pp. 187–194 (December 2010)Google Scholar
  7. 7.
  8. 8.
    Lin, H., Abawajy, J., Buyya, R.: Economy-Based Data Replication Broker. In: Proceedings of the 2nd IEEE Int’l Con. on E-Science and Grid Computing (E-Science 2006), Amsterdam, Netherlands. IEEE CS Press, Los Alamitos (2006)Google Scholar
  9. 9.
    Earl, A.D., Menken, H.L.: Supporting the challenge of LHC produced data with ScotGrid, The University of Edinburgh CERN-THESIS-2006-014 (April 2006)Google Scholar
  10. 10.
    Tirumala, A., Ferguson, J.: Iperf 1.2 - The TCP/UDP Bandwidth Measurement Tool (2002)Google Scholar
  11. 11.
  12. 12.
    Almuttari, R.M., Wankar, R., Negi, A., Rao, C.R.: Intelligent Replica Selection Strategy for Data Grid. In: Proceeding of the 10th Int. Conf. on Parallel and Distributed Proceeding Techniques and Applications, LasVegas, USA, vol. 3, pp. 95–100 (July 2010)Google Scholar
  13. 13.
    Matsunaga, H., Isobe, T., Mashimo, T., Sakamoto, H., Ueda, I.: Data transfer over the wide area network with large round trip time. In: IOP Science, 17th Int. Conf. in High Energy and Nuclear Physics (2010)Google Scholar
  14. 14.
    Wang, J., Huang, L.: Intelligent File Transfer Protocol for Grid Environment. In: Current Trends in High Performance Computing and Its Applications, Part II, pp. 469–476 (2005), doi:10.1007/3-540-27912-1_63Google Scholar
  15. 15.
    Kavitha, R., Foster, I.: Design and evaluation of replication strategies for a high performance data grid. In: Proceedings of Computing and High Energy and Nuclear Physics (2001)Google Scholar
  16. 16.
    Ceryen, T., Kevin, M.: Performance characterization of decentralized algorithms for replica selection in distributed object systems. In: Proceedings of 5th International Workshop on Software and Performance, Palma, de Mallorca, Spain, July 11 -14, pp. 257–262 (2005)Google Scholar
  17. 17.
    Almuttari, R.M., Wankar, R., Negi, A., Rao, C.R.: Smart Replica Selection for Data Grids Using Rough Set Approximations (RSDG). In: CICN, pp. 466–471 (November 2010)Google Scholar
  18. 18.
    Almuttari, R.M., Wankar, R., Negi, A., Rao, C.R.: Rough set clustering approach to replica selection in data grids (RSCDG). In: ISDA 2010, pp. 1195–1200 (November 2010)Google Scholar
  19. 19.
    Almuttari, R.M., Wankar, R., Negi, A., Rao, C.R.: Replica Selection in Data Grids Using Preconditioning of Decision Attributes by K-means Clustering (K-RSDG). In: Information Technology for Real World Problems (VCON), pp. 18–23 (December 2010)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Rafah M. Almuttairi
    • 1
  • Rajeev Wankar
    • 1
  • Atul Negi
    • 1
  • Chillarige Raghavendra Rao
    • 1
  1. 1.DCISUniversity of HyderabadHyderabadIndia

Personalised recommendations