Skip to main content

Hybrid Approaches for Distributed Storage Systems

  • Conference paper
Data Management in Grid and Peer-to-Peer Systems (Globe 2011)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6864))

Included in the following conference series:

Abstract

Distributed or peer-to-peer storage solutions rely on the introduction of redundant data to be fault-tolerant and to achieve high reliability. One way to introduce redundancy is by simple replication. This strategy allows an easy and fast access to data, and a good bandwidth efficiency to repair the missing redundancy when a peer leaves or fails in high churn systems.

However, it is known that erasure codes, like Reed-Solomon, are an efficient solution in terms of storage space to obtain high durability when compared to replication.

Recently, the Regenerating Codes were proposed as an improvement of erasure codes to better use the available bandwidth when reconstructing the missing information.

In this work, we compare these codes with two hybrid approaches. The first was already proposed and mixes erasure codes and replication. The second one is a new proposal that we call Double Coding. We compare these approaches with the traditional Reed-Solomon code and also Regenerating Codes from the point of view of availability, durability and storage space. This comparison uses Markov Chain Models that take into account the reconstruction time of the systems.

Partially supported by the INRIA associated team EWIN, by ANR AGAPE, DIMAGREEN, GRATEL, and by Strep EULER.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 54.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 69.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Ahlswede, R., Cai, N., Yen Robert, S., Li, R., Yeung, W., Member, S., Member, S.: Network information flow. IEEE Transactions on Information Theory 46, 1204–1216 (2000)

    Article  MATH  Google Scholar 

  2. Alouf, S., Dandoush, A., Nain, P.: Performance analysis of peer-to-peer storage systems. In: Mason, L.G., Drwiega, T., Yan, J. (eds.) ITC 2007. LNCS, vol. 4516, pp. 642–653. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  3. Bhagwan, R., Tati, K., Cheng, Y.-C., Savage, S., Voelker, G.M.: Total recall: system support for automated availability management. In: Proceedings of Usenix NSDI, Berkeley, CA, USA, pp. 25–25 (2004)

    Google Scholar 

  4. Bolosky, W.J., Douceur, J.R., Ely, D., Theimer, M.: Feasibility of a serverless distributed file system deployed on an existing set of desktop pcs. SIGMETRICS Performance Evalualtion Rev. 28(1), 34–43 (2000)

    Article  Google Scholar 

  5. Chun, B.-G., Dabek, F., Haeberlen, A., Sit, E., Weatherspoon, H., Kaashoek, M.F., Kubiatowicz, J., Morris, R.: Efficient replica maintenance for distributed storage systems. In: Proceedings of Usenix NSDI, Berkeley, USA, pp. 45–58 (2006)

    Google Scholar 

  6. Dalle, O., Giroire, F., Monteiro, J., Pérennes, S.: Analysis of failure correlation impact on peer-to-peer storage systems. In: Proceedings of IEEE P2P, pp. 184–193 (September 2009)

    Google Scholar 

  7. Datta, A., Aberer, K.: Internet-scale storage systems under churn – a study of the steady-state using markov models. In: Proceedings of IEEE P2P, pp. 133–144. IEEE Computer Society, Los Alamitos (2006)

    Google Scholar 

  8. Dimakis, A., Godfrey, P., Wainwright, M., Ramchandran, K.: Network coding for distributed storage systems. In: Proceedings of IEEE INFOCOM, pp. 2000–2008 (May 2007)

    Google Scholar 

  9. Dimakis, A.G., Ramchandran, K., Wu, Y., Suh, C.: A survey on network codes for distributed storage. CoRR, abs/1004.4438 (2010)

    Google Scholar 

  10. Duminuco, A., Biersack, E.: A practical study of regenerating codes for peer-to-peer backup systems. In: ICDCS 2009: Proceedings of the 2009 29th IEEE International Conference on Distributed Computing Systems, pp. 376–384. IEEE Computer Society, Washington, DC, USA (2009)

    Chapter  Google Scholar 

  11. Kubiatowicz, J., Bindel, D., Chen, Y., Czerwinski, S., Eaton, P., Geels, D., Gummadi, R., Rhea, S., Weatherspoon, H., Wells, C., et al.: OceanStore: an architecture for global-scale persistent storage. ACM SIGARCH Computer Architecture News 28(5), 190–201 (2000)

    Article  Google Scholar 

  12. Lin, W., Chiu, D., Lee, Y.: Erasure code replication revisited. In: Proceedings of IEEE P2P, pp. 90–97 (2004)

    Google Scholar 

  13. Rabin, M.O.: Efficient dispersal of information for security, load balancing, and fault tolerance. Journal of ACM 36(2), 335–348 (1989)

    Article  MathSciNet  MATH  Google Scholar 

  14. Ramabhadran, S., Pasquale, J.: Analysis of long-running replicated systems. In: Proceedings of IEEE INFOCOM, pp. 1–9 (April 2006)

    Google Scholar 

  15. Rashmi, K.V., Shah, N.B., Kumar, P.V., Ramchandran, K.: Explicit construction of optimal exact regenerating codes for distributed storage. In: Allerton 2009: Proceedings of the 47th annual Allerton conference on Communication, control, and computing, Piscataway, NJ, USA, pp. 1243–1249. IEEE Press, Los Alamitos (2009)

    Chapter  Google Scholar 

  16. Reed, I.S., Solomon, G.: Polynomial codes over certain finite fields. Journal of the Society for Industrial and Applied Mathematics 8(2), 300–304 (1960)

    Article  MathSciNet  MATH  Google Scholar 

  17. Rodrigues, R., Liskov, B.: High availability in dhts: Erasure coding vs. replication. In: van Renesse, R. (ed.) IPTPS 2005. LNCS, vol. 3640, pp. 226–239. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  18. Weatherspoon, H., Kubiatowicz, J.: Erasure coding vs. replication: A quantitative comparison. In: Druschel, P., Kaashoek, M.F., Rowstron, A. (eds.) IPTPS 2002. LNCS, vol. 2429, pp. 328–338. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  19. Wu, Y., Dimakis, R., Ramchandran, K.: Deterministic regenerating codes for distributed storage. In: Allerton 2009: Proceedings of the 47th annual Allerton conference on Communication, control, and computing, Piscataway, NJ, USA. IEEE Press, Los Alamitos (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Araujo, J., Giroire, F., Monteiro, J. (2011). Hybrid Approaches for Distributed Storage Systems. In: Hameurlain, A., Tjoa, A.M. (eds) Data Management in Grid and Peer-to-Peer Systems. Globe 2011. Lecture Notes in Computer Science, vol 6864. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22947-3_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-22947-3_1

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-22946-6

  • Online ISBN: 978-3-642-22947-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics