Advertisement

HDUMP: A Data Recovery Tool for Hadoop

  • Zhongsheng Li
  • Qiuhong LiEmail author
  • Wei Wang
  • Qitong Wang
  • Fengbin Qi
  • Yimin Liu
  • Peng Wang
Conference paper
  • 2.4k Downloads
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10828)

Abstract

Hadoop is a popular distributed framework for massive data processing. HDFS is the underlying file system of Hadoop. More and more companies use Hadoop as data processing platform. Once Hadoop crashes, the data stored in HDFS can not be accessed directly. We present HDUMP, a light-weight bypassing file system, which aims to recover the data stored in HDFS when Hadoop crashes.

References

  1. 1.
  2. 2.
    Comer, D.: The ubiquitous B-tree. ACM Comput. Surv. 11, 121–137 (1979)CrossRefGoogle Scholar
  3. 3.
    Dean, J., Ghemawat, S.: MapReduce: simplified data processing on large clusters. Commun. ACM 51, 107–113 (2008)CrossRefGoogle Scholar
  4. 4.
    Deshpande, P., Bora, D.: The recovery system for Hadoop cluster. In: The 20th International Conference on Distributed Multimedia Systems: Research Papers on Distributed Multimedia Systems, Distance Education Technologies and Visual Languages and Computing, Pittsburgh, PA, USA, 27–29 August 2014, pp. 416–420 (2014)Google Scholar

Copyright information

© Springer International Publishing AG, part of Springer Nature 2018

Authors and Affiliations

  • Zhongsheng Li
    • 1
  • Qiuhong Li
    • 2
    Email author
  • Wei Wang
    • 2
  • Qitong Wang
    • 2
  • Fengbin Qi
    • 1
  • Yimin Liu
    • 3
  • Peng Wang
    • 2
  1. 1.JiangNan Institute of Computing TechnologyWuxiChina
  2. 2.School of Computer ScienceFudan UniversityShanghaiChina
  3. 3.Third Affiliated Hospital of Second Military Medical UniversityChongqingChina

Personalised recommendations