Energy Efficient Deduplication System Exploiting Similarity Information

  • Ho Min JungEmail author
  • Jae Min Park
  • Sun Jeong Kim
  • Young Woong Ko
Part of the Lecture Notes in Electrical Engineering book series (LNEE, volume 179)


In this paper, we propose a file deduplication system considering file similarity information. The key idea of this paper is to reduce the energy consumption of a mobile device by minimizing network traffic. The proposed system can detect duplication ratio of a file using file similarity algorithm. In this work, for a file with high data redundancy, data deduplication is performed during file transferring, otherwise, conventional file transfer scheme works without data deduplication. We implemented Fixed-length Chunking and Variable-length Chunking deduplication algorithm using file similarity. Experiment result shows that the proposed scheme significantly reduces energy consumption of data deduplication.


Deduplication File Similarity Energy Chunking Backup 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Meunier, P., Nystrom, S., Kamara, S., Yost, S., Alexander, K., Noland, D., Crane, J.: ActiveSync TCP/IP and 802.11 b Wireless Vulnerabilities of WinCE-based PDAs, pp. 145–150. IEEE (2002)Google Scholar
  2. 2.
    HotSync, P.: Palm Developer Online Documentation (2007)Google Scholar
  3. 3.
    Starobinski, D., Trachtenberg, A., Agarwal, S.: Efficient PDA synchronization. IEEE Transactions on Mobile Computing 2, 40–51 (2003)CrossRefGoogle Scholar
  4. 4.
    Yan, H., Irmak, U., Suel, T.: Algorithms for low-latency remote file synchronization, pp. 156–160. IEEE (2008)Google Scholar
  5. 5.
    Xu, D., Sheng, Y., Ju, D., Wu, J., Wang, D.: High Effective Two-round Remote File Fast Synchronization Algorithm. Jisuanji Kexue yu Tansuo 5, 38–49 (2011)Google Scholar
  6. 6.
    Tridgell, A.: Efficient algorithms for sorting and synchronization. PhD thesis, The Australian National University (1999)Google Scholar
  7. 7.
    Quinlan, S., Dorward, S.: Venti: a new approach to archival storage. In: Proceedings of the 1st USENIX Conference on File and Storage Technologies, p. 7. USENIX Association, Venti (2002)Google Scholar
  8. 8.
    Muthitacharoen, A., Chen, B., Mazieres, D.: A low-bandwidth network file system. ACM SIGOPS Operating Systems Review 35, 174–187 (2001)CrossRefGoogle Scholar
  9. 9.
    Jayaram, K., Peng, C., Zhang, Z., Kim, M., Chen, H., Lei, H.: An empirical analysis of similarity in virtual machine images, p. 6. ACM (2011)Google Scholar
  10. 10.
    Tangwongsan, K., Pucha, H., Andersen, D.G., Kaminsky, M.: Efficient similarity estimation for systems exploiting data redundancy, pp. 1–9. IEEE (2010)Google Scholar
  11. 11.
    Shavitt, Y., Weinsberg, E., Weinsberg, U.: Estimating peer similarity using distance of shared files, p. 4. USENIX Association (2010)Google Scholar

Copyright information

© Springer Science+Business Media Dordrecht 2012

Authors and Affiliations

  • Ho Min Jung
    • 1
    Email author
  • Jae Min Park
    • 1
  • Sun Jeong Kim
    • 1
  • Young Woong Ko
    • 1
  1. 1.Department of Computer EngineeringHallym UniversityChuncheonKorea

Personalised recommendations