Abstract
Data de-duplication is a process which stores a single copy of the data in the storage by eliminating the redundant copies of the data and provides a reference to the existing unique data. On the other hand, cloud storage is growing day by day due to the large volumes of data generated every day. The users make use of cloud to store the large amount of data available with them. Many Internet services such as blogs and social networks which produces huge amount of data may contain a lot of redundancies between them. To efficiently store and manage such kind of data, de-duplication comes into existence. This paper intends to apply data de-duplication framework in the cloud environment and to assess their performance of compressed storage area with respect to two de-duplication strategies such as file level and chunk level. The combination of performing de-duplication along with compression has also improved the compression rate of the storage device. This research achieves efficiency in terms of storage in large. Also it is obvious from the experiments that the performance of the chunk level is better than the file-level data de-duplication.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
A. Upadhyay, P.R Balihalli, S. Ivaturi, S. Rao, De-duplication and compression techniques in cloud design. Proceedings of the IEEE International Systems Conference. (2012), pp. 1–6
H. Qinlu, L. Zhanhuai, Z. Xiao, Data de-duplication techniques. Proceedings of the International Conference on Future Information Technology and Management Engineering. 1, 430–433 (2010)
G. Zhu, X. Zhang, L. Wang, Y, Zhu, X. Dong, An intelligent data de-duplication based backup system. Proceedings of the 15th IEEE International Conference on Network-Based Information Systems. (2012), pp. 771–776
W. Zeng Y. Zhao K. Ou W. Song, Research on cloud storage architecture and key technologies. Proceedings of the 2nd ACM International Conference on Interaction Sciences: Information Technology, Culture and Human. (2009), pp. 1044–1048
S. Patidar, D, Rane, P. Jain, A survey paper on cloud computing. Proceedings of the 2nd IEEE International Conference on Advanced Computing and Communication Technologies. (2012), pp. 394–398
F. Rashid, A. Miri, I. Woungang, A secure data de-duplication framework for cloud environments. Proceedings of the 10th IEEE International Conference on Privacy, Security and Trus. (2012), pp. 81–87
D.D. Harnik, D. Naor, D. Sotnikov, G. Vernik, O. Margali, Estimation of de duplication ratios in large data sets. Proceedings of the 28 th IEEE Symposium on Mass Storage Systems and Technologies. (2012) pp. 1–11
Y. Fu, H. Jiang, N. Xiao, L. Tian, F. AA-dedupe, An application aware source de-duplication approach for cloud backup services in the personal computing environment. Proceedings of the IEEE International Conference on Cluster Computing. (2011), pp. 112–120
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer India
About this paper
Cite this paper
Deivamani, M., Vikraman, R., Abirami, S., Baskaran, R. (2015). Data Storage Optimization in Cloud Environment. In: Suresh, L., Dash, S., Panigrahi, B. (eds) Artificial Intelligence and Evolutionary Algorithms in Engineering Systems. Advances in Intelligent Systems and Computing, vol 325. Springer, New Delhi. https://doi.org/10.1007/978-81-322-2135-7_45
Download citation
DOI: https://doi.org/10.1007/978-81-322-2135-7_45
Published:
Publisher Name: Springer, New Delhi
Print ISBN: 978-81-322-2134-0
Online ISBN: 978-81-322-2135-7
eBook Packages: EngineeringEngineering (R0)