Abstract
In present scenario of data storage world, selection of storage system is performed on the basis of performance which can be hardware compatibility, cost, response time or application features. In case of big data, data is collected in different format. At the other end, each and every data storage system is not suitable for storing any types of data. Here, the question is, how to prepare a storage system that can be able to store any types of data in the corresponding data storage system. Again, the data format conversion of heterogeneous raw data into a particular format is costly and time consuming. Therefore, it is require to have such data storage mechanism which can store the raw data in the data storage system in their originated data format. In this paper, we proposed an xml based storage model which is going to solve the above mentioned problem. The proposed model makes a decision regarding the allocation of storage resources (or databases) on the basis of data type. Our storage model can customized the storage structure of each and individual storage resource. Collection of data fragmentation and selection of the compatible database are the responsibility of our storage model. Also, the applicability of this storage model is described through an experiment in this paper.
References
Cassandra. http://cassandra.apache.org/
Mongodb. https://www.mongodb.org/
Anderson, J.C., Lehnardt, J., Slater, N.: CouchDB: the Definitive Guide. O’Reilly Media Inc, Sebastopol (2010)
Armbrust, M., Fox, A., Griffith, R., Joseph, A.D., Katz, R., Konwinski, A., Lee, G., Patterson, D., Rabkin, A., Stoica, I., Zaharia, M.: A view of cloud computing. ACM Commun. 53(4), 50–58 (2010)
Chen, M., Mao, S., Liu, Y.: Big data: a survey. Mob. Netw. Appl. 19(2), 171–209 (2014)
Gudivada, V.N., Baeza-Yates, R., Raghavan, V.V.: Big data: promises and problems. IEEE Comput. J. 3, 20–23 (2015)
Kaisler, S., Armour, F., Espinosa, J.A., Money, W.: Big data: issues and challenges moving forward. In: 46th Hawaii International Conference on System Sciences (HICSS 2013) (2013)
Kulkarni, G., Waghmare, R., Palwe, R., Waykule, V., Bankar, H., Koli, K.: Cloud storage architecture. In: 7th International Conference on Telecommunication Systems, Services, and Applications (TSSA 2012), pp. 76–81. IEEE (2012)
Li, Y., Guo, L., Guo, Y.: Cacss: Towards a generic cloud storage service. In: CLOSER (2012)
Mathur, G., Desnoyers, P., Ganesan, D., Shenoy, P.: Capsule: an energy-optimized object storage system for memory-constrained sensor devices. ACM (2006)
Momjian, B.: PostgreSQL: Introduction and Concepts. Addison-Wesley, New York (2001)
Palankar, M.R., Iamnitchi, A., Ripeanu, M., Garfinkel, S.: Amazon s3 for science grids: a viable solution?. In: Proceedings of the 2008 International Workshop on Data-aware Distributed Computing, New York, NY, USA (2008)
Panda, P.R., Catthoor, F., Dutt, N.D., Danckaert, K., Brockmeyer, E., Kulkarni, C., Vandercappelle, A., Kjeldsberg, P.G.: Data and memory optimization techniques for embedded systems. ACM Trans. Des. Autom. Electron. 6(2), 149–206 (2001)
Pepple, K.: Deploying Openstack. O’Reilly Media Inc, Sebastopol (2011)
Ruiz-Alvarez, A., Humphrey, M.: An automated approach to cloud storage service selection. In: Proceedings of the 2nd International Workshop on Scientific Cloud Computing, pp. 39–48. ACM (2011)
Ruiz-Alvarez, A., Humphrey, M.: A model and decision procedure for data storage in cloud computing. In: 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid 2012), pp. 572–579. IEEE (2012)
Shvachko, K., Kuang, H., Radia, S., Chansler, R.: The hadoop distributed file system. In: IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST 2010) (2010)
Sidwell, L.P.: System for modifying jcl parameters to optimize data storage allocations (2000)
Spillner, J., Müller, J., Schill, A.: Creating optimal cloud storage systems. Future Gener. Comput. Syst. 29(4), 1062–1072 (2013)
Takahashi, K., Yamamoto, S., Okushi, A., Matsumoto, S., Nakamura, M.: Design and implementation of service api for large-scale house log in smart city cloud. In: IEEE 4th International Conference on Cloud Computing Technology and Science (CloudCom 2012), pp. 815–820 (2012)
Timmaraju, S., Ravi, V., Gangadharan, G.: Ranking of cloud services using opinion mining and multi-attribute decision making. In: Handbook of Research on Advanced Data Mining Techniques and Applications for Business Intelligence, p. 379 (2017)
Vrable, M., Savage, S., Voelker, G.M.: Bluesky: a cloud-backed file system for the enterprise. In: Proceedings of the 10th USENIX Conference on File and Storage Technologies, p. 19. USENIX Association (2012)
Wu, K., Vassileva, J., Zhao, Y.: Understanding users’ intention to switch personal cloud storage services: evidence from the chinese market. Comput. Hum. Behav. 68, 300–314 (2017)
Yamamoto, S., Matsumoto, S., Saiki, S., Nakamura, M.: Materialized view as a service for large-scale house log in smart city. In: IEEE 5th International Conference on Cloud Computing Technology and Science (CloudCom 2013), vol. 2, pp. 311–316 (2013)
Zhang, M., Ranjan, R., Haller, A., Georgakopoulos, D., Menzel, M., Nepal, S.: An ontology-based system for cloud infrastructure services’ discovery. In: 8th International Conference on Collaborative Computing: Networking, Applications and Worksharing (CollaborateCom 2012), pp. 524–530. IEEE (2012)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Sarkar, A., Chattopadhyay, S. (2017). A Storage Model for Handling Big Data Variety. In: Mandal, J., Dutta, P., Mukhopadhyay, S. (eds) Computational Intelligence, Communications, and Business Analytics. CICBA 2017. Communications in Computer and Information Science, vol 775. Springer, Singapore. https://doi.org/10.1007/978-981-10-6427-2_5
Download citation
DOI: https://doi.org/10.1007/978-981-10-6427-2_5
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-6426-5
Online ISBN: 978-981-10-6427-2
eBook Packages: Computer ScienceComputer Science (R0)