Abstract
Big Data has been described as a four-dimensional model with Volume, Variety, Velocity, and Veracity. In this paper we discuss the potential of a model-driven approach (MDA) to tackle design issues of Big Data taking into account the effect of the four dimensions. Our approach considers NoSQL graph databases. The approach is applied to the case of Neo4j database. Our main contribution is an MDA methodology that enables to tackle the four V’s dimensions described above. It consists of two major steps: (i) a forward engineering approach based on MDA as well as a set of transformations rules enabling the development of a conceptual, logical, and physical model for big data encompassing the four V’s, (ii) a volume-guided approach supporting the generation of test bases dedicated to performance evaluation. We present an illustrative scenario of our forward engineering approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Schroeck, M., Shockley, R., Smart, J., Romero-Morales, D., Tufano, P.: Analytics: the real-world use of big data. IBM Institute for Business Value - Executive Report (2012)
Llewellyn, A.: NASA Tournament Lab’s Big Data Challenge, October 2012. https://open.nasa.gov/blog/2012/10/03/nasa-tournament-labs-big-data-challenge/
Lukoianova, T., Rubin, V.L.: Veracity roadmap: is big data objective, truthful and credible? In: Advances in Classification Research Online, vol. 24(1), pp. 4–15 (2014)
Sänger, J., et al.: Trust and big data: a roadmap for research. In: Database and Expert Systems Applications (DEXA), pp. 278–282, September 2014
Aggarwal, D., Davis, K.C.: Employing graph databases as a standardization model towards addressing heterogeneity. In: 2016 IEEE 17th International Conference on Information Reuse and Integration (IRI), pp. 198–207 (2016)
Daniel, G., Sunyé, G., Cabot, J.: UMLtoGraphDB: mapping conceptual schemas to graph databases. In: Comyn-Wattiau, I., Tanaka, K., Song, I.-Y., Yamamoto, S., Saeki, M. (eds.) ER 2016. LNCS, vol. 9974, pp. 430–444. Springer, Cham (2016). doi:10.1007/978-3-319-46397-1_33
Bugiotti, F., Cabibbo, L., Atzeni, P., Torlone, R.: Database design for NoSQL systems. In: Yu, E., Dobbie, G., Jarke, M., Purao, S. (eds.) ER 2014. LNCS, vol. 8824, pp. 223–231. Springer, Cham (2014). doi:10.1007/978-3-319-12206-9_18
Sevilla Ruiz, D., Morales, S.F., García Molina, J.: Inferring versioned schemas from NoSQL databases and its applications. In: Johannesson, P., Lee, M.L., Liddle, S.W., Opdahl, A.L., López, Ó.P. (eds.) ER 2015. LNCS, vol. 9381, pp. 467–480. Springer, Cham (2015). doi:10.1007/978-3-319-25264-3_35
Boulil, K., Bimonte, S., Pinet, F.: Conceptual model for spatial data cubes: a UML profile and its automatic implementation. Comput. Stand. Interfaces 38, 113–132 (2015)
Cuzzocrea, A., do N. Fidalgo, R.: Enhancing coverage and expressive power of spatial data warehousing modeling: the SDWM approach. In: Cuzzocrea, A., Dayal, U. (eds.) DaWaK 2012. LNCS, vol. 7448, pp. 15–29. Springer, Heidelberg (2012). doi:10.1007/978-3-642-32584-7_2
Toreador Project. http://www.toreador-project.eu/
Boulil, K., Bimonte, S., Pinet, F.: Spatial OLAP integrity constraints: from UML-based specification to automatic implementation: application to energetic data in agriculture. J. Dec. Syst. 23(4), 460–480 (2014)
Curé, O., Hecht, R., Le Duc, C., Lamolle, M.: Data integration over NoSQL stores using access path based mappings. In: Hameurlain, A., Liddle, Stephen W., Schewe, K.-D., Zhou, X. (eds.) DEXA 2011. LNCS, vol. 6860, pp. 481–495. Springer, Heidelberg (2011). doi:10.1007/978-3-642-23088-2_36
Abdelhedi, F., Brahim, A.A., Atigui, F., Zurfluh, G.: Big data and knowledge management: how to implement conceptual models in NoSQL systems? In: Proceedings of the 8th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, Porto, Portugal, pp. 235–240 (2016)
Prat, N., Akoka, J., Comyn-Wattiau, I.: An MDA approach to knowledge engineering. Expert Syst. Appl. 39(12), 10420–10437 (2012)
Bouhali, R., Laurent, A.: Exploiting RDF open data using NoSQL graph databases. In: Chbeir, R., Manolopoulos, Y., Maglogiannis, I., Alhajj, R. (eds.) AIAI 2015. IAICT, vol. 458, pp. 177–190. Springer, Cham (2015). doi:10.1007/978-3-319-23868-5_13
Zhu, Y., Yan, E., Song, I.Y.: The use of a graph-based system to improve bibliographic information retrieval: system design, implementation, and evaluation. J. Assoc. Inf. Sci. Technol. 68(2), 480–490 (2017)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Akoka, J., Comyn-Wattiau, I., Prat, N. (2017). A Four V’s Design Approach of NoSQL Graph Databases. In: de Cesare, S., Frank, U. (eds) Advances in Conceptual Modeling. ER 2017. Lecture Notes in Computer Science(), vol 10651. Springer, Cham. https://doi.org/10.1007/978-3-319-70625-2_6
Download citation
DOI: https://doi.org/10.1007/978-3-319-70625-2_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-70624-5
Online ISBN: 978-3-319-70625-2
eBook Packages: Computer ScienceComputer Science (R0)