Abstract
Over the last decade, social media have dominated our lives. The exploding number of data produced by these platforms triggered a wave of research works that mainly focus on the storage and analysis of this data. In this paper, we propose an original information warehouse architecture for the storage and analysis of social media information. A multidimensional model is defined and the information is extracted, transformed and loaded in the warehouse using ETL (Extract, Transform, Load). The described framework is implemented for Twitter and a data mining analysis is performed on the collected tweets using a clustering algorithm to uncover most discussed topics. The preliminary results are satisfactory and the proposed paradigm can be applied for various information sources such as newspapers and scientific papers.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Khan, J.: Universal information warehouse system and method, 11 May 2004
Holten, R.: Framework and method for information warehouse development processes, pp. 135–163. Physica-Verlag HD, Heidelberg (2000)
Post, A.R., Kurc, T., Cholleti, S., Gao, J., Lin, X., Bornstein, W., Cantrell, D., Levine, D., Hohmann, S., Saltz, J.H.: The analytic information warehouse (AIW): a platform for analytics using electronic health record data. J. Biomed. Inform. 46(3), 410–424 (2013)
Kamal, J., Pasuparthi, K., Rogers, P., Buskirk, J., Mekhjian, H.: Using an information warehouse for clinical trials: a prototype. AMIA Ann. Symp. Proc. 2005, 1004–1004 (2005)
Choo, C.W.: The Knowing Organization: How Organisations Use Information to Construct Meaning, Create Knowledge, and Make Decisions. Oxford University Press, New York (2006)
Rehman, N.U., Mansmann, S., Weiler, A., Scholl, M.H.: Building a data warehouse for twitter stream exploration. In: 2012 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, pp. 1341–1348 (2012)
Aboubi, Y., Drias, H., Kamel, N.: BSO-CLARA: bees swarm optimization for clustering large applications. In: Prasath, R., Vuppala, A., Kathirvalavakumar, T. (eds.) Mining Intelligence and Knowledge Exploration. Lecture Notes in Computer Science, vol. 9468. Springer, Cham (2015)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Moulai, H., Drias, H. (2019). Towards the Paradigm of Information Warehousing: Application to Twitter. In: Demigha, O., Djamaa, B., Amamra, A. (eds) Advances in Computing Systems and Applications. CSA 2018. Lecture Notes in Networks and Systems, vol 50. Springer, Cham. https://doi.org/10.1007/978-3-319-98352-3_16
Download citation
DOI: https://doi.org/10.1007/978-3-319-98352-3_16
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-98351-6
Online ISBN: 978-3-319-98352-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)