Abstract
In this work, we present a hybrid miner-network analyzer (HMNA) system includes three main stages; the first stage called preparing and preprocessing stage that includes building initial network from citation file and Find Keywords through apply Rake and cleaning. The second stage involves building classification model including parameters detection and apply LDA for find topics, final stage, add the topics of document into initial network to construction multi-layer network, where each level represent community related of that topic. We can summarize the main points of HMNA system as: (i) It deals with real, complex, huge database of papers ‘citation’. (ii) The preprocessing stage involves retrieve keywords from corpus using Rake after add constructions on it and cleaning without using any feature selection method. (iii) It building digital corpus that combines with dictionary to clustering the clean text into multi groups based on LDA model. (iv) It reconstructed the initial network by add the labeling of topic results from above step to it. (v) It builds multi communities (multi-level network), each level in that network represent single communities, (vi) It computes the main characteristics measures for each level or communities to determine the similarity in it structure with cluster in citation network.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Zanin, M.: Complex networks and data mining: toward a new perspective for the understanding of Complex Systems (2014)
Ali, S.H.: A novel tool (FP-KC) for handle the three main dimensions reduction and association rule mining. In: IEEE, 2012 6th International Conference on Sciences of Electronics, Technologies of Information and Telecommunications (SETIT), Sousse, pp. 951–961 (2012). https://doi.org/10.1109/SETIT.2012.6482042, http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6482042&isnumber=6481878
Ali, S.H.: Miner for OACCR: Case of medical data analysis in knowledge discovery. In: IEEE, 2012 6th International Conference on Sciences of Electronics, Technologies of Information and Telecommunications (SETIT), Sousse, pp. 962–975 (2012). https://doi.org/10.1109/SETIT.2012.6482043, http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6482043&isnumber=6481878
Ohsawa, Y., Benson, N.E., Yachida, M., Science, H.: KeyGraph: automatic indexing by co-occurrence graph based on building construction metaphor. In: International Forum on Research and Technology Research and Technology Advances in Digital Libraries, pp. 12–18 (1998)
Litvak, M., Last, M.: Graph-based keyword extraction for single-document summarization. In: Proceedings of the workshop on Multi-source Multilingual Information Extraction and Summarization, pp. 17–24, August 2008
Abu-errub, A.: Arabic text classification algorithm using TFIDF and chi square measurements. Int. J. Comput. Appl. 93(6), 40–45 (2014)
Abilhoa, W.D., De Castro, L.N.: A keyword extraction method from twitter messages represented as graphs. Appl. Math. Comput. 240, 308–325 (2014)
Jain, A., Mishra, R.D.: Text categorization: by combining Naïve Bayes and modified maximum entropy classifiers. Int. J. Adv. Electron. Comput. Sci., 22–126, Sep 2016. Special Issue, ISSN: 2393-2835. http://www.iraj.in/journal/journal_file/journal_pdf/12-295-1475473979122-126.pdf
Brahimi, B., Touahria, M., Tari, A.: Data and text mining techniques for classifying Arabic tweet polarity. J. Digit. Inf. Manag. 14(1), 15–25 (2016)
Shah, M., Shinde, S., Sawant, R.S., Wagh, P.P.: Analysis of text review using hybrid classifier, vol. 7, no. 4, pp. 10914–10916 (2017)
Rezaeian, M., Montazeri, H., Loonen, R.C.G.M.: Science foresight using life-cycle analysis, text mining and clustering: a case study on natural ventilation. Technol. Forecast. Soc. Change 118, 270–280 (2017)
Al_Janabi, S.: Smart System to create optimal higher education environment using IDA and IOTs. Int. J. Comput. Appl. (2018). https://doi.org/10.1080/1206212x.2018.1512460
Dutta, A.: A novel extension for automatic keyword extraction. Int. J. Adv. Res. Comput. Sci. Softw. Eng. 6(5), 160–163 (2016)
Rose, S., Engel, D., Cramer, N., Cowley, W.: Automatic keyword extraction from individual documents. In: Text Mining: Application and Theory, pp. 1–20. Wiley (2010). ISBN: 978-0-470-74982-1. https://doi.org/10.1002/9780470689646.ch1
Erciyes, K.: Complex networks: an algorithmic perspective (2015)
Al-Janabi, S., Salman, M.A., Fanfakh, A.: Recommendation system to improve time management for people in education environments. J. Eng. Appl. Sci. 13, 10182–10193 (2018). https://doi.org/10.3923/jeasci.2018.10182.10193, http://medwelljournals.com/abstract/?doi=jeasci.2018.10182.10193
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Al_Janabi, S., Salman, M.A., Mohammad, M. (2019). Multi-level Network Construction Based on Intelligent Big Data Analysis. In: Farhaoui, Y., Moussaid, L. (eds) Big Data and Smart Digital Environment. ICBDSDE 2018. Studies in Big Data, vol 53. Springer, Cham. https://doi.org/10.1007/978-3-030-12048-1_13
Download citation
DOI: https://doi.org/10.1007/978-3-030-12048-1_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-12047-4
Online ISBN: 978-3-030-12048-1
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)