Abstract
In this paper, we explore the task of automatic construction of concept maps for various knowledge domains. We propose a simple 3-steps algorithm for extraction of key elements of a concept map (nodes and links) from a given collection of domain documents. Our algorithm manipulates a statistical term-document matrix describing how frequently terms occur in documents of the collection. At the first step we decompose this matrix into scores (terms-by-factors) and loadings (factors-by-documents) matrixes using non-negative matrix factorization, wherein each factor represents one topic of the collection. Since the scores matrix specifies the relative contribution of each term to the factors, we can select the most contributing terms and use them as concept map nodes. At the second step we associate selected key terms with the corresponding row-vectors of the term-document matrix and calculate pairwise cosine distances between them. Since the close distances determine the pairs of strongly related key terms, we can select the strongest relations as concept map links. Finally, we construct the resulting concept map as a graph with selected nodes and links. The benefits of our statistical algorithm are its simplicity, efficiency and applicability to any domain, any language and any document collection.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Sherman, R.: Abstraction in concept map and coupled outline knowledge representations. J. Interact. Learn. Res. 14, 31–49 (2003)
Villalon, J., Calvo, R.: Concept map mining: a definition and a framework for its evaluation. In: Proceedings of the International Conference on Web Intelligence and Intelligent Agent Technology, vol. 3, Los Alamitos, USA, pp. 357−360 (2008)
Villalon J., Calvo R., Montenegro R.: Analysis of a gold standard for Concept Map Mining – how humans summarize text using concept maps. In: Proceedings of the Fourth International Conference on Concept Mapping, pp. 14−22 (2010)
Akhmed-Zaki D., Mansurova M., Pyrkova A.: Development of courses directed on formation of competences demanded on the market of IT technologies. In: Proceedings of the 2014 Zone 1 Conference of the American Society for Engineering Education, pp. 1−4 (2014)
Zubrinic, K., Kalpic, D., Milicevic, M.: The automatic creation of concept maps from documents written using morphologically rich languages. Expert Syst. Appl. 39(16), 12709–12718 (2012)
Clariana, R.B., Koul, R.: A computer-based approach for translating text into concept map-like representations. In: Proceedings of the First International Conference on Concept Mapping, Pamplona, Spain, pp. 131–134 (2004)
Chen, N.S., Kinshuk Wei, C.W., Chen, H.J.: Mining e-learning domain concept map from academic articles. Comput. Educ. 50(3), 1009–1021 (2008)
Oliveira, A., Pereira, F.C., Cardoso, A.: Automatic reading and learning from text. In: Paper Presented at the International Symposium on Artificial Intelligence Kolhapur, India (2001)
Valerio, A., Leake, D.B.: Associating documents to concept maps in context. In: Paper Presented at the Third International Conference on Concept Mapping, Finland (2008)
Alves, Z.O., Pereira, F.C., Cardoso, A.: Automatic reading and learning from text. In: Proceedings of the International Symposium on Artificial Intelligence (ISAI 2001), pp. 302–310 (2001)
Rajaraman, K., Tan, A.H.: Knowledge discovery from texts: a concept frame graph approach. In: Proceedings of the 11th International Conference on Information and Knowledge Management, pp. 669–671 (2002)
Valerio, A., Leake, D.B., Cañas, A.J. Using automatically generated concept maps for document understanding: a human subjects experiment. In: Proceedings of the 15 International Conference on Concept Mapping, pp. 438−445 (2012)
Reis, J.C., Gaia, A.S.C., Viegas Jr., R.: Concept maps construction based on exhaustive rules and vector space intersection. IJCSNS 14(7), 26 (2014)
Costa, G., Ortale, R., A latent semantic approach to xml clustering by content and structure based on non-negative matrix factorization. In: 2013 12th International Conference on Machine Learning and Applications (ICMLA) IEEE 2013, vol. 1, pp. 179−184 (2013)
Evangelopoulos, N.E.: Latent semantic analysis. Wiley Interdisc. Rev.: Cognitive Sci. 4(6), 683–692 (2013)
Allemang, D., Hendler, J.: Semantic Web for the Working Ontologist, 2nd edn. Elsevier Inc., Philadelphia (2011)
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Nugumanova, A., Mansurova, M., Alimzhanov, E., Zyryanov, D., Apayev, K. (2016). An Automatic Construction of Concept Maps Based on Statistical Text Mining. In: Helfert, M., Holzinger, A., Belo, O., Francalanci, C. (eds) Data Management Technologies and Applications. DATA 2015. Communications in Computer and Information Science, vol 584. Springer, Cham. https://doi.org/10.1007/978-3-319-30162-4_3
Download citation
DOI: https://doi.org/10.1007/978-3-319-30162-4_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-30161-7
Online ISBN: 978-3-319-30162-4
eBook Packages: Computer ScienceComputer Science (R0)