An Approach for Knowledge Extraction from Source Code (KNESC) of Typed Programming Languages

Jiomekong, Azanzi; Camara, Gaoussou

doi:10.1007/978-3-319-77703-0_12

An Approach for Knowledge Extraction from Source Code (KNESC) of Typed Programming Languages

Azanzi Jiomekong⁶ &
Gaoussou Camara⁷

Conference paper

8534 Accesses
2 Citations

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 745))

Abstract

Knowledge extraction is the discovery of knowledge from structured and/or unstructured sources. This knowledge can be used to build or enrich a domain ontology. Source code is rarely used. But implementation platforms evolve faster than business logic and these evolutions are usually integrated directly into source code without updating the conceptual model. In this paper, we present a generic approach for knowledge extraction from source code of typed programming languages using Hidden Markov Models. This approach consist of the definition of the HMM so that it can be used to extract any type of knowledge from the source code. The method is experimented on EPICAM and GeoServer developed in Java and on MapServer developed in C/C++. Structural evaluation shows that source code contains a structure that permit to build a domain ontology and functional evaluation shows that source code contains more knowledge than those contained in both databases and meta-models.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Azanzi, J., Gaoussou, C.: Knowledge extraction from source code based on Hidden Markov Model: application to EPICAM. In: Proceedings of the 14th ACS/IEEE International Conference on Computer Systems and Applications (2017)
Google Scholar
Bontcheva, K., Sabou, M.: Learning ontologies from software artifacts: exploring and combining multiple choices. Seman. Web Enabled Softw. Eng. 17, 235 (2014)
Google Scholar
Dellschaft, K., Staab, S.: Strategies for the evaluation of ontology learning. In: Buitelaar, P., Cimiano, P. (eds.) Bridging the Gap between Text and Knowledge Selected Contributions to Ontology Learning and Population from Text. IOS Press (2008)
Google Scholar
Djuric, D., Gasevic, D., Devedzic, V.: Ontology modeling and MDA. J. Object Technol. 4, 109–128 (2005)
Article Google Scholar
Foundation OSG: Geoserver (2014). http://geoserver.org/
Gómez-Pérez, A., Fernández-López, M., Corcho, O.: Ontological Engineering: with examples from the areas of Knowledge Management, e-Commerce and the Semantic Web. Springer, New York (2007)
Google Scholar
Maedche, A., Staab, S.: Semi-automatic engineering of ontologies from text. In: Proceedings of the 12th Internal Conference on Software and Knowledge Engineering Chicago, USA (2000)
Google Scholar
Russell, S.J., Norvig, P.: Artificial Intelligence: A Modern Approach, 2nd edn. Pearson Education, Upper Saddle River (2003)
MATH Google Scholar
Seymore, K., Mccallum, A., Rosenfeld, R.: Learning Hidden Markov Model structure for information extraction. In: AAAI 1999 Workshop on Machine Learning for Information Extraction, pp. 37–42 (1999)
Google Scholar
Shamsfard, M., Abdollahzadeh Barforoush, A.: The state of the art in ontology learning: a framework for comparison. Knowl. Eng. Rev. 18(4), 293–316 (2003)
Article Google Scholar
UMMISCO, MEDES, PNLT, CPC: Plate-forme de surveillance de la tuberculose (2016). http://github.com/UMMISCO/EPICAM
Unbehauen, J., Hellmann, S., Auer, S., Stadler, C.: Knowledge extraction from structured sources. In: Search computing, pp 34–52. Springer, Heidelberg (2012)
Chapter Google Scholar
University of Minnesota: MapServer (2017). www.mapserver.org
Zhao, S., Chang, E., Dillon, T.S.: Knowledge extraction from web-based application source code: an approach to database reverse engineering for ontology development. In: IRI, IEEE Systems, Man, and Cybernetics Society, pp. 153–159 (2008)
Google Scholar
Zhou, L.: Ontology learning: state of the art and open issues. Inf. Technol. Manage. 8, 241–252 (2007)
Article Google Scholar

Download references

Author information

Authors and Affiliations

UMMISCO, Faculty of Science, University of Yaoundé I, Yaoundé, Cameroon
Azanzi Jiomekong
EIR-IMTICE, Université Alioune Diop de Bambey, Bambey, Senegal
Gaoussou Camara

Authors

Azanzi Jiomekong
View author publications
You can also search for this author in PubMed Google Scholar
Gaoussou Camara
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Azanzi Jiomekong .

Editor information

Editors and Affiliations

Departamento de Engenharia Informática, Universidade de Coimbra, Coimbra, Portugal
Álvaro Rocha
College of Engineering, The Ohio State University, Columbus, OH, USA
Hojjat Adeli
DSI/EEUM, Universidade do Minho, Guimarães, Portugal
Luís Paulo Reis
DIMES, Università della Calabria, Arcavacata di Rende, Italy
Sandra Costanzo

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jiomekong, A., Camara, G. (2018). An Approach for Knowledge Extraction from Source Code (KNESC) of Typed Programming Languages. In: Rocha, Á., Adeli, H., Reis, L.P., Costanzo, S. (eds) Trends and Advances in Information Systems and Technologies. WorldCIST'18 2018. Advances in Intelligent Systems and Computing, vol 745. Springer, Cham. https://doi.org/10.1007/978-3-319-77703-0_12

Download citation

DOI: https://doi.org/10.1007/978-3-319-77703-0_12
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-77702-3
Online ISBN: 978-3-319-77703-0
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics