Skip to main content

An Approach for Knowledge Extraction from Source Code (KNESC) of Typed Programming Languages

  • Conference paper

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 745))

Abstract

Knowledge extraction is the discovery of knowledge from structured and/or unstructured sources. This knowledge can be used to build or enrich a domain ontology. Source code is rarely used. But implementation platforms evolve faster than business logic and these evolutions are usually integrated directly into source code without updating the conceptual model. In this paper, we present a generic approach for knowledge extraction from source code of typed programming languages using Hidden Markov Models. This approach consist of the definition of the HMM so that it can be used to extract any type of knowledge from the source code. The method is experimented on EPICAM and GeoServer developed in Java and on MapServer developed in C/C++. Structural evaluation shows that source code contains a structure that permit to build a domain ontology and functional evaluation shows that source code contains more knowledge than those contained in both databases and meta-models.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   259.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Azanzi, J., Gaoussou, C.: Knowledge extraction from source code based on Hidden Markov Model: application to EPICAM. In: Proceedings of the 14th ACS/IEEE International Conference on Computer Systems and Applications (2017)

    Google Scholar 

  2. Bontcheva, K., Sabou, M.: Learning ontologies from software artifacts: exploring and combining multiple choices. Seman. Web Enabled Softw. Eng. 17, 235 (2014)

    Google Scholar 

  3. Dellschaft, K., Staab, S.: Strategies for the evaluation of ontology learning. In: Buitelaar, P., Cimiano, P. (eds.) Bridging the Gap between Text and Knowledge Selected Contributions to Ontology Learning and Population from Text. IOS Press (2008)

    Google Scholar 

  4. Djuric, D., Gasevic, D., Devedzic, V.: Ontology modeling and MDA. J. Object Technol. 4, 109–128 (2005)

    Article  Google Scholar 

  5. Foundation OSG: Geoserver (2014). http://geoserver.org/

  6. Gómez-Pérez, A., Fernández-López, M., Corcho, O.: Ontological Engineering: with examples from the areas of Knowledge Management, e-Commerce and the Semantic Web. Springer, New York (2007)

    Google Scholar 

  7. Maedche, A., Staab, S.: Semi-automatic engineering of ontologies from text. In: Proceedings of the 12th Internal Conference on Software and Knowledge Engineering Chicago, USA (2000)

    Google Scholar 

  8. Russell, S.J., Norvig, P.: Artificial Intelligence: A Modern Approach, 2nd edn. Pearson Education, Upper Saddle River (2003)

    MATH  Google Scholar 

  9. Seymore, K., Mccallum, A., Rosenfeld, R.: Learning Hidden Markov Model structure for information extraction. In: AAAI 1999 Workshop on Machine Learning for Information Extraction, pp. 37–42 (1999)

    Google Scholar 

  10. Shamsfard, M., Abdollahzadeh Barforoush, A.: The state of the art in ontology learning: a framework for comparison. Knowl. Eng. Rev. 18(4), 293–316 (2003)

    Article  Google Scholar 

  11. UMMISCO, MEDES, PNLT, CPC: Plate-forme de surveillance de la tuberculose (2016). http://github.com/UMMISCO/EPICAM

  12. Unbehauen, J., Hellmann, S., Auer, S., Stadler, C.: Knowledge extraction from structured sources. In: Search computing, pp 34–52. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  13. University of Minnesota: MapServer (2017). www.mapserver.org

  14. Zhao, S., Chang, E., Dillon, T.S.: Knowledge extraction from web-based application source code: an approach to database reverse engineering for ontology development. In: IRI, IEEE Systems, Man, and Cybernetics Society, pp. 153–159 (2008)

    Google Scholar 

  15. Zhou, L.: Ontology learning: state of the art and open issues. Inf. Technol. Manage. 8, 241–252 (2007)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Azanzi Jiomekong .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG, part of Springer Nature

About this paper

Cite this paper

Jiomekong, A., Camara, G. (2018). An Approach for Knowledge Extraction from Source Code (KNESC) of Typed Programming Languages. In: Rocha, Á., Adeli, H., Reis, L.P., Costanzo, S. (eds) Trends and Advances in Information Systems and Technologies. WorldCIST'18 2018. Advances in Intelligent Systems and Computing, vol 745. Springer, Cham. https://doi.org/10.1007/978-3-319-77703-0_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-77703-0_12

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-77702-3

  • Online ISBN: 978-3-319-77703-0

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics