Towards a Semantic Search Engine for Open Source Software

  • Sihem Ben SassiEmail author
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9679)


To be able to use or reuse an open source software, we must be aware of its existence and find it first. Existing search engines on the Web do not allow finding open source software satisfying given requirements while taking into account special semantics. In this paper, we propose a semantic search system for open source software allowing to identify and localize them. It relies on an ontology providing a mechanism to describe knowledge about open source software. Moreover, it can infer knowledge for semantic identification through some defined rules. Preliminary results are encouraging. The evaluation results compared to a non-semantic version of the same search engine confirm the added value of the consideration of the semantic aspect in search.


Open source software Ontology Search engine Semantic FLOSS 



Acknowledgment to Atef Charef and Raja Lagha for their participation during the elaboration of Se2FLOSS.


  1. 1.
    Survey Analysis: Open-Source Software Adoption and Governance-Worldwide-2014, February 2015.
  2. 2.
    2015 Future of Open Source Survey Results - Black Duck Software.
  3. 3.
    Widespread Use of Open-Source Software Demands Strong and Effective Governance, August 2014.
  4. 4.
    Inoue, K., Yokomori, R., Yamamoto, T., Matsushita, M., Kusumoto, S.: Ranking significance of software components based on use relations. IEEE Trans. Softw. Eng. 31(3), 213–225 (2005)CrossRefGoogle Scholar
  5. 5.
    Gysin, F.S.: Improved social trustability of code search results. In: 32nd ACM/IEEE International Conference on Software Engineering, Cape Town, South Africa, pp. 513–514. ACM Press (2010)Google Scholar
  6. 6.
    Krugle OpenSearch.
  7. 7.
    BlackDuck Open HUB.
  8. 8.
    Reiss, S.P.: Semantics-based code search. In: 31st ACM/IEEE International Conference on Software Engineering, Vancouver, Canada, pp. 243–253. IEEE Computer Society (2009)Google Scholar
  9. 9.
    Merobase Source Code Search.
  10. 10.
    Hummel, O., Janjic, W., Atkinson, W.: Code conjurer: pulling reusable software out of thin air. IEEE Softw. 25(5), 45–52 (2008)CrossRefGoogle Scholar
  11. 11.
    Linstead, E., Bajracharya, S., Ngo, T., Rigor, P., Lopes, C., Baldi, P.: Sourcerer: mining and searching internet-scale software repositories. Data Min. Knowl. Disc. 18(2), 300–336 (2009)MathSciNetCrossRefGoogle Scholar
  12. 12.
    McMillan, C., Grechanik, M., Poshyvanyk, D., Fu, C., Xie, Q.: Exemplar: a source code search engine for finding highly relevant applications. IEEE Trans. Softw. Eng. 38(5), 1069–1087 (2012)CrossRefGoogle Scholar
  13. 13.
    Open source software.
  14. 14.
  15. 15.
    Gruber, T.: Ontology. In: Liu, L., Tamer Özsu, M. (eds.) Encyclopedia of Database Systems, pp. 1963–1965. Springer, Heidelberg (2009)Google Scholar
  16. 16.
    Guarino, N., Welty, C.: Evaluating ontological decisions with OntoClean. Commun. ACM 45(2), 61–65 (2002)CrossRefGoogle Scholar
  17. 17.
    Sure, Y., Studer, R.: On-to-knowledge methodology. In: Staab, S., Studer, R. (eds.) Handbook on Ontologies. Springer, Heidelberg (2003)Google Scholar
  18. 18.
    Gomez-Perez, A., Fernandez-Lopez, M., Corcho, O.: Ontological Engineering with Examples from the Areas of Knowledge Management, e-Commerce and the Semantic Web. Springer, London (2004)Google Scholar
  19. 19.
    Bachimont, B., Isaac, A., Troncy, R.: Semantic commitment for designing ontologies: a proposal. In: Gómez-Pérez, A., Benjamins, V.R. (eds.) EKAW 2002. LNCS (LNAI), vol. 2473, pp. 114–121. Springer, Heidelberg (2002)CrossRefGoogle Scholar
  20. 20.
  21. 21.
    Various Licenses and Comments about Them.
  22. 22.
  23. 23.
    Beltaifa, R.: Une infrastructure pour la rutilisation de composants logiciels. Ph.D. thesis, National School of Computer Sciences, Tunisia (2004)Google Scholar
  24. 24.
    Spinellis, D., Gousios, G., Karakoidas, V., Louridas, P., Admas, P.J., Samoladas, I., Stamelos, I.: Evaluating the quality of open source software. Electron. Notes Theor. Comput. Sci. 223, 5–28 (2009)CrossRefGoogle Scholar
  25. 25.
    Bollmann, P.: The normalized recall and related measures. In: 6th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Maryland, USA, pp. 122–128. ACM Press (1983)Google Scholar
  26. 26.
    Dyer, R., Nguyen, H.A., Rajan, H., Nguyen, T.N.: Boa: a language and infrastructure for analyzing ultra-large-scale software repositories. In: 35th International Conference on Software Engineering, San Francisco, CA, USA, pp. 422–431. IEEE Press (2013)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2016

Authors and Affiliations

  1. 1.RIADI Laboratory, National School of Computer SciencesManouba UniversityLa ManoubaTunisia
  2. 2.High Institute of Telecommunication, Technoparc ElGhazalaArianaTunisia

Personalised recommendations