A Tagger for Glossary of Terms Extraction from Ontology Competency Questions
Competency Questions (CQs) are questions expressed in natural language aimed to indicate ontology’s scope, which are later formalized according to the language used to represent the ontology. One intermediate step that facilitates formalizing CQs, proposed in ontology engineering methodologies, is to extract so-called Glossary of Terms from them, which is so far a manual process. To automate this intermediate step, we propose a tagger, which for the given sequence of words, in a CQ, decides whether it should be considered as a suggestion of vocabulary (a class, an instance or a property) in the created ontology, and in this way being a good candidate entry to the Glossary of Terms. We also report about preliminary evaluation of the tagger.
KeywordsOntology engineering Competency Questions Knowledge extraction
- 2.Lafferty, J.D., McCallum, A., Pereira, F.C.N.: Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: Proceedings of ICML (2001)Google Scholar
- 3.Ren, Y., Parvizi, A., Mellish, C., Pan, J.Z., van Deemter, K., Stevens, R.: Towards competency question-driven ontology authoring. In: Presutti, V., d’Amato, C., Gandon, F., d’Aquin, M., Staab, S., Tordai, A. (eds.) ESWC 2014. LNCS, vol. 8465, pp. 752–767. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-07443-6_50CrossRefGoogle Scholar
- 4.Suarez-Figueroa, M.C., et al.: NeOn methodology for building contextualized ontology networks. NeOn Deliverable D5.4.1, NeOn Project (2008)Google Scholar
- 5.Wisniewski, D., Potoniec, J., Lawrynowicz, A., Keet, C.M.: Competency questions and SPARQL-OWL queries dataset and analysis. Technical report 1811.09529, November 2018. https://arxiv.org/abs/1811.09529