Supporting Semantic Search on Heterogeneous Semi-structured Documents
This paper presents SHIRI-Querying, an approach for semantic search on semi-structured documents. We propose a solution to tackle incompleteness and imprecision of semantic annotations of semistructured documents at querying time. We particularly introduce three elementary reformulations that rely on the notion of aggregation and on the document structure. We present the Dynamic Reformulation and Execution of Queries algorithm (DREQ) which combines these elementary transformations to construct reformulated queries w.r.t. a defined order relation. Experiments on two real datasets show that these reformulations greatly increase the recall and that returned answers are effectively ranked according to their precision.
KeywordsSemantic Relation Domain Ontology User Query Semantic Annotation Annotation Model
- 3.Bhagdev, R., Chapman, S., Ciravegna, F., Lanfranchi, V., Petrelli, D.: Hybrid Serach: Effectively Combining Keywords and Semantic Searches. In: Bechhofer, S., Hauswirth, M., Hoffmann, J., Koubarakis, M. (eds.) ESWC 2008. LNCS, vol. 5021, pp. 554–568. Springer, Heidelberg (2008)CrossRefGoogle Scholar
- 7.Castells, P., Fernàndez, M., Vallet, D.: An adaptation of the vector-space model for ontology-based information retreival. IEEE T. on Know. and Data Eng. 19(2) (2007)Google Scholar