Entity-Based Semantic Search on Conversational Transcripts Semantic
- 1k Downloads
This paper describes the implementation of a semantic web search engine on conversation styled transcripts. Our choice of data is Hansard, a publicly available conversation style transcript of parliamentary debates. The current search engine implementation on Hansard is limited to running search queries based on keywords or phrases hence lacks the ability to make semantic inferences from user queries. By making use of knowledge such as the relationship between members of parliament, constituencies, terms of office, as well as topics of debates the search results can be improved in terms of both relevance and coverage. Our contribution is not algorithmic instead we describe how we exploit a collection of external data sources, ontologies, semantic web vocabularies and named entity extraction in the analysis of underlying semantics of user queries as well as the semantic enrichment of the search index thereby improving the quality of results.
KeywordsHansard Named Entities Semantic Search RDF
Unable to display preview. Download preview PDF.
- 1.Emmerich, W.W.: Distributed Component Technologies and their Software Engineering Implications. In: Proceedings of the 24th International Conference on Software Engineering, Orlando, Florida, pp. 537–546 (2002)Google Scholar
- 5.Navigli, R.: Word Sense Disambiguation: A Survey. ACM Computing Surveys 41(2), Article No. 10 (February 2009)Google Scholar