Abstract
The Sindice Semantic Web index provides search capabilities over 260 million documents. Reasoning over web data enables to make explicit what would otherwise be implicit knowledge: it adds value to the information and enables Sindice to ultimately be more competitive in terms of precision and recall. However, due to the scale and heterogeneity of web data, a reasoning engine for the Sindice system must (1) scale out through parallelisation over a cluster of machines; and (2) cope with unexpected data usage. In this paper, we report our experiences and lessons learned in building a large scale reasoning engine for Sindice. The reasoning approach has been deployed, used and improved since 2008 within Sindice and has enabled Sindice to reason over billions of triples.
A preliminary version [6] of this article was presented at the 4th International Workshop on Scalable Semantic Web Knowledge Base Systems (SSWS 2008). We have extended it with a comparison with other large scale reasoning approaches, a performance evaluation, and reports on using and optimising the presented reasoning approach in a production system – the Sindice Semantic Web index – since 2008.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bechhofer, S., van Harmelen, F., Hendler, J., Horrocks, I., McGuinness, D.L., Patel-Schneider, P.F., Stein, L.A.: OWL Web Ontology Language Reference. W3C Recommendation, W3C (February 2004)
Berners-Lee, T.: Linked data. W3C Design Issues (July 2006), http://www.w3.org/DesignIssues/LinkedData.html
Bouquet, P., Giunchiglia, F., van Harmelen, F., Serafini, L., Stuckenschmidt, H.: Contextualizing ontologies. Journal of Web Semantics 1(4), 325–343 (2004)
Carroll, J., Bizer, C., Hayes, P., Stickler, P.: Named graphs, provenance and trust. In: WWW 2005: Proceedings of the 14th international conference on World Wide Web, pp. 613–622. ACM Press, New York (2005)
d’Aquin, M., Baldassarre, C., Gridinoc, L., Angeletou, S., Sabou, M., Motta, E.: Characterizing Knowledge on the Semantic Web with Watson. In: EON, pp. 1–10 (2007)
Delbru, R., Polleres, A., Tummarello, G., Decker, S.: Context Dependent Reasoning for Semantic Documents in Sindice. In: Proceedings of the 4th International Workshop on Scalable Semantic Web Knowledge Base Systems, SSWS 2008 (2008)
Ding, L., Tao, J., McGuinness, D.L.: An initial investigation on evaluating semantic web instance data. In: WWW 2008: Proceeding of the 17th International Conference on World Wide Web, pp. 1179–1180. ACM, New York (2008)
Giunchiglia, F.: Contextual reasoning. Epistemologia, special issue on I Linguaggi e le Macchine 345, 345–364 (1993)
Guha, R.V.: Contexts: a formalization and some applications. Ph.D. thesis, Stanford, CA, USA (1992)
Guha, R.V., McCool, R., Fikes, R.: Contexts for the Semantic Web. In: International Semantic Web Conference, pp. 32–46 (2004)
Hayes, P.: RDF Semantics. W3C Recommendation, W3C (February 2004)
Hogan, A., Harth, A., Polleres, A.: Scalable Authoritative OWL Reasoning for the Web. International Journal on Semantic Web and Information Systems 5(2), 49–90 (2009)
Hogan, A., Pan, J.Z., Polleres, A., Decker, S.: SAOR: Template Rule Optimisations for Distributed Reasoning over 1 Billion Linked Data Triples. In: Proceedings of the 9th International Semantic Web Conference. Springer, Heidelberg (2010)
ter Horst, H.J.: Completeness, decidability and complexity of entailment for RDF Schema and a semantic extension involving the OWL vocabulary. Journal of Web Semantics 3(2-3), 79–115 (2005)
de Kleer, J.: An Assumption-Based TMS. Artif. Intell. 28(2), 127–162 (1986)
Mayfield, J., Finin, T.: Information retrieval on the Semantic Web: Integrating inference and retrieval. In: Proceedings of the SIGIR Workshop on the Semantic Web (August 2003)
McCarthy, J.: Notes On Formalizing Context. In: Proceedings of IJCAI 1993, pp. 555–560 (1993)
Miles, A., Baker, T., Swick, R.: Best Practice Recipes for Publishing RDF Vocabularies. W3C working group note, W3C (2008), http://www.w3.org/TR/swbp-vocab-pub/
Polleres, A., Feier, C., Harth, A.: Rules with contextually scoped negation. In: Sure, Y., Domingue, J. (eds.) ESWC 2006. LNCS, vol. 4011, pp. 332–347. Springer, Heidelberg (2006), http://www.polleres.net/publications/poll-etal-2006b.pdf
Serafini, L., Bouquet, P.: Comparing formal theories of context in AI. Artificial Intelligence 155(1-2), 41 (2004)
Stickler, P.: CBD - Concise Bounded Description. W3C Member Submission, W3C (June 2005)
Stoermer, H., Bouquet, P., Palmisano, I., Redavid, D.: A Context-Based Architecture for RDF Knowledge Bases: Approach, Implementation and Preliminary Results. In: Marchiori, M., Pan, J.Z., Marie, C.d.S. (eds.) RR 2007. LNCS, vol. 4524, pp. 209–218. Springer, Heidelberg (2007)
Urbani, J., Kotoulas, S., Maassen, J., van Harmelen, F., Bal, H.: OWL Reasoning with WebPIE: Calculating the Closure of 100 Billion Triples. In: Aroyo, L., Antoniou, G., Hyvönen, E., Teije, A., Stuckenschmidt, H., Cabral, L., Tudorache, T. (eds.) ESWC 2010. LNCS, vol. 6088, pp. 213–227. Springer, Heidelberg (2010), doi:10.1007/978-3-642-13486-9_15
Urbani, J., Kotoulas, S., Oren, E., van Harmelen, F.: Scalable distributed reasoning using mapreduce. In: International Semantic Web Conference, pp. 634–649 (2009)
Weaver, J., Hendler, J.A.: Parallel materialization of the finite rdfs closure for hundreds of millions of triples. In: Bernstein, A., Karger, D.R., Heath, T., Feigenbaum, L., Maynard, D., Motta, E., Thirunarayan, K. (eds.) ISWC 2009. LNCS, vol. 5823, pp. 682–697. Springer, Heidelberg (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Delbru, R., Tummarello, G., Polleres, A. (2011). Context-Dependent OWL Reasoning in Sindice - Experiences and Lessons Learnt. In: Rudolph, S., Gutierrez, C. (eds) Web Reasoning and Rule Systems. RR 2011. Lecture Notes in Computer Science, vol 6902. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23580-1_5
Download citation
DOI: https://doi.org/10.1007/978-3-642-23580-1_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23579-5
Online ISBN: 978-3-642-23580-1
eBook Packages: Computer ScienceComputer Science (R0)