Skip to main content

A Distributed Publish/Subscribe System for RDF Data

  • Conference paper
Data Management in Cloud, Grid and P2P Systems (Globe 2013)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8059))

Abstract

The pub/sub communication style is a prevalent messaging pattern for filtering information from distributed and large-scale network (e.g., from the real-time web, sensor networks, etc.) thanks to the decoupling between publishers and subscribers. At the same time, persisting the published information is a prerequisite for any further batch analytics on such big amount of data. As data can be heterogeneous, reliance on format from the semantic web such as RDF is unavoidable. In this paper we introduce two versions of a content-based pub/sub matching algorithm for RDF described events, working on an adapted version of the CAN structured P2P network designed to both store and disseminate RDF events. In contrary to existing pub/sub solutions based upon structured overlay networks that index semantic events several times due to the use of hash functions, we leverage the lexicographic order of the event elements. Thus, only subscriptions and not publications have to be duplicated, which is better given that in real settings, publications may occur more frequently than subscriptions. Furthermore, our system allows to publish events made of any number of elements and the subscription language leverages the SPARQL query language. The first algorithm we introduce initially derives from the ideas discussed by Liarou. et al. based upon rewriting continuous queries along matching RDF elements (CSBV) with the purpose to perform the matching between subscriptions and several RDF elements on multiple nodes. The experimental results discuss the applicability of the presented algorithms to some synthetic scenarios and identify, accordingly, which pub/sub matching algorithm is the more relevant.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 54.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 72.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Broekstra, J., Kampman, A., van Harmelen, F.: Sesame: A generic architecture for storing and querying RDF and RDF schema. In: Horrocks, I., Hendler, J. (eds.) ISWC 2002. LNCS, vol. 2342, pp. 54–68. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  2. Cai, M., Frank, M., Chen, J., Szekely, P.: Maan: A multi-attribute addressable network for grid information services. Journal of Grid Computing 2(1), 3–14 (2004)

    Article  MATH  Google Scholar 

  3. Cai, M., Frank, M., Yan, B., MacGregor, R.: A subscribable peer-to-peer rdf repository for distributed metadata management. Web Semantics: Science, Services and Agents on the World Wide Web 2(2), 109–130 (2004)

    Article  Google Scholar 

  4. Carroll, J., Dickinson, I., Dollin, C., Reynolds, D., Seaborne, A., Wilkinson, K.: Jena: implementing the semantic web recommendations. In: Proceedings of the 13th International World Wide Web Conference on Alternate Track Papers & Posters, pp. 74–83. ACM (2004)

    Google Scholar 

  5. Carzaniga, A., Rosenblum, D., Wolf, A.: Design and evaluation of a wide-area event notification service. ACM Transactions on Computer Systems (TOCS) 19(3), 332–383 (2001)

    Article  Google Scholar 

  6. Castro, M., Druschel, P., Kermarrec, A., Rowstron, A.: Scribe: A large-scale and decentralized application-level multicast infrastructure. IEEE Journal on Selected Areas in Communications 20(8), 1489–1499 (2002)

    Article  Google Scholar 

  7. Fitzpatrick, B., Slatkin, B., Atkins, M.: Pubsubhubbub protocol (2010), http://pubsubhubbub.googlecode.com/svn/trunk/pubsubhubbub-core-0.3.html

  8. Forgy, C.L.: Rete: A fast algorithm for the many pattern/many object pattern match problem. Artificial Intelligence 19(1), 17–37 (1982), http://dx.doi.org/10.1016/0004-3702(82)90020-0

    Article  Google Scholar 

  9. Gupta, A., Sahin, O.D., Agrawal, D.P., El Abbadi, A.: Meghdoot: Content-based publish/Subscribe over P2P networks. In: Jacobsen, H.-A. (ed.) Middleware 2004. LNCS, vol. 3231, pp. 254–273. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  10. Jelasity, M., Kermarrec, A.M.: Ordered slicing of very large-scale overlay networks. In: Sixth IEEE International Conference on Peer-to-Peer Computing, P2P 2006, pp. 117–124. IEEE (2006)

    Google Scholar 

  11. Kiryakov, A., Ognyanov, D., Manov, D.: OWLIM – A pragmatic semantic repository for OWL. In: Dean, M., Guo, Y., Jun, W., Kaschek, R., Krishnaswamy, S., Pan, Z., Sheng, Q.Z. (eds.) WISE 2005 Workshops. LNCS, vol. 3807, pp. 182–192. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  12. Lakshman, A., Malik, P.: Cassandra decentralized structured storage system. Operating Systems Review 44(2), 35 (2010)

    Article  Google Scholar 

  13. Li, M., Ye, F., Kim, M., Chen, H., Lei, H.: A scalable and elastic publish/subscribe service. In: 2011 IEEE International Parallel & Distributed Processing Symposium (IPDPS), pp. 1254–1265. IEEE (2011)

    Google Scholar 

  14. Liarou, E., Idreos, S., Koubarakis, M.: Continuous RDF query processing over DHTs. In: Aberer, K., et al. (eds.) ASWC 2007 and ISWC 2007. LNCS, vol. 4825, pp. 324–339. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  15. Pellegrino, L., Baude, F., Alshabani, I.: Towards a scalable cloud-based rdf storage offering a pub/sub query service. In: The Third International Conference on Cloud Computing, GRIDs, and Virtualization, pp. 243–246 (2012)

    Google Scholar 

  16. Pietzuch, P., Bacon, J.: Hermes: A distributed event-based middleware architecture. In: Proceedings of 22nd International Conference on Distributed Computing Systems Workshops, pp. 611–618. IEEE (2002)

    Google Scholar 

  17. Prud Hommeaux, E., Seaborne, A., et al.: Sparql query language for rdf. W3C Recommendation 15 (2008)

    Google Scholar 

  18. Ranger, D., Cloutier, J.-F.: Scalable peer-to-peer RDF query algorithm. In: Dean, M., Guo, Y., Jun, W., Kaschek, R., Krishnaswamy, S., Pan, Z., Sheng, Q.Z. (eds.) WISE 2005 Workshops. LNCS, vol. 3807, pp. 266–274. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  19. Ratnasamy, S., Francis, P., Handley, M., Karp, R., Shenker, S.: A scalable content-addressable network. ACM SIGCOMM Computer Communication Review 31(4), 160–172 (2001)

    Article  Google Scholar 

  20. Rowstron, A., Druschel, P.: Pastry: Scalable, decentralized object location, and routing for large-scale peer-to-peer systems. In: Guerraoui, R. (ed.) Middleware 2001. LNCS, vol. 2218, pp. 329–350. Springer, Heidelberg (2001)

    Chapter  Google Scholar 

  21. Shvartzshnaider, Y., Ott, M., Levy, D.: Publish/Subscribe on top of DHT using RETE algorithm. In: Berre, A.J., Gómez-Pérez, A., Tutschku, K., Fensel, D. (eds.) FIS 2010. LNCS, vol. 6369, pp. 20–29. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  22. Stoica, I., Morris, R., Karger, D., Kaashoek, M., Balakrishnan, H.: Chord: A scalable peer-to-peer lookup service for internet applications. ACM SIGCOMM Computer Communication Review 31(4), 149–160 (2001)

    Article  Google Scholar 

  23. TIBCO, I.: Tib/rendezvous white paper, Palo Alto, California (1999)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Pellegrino, L., Huet, F., Baude, F., Alshabani, A. (2013). A Distributed Publish/Subscribe System for RDF Data. In: Hameurlain, A., Rahayu, W., Taniar, D. (eds) Data Management in Cloud, Grid and P2P Systems. Globe 2013. Lecture Notes in Computer Science, vol 8059. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40053-7_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-40053-7_4

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-40052-0

  • Online ISBN: 978-3-642-40053-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics