Skip to main content
Log in

Building a Distributed Infrastructure for Scalable Triple Stores

  • Regular Paper
  • Published:
Journal of Computer Science and Technology Aims and scope Submit manuscript

Abstract

Built specifically for the Semantic Web, triple stores are required to accommodate a large number of RDF triples and remain primarily centralized. As triple stores grow and evolve with time, there is a demanding need for scalable techniques to remove resource and performance bottlenecks in such systems. To this end, we propose a fully decentralized peer-to-peer architecture for large scale triple stores in which triples are maintained by individual stakeholders, and a semantics-directed search protocol, mediated by topology reorganization, for locating triples of interest. We test our design through simulations and the results show anticipated improvements over existing techniques for distributed triple stores. In addition to engineering future large scale triple stores, our work will in particular benefit the federation of stand-alone triple stores of today to achieve desired scalability.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  1. Broekstra J, Kampman A, van Harmelen F. Sesame: A generic architecture for storing and querying RDF and RDF schema. In Proc. the 1st International Semantic Web Conference, Sardinia, Italy, June 9–12, 2002, pp.54–68.

  2. Harris S, Gibbins N. 3store: Efficient bulk RDF storage. In Proc. the 1st International Workshop on Practical and Scalable Semantic Web Systems, Sanibel Island, Florida, USA, October 20, 2003, pp.1–15.

  3. McBride B. Jena: A Semantic Web toolkit. IEEE Internet Computing, 2002, 6(6): 55–59.

    Article  Google Scholar 

  4. Arenas M, Kantere V, Kementsietsidis A et al. The Hyperion project: From data Integration to data coordination. ACM SIGMOD Record, September 2003, 32(3): 53–58.

    Article  Google Scholar 

  5. Aberer K, Cudré-Mauroux P, Hauswirth M et al. GridVine: Building Internet-scale semantic overlay networks. In Proc. the 3rd International Semantic Web Conference, Hiroshima, Japan, November 7–11, 2004, pp.107–121.

  6. Nejdl W,Wolf B, Qu C et al. EDUTELLA: A P2P networking infrastructure based on RDF. In Proc. the 11th International Conference on World Wide Web, Honolulu, Hawaii, USA, May 7–11, 2002, pp.604–615.

  7. Nejdl W, Wolpers M, Siberski W et al. Super-peer-based routing and clustering strategies for RDF-based peer-to-peer networks. In Proc. the 12th International Conference on World Wide Web, Budapest, Hungary, May 20–24, 2003, pp.536–543.

  8. Kokkinidis G, Sidirourgos L, Christophides V. Query Processing in RDF/S-Based P2P Database Systems. Semantic Web and Peer-to-Peer, Staab S, Stuckenschmidt H (eds.), Springer-Verlag, 2006, pp.59–81.

  9. Halevy A, Ives Z, Mork P et al. Piazza: Data management infrastructure for Semantic Web applications. In Proc. the 12th International Conference on World Wide Web, Budapest, Hungary, May 20-24, 2003, pp.556-567.

  10. Cai M, Frank M. RDFPeers: A scalable distributed RDF repository based on a structured peer-to-peer network. In Proc. the 13th International Conference on World Wide Web, New York, NY, USA, May 17–22, 2004, pp.650–657.

  11. Stuckenschmidt H, Vdovjak R, Broekstra J et al. Towards distributed processing of RDF path queries. Int. J. Web Engineering and Technology, 2005, 2(2/3): 207–230.

    Article  Google Scholar 

  12. Clark D. Face-to-face with peer-to-peer networking. Computer, 2001, 34(1): 18–21.

    Article  Google Scholar 

  13. Gruber T. A translation approach to portable ontology specifications. Knowledge Acquisition, June 1993, 5(2): 199–220.

    Article  Google Scholar 

  14. Mitra P, Noy N, Jaiswal A. OMEN: A probabilistic ontology mapping tool. In Proc. the 4th International Semantic Web Conference, Galway, Ireland, Nov. 6–10, 2005, pp.537–547.

  15. Pan R, Ding Z, Yu Y et al. A Bayesian network approach to ontology mapping. In Proc. the 4th International Semantic Web Conference, Galway, Ireland, Nov. 6–10, 2005, pp.563–577.

  16. Gong L. JXTA: A network programming environment. IEEE Internet Computing, May 2001, 5(3): 88–95.

    Article  Google Scholar 

  17. Balakrishnan H, Kaashoek M, Karger D et al. Looking up data in P2P systems. Communications of the ACM, February 2003, 46(2): 43–48.

    Article  Google Scholar 

  18. Lv Q, Cao P, Cohen E et al. Search and replication in unstructured peer-to-peer systems. In Proc. the 16th Int. Conf. Supercomputing, New York, NY, USA, June 22–26, 2002, pp.84–95.

  19. Stoica I, Morris R, Karger D et al. Chord: A scalable peer-to-peer lookup service for Internet applications. In Proc. the ACM SIGCOMM Conference, San Diego, California, USA, August 27–31, 2001, pp.149–160.

  20. Chawathe Y, Ratnasamy S, Breslau L et al. Making Gnutella-like P2P systems scalable. In Proc. the ACM SIGCOMM Conference, Karlsruhe, Germany, Aug. 25–29, 2003, pp.407–418.

  21. Haase P, Siebes R, van Harmelen F. Peer selection in peer-to-peer networks with semantic topologies. In Proc. the 1st Int. IFIP Conf. Semantics of a Networked World (ICSNW 2004), Paris, France, June 17–19, 2004, pp.108–125.

  22. Haase P, Broekstra J, Ehrig M et al. Bibster - A semantics-based bibliographic peer-to-peer system. In Proc. the 3rd International Semantic Web Conference, Hiroshima, Japan, November 7–11, 2004, pp.122–136.

  23. Menascé D. Scalable P2P search. IEEE Internet Computing, March/April 2003, 7(2): 83–87.

    Article  Google Scholar 

  24. Sripanidkulchai K, Maggs B, Zhang H. Efficient content location using Interest-based locality in peer-to-peer systems. In Proc. IEEE INFOCOM, San Francisco, CA, USA, April 1–3, 2003, pp.2166–2176.

  25. Sayers C, Wilkinson K. A pragmatic approach to storing and distributing RDF in context using Snippets. Technical Report HPL-2003-231, Enterprise Systems and Data Management Laboratory, HP Laboratories Palo Alto, USA, November 2003.

  26. Zhou J. DDLS: Extending open hypermedia systems into peer-to-peer environments [Ph.D. Dissertation]. University of Southampton, UK, 2004.

  27. Abadi D, Marcus A, Madden S, Hollenbach K. Scalable Semantic Web data management using vertical partitioning. In Proc. the 33rd International Conference on Very Large Data Bases, Vienna, Austria, September 23–27, 2007, pp.411–422.

  28. Ratnasamy S, Francis P, Handley M et al. A scalable content-addressable network. In Proc. the ACM SIGCOMM Conference 2001, San Diego, California, USA, August 27–31, 2001, pp.161–172.

  29. Rowstron A, Druschel P. Pastry: Scalable, decentralized object location, and routing for large-scale peer-to-peer systems. In Proc. the 18th IFIP/ACM International Conference on Distributed Systems Platforms (Middleware 2001), Heidelberg, Germany, November 12–16, 2001, pp.329–350.

  30. Winick J, Jamin S. Inet-3.0: Internet topology generator. Technical Report CSE-TR-456-02, University of Michigan, 2002.

  31. Zhou J, Hall W, De Roure D et al. Supporting ad-hoc resource sharing on the Web: A peer-to-peer approach to hypermedia link services. ACM Transactions on Internet Technology, May 2007, 7(2): Article 11.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jing Zhou.

Additional information

This research was primarily conducted while Jing Zhou was affiliated with the School of Electronics and Computer Science, University of Southampton, U.K. and is supported in part by the Leading Academic Discipline Program, 211 Project for Communication University of China (the 3rd phase).

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhou, J., Hall, W. & De Roure, D. Building a Distributed Infrastructure for Scalable Triple Stores. J. Comput. Sci. Technol. 24, 447–462 (2009). https://doi.org/10.1007/s11390-009-9236-1

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11390-009-9236-1

Keywords

Navigation