A Subject Partitioning Based SPARQL Query Engine and Its NoSQL Implementation

  • Chuanlei Ni
  • Wei Hu
  • Yuzhong Qu
Part of the Communications in Computer and Information Science book series (CCIS, volume 406)


The schema-free nature of RDF makes the processing of its query language SPARQL challenging. In this paper, we propose a subject partitioning approach to storing RDF data using the unit of entity document to reduce join operations. To transform a query graph to a query plan, we design a size estimation model specific to SPARQL processing. We implement a prototype system called sp-NoSQL over the MongoDB NoSQL database. By comparing with two representative SPARQL query engines, we evaluate the effectiveness of our approach.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    SPARQL Query Language for RDF. W3C Recommendation (January 15, 2008),
  2. 2.
    Neumann, T., Weikum, G.: The rdf-3x engine for scalable management of rdf data. The VLDB Journal 19, 91–113 (2010)CrossRefGoogle Scholar
  3. 3.
    Harris, S., Gibbins, N.: 3store: Effcient bulk RDF storage. In: Proc. of PSSS 2003, pp. 1–15 (2003)Google Scholar
  4. 4.
    Abadi, D.J., Marcus, A., Madden, S.R., Hollenbach, K.: Scalable semantic web data management using vertical partitioning. In: VLDB, pp. 411–422 (2007)Google Scholar
  5. 5.
    Wilkinson, K.: Jena Property Table Implementation. In: SSWS (2006)Google Scholar
  6. 6.
    Stocker, M., Seaborne, A., Bernstein, A., et al.: SPARQL basic graph pattern optimization using selectivity estimation. In: Proc. WWW, pp. 595–604 (2008)Google Scholar
  7. 7.
    Sidirourgos, L., Goncalves, R., et al.: Column-Store Support for RDF Data Management: not all swans are white. In: Proc. VLDB, pp. 1553–1563 (2008)Google Scholar
  8. 8.
    DeHaan, D., Tompa, F.W.: Optimal top-down join enumeration. In: Proc. SIGMOD, pp. 785–796 (2007)Google Scholar
  9. 9.
    Hecht, R., Jablonski, S.: NoSQL Evaluation: A Use Case Orinted Survey. In: Proc. CSC, pp. 336–341 (2011)Google Scholar
  10. 10.
    Bugiotti, F., et al.: RDF data management in the Amazon cloud. In: EDBT/ICDT Workshops, pp. 61–72 (2012)Google Scholar
  11. 11.
    Khadilkar, V., Kantarcioglu, M., Castagna, P., et al.: Jena-HBase: A Distributed, Scalable and Efficient RDF Triple Store. In: Proc. ISWC, Posters & Demos (2012)Google Scholar
  12. 12.
    Bizer, C., Schultz, A.: The Berlin SPARQL Benchmark. IJSWIS 5(2), 1–24 (2009)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  • Chuanlei Ni
    • 1
  • Wei Hu
    • 1
  • Yuzhong Qu
    • 1
  1. 1.State Key Laboratory for Novel Software TechnologyNanjing UniversityChina

Personalised recommendations