Skip to main content

HDT Bitmap Triple Indices for Efficient RDF Data Exploration

  • Conference paper
  • First Online:
The Semantic Web (ESWC 2021)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12731))

Included in the following conference series:

  • 2405 Accesses

Abstract

The exploration of large, unknown RDF data sets is difficult even for users who are familiar with Semantic Web technologies as, e.g., the SPARQL query language. The concept of faceted navigation offers a user-friendly exploration method through filters that are chosen such that no empty result sets occur. Computing such filters is resource intensive, especially for large data sets, and may cause considerable delays in the user interaction. One possibility for improving the performance is the generation of indices for partial solutions. In this paper, we propose and evaluate indices in form of the Bitmap Triple (BT) data structure, generated over the Header-Dictionary-Triples (HDT) RDF compression format. We show that the resulting indices can be utilized to efficiently compute the required exploratory operations for data sets with up to 150 million triples. In the experiments, the BT indices exhibit a stable performance and outperform other deployed approaches in four out of five compared operations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://github.com/rdfhdt/hdt-java.

  2. 2.

    https://github.com/MaximilianWenzel/hdt-bt-indices-java-lib.

  3. 3.

    https://jena.apache.org/documentation/query/index.html.

References

  1. Semspect by derivo. http://semspect.de/. Accessed 14 Jan 2020

  2. Arenas, M., Cuenca Grau, B., Kharlamov, E., Marciuška, Š., Zheleznyakov, D.: Faceted search over RDF-based knowledge graphs. J. Web Semant. 37, 55–74 (2016)

    Article  Google Scholar 

  3. Bast, H., Bäurle, F., Buchhold, B., Haußmann, E.: Easy access to the freebase dataset. In: Proceedings of 23rd International Conference on World Wide Web, pp. 95–98 (2014)

    Google Scholar 

  4. Brisaboa, N.R., Cánovas, R., Claude, F., Martínez-Prieto, M.A., Navarro, G.: Compressed string dictionaries. In: Pardalos, P.M., Rebennack, S. (eds.) SEA 2011. LNCS, vol. 6630, pp. 136–147. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-20662-7_12

    Chapter  Google Scholar 

  5. Fernández, J.D., Martínez-Prieto, M.A., Gutiérrez, C., Polleres, A., Arias, M.: Binary RDF representation for publication and exchange (HDT). J. Web Semant. 19, 22–41 (2013)

    Article  Google Scholar 

  6. Ferré, S.: Sparklis: an expressive query builder for SPARQL endpoints with guidance in natural language. Semant. Web 8(3), 405–418 (2017)

    Article  Google Scholar 

  7. Heim, P., Ziegler, J., Lohmann, S.: gFacet: a browser for the web of data. In: Proceedings of International Workshop on Interacting with Multimedia Content in the Social Semantic Web, vol. 417, pp. 49–58. CEUR-WS.org (2008)

    Google Scholar 

  8. Hildebrand, M., van Ossenbruggen, J., Hardman, L.: /facet: a browser for heterogeneous semantic web repositories. In: Cruz, I., et al. (eds.) ISWC 2006. LNCS, vol. 4273, pp. 272–285. Springer, Heidelberg (2006). https://doi.org/10.1007/11926078_20

    Chapter  Google Scholar 

  9. Hyvönen, E., Saarela, S., Viljanen, K.: Ontogator: combining view- and ontology-based search with semantic browsing. J. Inf. Retrieval 16, 17 (2003)

    Google Scholar 

  10. Kharlamov, E., Giacomelli, L., Sherkhonov, E., Cuenca Grau, B., Kostylev, E.V., Horrocks, I.: SemFacet: making hard faceted search easier. In: Proceedings of ACM on Conference on Information and Knowledge Management, pp. 2475–2478 (2017)

    Google Scholar 

  11. Liebig, T., Vialard, V., Opitz, M.: Connecting the dots in million-nodes knowledge graphs with SemSpect. In: Proceedings of International Semantic Web Conference (Posters, Demos & Industry Tracks) (2017)

    Google Scholar 

  12. Martínez-Prieto, M.A., Arias Gallego, M., Fernández, J.D.: Exchange and consumption of huge RDF data. In: Simperl, E., Cimiano, P., Polleres, A., Corcho, O., Presutti, V. (eds.) ESWC 2012. LNCS, vol. 7295, pp. 437–452. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-30284-8_36

    Chapter  Google Scholar 

  13. Moreno-Vega, J., Hogan, A.: GraFa: scalable faceted browsing for RDF graphs. In: Vrandečić, D., et al. (eds.) ISWC 2018. LNCS, vol. 11136, pp. 301–317. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-00671-6_18

    Chapter  Google Scholar 

  14. Nenov, Y., Piro, R., Motik, B., Horrocks, I., Wu, Z., Banerjee, J.: RDFox: a highly-scalable RDF store. In: Arenas, M., et al. (eds.) ISWC 2015. LNCS, vol. 9367, pp. 3–20. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-25010-6_1

    Chapter  Google Scholar 

  15. Oren, E., Delbru, R., Decker, S.: Extending faceted navigation for RDF data. In: Cruz, I., et al. (eds.) ISWC 2006. LNCS, vol. 4273, pp. 559–572. Springer, Heidelberg (2006). https://doi.org/10.1007/11926078_40

    Chapter  Google Scholar 

  16. Wenzel, M.: HDT bitmap triple indices - results of the experiments (Mar 2021). https://doi.org/10.5281/zenodo.4608413

Download references

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Maximilian Wenzel , Thorsten Liebig or Birte Glimm .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Wenzel, M., Liebig, T., Glimm, B. (2021). HDT Bitmap Triple Indices for Efficient RDF Data Exploration. In: Verborgh, R., et al. The Semantic Web. ESWC 2021. Lecture Notes in Computer Science(), vol 12731. Springer, Cham. https://doi.org/10.1007/978-3-030-77385-4_7

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-77385-4_7

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-77384-7

  • Online ISBN: 978-3-030-77385-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics