Abstract
DBpedia and Wikidata are two online projects focused on offering structured data from Wikipedia in order to ease its exploitation on the Linked Data Web. In this paper, a comparison of these two widely-used structured data sources is presented. This comparison considers the most relevant data quality dimensions in the state of the art of the scientific research. As fundamental differences between both projects, we can highlight that Wikidata has an open centralised nature, whereas DBpedia is more popular in the Semantic Web and the Linked Open Data communities and depends on the different linguistic editions of Wikipedia.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
- 3.
- 4.
This data was obtained the May 15th, 2017.
- 5.
- 6.
- 7.
- 8.
- 9.
Dimensions “Cost-effectiveness” and “Flexibility” defined in [5] are considered very related to “Performance” and “Versatility” defined in [6], respectively, so they are represented in color green. Notice also that, despite the fact that “Interlinking” is in green, it is not discarded because it appears in the same category in [6, 7].
References
Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.: DBpedia: a nucleus for a web of open data. In: Aberer, K., et al. (eds.) ASWC/ISWC 2007. LNCS, vol. 4825, pp. 722–735. Springer, Heidelberg (2007). https://doi.org/10.1007/978-3-540-76298-0_52
Vrandečić, D.: Wikidata: a new platform for collaborative data collection. In: 21st International Conference on World Wide Web, pp. 1063–1064. ACM, France (2012)
Lehmann, J., Isele, R., Jakob, M., Jentzsch, A., Kontokostas, D., Mendes, P., Hellmann, S., Morsey, M., van Kleef, P., Auer, S., Bizer, C.: DBpedia - a large-scale, multilingual knowledge base extracted from wikipedia. Semantic Web J. 6(2), 167–195 (2015)
Vrandečić, D., Krőtzsch, M.: Wikidata: a free collaborative knowledgebase. Commun. ACM 57(10), 78–85 (2014)
Wang, R.Y., Strong, D.M.: Beyond accuracy: what data quality means to data consumers. J. Manag. Inf. Syst. 12(4), 5–33 (1996)
Zaveri, A., Rula, A., Maurino, A., Pietrobon, R., Lehmann, J., Auer, S.: Quality assessment for linked data: a survey. Semantic Web 7(1), 63–93 (2015)
Wikidata discussion page about Data Quality. https://www.wikidata.org/wiki/Wikidata:Requests_for_comment/Data_quality_framework_for_Wikidata. Accessed May 2017
Kontokostas, D., Auer, S., Lehmann, J., Hellmann, S.: Wikidata through the Eyes of DBpedia. CoRR, vol. abs/1507.04180 (2015)
Carothers, G., Machina, L.: RDF 1.1 N-Quads. A line-based syntax for RDF datasets. W3C Recommendation (2014)
Provenance meta-data in DBpedia Datasets. http://wiki.dbpedia.org/services-resources/datasets/dbpedia-datasets#h434-17. Accessed May 2017
The Linking Open Data diagram. http://lod-cloud.net. Accessed June 2017
Antoniou, G., Harmelen, F.: A Semantic Web Primer. The MIT Press, Cambridge (2008)
Rodríguez, I.: DBpedia Mappings Front-End Administration. Google Summer of Code Project (2017)
Rodriguez-Hernandez, I., Trillo-Lado, R., Yus, R.: WikInfoboxer: a tool to create wikipedia infoboxes using DBpedia. XXI Jornadas de Ingeniería del Software y Bases de Datos at Congreso Español De Infomática (2016)
Fernández, J.D., Martínez-Prieto, M.A., Gutierrez, C., Polleres, A.: Binary RDF Representation for Publication and Exchange (HDT). W3C Member Submission (2011)
Acknowledgments
This work has been partially funded by: Action COST Keystone IC-1302, TIN2016-78011-C4-3-R (AEI/FEDER, UE), TIN2013-46238-C4-4-R. We thank Á. Poc, D. Martínez, X. Pan and F. del Molino for their support on this work.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this paper
Cite this paper
Abián, D., Guerra, F., Martínez-Romanos, J., Trillo-Lado, R. (2018). Wikidata and DBpedia: A Comparative Study. In: Szymański, J., Velegrakis, Y. (eds) Semantic Keyword-Based Search on Structured Data Sources. IKC 2017. Lecture Notes in Computer Science(), vol 10546. Springer, Cham. https://doi.org/10.1007/978-3-319-74497-1_14
Download citation
DOI: https://doi.org/10.1007/978-3-319-74497-1_14
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-74496-4
Online ISBN: 978-3-319-74497-1
eBook Packages: Computer ScienceComputer Science (R0)