Advertisement

Querying the Edit History of Wikidata

  • Thomas Pellissier TanonEmail author
  • Fabian Suchanek
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 11762)

Abstract

In its 7 years of existence, Wikidata has accumulated an edit history of millions of contributions. In this paper, we propose a system that makes this data accessible through a SPARQL endpoint. We index not just the diffs done by a revision, but also the global state of Wikidata graph after any given revision. This allows users to answer complex SPARQL 1.1 queries on the Wikidata history, tracing the contributions of human vs. automated contributors, the areas of vandalism, the big schema changes, or the adoption of different values for the “gender” property across time.

Notes

Acknowledgement

Partially supported by the grant ANR-16-CE23-0007-01 (“DICOS”).

References

  1. 1.
    Bishop, B., Kiryakov, A., Ognyanoff, D., Peikov, I., Tashev, Z., Velkov, R.: OWLIM: a family of scalable semantic repositories. Semantic Web 2(1), 33–42 (2011)Google Scholar
  2. 2.
    Cerdeira-Pena, A., Fariña, A., Fernández, J.D., Martínez-Prieto, M.A.: Self-indexing RDF archives. In: DCC, pp. 526–535 (2016)Google Scholar
  3. 3.
    Erxleben, F., Günther, M., Krötzsch, M., Mendez, J., Vrandečić, D.: Introducing wikidata to the linked data web. In: ISWC, pp. 50–65 (2014)Google Scholar
  4. 4.
    Fernández, J.D., Martínez-Prieto, M.A., Polleres, A., Reindorf, J.: HDTQ: managing RDF datasets in compressed space. In: ESWC, pp. 191–208 (2018)Google Scholar
  5. 5.
    Fernández, J.D., Polleres, A., Umbrich, J.: Towards efficient archiving of dynamic linked open data. DIACRON @ ESWC 1377, 34–49 (2015)Google Scholar
  6. 6.
    Fernández, J.D., Umbrich, J., Polleres, A., Knuth, M.: Evaluating query and storage strategies for RDF archives. In: SEMANTICS, pp. 41–48 (2016)Google Scholar
  7. 7.
    Harris, S., Seaborne, A., Prud’hommeaux, E.: SPARQL 1.1 query language (2013)Google Scholar
  8. 8.
    Neumann, T., Weikum, G.: x-RDF-3X: fast querying, high update rates, and consistency for RDF databases. PVLDB 3(1), 256–263 (2010)Google Scholar
  9. 9.
    Pellissier Tanon, T., Bourgaux, C., Suchanek, F.: Learning how to correct a knowledge base from the edit history. In: WWW (2019)Google Scholar
  10. 10.
    Pugliese, A., Udrea, O., Subrahmanian, V.S.: Scaling RDF with time. In: WWW, pp. 605–614 (2008)Google Scholar
  11. 11.
    Taelman, R., Sande, M.V., Verborgh, R.: OSTRICH: versioned random-access triple store. In: WWW, pp. 127–130 (2018)Google Scholar
  12. 12.
    Vander Sande, M., Colpaert, P., Verborgh, R., Coppens, S., Mannens, E., Van de Walle, R.: R&wbase: git for triples. LDOW @ WWW 996 (2013)Google Scholar
  13. 13.
    Vrandečić, D., Krötzsch, M.: Wikidata: a free collaborative knowledgebase. Commun. ACM 57(10), 78–85 (2014)CrossRefGoogle Scholar

Copyright information

© Springer Nature Switzerland AG 2019

Authors and Affiliations

  1. 1.Télécom ParisTechParisFrance

Personalised recommendations