Skip to main content

Comparing DBpedia, Wikidata, and YAGO for Web Information Retrieval

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 67))

Abstract

Knowledge graphs serve as the primary sources of structured data in many Semantic Web applications. In this paper, the three most popular cross-domain knowledge graphs (KGs), namely, DBpedia, YAGO, and Wikidata were empirically explored and compared. These knowledge graphs were compared from the perspectives of completeness of the relations, timeliness of the data and accessibility of the KG. Three fundamental categories of named entities were queried within the KGs for detailed analysis of the data returned. From the experimental results and findings, Wikidata scores the highest in term of the timeliness of the data provided owing to the effort of global community update, with DBpedia LIVE being the next. Regarding accessibility, it was observed that DBpedia and Wikidata gave continuous access using public SPARQL endpoint, while YAGO endpoints were intermittently inaccessible. With respect to completeness of predicates, none of the KGs have a remarkable lead for any of the selected categories. From the analysis, it is observed that none of the KG can be considered complete on its own with regard to the relations of an entity.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    http://wiki.dbpedia.org/.

  2. 2.

    http://wiki.dbpedia.org/online-access/DBpediaLive.

  3. 3.

    https://www.mpi-inf.mpg.de/departments/databases-and-information-systems.

  4. 4.

    https://www.wikidata.org/wiki/Wikidata:Main_Page.

  5. 5.

    https://gate.d5.mpi-inf.mpg.de/webyago3spotlxComp/SvgBrowser.

  6. 6.

    https://gate.d5.mpi-inf.mpg.de/webyagospotlxComp/WebInterface.

  7. 7.

    https://en.wikipedia.org/wiki/China_(disambiguation).

References

  1. Tim B, Lee B, Hendler J, Lassila O (2001) The semantic web will enable machines to comprehend semantic documents, no. May, pp 1–5

    Google Scholar 

  2. Bizer C et al (2008) Linked data on the web. WWW2008 Work. Linked Data Web, pp 1265–1266

    Google Scholar 

  3. Zaveri A, Kontokostas D, Leipzig U, Hellmann S (2017) Linked data quality of DBpedia, freebase Semant. Web, 0(0):1–53

    Google Scholar 

  4. Pipino LL, Lee YW, Wang RY, Lowell Yang Lee MW, Yang RY (2002) Data Quality Assessment. Commun. ACM 45(4):211

    Article  Google Scholar 

  5. Li TRBY, Wang H, Zhao L (2016) The semantic web. Latest advances and new domains 9678:52–68

    Google Scholar 

  6. Paulheim H (2015) Knowledge graph refinement: a survey of approaches and evaluation methods. Semant. Web, 0:1–0

    Google Scholar 

  7. Ringler D, HPB (2017) KI 2017: advances in artificial intelligence 10505:366–372

    Chapter  Google Scholar 

  8. Vrandečić D, Krötzsch M (2014) Wikidata: a free collaborative knowledgebase. Commun ACM 57(10):78–85

    Article  Google Scholar 

  9. Hoffart J, Suchanek FM, Berberich K, Weikum G (2013) YAGO2: a spatially and temporally enhanced knowledge base from Wikipedia. IJCAI Int Jt Conf Artif Intell 3161–3165

    Google Scholar 

  10. Demner-Fushman D et al (2013) YAGO3 : a knowledge base from multilingual wikipedia. J Biomed Inform 46(SUPPL):129–132

    Google Scholar 

  11. Auer S, Bizer C, Kobilarov G, Lehman J, Cyganiak R, Ives Z (2007) DBedpia: a nucleus for a web od open data, Emantic Web Lect Notes Comput Sci 4825:722

    Article  Google Scholar 

  12. Verborgh R et al (2014) Querying datasets on the web with high availability. Lect. Notes Comput. Sci. (including Subser. Lect. Notes Artif. Intell. Lect. Notes Bioinformatics) 8796(Iswc 2014):180–196

    Google Scholar 

  13. Voigt M, Mitschick A, Schulz J (2012) Yet another triple store benchmark? Practical experiences with real-world data. CEUR Workshop Proc, 912(Sda):85–94

    Google Scholar 

Download references

Acknowledgements

This work is partially funded by Fundamental Research Grant Scheme (FRGS) by Malaysia Ministry of Higher Education (Ref: FRGS/1/2017/ICT02/MMU/02/6).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lay-Ki Soon .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Pillai, S.G., Soon, LK., Haw, SC. (2019). Comparing DBpedia, Wikidata, and YAGO for Web Information Retrieval. In: Piuri, V., Balas, V., Borah, S., Syed Ahmad, S. (eds) Intelligent and Interactive Computing. Lecture Notes in Networks and Systems, vol 67. Springer, Singapore. https://doi.org/10.1007/978-981-13-6031-2_40

Download citation

Publish with us

Policies and ethics