Abstract
Creating links manually between large datasets becomes an extremely tedious task. Although the linked data production is growing massively, the interconnecting needs improvement. This paper presents our work regarding detecting and extending links between Wikidata and COURAGE entities with respect to cultural heritage data. The COURAGE project explored the methods for cultural opposition in the socialist era (cc. 1950–1990), highlighting the variety of alternative cultural scenes that flourished in Eastern Europe before 1989. We describe our methods and results in discovering common entities in the two datasets, and our solution for automating this task. Furthermore, it is shown how it was possible to enrich the data in Wikidata and to establish new, bi-directional connections between COURAGE and Wikidata. Hence, the audience of both databases will have a more complete view of the matched entities.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
COURAGE project. http://cultural-opposition.eu/
Apor, B., Apor, P., Horváth, S. (eds.): The Handbook of COURAGE, Budapest (2018). https://doi.org/10.24389/handbook
Micsik, A.: Courage registry - open dataset 1.1, July 2019. https://doi.org/10.5281/zenodo.3333540
Erxleben, F., Günther, M., Krötzsch, M., Mendez, J., Vrandečić, D.: Introducing Wikidata to the Linked Data Web. In: Mika, P., et al. (eds.) ISWC 2014. LNCS, vol. 8796, pp. 50–65. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-11964-9_4
Vrandečić, D., Krötzsch, M.: Wikidata: a free collaborative knowledgebase. Commun. ACM 57(10) (2014). https://doi.org/10.1145/2629489
Wikidata Statistics. https://www.wikidata.org/wiki/Wikidata:Statistics
WikiProject Cultural heritage. https://www.wikidata.org/wiki/Wikidata:WikiProject_Cultural_heritage
Why data partners should link their vocabulary to Wikidata: a new case study. Europeana pro page. https://pro.europeana.eu/post/why-data-partners-should-link-their-vocabulary-to-wikidata-a-new-case-study
Malyshev, S., Krötzsch, M., González, L., Gonsior, J., Bielefeldt, A.: Getting the most out of Wikidata: semantic technology usage in Wikipedia’s knowledge graph. In: Vrandečić, D., et al. (eds.) ISWC 2018. LNCS, vol. 11137, pp. 376–394. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-00668-6_23
Allison-Cassin, S., Scott, D.: Wikidata: a platform for your library’s linked open data. Code4Lib 40 (2018)
Nentwig, M., Hartung, M., Cyrille, A., Ngomo, N., Rahm E.: A survey of current link discovery frameworks. Semant. Web J. 2(224) (2017). https://doi.org/10.3233/sw-150210
Isele, R., Jentzsch, A., Bizer, C.: Efficient multidimensional blocking for link discovery without losing recall. In: 14th International Workshop on the Web and Databases, WebDB, Athens (2011)
Ngomo, A.C.N., Auer, S.: LIMES - a time-efficient approach for large-scale link discovery on the web of data. In: IJCAI, pp. 2312–2317 (2011). https://doi.org/10.5591/978-1-57735-516-8/ijcai11-385
Nikolov, A., Uren, V., Motta, E.: KnoFuss: a comprehensive architecture for knowledge fusion. In: Proceedings of the 4th International Conference on Knowledge Capture, pp. 185–186. ACM (2007)
Mix‘n’match Manual Wikimedia. https://meta.wikimedia.org/wiki/Mix%27n%27match/Manual
Hickey, T.B., Toves, J.A.: Managing ambiguity in VIAF. D-Lib Mag. 20(7/8). https://doi.org/10.1045/july2014-hickey
Larson, R.R., Janakiraman, K.: Connecting archival collections: the social networks and archival context project. In: Gradmann, S., Borri, F., Meghini, C., Schuldt, H. (eds.) TPDL 2011. LNCS, vol. 6966, pp. 3–14. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-24469-8_3
QuickStatements help. https://www.wikidata.org/wiki/Help:QuickStatements
Acknowledgement
The project has been supported by the European Union, co-financed by the European Social Fund (EFOP-3.6.3-VEKOP-16-2017-00002).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Faraj, G., Micsik, A. (2019). Enriching Wikidata with Cultural Heritage Data from the COURAGE Project. In: Garoufallou, E., Fallucchi, F., William De Luca, E. (eds) Metadata and Semantic Research. MTSR 2019. Communications in Computer and Information Science, vol 1057. Springer, Cham. https://doi.org/10.1007/978-3-030-36599-8_37
Download citation
DOI: https://doi.org/10.1007/978-3-030-36599-8_37
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-36598-1
Online ISBN: 978-3-030-36599-8
eBook Packages: Computer ScienceComputer Science (R0)