Building Knowledge Graphs from Survey Data: A Use Case in the Social Sciences (Extended Version)
Many research endeavors in the social sciences rely on high-quality empirical data. Survey data is often used as a foundation to investigate social behavior. The GESIS Panel is a probability-based mixed-mode panel survey in Germany providing high-quality survey and statistical data about e.g. political opinions, well-being, and other contemporary societal topics. In general, the integration and analysis of relevant data is a time-consuming process for researchers. This is due to the fact that search, discovery, and retrieval of the survey data requires accessing various data sources providing different information in different file formats. In this paper, we present our architecture for building a Knowledge Graph of the GESIS Panel data. We present the relevant heterogeneous data sources and demonstrate how we semantically lift and interlink the data in a shared RDF model. At the core of our architecture is a Knowledge Graph representing all aspects of the surveys. It is generated in a modular fashion and, therefore, our solution can be transferred to the existing infrastructure of other survey data publishers.
KeywordsKnowledge Graph Survey data RDF DDI
This work was carried out with the support of the German Research Foundation (DFG) within the project “SoRa - Sozial-Raumwissenschaftliche Forschungsdateninfrastruktur” (see footnote 17).
- 1.Bosch, T., Cyganiak, R., Gregory, A., Wackerow, J.: DDI-RDF discovery vocabulary: a metadata vocabulary for documenting research and survey data. In: LDOW (2013)Google Scholar
- 2.Bosch, T., Wackerow, J., Cyganiak, R., Zapilko, B.: Leveraging the DDI model for linked statistical data in the social, behavioural, and economic sciences, p. 10 (2012)Google Scholar
- 4.Chaves-Fraga, D., Priyatna, F., Santana-Pérez, I., Corcho, Ó.: Virtual statistics knowledge graph generation from CSV files. In: Emerging Topics in Semantic Technologies - ISWC 2018 Satellite Events (best papers from 13 of the Workshops Co-located with the ISWC 2018 Conference), pp. 235–244 (2018). https://doi.org/10.3233/978-1-61499-894-5-235
- 5.Gherghina, S., Geissel, B.: Citizens’ conceptions of democracy and political participation in Germany. In: Workshops of European Consortium for Political Research, p. 25 (2015)Google Scholar
- 6.Gottron, T., Hachenberg, C., Harth, A., Zapilko, B.: Towards a semantic data library for the social sciences, p. 13 (2011)Google Scholar
- 7.Mayer, S.J., Schultze, M.: The effects of political involvement and cross-pressures on multiple party identifications in multi-party systems - evidence from Germany. J. Elections Public Opin. Parties 29, 1–17 (2018)Google Scholar
- 11.Zapilko, B., Schaible, J., Mayr, P., Mathiak, B.: TheSoz: a SKOS representation of the thesaurus for the social sciences, p. 7 (2012)Google Scholar