Skip to main content

Enriching Media Collections for Event-Based Exploration

  • Conference paper
  • First Online:
Metadata and Semantic Research (MTSR 2017)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 755))

Included in the following conference series:

Abstract

Scholars currently have access to large heterogeneous media collections on the Web, which they use as sources for their research. Exploration of such collections is an important part in their research, where scholars make sense of these heterogeneous datasets. Knowledge graphs which relate media objects, people and places with historical events can provide a valuable structure for more meaningful and serendipitous browsing. Based on extensive requirements analysis done with historians and media scholars, we present a methodology to publish, represent, enrich, and link heritage collections so that they can be explored by domain expert users. We present four methods to derive events from media object descriptions. We also present a case study where four datasets with mixed media types are made accessible to scholars and describe the building blocks for event-based proto-narratives in the knowledge graph.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://diveplus.beeldengeluid.nl.

  2. 2.

    https://www.clariah.nl/en/.

  3. 3.

    https://www.w3.org/2004/02/skos/.

  4. 4.

    http://dublincore.org/documents/dcmi-terms/.

  5. 5.

    https://www.w3.org/TR/annotation-model/.

  6. 6.

    http://lov.okfn.org/dataset/lov/vocabs/mads.

  7. 7.

    http://crowdtruth.org/.

  8. 8.

    http://cultuurlink.beeldengeluid.nl.

  9. 9.

    http://diveproject.beeldengeluid.nl.

  10. 10.

    http://www.beeldengeluid.nl.

  11. 11.

    http://openimages.eu.

  12. 12.

    http://gtaa.beeldengeluid.nl.

  13. 13.

    http://radiobulletins.delpher.nl/.

  14. 14.

    This conversion code is available at https://github.com/biktorrr/dive/.

  15. 15.

    https://www.amsterdammuseum.nl.

  16. 16.

    https://tropenmuseum.nl/en.

  17. 17.

    http://svcn.nl.

  18. 18.

    http://data.dive.beeldengeluid.nl/browse/list_triples?graph=http%3A//purl.org/collections/nl/am/am_additions.ttl shows the 12 triples added for Amsterdam Museum. These include mappings of object-image relations, object-entity relations as well as object classes.

  19. 19.

    http://xtas.net.

  20. 20.

    http://www.opener-project.eu/.

  21. 21.

    http://tinyurl.com/diveplusexample2 shows an example event in the DIVE+ UI.

  22. 22.

    The triple store can be accessed at http://data.dive.beeldengeluid.nl/.

  23. 23.

    https://github.com/biktorrr/diveplusdata/.

References

  1. van den Akker, C., van Nuland, A., van der Meij, L., van Erp, M., Legne, S., Aroyo, L., Schreiber, G.: From information delivery to interpretation support: evaluating cultural heritage access on the web. In: Proceedings of the 5th Annual ACM Web Science Conference, WebSci 2013, pp. 431–440. ACM, New York (2013)

    Google Scholar 

  2. Akker, C.v.d., Legêne, S., Erp, M.v., Aroyo, L., Segers, R., Meij, L.v.D., Ossenbruggen Van, J., Schreiber, G., Wielinga, B., Oomen, J., et al.: Digital hermeneutics: agora and the online understanding of cultural heritage. In: Proceedings of the 3rd International Web Science Conference, p. 10. ACM (2011)

    Google Scholar 

  3. Aroyo, L., Welty, C.: The three sides of CrowdTruth. J. Hum. Comput. 1, 31–34 (2014)

    Google Scholar 

  4. Baca, M.: Practical issues in applying metadata schemas and controlled vocabularies to cultural heritage information. Cat. Classif. Q. 36(3–4), 47–55 (2003)

    Google Scholar 

  5. Bizer, C., Heath, T., Berners-Lee, T.: Linked data-the story so far. In: Semantic Services, Interoperability and Web Applications: Emerging Concepts, pp. 205–227 (2009)

    Google Scholar 

  6. van den Bosch, A., Busser, B., Canisius, S., Daelemans, W.: An efficient memory-based morphosyntactic tagger and parser for dutch. LOT Occas. 7, 191–206 (2007)

    Google Scholar 

  7. Bron, M., van Gorp, J., de Rijke, M.: Media studies research in the data-driven age: How research questions evolve. J. Assoc. Inf. Sci. Technol. 67(7), 1535–1554 (2015)

    Article  Google Scholar 

  8. Coburn, E., Light, R., McKenna, G., Stein, R., Vitzthum, A.: LIDO-lightweight information describing objects version 1.0. ICOM International Committee of Museums (2010)

    Google Scholar 

  9. de Boer, V., Oomen, J., Inel, O., Aroyo, L., van Staveren, E., Helmich, W., de Beurs, D.: DIVE into the event-based browsing of linked historical media. Web Semant. Sci. Serv. Agents WWW 35, 152–158 (2015)

    Article  Google Scholar 

  10. de Boer, V., Priem, M., Hildebrand, M., Verplancke, N., de Vries, A., Oomen, J.: Exploring Audiovisual Archives Through Aligned Thesauri, pp. 211–222 (2016)

    Google Scholar 

  11. de Boer, V., Wielemaker, J., van Gent, J., Oosterbroek, M., Hildebrand, M., Isaac, A., van Ossenbruggen, J., Schreiber, G.: Amsterdam museum linked open data. Semant. Web 4(3), 237–243 (2013)

    Google Scholar 

  12. de Boer, V., Wielemaker, J., Gent, J., Hildebrand, M., Isaac, A., Ossenbruggen, J., Schreiber, G.: Supporting linked data production for cultural heritage institutes: the amsterdam museum case study. In: Simperl, E., Cimiano, P., Polleres, A., Corcho, O., Presutti, V. (eds.) ESWC 2012. LNCS, vol. 7295, pp. 733–747. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-30284-8_56

    Chapter  Google Scholar 

  13. Dijkshoorn, C., Leyssen, M.H., Nottamkandath, A., Oosterman, J., Traub, M.C., Aroyo, L., Bozzon, A., Fokkink, W., Houben, G.J., Hovelmann, H., et al.: Personalized nichesourcing: acquisition of qualitative annotations from niche communities. In: UMAP Workshops (2013)

    Google Scholar 

  14. Doerr, M.: The CIDOC conceptual reference module: an ontological approach to semantic interoperability of metadata. AI Mag. 24(3), 75 (2003)

    Google Scholar 

  15. Doerr, M., Gradmann, S., Hennicke, S., Isaac, A., Meghini, C., van de Sompel, H.: The europeana data model (edm). In: World Library and Information Congress: 76th IFLA General Conference and Assembly, pp. 10–15 (2010)

    Google Scholar 

  16. Gangemi, A.: A comparison of knowledge extraction tools for the semantic web. In: Cimiano, P., Corcho, O., Presutti, V., Hollink, L., Rudolph, S. (eds.) ESWC 2013. LNCS, vol. 7882, pp. 351–366. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-38288-8_24

    Chapter  Google Scholar 

  17. Grover, C., Givon, S., Tobin, R., Ball, J.: Named entity recognition for digitised historical texts. In: LREC (2008)

    Google Scholar 

  18. van Hage, W.R., Malais, V., Segers, R., Hollink, L., Schreiber, G.: Design and use of the simple event model (SEM). Web Semant. Sci. Serv. Agent World Wide Web 9(2), 128–136 (2011)

    Article  Google Scholar 

  19. Hagedoorn, B., Sauer, S.: Getting the Bigger Picture: Exploratory Search and Narrative Creation for Media Research into Disruptive Events. Utrecht (2017)

    Google Scholar 

  20. van Hooland, S., De Wilde, M., Verborgh, R., Steiner, T., van de Walle, R.: Exploring entity recognition and disambiguation for cultural heritage collections. Digit. Sch. Humanit. 30(2), 262–279 (2013)

    Article  Google Scholar 

  21. Inel, O., Aroyo, L.: Harnessing diversity in crowds and machines for better NER performance. In: Blomqvist, E., Maynard, D., Gangemi, A., Hoekstra, R., Hitzler, P., Hartig, O. (eds.) ESWC 2017. LNCS, vol. 10249, pp. 289–304. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-58068-5_18

    Chapter  Google Scholar 

  22. Kim, J.D., Ohta, T., Pyysalo, S., Kano, Y., Tsujii, J.: Overview of bionlp’09 shared task on event extraction. In: Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing: Shared Task, pp. 1–9. ACL (2009)

    Google Scholar 

  23. Lee, K., Artzi, Y., Choi, Y., Zettlemoyer, L.: Event detection and factuality assessment with non-expert supervision. In: EMNLP, pp. 1643–1648 (2015)

    Google Scholar 

  24. Melgar Estrada, L., Koolen, M., Huurdeman, H., Blom, J.: A process model of time-based media annotation in a scholarly context. In: ACM SIGIR Conference on Human Information Interaction & Retrieval (CHIIR), Oslo (2017)

    Google Scholar 

  25. Palmer, C.L., Teffeau, L.C., Pirmann, C.M.: Scholarly information practices in the online environment: themes from the literature and implications for library service development. Technical report, OCLC Research, Dublin, Ohio (2009)

    Google Scholar 

  26. Richards, J.D., Tudhope, D., Vlachidis, A.: Text mining in archaeology: extracting information from archaeological reports. In: Barcelo, J., Bogdanovic, I. (eds.) Mathematics and Archaeology, p. 240. CRC Press, Boca Raton (2015)

    Chapter  Google Scholar 

  27. Sauer, S., de Rijke, M.: Seeking serendipity: a living lab approach to understanding creative retrieval in broadcast media production. In: Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2016, pp. 989–992. ACM, New York (2016)

    Google Scholar 

  28. Schreiber, G., Amin, A., Aroyo, L., van Assem, M., de Boer, V., Hardman, L., Hildebrand, M., Omelayenko, B., van Osenbruggen, J., Tordai, A., et al.: Semantic annotation and search of cultural-heritage collections: the multimedian e-culture demonstrator. Web Semant. Sci. Serv. Agents World Wide Web 6(4), 243–249 (2008)

    Article  Google Scholar 

  29. Shaw, R., Troncy, R., Hardman, L.: LODE: linking open descriptions of events. In: Gómez-Pérez, A., Yu, Y., Ding, Y. (eds.) ASWC 2009. LNCS, vol. 5926, pp. 153–167. Springer, Heidelberg (2009). https://doi.org/10.1007/978-3-642-10871-6_11

    Chapter  Google Scholar 

  30. van Veen, T., Lonij, J., Faber, W.J.: Linking named entities in dutch historical newspapers. In: Garoufallou, E., Subirats Coll, I., Stellato, A., Greenberg, J. (eds.) MTSR 2016. CCIS, vol. 672, pp. 205–210. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-49157-8_18

    Chapter  Google Scholar 

Download references

Acknowledgements

This work was partially supported by CLARIAH (http://clariah.nl/) and by the Netherlands eScience Center (http://esciencecenter.nl/) DIVE+ project. We furthermore thank Victor Kramer, Jaap Blom and Werner Helmich.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Victor de Boer .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

de Boer, V., Melgar, L., Inel, O., Ortiz, C.M., Aroyo, L., Oomen, J. (2017). Enriching Media Collections for Event-Based Exploration. In: Garoufallou, E., Virkus, S., Siatri, R., Koutsomiha, D. (eds) Metadata and Semantic Research. MTSR 2017. Communications in Computer and Information Science, vol 755. Springer, Cham. https://doi.org/10.1007/978-3-319-70863-8_18

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-70863-8_18

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-70862-1

  • Online ISBN: 978-3-319-70863-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics