Abstract
The ever growing number of textual historical collections calls for methods that can meaningfully connect and explore these. Different collections offer different perspectives, expressing views at the time of writing or even a subjective view of the author. We propose to connect heterogeneous digital collections through temporal references found in documents as well as their textual content. We evaluate our approach and find that it works very well on digital-native collections. Digitized collections pose interesting challenges and with improved preprocessing our approach performs well. We introduce a novel search interface to explore and analyze the connected collections that highlights different perspectives and requires little domain knowledge. In our approach, perspectives are expressed as complex queries. Our approach supports humanity scholars in exploring collections in a novel way and allows for digital collections to be more accessible by adding new connections and new means to access collections.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
In English: The Kingdom of the Netherlands during WWII.
- 2.
- 3.
- 4.
- 5.
The exported RDF triples are ingested in the “Verrijkt Koninkrijk” triple store. The updated triple store can be found at http://semanticweb.cs.vu.nl/verrijktkoninkrijk/.
- 6.
The fully functional application can be accessed at http://qhp.science.uva.nl.
- 7.
In Dutch: overval, dieven, handhaving, toegang, verschaft, verwijderd, zaak.
- 8.
In Dutch: aanleiding, gebeurtenissen, Februaristaking, gearresteerd, eigenaar.
- 9.
The source code is available on https://bitbucket.org/qhp.
References
Alonso, O., Strötgen, J., Baeza-Yates, R., Gertz, M.: Temporal information retrieval: challenges and opportunities. In: TWAW Workshop, WWW (2011)
Au Yeung, C.-M., Jatowt, A.: Studying how the past is remembered: towards computational history through large scale text mining. In: CIKM 2011, pp. 1231–1240. ACM, New York (2011)
Bron, M., Huurnink, B., de Rijke, M.: Linking archives using document enrichment and term selection. In: Gradmann, S., Borri, F., Meghini, C., Schuldt, H. (eds.) TPDL 2011. LNCS, vol. 6966, pp. 360–371. Springer, Heidelberg (2011)
Bron, M., Van Gorp, J., Nack, F., de Rijke, M., Vishneuski, A., de Leeuw, S.: A subjunctive exploratory search interface to support media studies researchers. In: SIGIR 2011, pp. 425–434. ACM (2012)
Byron, L., Wattenberg, M.: Stacked graphs-geometry & aesthetics. IEEE Trans. Visual Comput. Graphics 14(6), 1245–1252 (2008)
Card, S.K., Mackinlay, J.D., Shneiderman, B.: Readings in Information Visualization: Using Vision to Think. Morgan Kaufmann, San Francisco (1999)
de Boer, V., van Doornik, J., Buitinck, L., Marx, M., Veken, T., Ribbens, K.: Linking the kingdom: enriched access to a historiographical text. In: K-CAP 2013, pp. 17–24. ACM, New York (2013)
de Rooij, O., Odijk, D., de Rijke, M.: Themestreams: visualizing the stream of themes discussed in politics. In: SIGIR 2013, pp. 1077–1078. ACM (2013)
Kleppe, M., Hollink, L., Kemman, M., Juric, D., Beunders, H., Blom, J., Oomen, J., Houben, G.J.: Polimedia analysing media coverage of political debates by automatically generated links to radio & newspaper items. In: Proceedings of the LinkedUp Veni Competition
Lensen, J.: De zoektocht naar het midden: nieuwe perspectieven op de herinnering aan de Tweede Wereldoorlog in Vlaanderen en Duitsland. Internationale neerlandistiek 52(2), 113–133 (2014)
Marchionini, G.: Exploratory search: from finding to understanding. Commun. ACM 49(4), 41–46 (2006)
Massa, P., Scrinzi, F.: Manypedia: comparing language points of view of wikipedia communities. In: WikiSym 2012, p. 21. ACM (2012)
Monz, C., Nastase, V., Negri, M., Fahrni, A., Mehdad, Y., Strube, M.: Cosyne: a framework for multilingual content synchronization of wikis. In: WikiSym 2011, pp. 217–218. ACM (2011)
Odijk, D., de Rooij, O., Peetz, M.-H., Pieters, T., de Rijke, M., Snelders, S.: Semantic document selection. In: Zaphiris, P., Buchanan, G., Rasmussen, E., Loizides, F. (eds.) TPDL 2012. LNCS, vol. 7489, pp. 215–221. Springer, Heidelberg (2012)
Pustejovsky, J., Castano, J.M., Ingria, R., Sauri, R., Gaizauskas, R.J., Setzer, A., Katz, G., Radev, D.R.: Timeml: robust specification of event and temporal expressions in text. New Dir. Question Answering 3, 28–34 (2003)
Schreiber, G., et al.: MultimediaN E-Culture demonstrator. In: Cruz, I., Decker, S., Allemang, D., Preist, C., Schwabe, D., Mika, P., Uschold, M., Aroyo, L.M. (eds.) ISWC 2006. LNCS, vol. 4273, pp. 951–958. Springer, Heidelberg (2006)
Strötgen, J., Gertz, M.: Multilingual and cross-domain temporal tagging. Lang. Resour. Eval. 47(2), 269–298 (2013)
Van Vree, F.: De Nederlandse Pers en Duitsland, 1930 [-] 1939: Een studie over de vorming van de publieke opinie (1989)
Acknowledgements
This research was supported by Amsterdam Data Science and the Dutch national program COMMIT.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Odijk, D. et al. (2015). Supporting Exploration of Historical Perspectives Across Collections. In: Kapidakis, S., Mazurek, C., Werla, M. (eds) Research and Advanced Technology for Digital Libraries. TPDL 2015. Lecture Notes in Computer Science(), vol 9316. Springer, Cham. https://doi.org/10.1007/978-3-319-24592-8_18
Download citation
DOI: https://doi.org/10.1007/978-3-319-24592-8_18
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-24591-1
Online ISBN: 978-3-319-24592-8
eBook Packages: Computer ScienceComputer Science (R0)