Abstract
In this paper we describe a practical approach to the challenge of linguistic retrodigitization. We propose to distinguish strictly between a base digitization and separate interpretation of the sources. The base digitization only includes a literal electronic transcript of the source. All sources are thus simply treated as strings of characters, i.e. as unstructured corpora. The often complex structure as found in many dictionaries and grammars will subsequently (and possibly much later) be added as Linked Data in the form of standoff annotation. A further advantage of this approach is that the complete digitization and interpretation can be performed collaboratively without a complex organizational superstructure.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bánski P, Przepiórkowski A (2009) Stand-off TEI annotation: The case of the National Corpus of Polish. In: Proceedings of the Third Linguistic Annotation Workshop (LAW III), pp 65–67
Cayless HA, Soroka A (2010) On implementing string-range() for TEI. In: Proceedings of Balisage: The Markup Conference 2010
Lee K, Romary L (2010) Towards interoperability of ISO standards for Language Resource Management. In: Proceedings of the 2nd International Conference on Global Interoperability for Language Resources
Schmidt D (2010) The inadequacy of embedded markup for cultural heritage texts. Literary and Linguistic Computing pp 337–356
Thiesen W, Thiesen E (1998) Diccionario Bora-Castellano Castellano-Bora. Instituto Lingüístico de Verano
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Bouda, P., Cysouw, M. (2012). Treating Dictionaries as a Linked-Data Corpus. In: Chiarcos, C., Nordhoff, S., Hellmann, S. (eds) Linked Data in Linguistics. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28249-2_2
Download citation
DOI: https://doi.org/10.1007/978-3-642-28249-2_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-28248-5
Online ISBN: 978-3-642-28249-2
eBook Packages: Computer ScienceComputer Science (R0)