Linked Data in Linguistics

Representing and Connecting Language Data and Language Metadata

  • Christian Chiarcos
  • Sebastian Nordhoff
  • Sebastian Hellmann

Table of contents

  1. Front Matter
    Pages I-XIV
  2. Christian Chiarcos, Sebastian Hellmann, Sebastian Nordhoff
    Pages 1-12
  3. Lexical Resources

    1. Front Matter
      Pages 13-13
    2. Peter Bouda, Michael Cysouw
      Pages 15-23
    3. John McCrae, Elena Montiel-Ponsoda, Philipp Cimiano
      Pages 25-34
    4. Axel Herold, Lothar Lemnitzer, Alexander Geyken
      Pages 35-44
    5. David Lewis, Alexander O’Connor, Sebastien Molines, Leroy Finn, Dominic Jones, Stephen Curran et al.
      Pages 45-54
  4. Corpus Building and Annotation

  5. Terminology Repositories and Knowledge Bases

    1. Front Matter
      Pages 97-97
    2. Menzo Windhouwer, Sue Ellen Wright
      Pages 99-107
    3. Thierry Declerck, Piroska Lendvai, Karlheinz Mörth, Gerhard Budin, Tamás Váradi
      Pages 109-116
  6. Towards a Linguistic Linked Open Data Cloud: Recent Activities of the Open Linguistics Working Group

    1. Front Matter
      Pages 151-151
    2. Christian Chiarcos, Sebastian Hellmann, Sebastian Nordhoff
      Pages 153-160
    3. Christian Chiarcos
      Pages 161-179
    4. Sebastian Hellmann, Claus Stadler, Jens Lehmann
      Pages 181-190
    5. Christian Chiarcos, Sebastian Hellmann, Sebastian Nordhoff
      Pages 201-216

About this book


The explosion of information technology has led to substantial growth of web-accessible linguistic data in terms of quantity, diversity and complexity. These resources become even more useful when interlinked with each other to generate network effects.

The general trend of providing data online is thus accompanied by newly developing methodologies to interconnect linguistic data and metadata. This includes linguistic data collections, general-purpose knowledge bases (e.g., the DBpedia, a machine-readable edition of the Wikipedia), and repositories with specific information about languages, linguistic categories and phenomena. The Linked Data paradigm provides a framework for interoperability and access management, and thereby allows to integrate information from such a diverse set of resources.

The contributions assembled in this volume illustrate the band-width of applications of the Linked Data paradigm for representative types of language resources. They cover lexical-semantic resources, annotated corpora, typological databases as well as terminology and metadata repositories. The book includes representative applications from diverse fields, ranging from academic linguistics (e.g., typology and corpus linguistics) over applied linguistics (e.g., lexicography and translation studies) to technical applications (in computational linguistics, Natural Language Processing and information technology).

This volume accompanies the Workshop on Linked Data in Linguistics 2012 (LDL-2012) in Frankfurt/M., Germany, organized by the Open Linguistics Working Group (OWLG) of the Open Knowledge Foundation (OKFN). It assembles contributions of the workshop participants and, beyond this, it summarizes initial steps in the formation of a Linked Open Data cloud of linguistic resources, the Linguistic Linked Open Data cloud (LLOD).


Linguistic Linked Open Data interoperable representation formalisms knowledge integration (linking) from various sources linguistic databases, linguistic corpora, lexical resources modeling linguistic resources

Editors and affiliations

  • Christian Chiarcos
    • 1
  • Sebastian Nordhoff
    • 2
  • Sebastian Hellmann
    • 3
  1. 1., Information Science InstituteUniversity of Southern CaliforniaMarina del ReyUSA
  2. 2.Department of Linguistics, Evolutionary Anthropology LeipzigMax-Planck Instutite forLeipzigGermany
  3. 3., Business Information SystemsUniversity of LeipzigLeipzigGermany

Bibliographic information

Industry Sectors
IT & Software