Skip to main content

Linked Data for Linguistic Diversity Research: Glottolog/Langdoc and ASJP Online

  • Chapter

Abstract

Most of the linguistic resources available to day are about the world’s major languages. This paper discusses two projects which have world-wide coverage as their aim. Glottolog/Langdoc is an attempt to attain near-complete bibliographical coverage for the world’s lesser-known languages (i.e. 95% of the world’s linguistic diversity), which then provides solid empirical ground for extensional definitions of languages and language classification. Automated Similarity Judgment Program (ASJP) online provides standardized lexical distance data for 5800 languages as Linked Data. These two projects are the first attempt at a Typological Linked Data Cloud, to which PHOIBLE by other resources can easily be added in the future.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   54.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Bird S, Simons G (2001) The olac metadata set and controlled vocabularies. In: Proceedings of the ACL 2001 Workshop on Sharing Tools and Resources - Volume 15, Association for Computational Linguistics, Stroudsburg, PA, USA, STAR ’01, pp 7–18. DOI 10.3115/1118062.1118065. Online version http://www.language-archives.org.

    Google Scholar 

  • Brown CH, Holman EW, Wichmann S, Velupillai V (2008) Automated classification of the world’s languages: A description of the method and preliminary results. STUF 61(4):286–308

    Google Scholar 

  • Dryer MS (2005) Genealogical language list. In: Comrie B, Dryer MS, Gil D, Haspelmath M (eds) World Atlas of Language Structures, Oxford University Press, pp 584–644

    Google Scholar 

  • Fabre A (2005) Diccionario etnolingüístico y guía bibliográfica de los pueblos indigenas sudamericanos. Book in Progress at http://butler.cc.tut.fi/~fabre/BookInternetVersio/Alkusivu.html accessed May 2005.

  • Farrar S, Langendoen D (2003a) A linguistic ontology for the semantic web. Glot International 7(3):97–100

    Google Scholar 

  • Farrar S, Langendoen D (2003b) Markup and the GOLD ontology. In: EMELD Workshop on Digitizing and Annotating Text and Field Recordings, Michigan State University

    Google Scholar 

  • Good J, Hendryx-Parker C (2006) Modeling contested categorization in linguistic databases. In: Proceedings of the EMELD Workshop on Digital Language Documentation, East Lansing, Michigan

    Google Scholar 

  • Holman EW, Brown CH, Wichmann S, Müller A, Velupillai V, Hammarström H, Sauppe S, Jung H, Bakker D, Brown P, Belyaev O, Urban M, Mailhammer R, List JM, Egorov D (2011) Automated dating of the world’s language families based on lexical similarity. Current Anthropology 52:841–875

    Article  Google Scholar 

  • Lewis M (ed) (2009a) Ethnologue: Languages of the World, Sixteenth edition. SIL International, Dallas, online version available at http://www.ethnologue.com/. Accessed on 2011-11-27.

    Google Scholar 

  • Lewis MP (ed) (2009b) Ethnologue: Languages of the World, 16th edn. SIL, Dallas.

    Google Scholar 

  • Lewis WD (2006) Odin: A model for adapting and enriching legacy infrastructure. In: Proceedings of the e-Humanities Workshop, held in cooperation with e-Science 2006: 2nd IEEE International Conference on e-Science and Grid Computing, Amsterdam, URL http://faculty.washington.edu/wlewis2/papers/ODIN-eH06.pdf, online version available at http://www.csufresno.edu/odin/

  • Maho J (2001) African Languages Country by Country: A Reference Guide, Göteborg Africana Informal Series, vol 1, 5th edn. Department of Oriental and African Languages, Göteborg University

    Google Scholar 

  • de Melo G, Weikum G (2008) Language as a foundation of the Semantic Web. In: Bizer C, Joshi A (eds) Proceedings of the Poster and Demonstration Session at the 7th International Semantic Web Conference (ISWC 2008), CEUR, Karlsruhe, Germany, CEUR WS, vol 401

    Google Scholar 

  • Moran S (this vol.) Using Linked Data to create a typological knowledge base. pp 129–138

    Google Scholar 

  • Nordhoff S, Hammarström H (2011) Glottolog/Langdoc: Defining dialects, languages, and language families collections of resources. In: Proceedings of ISWC 2011, URL http://iswc2011.semanticweb.org/fileadmin/iswc/Papers/Workshops/LISC/nordhoff.pdf

  • Weibel S, Kunze J, Lagoze C, Wolf M (1998) RFC 2413 - Dublin Core metadata for resource discovery. http://www.isi.edu/in-notes/rfc2413.txt

  • Xie Y, Aristar-Dry H, Aristar A, Lockwood H, Thompson J, Parker D, Cool B (2009) Language and location: Map annotation project - a gis-based infrastructure for linguistics information management. In: Computer Science and Information Technology, 2009. IMCSIT ’09. International Multiconference on, pp 305–311, DOI 10.1109/IMCSIT.2009.5352710, URL http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5352710, online version at http://www.llmap.org

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sebastian Nordhoff .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Nordhoff, S. (2012). Linked Data for Linguistic Diversity Research: Glottolog/Langdoc and ASJP Online. In: Chiarcos, C., Nordhoff, S., Hellmann, S. (eds) Linked Data in Linguistics. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28249-2_18

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-28249-2_18

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-28248-5

  • Online ISBN: 978-3-642-28249-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics