Skip to main content

The LAPPS Interchange Format

  • Conference paper
  • First Online:
Worldwide Language Service Infrastructure (WLSI 2015)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9442))

Included in the following conference series:

Abstract

We describe and motivate the LAPPS Interchange Format, a JSON-LD format that is used for data transfer between language services in the Language Application Grid. The LAPPS Interchange Format enables syntactic and semantic interoperability of language services by providing a uniform syntax for common linguistic data and by using the Linked Data aspect of JSON-LD to refer to external definitions of linguistic categories. It is tightly integrated with the Web Services Exchange Vocabulary, which specifies a terminology for a core of linguistic objects and features exchanged by services.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    The USE of LIF objects is actually not a strict requirement imposed for LAPPS Services. For example, services based on an existing pipeline of GATE components may choose to use the GATE format until the output of the very last component. We have created a suite of translation services that translate common linguistic objects between LIF and common existing formats.

  2. 2.

    See http://www.json.org/ and http://json-ld.org/.

  3. 3.

    See http://vocab.lappsgrid.org/Document.html#language.

  4. 4.

    See http://vocab.lappsgrid.org/Annotation.

  5. 5.

    A related issue is that versioning not only applies to the version of the wrapped NLP component, but also the version of the wrapper itself as well as to versions of definitions in the vocabulary. This issue is currently under discussion.

  6. 6.

    See http://vocab.lappsgrid.org/Sentence.html and http://vocab.lappsgrid.org/Token.html.

  7. 7.

    An alternative to this approach is to make PosTag a kind of annotation rather than a feature of an annotation, which can help in reducing potential redundancies that will become obvious later in this section. However, it was important to us to keep the number of annotation categories in the vocabulary as low as possible. We also felt that conceptually a part-of-speech tag is a natural feature of a token and not a standalone category.

References

  1. Cunningham, H., Maynard, D., Bontcheva, K., Tablan, V.: GATE: a framework and graphical development environment for robust NLP tools and applications. In: Proceedings of the 40th Anniversary Meeting of the Association for Computational Linguistics (2002)

    Google Scholar 

  2. Ferrucci, D., Lally, A.: UIMA: an architectural approach to unstructured information processing in the corporate research environment. Nat. Lang. Eng. 10(3–4), 327–348 (2004). Cambridge University Press

    Article  Google Scholar 

  3. Ide, N., Suderman, K.: The linguistic annotation framework: a standard for annotation interchange and merging. Lang. Resour. Eval. 48(3), 395–418 (2014)

    Article  Google Scholar 

  4. Ide, N., Pustejovsky, J., Cieri, C., Nyberg, E., Wang, D., Suderman, K., Verhagen, M., Wright, J.: The language application grid. In: The Ninth Language Resources and Evaluation Conference (LREC 2014), Reykjavik, Iceland

    Google Scholar 

  5. Ide, N., Pustejovsky, J., Suderman, K., Verhagen, M.: The language application grid web service exchange vocabulary. In: Workshop on Open Infrastructures and Analysis Frameworks for HLT (OIAF4HLT), Held in Conjunction with COLING, Dublin, Ireland (2014)

    Google Scholar 

  6. Ishida, T. (ed.): The Language Grid: Service-Oriented Collective Intelligence for Language Resource Interoperability. Springer, Heidelberg (2011). ISBN: 978-3-642-21177-5. Observation of strains. Infect Dis Ther. 3(1), 35–43 (2011)

    Google Scholar 

  7. Phillips, A., Davis, M.: Tags for Identifying Languages. IETF Best Current Practice, September 2009. http://tools.ietf.org/html/bcp47

Download references

Acknowledgements

This work was supported by two National Science Foundation grants: NSF-ACI 1147944 and NSF-ACI 1147912.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Marc Verhagen .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Verhagen, M. et al. (2016). The LAPPS Interchange Format. In: Murakami, Y., Lin, D. (eds) Worldwide Language Service Infrastructure. WLSI 2015. Lecture Notes in Computer Science(), vol 9442. Springer, Cham. https://doi.org/10.1007/978-3-319-31468-6_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-31468-6_3

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-31467-9

  • Online ISBN: 978-3-319-31468-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics