Skip to main content

Practical Considerations in the Use of TEI Headers in a Large Corpus

  • Chapter
  • 75 Accesses

Abstract

Many aspects of the guidelines of the Text Encoding Initiative (TEI) are applicable to corpora and text collections, and to the texts that these contain. As the first large corpus developed using mark-up conforming to the guidelines, the British National Corpus (BNC) is a test-bed for many TEI-developed mechanisms. This is particularly true in the case of the TEI header, which has three intended applications — to describe a corpus, to describe an individual text, and as a free-standing bibliographic record — all of them used by the BNC. This paper describes the application of the TEI header to the BNC. It is intended that this information should, through a description of experience on a practical project, serve as a guide for those wishing to use TEI headers in the documentation and management of other corpora and collections of texts.

Dominic Dunlop is project manager for the British National Corpus at Oxford University Computing Services. Prior to assuming this position, he worked in a variety of positions related to development and support of the UNIX operating system, and was active in the POSIX initiative for the standardization of UNIX.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • BNC, TGAW15. Spoken Corpus Design Specification. British National Corpus project document, 1991a. (Note: Copies of British National Corpus project documents may be obtained by sending electronic mail to the author at natcorp@vax.ox.ac.uk.)

    Google Scholar 

  • BNC, BNCW08. Written Corpus Design Specification. British National Corpus project document, 1991b.

    Google Scholar 

  • BNC, TGAP21. Selecting Titles for the British National Corpus. British National Corpus project document, 1992a.

    Google Scholar 

  • BNC, TGBP05. BNC Permissions Request. British National Corpus project document, 1992b.

    Google Scholar 

  • BNC, TGDW36. The New BNC Database. British National Corpus project document, 1992c.

    Google Scholar 

  • Burnage, Gavin and Dominic Dunlop. “Encoding the British National Corpus”. In English Language Corpora: Design, Analysis and Exploitation. Ed. Jan Aarts, Pieter de Haan and Nelleke Oostdijk. Amsterdam and Atlanta: Editions Rodopi, 1993, pp. 79–95.

    Google Scholar 

  • Giordano, Richard. “The TEI Header”. In this volume.

    Google Scholar 

  • Goldfarb, Charles F. The SGML Handbook. Oxford: Oxford University Press, 1990.

    Google Scholar 

  • Ingres. Introducing Ingres for the UNIX and VMS Operating Systems. Alameda, CA: Relational Technology Inc., 1989.

    Google Scholar 

  • ISO. ISO 8879:1986 Information Processing — Structured Generalized Markup Language. Geneva: International Organization for Standardization, 1986.

    Google Scholar 

  • ISO. ISO 646:1991 Information Processing — ISO 7-bit Coded Character Set for Information Interchange. Geneva: International Organization for Standardization, 1991.

    Google Scholar 

  • Pratchett, Terry. Wings. London: Corgi, 1991.

    Google Scholar 

  • Sperberg-McQueen, C.M. and Lou Bumard. Guidelines for Electronic Text Encoding and Interchange (TEI P3). Chicago, Oxford: Text Encoding Initiative, 1994.

    Google Scholar 

  • TEI P3. See Sperberg-McQueen and Bumard.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1995 Springer Science+Business Media Dordrecht

About this chapter

Cite this chapter

Dunlop, D. (1995). Practical Considerations in the Use of TEI Headers in a Large Corpus. In: Ide, N., Véronis, J. (eds) Text Encoding Initiative. Springer, Dordrecht. https://doi.org/10.1007/978-94-011-0325-1_7

Download citation

  • DOI: https://doi.org/10.1007/978-94-011-0325-1_7

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-0-7923-3704-1

  • Online ISBN: 978-94-011-0325-1

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics