Speaking with One Voice: Encoding Standards and the Prospects for an Integrated Approach to Computing in History

Greenstein, Daniel; Burnard, Lou

doi:10.1007/978-94-011-0325-1_11

Daniel Greenstein³ &
Lou Burnard⁴

75 Accesses
1 Citations

Abstract

This paper focusses on the types of questions that are raised in the encoding of historical documents. Using the example of a 17th century Scottish Sasine, the authors show how TEI-based encoding can produce a text which will be of major value to a variety of future historical researchers. Firstly, they show how to produce a machine-readable transcription which would be comprehensible to a word-processor as a text stream filled with print and formatting instructions; to a text analysis package as compilation of named text segments of some known structure; and to a statistical package as a set of observations each of which comprises a number of defined and named variables. Secondly, they make provision for a machine-readable transcription where the encoder’s research agenda and assumptions are reversible or alterable by secondary analysts who will have access to a maximum amount of information contained in the original source.

Daniel I. Greenstein is a Senior Lecturer in Modern History at Glasgow University where he teaches and publishes in American urban and political history and culture, modern British education, and computer methods. Computing works include A Historian’s Guide to Computing (Oxford, 1994).

Lou Burnard is Director of the Oxford Text Archive at Oxford University Computing Services, with interests in electronic text and database technology. He is European Editor of the Text Encoding Initiative’s Guidelines.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

eBook: USD 16.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Notes

See I. H. Kropac, “Medieval Documents”, in D. I. Greenstein, ed., Modelling Historical Data: Towards a Standard for Encoding and Exchanging Machine-Readable Texts (St Katharinen, 1991), 111–16.
Google Scholar
Historians have made too little use of linguistic content analyses and stylistic measures which are so fruitfully exploited in other humanities disciplines. Fortunately there is at last some movement in this direction after a slow start. For an early discussion of how content analysis might be applied in history see T. F. Carney, “Content Analysis: A Review Essay”, Historical Methods Newsletter, 4 (1971), 52–61. For substantive work see R. L. Merritt, “The Emergence of American Nationalism: A Quantitative Approach”, American Quarterly, (1965), 319–35. More recent work includes P. Dawdler, “Les Déclarations des Droits de l’Homme: Une Approche Quantitative”, Centre National de la Recherche Scientifique, IVe Congrès, History and Computing, Talence, 14–16 Septembre 1989: Volume Des Actes (Bordeaux, 1990), 65–73
Google Scholar
P. Tavernier, “L’héritage de 1789 et de 1848 dans la Déclaration universelle de 1948”, Les droits de l’homme et la conquête des libertés (Grenoble, 1988)
Google Scholar
M. Olsen and L-G. Harvey, “Computers in Intellectual History: Lexical Statistics and the Analysis of Political Discourse”, Journal of Interdisciplinary History; 18 (1988), 456–8
Article Google Scholar
D. I. Greenstein, A Historian’s Guide to Computing (Oxford, 1994), chapter 5.
Google Scholar
Database and statistical processing is much more developed amongst historians than linguistic content analysis. For a brief review see L. D. Burnard, “Relational Theory, SQL and Historical Practice”, in P. R. Denley et al. eds., History and Computing II (Manchester, 1989), 63–71
Google Scholar
L. D. Burnard, “The Principles of Database Design”, in Sebastian Rahtz, ed., Information Technology in the Humanities (Chichester, 1987), 54–68
Google Scholar
D. I. Greenstein, “Multi-Sourced and Integrated Databases for the Prosopographer”, in E. Mawdsley et al. eds., History and Computing III: Historians, Computers and Data. Applications in Research and Teaching (Manchester, 1990), 60–6
Google Scholar
D. I. Greenstein “A Source-Oriented Approach to History and Computing: The Relational Database”, Historical Social Research, 14:3 (1989), 9–16
Google Scholar
P. Hartland and C. Harvey, “Information Engineering and Historical Databases”, in P. R. Denley et al. eds., History and Computing II, 44–62
Google Scholar
C. Harvey and J. Press, “Relational Data Analysis: Value, Concepts and Methods”, History and Computing, 4 (1992), 98–109
Google Scholar
S. Pasleau, Les Bases de Données en Sciences Humaines (Liège, 1988). For the use of relational databases in medieval history see C. Bourlet and J-L. Minel, “An Expert System Decision Support System for a Prosopographical Database”, in L. J. McCrank, ed., Databases in the Humanities and Social Sciences (Auburn, 1987)
Google Scholar
D. I. Greenstein, A Historian’s Guide to Computing (Oxford, 1994), chapter 3.
Google Scholar
Amongst the most obviously contentious interpretions made when encoding machine-readable data are those concerned with the classification of occupational data. See M. B. Katz, “Occupational Classification in History”, Journal of Interdisciplinary History, 3 (1972)
Google Scholar
D. J. Treiman, “A Standard Occupational Prestige Scale for Use with Historical Data”, Journal of Interdisciplinary History, 7 (1976), 283–304
Article Google Scholar
D. I. Greenstein, “Standard, Meta-Standard: A Framework for Coding Occupational Data”, Historical Social Research, 16 (1991), 3–22.
Google Scholar
Coding notoriously fuzzy data including dates, place names, and currency measures is also problematic. See M. Thaller, “The Need for Standards: Data Modelling and Exchange”, 5, and “A Draft Proposal for the Coding of Machine Readable Sources”, 39, both in D. I. Greenstein, ed., Modelling Historical Data, pp. 1–64.
Google Scholar
See J. M. Clubb, “Computer Technology and the Source Materials of Social History”, Social Science History, 10 (1986), 97–114
Article Google Scholar
D. I. Greenstein, “Historians as Producers or Consumers of Standard-Conformant, Full-Text Datasets? Some Sources of Modern History as a Test Case”, in D. I. Greenstein ed., Modelling Historical Data, 179–94
Google Scholar
R. W. Zweig, “Virtual Records and Real History”, History andComputing, 4 (1992), 174–82.
Google Scholar
See I. H. Kropac, “Medieval Documents”.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of History, Glasgow University, Scotland, UK
Daniel Greenstein
Oxford University Computing Service, 13 Banbury Road, Oxford, OX2 6NN, England, UK
Lou Burnard

Authors

Daniel Greenstein
View author publications
You can also search for this author in PubMed Google Scholar
Lou Burnard
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, Vassar College, Poughkeepsie, NY, USA
Nancy Ide
Laboratoire Parole et Langage, CNRS & Université de Provence, Aix-en-Provence, France
Jean Véronis

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Greenstein, D., Burnard, L. (1995). Speaking with One Voice: Encoding Standards and the Prospects for an Integrated Approach to Computing in History. In: Ide, N., Véronis, J. (eds) Text Encoding Initiative. Springer, Dordrecht. https://doi.org/10.1007/978-94-011-0325-1_11

Download citation

DOI: https://doi.org/10.1007/978-94-011-0325-1_11
Publisher Name: Springer, Dordrecht
Print ISBN: 978-0-7923-3704-1
Online ISBN: 978-94-011-0325-1
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics