Abstract
This paper focusses on the types of questions that are raised in the encoding of historical documents. Using the example of a 17th century Scottish Sasine, the authors show how TEI-based encoding can produce a text which will be of major value to a variety of future historical researchers. Firstly, they show how to produce a machine-readable transcription which would be comprehensible to a word-processor as a text stream filled with print and formatting instructions; to a text analysis package as compilation of named text segments of some known structure; and to a statistical package as a set of observations each of which comprises a number of defined and named variables. Secondly, they make provision for a machine-readable transcription where the encoder’s research agenda and assumptions are reversible or alterable by secondary analysts who will have access to a maximum amount of information contained in the original source.
Daniel I. Greenstein is a Senior Lecturer in Modern History at Glasgow University where he teaches and publishes in American urban and political history and culture, modern British education, and computer methods. Computing works include A Historian’s Guide to Computing (Oxford, 1994).
Lou Burnard is Director of the Oxford Text Archive at Oxford University Computing Services, with interests in electronic text and database technology. He is European Editor of the Text Encoding Initiative’s Guidelines.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Notes
See I. H. Kropac, “Medieval Documents”, in D. I. Greenstein, ed., Modelling Historical Data: Towards a Standard for Encoding and Exchanging Machine-Readable Texts (St Katharinen, 1991), 111–16.
Historians have made too little use of linguistic content analyses and stylistic measures which are so fruitfully exploited in other humanities disciplines. Fortunately there is at last some movement in this direction after a slow start. For an early discussion of how content analysis might be applied in history see T. F. Carney, “Content Analysis: A Review Essay”, Historical Methods Newsletter, 4 (1971), 52–61. For substantive work see R. L. Merritt, “The Emergence of American Nationalism: A Quantitative Approach”, American Quarterly, (1965), 319–35. More recent work includes P. Dawdler, “Les Déclarations des Droits de l’Homme: Une Approche Quantitative”, Centre National de la Recherche Scientifique, IVe Congrès, History and Computing, Talence, 14–16 Septembre 1989: Volume Des Actes (Bordeaux, 1990), 65–73
P. Tavernier, “L’héritage de 1789 et de 1848 dans la Déclaration universelle de 1948”, Les droits de l’homme et la conquête des libertés (Grenoble, 1988)
M. Olsen and L-G. Harvey, “Computers in Intellectual History: Lexical Statistics and the Analysis of Political Discourse”, Journal of Interdisciplinary History; 18 (1988), 456–8
D. I. Greenstein, A Historian’s Guide to Computing (Oxford, 1994), chapter 5.
Database and statistical processing is much more developed amongst historians than linguistic content analysis. For a brief review see L. D. Burnard, “Relational Theory, SQL and Historical Practice”, in P. R. Denley et al. eds., History and Computing II (Manchester, 1989), 63–71
L. D. Burnard, “The Principles of Database Design”, in Sebastian Rahtz, ed., Information Technology in the Humanities (Chichester, 1987), 54–68
D. I. Greenstein, “Multi-Sourced and Integrated Databases for the Prosopographer”, in E. Mawdsley et al. eds., History and Computing III: Historians, Computers and Data. Applications in Research and Teaching (Manchester, 1990), 60–6
D. I. Greenstein “A Source-Oriented Approach to History and Computing: The Relational Database”, Historical Social Research, 14:3 (1989), 9–16
P. Hartland and C. Harvey, “Information Engineering and Historical Databases”, in P. R. Denley et al. eds., History and Computing II, 44–62
C. Harvey and J. Press, “Relational Data Analysis: Value, Concepts and Methods”, History and Computing, 4 (1992), 98–109
S. Pasleau, Les Bases de Données en Sciences Humaines (Liège, 1988). For the use of relational databases in medieval history see C. Bourlet and J-L. Minel, “An Expert System Decision Support System for a Prosopographical Database”, in L. J. McCrank, ed., Databases in the Humanities and Social Sciences (Auburn, 1987)
D. I. Greenstein, A Historian’s Guide to Computing (Oxford, 1994), chapter 3.
Amongst the most obviously contentious interpretions made when encoding machine-readable data are those concerned with the classification of occupational data. See M. B. Katz, “Occupational Classification in History”, Journal of Interdisciplinary History, 3 (1972)
D. J. Treiman, “A Standard Occupational Prestige Scale for Use with Historical Data”, Journal of Interdisciplinary History, 7 (1976), 283–304
D. I. Greenstein, “Standard, Meta-Standard: A Framework for Coding Occupational Data”, Historical Social Research, 16 (1991), 3–22.
Coding notoriously fuzzy data including dates, place names, and currency measures is also problematic. See M. Thaller, “The Need for Standards: Data Modelling and Exchange”, 5, and “A Draft Proposal for the Coding of Machine Readable Sources”, 39, both in D. I. Greenstein, ed., Modelling Historical Data, pp. 1–64.
See J. M. Clubb, “Computer Technology and the Source Materials of Social History”, Social Science History, 10 (1986), 97–114
D. I. Greenstein, “Historians as Producers or Consumers of Standard-Conformant, Full-Text Datasets? Some Sources of Modern History as a Test Case”, in D. I. Greenstein ed., Modelling Historical Data, 179–94
R. W. Zweig, “Virtual Records and Real History”, History andComputing, 4 (1992), 174–82.
See I. H. Kropac, “Medieval Documents”.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1995 Springer Science+Business Media Dordrecht
About this chapter
Cite this chapter
Greenstein, D., Burnard, L. (1995). Speaking with One Voice: Encoding Standards and the Prospects for an Integrated Approach to Computing in History. In: Ide, N., Véronis, J. (eds) Text Encoding Initiative. Springer, Dordrecht. https://doi.org/10.1007/978-94-011-0325-1_11
Download citation
DOI: https://doi.org/10.1007/978-94-011-0325-1_11
Publisher Name: Springer, Dordrecht
Print ISBN: 978-0-7923-3704-1
Online ISBN: 978-94-011-0325-1
eBook Packages: Springer Book Archive