Abstract
When we leave general style as a poor bet and pay attention to words, we find that Hamilton and Madison use certain words at quite different rates. Douglass Adair brought this spectacularly to our attention by pointing out their uses of while and whilst. In our work, we have used individual words as the principal basis for measuring likelihood of authorship. Early investigations convinced us that most single variables, carefully selected or not, have little discriminating value, and that a large pool of variables provides the greatest hope of success. Sentence length is a good example of a stylistic variable which had even been used effectively elsewhere, yet failed miserably here. Since the rate for each word can be regarded as a variable, words supply a pool of thousands of variables. Furthermore, words are easily recognized and effective for discrimination.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 1984 Springer-Verlag New York Inc.
About this chapter
Cite this chapter
Mosteller, F., Wallace, D.L. (1984). Words and Their Distributions. In: Applied Bayesian and Classical Inference. Springer Series in Statistics. Springer, New York, NY. https://doi.org/10.1007/978-1-4612-5256-6_2
Download citation
DOI: https://doi.org/10.1007/978-1-4612-5256-6_2
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4612-9759-8
Online ISBN: 978-1-4612-5256-6
eBook Packages: Springer Book Archive