Words and Their Distributions

Mosteller, Frederick; Wallace, David L.

doi:10.1007/978-1-4612-5256-6_2

Frederick Mosteller³ &
David L. Wallace⁴

Part of the book series: Springer Series in Statistics ((SSS))

594 Accesses

Abstract

When we leave general style as a poor bet and pay attention to words, we find that Hamilton and Madison use certain words at quite different rates. Douglass Adair brought this spectacularly to our attention by pointing out their uses of while and whilst. In our work, we have used individual words as the principal basis for measuring likelihood of authorship. Early investigations convinced us that most single variables, carefully selected or not, have little discriminating value, and that a large pool of variables provides the greatest hope of success. Sentence length is a good example of a stylistic variable which had even been used effectively elsewhere, yet failed miserably here. Since the rate for each word can be regarded as a variable, words supply a pool of thousands of variables. Furthermore, words are easily recognized and effective for discrimination.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Author information

Authors and Affiliations

Department of Statistics, Harvard University, Cambridge, MA, 02138, USA
Frederick Mosteller
Department of Statistics, University of Chicago, Chicago, IL, 60637, USA
David L. Wallace

Authors

Frederick Mosteller
View author publications
You can also search for this author in PubMed Google Scholar
David L. Wallace
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Mosteller, F., Wallace, D.L. (1984). Words and Their Distributions. In: Applied Bayesian and Classical Inference. Springer Series in Statistics. Springer, New York, NY. https://doi.org/10.1007/978-1-4612-5256-6_2

Download citation

DOI: https://doi.org/10.1007/978-1-4612-5256-6_2
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4612-9759-8
Online ISBN: 978-1-4612-5256-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics