Introduction to Probability Theory and Statistics

Zheng, Gang; Yang, Yaning; Zhu, Xiaofeng; Elston, Robert C.

doi:10.1007/978-1-4614-2245-7_1

Gang Zheng⁵,
Yaning Yang⁶,
Xiaofeng Zhu⁷ &
…
Robert C. Elston⁷

Part of the book series: Statistics for Biology and Health ((SBH))

2751 Accesses

Abstract

Basic probability theory and statistical models and procedures for the analysis of genetic studies are covered in Chap. 1. This chapter starts with an introduction to basic distribution theory and common distributions that are used in the book, including the uniform, multinomial, normal, t-, F-, Beta, Gamma, chi-squared and hypergeometric distributions. The basic distributions for order statistics are also given. Several types of stochastic convergence used in the book are summarized. Maximum likelihood estimation and its large sample properties are discussed. Various tests, including the efficient Score test, likelihood ratio test and Wald test, are studied with or without nuisance parameters. Multiple testing issues related to testing association with multiple genetic markers and related to hypothesis testing with an unknown genetic model are briefly reviewed. This chapter also covers the Delta method, the EM algorithm, basic concepts of sample size and power calculations, and asymptotic relative efficiency.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 99.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Benjamini, Y., Hochberg, Y.: Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. Roy. Stat. Soc. Ser. B 57, 289–300 (1995)
MATH MathSciNet Google Scholar
Benjamini, Y., Yekutieli, D.: The control of the false discovery rate in multiple testing under dependency. Ann. Stat. 29, 1165–1188 (2001)
Article MATH MathSciNet Google Scholar
Casella, G., Berger, R.L.: Statistical Inference. Duxbury Press, Belmont (1990)
MATH Google Scholar
Ceppellini, R., Siniscalco, M., Smith, C.A.B.: The estimation of gene frequencies in a random mating population. Ann. Hum. Genet. 20, 97–115 (1955)
Article MATH MathSciNet Google Scholar
Cox, D.R., Hinkley, D.V.: Theoretical Statistics. Chapman & Hall/CRC, Boca Raton (1974)
MATH Google Scholar
David, H.A., Nagaraja, H.N.: Order Statistics. 3rd edn. Wiley, Hoboken (2003)
Book MATH Google Scholar
Dempster, A., Laird, N.M., Rubin, D.: Maximum likelihood from incomplete data via the EM algorithm (with discussion). J. Roy. Stat. Soc. Ser. B 39, 1–38 (1977)
MATH MathSciNet Google Scholar
Dudoit, S., van der Laan, M.J.: Multiple Testing Procedures with Applications to Genomics. Springer, New York (2008)
Book MATH Google Scholar
Elston, R.C., Johnson, W.D.: Basic Biostatistics for Genetists and Epidemiologists. Wiley, West Sussex (2008)
Google Scholar
Evans, M., Hastings, N., Peacock, B.: Statistical Distributions. 3rd edn. Wiley, New York (2000)
MATH Google Scholar
Freidlin, B., Zheng, G., Li, Z., Gastwirth, J.L.: Trend tests for case-control studies of genetic markers: power, sample size and robustness. Hum. Hered. 53, 146–152 (2002) (Erratum 68, 220 (2009))
Article Google Scholar
Gastwirth, J.L.: On robust procedures. J. Am. Stat. Assoc. 61, 929–948 (1966)
Article MATH MathSciNet Google Scholar
Gastwirth, J.L.: The use of maximin efficiency robust tests in combining contingency tables and survival analysis. J. Am. Stat. Assoc. 80, 380–384 (1985)
Article MATH MathSciNet Google Scholar
Gastwirth, J.L., Freidlin, B.: On power and efficiency robust linkage tests for affected sibs. Ann. Hum. Genet. 64, 443–453 (2000)
Article Google Scholar
Lachin, J.M.: Biostatistical Methods: The Assessment of Relative Risks. Wiley, New York (2000)
Book MATH Google Scholar
Noether, G.E.: On a theorem of Pitman. Ann. Math. Stat. 26, 64–68 (1955)
Article MATH MathSciNet Google Scholar
Robert, C.P., Casella, G.: Monte Carlo Statistical Methods. Springer, New York (2004)
MATH Google Scholar
Storey, J.D.: A direct approach to false discovery rates. J. Roy. Stat. Soc. Ser. B 64, 479–498 (2002)
Article MATH MathSciNet Google Scholar
Storey, J.D.: The positive false discovery rate: A Bayesian interpretation and the q-value. Ann. Stat. 31, 2013–2035 (2003)
Article MATH MathSciNet Google Scholar
van der Vaart, A.W.: Asymptotic Statistics. Cambridge University Press, Cambridge (1998)
MATH Google Scholar
Zheng, G., Freidlin, B., Gastwirth, J.L.: Robust TDT-type candidate-gene association tests. Ann. Hum. Hered. 66, 145–155 (2002)
Google Scholar
Zheng, G., Freidlin, B., Gastwirth, J.L.: Comparison of robust tests for genetic association using case-control studies. In: Rojo, J. (ed.) Optimality: The Second Erich L. Lehmann Symposium. Lecture Notes–Monograph Series, vol. 49, pp. 320–336. Institute of Mathematical Statistics, Beachwood (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

Bethesda, MD, USA
Gang Zheng
School of Management, Dept. Statistics & Finance, University of Science & Technology of China, Hefei, Anhui, People’s Republic of China
Yaning Yang
School of Medicine, Dept. Epidemiology & Biostatistics, Case Western Reserve University, Cleveland, OH, USA
Xiaofeng Zhu & Robert C. Elston

Authors

Gang Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Yaning Yang
View author publications
You can also search for this author in PubMed Google Scholar
Xiaofeng Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Robert C. Elston
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Gang Zheng .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Zheng, G., Yang, Y., Zhu, X., Elston, R.C. (2012). Introduction to Probability Theory and Statistics. In: Analysis of Genetic Association Studies. Statistics for Biology and Health. Springer, Boston, MA. https://doi.org/10.1007/978-1-4614-2245-7_1

Download citation

DOI: https://doi.org/10.1007/978-1-4614-2245-7_1
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4614-2244-0
Online ISBN: 978-1-4614-2245-7
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics