Skip to main content

Statistical Hypothesis Testing

  • Chapter
  • First Online:

Abstract

In this chapter, hypothesis testing, which is another important method of statistical inference, is introduced. In the initial sections, the general guidelines for constructing hypotheses tests and the logic behind them are outlined. Then, the most common tests for normal populations are described and exemplified. To this purpose, a few hydrologic variables that are approximately symmetric are used as examples. It is well known, however, that hydrologic variables are skewed, a fact that imposes the use of the nonparametric tests, which do not assume from the beginning a distributional shape for the data being tested. As such, some sections of this chapter are devoted to describing some nonparametric tests, with particular emphasis to those that are essential to test the hypotheses of randomness, independence, homogeneity, and nonstationarity, generally required for hydrologic frequency analysis. In the final sections of this chapter, the most relevant goodness-of-fit tests, namely, the chi-square, Kolmogorov–Smirnov, Anderson–Darling, and PPCC (Probability Plot Correlation Coefficient) are described and exemplified, as well as the Grubbs–Beck test to detect and identify possible outliers in a sample. As usual in this textbook, the choice of methods and tests has been exercised on the basis of their practical usefulness for Statistical Hydrology. The subject of statistical hypothesis testing, however, is much broader and covers topics that are not included in this chapter, such as test optimality, sequential and multiple-hypotheses tests. Another important topic that is not covered in this chapter, that of likelihood-ratio tests, is described and employed in Chap. 12, in the context of nonstationary frequency analysis. The reader interested in tests and methods not covered in this chapter should consult Hoel et al. (1971) or Mood et al. (1974) or Ramachandran and Tsokos (2009), for books written at an intermediate level of complexity, or more in-depth texts such as Bickel and Doksum (1977) or Casella and Berger (1990).

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   79.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   99.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   139.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  • Ahmad MI, Sinclair CD, Spurr BD (1998) Assessment of flood frequency models using empirical distribution function statistics. Water Resour Res 24(8):1323–1328

    Article  Google Scholar 

  • Anderson TW, Darling DA (1954) A test of goodness of fit. J Am Stat Assoc 49:756–769

    Google Scholar 

  • Bickel PJ, Doksum K (1977) Mathematical statistics: basic ideas and selected topics. Prentice Hall, Englewood Cliffs, NJ

    Google Scholar 

  • Bobée B, Ashkar F (1991) The Gamma family and derived distributions applied in hydrology. Water Resources Publications, Littleton, CO

    Google Scholar 

  • Casella G, Berger R (1990) Statistical inference. Duxbury Press, Belmont, CA

    Google Scholar 

  • Chandra M, Singpurwalla MD, Stephens MA (1981) Kolmogorov statistics for tests of fit for the extreme value and Weibull distribution. J Am Stat Assoc 76(375):729–731

    Google Scholar 

  • Chowdhury JU, Stedinger JR, Lu L-H (1991) Goodness of fit tests for regional generalized extreme value flood distributions. Water Resour Res 27(7):1765–1776

    Article  Google Scholar 

  • Crutcher HL (1975) A note on the possible misuse of the Kolmogorov-Smirnov test. J Appl Meteorol 14:1600–1603

    Article  Google Scholar 

  • D’Agostino RB, Stephens M (1986) Goodness-of-fit techniques. Marcel Dekker, New York

    Google Scholar 

  • Heo J-H, Kho YW, Shin H, Kim and Kim T (2008) Regression equations of probability plot correlation coefficient test statistic from several probability distributions. J Hydrol 355:1–15

    Article  Google Scholar 

  • Heo J-H, Shin H, Nam W, Om J, Jeong C (2013) Approximation of modified Anderson-Darling test statistic for extreme value distributions with unknown shape parameter. J Hydrol 499:41–49

    Article  Google Scholar 

  • Hoel PG, Port SC, Stone CJ (1971) Introduction to statistical theory. Houghton Mifflin, Boston

    Google Scholar 

  • Hollander M, Wolfe DA (1973) Nonparametric statistical methods. Wiley, New York

    Google Scholar 

  • Gibbons JD (1971) Nonparametric statistical inference. McGraw-Hill, New York

    Google Scholar 

  • Grubbs FE, Beck G (1972) Extension of sample sizes and percentage points for significance tests of outlying observations. Technometrics 14(4):847–854

    Article  Google Scholar 

  • Grubbs FE (1969) Procedures for detecting outlying observations in samples. Technometrics 11(1):1–21

    Article  Google Scholar 

  • Grubbs FE (1950) Sample criteria for testing outlying observations. Ann Math Stat 21(1):27–58

    Article  Google Scholar 

  • Filliben JJ (1975) The probability plot correlation coefficient test for normality. Technometrics 17(1):111–117

    Article  Google Scholar 

  • Kim S, Kho Y, Shin H, Heo J-H (2008) Derivation of probability plot correlation coefficient test statistics for the generalized logistic and the generalized Pareto distributions. Proceedings of the world environmental and water resources congress 2008 Ahupua’a, 1:10. American ociety of Civil Engineers

    Google Scholar 

  • Kolmogorov A (1933) Sulla determinazione empirica di una legge di distribuzione. Giornale dell’ Istituto Italiano degli Attuari 4:83–91

    Google Scholar 

  • Kottegoda NT, Rosso R (1997) Statistics, probability, and reliability for civil and environmental engineers. McGraw-Hill, New York

    Google Scholar 

  • Lamontagne JR, Stedinger JR (2016) Examination of the Spencer-McCuen outlier detection test for log-Pearson type III distributed data. J Hydrol Eng 21(3):1–7, 04015069

    Article  Google Scholar 

  • Lamontagne JR, Stedinger JR, Berenbrock C, Veilleux AG, Ferris JC, Knifong DL (2012) Development of regional skew for selected flood durations for the Central Valley Region, California, based on data through water year 2008. Report 2012-5130. United States Geological Survey, Reston, VA

    Google Scholar 

  • Larsen RJ, Marx ML (1986) An introduction to mathematical statistics and its applications. Prentice-Hall, Englewood Cliffs, NJ

    Google Scholar 

  • Lilliefors HW (1967) On the Kolmogorov-Smirnov test for normality with mean and variance unknown. J Am Stat Assoc 62(318):399–402

    Article  Google Scholar 

  • Mann HB, Whitney DR (1947) On the test of whether one of two random variables is stochastically larger than the other. Ann Math Stat 18:50–60

    Article  Google Scholar 

  • Mauget S (2011) Time series analysis based on running Mann-Whitney Z statistics. J Time Series Anal 32:47–53

    Article  Google Scholar 

  • Mood AM, Graybill FA, Boes DC (1974) Introduction to the theory of statistics, international, 3rd edn. McGraw-Hill, Singapore

    Google Scholar 

  • NERC (1975) Flood studies report, vol 1. National Environmental Research Council, London

    Google Scholar 

  • Özmen T (1993) A modified Anderson-Darling goodness-of-fit test for the gamma distribution with unknown scale and location parameter. Thesis of MSc in Operation Research, Air Force Institute of Technology, Wright-Patterson Air Force Base, OH

    Google Scholar 

  • Pettitt AN (1979) A non-parametric approach to the change-point problem. Appl Stat 28(2):126

    Article  Google Scholar 

  • Ramachandran KM, Tsokos CP (2009) Mathematical statistics with applications. Elsevier Academic Press, Burlington, MA

    Google Scholar 

  • Siegel S (1956) Nonparametric statistics for the behavioral sciences. McGraw-Hill, New York

    Google Scholar 

  • Spencer CS, McCuen RH (1996) Detection of outliers in Pearson type III data. J Hydrol Eng 1(2):2–10

    Article  Google Scholar 

  • Stedinger JR, Vogel RM, Foufoula-Georgiou E (1993) Frequency analysis of extreme events. In: Maidment DR (ed) Chapter 18 in handbook of hydrology. McGraw-Hill, New York

    Google Scholar 

  • Stephens MA (1974) EDF statistics for goodness of fit and some comparisons. J Am Stat Assoc 69(347):730–737

    Article  Google Scholar 

  • Smirnov N (1948) Table for estimating the goodness-of-fit of empirical distributions. Ann Math Stat 19:279–281

    Article  Google Scholar 

  • Vlcek O, Huth R (2009) Is daily precipitation Gamma-distributed? Adverse effects of an incorrect use of the Kolmogorov-Smirnov test. Atmos Res 93:759–766

    Article  Google Scholar 

  • Vogel RM, Kroll CN (1989) Low flow frequency analysis using probability-plot correlation coefficients. J Water Resour Plann Manag 115(3):338–357

    Article  Google Scholar 

  • Vogel RM, McMartin DE (1991) Probability plot goodness-of-fit and skewness estimation procedures for the Pearson type 3 distribution. Water Resourc Res 27(12):3149–3158

    Article  Google Scholar 

  • Wald A, Wolfowitz J (1943) An exact test for randomness in the non-parametric case based on serial correlation. Ann Math Stat 14:378–388

    Article  Google Scholar 

  • Wang QJ (1998) Approximate goodness-of-fit tests of fitted generalized extreme value distributions using LH moments. Water Resour Res 34(12):3497–3502

    Article  Google Scholar 

  • WMO (2009) Guide to hydrological practices. WMO No. 168, vol II, 6th edn. World Meteorological Organization, Geneva

    Google Scholar 

  • WRC (1981) Guidelines for determining flood flow frequency—Bulletin 17B. United States Water Resources Council-Hydrology Committee, Washington

    Google Scholar 

  • Yule GY, Kendall MG (1950) An introduction to the theory of statistics. Charles Griffin & Co., London

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mauro Naghettini .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing Switzerland

About this chapter

Cite this chapter

Naghettini, M. (2017). Statistical Hypothesis Testing. In: Naghettini, M. (eds) Fundamentals of Statistical Hydrology. Springer, Cham. https://doi.org/10.1007/978-3-319-43561-9_7

Download citation

Publish with us

Policies and ethics