Skip to main content

Abstract

Among the most important techniques in statistics is hypothesis testing. A hypothesis is a supposition about a certain state of affairs. It does not spring from a sudden epiphany or a long-standing conviction; rather, it offers a testable explanation of a specific phenomenon. A hypothesis is something that we can accept (verify) or reject (falsify) based on empirical data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 54.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 69.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 99.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    In Excel 2010, this can be reached by clicking File → Options → Add-ins → Go.

  2. 2.

    For a very good explanation of how to perform this test in SPSS, see https://www.youtube.com/watch?v=MJGk2sg4EZU on the how2stats YouTube channel.

  3. 3.

    For a very good explanation of how to perform this test in Stata, see https://www.youtube.com/watch?v=ajzMeANAMzI on the Stata Learner YouTube channel.

  4. 4.

    For a very good explanation of how to perform this test in Excel, see https://www.youtube.com/watch?v=wy8GVt7Ityk on the YouTube channel of https://alphabench.com

  5. 5.

    Most statistical software packages perform this step automatically.

  6. 6.

    For a very good explanation of how to do this in SPSS, see https://www.youtube.com/watch?v=dkobjvhxTro on the YouTube channel of Dr. Todd Grande.

  7. 7.

    For a very good explanation of how to do this in Stata, see https://www.youtube.com/watch?v=2oJxerMCwIE and https://www.youtube.com/watch?v=NIwtaZqNFs8 on the Stata Learner YouTube channel.

  8. 8.

    For a very good explanation of how to do this in Excel, see https://www.youtube.com/watch?v=xlgeta9FivI on the YouTube channel of Matthias Kullowatz and https://www.youtube.com/watch?v=mJtbhGETU88 on the YouTube channel of Dr. Todd Grande.

  9. 9.

    Based on the results of the Kolmogorov–Smirnov test and the Shapiro–Wilk test (see Sect. 9.6.2), we must reject the hypothesis of a normal distribution.

  10. 10.

    Based on the results of the Kolmogorov–Smirnov test and the Shapiro–Wilk test (see Sect. 9.6.2), we must reject the hypothesis of a normal distribution.

  11. 11.

    In Excel 2010 this can be reached by clicking File → Options → Add-ins → Go.

  12. 12.

    For a very good explanation of how to perform these steps using Excel, see https://www.youtube.com/watch?v=BlS11D2VL_U on the YouTube channel of Jim Grange or https://www.youtube.com/watch?v=X14z9r8FUKY on the YouTube channel of Dr. James Clark from the Kings College London (Essential Life Science Tutorials).

  13. 13.

    See Conover (1980). For more information on accurately calculating the approximated U test, see Bortz et al. (2000, p. 200).

  14. 14.

    Based on the results of the Kolmogorov–Smirnov test and the Shapiro–Wilk test, we must reject the hypothesis of a normal distribution.

  15. 15.

    See the file Chocopraline_colour_name_price.sav for SPSS and Chocopraline_colour_name_price.dta for Stata.

  16. 16.

    The result of a single-factor univariate ANOVA for a factor with two traits is the same as that of a t-test for independent samples.

  17. 17.

    Traditionally, the F-test (or its application to groups, Bartlett’s test) is used to measure equal variance. But these tests react very sensitively to deviations from the normal distribution, which is why the more robust Levene’s test is preferred. SPSS automatically performs Levene’s test for equal variance when performing ANOVA (choose Options → Homogeneity tests). Stata uses the one-way ANOVA to determine Bartlett’s test for equal variances. The Levene’s test calculation (w_0) is located under the heading Hypothesis Tests.

  18. 18.

    Strictly speaking, all the measures of regression diagnostics (heteroskedasticity, autocorrelation, multicollinearity, etc.) should be performed when carrying out an ANCOVA (see Sect. 10.10).

  19. 19.

    When using Excel 2010, select File → Options → Add-ins → GO instead.

  20. 20.

    For a very good explanation of ANOVA in Excel, see https://www.youtube.com/watch?v=tPGPV_XPw-o on the YouTube channel of Dr. James Clark from the Kings College London (Essential Life Science Tutorials) and https://www.youtube.com/watch?v=JfUf5DR2Azs on the YouTube channel of StatisticsHowTo.com

  21. 21.

    ti represents the respective number of rank scores for the value i. In our example, we have 4 rank scores of 2.5 for value 1, 3 rank scores of 6 for value 2, 8 rank scores of 11.5 for value 3, and 9 rank scores of 20 for value 4.

  22. 22.

    The data in titanic.sav (SPSS), titanic.dta (Stata), and titanic.xls (Excel) contain figures on the number of persons on board and the number of victims. The data is taken from the British Board of Trade Inquiry Report (1990), Report on the Loss of the Titanic′ (S.S.), Gloucester (reprint).

  23. 23.

    For a very good explanation of how to calculate the chi-square test of Independence using SPSS, see https://www.youtube.com/watch?v=wfIfEWMJY3s on the YouTube channel of ASK Brunel.

  24. 24.

    Syntax command: tabulate class survived, cchi2 cell chi2 clrchi2 column expected row V.

  25. 25.

    For a very good explanation of how to calculate the chi-square test of independence using Stata, see: https://www.youtube.com/watch?v=GZIi9zAlzIA on the StataCorp LLC YouTube channel.

  26. 26.

    For a very good explanation of how to calculate the chi-square test of independence using Excel, see https://www.youtube.com/watch?v=ODxEoDyF6RI on the YouTube channel of Ken Blake.

  27. 27.

    For a very good explanation of how to test for normal distribution using SPSS, see https://www.youtube.com/watch?v=dK-JNR3g_LU on the Dragonfly Statistics YouTube channel and https://www.youtube.com/watch?v=sQkB-AlJgPI on the HowToStats.com YouTube channel.

References

  • Backhaus, K., Erichson, B., Plinke, W., Weiber, R. (2016). Multivariate Analysemethoden. Eine anwendungsorientierte Einführung, 14th Edition. Berlin, Heidelberg: SpringerGabler.

    Book  Google Scholar 

  • Bortz, J. (1999). Statistik für Sozialwissenschaftler, 5th Edition. Berlin, Heidelberg: Springer.

    Google Scholar 

  • Bortz, J., Lienert, G. A., Boehnke, K. (2000). Verteilungsfreie Methoden der Biostatistik, 2nd Edition. Berlin, Heidelberg: Springer.

    Book  Google Scholar 

  • Brown, M. B., Forsythe, A. B. (1974). Robust tests for the equality of variances. Journal of the American Statistical Association, 69, 364–367.

    Article  Google Scholar 

  • Conover, W.J. (1980). Practical nonparametric statistics. New-York: Wiley.

    Google Scholar 

  • Dixon, W.J. (1954). Power under normality of several nonparametric tests. The Annals of Mathematical Statistics, 25: 610-614.

    Article  Google Scholar 

  • Field, A. (2005). Discovering Statistics Using SPSS. London: Sage.

    Google Scholar 

  • Fisher, R.A., Yates, F. (1963). Statistical tables for biological, agricultural, and medical research. London: Oliver and Boyed.

    Google Scholar 

  • Hair, J.F., Anderson, R.E., Tatham, R.L., Black, W.C. (1998). Multivariate Data Analysis, 5th Edition. London: Prentice Hall.

    Google Scholar 

  • Kolmogorov, A. N. (1933). Sulla determinazione empirica di una legge di distribuzione. Giornale dell’ Istituto Italiano degli Attuari, 4, 83–91.

    Google Scholar 

  • Kruskal, W. H., Wallis, W. A. (1952). Use of ranks in one-criterion variance analysis. Journal of the American Statistical Association, 47, 583–621.

    Article  Google Scholar 

  • Kruskal, W. H., Wallis, W. A. (1953). Errata: Use of ranks in one-criterion variance analysis. Journal of the American Statistical Association, 48, 907–911.

    Article  Google Scholar 

  • Mann, H.B., Whitney, D.R. (1947). On a test whether one of two random variables is stochastically larger than the other. The Annals of Mathematical Statistics, 18, 65-78.

    Article  Google Scholar 

  • Neyman, J. (1937). Outline of a theory of statistical estimation based on the classical theory of probability. Philosophical transactions of the Royal Society, Series A, 236.

    Google Scholar 

  • Neyman, J. & Pearson, E. S. (1928a). On the use and interpretation of certain test criteria for purposes of statistical inference, part i. Biometrika, 20A, 175–240.

    Google Scholar 

  • Neyman, J. & Pearson, E. S. (1928b). On the use and interpretation of certain test criteria for purposes of statistical inference, part ii. Biometrika, 20A, 263–294.

    Google Scholar 

  • Popper, K. (1934). Logik der Forschung. Tübingen: Mohr.

    Google Scholar 

  • Satterthwaite, F. E. (1946). An approximate distribution of estimates of variance components. Biometrics Bulletin 2: 110-114.

    Article  Google Scholar 

  • Scheffé, H. (1953). A method for judging all contrasts in the analysis of variance. Biometrika, 40, 87-104.

    Google Scholar 

  • Shapiro, S. S., and R. S. Francia (1972). An approximate analysis of variance test for normality. Journal of the American Statistical Association, 67, 215–216.

    Article  Google Scholar 

  • Shapiro, S. S., and M. B. Wilk (1965). An analysis of variance test for normality (complete samples). Biometrika, 52, 591–611.

    Article  Google Scholar 

  • Smirnov, N. V. (1933). Estimate of deviation between empirical distribution functions in two independent samples. Bulletin Moscow University, 2, 3–16.

    Google Scholar 

  • Spearman, C.E. (1904). The proof and measurement of association between two things. American Journal of Psychology, 15(1), 72–101.

    Article  Google Scholar 

  • Stevens, J. P. (1972). Four Methods of Analyzing between Variations for the k-Group MANOVA Problem, Multivariate Behaviorial Research, 7, 442-454.

    Google Scholar 

  • Welch, B. L. (1947). The generalization of Student’s problem when several different population variances are involved. Biometrika, 34, 28–35.

    Google Scholar 

  • Wilcoxon, F. (1945). Individual comparisons by ranking methods. Biometrics, 1, 80-83.

    Article  Google Scholar 

  • Wilcoxon, F. (1947). Probability tables for individual comparisons by ranking methods. Biometrics, 3, 119-122.

    Article  Google Scholar 

  • Witting, H. (1960). A generalized Pitman efficiency for nonparametric tests. The Annals of Mathematical Statistics, 31, 405-414.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Cleff, T. (2019). Hypothesis Testing. In: Applied Statistics and Multivariate Data Analysis for Business and Economics. Springer, Cham. https://doi.org/10.1007/978-3-030-17767-6_9

Download citation

Publish with us

Policies and ethics