Advertisement

Outliers

  • G. Barrie Wetherill
  • P. Duncombe
  • M. Kenward
  • J. Köllerström
  • S. R. Paul
  • B. J. Vowden
Chapter
Part of the Monographs on Statistics and Applied Probability book series (MSAP)

Abstract

We have emphasized in Chapter 2 that outlying observations may occur in a data set due to a variety of causes. There are two quite separate questions which arise and these must be carefully distinguished. One problem is to have some statistical techniques which may indicate outlying observations and so select them for special study. That is the problem which we discuss in the rest of this chapter. The second problem is what to do with these outliers, once they are located, and we make a few remarks on that problem now.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Andrews, D. F. (1974) A robust method for multiple linear regression. Technometrics, 16, 523–531.CrossRefGoogle Scholar
  2. Andrews, D. F. and Pregibon, D. (1978) Finding the outliers that matter. J. Roy. Statist. Soc. B, 40, 85–93.Google Scholar
  3. Atkinson, A. C. (1981) Two graphical displays for outlying and influential observations in regression. Biometrika, 68, 13–20.CrossRefGoogle Scholar
  4. Barnett, V. and Lewis, T. (1978) Outliers in Statistical Data. Wiley, New York.Google Scholar
  5. Beckman, R.J. and Cook, R. D. (1983) Outlier…s (with Discussion). Technometrics, 25, 119–163.Google Scholar
  6. Belsley, D. A., Kuh, E. and Welsch, R. E. (1980) Regression Diagnostics: Identifying Influential Data and Source Collinearity. Wiley, New York.CrossRefGoogle Scholar
  7. Bingham, C. (1977) Some identities useful in the analysis of residuals from linear regression. Technical Report No. 300, School of Statistics, University of Minnesota.Google Scholar
  8. Brownlee, K. A. (1965) Statistical Theory and Methodology in Science and Engineering, 2nd edn, Wiley, New York.Google Scholar
  9. Cook R. D. (1977) Detection of influential observations in linear regression. Technometrics, 19, 15–18.CrossRefGoogle Scholar
  10. Cook, R. D. (1979) Influential observations in linear regression. J. Amer. Statist. Assoc., 74, 169–174.CrossRefGoogle Scholar
  11. Cook, R. D. and Weisberg, S. (1980) Characterization of an influence function for detecting influential cases in regression. Technometrics, 22, 495–508.CrossRefGoogle Scholar
  12. Cook, R. D. and Weisberg, S. (1982) Residuals and Influence in Regression, Chapman and Hall, London.Google Scholar
  13. Cran, G. W., Martin, K. J. and Thomas, G. E. (1977) Remark ASR19 and Algorithm AS 109: a remark on Algorithm AS63: the incomplete integral, AS64: inverse of the incomplete beta function ratio. Appl. Statist., 26, 111–114.CrossRefGoogle Scholar
  14. Daniel, C. and Wood, F. S. (1971) Fitting Equations to Data. Wiley, New York.Google Scholar
  15. Dempster, A. D. and Gasko-Green, M. (1981) New tools for residual analysis. Ann. Statist. 9, 945–959.CrossRefGoogle Scholar
  16. Draper, N. R. and John, J. A. (1981) Influential observations and outliers in regression. Technometrics, 23, 21–26.CrossRefGoogle Scholar
  17. Draper, N. R. and Stoneman, D. M. (1966). Testing for the inclusion of variables in linear regression by a randomization technique. Technometrics, 8, 695–699.CrossRefGoogle Scholar
  18. Gentleman, J. F. and Wilk, M. B. (1975) Detecting outliers, II. Supplementing the direct analysis of residuals. Biometrics, 31, 387–410.CrossRefGoogle Scholar
  19. Hoaglin, D. C. and Welsch, R. E. (1978) The hat matrix in regression and ANOVA. Amer. Statist., 32, 17–22.Google Scholar
  20. Longley, J. W. (1967) An appraisal of least squares program for the electronic computer from the point of view of the user. J. Amer. Statist. Assoc., 62, 819–841.CrossRefGoogle Scholar
  21. Lund, R. E. (1975) Tables for an approximate test for outliers in linear models. Technometrics, 17, 473–476.CrossRefGoogle Scholar
  22. Mickey, M. R., Dunn, O. J., and Clark, V. (1967) Note on use of stepwise regression in detecting outliers. Comput. Biomed. Res., 1, 105–111.CrossRefGoogle Scholar
  23. Paul, S. R. (1983) Sequential detection of unusual points in regression. The Statistician, 32, 105–112.CrossRefGoogle Scholar
  24. Paul, S. R. (1985a) Critical values of ‘maximum studentized residual’ statistics in multiple linear regression. Biom. J., 26, 1–5.Google Scholar
  25. Prescott, P. (1975) An approximate test for outliers in linear models. Technometrics, 17, 127–132.CrossRefGoogle Scholar
  26. Rao, C. R. (1965) Linear Statistical Influence and Its Applications. Wiley, New York.Google Scholar
  27. Seber, G. A. F. (1977) Linear Regression Analysis. Wiley, New York.Google Scholar
  28. Srikantan, K. S. (1961) Testing for the single outlier in the regression model. Sankhya A, 23, 251–260.Google Scholar

Further Reading

  1. Paul, S. R. (1985b) A note on maximum likelihood ratio test of no outliers in regression models. Biom. J. (to appear).Google Scholar
  2. Wetherill, G. B. (1981) Intermediate Statistical Methods. Chapman and Hall, London.CrossRefGoogle Scholar

Copyright information

© G. Barrie Wetherill 1986

Authors and Affiliations

  • G. Barrie Wetherill
    • 1
  • P. Duncombe
    • 2
  • M. Kenward
    • 3
  • J. Köllerström
    • 3
  • S. R. Paul
    • 4
  • B. J. Vowden
    • 3
  1. 1.Department of StatisticsThe University of Newcastle upon TyneUK
  2. 2.Applied Statistics Research UnitUniversity of Kent at CanterburyUK
  3. 3.Mathematical InstituteUniversity of Kent at CanterburyUK
  4. 4.Department of Mathematics and StatisticsUniversity of WindsorCanada

Personalised recommendations