Skip to main content

Improving the Classification Performance of a Discriminant Rule by Dealing With Data Cases Having a Substantial Influence on Variable Selection

  • Conference paper
Between Data Science and Applied Data Analysis
  • 821 Accesses

Abstract

An important problem in discriminant analysis that has received little attention in the literature is the effect of outliers when variable selection forms part of the analysis. In this paper we argue that variable selection and outlier identification should not be done sequentially, but should rather be integrated. We investigate an integrated approach, and compare its classification performance to that of a sequential approach in a limited simulation study.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • ANDREWS, D.F. and PREGIBON, D. (1978): Finding the outliers that matter. Journal of the Royal Statistical Society B, 40, 85–93.

    MATH  Google Scholar 

  • BREIMAN, L., FRIEDMAN, J.H., OLSHEN, R.A. and STONE, C.J. (1993): Classification and Regression Trees. Chapman and Hall, New York.

    Google Scholar 

  • FUNG, W.K. (1995): Diagnostics in linear discriminant analysis. Journal of the American Statistical Association, 90, 952–956.

    Article  MathSciNet  MATH  Google Scholar 

  • HAWKINS, D.M. and MCLACHLAN, G.J. (1997): High-breakdown linear discriminant analysis. Journal of the American Statistical Association, 92, 136–143.

    Article  MathSciNet  MATH  Google Scholar 

  • LE ROUX, N.J., STEEL, S.J. and LOUW, N. (1997): Variable selection and error rate estimation in discriminant analysis. Journal of Statistical Computation and Simulation, 59, 195–219.

    Article  MATH  Google Scholar 

  • RIANI, M. and ATKINSON, A.C. (2001): A unified approach to outliers, influence and transformations in discriminant analysis. Journal of Computational and Graphical Statistics, 10, 513–544.

    Article  MathSciNet  Google Scholar 

  • ROUSSEEUW, P.J. and LEROY, A. (1987): Robust regression and outlier detection. Wiley, New York.

    Book  MATH  Google Scholar 

  • STEEL, S.J. and LOUW, N. (2001): Variable selection in discriminant analysis: measuring the influence of individual cases. Computational Statistics and Data Analysis, 37, 249–260.

    Article  MathSciNet  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Steel, S., Louw, N. (2003). Improving the Classification Performance of a Discriminant Rule by Dealing With Data Cases Having a Substantial Influence on Variable Selection. In: Schader, M., Gaul, W., Vichi, M. (eds) Between Data Science and Applied Data Analysis. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-18991-3_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-18991-3_13

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-40354-8

  • Online ISBN: 978-3-642-18991-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics