Abstract
An important problem in discriminant analysis that has received little attention in the literature is the effect of outliers when variable selection forms part of the analysis. In this paper we argue that variable selection and outlier identification should not be done sequentially, but should rather be integrated. We investigate an integrated approach, and compare its classification performance to that of a sequential approach in a limited simulation study.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
ANDREWS, D.F. and PREGIBON, D. (1978): Finding the outliers that matter. Journal of the Royal Statistical Society B, 40, 85–93.
BREIMAN, L., FRIEDMAN, J.H., OLSHEN, R.A. and STONE, C.J. (1993): Classification and Regression Trees. Chapman and Hall, New York.
FUNG, W.K. (1995): Diagnostics in linear discriminant analysis. Journal of the American Statistical Association, 90, 952–956.
HAWKINS, D.M. and MCLACHLAN, G.J. (1997): High-breakdown linear discriminant analysis. Journal of the American Statistical Association, 92, 136–143.
LE ROUX, N.J., STEEL, S.J. and LOUW, N. (1997): Variable selection and error rate estimation in discriminant analysis. Journal of Statistical Computation and Simulation, 59, 195–219.
RIANI, M. and ATKINSON, A.C. (2001): A unified approach to outliers, influence and transformations in discriminant analysis. Journal of Computational and Graphical Statistics, 10, 513–544.
ROUSSEEUW, P.J. and LEROY, A. (1987): Robust regression and outlier detection. Wiley, New York.
STEEL, S.J. and LOUW, N. (2001): Variable selection in discriminant analysis: measuring the influence of individual cases. Computational Statistics and Data Analysis, 37, 249–260.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Steel, S., Louw, N. (2003). Improving the Classification Performance of a Discriminant Rule by Dealing With Data Cases Having a Substantial Influence on Variable Selection. In: Schader, M., Gaul, W., Vichi, M. (eds) Between Data Science and Applied Data Analysis. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-18991-3_13
Download citation
DOI: https://doi.org/10.1007/978-3-642-18991-3_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40354-8
Online ISBN: 978-3-642-18991-3
eBook Packages: Springer Book Archive