Abstract
Multivariate outliers are usually identified by means of robust distances. A statistically principled method for accurate outlier detection requires both availability of a good approximation to the finite-sample distribution of the robust distances and correction for the multiplicity implied by repeated testing of all the observations for outlyingness. These principles are not always met by the currently available methods. The goal of this paper is thus to provide data analysts with useful information about the practical behaviour of some popular competing techniques. Our conclusion is that the additional information provided by a data-driven level of trimming is an important bonus which ensures an often considerable gain in power.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
The Authors are grateful to Dr. Spyros Arsenis and Dr. Domenico Perrotta for pointing out this historical reference.
- 2.
In the RRCOV packege of the R software this option is called eff.shape
References
Atkinson, A. C., & Riani, M. (2000). Robust diagnostic regression analysis. New-York: Springer.
Atkinson, A. C., Riani, M., & Cerioli, A. (2004). Exploring multivariate data with the forward search. New York: Springer.
Cerioli, A. (2010). Multivariate outlier detection with high-breakdown estimators. Journal of the American Statistical Association, 105, 147–156.
Cerioli, A., & Farcomeni, A. (2011). Error rates for multivariate outlier detection. Computational Statistics and Data Analysis, 55, 544–553.
Cerioli, A., Riani, M., & Atkinson, A. C. (2009). Controlling the size of multivariate outlier tests with the MCD estimator of scatter. Statistics and Computing, 19, 341–353
Cerioli, A., Atkinson, A. C., & Riani, M. (2011a). Some perspectives on multivariate outlier detection. In S. Ingrassia, R. Rocci, & M. Vichi (Eds.), New perspectives in statistical modeling and data analysis (pp. 231–238). Berlin/Heidelberg: Springer.
Cerioli, A., Riani, M., & Torti, F. (2011b). Accurate and powerful multivariate outlier detection. 58th congress of ISI, Dublin.
Hadi, A. S., Rahmatullah Imon, A. H. M., & Werner, M. (2009). Detection of outliers. WIREs Computational Statistics, 1, 57–70.
Hubert, M., Rousseeuw, P. J., & Van aelst, S. (2008). High-breakdown robust multivariate methods. Statistical Science, 23, 92–119.
Maronna, R. A., Martin, D. G., & Yohai, V. J. (2006). Robust statistics. New York: Wiley.
Morgenthaler, S. (2006). A survey of robust statistics. Statistical Methods and Applications, 15, 271–293 (Erratum 16, 171–172).
Perrotta, D., Riani, M., & Torti, F. (2009). New robust dynamic plots for regression mixture detection. Advances in Data Analysis and Classification, 3, 263–279.
Pison, G., Van aelst, S., & Willems, G. (2002). Small sample corrections for LTS and MCD. Metrika, 55, 111–123.
Riani, M., Atkinson, A. C., & Cerioli, A. (2009). Finding an unknown number of multivariate outliers. Journal of the Royal Statistical Society B, 71, 447–466.
Riani, M., Torti, F., & Zani, S. (2011). Outliers and robustness for ordinal data. In R. S. Kennet & S. Salini (Eds.), Modern analysis of customer satisfaction surveys: with applications using R. Chichester: Wiley.
Riani, M., Cerioli, A., & Torti, F. (2012). A new look at consistency factors and efficiency of robust scale estimators. Submitted.
Rousseeuw, P. J., & Leroy, A. M. (1987). Robust regression and outlier detection. New York: Wiley.
Rousseeuw, P. J. & Van Driessen, K. (1999). A fast algorithm for the minimum covariance determinant estimator. Technometrics, 41, 212–223.
Salibian-barrera, M., Van Aelst, S., & Willems, G. (2006). Principal components analysis based on multivariate mm estimators with fast and robust bootstrap. Journal of the American Statistical Association, 101, 1198–1211.
Todorov, V., & Filzmoser, P. (2009). An object-oriented framework for robust multivariate analysis. Journal of Statistical Software, 32, 1–47.
Wilks, S. S. (1963). Multivariate statistical outliers. Sankhya A, 25, 407–426.
Acknowledgements
The authors thank the financial support of the project MIUR PRIN MISURA - Multivariate models for risk assessment.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer International Publishing Switzerland
About this paper
Cite this paper
Cerioli, A., Riani, M., Torti, F. (2013). Size and Power of Multivariate Outlier Detection Rules. In: Lausen, B., Van den Poel, D., Ultsch, A. (eds) Algorithms from and for Nature and Life. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Cham. https://doi.org/10.1007/978-3-319-00035-0_1
Download citation
DOI: https://doi.org/10.1007/978-3-319-00035-0_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-00034-3
Online ISBN: 978-3-319-00035-0
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)