Skip to main content

Mixture Models in Forward Search Methods for Outlier Detection

  • Conference paper
Book cover Data Analysis, Machine Learning and Applications

Abstract

Forward search (FS) methods have been shown to be usefully employed for detecting multiple outliers in continuous multivariate data (Hadi, (1994); Atkinson et al., (2004)). Starting from an outlier-free subset of observations, they iteratively enlarge this good subset using Mahalanobis distances based only on the good observations. In this paper, an alternative formulation of the FS paradigm is presented, that takes a mixture of K > 1 normal components as a null model. The proposal is developed according to both the graphical and the inferential approach to FS-based outlier detection. The performance of the method is shown on an illustrative example and evaluated on a simulation experiment in the multiple cluster setting.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • ATKINSON, A.C. (1993): Stalactite plots and robust estimation for the detection of multivari-ate outliers. In: E. Ronchetti, E. Morgenthaler, and W. Stahel (Eds.): New Directions in Statistical Data Analysis and Robustenss., Birkhäuser, Basel.

    Google Scholar 

  • ATKINSON, A.C., RIANI, C. and CERIOLI A. (2004): Exploring Multivariate Data with the Forward Search. Springer, New York.

    MATH  Google Scholar 

  • FRALEY, C. and RAFTERY, A.E. (1998): How may clusters? Which clustering method? Answers via model-based cluster analysis. The Computer Journal, 41, 578-588.

    Article  MATH  Google Scholar 

  • HADI, A.S. (1994): A modification of a method for the detection of outliers in multivariate samples. J R Stat Soc, Ser B, 56, 393-396.

    MATH  Google Scholar 

  • HARDIN, J. and ROCKE D.M. (2004): Outlier detection in the multiple cluster setting us-ing the minimum covariance determinant estimator. Computational Statistics and Data Analysis, 44, 625-638.

    Article  MathSciNet  Google Scholar 

  • HENNIG, C. (2004): Breakdown point for maximum likelihood estimators of location-scale mixtures. The Annals of Statistics, 32, 1313-1340.

    Article  MATH  MathSciNet  Google Scholar 

  • MCLACHLAN, G.J. and BASFORD K.E. (1988): Mixture Models: Inference and Applica-tions to Clustering. Marcel Dekker, New York.

    Google Scholar 

  • MCLACHLAN, G.J. and PEEL, D. (2000): Finite Mixture Models. Wiley, New York.

    Book  MATH  Google Scholar 

  • WANG S. et al. (1997): A new test for outlier detection from a multivariate mixture distribu-tion, Journal of Computational and Graphical Statistics, 6, 285-299.

    Article  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Calò, D.G. (2008). Mixture Models in Forward Search Methods for Outlier Detection. In: Preisach, C., Burkhardt, H., Schmidt-Thieme, L., Decker, R. (eds) Data Analysis, Machine Learning and Applications. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-78246-9_13

Download citation

Publish with us

Policies and ethics