Model Combination Methods for Outlier Ensembles

Aggarwal, Charu C.; Sathe, Saket

doi:10.1007/978-3-319-54765-7_5

Charu C. Aggarwal³ &
Saket Sathe³

2091 Accesses

Abstract

An important part of the process of creating outlier ensembles is to combine the outputs of different detectors. The precise method for model combination has a significant impact on the effectiveness of a particular outlier detection method because of the varying theoretical effects of different combination methods. For example, the impact of the scheme of averaging is quite different from that of maximization in terms of the bias and variance of the result. Therefore, the choice of model combination has a crucial effect on the results of the ensemble.

Only government can take perfectly good paper, cover it with perfectly good ink and make the combination worthless.

Milton Friedman

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Hardcover Book: USD 84.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
The original LOF paper recognized the problem of dilution from irrelevant ensemble components and therefore suggested the use of the maximization function.

References

C. C. Aggarwal. Outlier Ensembles: Position Paper, ACM SIGKDD Explorations, 14(2), pp. 49–58, December, 2012.
Google Scholar
C. C. Aggarwal. Recommender Systems: The Textbook, Springer, 2016.
Google Scholar
C. C. Aggarwal. Outlier Analysis, Second Edition, Springer, 2017.
Google Scholar
C. C. Aggarwal and S. Sathe. Theoretical Foundations and Algorithms for Outlier Ensembles, ACM SIGKDD Explorations, 17(1), June 2015.
Google Scholar
C. C. Aggarwal and P. S. Yu. Outlier Detection in High Dimensional Data, ACM SIGMOD Conference, 2001.
Google Scholar
C. C. Aggarwal and P. S. Yu. Outlier Detection in Graph Streams, IEEE ICDE Conference, 2011.
Google Scholar
D. Barbara, Y. Li, J. Couto, J.-L. Lin, and S. Jajodia. Bootstrapping a Data Mining Intrusion Detection System. Symposium on Applied Computing, 2003.
Google Scholar
M. Breunig, H.-P. Kriegel, R. Ng, and J. Sander. LOF: Identifying Density-based Local Outliers, ACM SIGMOD Conference, 2000.
Google Scholar
L. Brieman. Bagging Predictors. Machine Learning, 24(2), pp. 123–140, 1996.
Google Scholar
L. Brieman. Random Forests. Journal Machine Learning archive, 45(1), pp. 5–32, 2001.
Google Scholar
G. Brown, J. Wyatt, R. Harris, and X. Yao. Diversity creation methods: a survey and categorisation. Information Fusion, 6:5(20), 2005.
Google Scholar
P. Buhlmann. Bagging, subagging and bragging for improving some prediction algorithms, Recent advances and trends in nonparametric statistics, Elsevier, 2003.
Google Scholar
P. Buhlmann, B. Yu. Analyzing bagging. Annals of Statistics, pp. 927–961, 2002.
Google Scholar
A. Buja, W. Stuetzle. Observations on bagging. Statistica Sinica, 16(2), 323, 2006.
Google Scholar
J. Chen, S. Sathe, C. Aggarwal and D. Turaga. Outlier detection with ensembles of autoencoders. In preparation, 2017.
Google Scholar
J. Gao, P.-N. Tan. Converting output scores from outlier detection algorithms into probability estimates. ICDM Conference, 2006.
Google Scholar
Z. He, S. Deng and X. Xu. A Unified Subspace Outlier Ensemble Framework for Outlier Detection, Advances in Web Age Information Management, 2005.
Google Scholar
M. Kendall. A New Measure of Rank Correlation. Biometrika, 30(1/2), 81–93, 1938.
Google Scholar
A. Lazarevic, and V. Kumar. Feature Bagging for Outlier Detection, ACM KDD Conference, 2005.
Google Scholar
B. Micenkova, B. McWiliams, and I. Assent. Learning Outlier Ensembles: The Best of Both Worlds – Supervised and Unsupervised. Outlier Detection and Description Workshop, 2014. Extended version: http://arxiv.org/pdf/1507.08104v1.pdf.
S. Papadimitriou, H. Kitagawa, P. Gibbons, and C. Faloutsos, LOCI: Fast outlier detection using the local correlation integral, ICDE Conference, 2003.
Google Scholar
M. Shyu, S. Chen, K. Sarinnapakorn, L. Chang. A novel anomaly detection scheme based on principal component classifier. ICDMW, 2003.
Google Scholar
D. Wolpert. Stacked Generalization, Neural Networks, 5(2), pp. 241–259, 1992.
Google Scholar
A. Zimek, R. Campello, J. Sander. Ensembles for unsupervised outlier detection: Challenges and research questions, SIGKDD Explorations, 15(1), 2013.
Google Scholar

Download references

Author information

Authors and Affiliations

IBM T. J. Watson Research Center, Yorktown Heights, NY, USA
Charu C. Aggarwal & Saket Sathe

Authors

Charu C. Aggarwal
View author publications
You can also search for this author in PubMed Google Scholar
Saket Sathe
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Charu C. Aggarwal .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Aggarwal, C.C., Sathe, S. (2017). Model Combination Methods for Outlier Ensembles. In: Outlier Ensembles. Springer, Cham. https://doi.org/10.1007/978-3-319-54765-7_5

Download citation

DOI: https://doi.org/10.1007/978-3-319-54765-7_5
Published: 07 April 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-54764-0
Online ISBN: 978-3-319-54765-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics