Abstract
An important part of the process of creating outlier ensembles is to combine the outputs of different detectors. The precise method for model combination has a significant impact on the effectiveness of a particular outlier detection method because of the varying theoretical effects of different combination methods. For example, the impact of the scheme of averaging is quite different from that of maximization in terms of the bias and variance of the result. Therefore, the choice of model combination has a crucial effect on the results of the ensemble.
Keywords
- Receiver Operating Characteristic Curve
- Maximization Function
- Combination Method
- Variance Reduction
- Model Combination
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
Only government can take perfectly good paper, cover it with perfectly good ink and make the combination worthless.
Milton Friedman
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
The original LOF paper recognized the problem of dilution from irrelevant ensemble components and therefore suggested the use of the maximization function.
References
C. C. Aggarwal. Outlier Ensembles: Position Paper, ACM SIGKDD Explorations, 14(2), pp. 49–58, December, 2012.
C. C. Aggarwal. Recommender Systems: The Textbook, Springer, 2016.
C. C. Aggarwal. Outlier Analysis, Second Edition, Springer, 2017.
C. C. Aggarwal and S. Sathe. Theoretical Foundations and Algorithms for Outlier Ensembles, ACM SIGKDD Explorations, 17(1), June 2015.
C. C. Aggarwal and P. S. Yu. Outlier Detection in High Dimensional Data, ACM SIGMOD Conference, 2001.
C. C. Aggarwal and P. S. Yu. Outlier Detection in Graph Streams, IEEE ICDE Conference, 2011.
D. Barbara, Y. Li, J. Couto, J.-L. Lin, and S. Jajodia. Bootstrapping a Data Mining Intrusion Detection System. Symposium on Applied Computing, 2003.
M. Breunig, H.-P. Kriegel, R. Ng, and J. Sander. LOF: Identifying Density-based Local Outliers, ACM SIGMOD Conference, 2000.
L. Brieman. Bagging Predictors. Machine Learning, 24(2), pp. 123–140, 1996.
L. Brieman. Random Forests. Journal Machine Learning archive, 45(1), pp. 5–32, 2001.
G. Brown, J. Wyatt, R. Harris, and X. Yao. Diversity creation methods: a survey and categorisation. Information Fusion, 6:5(20), 2005.
P. Buhlmann. Bagging, subagging and bragging for improving some prediction algorithms, Recent advances and trends in nonparametric statistics, Elsevier, 2003.
P. Buhlmann, B. Yu. Analyzing bagging. Annals of Statistics, pp. 927–961, 2002.
A. Buja, W. Stuetzle. Observations on bagging. Statistica Sinica, 16(2), 323, 2006.
J. Chen, S. Sathe, C. Aggarwal and D. Turaga. Outlier detection with ensembles of autoencoders. In preparation, 2017.
J. Gao, P.-N. Tan. Converting output scores from outlier detection algorithms into probability estimates. ICDM Conference, 2006.
Z. He, S. Deng and X. Xu. A Unified Subspace Outlier Ensemble Framework for Outlier Detection, Advances in Web Age Information Management, 2005.
M. Kendall. A New Measure of Rank Correlation. Biometrika, 30(1/2), 81–93, 1938.
A. Lazarevic, and V. Kumar. Feature Bagging for Outlier Detection, ACM KDD Conference, 2005.
B. Micenkova, B. McWiliams, and I. Assent. Learning Outlier Ensembles: The Best of Both Worlds – Supervised and Unsupervised. Outlier Detection and Description Workshop, 2014. Extended version: http://arxiv.org/pdf/1507.08104v1.pdf.
S. Papadimitriou, H. Kitagawa, P. Gibbons, and C. Faloutsos, LOCI: Fast outlier detection using the local correlation integral, ICDE Conference, 2003.
M. Shyu, S. Chen, K. Sarinnapakorn, L. Chang. A novel anomaly detection scheme based on principal component classifier. ICDMW, 2003.
D. Wolpert. Stacked Generalization, Neural Networks, 5(2), pp. 241–259, 1992.
A. Zimek, R. Campello, J. Sander. Ensembles for unsupervised outlier detection: Challenges and research questions, SIGKDD Explorations, 15(1), 2013.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this chapter
Cite this chapter
Aggarwal, C.C., Sathe, S. (2017). Model Combination Methods for Outlier Ensembles. In: Outlier Ensembles. Springer, Cham. https://doi.org/10.1007/978-3-319-54765-7_5
Download citation
DOI: https://doi.org/10.1007/978-3-319-54765-7_5
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-54764-0
Online ISBN: 978-3-319-54765-7
eBook Packages: Computer ScienceComputer Science (R0)