Advertisement

On Complementarity of Cluster and Outlier Detection Schemes

  • Zhixiang Chen
  • Ada Wai-Chee Fu
  • Jian Tang
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2737)

Abstract

We are interested in the problem of outlier detection, which is the discovery of data that deviate a lot from other data patterns. Hawkins [7] characterizes an outlier in a quite intuitive way as follows: An outlier is an observation that deviates so much from other observations as to arouse suspicion that it was generated by a different mechanism.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Ankerst, M., Breunig, M., Kriegel, H.P., Sander, J.: OPTICS: Ordering points to identify the cluster structure. In: Proc. of ACM-SIGMOD Conf., pp. 49–60 (1999)Google Scholar
  2. 2.
    Arning, A., Agrawal, R., Raghavan, P.: A Linear Method for Deviation detection in Large Databases. In: Proc. of 2nd Intl. Conf. On Knowledge Discovery and Data Mining, pp. 164–169 (1996)Google Scholar
  3. 3.
    Barnett, V., Lewis, T.: Outliers in Statistical Data. John Wiley, Chichester (1994)zbMATHGoogle Scholar
  4. 4.
    Breuning, M., Kriegel, H.-P., Ng, R., Sander, J.: LOF: Identifying density-based Local Outliers. In: Proc. of the ACM SIGMOD Conf. (2000)Google Scholar
  5. 5.
    DuMouchel, W., Schonlau, M.: A Fast Computer Intrusion Detection Algorithm based on Hypothesis Testing of Command Transition Probabilities. In: Proc. of 4th Intl. Conf. On Knowledge Discovery and Data Mining, pp. 189–193 (1998)Google Scholar
  6. 6.
    Ester, M., Kriegel, H., Sander, J., Xu, X.: A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise. In: Proc. of 2nd Intl. Conf. On Knowledge Discovery and Data Mining, pp. 226–231 (1996)Google Scholar
  7. 7.
    Fawcett, T., Provost, F.: Adaptive Fraud Detection. Data Mining and Knowledge Discovery Journal 1(3), 291–316 (1997)CrossRefGoogle Scholar
  8. 8.
    Guha, S., Rastogi, R., Shim, K.: Cure: An Efficient Clustering Algorithm for Large Databases. In: Proc. of the ACM SIGMOD Conf., pp. 73–84 (1998)Google Scholar
  9. 9.
    Hawkins, D.: Identification of Outliers. Chapman and Hall, London (1980)zbMATHGoogle Scholar
  10. 10.
    Knorr, E., Ng, R.: Algorithms for Mining Distance-based Outliers in Large Datasets. In: Proc. of 24th Intl. Conf. On VLDB, pp. 392–403 (1998)Google Scholar
  11. 11.
    Knorr, E., Ng, R.: Finding Intensional Knowledge of Distance-based Outliers. In: Proc. of 25th Intl. Conf. On VLDB, pp. 211–222 (1999)Google Scholar
  12. 12.
    Ng, R., Han, J.: Efficient and Effective Clustering Methods for Spatial Data Mining. In: Proc. of 20th Intl. Conf. On Very Large Data Bases, pp. 144–155 (1994)Google Scholar
  13. 13.
    Ramaswamy, S., Rastogi, R., Kyuseok, S.: Efficient Algorithms for Mining Outliers from Large Data Sets. In: Proc. of ACM SIGMOD Conf., pp. 427–438 (2000)Google Scholar
  14. 14.
    Roussopoulos, N., Kelley, S., Vincent, F.: Nearest Neighbor Queries. In: Proc. of ACM SIGMOD Conf., pp. 71–79 (1995)Google Scholar
  15. 15.
    Sheikholeslami, G., Chatterjee, S., Zhang, A.: WaveCluster: A multi-Resolution Clustering Approach for Very Large Spatial Databases. In: Proc. of 24th Intl. Conf. On Very Large Data Bases, pp. 428–439 (1998)Google Scholar
  16. 16.
    Tang, J., Chen, Z., Fu, A., Cheung, D.: A Robust Outlier Detection Scheme in Large Data Sets. In: PAKDD (2002)Google Scholar
  17. 17.
    Zhang, T., Ramakrishnan, R., Linvy, M.: BIRCH: An Efficient Data Clustering Method for Very Large Databases. In: Proc. of ACM SIGMOD Intl. Conf., pp. 103–114 (1996)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2003

Authors and Affiliations

  • Zhixiang Chen
    • 1
  • Ada Wai-Chee Fu
    • 2
  • Jian Tang
    • 2
  1. 1.Department of Computer ScienceUniversity of Texas-Pan AmericanEdinburgUSA
  2. 2.Department of Computer Science and EngineeringChinese University of Hong KongShatin, N.T., Hong Kong

Personalised recommendations