Outlier Mining Process Model and Its Application

  • Huan Zhou
  • Lian Hu
  • Yi-mu Sun
Part of the Advances in Intelligent and Soft Computing book series (AINSC, volume 165)


As an active research field of Data Mining, outlier mining has important application in business. Previous studies focused more on Mining algorithm and its improvement, and less attention was paid on Outlier Mining process and its specifical application in industries. This paper presents an Outlier Mining Process Model based on the analysis of a data mining model process. In this paper, each step of the process is discussed in detail with an empirical application in security market for illustration.


Outlier Mining Process Data Mining Model Application 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Han, J., Kamber, M.: Data Mining: Concept and Techniques, pp. 254–259. Morgan Kaufmann Publishers, New York (2001)Google Scholar
  2. 2.
    Wang, H.-D., Tong, Y.-H., Tan, S.-H., Tan, S.-W., Yang, D.-Q.: Research progress on outlier mining. CAAI Transactions on Intelligent Systems 1, 67–73 (2006)zbMATHGoogle Scholar
  3. 3.
  4. 4.
    CRISP-DM Process Model(EB/OL),
  5. 5.
    KD nuggets. What Main Methodology Are You Using for Data Mining?(EB/OL),
  6. 6.
    Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P.: Adavance In Knowledge Dicovery And Data Mining, pp. 1–36. AAAI/MIT Press (1996)Google Scholar
  7. 7.
    John, G.H.: Enhancements to the Data Mining Process. Ph.D thesis of Stanford University (1997)Google Scholar
  8. 8.
    Brachman, R.J., Anand, T.: The Process of Knowledge Discovery In Databases: A Human-Centered Approach. In: Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P., Uthurusamy, R. (eds.) Advances In Knowledge Discovery And Data Mining, pp. 37–57. AAAI Press/The MIT Press, Menlo Park, CA (1996)Google Scholar
  9. 9.
    Rud, O.P.: Data Mining Cookbook: Modeling Data for Marketing, Risk, and Customer Relationship Management, pp. 3–19. John Wiley & Sons Inc. (2003)Google Scholar
  10. 10.
    Xie, B.-C.: Business Intelligence and Data Mining, pp. 13–15. China Machine Press, Beijing (2008)Google Scholar
  11. 11.
    Kantardzi, M.: Data Mining-Concepts, Models, Methods and Algorithms, pp. 5–8. Wiley-IEEE Press (2002)Google Scholar
  12. 12.
    Barnett, V., Lewis, T.: Outliers in Statistical Data. John Wiley & Sons, New York (1994)zbMATHGoogle Scholar
  13. 13.
    Knorr, E., Ng, R.: A unified notion of outliers: Properties and computation. In: Proc. 1997 Int. Conf. Knowledge Discovery and Data Mining (KDD 1997), Newport Beach, CA, pp. 219–222 (August 1997)Google Scholar
  14. 14.
    Knorr, E., Ng, R.: Algorithms for mining distance-based outliers in large datasets. In: Proc. 1998 Int. Conf. Very Large Data Bases (VLDB 1998), New York, pp. 392–403 (August 1998)Google Scholar
  15. 15.
    Arning, A., Agrawal, R., Raghavan, P.: A linear method for deviation detection in large databases. In: Proc. 1996 Int. Conf. Knowledge Discovery and Data Mining (KDD 1996), Portland, OR, pp. 164–169 (August 1996)Google Scholar
  16. 16.
    Breunig, M.M., Kriegel, H.-P., Ng, R.T., Sander, J.: LOF: Identifying density-based local outliers. In: Proc. 2000 ACM SIGMOD Int. Conf. Management of Data (SIGMOD 2000), Dallas Texa, pp. 93–104 (2000)Google Scholar
  17. 17.
    Aggarwal, C., Yu, P.: Outlier detection for high dimensional data. In: Proc. 2001 ACM SIGMOD Int. Conf. Management of data (SIGMOD 2000), New York, NY, USA, pp. 37–46 (2001)Google Scholar
  18. 18.
    Jiang, M.F., Tseng, S.S., Su, C.M.: Two-phase clustering process for outliers detection. Computational Statistics and Data Analysis 36, 351–382 (2001)MathSciNetCrossRefGoogle Scholar

Copyright information

© Springer-Verlag GmbH Berlin Heidelberg 2012

Authors and Affiliations

  • Huan Zhou
    • 1
  • Lian Hu
    • 1
  • Yi-mu Sun
    • 2
  1. 1.School of Information ManagementShanghai Finance UniversityShanghaiChina
  2. 2.School of Economics and ManagementTongji UniversityShanghaiChina

Personalised recommendations