Advertisement

Object-Oriented Database Mining: Use of Object Oriented Concepts for Improving Data Classification Technique

  • Kitsana Waiyamai
  • Chidchanok Songsiri
  • Thanawin Rakthanmanon
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3036)

Abstract

Complex objects are organized into class/subclass hierarchy where each object attribute may be composed of other complex objects. Almost of the existing works on complex data classification start by generalizing objects in appropriate abstraction level before the classification process. Generalization prior to classification produces less accurate result than integrating generalization into the classification process. This paper proposes CO4.5, an approach for generating decision trees for complex objects. CO4.5 classifies complex objects directly through the use of inheritance and composition relationships stored in object-oriented databases. Experimental results, using large complex datasets, showed that CO4.5 yielded better accuracy compared to traditional data classification techniques.

References

  1. 1.
    Chen, M., Han, J., Yu, S.: Data Mining: An Overview from Database Perspective. IEEE Transactions on Knowledge and Data Engineering (1996)Google Scholar
  2. 2.
    Han, J., Kamber, M.: Data Mining Concepts and Techniques. Morgan Kaufmann Publishers, San Francisco (2001)Google Scholar
  3. 3.
    Han, J., Nishio, S., Kawano, H., Wang, W.: Generalization-based data mining in objectoriented databases using an object-cube model. Data and Knowledge Engineering 25, 55–97 (1998)zbMATHCrossRefGoogle Scholar
  4. 4.
    Han, J., Fu, Y.: Exploration of the power of attribute-oriented induction in data mining. In: Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P., Uthurusamy, R. (eds.) Advances in Knowledge Discovery and Data Mining, pp. 399–421. AAAI/MIT Press (1996)Google Scholar
  5. 5.
    Han, J., Cai, Y., Cercone, N.: Data driven discovery of quantitative rules in relational databases. IEEE Trans.Knowledge and Data Engineering. 5, 29–40 (1993)CrossRefGoogle Scholar
  6. 6.
    Kamber, M., Winstone, L., Gong, W., Cheng, S., Han, J.: Generalization and decision tree induction: Efficient classification in data mining. In: Int. Workshop on Research Issues on Data Engineering, Birmingham, England, pp. 111–120 (1997)Google Scholar
  7. 7.
    Mehta, M., Agrawal, R., Rissanen, J.: SLIQ: A Fast Scalable Classifier for Data Mining. In: Int Extending Database Technology, Avignon, France (1996)Google Scholar
  8. 8.
    Quinlan, J.R.: Induction of decision trees. Machine Learning 1, 81–106 (1986)Google Scholar
  9. 9.
    Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Francisco (1993)Google Scholar
  10. 10.
    Songsiri, C., Waiyamai, K., Rakthanmanonn, T.: An Object-oriented Data Classification Technique (in Thai). In: Proc. The National Computer Science and Engineering Conference (2002)Google Scholar
  11. 11.
    Wang, W.: Predictive Modeling Based on Classification and Pattern Matching Methods. M.Sc. thesis. Computing Science, Simon Fraser University (1999)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2004

Authors and Affiliations

  • Kitsana Waiyamai
    • 1
  • Chidchanok Songsiri
    • 1
  • Thanawin Rakthanmanon
    • 1
  1. 1.Computer Engineering DepartmentKasetsart UniversityThailand

Personalised recommendations