Skip to main content

CAIG: Classification Based on Attribute-Value Pair Integrate Gain

  • Conference paper
Book cover Database Theory and Application, Bio-Science and Bio-Technology (BSBT 2011, DTA 2011)

Abstract

Many studies have shown that rule-based classifiers perform well in classifying categorical data. However, a limitation of many of them lies in the use of only one measure to select the best attribute-value pair. In this way, many attribute-value pairs have the same best values. Consequently, it is difficulty to distinguish which attribute-value pair is the best.On the other hand, these classifiers usually combine two best attribute-value pairs to generate rules, whether they bias toward the same class label or not. In this paper, we propose a new classification approach named CAIG (Classification based on Attribute-value pair Integrate Gain) which has a number of new features. First, it uses multi measures to select the best attribute-value pairs. Second, it divides attribute-value pairs into different groups according to the class label. Third, it arranges attribute-value pairs to the same groups if they bias to same class label. Fourth, it adopts a greedy algorithm to generate rules from the theses groups. Experimental results show that the method of multi measures is highly accuracy in comparison with the one measure.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Quinlan, J.R.: C4.5: Programs for machine learning. Morgan Kaufmann (1993)

    Google Scholar 

  2. Zhu, W., Wang, F.: Reduction and axiomization of covering generalized rough sets. Information Sciences 152(1), 217–230 (2003)

    Article  MathSciNet  MATH  Google Scholar 

  3. Zhu, W.: Generalized rough sets based on relations. Information Sciences 177(22), 4997–5011 (2007)

    Article  MathSciNet  MATH  Google Scholar 

  4. Min, F., Liu, Q., Fang, C.: Rough sets approach to symbolic value partition. International Journal of Approximate Reasoning 49, 689–700 (2008)

    Article  MathSciNet  MATH  Google Scholar 

  5. Zhu, W.: Relationship among basic concepts in covering-based rough sets. Information Sciences 179(14), 2478–2486 (2009)

    Article  MathSciNet  MATH  Google Scholar 

  6. Min, F., He, H., Qian, Y., Zhu, W.: Test-cost-sensitive attribute reduction. To Appear in Information Sciences (2011)

    Google Scholar 

  7. Min, F., Liu, Q.: A hierarchical model for test-cost-sensitive decision systems. Information Sciences 179(14), 2442–2452 (2009)

    Article  MathSciNet  MATH  Google Scholar 

  8. Yao, Y.Y.: Granular computing: basic issues and possible solutions. In: The 5th Joint Conference on Information Sciences, vol. 1, pp. 186–189 (2000)

    Google Scholar 

  9. Yao, J.T., Yao, Y.Y.: A granular computing approach to machine learning (2002), http://www2.cs.uregina.ca/jtyao/Pagers/GrcMining-1534.pdf

  10. Yao, J.T., Yao, Y.Y.: Induction of Classification Rules by Granular Computing. In: Alpigini, J.J., Peters, J.F., Skowron, A., Zhong, N. (eds.) RSCTC 2002. LNCS (LNAI), vol. 2475, p. 331. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  11. Liu, B., Hsu, W., Ma, Y.: Integrating classification and association rule mining. In: KDD, pp. 80–86 (1998)

    Google Scholar 

  12. Dong, G., Zhang, X., Wong, L., Li, J.: CAEP: Classification by Aggregating Emerging Patterns. In: Arikawa, S., Nakata, I. (eds.) DS 1999. LNCS (LNAI), vol. 1721, pp. 30–42. Springer, Heidelberg (1999)

    Chapter  Google Scholar 

  13. Li, W., Han, J., Pei, J.: CMAR: accurate and efficient classification based on multiple class-association rules. In: ICDM, pp. 369–376 (2001)

    Google Scholar 

  14. Cover, T., Hart, P.: Nearest neighbor pattern classification. IEEE Transactions on Information Theory 13, 21–27 (1967)

    Article  MATH  Google Scholar 

  15. Dasarathy, B.: Nearest Neighbor Pattern Classification Techniques. IEEE Computer Society Press (1990)

    Google Scholar 

  16. Bian, H.Y.: Fuzzy-rough nearest neighbor classification: an integrated framework. In: Proceedings of IASTED International Symposium on Artificial intelligence and Applications, pp. 160–164 (2002)

    Google Scholar 

  17. Guo, G., Wang, H., Bell, D.A., Bi, Y., Greer, K.: An kNN model-based approach and its application in text categorization. In: Gelbukh, A. (ed.) CICLing 2004. LNCS, vol. 2945, pp. 559–570. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  18. Neskovic, P., Cooper, L.N.: Improving nearest neighbor rule with a simple adaptive distance measure. Pattern Recognition 28(2), 21–27 (2007)

    Google Scholar 

  19. Yin, X., Han, J.: CPAR: classification based on predictive association rules. In: SDM, pp. 331–335 (2003)

    Google Scholar 

  20. Quinlan, J.R., Cameron-Jones, R.: FOIL: A Midterm Report. In: Brazdil, P.B. (ed.) ECML 1993. LNCS, vol. 667, pp. 3–20. Springer, Heidelberg (1993)

    Google Scholar 

  21. Cohen, W.: Fast effective rule induction. In: ICML, pp. 115–123 (1995)

    Google Scholar 

  22. Cendrowska, J.: PRISM: An algorithm for inducing modular rules (1988)

    Google Scholar 

  23. Clark, Boswell, R.: Rule induction with CN2:some recent improvements. In: EWSL, pp. 151–163 (1991)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

He, T., Zhou, Z., Huang, Z., Wang, X. (2011). CAIG: Classification Based on Attribute-Value Pair Integrate Gain. In: Kim, Th., et al. Database Theory and Application, Bio-Science and Bio-Technology. BSBT DTA 2011 2011. Communications in Computer and Information Science, vol 258. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-27157-1_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-27157-1_4

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-27156-4

  • Online ISBN: 978-3-642-27157-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics