Skip to main content

ELEM2: A learning system for more accurate classifications

  • Learning
  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1418))

Abstract

We present ELEM2, a new method for inducing classification rules from a set of examples. The method employs several new strategies in the induction and classification processes to improve the predictive performance of induced rules. In particular, a new heuristic function for evaluating attribute-value pairs is proposed. The function is defined to reflect the degree of relevance of an attribute-value pair to a target concept and leads to selection of the most relevant pairs for formulating rules. Another feature of ELEM2 is that it handles inconsistent training data by defining an unlearnable region of a concept based on the probability distribution of that concept in the training data. To further deal with imperfect data, ELEM2 makes use of the post-pruning technique to remove unreliable portions of a generated rule. A new rule quality measure is proposed for the purpose of post-pruning. The measure is defined according to the relative distribution of a rule with respect to positive and negative examples. To show whether ELEM2 achieves its objective, we report experimental results which compare ELEM2 with C4.5 and CN2 on a number of datasets.

This is a preview of subscription content, log in via an institution.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. An, A. 1997. Analysis Methodologies for Integrated and Enhanced Problem Solving. Ph.D. Thesis, Dept. of Computer Science, University of Regina, Regina, Canada.

    Google Scholar 

  2. Brodley, C.E. 1993. “Addressing the Selective Superiority Problem: Automatic Algorithm/model Class Selection.” Proceedings of the 10th Machine Learning Conference. pp.17–24.

    Google Scholar 

  3. Cendrowska, J. 1988. “PRISM: An Algorithm for Inducing Modular Rules”. In Gaines, B. and Boose, J. (eds.): Knowledge Acquisition for Knowledge-Based Systems. Academic Press.

    Google Scholar 

  4. Clark, P. and Niblett, T. 1989. “The CN2 Induction Algorithm”. Machine Learning, 3, pp.261–283.

    Google Scholar 

  5. Cooper, W.S. 1973. “On Selecting a Measure of Retrieval Effectiveness.” Journal of the American Society for Information Science. Vo1.24.

    Google Scholar 

  6. Creecy, R.H., Masand, B.M., Smith, S.J. and Waltz, D.L. 1992. “Trading MIPS and Memory for Knowledge Engineering”. Communications of the ACM, 35, pp.48–64.

    Article  Google Scholar 

  7. Domingos, P. 1995. “Rule Induction and Instance-Based Learning: A Unified Approach.” IJCAI-95. Montreal, Canada. pp.1226–1232.

    Google Scholar 

  8. Grzymala-Busse, J.W. 1992. “LERS-A System for Learning From Examples Based on Rough Sets”, in Slowinski, R.(ed.): Intelligent Decision Support: Handbook of Applications and Advances of Rough Sets Theory, Kluwer Academic Publishers, pp.318.

    Google Scholar 

  9. Hamilton, H.J., Shan, N. and Cercone, N. 1996. “RIAC: A Rule Induction Algorithm Based on Approximate Classification”. Technical Report CS-96-06, University of Regina.

    Google Scholar 

  10. Holte, R., Acker, L. and Porter, B. 1989. “Concept Learning and the Problem of Small Disjuncts”. Proceedings of the Eleventh International Joint Conference on Artificial Intelligence, Detroit, Michigan.

    Google Scholar 

  11. Kerber, R. 1992. “ChiMerge: Discretization of Numeric Attributes”, Proceedings of the 10th National Conference on Artificial Intelligence AAAI-92, San Jose, CA.

    Google Scholar 

  12. Michalski, R.S., Mozetic, I., Hong, J. and Lavrac, N. 1986. “The Multi-Purpose Incremental Learning System AQl5 and Its Testing Application to Three Medical Domains”. Proceedings of AAAI 1986. pp.1041–1045.

    Google Scholar 

  13. Murphy, P.M. and Aha, D.W. 1994. UCI Repository of Machine Learning Databases. URL: http://www.ics.uci.edu/ mlearn/MLRepository.html. For information contact ml-repository@ics.uci.edu.

    Google Scholar 

  14. Quinlan, J.R. 1983. “Learning efficient classification procedures and their application to chess end games”. In Michalski, R.S., Carbonell, J.G. and Mitchell, T.M. (eds.): Machine Learning: An Artificial Intelligence Approach. Vol.1.

    Google Scholar 

  15. Quinlan, J.R. 1993. C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers. San Mateo, CA.

    Google Scholar 

  16. Wnek, J., Sarma, J., Wahab, A. and Michalski, R. 1990. “Comparison Learning Paradigms via Diagrammatic Visualization: A Case Study in Single Concept Learning Using Symbolic, Neural Net and Genetic Algorithm Methods”. Technical Report, Computer Science Department, George Mason University.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Robert E. Mercer Eric Neufeld

Rights and permissions

Reprints and permissions

Copyright information

© 1998 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

An, A., Cercone, N. (1998). ELEM2: A learning system for more accurate classifications. In: Mercer, R.E., Neufeld, E. (eds) Advances in Artificial Intelligence. Canadian AI 1998. Lecture Notes in Computer Science, vol 1418. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-64575-6_68

Download citation

  • DOI: https://doi.org/10.1007/3-540-64575-6_68

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-64575-7

  • Online ISBN: 978-3-540-69349-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics