T3: A Classification Algorithm for Data Mining

Tjortjis, Christos; Keane, John

doi:10.1007/3-540-45675-9_9

Christos Tjortjis⁷ &
John Keane⁷

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2412))

Included in the following conference series:

International Conference on Intelligent Data Engineering and Automated Learning

1788 Accesses
6 Citations

Abstract

This paper describes and evaluates T3, an algorithm that builds trees of depth at most three, and results in high accuracy whilst keeping the size of the tree reasonably small. T3 is an improvement over T2 in that it builds larger trees and adopts a less greedy approach. T3 gave better results than both T2 and C4.5 when run against publicly available data sets: T3 decreased classification error on average by 47% and generalisation error by 29%, compared to T2; and T3 resulted in 46% smaller trees and 32% less classification error compared to C4.5. Due to its way of handling unknown values, T3 outperforms C4.5 in generalisation by 99% to 66%, on a specific medical dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Aha, D.W., Breslow, L.A: Comparing Simplification Procedures for Decision Trees on an Economics Classification, NRL/FR/5510 98-9881, (Technical Report AIC-98-009), May 11, 1998.
Google Scholar
Auer, P. Holte, R.C., Maass, W.: Theory and Applications of Agnostic PAC-Learning with Small Decision Trees, Proc. 12th Int’l Machine Learning Conf. San Francisco, Morgan Kaufmann 1995, pp. 21–29.
Google Scholar
Breslow, L., Aha, D.W.: Comparing Tree-Simplification Procedures, Proc. 6^th Int’l Workshop Artificial Intelligence and Statistics, Ft. Lauderdale, 1997, pp. 67–74.
Google Scholar
Ganti, V., Gehrke, J., Ramakrishnan, R.: Mining Very Large Databases, IEEE Computer, Special issue on Data Mining, August 1999.
Google Scholar
Kohavi, R., Sommerfield, D., Dougherty, J.: Data Mining using MLC++: A Machine Learning Library in C++, Tools with AI, 1996.
Google Scholar
Murthy, S., Saltzberg, S.: Decision Tree Induction: How effective is the Greedy Heuristic?, Proc. 1st Int’l Conf. on KDD and DM, 1995, pp. 156–161.
Google Scholar
Quinlan, J.R.: C4.5: Programs for Machine Learning, San Mateo, Morgan Kaufmann, 1993.
Google Scholar
Quinlan, J.R.: Improved Use of Continuous Attributes in C4.5, Journal of AI Research 4, Morgan Kaufmann 1996, pp. 77–90.
MATH Google Scholar
http://www.ics.uci.edu/~mlearn/MLRepository.html UCI Machine Learning Repository data sets converted to MLC++ format, http://www.sgi.com/tech/mlc/db/ (last accessed 5/02).

Download references

Author information

Authors and Affiliations

UMIST, Department of Computation, P.O. Box 88, Manchester, M60 1QD, UK
Christos Tjortjis & John Keane

Authors

Christos Tjortjis
View author publications
You can also search for this author in PubMed Google Scholar
John Keane
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Electrical Engineering and Electronics, UMIST, Manchester, M60 1QD, UK
Hujun Yin , Nigel Allinson & Richard Freeman , &
Department of Computation, UMIST, Manchester, M60 1QD, UK
John Keane
Department of Biomolecular Science, UMIST, Manchester, M60 1QD, UK
Simon Hubbard

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tjortjis, C., Keane, J. (2002). T3: A Classification Algorithm for Data Mining. In: Yin, H., Allinson, N., Freeman, R., Keane, J., Hubbard, S. (eds) Intelligent Data Engineering and Automated Learning — IDEAL 2002. IDEAL 2002. Lecture Notes in Computer Science, vol 2412. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45675-9_9

Download citation

DOI: https://doi.org/10.1007/3-540-45675-9_9
Published: 20 August 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44025-3
Online ISBN: 978-3-540-45675-9
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics