Abstract
This section compares the performance of the information-theoretic network to leading classification methods of data mining. The comparison is based on public datasets and it includes the following performance criteria:
-
Dimensionality Reduction is measured by the portion of candidate-input attributes removed by the algorithm (excluded from the network) and by the size of the information-theoretic network vs. other predictive models.
-
Prediction Accuracy is the average accuracy of the network on validation cases vs. published accuracy of other classifiers.
-
Stability represents the ability of the algorithms to provide similar results from different random samples of the same dataset. The benchmark classification methods used for the comparison include:
-
Naive Bayes Classifier. This is a probabilistic method assuming conditional independence of all input attributes. See the details of the algorithm in (Mitchell, 1997).
-
C4.5. This is a state-of-the-art decision tree algorithm presented in (Quinlan, 1993). Today, most commercial tools for constructing decision trees are based on C4.5 or one of its modified versions.
-
CART ™ This is an earlier decision tree method (Breiman et al., 1984). It is used as the engine of a commercial tool, having the same name, which is available from Salford Systems ( http://www.salford-systems.com/ ).
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer Science+Business Media Dordrecht
About this chapter
Cite this chapter
Maimon, O., Last, M. (2001). Comparative Study. In: Knowledge Discovery and Data Mining. Massive Computing, vol 1. Springer, Boston, MA. https://doi.org/10.1007/978-1-4757-3296-2_7
Download citation
DOI: https://doi.org/10.1007/978-1-4757-3296-2_7
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4419-4842-7
Online ISBN: 978-1-4757-3296-2
eBook Packages: Springer Book Archive