Abstract
One of the most important problems on rule induction methods is that they cannot extract rules, which plausibly represent experts' decision processes. On one hand, rule induction methods induce probabilistic rules, the description length of which is too short, compared with the experts' rules. On the other hand, construction of Bayesian networks generates too lengthy rules. In this paper, the characteristics of experts' rules are closely examined and a new approach to extract plausible rules is introduced, which consists of the following three procedures. First, the characterization of decision attributes (given classes) is extracted from databases and the classes are classified into several groups with respect to the characterization. Then, two kinds of sub-rules, characterization rules for each group and discrimination rules for each class in the group are induced. Finally, those two parts are integrated into one rule for each decision attribute. The proposed method is evaluated on medical databases, the experimental results of which show that induced rules correctly represent experts' decision processes.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Clark, P., Niblett, T. (1989). The CN2 Induction Algorithm. Machine Learning, 3, 261–283.
Everitt. Cluster Analysis, 3rd Edition, 1996.
Michalski, R. S., Mozetic, I., Hong, J., and Lavrac, N. (1986). The Multi-Purpose Incremental Learning System AQ15 and its Testing Application to Three Medical Domains. Proceedings of the fifth National Conference on Artificial Intelligence, 1041–1045, AAAI Press, Palo Alto, CA.
Pawlak, Z. (1991). Rough Sets. Kluwer Academic Publishers, Dordrecht.
Quinlan, J.R. (1993). C4.5 — Programs for Machine Learning, Morgan Kaufmann, CA.
SWI-Prolog Version 2.0.9 Manual, (1995). University of Amsterdam.
Tsumoto, S. and Tanaka, H. PRIMEROSE: Probabilistic Rule Induction Method based on Rough Sets and Resampling Methods. Computational Intelligence, 11, 389–405, 1995.
Tsumoto, S. Empirical Induction of Medical Expert System Rules based on Rough Set Model. PhD dissertation, 1997 (in Japanese).
Ziarko, W. (1991). The Discovery, Analysis, and Representation of Data Dependencies in Databases. in: Shapiro, G. P. and Frawley, W. J. (eds), Knowledge Discovery in Databases, AAAI press, Palo Alto, CA, pp. 195–209.
Ziarko, W. (1993). Variable Precision Rough Set Model. Journal of Computer and System Sciences, 46, 39–59.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1997 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tsumoto, S. (1997). Extraction of experts' decision process from clinical databases using rough set model. In: Komorowski, J., Zytkow, J. (eds) Principles of Data Mining and Knowledge Discovery. PKDD 1997. Lecture Notes in Computer Science, vol 1263. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-63223-9_106
Download citation
DOI: https://doi.org/10.1007/3-540-63223-9_106
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-63223-8
Online ISBN: 978-3-540-69236-2
eBook Packages: Springer Book Archive