Quality Measures and Semi-automatic Mining of Diagnostic Rule Bases

Atzmueller, Martin; Baumeister, Joachim; Puppe, Frank

doi:10.1007/11415763_5

Quality Measures and Semi-automatic Mining of Diagnostic Rule Bases

Martin Atzmueller²²,
Joachim Baumeister²² &
Frank Puppe²²

Conference paper

500 Accesses
7 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3392))

Abstract

Semi-automatic data mining approaches often yield better results than plain automatic methods, due to the early integration of the user’s goals. For example in the medical domain, experts are likely to favor simpler models instead of more complex models. Then, the accuracy of discovered patterns is often not the only criterion to consider. Instead, the simplicity of the discovered knowledge is of prime importance, since this directly relates to the understandability and the interpretability of the learned knowledge.

In this paper, we present quality measures considering the understandability and the accuracy of (learned) rule bases. We describe a unifying quality measure, which can trade-off small losses concerning accuracy vs. an increased simplicity. Furthermore, we introduce a semi-automatic data mining method for learning understandable and accurate rule bases. The presented work is evaluated using cases from a real world application in the medical domain.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Ho, T., Saito, A., Kawasaki, S., Nguyen, D., Nguyen, T.: Failure and Success Experience in Mining Stomach Cancer Data. In: International Workshop Data Mining Lessons Learned, International Conf. Machine Learning, pp. 40–47 (2002)
Google Scholar
Gamberger, D., Lavrac, N.: Expert-Guided Subgroup Discovery: Methodology and Application. Journal of Artificial Intelligence Research 17, 501–527 (2002)
MATH Google Scholar
Huettig, M., Buscher, G., Menzel, T., Scheppach, W., Puppe, F., Buscher, H.P.: A Diagnostic Expert System for Structured Reports, Quality Assessment, and Training of Residents in Sonography. Medizinische Klinik 99, 117–122 (2004)
Article Google Scholar
Puppe, F., Ziegler, S., Martin, U., Hupp, J.: Wissensbasierte Diagnosesysteme im Service-Support (Diagnostic Knowledge Systems for the Service-Support). Springer, Heidelberg (2001)
Google Scholar
Ohmann, C., et al.: Clinical Benefit of a Diagnostic Score for Appendicitis: Results of a Prospective Interventional Study. Archives of Surgery 134, 993–996 (1999)
Article Google Scholar
Miller, R., Pople, H.E., Myers, J.: Internist-1, an Experimental Computer-Based Diagnostic Consultant for General Internal Medicine. NEJM 307, 468–476 (1982)
Article Google Scholar
Neumann, M., Baumeister, J., Liess, M., Schulz, R.: An Expert System to Estimate the Pesticide Contamination of Small Streams using Benthic Macroinvertebrates as Bioindicators, Part 2. Ecological Indicators 2, 391–401 (2003)
Article Google Scholar
Tuzhilin, A.: Usefulness, Novelty, and Integration of Interestingness Measures. In: Klösgen, Z. (ed.) Handbook of Data Mining and Knowledge Discovery, ch. 19.2.2. Oxford University Press, New York (2002)
Google Scholar
Freitas, A.A.: On Rule Interestingness Measures. Knowledge-Based Systems 12, 309–325 (1999)
Article Google Scholar
Lewis, D.D., Gale, W.A.: A Sequential Algorithm for Training Text Classifiers. In: Proc. of the 17th ACM International Conference on Research and Development in Information Retrieval (SIGIR 1994), London, pp. 3–12. Springer, Heidelberg (1994)
Google Scholar
Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations. Morgan Kaufmann, San Francisco (1999)
Google Scholar
Mitchell, T.: Machine Learning. McGraw-Hill Comp., New York (1997)
MATH Google Scholar
Yen, S.J., Chen, A.L.P.: An Efficient Algorithm for Deriving Compact Rules from Databases. In: Ling, M. (ed.) Proceedings of the 4th International Conference on Database Systems for Advanced Applications 1995, pp. 364–371. World Scientific, Singapore (1995)
Google Scholar
Baumeister, J., Atzmueller, M., Puppe, F.: Inductive Learning for Case-Based Diagnosis with Multiple Faults. In: Craw, S., Preece, A.D. (eds.) ECCBR 2002. LNCS (LNAI), vol. 2416, pp. 28–42. Springer, Heidelberg (2002)
Chapter Google Scholar
Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Morgan Kaufmann Publishers, San Mateo (2000)
Google Scholar
Puppe, F.: Knowledge Reuse Among Diagnostic Problem-Solving Methods in the Shell-Kit D3. Int. J. Human-Computer Studies 49, 627–649 (1998)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of Würzburg, 97074, Würzburg, Germany
Martin Atzmueller, Joachim Baumeister & Frank Puppe

Authors

Martin Atzmueller
View author publications
You can also search for this author in PubMed Google Scholar
Joachim Baumeister
View author publications
You can also search for this author in PubMed Google Scholar
Frank Puppe
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Universität Würzburg, Am Hubland, 97074, Würzburg, Germany
Dietmar Seipel
Institut für Informatik, CAU Kiel, Germany
Michael Hanus
Fraunhofer FIRST, Berlin
Ulrich Geske
IF Computer Japan, 5-28-2 Sendagi, Bunkyo-ku, 113-0022, Tokyo, Japan
Oskar Bartenstein

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Atzmueller, M., Baumeister, J., Puppe, F. (2005). Quality Measures and Semi-automatic Mining of Diagnostic Rule Bases. In: Seipel, D., Hanus, M., Geske, U., Bartenstein, O. (eds) Applications of Declarative Programming and Knowledge Management. INAP WLP 2004 2004. Lecture Notes in Computer Science(), vol 3392. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11415763_5

Download citation

DOI: https://doi.org/10.1007/11415763_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-25560-4
Online ISBN: 978-3-540-32124-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics