Automated Case Generation from Databases Using Similarity-Based Rough Approximation

Geng, Liqiang; Chan, Christine W.

doi:10.1007/3-540-46016-0_33

Liqiang Geng⁵ &
Christine W. Chan⁵

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2313))

Included in the following conference series:

Mexican International Conference on Artificial Intelligence

640 Accesses
1 Citations

Abstract

Knowledge acquisition for a case-based reasoning system from domain experts is a bottleneck in the system development process. With the huge amounts of data that have become available, it would be useful to derive automatically representative cases from available databases rather than acquiring them from domain experts. This paper presents two algorithms using similarity-based rough set theory to derive cases automatically from available databases. The first algorithm SRS1 requires the user to decide the similarity thresholds for the objects in a database, while the second algorithm SRS2 can automatically select proper similarity thresholds. These algorithms require fewer parameters from domain experts than other case generation algorithms. Also they can tackle noise and inconsistent data in the database and select a reasonable number of the representative cases from the database. The experimental results were compared with those from well-known data mining systems, such as rule induction systems and neural network systems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Aha, D.W., Kibler, D., Albert, M.K., Instance based learning algorithms. Machine Learning 6 37–66, 1991.
Google Scholar
Bradshaw, G., Learning about speech sounds: The NEXUS project. Proceedings of the Fourth International Workshop on Machine Learning 1–11 1987
Google Scholar
Cao, G., Shiu, S., and Wang, X., A fuzzy-rough approach for case base maintenance. Proceedings of the Forth International Conference on Case-Based Reasoning, 118–130, 2001.
Google Scholar
Chan, C., Chen, L., and Geng, L. Knowledge engineering for an intelligent case-based system for help desk operations. Expert System with Application, 18, 125–132, 2000.
Article Google Scholar
Cost, S. and SalzBerg, S. (1990) A weighted nearest neighbor algorithm for learning with symbolic features. Technical Report JHU-90/11. Baltimore, MD: The Johns Hopkins University, Department of Computer Science.
Google Scholar
Funakoshi, K. and Bao Ho, T. Rough set approach to information retrieval. Rough Sets in Knowledge Discovery, v2, Lech Polkowski, Andrzej Skowron (eds.). Heidelberg Publisher, New York: Physica-Verlag 166–177, 1998.
Google Scholar
Grzymala-Busse, J.W., LERS—a system for learning from examples based on rough sets. In Slowinski, R. (eds.), Intelligent Decision Support, Kluwer Academic Publishers, 3–18, 1992.
Google Scholar
Han, J. and Kamber, K. Data Mining: Concepts and Techniques. Morgan Kaufmann Publishers, 2000.
Google Scholar
Ketler, K. Case based reasoning: an introduction. Expert System with Application, 6, 3–8, 1993.
Article Google Scholar
Krzysztof Krawiec, Roman Slowinski, and Daniel Vanderpooten, Learining Decision Rules from Similarity Based Rough Approximations 2: Applications, Case Studies and Software Systems. Rough Sets in Knowledge Discovery, v2, Lech Polkowski, Andrzej Skowron (eds.), Heidelberg; New York: Physica-Verlag, 37–54, 1998.
Google Scholar
Merz, C.J., Murphy, P.M.,UCI Repository of machine learning databases. University of California, Department of Information and Computer Science, 1996.
Google Scholar
Mrozek, A. and Skabek, K. Rough sets in economic applications. Rough Sets in Knowledge Discovery, v2, Lech Polkowski, Andrzej Skowron (eds.). Heidelberg Publisher, New York: Physica-Verlag 238–271, 1998.
Google Scholar
Pal, S. K. and Mitra, P. Case generation: a rough-fuzzy approach. Proceedings of Workshop Program at the 4th International Conference on Case-Based Reasoning, 236–242, 2001.
Google Scholar
Pawalk, Z. Rough Sets: Theoretical Aspects of Reasoning about Data, Kluwer Academic Publishers, Dordrecht, 1991.
Google Scholar
Quinlan, J.R. C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, San Mateo CA, 1988.
Google Scholar
Rumelhart, D.E., Hinton, G. E., Williams, R.J. Learning internal representations by error propagation. In: Rumelhart, D.E, McClelland, J.L. and the PDP Research Group (eds.), Parallel distributed processing: Explorations in the microstructure of cognition, MIT Press, Cambridge MA, 318–362, 1986.
Google Scholar
Slowinski, R and Vanderpoonten D. Similarity relation as a basis for rough approximations. Advances in Machine Intelligence and Soft Computing 4, 17–33, 1997.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of Regina, S4S 0A2, Regina, Sask, Canada
Liqiang Geng & Christine W. Chan

Authors

Liqiang Geng
View author publications
You can also search for this author in PubMed Google Scholar
Christine W. Chan
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Computer Science Section, Electrical Engineering Department, CINVESTAV-IPN, Av. IPN 2508, Col. San Pedro Zacatenco, D.F. 07300, Mexico, Mexico
Carlos A. Coello Coello
Computer Science Department, ITESM-Mexico City, Calle del Puente 222, Tlalpan, D.F. 14380, Mexico, Mexico
Alvaro de Albornoz
Computer Science Department, ITESM-Cuernavaca, Reforma 182-A, Lomas de Cuernavaca, Temixco, 62589, Morelos, Mexico
Luis Enrique Sucar
Department of Computer Science, ITAM, Rio Hondo 1, Progreso Tizapan, D.F. 01000, Mexico, Mexico
Osvaldo Cairó Battistutti

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Geng, L., Chan, C.W. (2002). Automated Case Generation from Databases Using Similarity-Based Rough Approximation. In: Coello Coello, C.A., de Albornoz, A., Sucar, L.E., Battistutti, O.C. (eds) MICAI 2002: Advances in Artificial Intelligence. MICAI 2002. Lecture Notes in Computer Science(), vol 2313. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46016-0_33

Download citation

DOI: https://doi.org/10.1007/3-540-46016-0_33
Published: 07 May 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43475-7
Online ISBN: 978-3-540-46016-9
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics