Abstract
A stimulus-response learning classifier system (LCS), EpiCS, was developed from the BOOLE and NEWBOOLE models to address the needs of knowledge discovery in databases used in clinical research. Two specific needs were investigated: the derivation of accurate estimates of disease risk, and the ability to deal with rare clinical outcomes. EpiCS was shown to have excellent classification accuracy, compared to logistic regression, when using risk estimates as the primary means for classification. This was especially true in data with low disease prevalence. EpiCS was designed to accommodate differential negative reinforcement when false positive or false negative decisions were made by the system. This feature was investigated to determine its effect on learning rate and classification accuracy. Tested across a range of disease prevalences, the learning rate improved when erroneous decisions were differentially negatively reinforced. However, classification accuracy was not affected by differential negative reinforcement
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bonelli, P., Parodi, A., Sen, S., and Wilson, S.: NEWBOOLE: A fast GBML system. In International Conference on Machine Learning, pages 153–159, San Mateo, California, 1990. Morgan Kaufmann.
Bonelli, P. and Parodi, A.: An efficient classifier system and its experimental comparison with two representative learning methods on three medical domains. In R.K. Belew and L.B. Booker, editors, Proceedings of the Fourth International Conference on Genetic Algorithms (ICGA-4), pages 288–295. San Mateo, CA: Morgan Kaufmann, 1991.
Centor, R. and Keightley, G.E.: Receiver operating characteristic (ROC) curve area analysis using the ROC ANALYZER. System. In Kingsland, L.C., editor Proceedings of the Thirteenth Annual Symposium on Computer Applications in Medical Care, pages 222–226, Silver Spring, MD: IEEE Computer Society Press, 1989.
Dean, A.D.. Dean, J.A., Burton, J.H., and Dicker, R.C.: Epi Info, Version 5: a word processing, database, and statistics program for epidemiology on microcomputers. Centers for Disease Control, Atlanta, Georgia, 1990.
Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P., and Uthurusamy, R.: Advances in Knowledge Discovery and Data Mining. Menlo Park, CA: AAAI Press; 1996.
Goldberg, D.E.: Genetic Algorithms in Search, Optimization, and Machine Learning. Addison-Wesley, Reading, MA, 1989.
Good, W.F., Gur, D, Straub, W.H., and Feist, J.H.: Comparing imaging systems by ROC studies. Detection versus interpretation. Investigative Radiology. 24(11): 932–3, 1989.
Green, D.M. and Swets, J.A.: Signal Detection Theory and Psychophysics. New York: John Wiley Sons; 1966.
Hanley, J.A. and McNeil, B.J.: The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology 143:29–36, 1982.
Hennekens, C.H., Buring, J.E., and Mayrent, S.L., editors: Epidemiology in Medicine. Boston: Little, Brown and Company; 1987.
Holmes, J.H.: A genetics-based machine learning approach to knowledge discovery in clinical data, Journal of the American Medical Informatics Association Supplement (1996) 883.
Holmes, J.H.: Discovering Risk of Disease with a Learning Classifier System. In Baeck, T., editor, Proceedings of the Seventh International Conference on Genetic Algorithms (ICGA-7). pages 426–433. San Francisco, CA: Morgan Kaufmann, 1997.
Holmes, J.H., Durbin D.R., and Winston F.K.: The Learning Classifier System: An evolutionary computation approach to knowledge discovery in epidemiologic surveillance. Artificial Intelligence in Medicine (Accepted for publication).
Hume, D.A.: A Treatise of Human Nature (1739). Second edition. Oxford: Clarendon Press; 1978.
Kelsey, J.L., Thompson, W.D., and Evans, A.S.: Methods in Observational Epidemiology. New York: Oxford University Press; 1986.
McNeil, B.J. and Hanley, J.A.: Statistical approaches to the analysis of receiver operating characteristic (ROC) curves. Medical Decision Making. 4:137–150, 1984.
McNichol, D.A.: Primer of Signal Detection Theory. London: George Allen and Unwin, Ltd.; 1972.
Metz, C.E., Shen, J.-H., and Kronman, H.B.: LabROC4: A program for maximum likelihood estimation of a binormal ROC curve and its associated parameters from a set of continuously-distributed data. University of Chicago; 1993.
Provost, F. and Fawcett, T.: Analysis and visualization of classifier performance: comparison under imprecise class and cost distributions. In Heckerman, D., Mannila, H., Pregibon, D., and Uthurusamy, R., editors, Proceedings of the Third International Conference on Knowledge Discovery and Data Mining, pages 43–48. Menlo Park, CA: AAAI Press, 1997.
Raska, K.: Epidemiologic surveillance in the control of infectious diseases. Review of Infectious Disease, 5: 1112–1117, 1983.
Robertson, G.G. and Riolo, R.L.: A tale of two classifier systems. Machine Learning 3:139–159, 1988.
Rothman, K.J.: Modern Epidemiology. Boston: Little, Brown and Company, 1986.
Sedbrook, T.A., Wright, H., and Wright, R.: Application of a genetic classifier for patient triage. In Belew, R.K. and Booker, L.B., editors, Proceedings of the Fourth International Conference on Genetic Algorithms (ICGA-4), pages 334–338. San Mateo, CA: Morgan Kaufmann, 1991.
Somoza, E., Soutullo-Esperon, L., and Mossman, D.: Evaluation and optimization of diagnostic tests using receiver operating characteristic analysis and information theory. International Journal of Biomedical Computing, 24(3): 153–89, 1989.
Wilson, S.W.: Knowledge growth in an artificial animal. In Grefenstette, J.J., editor, Proceedings of an International Conference on Genetic Algorithms and their Applications, pages 16–23, Pittsburgh, PA, July 1985. Lawrence Erlbaum Associates.
Wilson, S.W.: Classifier systems and the animat problem, Machine Learning, 2:199–228, 1987.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Holmes, J.H. (2000). Learning Classifier Systems Applied to Knowledge Discovery in Clinical Research Databases. In: Lanzi, P.L., Stolzmann, W., Wilson, S.W. (eds) Learning Classifier Systems. IWLCS 1999. Lecture Notes in Computer Science(), vol 1813. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45027-0_13
Download citation
DOI: https://doi.org/10.1007/3-540-45027-0_13
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-67729-1
Online ISBN: 978-3-540-45027-6
eBook Packages: Springer Book Archive