Learning Classifier Systems Applied to Knowledge Discovery in Clinical Research Databases

Holmes, John H.

doi:10.1007/3-540-45027-0_13

John H. Holmes⁵

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1813))

Included in the following conference series:

International Workshop on Learning Classifier Systems

876 Accesses
15 Citations

Abstract

A stimulus-response learning classifier system (LCS), EpiCS, was developed from the BOOLE and NEWBOOLE models to address the needs of knowledge discovery in databases used in clinical research. Two specific needs were investigated: the derivation of accurate estimates of disease risk, and the ability to deal with rare clinical outcomes. EpiCS was shown to have excellent classification accuracy, compared to logistic regression, when using risk estimates as the primary means for classification. This was especially true in data with low disease prevalence. EpiCS was designed to accommodate differential negative reinforcement when false positive or false negative decisions were made by the system. This feature was investigated to determine its effect on learning rate and classification accuracy. Tested across a range of disease prevalences, the learning rate improved when erroneous decisions were differentially negatively reinforced. However, classification accuracy was not affected by differential negative reinforcement

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bonelli, P., Parodi, A., Sen, S., and Wilson, S.: NEWBOOLE: A fast GBML system. In International Conference on Machine Learning, pages 153–159, San Mateo, California, 1990. Morgan Kaufmann.
Google Scholar
Bonelli, P. and Parodi, A.: An efficient classifier system and its experimental comparison with two representative learning methods on three medical domains. In R.K. Belew and L.B. Booker, editors, Proceedings of the Fourth International Conference on Genetic Algorithms (ICGA-4), pages 288–295. San Mateo, CA: Morgan Kaufmann, 1991.
Google Scholar
Centor, R. and Keightley, G.E.: Receiver operating characteristic (ROC) curve area analysis using the ROC ANALYZER. System. In Kingsland, L.C., editor Proceedings of the Thirteenth Annual Symposium on Computer Applications in Medical Care, pages 222–226, Silver Spring, MD: IEEE Computer Society Press, 1989.
Google Scholar
Dean, A.D.. Dean, J.A., Burton, J.H., and Dicker, R.C.: Epi Info, Version 5: a word processing, database, and statistics program for epidemiology on microcomputers. Centers for Disease Control, Atlanta, Georgia, 1990.
Google Scholar
Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P., and Uthurusamy, R.: Advances in Knowledge Discovery and Data Mining. Menlo Park, CA: AAAI Press; 1996.
Google Scholar
Goldberg, D.E.: Genetic Algorithms in Search, Optimization, and Machine Learning. Addison-Wesley, Reading, MA, 1989.
MATH Google Scholar
Good, W.F., Gur, D, Straub, W.H., and Feist, J.H.: Comparing imaging systems by ROC studies. Detection versus interpretation. Investigative Radiology. 24(11): 932–3, 1989.
Article Google Scholar
Green, D.M. and Swets, J.A.: Signal Detection Theory and Psychophysics. New York: John Wiley Sons; 1966.
Google Scholar
Hanley, J.A. and McNeil, B.J.: The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology 143:29–36, 1982.
Google Scholar
Hennekens, C.H., Buring, J.E., and Mayrent, S.L., editors: Epidemiology in Medicine. Boston: Little, Brown and Company; 1987.
Google Scholar
Holmes, J.H.: A genetics-based machine learning approach to knowledge discovery in clinical data, Journal of the American Medical Informatics Association Supplement (1996) 883.
Google Scholar
Holmes, J.H.: Discovering Risk of Disease with a Learning Classifier System. In Baeck, T., editor, Proceedings of the Seventh International Conference on Genetic Algorithms (ICGA-7). pages 426–433. San Francisco, CA: Morgan Kaufmann, 1997.
Google Scholar
Holmes, J.H., Durbin D.R., and Winston F.K.: The Learning Classifier System: An evolutionary computation approach to knowledge discovery in epidemiologic surveillance. Artificial Intelligence in Medicine (Accepted for publication).
Google Scholar
Hume, D.A.: A Treatise of Human Nature (1739). Second edition. Oxford: Clarendon Press; 1978.
Google Scholar
Kelsey, J.L., Thompson, W.D., and Evans, A.S.: Methods in Observational Epidemiology. New York: Oxford University Press; 1986.
Google Scholar
McNeil, B.J. and Hanley, J.A.: Statistical approaches to the analysis of receiver operating characteristic (ROC) curves. Medical Decision Making. 4:137–150, 1984.
Article Google Scholar
McNichol, D.A.: Primer of Signal Detection Theory. London: George Allen and Unwin, Ltd.; 1972.
Google Scholar
Metz, C.E., Shen, J.-H., and Kronman, H.B.: LabROC4: A program for maximum likelihood estimation of a binormal ROC curve and its associated parameters from a set of continuously-distributed data. University of Chicago; 1993.
Google Scholar
Provost, F. and Fawcett, T.: Analysis and visualization of classifier performance: comparison under imprecise class and cost distributions. In Heckerman, D., Mannila, H., Pregibon, D., and Uthurusamy, R., editors, Proceedings of the Third International Conference on Knowledge Discovery and Data Mining, pages 43–48. Menlo Park, CA: AAAI Press, 1997.
Google Scholar
Raska, K.: Epidemiologic surveillance in the control of infectious diseases. Review of Infectious Disease, 5: 1112–1117, 1983.
Google Scholar
Robertson, G.G. and Riolo, R.L.: A tale of two classifier systems. Machine Learning 3:139–159, 1988.
Google Scholar
Rothman, K.J.: Modern Epidemiology. Boston: Little, Brown and Company, 1986.
Google Scholar
Sedbrook, T.A., Wright, H., and Wright, R.: Application of a genetic classifier for patient triage. In Belew, R.K. and Booker, L.B., editors, Proceedings of the Fourth International Conference on Genetic Algorithms (ICGA-4), pages 334–338. San Mateo, CA: Morgan Kaufmann, 1991.
Google Scholar
Somoza, E., Soutullo-Esperon, L., and Mossman, D.: Evaluation and optimization of diagnostic tests using receiver operating characteristic analysis and information theory. International Journal of Biomedical Computing, 24(3): 153–89, 1989.
Article Google Scholar
Wilson, S.W.: Knowledge growth in an artificial animal. In Grefenstette, J.J., editor, Proceedings of an International Conference on Genetic Algorithms and their Applications, pages 16–23, Pittsburgh, PA, July 1985. Lawrence Erlbaum Associates.
Google Scholar
Wilson, S.W.: Classifier systems and the animat problem, Machine Learning, 2:199–228, 1987.
Google Scholar

Download references

Author information

Authors and Affiliations

Center for Clinical Epidemiology and Biostatistics, University of Pennsylvania School of Medicine, USA
John H. Holmes

Authors

John H. Holmes
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dipartimento di Elettronica ed Informatzione, Politecnico di Milano, Piazza Leonardo da Vinci n. 32, 20133, Milano, Italy
Pier Luca Lanzi
Institut für Psychologie III, Universität Würzburg, Röntgenring 11, 97070, Würzburg, Germany
Wolfgang Stolzmann
Prediction Dynamics, Concord, MA, 01742, USA
Stewart W. Wilson
Department of General Engineering, University of Illinois at Urbana-Champaign, USA
Stewart W. Wilson

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Holmes, J.H. (2000). Learning Classifier Systems Applied to Knowledge Discovery in Clinical Research Databases. In: Lanzi, P.L., Stolzmann, W., Wilson, S.W. (eds) Learning Classifier Systems. IWLCS 1999. Lecture Notes in Computer Science(), vol 1813. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45027-0_13

Download citation

DOI: https://doi.org/10.1007/3-540-45027-0_13
Published: 21 July 2000
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-67729-1
Online ISBN: 978-3-540-45027-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics