Abstract
Clinicians routinely collect extensive case histories on their patients and in certain medical domains this data may be supplemented with confirmed or “working” diagnosis obtained by patient follow-up. The possibility of using such datasets as a source of ‘knowledge’ for diagnostic systems has been a goal of many research studies.
This paper reports on the application of five approaches: rule induction, neural networks, statistically based diagnostic trees, Bayes discriminants and logistic models; to the construction of diagnostic aids based on a noisy, mainly categorical, medical dataset giving the clinical presentation of patients with either Multiple Sclerosis or Cerebrovascular(Vascular) Disease who have been referred for Magnetic Resonance Imaging.
The procedures investigated gave very similar results in terms of overall diagnostic performance although the ‘format’ of the resulting diagnostic aids was very different. The use of Multiple Correspondence Analysis as a preparatory technique, to remove noisy variables, proved very successful in identifying a smaller subset of items that were more amenable to ‘automated’ techniques such as neural networks/rule induction and also assisted in the selection of variables for statistical discrimination.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Balla JI, Elstein AS and Christensen C Obstacles to acceptance of clinical decision analysis, BMJ, 298 (1989), 579–82.
Ledley RS and Lusted 1.13 Reasoning foundations of medical diagnosis, Science, 130 (1959), 9–21.
de Dombal FT, Leaper DJ, Staniland JR, McCann AP and Horrocks JC: Human and computer-aided diagnosis of abdominal pain; tùrther report with emphasis on performance of clinicians, BMJ, I (1972), 376 - S0.
de Dombal FT, Clamp S, Softly A, Unwin B and Staniland JR: Prediction of individual patient prognosis value of computer aided systems, Medical Decision Making, 6 (1986), 1, 18–22.
Spiegelhalter DJ and Knill-Jones RP Statistical and knowledge based applications to clinical decision support systems with an application in gastroenterology, JRSS, 147 (1984)
Wills KM, du Boulay GH and Teather D: Initial findings in the computer aided diagnosis of cerebral tumours using CT scan results, Br.J.Radiol., 54 (1981), 948–952.
Teather D, Morton BA, du Boulay GH, Wills KM, Plummer D and Innocent PR: Computer assistance for CT scan interpretation and cerebral disease diagnosis, Statistics in Medicine, 4 (1985), 311–315.
du Boulay GH, Field B, Teather BA, Teather D and Plummer D: The extraction of expert knowledge for MR image acquisition from the published literature, Rivista di Neuroradiologia, 5 (1992), 473–482.
Enzmann DR and O’Donohue J: Optimising MR imaging for detecting small tumours in the cerehellopontine angle and internal auditory canal, A.INR, 8 (1987), 99–106.
Pojunas KW, Danials DL, Williams AL, Naughton VM MR imaging of prolactin secreting micro-adenomas, AJNR, 7 (I 986 ), 209–213
Ormerod IEC, Miller DII, McDonald WI et al The role of NMR imaging in the assessment of multiple sclerosis and isolated neurological lesions–a quantitative study, Brain, 110 (1987), 1579–1616.
Gregson M, John R, Teather BA and Thompson R Practical issues in the application of neural networks to the differential diagnosis of brain disease, Proc. IEE International Conference on Neural Networks and Expert Systems in Medicine and Healthcare, Plymouth (1994).
Goodman LA: The analysis of multidimensional contingency tables–stepwise procedures and direct estimation methods for building, models for multiple classifications, Technometrics, 13 (1971), 33–61.
Teather D: Diagnosis–methods and analysis, Bulletin of IMA, 10 (1974), 37–41.
Sturt E: Computerised construction in Fortran of a discriminant Function for categorical data, Applied Statistics, 30 (1981), 213–222.
Benzécri JP et al: L’Analyse des données Tome I - La taxinomie. Tonne 2 - L’Analyse des correspondances, Dunod, Paris
Greenacre MJ: Correspondence analysis in practice, Academic Press, London 1993
Greenacre MJ SimCA Version 2 - Personal Computer Software for Correspondence Analysis, User Manual, 1990.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1995 Springer-Verlag Wien
About this paper
Cite this paper
Teather, B.A., Riccia, G.D., Teather, D. (1995). Learning from Noisy Medical Data: A Comparative Study Based on a Real Diagnostic Problem. In: Della Riccia, G., Kruse, R., Viertl, R. (eds) Proceedings of the ISSEK94 Workshop on Mathematical and Statistical Methods in Artificial Intelligence. International Centre for Mechanical Sciences, vol 363. Springer, Vienna. https://doi.org/10.1007/978-3-7091-2690-5_17
Download citation
DOI: https://doi.org/10.1007/978-3-7091-2690-5_17
Publisher Name: Springer, Vienna
Print ISBN: 978-3-211-82713-0
Online ISBN: 978-3-7091-2690-5
eBook Packages: Springer Book Archive