Learning from Noisy Medical Data: A Comparative Study Based on a Real Diagnostic Problem

  • B. A. Teather
  • G. Della Riccia
  • D. Teather
Conference paper
Part of the International Centre for Mechanical Sciences book series (CISM, volume 363)


Clinicians routinely collect extensive case histories on their patients and in certain medical domains this data may be supplemented with confirmed or “working” diagnosis obtained by patient follow-up. The possibility of using such datasets as a source of ‘knowledge’ for diagnostic systems has been a goal of many research studies.

This paper reports on the application of five approaches: rule induction, neural networks, statistically based diagnostic trees, Bayes discriminants and logistic models; to the construction of diagnostic aids based on a noisy, mainly categorical, medical dataset giving the clinical presentation of patients with either Multiple Sclerosis or Cerebrovascular(Vascular) Disease who have been referred for Magnetic Resonance Imaging.

The procedures investigated gave very similar results in terms of overall diagnostic performance although the ‘format’ of the resulting diagnostic aids was very different. The use of Multiple Correspondence Analysis as a preparatory technique, to remove noisy variables, proved very successful in identifying a smaller subset of items that were more amenable to ‘automated’ techniques such as neural networks/rule induction and also assisted in the selection of variables for statistical discrimination.


Multiple Sclerosis Multiple Correspondence Analysis Multiple Episode Rule Induction Internal Auditory Canal 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Balla JI, Elstein AS and Christensen C Obstacles to acceptance of clinical decision analysis, BMJ, 298 (1989), 579–82.CrossRefGoogle Scholar
  2. 2.
    Ledley RS and Lusted 1.13 Reasoning foundations of medical diagnosis, Science, 130 (1959), 9–21.Google Scholar
  3. 3.
    de Dombal FT, Leaper DJ, Staniland JR, McCann AP and Horrocks JC: Human and computer-aided diagnosis of abdominal pain; tùrther report with emphasis on performance of clinicians, BMJ, I (1972), 376 - S0.Google Scholar
  4. 4.
    de Dombal FT, Clamp S, Softly A, Unwin B and Staniland JR: Prediction of individual patient prognosis value of computer aided systems, Medical Decision Making, 6 (1986), 1, 18–22.CrossRefGoogle Scholar
  5. 5.
    Spiegelhalter DJ and Knill-Jones RP Statistical and knowledge based applications to clinical decision support systems with an application in gastroenterology, JRSS, 147 (1984)Google Scholar
  6. 6.
    Wills KM, du Boulay GH and Teather D: Initial findings in the computer aided diagnosis of cerebral tumours using CT scan results, Br.J.Radiol., 54 (1981), 948–952.CrossRefGoogle Scholar
  7. 7.
    Teather D, Morton BA, du Boulay GH, Wills KM, Plummer D and Innocent PR: Computer assistance for CT scan interpretation and cerebral disease diagnosis, Statistics in Medicine, 4 (1985), 311–315.CrossRefGoogle Scholar
  8. 8.
    du Boulay GH, Field B, Teather BA, Teather D and Plummer D: The extraction of expert knowledge for MR image acquisition from the published literature, Rivista di Neuroradiologia, 5 (1992), 473–482.Google Scholar
  9. 9.
    Enzmann DR and O’Donohue J: Optimising MR imaging for detecting small tumours in the cerehellopontine angle and internal auditory canal, A.INR, 8 (1987), 99–106.Google Scholar
  10. 10.
    Pojunas KW, Danials DL, Williams AL, Naughton VM MR imaging of prolactin secreting micro-adenomas, AJNR, 7 (I 986 ), 209–213Google Scholar
  11. 11.
    Ormerod IEC, Miller DII, McDonald WI et al The role of NMR imaging in the assessment of multiple sclerosis and isolated neurological lesions–a quantitative study, Brain, 110 (1987), 1579–1616.CrossRefGoogle Scholar
  12. 12.
    Gregson M, John R, Teather BA and Thompson R Practical issues in the application of neural networks to the differential diagnosis of brain disease, Proc. IEE International Conference on Neural Networks and Expert Systems in Medicine and Healthcare, Plymouth (1994).Google Scholar
  13. 13.
    Goodman LA: The analysis of multidimensional contingency tables–stepwise procedures and direct estimation methods for building, models for multiple classifications, Technometrics, 13 (1971), 33–61.CrossRefMATHGoogle Scholar
  14. 14.
    Teather D: Diagnosis–methods and analysis, Bulletin of IMA, 10 (1974), 37–41.Google Scholar
  15. 15.
    Sturt E: Computerised construction in Fortran of a discriminant Function for categorical data, Applied Statistics, 30 (1981), 213–222.CrossRefGoogle Scholar
  16. 16.
    Benzécri JP et al: L’Analyse des données Tome I - La taxinomie. Tonne 2 - L’Analyse des correspondances, Dunod, ParisGoogle Scholar
  17. 17.
    Greenacre MJ: Correspondence analysis in practice, Academic Press, London 1993Google Scholar
  18. 18.
    Greenacre MJ SimCA Version 2 - Personal Computer Software for Correspondence Analysis, User Manual, 1990.Google Scholar

Copyright information

© Springer-Verlag Wien 1995

Authors and Affiliations

  • B. A. Teather
    • 1
  • G. Della Riccia
    • 2
  • D. Teather
    • 1
  1. 1.De Montford UniversityLeicesterUK
  2. 2.University of UdineUdineItaly

Personalised recommendations