Mining Data from a Knowledge Management Perspective: An Application to Outcome Prediction in Patients with Resectable Hepatocellular Carcinoma

  • Riccardo Bellazzi
  • Ivano Azzini
  • Gianna Toffolo
  • Stefano Bacchetti
  • Mario Lise
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2101)


This paper presents the use of data mining tools to derive a prognostic model of the outcome of resectable hepatocellular carcinoma. The main goal of the study was to summarize the experience gained over more than 20 years by a surgical team. To this end, two decision trees have been induced from data: a model M1 that contains a full set of prognostic rules derived from the data on the basis of the 20 available factors, and a model M2 that considers only the two most relevant factors. M1 will be used to explicit the knowledge embedded in the data (externalization), while the model M2 will be used to extract operational rules (socialization). The models performance has been compared with the one of a Naive Bayes classifier and have been validated by the expert physicians. The paper concludes that a knowledge management perspective improves the validity of data mining techniques in presence of small data sets, coming from severe pathologies with relative low incidence. In these cases, it is more crucial the quality of the extracted knowledge than the predictive accuracy gained.


Liver Resection Prognostic Model Total Accuracy Decision Tree Induction Intelligent Data Analysis 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Nonaka I., Takeuchi H., The Knowledge-creating company, University Press, Oxford UK 1995.Google Scholar
  2. 2.
    Pazzani M.J., Knowledge discovery from data?, IEEE Intelligent Systems, 10–13, March–April 2000.Google Scholar
  3. 3.
    Kukar M., Besic N., Konomenko I., Auersperg M., Robnik-Sikonia M., Prognosing the survival time of the patients with the anaplastic thyroid carcinoma with machine learning, in: Intelligent Data Analysis in Medicine and Pharmacology, N. Lavrac, E. Keravnou, B. Zupan, 116–129, Kluwer, Boston, 1997.Google Scholar
  4. 4.
    Lise M., Bacchetti S., Da Pian P.P., Nitti D., Pilati P.L., Pigato P., Prognostic factors affecting long term outcome after liver resection for hepatocellular carcinoma, Cancer 82 (1998) 1028–1036.CrossRefGoogle Scholar
  5. 5.
    Bacchetti S., Toffolo G., Volpato M., Da Pian P., Nitti D., Cobelli C., Lise M., Outcome prediction in patients with resectable hepatocellular carcinoma; comparison of a Multivariate score model and a Bayesian Model (submitted for publication).Google Scholar
  6. 6.
    Ramoni M., Sebastiani P., An introduction to the robust Bayesian classifier, KMI-TR-79, the Open University, UK, 1999.Google Scholar
  7. 7.
    Dash M., Liu H., Features Selection for Classification, Intelligent Data Analysis, 1997,
  8. 8.
    Quinlan, J.R.: C4.5 Program for Machine Learning, Morgan Kaufmann, San Mateo CA (1994)Google Scholar
  9. 9.
    See5 Release 1.13,
  10. 10.
    Robust Bayesian Classifier (ROC),
  11. 11.
    Duda R.O., Hart P.E., Pattern classification and scene analysis, Wiley, New York, 1973.zbMATHGoogle Scholar
  12. 12.
    Mitchell T.M.: Machine Learning, McGraw-Hill, New York, 1997zbMATHGoogle Scholar
  13. 13.
    Hamamoto I., Okada S., Hashimoto T., Wakabayashi H., Maeba T., Maeta H.: Prediction of the early prognosis of the hepatectomized patients with hepatocellular carcinoma with a neural network, Comput Biol Med 25 (1995) 49–59.CrossRefGoogle Scholar
  14. 14.
    Gamberger D., Lavrac N., Jovanoski V., High confidence association rules for medical diagnosis, Proc. IDAMAP’ 99 workshop, 42–51., 1999.Google Scholar
  15. 15.
    Mani S., Shankle W.R., Dick M.B., Pazzani M.J.: Two stage Machine Learning model for guideline development,, Artif Intell Med, 16 (1999) 51–71.CrossRefGoogle Scholar
  16. 16.
    Zupan B., Demsar J., Kattan M.W., Beck R.J., Bratko I: Machine Learning for survival analysis: a case of study on recurrence of prostate cancer, Artif Intell Med, 20 (2000) 59–75.CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2001

Authors and Affiliations

  • Riccardo Bellazzi
    • 1
  • Ivano Azzini
    • 1
  • Gianna Toffolo
    • 2
  • Stefano Bacchetti
    • 3
  • Mario Lise
    • 3
  1. 1.Dipartimento di Informatica e SistemisticaUniversità di PaviaPaviaItaly
  2. 2.Dipartimento di Ingegneria Elettronica e InformaticaUniversità di PadovaPadovaItaly
  3. 3.Dipartimento di Scienze Oncologiche e Chirurgiche, Sez. Clinica ChirurgicaUniversità di PadovaPadovaItaly

Personalised recommendations