Formalizing Expert Knowledge Through Machine Learning

  • Tsuyoshi IdéEmail author
Part of the Service Science: Research and Innovations in the Service Economy book series (SSRI)


This chapter addresses one of the key questions in service science: how to formalize expert knowledge. While this question has been treated mainly as a task of formal language design, we use an alternative approach based on machine learning. Investigating the history of expert systems in artificial intelligence, we suggest that three criteria, generalizability, learnability, and actionability, are critical for extracted expert rules. We then conclude that machine learning is a promising tool to satisfy these criteria. As a real example, we perform a case study on a task of condition-based maintenance in the railway industry. We demonstrate that our proposed statistical outlier detection method achieves good performance for early anomaly detection in wheel axles, and thus in encoding expert knowledge.


Expert System Anomaly Detection Service Science Precision Matrix Graphical Gaussian Model 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. Banerjee O, Ghaoui LE, Natsoulis G (2006) Convex optimization techniques for fitting sparse Gaussian graphical models. In: Proceedings of the international conference on machine learning. ACM Press, New York, pp 89–96Google Scholar
  2. Bishop CM (2006) Pattern recognition and machine learning. Springer, New YorkGoogle Scholar
  3. Buchanan BG, Shortliffe EH (1984) Rule based expert systems: the MYCIN experiments of the Stanford Heuristic Programming Project. Addison-Wesley, ReadingGoogle Scholar
  4. Dempster AP (1972) Covariance selection. Biometrics, 28(1):157–175CrossRefGoogle Scholar
  5. Drake S (1957) Discoveries and opinions of Galileo, 2nd edn. Anchor, New YorkGoogle Scholar
  6. Fayyad UM, Piatetsky-Shapiro G, Smyth P, Uthurusamy R (1996) Advances in knowledge discovery and data mining. AAAI Press, Menlo Park, CA. and
  7. Ferrucci JA, Brown EW, Chu-Carroll J, Fan J, Gondek D, Kalyanpur A, Lally A, Murdock JW, Nyberg E, Prager JM, Schlaefer N, Welty CA (2010) Building Watson: an overview of the DeepQA project. AI Mag 31(3):59–79Google Scholar
  8. Friedman J, Hastie T, Höfling H, Tibshirani R (2007) Pathwise coordinate optimization. Ann Appl Stat 1(2):302–332CrossRefGoogle Scholar
  9. Friedman J, Hastie T, Tibshirani R (2008) Sparse inverse covariance estimation with the graphical lasso. Biostatistics 9(3):432–441CrossRefGoogle Scholar
  10. Idé T, Lozano AC, Abe N, Liu Y (2009) Proximity-based anomaly detection using sparse structure learning. In: Proceedings of 2009 SIAM International Conference on Data Mining (SDM 09), pp 97–108Google Scholar
  11. Nakazawa S (1998) ASP News, No.55. (in Japanese)
  12. Vargoa SL, Magliob PP, Akakaa MA (2008) On value and value co-creation: a service systems and service logic perspective. Eur Manag J 26(3):145–152CrossRefGoogle Scholar
  13. Waterman DA (1985) A guide to expert systems. Addison-Wesley Longman Publishing Co., Inc., BostonGoogle Scholar
  14. Wikipedia (2012a) MYCIN.
  15. Wikipedia (2012b) Rail transport in Japan.

Copyright information

© Springer Science+Business Media New York 2016

Authors and Affiliations

  1. 1.IBM Research, T. J. Watson Research CenterYorktown HeightsUSA

Personalised recommendations