Abstract
Predicting the amount of money that can be claimed is critical to the effective running of an Hospital. In this paper we describe a case study of a Dutch Hospital where we use process mining to predict the cash flow of the Hospital. In order to predict the cost of a treatment, we use different data mining techniques to predict the sequence of treatments administered, the duration and the final ”care product” or diagnosis of the patient. While performing the data analysis we encountered three specific kinds of noise that we call sequence noise, human noise and duration noise. Studies in the past have discussed ways to reduce the noise in process data. However, it is not very clear what effect the noise has to different kinds of process analysis. In this paper we describe the combined effect of sequence noise, human noise and duration noise on the analysis of process data, by comparing the performance of several mining techniques on the data.
Chapter PDF
Similar content being viewed by others
References
Han, J., Kamber, M., Pei, J.: Data mining: concepts and techniques. Morgan Kaufmann Pub. (2011)
Quinlan, J.: Improved use of continuous attributes. Journal of Artificial Intelligence Research 4 (1996)
Alpaydın, E.: Introduction to Machine Learning. The MIT Press (2004)
Breiman, L.: Bagging predictors. Machine Learning 24, 123–140 (1996)
Freund, Y., Schapire, R.: Experiments with a new boosting algorithm. In: Machine Learning: Proceedings of the Thirteenth International Conference, pp. 148–156 (1996)
Breiman, L.: Random forests. Machine Learning 45(1), 5–32 (2001)
Curram, S., Mingers, J.: Neural networks, decision tree induction and discriminant analysis: an empirical comparison. Journal of the Operational Research Society, 440–450 (1994)
Alfaro, E., García, N., Gámez, M., Elizondo, D.: Bankruptcy forecasting: An empirical comparison of adaboost and neural networks. Decision Support Systems 45(1), 110–122 (2008)
Banfield, R., Hall, L., Bowyer, K., Kegelmeyer, W.: A comparison of decision tree ensemble creation techniques. IEEE Transactions on Pattern Analysis and Machine Intelligence 29(1), 173–180 (2007)
Gislason, P., Benediktsson, J., Sveinsson, J.: Random forests for land cover classification. Pattern Recognition Letters 27(4), 294–300 (2006)
Baase, S., Gelder, A.: Computer algorithms: introduction to design and analysis. Addison-Wesley (2000)
van der Aalst, W.M.P., van Dongen, B., Herbst, J., Maruster, L., Schimm, G., Weijters, A.J.M.M.: Workflow mining: a survey of issues and approaches. Data & Knowledge Engineering 47(2), 237–267 (2003)
van der Aalst, W.M.P., Weijters, A.J.M.M., Maruster, L.: Workflow mining: Discovering process models from event logs. IEEE Transactions on Knowledge and Data Engineering 16(9), 1128–1143 (2004)
van der Aalst, W.M.P., Günther, C.: Finding structure in unstructured processes: The case for process mining. In: Seventh International Conference on Application of Concurrency to System Design, ACSD 2007, pp. 3–12. IEEE (2007)
van der Aalst, W.M.P., Reijers, H., Weijters, A.J.M.M., van Dongen, B., de Medeiros, A.K.A., Song, M., Verbeek, H.: Business process mining: An industrial application. Information Systems 32, 713–732 (2007)
van der Aalst, W.M.P., Rubin, V., Verbeek, H.M.W., van Dongen, B., Kindler, E., Günther, C.: Process mining: a two-step approach to balance between underfitting and overfitting. Software and Systems Modeling 9(1), 87–111 (2010)
van der Aalst, W.M.P., Weijters, A.J.M.M.: Process mining. Process-Aware Information Systems, 235–255 (2011)
Mans, R., Schonenberg, M., Song, M., van der Aalst, W.M.P., Bakker, P.: Application of process mining in healthcare–a case study in a dutch hospital. Biomedical Engineering Systems and Technologies, 425–438 (2009)
Günther, C., Rinderle-Ma, S., Reichert, M., Van Der Aalst, W.M.P.: Using process mining to learn from process changes in evolutionary systems. International Journal of Business Process Integration and Management 3(1), 61–78 (2008)
Weijters, A.J.M.M., van der Aalst, W.M.P.: Rediscovering workflow models from event-based data using little thumb. Integrated Computer Aided Engineering 10(2), 151–162 (2003)
Herbst, J., Karagiannis, D.: Workflow mining with InWoLve. Computers in Industry 53, 245–264 (2004)
van Dongen, B.F., de Medeiros, A.K.A., Verbeek, H.M.W., Weijters, A.J.M.M., van der Aalst, W.M.P.: The prom framework: A new era in process mining tool support. In: Ciardo, G., Darondeau, P. (eds.) ICATPN 2005. LNCS, vol. 3536, pp. 444–454. Springer, Heidelberg (2005)
Kiepuszewski, B., ter Hofstede, A., van der Aalst, W.M.P.: Fundamentals of control flow in workflows. Acta Informatica 39, 143–209 (2003)
Datta, A.: Automating the discovery of as-is business process models: Probabilistic and algorithmic approaches. Information Systems Research 9(3), 275–301 (1998)
Weijters, A.J.M.M., van der Aalst, W.M.P.: Rediscovering workflow models from event-based data using little thumb. Integrated Computer Aided Engineering 10, 151–162 (2003)
Agrawal, R., Gunopulos, D., Leymann, F.: Mining process models from workflow logs. In: Schek, H.-J., Saltor, F., Ramos, I., Alonso, G. (eds.) EDBT 1998. LNCS, vol. 1377, pp. 469–483. Springer, Heidelberg (1998)
Hwang, S., Yang, W.: On the discovery of process models from their instances. Decision Support Systems 34(1), 41–57 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 IFIP International Federation for Information Processing
About this paper
Cite this paper
van der Spoel, S., van Keulen, M., Amrit, C. (2013). Process Prediction in Noisy Data Sets: A Case Study in a Dutch Hospital. In: Cudre-Mauroux, P., Ceravolo, P., Gašević, D. (eds) Data-Driven Process Discovery and Analysis. SIMPDA 2012. Lecture Notes in Business Information Processing, vol 162. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40919-6_4
Download citation
DOI: https://doi.org/10.1007/978-3-642-40919-6_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40918-9
Online ISBN: 978-3-642-40919-6
eBook Packages: Computer ScienceComputer Science (R0)