Abstractions in Process Mining: A Taxonomy of Patterns
Process mining refers to the extraction of process models from event logs. Real-life processes tend to be less structured and more flexible. Traditional process mining algorithms have problems dealing with such unstructured processes and generate spaghetti-like process models that are hard to comprehend. One reason for such a result can be attributed to constructing process models from raw traces without due pre-processing. In an event log, there can be instances where the system is subjected to similar execution patterns/behavior. Discovery of common patterns of invocation of activities in traces (beyond the immediate succession relation) can help in improving the discovery of process models and can assist in defining the conceptual relationship between the tasks/activities.
In this paper, we characterize and explore the manifestation of commonly used process model constructs in the event log and adopt pattern definitions that capture these manifestations, and propose a means to form abstractions over these patterns. We also propose an iterative method of transformation of traces which can be applied as a pre-processing step for most of today’s process mining techniques. The proposed approaches are shown to identify promising patterns and conceptually-valid abstractions on a real-life log. The patterns discussed in this paper have multiple applications such as trace clustering, fault diagnosis/anomaly detection besides being an enabler for hierarchical process discovery.
KeywordsMaximal Element Edit Distance Tandem Array Abstract Entity Process Instance
Unable to display preview. Download preview PDF.
- 2.de Medeiros, A.K.A., van der Aalst, W., Pedrinaci, C.: Semantic Process Mining Tools: Core Building Blocks. In: 16th European Conference on Information Systems, pp. 1953–1964 (2008)Google Scholar
- 4.Bose, R.P.J.C., van der Aalst, W.: Context Aware Trace Clustering: Towards Improving Process Mining Results. In: SIAM International Conference on Data Mining, pp. 401–412 (2009)Google Scholar
- 6.Kolpakov, K.: Finding Maximal Repetitions in a Word in Linear Time. In: IEEE Symposium on Foundations of Computer Science (FOCS), pp. 596–604 (1999)Google Scholar
- 11.Greco, G., Guzzo, A., Pontieri, L.: Mining Hierarchies of Models: From Abstract Views to Concrete Specifications. In: Business Process Management, pp. 32–47 (2005)Google Scholar
- 13.Polyvyanyy, A., Smirnov, S., Weske, M.: Process Model Abstraction: A Slider Approach. In: Enterprise Distributed Object Computing, pp. 325–331 (2008)Google Scholar
- 14.Günther, C.W., van der Aalst, W.M.P.: Fuzzy Mining - Adaptive Process Simplification Based on Multi-perspective Metrics. In: Business Process Management, pp. 328–343 (2007)Google Scholar