Abstractions in Process Mining: A Taxonomy of Patterns

Jagadeesh Chandra Bose, R. P.; van der Aalst, Wil M. P.

doi:10.1007/978-3-642-03848-8_12

R. P. Jagadeesh Chandra Bose^20,21 &
Wil M. P. van der Aalst²⁰

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5701))

Included in the following conference series:

International Conference on Business Process Management

3303 Accesses
77 Citations

Abstract

Process mining refers to the extraction of process models from event logs. Real-life processes tend to be less structured and more flexible. Traditional process mining algorithms have problems dealing with such unstructured processes and generate spaghetti-like process models that are hard to comprehend. One reason for such a result can be attributed to constructing process models from raw traces without due pre-processing. In an event log, there can be instances where the system is subjected to similar execution patterns/behavior. Discovery of common patterns of invocation of activities in traces (beyond the immediate succession relation) can help in improving the discovery of process models and can assist in defining the conceptual relationship between the tasks/activities.

In this paper, we characterize and explore the manifestation of commonly used process model constructs in the event log and adopt pattern definitions that capture these manifestations, and propose a means to form abstractions over these patterns. We also propose an iterative method of transformation of traces which can be applied as a pre-processing step for most of today’s process mining techniques. The proposed approaches are shown to identify promising patterns and conceptually-valid abstractions on a real-life log. The patterns discussed in this paper have multiple applications such as trace clustering, fault diagnosis/anomaly detection besides being an enabler for hierarchical process discovery.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

van der Aalst, W., Weijters, A., Maruster, L.: Workflow Mining: Discovering Process Models from Event Logs. IEEE Trans. Knowl. Data Eng. 16(9), 1128–1142 (2004)
Article Google Scholar
de Medeiros, A.K.A., van der Aalst, W., Pedrinaci, C.: Semantic Process Mining Tools: Core Building Blocks. In: 16th European Conference on Information Systems, pp. 1953–1964 (2008)
Google Scholar
Ristad, E.S., Yianilos, P.N.: Learning String-Edit Distance. IEEE Trans. PAMI 20(5), 522–532 (1998)
Article Google Scholar
Bose, R.P.J.C., van der Aalst, W.: Context Aware Trace Clustering: Towards Improving Process Mining Results. In: SIAM International Conference on Data Mining, pp. 401–412 (2009)
Google Scholar
Gusfield, D.: Algorithms on Strings, Trees, and Sequences: Computer Science and Computational Biology. Cambridge University Press, Cambridge (1997)
Book MATH Google Scholar
Kolpakov, K.: Finding Maximal Repetitions in a Word in Linear Time. In: IEEE Symposium on Foundations of Computer Science (FOCS), pp. 596–604 (1999)
Google Scholar
Cheung, C.F., Yu, J.X., Lu, H.: Constructing Suffix Tree for Gigabyte Sequences with Megabyte Memory. IEEE Trans. Knowl. Data Eng. 17(1), 90–105 (2005)
Article Google Scholar
Gusfield, D., Stoye, J.: Linear Time Algorithms for Finding and Representing all the Tandem Repeats in a String. Journal of Computer and System Sciences 69, 525–546 (2004)
Article MathSciNet MATH Google Scholar
Sokol, D., Benson, G., Tojeira, J.: Tandem Repeats Over the Edit Distance. Bioinformatics 23(2), e30–e36 (2007)
Article Google Scholar
Ukkonen, E.: On-Line Construction of Suffix Trees. Algorithmica 14(3), 249–260 (1995)
Article MathSciNet MATH Google Scholar
Greco, G., Guzzo, A., Pontieri, L.: Mining Hierarchies of Models: From Abstract Views to Concrete Specifications. In: Business Process Management, pp. 32–47 (2005)
Google Scholar
Greco, G., Guzzo, A., Pontieri, L.: Mining Taxonomies of Process Models. Data Knowl. Eng. 67(1), 74–102 (2008)
Article Google Scholar
Polyvyanyy, A., Smirnov, S., Weske, M.: Process Model Abstraction: A Slider Approach. In: Enterprise Distributed Object Computing, pp. 325–331 (2008)
Google Scholar
Günther, C.W., van der Aalst, W.M.P.: Fuzzy Mining - Adaptive Process Simplification Based on Multi-perspective Metrics. In: Business Process Management, pp. 328–343 (2007)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Mathematics and Computer Science, University of Technology, Eindhoven, The Netherlands
R. P. Jagadeesh Chandra Bose & Wil M. P. van der Aalst
Philips Healthcare, Veenpluis 5-6, Best, The Netherlands
R. P. Jagadeesh Chandra Bose

Authors

R. P. Jagadeesh Chandra Bose
View author publications
You can also search for this author in PubMed Google Scholar
Wil M. P. van der Aalst
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Hewlett-Packard Laboratories, 1501 Page Mill Rd., 94304, Palo Alto, CA, USA
Umeshwar Dayal
Alps Adria University Klagenfurt, Universitätsstr. 65, 9020, Klagenfurt, Austria
Johann Eder
IBM Zurich Research Laboratory, Saeumerstr. 4, 8803, Rüschlikon, Switzerland
Jana Koehler
Eindhoven University of Technology, PO Box 513, 5600, Eindhoven, MB, The Netherlands
Hajo A. Reijers

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jagadeesh Chandra Bose, R.P., van der Aalst, W.M.P. (2009). Abstractions in Process Mining: A Taxonomy of Patterns. In: Dayal, U., Eder, J., Koehler, J., Reijers, H.A. (eds) Business Process Management. BPM 2009. Lecture Notes in Computer Science, vol 5701. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03848-8_12

Download citation

DOI: https://doi.org/10.1007/978-3-642-03848-8_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-03847-1
Online ISBN: 978-3-642-03848-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics