Abstract
The detection of structural similarities of process models is frequently discussed in the literature. The state-of-the-art approaches for structural similarities of process models presume a known subgraph that is searched in a larger graph, and utilize behavioral and textual semantics to achieve their goal. In this paper we propose an approach to detect reoccurring structures in a collection of BPMN 2.0 process models, without the knowledge of a subgraph to be searched, and by focusing solely on the structural characteristics of the process models. The proposed approach deals with the problems of subgraph isomorphism, frequent pattern discovery and maximum common subgraph isomorphism, which are mentioned as NP-hard in the literature. In this work we present a formal model and a novel algorithm for the detection of reoccurring structures in a collection of BPMN 2.0 process models. We then apply the algorithm to a collection of 1,806 real-world process models and provide a quantitative and qualitative analysis of the results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
van der Aalst, W.M.P., et al.: Workflow patterns. Distrib. Parallel Databases 14(1), 5–51 (2003)
Breuker, D., Delfmann, P., Dietrich, H.A., Steinhorst, M.: Graph theory and model collection management: conceptual framework and runtime analysis of selected graph algorithms. Inf. Syst. e-Bus. Manag. 13(1), 69–106 (2014)
Burattin, A.: Process Mining Techniques in Business Environments. LNBIP, vol. 207. Springer, Heidelberg (2015)
Cordella, L.P., et al.: A (sub)graph isomorphism algorithm for matching large graphs. IEEE Trans. Pattern Anal. Mach. Intell. 26(10), 1367–1372 (2004)
Dijkman, R.M., et al.: Similarity of business process models: metrics and evaluation. Inf. Syst. 36(2), 498–516 (2011)
Dijkman, R.M., et al.: A short survey on process model similarity. In: Bubenko, J., Krogstie, J., Pastor, O., Pernici, B., Rolland, C., Sølvberg, A. (eds.) Seminal Contributions to Information Systems Engineering, pp. 421–427. Springer, Heidelberg (2013). doi:10.1007/978-3-642-36926-1_34
Dumas, M., et al.: Fast detection of exact clones in business process model repositories. Inf. Syst. 38(4), 619–633 (2013)
Ekanayake, C.C., Rosa, M., Hofstede, A.H.M., Fauvet, M.-C.: Fragment-based version management for repositories of business process models. In: Meersman, R., Dillon, T., Herrero, P., Kumar, A., Reichert, M., Qing, L., Ooi, B.-C., Damiani, E., Schmidt, D.C., White, J., Hauswirth, M., Hitzler, P., Mohania, M. (eds.) OTM 2011. LNCS, vol. 7044, pp. 20–37. Springer, Heidelberg (2011). doi:10.1007/978-3-642-25109-2_3
Ekanayake, C.C., Dumas, M., García-Bañuelos, L., Rosa, M., Hofstede, A.H.M.: Approximate clone detection in repositories of business process models. In: Barros, A., Gal, A., Kindler, E. (eds.) BPM 2012. LNCS, vol. 7481, pp. 302–318. Springer, Heidelberg (2012). doi:10.1007/978-3-642-32885-5_24
Feitelson, D.G.: Workload Modeling for Computer Systems Performance Evaluation. Cambridge University Press, New York (2015)
Ferme, V., et al.: A container-centric methodology for benchmarking workflow management systems. In: Proceedings of CLOSER 2016. Springer, Heidelberg (2016)
Grossmann, W., Rinderle-Ma, S.: Fundamentals of Business Intelligence. Data-Centric Systems and Applications. Springer, Heidelberg (2015)
Gupta, A., Nishimura, N.: The Complexity of Subgraph Isomorphism: Duality Results for Graphs of Bounded Path- and Tree-Width. Technical Report CS-95-14. Univ. of Waterloo, Comp. Sc. Dep., Waterloo, Ontario, Canada (1995)
Hertis, M., Juric, M.: An empirical analysis of business process execution language usage. IEEE Trans. Softw. Eng. 40(8), 738–757 (2014)
Ivanov, S., et al.: BPMNDiffViz: a tool for bpmn models comparison. In: BPM (Demos). CEUR-WS.org (2015)
Jordan, D., Evdemon, J.: Business Process Model And Notation (BPMN) version 2.0. Object Management Group, Inc. http://www.omg.org/spec/BPMN/2.0/
La Rosa, M., et al.: APROMORE: an advanced process model repository. Expert Syst. Appl. 38(6), 7029–7040 (2011)
Muehlen, M., Recker, J.: How much language is enough? theoretical and practical use of the business process modeling notation. In: Bellahsène, Z., Léonard, M. (eds.) CAiSE 2008. LNCS, vol. 5074, pp. 465–479. Springer, Heidelberg (2008). doi:10.1007/978-3-540-69534-9_35
Nijssen, S., Kok, J.N.: The gaston tool for frequent subgraph mining. Electron. Notes Theor. Comput. Sci. 127(1), 77–87 (2005)
OASIS: Web Services Business Process Execution Language Version 2.0 - OASIS Standard (2007)
Pietsch, P., Wenzel, S.: Comparison of BPMN2 diagrams. In: Mendling, J., Weidlich, M. (eds.) BPMN 2012. LNBIP, vol. 125, pp. 83–97. Springer, Heidelberg (2012). doi:10.1007/978-3-642-33155-8_7
Pittke, F., Leopold, H., Mendling, J., Tamm, G.: Enabling reuse of process models through the detection of similar process parts. In: Rosa, M., Soffer, P. (eds.) BPM 2012. LNBIP, vol. 132, pp. 586–597. Springer, Heidelberg (2013). doi:10.1007/978-3-642-36285-9_59
Polyvyanyy, A., Vanhatalo, J., Völzer, H.: Simplified computation and generalization of the refined process structure tree. In: Bravetti, M., Bultan, T. (eds.) WS-FM 2010. LNCS, vol. 6551, pp. 25–41. Springer, Heidelberg (2011). doi:10.1007/978-3-642-19589-1_2
Schumm, D., et al.: Integrating compliance into business processes: process fragments as reusable compliance controls. In: MKWI (2010)
Skouradaki, M., Leymann, F.: Detecting frequently recurring structures in BPMN 2.0 process models. In: SummerSOC. IBM (2015)
Skouradaki, M., et al.: Application of sub-graph isomorphism to extract reoccurring structures from BPMN 2.0 process models. In: Proceedings of SOSE 2015. IEEE (2015)
Skouradaki, M., et al.: On the road to benchmarking BPMN 2.0 workflow engines. In: IProceedings of ICPE 2015 (2015)
Ullmann, J.R.: An algorithm for subgraph isomorphism. ACM 23(1), 31–42 (1976)
Valiente, G.: Algorithms on Trees and Graphs. Springer, Berlin (2002)
Valiente, G., Martínez, C.: An algorithm for graph pattern-matching. In: South American Workshop on String Processing. Carleton University Press (1997)
Yan, X., Han, J.: gSpan: graph-based substructure pattern mining. In: Proceedings of ICDM 2002. Institute of Electrical & Electronics Engineers (IEEE) (2002)
Yan, Z., Dijkman, R., Grefen, P.: Generating synthetic process model collections with properties of labeled real-life models. In: Ouyang, C., Jung, J.-Y. (eds.) AP-BPM 2014. LNBIP, vol. 181, pp. 74–88. Springer, Heidelberg (2014). doi:10.1007/978-3-319-08222-6_6
Acknowledgments
This work is funded by the “BenchFlow – A Benchmark for Workflow Management Systems” project DACH Grant No. 200021E-145062/1 and by the “SmartOrchestra” (01MD16001F) project funded by the BMWi.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Skouradaki, M., Andrikopoulos, V., Kopp, O., Leymann, F. (2016). RoSE: Reoccurring Structures Detection in BPMN 2.0 Process Model Collections. In: Debruyne, C., et al. On the Move to Meaningful Internet Systems: OTM 2016 Conferences. OTM 2016. Lecture Notes in Computer Science(), vol 10033. Springer, Cham. https://doi.org/10.1007/978-3-319-48472-3_15
Download citation
DOI: https://doi.org/10.1007/978-3-319-48472-3_15
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-48471-6
Online ISBN: 978-3-319-48472-3
eBook Packages: Computer ScienceComputer Science (R0)