Skip to main content

Process Model Search Using Latent Semantic Analysis

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Business Information Processing ((LNBIP,volume 281))

Abstract

Process model similarity measures can be utilized for searching process model collections, which is also called similarity-based search. While there are quite a lot of approaches, most of them base on an underlying alignment between the activities of the compared process models. Yet, according to the results of the process model matching contests conducted in recent years, such an alignment seems to be quite difficult to achieve. The Latent Semantic Analysis-based Similarity Search approach described in this paper circumvents the matching challenge by not requiring such a matching. Instead, it uses a Latent Semantic Analysis-based Similarity Measure to query model collections and retrieve similar models. An evaluation with a collection of 80 models resulted in very good results in terms of Precision, Recall, and F-Measure. The best F-Measure value obtained during the experiments was 0.92.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    The JAVA API can be downloaded from butler.aifb.kit.edu/asc/LS3/ls3.html.

References

  1. Lau, C.K., Fournier, A.J., Xia, Y., Recker, J., Bernhard, E.: Process model repository governance at suncorp. Technical report, Queensland University of Technology (2011)

    Google Scholar 

  2. Song, L., Wang, J., Wen, L., Wang, W., Tan, S., Kong, H.: Querying process models based on the temporal relations between tasks. In: Proceedings of the 15th IEEE International EDOCW Workshops, pp. 213–222 (2011)

    Google Scholar 

  3. Dumas, M., García-Bañuelos, L., Dijkman, R.M.: Similarity search of business process models. IEEE Data Eng. Bull. 32(3), 23–28 (2009)

    Google Scholar 

  4. ter Hofstede, A.H.M., Ouyang, C., Rosa, M.L., Song, L., Wang, J., Polyvyanyy, A.: APQL: a process-model query language. In: Proceedings of the 1st AP-BPM Conference, pp. 23–38 (2013)

    Google Scholar 

  5. Becker, M., Laue, R.: A comparative survey of business process similarity measures. Comput. Ind. 63(2), 148–167 (2012)

    Article  Google Scholar 

  6. Antunes, G., Bakhshandeh, M., Borbinha, J., Cardoso, J., Dadashnia, S., Francescomarino, C.D., Dragoni, M., Fettke, P., Gal, A., Ghidini, C., Hake, P., Khiat, A., Klinkmüller, C., Kuss, E., Leopold, H., Loos, P., Meilicke, C., Niesen, T., Pesquita, C., Péus, T., Schoknecht, A., Sheetrit, E., Sonntag, A., Stuckenschmidt, H., Thaler, T., Weber, I., Weidlich, M.: The process model matching contest 2015. In: Proceedings of the 6th EMISA Workshop, pp. 127–155 (2015)

    Google Scholar 

  7. Reisig, W.: Understanding Petri Nets - Modeling Techniques, Analysis Methods, Case Studies. Springer, Heidelberg (2013)

    Book  MATH  Google Scholar 

  8. Deerwester, S.C., Dumais, S.T., Landauer, T.K., Furnas, G.W., Harshman, R.A.: Indexing by latent semantic analysis. J. Am. Soc. Inf. Sci. 41(6), 391–407 (1990)

    Article  Google Scholar 

  9. Dumais, S.T., Furnas, G.W., Landauer, T.K., Deerwester, S., Harshman, R.: Using latent semantic analysis to improve access to textual information. In: SIGCHI Conference on Human Factors in Computing Systems, pp. 281–285 (1988)

    Google Scholar 

  10. Landauer, T.K., Foltz, P.W., Laham, D.: An introduction to latent semantic analysis. Discourse Process. 25(2–3), 259–284 (1998)

    Article  Google Scholar 

  11. Dumais, S.T.: LSA and information retrieval: getting back to basics. In: Handbook of Latent Semantic Analysis, pp. 293–321. Lawrence Erlbaum Associates (2007)

    Google Scholar 

  12. Awad, A.: BPMN-Q: a language to query business processes. In: Proceedings of the 2nd EMISA Workshop, pp. 115–128 (2007)

    Google Scholar 

  13. Beeri, C., Eyal, A., Kamenkovich, S., Milo, T.: Querying business processes with BP-QL. Inf. Syst. 33(6), 477–507 (2008)

    Article  Google Scholar 

  14. Dijkman, R., Dumas, M., García-Bañuelos, L.: Graph matching algorithms for business process model similarity search. In: Dayal, U., Eder, J., Koehler, J., Reijers, H.A. (eds.) BPM 2009. LNCS, vol. 5701, pp. 48–63. Springer, Heidelberg (2009). doi:10.1007/978-3-642-03848-8_5

    Chapter  Google Scholar 

  15. Kunze, M., Weidlich, M., Weske, M.: Querying process models by behavior inclusion. Softw. Syst. Model. 14(3), 1105–1125 (2015)

    Article  Google Scholar 

  16. Gater, A., Grigori, D., Bouzeghoub, M.: Indexing process model flow dependencies for similarity search. In: Meersman, R., Panetto, H., Dillon, T., Rinderle-Ma, S., Dadam, P., Zhou, X., Pearson, S., Ferscha, A., Bergamaschi, S., Cruz, I.F. (eds.) OTM 2012. LNCS, vol. 7565, pp. 128–145. Springer, Heidelberg (2012). doi:10.1007/978-3-642-33606-5_9

    Chapter  Google Scholar 

  17. Yan, Z., Dijkman, R., Grefen, P.: Fast business process similarity search. Distrib. Parallel Databases 30(2), 105–144 (2012)

    Article  Google Scholar 

  18. Kastner, M., Wagdy Saleh, M., Wagner, S., Affenzeller, M., Jacak, W.: Heuristic methods for searching and clustering hierarchical workflows. In: Moreno-Díaz, R., Pichler, F., Quesada-Arencibia, A. (eds.) EUROCAST 2009. LNCS, vol. 5717, pp. 737–744. Springer, Heidelberg (2009). doi:10.1007/978-3-642-04772-5_95

    Chapter  Google Scholar 

  19. Awad, A., Polyvyanyy, A., Weske, M.: Semantic querying of business process models. In: Proceedings of the 12th EDOC Conference, pp. 85–94 (2008)

    Google Scholar 

  20. Qiao, M., Akkiraju, R., Rembert, A.J.: Towards efficient business process clustering and retrieval: combining language modeling and structure matching. In: Rinderle-Ma, S., Toumani, F., Wolf, K. (eds.) BPM 2011. LNCS, vol. 6896, pp. 199–214. Springer, Heidelberg (2011). doi:10.1007/978-3-642-23059-2_17

    Chapter  Google Scholar 

  21. Li, S., Cao, J.: A new similarity search approach on process models. In: Cao, J., Wen, L., Liu, X. (eds.) PAS 2014. CCIS, vol. 495, pp. 11–20. Springer, Heidelberg (2015). doi:10.1007/978-3-662-46170-9_2

    Google Scholar 

  22. Miller, G.A.: WordNet: a lexical database for English. Commun. ACM 38(11), 39–41 (1995)

    Article  Google Scholar 

  23. Malinova, M., Dijkman, R., Mendling, J.: Automatic extraction of process categories from process model collections. In: Lohmann, N., Song, M., Wohed, P. (eds.) BPM 2013. LNBIP, vol. 171, pp. 430–441. Springer, Cham (2014). doi:10.1007/978-3-319-06257-0_34

    Chapter  Google Scholar 

  24. Porter, M.F.: An algorithm for suffix stripping. Program 14(3), 130–137 (1980)

    Article  Google Scholar 

  25. Zaman, A.N.K., Brown, C.G.: Latent semantic indexing and large dataset: study of term-weighting schemes. In: Proceedings of the 5th ICDIM Conference, pp. 1–4 (2010)

    Google Scholar 

  26. van Rijsbergen, C.J.: Information Retrieval, 2nd edn. Butterworth-Heinemann, Newton (1979)

    MATH  Google Scholar 

  27. Turney, P.D., Pantel, P.: From frequency to meaning: vector space models of semantics. J. Artif. Intell. Res. (JAIR) 37, 141–188 (2010)

    MathSciNet  MATH  Google Scholar 

  28. Vogelaar, J.J.C.L., Verbeek, H.M.W., Luka, B., Aalst, W.M.P.: Comparing business processes to determine the feasibility of configurable models: a case study. In: Daniel, F., Barkaoui, K., Dustdar, S. (eds.) BPM 2011. LNBIP, vol. 100, pp. 50–61. Springer, Heidelberg (2012). doi:10.1007/978-3-642-28115-0_6

    Chapter  Google Scholar 

  29. Hu, X., Cai, Z., Wiemer-Hastings, P., Graesser, A.C., McNamara, D.S.: Strengths, limitations, and extensions of LSA. In: Handbook of Latent Semantic Analysis, pp. 401–426. Lawrence Erlbaum Associates (2007)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Andreas Schoknecht .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Schoknecht, A., Fischer, N., Oberweis, A. (2017). Process Model Search Using Latent Semantic Analysis. In: Dumas, M., Fantinato, M. (eds) Business Process Management Workshops. BPM 2016. Lecture Notes in Business Information Processing, vol 281. Springer, Cham. https://doi.org/10.1007/978-3-319-58457-7_21

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-58457-7_21

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-58456-0

  • Online ISBN: 978-3-319-58457-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics