Skip to main content

Learning Actions in Complex Software Systems

  • Conference paper
Book cover Data Warehousing and Knowledge Discovery (DaWaK 2011)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6862))

Included in the following conference series:

  • 1255 Accesses

Abstract

Administering service-oriented architecture (SOA) systems could require sophisticated rules to decide for instance whether to add or remove servers and when. Rule construction often necessitates experts to study patterns that contribute to changes or events. This is a time consuming and error-prone process for complex software systems. In this paper we test the feasibility of automating this process by mining historical data such as past service requests (in time series) and server change events that the administrator committed. We propose a new method to relate frequent patterns in a given time series to changes recorded in the event’s history. We implemented and tested our method on a simulation system for SOA applications. First, we use Euclidean distance, DTW, and FastDTW to identify frequent patterns in a time series that represents performance metric of a SOA simulation system. Then, we calculate the confidence and support of frequent patterns that contribute to changes to identify a set of rules for automating changes. We tested rules that are generated using the proposed method in a training set on a testing set. The average accuracy of generated rules for the change event “remove” exceeded 80% in our experiments.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Lin, J., Keogh, E., Patel, P., Lonardi, S.: Finding motifs in time series. In: 8th ACM International Conference on Knowledge Discovery and Data Mining, Edmonton, Canada, pp. 53–68 (2002)

    Google Scholar 

  2. Arita, D., Yoshimatsu, H., Taniguchi, R.: Frequent motion pattern extraction for motion recognition in real-time human proxy. In: JSAI Workshop on Conversational Informatics, pp. 25–30 (2005)

    Google Scholar 

  3. Guyet, T., Garbay, C., Dojat, M.: Knowledge construction from time series data using a collaborative exploration system. Journal of Biomedical Informatics 40, 672–687 (2007)

    Article  Google Scholar 

  4. Androulakis, I.P., Wu, J., Vitolo, J., Roth, C.: Selecting maximally informative genes to enable temporal expression profiling analysis. FoSystems Biology in Engineering (2005)

    Google Scholar 

  5. McGovern, A., Rosendahl, D., Kruger, A., Beaton, M., Brown, R., Droegemeier, K.: Understanding the formation of tornadoes through data mining. In: 5th Conference on Artificial Intelligence and its Applications to Environmental Sciences at the American Meteorological Society (2007)

    Google Scholar 

  6. Keogh, E., Kasetty, S.: On the Need for Time Series Data Mining Benchmarks: A Survey and Empirical Demonstration. In: 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Edmonton, Canada, pp. 102–111 (2002)

    Google Scholar 

  7. Post, A.R., Harrison, J.H.: Temporal Data Mining. Clinics in Laboratory Medicine 28, 83–100 (2008)

    Article  Google Scholar 

  8. Morchen, F.: Time Series Knowledge Mining, Dissertation: Philipps-University Marburg, Germany (2006)

    Google Scholar 

  9. Salvador, S., Chan, P.: FastDTW: Toward accurate dynamic time warping in linear time and space. In: 3rd Workshop on Mining Temporal and Sequential Data, ACM KDD 2004, Seattle, Washington, USA (2004)

    Google Scholar 

  10. Kruskall, J.B., Liberman, M.: The Symmetric Time Warping Problem: From Continuous to Discrete. In: Time Warps, String Edits and Macromolecules: The Theory and Practice of Sequence Comparison, pp. 125–161. Addison-Wesley Publishing Co., Reading (1983)

    Google Scholar 

  11. Smit, M., Nisbet, A., Stroulia, E., Iszlai, G., Edgar, A.: Toward a simulation-generated knowledge base of service performance. In: 4th International Workshop on Middleware for Service Oriented Computing, New York, USA, pp. 19–24 (2009)

    Google Scholar 

  12. Rockwell, G.: Tapor: Building a portal for text analysis. In: Siemens, R., Moorman, D. (eds.) Mind Technologies, Humanities Computing and the Canadian Academic Community, pp. 285–299. University of Calgary Press, Calgary (2006)

    Google Scholar 

  13. Sakoe, H., Chiba, S.: Dynamic programming algorithm optimization for spoken word recognition. IEEE Transactions Acoustics, Speech, and Signal Processing 26 (1978)

    Google Scholar 

  14. Itakura, S.: Minimum Prediction Residual Principle Applied to Speech Recognition. IEEE Transactions Acoustics, Speech, and Signal 23, 5–72 (1975)

    Google Scholar 

  15. Keogh, E., Pazzani, M.: Scaling up Dynamic Time Warping for Datamining Applications. In: 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Boston, Massachuseetts, pp. 285–289 (2000)

    Google Scholar 

  16. Kim, S., Park, S., Chu, W.: An Index-based Approach for Similarity Search Supporting Time Warping in Large Sequence Databases. In: 17th International Conference on Data Engineering, Heidelberg, Germany, pp. 607–614 (2001)

    Google Scholar 

  17. Agrawal, R., Psaila, G., Wimmers, E.L., Zait, M.: Querying Shapes of Histories. In: 21st International Conference on Very Large Databases, Zurich, Switzerland, pp. 502–514 (1995)

    Google Scholar 

  18. Keogh, E., Chakrabarti, K., Pazzani, M., Mehrotra, S.: Locally Adaptive Dimensionality Reduction for Indexing Large Time Series Databases. In: Proceedings of ACM SIGMOD Conference on Management of Data, Santa Barbara, CA, USA, pp. 151–162 (2001)

    Google Scholar 

  19. Kalpakis, K., Gada, D., Puttagunta, V.: Distance Measures for Effective Clustering of ARIMA Time Series. In: Proceedings of the 2001 IEEE International Conference on Data Mining, San Jose, CA, pp. 273–280 (2001)

    Google Scholar 

  20. Fayyad, U., Reina, C., Bradley, P.: Initialization of Iterative Refinement Clustering Algorithms. In: Proceedings of the 4th International Conference on Knowledge Discovery and Data Mining, New York, NY, pp. 194–198 (1998)

    Google Scholar 

  21. Hetland, L., Saetrom, P.: Evolutionary rule mining in time series databases. Journal of Machine Learning 58, 107–125 (2005)

    Article  MATH  Google Scholar 

  22. Freitas, A.: Data Mining and Knowledge Discovery with Evolutionary Algorithms. Springer, Heidelberg (2002)

    Book  MATH  Google Scholar 

  23. Mannila, H., Toivonen, H., Verkamo, A.I.: Discovery of frequent episodes in event sequences. Journal of Data Mining and Knowledge Discovery 1, 259–289 (1997)

    Article  Google Scholar 

  24. Höppner, F., Klawonn, F.: Finding informative rules in interval sequences. In: Hoffmann, F., Adams, N., Fisher, D., Guimarães, G., Hand, D.J. (eds.) IDA 2001. LNCS, vol. 2189, pp. 125–134. Springer, Heidelberg (2001)

    Chapter  Google Scholar 

  25. Weiss, G.M., Hirsh, H.: Learning to predict rare events in event sequences. In: Agrawal, R., Stolorz, P., Piatetsky-Shapiro, G. (eds.) 4th International Conference on Knowledge Discovery and Data Mining (KDD), Menlo Park, USA, pp. 359–363 (1998)

    Google Scholar 

  26. Hetland, L., Saetrom, P.: Temporal rule discovery using genetic programming and specialized hardware. In: 4th International Conference on Recent Advances in Soft Computing (RASC), pp. 182–188 (2002)

    Google Scholar 

  27. Hipp, J., Guntzer, U., Nakhaeizadeh, G.: Algorithms for association rule mining – a general survey and comparison. SIGKDD Explorations 2, 58–64 (2000)

    Article  Google Scholar 

  28. Agrawal, R., Srikant, R.: Mining sequential patterns. In: ICDE, Taiwan (1995)

    Google Scholar 

  29. Lo, D., Khoo, S.C., Liu, C.: Efficient mining of iterative patterns for software specification discovery. In: 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), San Jose, USA, pp. 460–469 (2007)

    Google Scholar 

  30. Mannila, H., Toivonen, H., Verkamo, A.I.: Discovery of frequent episodes in event sequences. Journal of Data Mining and Knowledge Discovery (DMKD) 1, 259–289 (1997)

    Article  Google Scholar 

  31. Lo, D., Cheng, H., Han, J., Khoo, S.C.: Classification of software behaviors for failure detection: A discriminative pattern mining approach. In: 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, New York, NY, USA, pp. 557–566 (2009)

    Google Scholar 

  32. Hatonen, K., Klemettinen, M., Mannila, H., Ronkainen, P., Toivonen, H.: Knowledge discovery from telecomunication network alarm databases. In: 12th International Conference of Data Engineering, ICDE (1996)

    Google Scholar 

  33. Agrawal, R., Mannila, H., Srikant, R., Toivonen, H., Verkamo, A.I.: Fast discovery of association rules. In: Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P., Uthurusamy, R. (eds.) Advances in Knowledge Discovery and Data Mining, pp. 307–328. AAAI Press, MIT Press, Menlo Park, Cambridge (1996)

    Google Scholar 

  34. Savasere, Omiecinski, E., Navathe, S.: An efficient algorithm for mining association rules in large databases. In: VLDB Conference, Zurich, Switzerland (1995)

    Google Scholar 

  35. Zaki, M.J., Parthasarathy, S., Ogihara, M., Li, W.: New algorithms for fast discovery of association rules. In: Proceedings of 3rd International Conference on Knowledge Discovery and Data Mining (1997)

    Google Scholar 

  36. Keogh, E., Lonardi, S., Chiu, W.: Finding Surprising Patterns in a Time Series Database in Linear Time and Space. In: 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Edmonton, Canada, pp. 550–556 (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Golmohammadi, K., Smit, M., Zaiane, O.R. (2011). Learning Actions in Complex Software Systems. In: Cuzzocrea, A., Dayal, U. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2011. Lecture Notes in Computer Science, vol 6862. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23544-3_28

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-23544-3_28

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-23543-6

  • Online ISBN: 978-3-642-23544-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics