Abstract
Learning to predict significant events from sequences of data with categorical features is an important problem in many application areas. We focus on events for system management, and formulate the problem of prediction as a classification problem. We perform co-occurrence analysis of events by means of Singular Value Decomposition (SVD) of the examples constructed from the data. This process is combined with Support Vector Machine (SVM) classification, to obtain efficient and accurate predictions. We conduct an analysis of statistical properties of event data, which explains why SVM classification is suitable for such data, and perform an empirical study using real data.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Bi, Z., Faloutsos, C., Korn, F. (2001). The “DGX” Distribution for Mining Massive, Skewed Data. International Conference on Knowledge Discovery and Data Mining.
Brockwell, P. J., Davis, R. (1996). Introduction to Time-Series and Forecasting, Springer-Verlag.
Cauwenberghs, G., Poggio, T. (2000). Incremental and Decremental Support Vector Machine Learning. NIPS.
Chandrasekharan, S., Manjunath, B. S., Wang, Y. F., Winkeler, J., Zhang, H.(1997). An eigenspace update algorithm for image analysis. Journal of graphical models and image processing, 321–332, 59(5).
Deerwester, S., Dumais, S. T., Furnas, G. W., Landauer, T. K., Harshman, R. A. (1990). Indexing by latent semantic analysis, Journal of the American Society for Information Science, 391–407, 41(6).
Degroat, R., Roberts, R.(1990). Efficient numerically stabilized rank-one eigenstructure updating. IEEE transactions on acoustic and signal processing, 38(2).
Joachims, T.(1999). Making large-scale SVM learning practical. Advances in Kernel Methods — Support Vector Learning, MIT-Press.
Joachims, T.(2000). The maximum margin approach to learning text classifiers: Methods, theory, and algorithms. Doctoral dissertation, Universität Dortmund, Informatik, LS VIII.
Kanth, K. V. R., Agrawal, D., Singh, A.(1998). Dimensionality Reduction for Similarity Searching in Dynamic Databases. ACM SIGMOD.
Mannila, H., Toivonen, H., Verkamo, A. I.(1995). Discovering frequent episodes in sequences. International Conference on Knowledge Discovery and Data Mining.
Mitra, P., Murthy, C. A., Pal, S. K. (2000). Data Condensation in Large Databases by Incremental Learning with Support Vector Machines. International Conference on Pattern Recognition.
Platt, J. C.(1999). Fast Training of Support Vector Machines using Sequential Minimal Optimization. Advances in Kernel Methods, MITPress, 185–208.
Syed, N. A., Liu, H., Sung, K. K.(1999). Incremental Learning with Support Vector Machines. International Joint Conference on Artificial Intelligence.
Vilalta, R., Ma, S., Hellerstein, J.(2001). Rule induction of computer events. IEEE International Workshop on Distributed Systems: Operations & Management, Springer Verlag, Lecture Notes in Computer Science.
Weiss, G., Hirsh, H.(1998). Learning to predict rare events in event sequences. International Conference on Knowledge Discovery and Data Mining.
Zaki, M. J.(2001). Sequence mining in categorical domains. Sequence Learning: Paradigms, Algorithms, and Applications, pp. 162–187, Springer Verlag, Lecture Notes in Computer Science.
Zipf, G. K.(1949). Human behavior and the principle of least effort: An introduction to human ecology. Addison-Wesley Press.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Domeniconi, C., Perng, Cs., Vilalta, R., Ma, S. (2002). A Classification Approach for Prediction of Target Events in Temporal Sequences. In: Elomaa, T., Mannila, H., Toivonen, H. (eds) Principles of Data Mining and Knowledge Discovery. PKDD 2002. Lecture Notes in Computer Science, vol 2431. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45681-3_11
Download citation
DOI: https://doi.org/10.1007/3-540-45681-3_11
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44037-6
Online ISBN: 978-3-540-45681-0
eBook Packages: Springer Book Archive