Abstract
Historical transaction data are collected in many applications, e.g., patient histories recorded by physicians and customer transactions collected by companies. An important question is the learning of models upon the primary objects (patients, customers) rather than the transactions, especially when these models are subjected to drift.
We address this problem by combining advances of online clustering on multivariate data with the trajectory mining paradigm. We model the measurements of each individual primary object (e.g. its transactions), taken at irregular time intervals, as a trajectory in a high-dimensional feature space. Then, we cluster individuals with similar trajectories to identify sub-populations that evolve similarly, e.g. groups of customers that evolve similarly or groups of employees that have similar careers.
We assume that the multivariate trajectories are generated by drifting Gaussian Mixture Models. We study (i) an EM-based approach that clusters these trajectories incrementally as a reference method that has access to all the data for learning, and propose (ii) an online algorithm based on a Kalman filter that efficiently tracks the trajectories of Gaussian clusters. We show that while both methods approximate the reference well, the algorithm based on a Kalman filter is faster by one order of magnitude compared to the EM-based approach.
Chapter PDF
Similar content being viewed by others
Keywords
References
Buzan, D., Sclaroff, S., Kollios, G.: Extraction and clustering of motion trajectories in video. In: Proceedings of the 17th International Conference on Pattern Recognition, ICPR 2004, vol. 2, pp. 521–524 (2004)
Cadez, I.V., Gaffney, S., Smyth, P.: A general probabilistic framework for clustering individuals and objects. In: KDD 2000, pp. 140–149. ACM, New York (2000)
Chudova, D., Gaffney, S., Mjolsness, E., Smyth, P.: Translation-invariant mixture models for curve clustering. In: KDD 2003, pp. 79–88. ACM, New York (2003)
Ellis, D., Sommerlade, E., Reid, I.D.: Modelling pedestrian trajectory patterns with gaussian processes. In: VS 2009, pp. 1229–1234 (2009)
Funk, N.: A study of the kalman filter applied to visual tracking. Report (2003)
Gaffney, S., Smyth, P.: Trajectory clustering with mixtures of regression models. In: KDD 1999, pp. 63–72. ACM, New York (1999)
Han, Y., de Veth, J., Boves, L.: Trajectory clustering for automatic speech recognition (2005)
Kalman, R.E.: A New Approach to Linear Filtering and Prediction Problems. Trans. of the ASME – Journal of Basic Engineering 82(series D), 35–45 (1960)
Li, X., Wang, K., Wang, W., Li, Y.: A multiple object tracking method using kalman filter. In: IEEE, ICIA 2010, pp. 1862–1866 (2010)
Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval. Cambridge University Press, Cambridge (2008)
Medeiros, H., Park, J., Kak, A.: Distributed object tracking using a cluster-based kalman filter in wireless camera networks. IEEE TSP 2 (2008)
Pathan, S.S., Al-Hamadi, A., Michaelis, B.: OIF - an online inferential framework for multi-object tracking with kalman filter. In: Jiang, X., Petkov, N. (eds.) CAIP 2009. LNCS, vol. 5702, pp. 1087–1095. Springer, Heidelberg (2009)
Welch, G., Bishop, G.: An introduction to the kalman filter. Tech Report (1995)
Xiong, G., Feng, C., Ji, L.: Dynamical gaussian mixture model for tracking elliptical living objects. Pattern Recognition Letters 27, 838–842 (2006), doi:10.1016/j.patrec.2005.11.015
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Krempl, G., Siddiqui, Z.F., Spiliopoulou, M. (2011). Online Clustering of High-Dimensional Trajectories under Concept Drift. In: Gunopulos, D., Hofmann, T., Malerba, D., Vazirgiannis, M. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2011. Lecture Notes in Computer Science(), vol 6912. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23783-6_17
Download citation
DOI: https://doi.org/10.1007/978-3-642-23783-6_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23782-9
Online ISBN: 978-3-642-23783-6
eBook Packages: Computer ScienceComputer Science (R0)