Abstract
We describe an original implementation of k-Maximum Likelihood Estimator (k-MLE)[1], a fast algorithm for learning finite statistical mixtures of exponential families. Our version converges to a local maximum of the complete likelihood while guaranteeing not to have empty clusters. To initialize k-MLE, we propose a careful and greedy strategy inspired by k-means++ which selects automatically cluster centers and their number. The paper gives all details for using k-MLE with mixtures of Wishart (WMMs). Finally, we propose to use the Cauchy-Schwartz divergence as a comparison measure between two WMMs and give a general methodology for building a motion retrieval system.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Nielsen, F.: k-MLE: A fast algorithm for learning statistical mixture models. In: International Conference on Acoustics, Speech and Signal Processing, pp. 869–872 (2012)
McLachlan, G.J., Krishnan, T.: The EM Algorithm and Extensions, 2nd edn. Wiley Series in Probability and Statistics. Wiley-Interscience (2008)
Banerjee, A., Merugu, S., Dhillon, I.S., Ghosh, J.: Clustering with bregman divergences. Journal of Machine Learning Research (6), 1705–1749 (2005)
Wishart, J.: The generalised product moment distribution in samples from a Normal multivariate population. Biometrika 20A(1/2), 32–52 (1928)
Nielsen, F., Garcia, V.: Statistical exponential families: A digest with flash cards (November 2009), http://arxiv.org/abs/0911.4863
Ji, S., Krishnapuram, B., Carin, L.: Variational Bayes for continuous hidden Markov models and its application to active learning. IEEE Transactions on Pattern Analysis and Machine Intelligence 28(4), 522–532 (2006)
Hidot, S., Saint Jean, C.: An Expectation-Maximization algorithm for the Wishart mixture model: Application to movement clustering. Pattern Recognition Letters 31(14), 2318–2324 (2010)
Hartigan, J.A., Wong, M.A.: Algorithm AS 136: A k-means clustering algorithm. Journal of the Royal Statistical Society. Series C (Applied Statistics) 28(1), 100–108 (1979)
Telgarsky, M., Vattani, A.: Hartigan’s method: k-means clustering without Voronoi. In: Proc. of International Conference on Artificial Intelligence and Statistics (AISTATS), pp. 820–827 (2010)
Kulis, B., Jordan, M.I.: Revisiting k-means: New algorithms via Bayesian nonparametrics. In: International Conference on Machine Learning, ICML (2012)
Nielsen, F.: Closed-form information-theoretic divergences for statistical mixtures. In: International Conference on Pattern Recognition (ICPR), pp. 1723–1726 (2012)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Saint-Jean, C., Nielsen, F. (2013). A New Implementation of k-MLE for Mixture Modeling of Wishart Distributions. In: Nielsen, F., Barbaresco, F. (eds) Geometric Science of Information. GSI 2013. Lecture Notes in Computer Science, vol 8085. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40020-9_26
Download citation
DOI: https://doi.org/10.1007/978-3-642-40020-9_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40019-3
Online ISBN: 978-3-642-40020-9
eBook Packages: Computer ScienceComputer Science (R0)