A New Method for Random Initialization of the EM Algorithm for Multivariate Gaussian Mixture Learning
In the paper a new method for random initialization of the EM algorithm for multivariate Gaussian mixture models is proposed. In the method booth mean vector and covariance matrix of a mixture component are initialized randomly. The mean vector of the component is initialized by the feature vector, selected from a randomly chosen set of candidate feature vectors, located farthest from already initialized mixture components as measured by the Mahalanobis distance. In the experiments the EM algorithm was applied to the clustering problem. Our approach was compared to three well known EM initialization methods. The results of the experiments, performed on synthetic datasets, generated from the Gaussian mixtures with the varying degree of overlap between clusters, indicate that our method outperforms three others.
KeywordsFeature Vector Gaussian Mixture Model Mixture Component Random Initialization Initialization Method
Unable to display preview. Download preview PDF.
- 11.McQueen, J.: Some methods for classification and analysis of multivariate observations. In: Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, pp. 281–297 (1967)Google Scholar