Abstract
We present in this paper a new approach for unsupervised feature selection for non Gaussian data controlled by a finite mixture of generalized Dirichlet distributions. We model each feature by a mixture of two Beta distributions: one relevant and depends on component labels while the second distribution is uninformative for the clustering. The relevance of each feature is then quantified by the mixture weight associated to the relevant Beta distribution. Experiments in summarizing image collections have shown the merits of our approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Aha, D.W., Bankert, R.L.: A comparative evaluation of sequential feature selection algorithms. In: Fifth International Workshop on Artificial Intelligence and Statistics (1995)
Blum, A.L., Langley, P.: Selection of relevant features and examples in machine learning. Artificial Intelligence 97(1-2), 245–271 (1997)
Bouguila, N., Ziou, D.: A Hybrid SEM Algorithm for High-Dimensional Unsupervised Learning Using a Finite Generalized Dirichlet Mixture. IEEE Transactions on Image Processing 15(9), 1785–1803 (2006)
Bouguila, N., Ziou, D.: Unsupervised Selection of a Finite Dirichlet Mixture Model: An MML-Based Approach. IEEE Transactions on Knowledge and Data Engineering 18(8), 993–1009 (2006)
Bouguila, N., Ziou, D.: High-Dimensional Unsupervised Selection and Estimation of a Finite Generalized Dirichlet Mixture Model Based on Minimum Message Length. IEEE Trans. on PAMI (2007)
Bouguila, N., Ziou, D., Vaillancourt, J.: Unsupervised Learning of a Finite Mixture Model Based on the Dirichlet Distribution and its Applications. IEEE Transactions on Image Processing 13(11), 1533–1543 (2004)
Boutemedjet, S., Ziou, D.: Visual Aspect: A Unified Content-Based Collaborative Filtering Model for Visual Document Recommendation. In: Proceeding of International Conference on Image Analysis and Recognition, pp. 685–696 (2006)
Boutemedjet, S., Ziou, D.: Content-based collaborative filtering model for scalable visual document recommendation. In: IJCAI-2007 Workshop on Multimodal Information Retrieval, pp. 11–18 (2007)
Dy, J.G., Brodley, C.E.: Feature selection for unsupervised learning. Journal of Machine Learning Research 5, 845–889 (2004)
Fielitz, B.D., Myers, B.L.: Estimation of parameters in the beta distribution. Decision Sciences 6, 1–13 (1975)
Figueiredo, M.A.T., Jain, A.K.: Unsupervised learning of finite mixture models. IEEE Trans. on PAMI 24(3), 4–37 (2002)
Friedman, J.H., Meulman, J.J.: Clustering objects on subsets of attributes. Journal of Royal Statistical Society-B 66(Part-4), 1–25 (2004)
Graham, M.W., Miller, D.J.: Unsupervised learning of parsimonious mixtures on large spaces with integrated feature and component selection. IEEE Transactions on Signal Processing 54(4) (2006)
Guyon, I., Elisseeff, A.: An Introduction to Variable and Feature Selection. Journal of Machine Learning Research 3, 1157–1182 (2003)
Huang, J., Kumar, S.R., Mitra, M., Zhu, W.J., Zabih, R.: Image indexing using color correlograms. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, IEEE Computer Society Press, Los Alamitos (1997)
Jain, A., Vailaya, A.: Image retrieval using color and shape. Pattern Recognition 29(8), 1233–1244 (1996)
Law, M.H.C., Figueiredo, M.A.T., Jain, A.K.: Simultaneous feature selection and clustering using mixture models. IEEE Trans. on PAMI 26(9) (2004)
Liu, T., Liu, S., Chen, Z., Ma, W.Y.: An evaluation on feature selection for text clustering. In: Twentieth International Conference on Machine Learning (2003)
McLachlan, G., Peel, D.: Finite Mixture Models. John Wiley and Sons, Chichester (2000)
Rissanen, J.: The mpeg-7 visual standard for content description: an overview. IEEE Transactions on Circuits and Systems for Video Technology 11(6), 696–702 (2001)
Schwarz, G.: Estimating the dimension of a model. The Annals of Statistics 6(2), 461–464 (1978)
Wallace, C.: Statistical and Inductive Inference by Minimum Message Length. Information Science and Statistics. Springer, Heidelberg (2005)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Boutemedjet, S., Bouguila, N., Ziou, D. (2007). Unsupervised Feature and Model Selection for Generalized Dirichlet Mixture Models. In: Kamel, M., Campilho, A. (eds) Image Analysis and Recognition. ICIAR 2007. Lecture Notes in Computer Science, vol 4633. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74260-9_30
Download citation
DOI: https://doi.org/10.1007/978-3-540-74260-9_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74258-6
Online ISBN: 978-3-540-74260-9
eBook Packages: Computer ScienceComputer Science (R0)