Abstract
Random partition models are widely used to perform clustering, since their features make them appealing options. However, additional information regarding group properties is not straightforward to incorporate under this approach. In order to overcome this difficulty, a novel approach to infer about clustering is presented. By relaxing the symmetry property of random partitions’ distributions, we are able to include group sizes in the computation of the probabilities. A Bayesian model is also given, together with a sampling scheme, and it is tested using simulated and real datasets.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Crane, H.: The cut-and-paste process. Ann. Probab. 42(5), 1952–1979 (2014)
Lijoi, A., Prünster, I.: Models beyond the Dirichlet process. In: Hjort, N.L., Holmes, C.C., Müller, P., Walker, S.G. (eds.) Bayesian Nonparametrics, pp. 80–136. Cambridge University Press, Cambridge (2010)
Martínez, A.F.: Usages of random combinatorial structures in statistics; a Bayesian nonparametric approach. Ph.D. thesis, Universidad Nacional Autónoma de México (2015)
McCullagh, P., Yang, J.: How many clusters? Bayesian Anal. 3(1), 101–120 (2008)
Mena, R.H., Walker, S.G.: On the Bayesian mixture model and identifiability. J. Comput. Graph. Stat. 24(4), 1155–1169 (2015)
Nobile, A., Fearnside, A.T.: Bayesian finite mixtures with an unknown number of components: the allocation sampler. Stat. Comput. 17, 147–162 (2007)
Papastamoulis, P., Iliopoulos, G.: An artificial allocations based solution to the label switching problem in Bayesian analysis of mixtures of distributions. J. Comput. Graph. Stat. 19(2), 313–331 (2010)
Richardson, S., Green, P.J.: On Bayesian analysis of mixtures with an unknown number of components (with discussion). J. R. Stat. Soc: Ser. B (Stat. Methodol.) 59(4), 731–792 (1997)
Tran, T., Phung, D., Venkatesh, S.: Learning from ordered sets and applications in collaborative ranking. In: JMLR: Workshop and Conference Proceedings, vol. 25, pp. 427–442 (2012)
Truyen, T., Phung, D., Venkatesh, S.: Probabilistic models over ordered partitions with applications in document ranking and collaborative filtering. In: Proceedings of the 2011 SIAM International Conference on Data Mining, pp. 426–437 (2011)
Acknowledgements
I would like to thank two anonymous referees for many helpful comments made on a previous version of the paper.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Martínez, A.F. (2019). Clustering via Nonsymmetric Partition Distributions. In: Antoniano-Villalobos, I., Mena, R., Mendoza, M., Naranjo, L., Nieto-Barajas, L. (eds) Selected Contributions on Statistics and Data Science in Latin America. FNE 2018. Springer Proceedings in Mathematics & Statistics, vol 301. Springer, Cham. https://doi.org/10.1007/978-3-030-31551-1_6
Download citation
DOI: https://doi.org/10.1007/978-3-030-31551-1_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-31550-4
Online ISBN: 978-3-030-31551-1
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)