Abstract
Consider a population of individuals belonging to an infinity number of types, and assume that type proportions follow the Poisson-Dirichlet distribution with parameter α ∈ [0,1) and 𝜃 > −α. Given a sample of size n from the population, two important statistics are the number Kn of different types in the sample, and the number Ml,n of different types with frequency l in the sample. We establish moderate deviation principles for (Kn)n≥ 1 and (Ml,n)n≥ 1. Corresponding rate functions are explicitly identified, which help in revealing a critical scale and in understanding the exact role of the parameters α and 𝜃.
Similar content being viewed by others
References
Barbour, A. and Gnedin, A. (2009). Small counts in the infinite occupancy scheme. Electron. J. Probab. 14, 365–384.
Ben-Hamou, A., Boucheron, S. and Ohannessian, M.I. (2016). Concentration inequalities in the infinite urn scheme for occupancy counts and the missing mass, with applications. Bernoulli 23, 249–287.
Crane, H. (2016). The ubiquitous Ewens sampling formula. Statist. Sci. 31, 1–19.
Dembo, A. and Zeitouni, O. (1998). Large deviations techniques and applications. Springer, New York.
Favaro, S. and Feng, S. (2015). Large deviation principles for the Ewens-Pitman sampling model. Electron. J. Probab. 20, 1–27.
Favaro, S., Lijoi, A., Mena, R.H. and Prünster, I. (2009). Bayesian nonparametric inference for species variety with a two parameter Poisson-Dirichlet process prior. J. Roy. Statist. Soc. Ser. B 71, 993–1008.
Favaro, S., Lijoi, A. and Prünster, I. (2013). Conditional formulae for Gibbs-type exchangeable random partitions. Ann. Appl. Probab. 23, 1721–1754.
Feng, S. and Hoppe, F.M. (1998). Large deviation principles for some random combinatorial structures in population genetics and Brownian motion. Ann. Appl. Probab. 8, 975–994.
Gnedin, A., Hansen, B. and Pitman, J. (2007). Notes on the occupancy problem with infinitely many boxes: general asymptotic and power laws. Probab. Surv 4, 146–171.
Goncharov, V.L. (1944). Some facts from combinatorics. Izvestia Akad. Nauk. SSSR, Ser. Mat. 8, 3–48.
Hansen, J.C. (1990). A functional central limit theorem for the Ewens sampling formula. J. Appl. Probab 27, 28–43.
Hwang, H. and Janson, S. (2008). Local limit theorems for finite and infinite urn models. Ann. Probab 36, 992–1022.
Karlin, S. (1967). Central limit theorems for certain infinite urn schemes. J. Math. and Mech. 17, 373–401.
Kingman, J.F.C. (1975). Random discrete distributions. J. Roy. Statist. Soc. Ser. B 37, 1–22.
Korwar, R.M. and Hollander, M. (1973). Contributions to the theory of Dirichlet processes. Ann. Probab. 1, 705–711.
Lijoi, A., Prünster, I. and Walker, S.G. (2008). Bayesian nonparametric estimators derived from conditional Gibbs structures. Ann. Appl. Probab. 18, 1519–1547.
Perman, M., Pitman, J. and Yor, M. (1992). Size-biased sampling of Poisson point processes and excursions. Probab. Theory Related Fields 92, 21–39.
Pitman, J. (1995). Exchangeable and partially exchangeable random partitions. Probab. Theory Related Fields 102, 145–158.
Pitman, J. (1996). Some developments of the Blackwell-MacQueen urn scheme. In Statistics, probability and game theory. (T. S. Ferguson, L. S. Shapley and J. B. MacQueen, eds.). Institute of Mathematical Statistics, Hayward, pp. 245–267.
Pitman, J. and Yor, M. (1997). The two parameter Poisson-Dirichlet distribution derived from a stable subordinator. Ann. Probab. 25, 855–900.
Pitman, J. (2006). Combinatorial stochastic processes. Ecole d’eté de probabilités de Saint-Flour XXXII. Lecture notes in mathematics, N 1875. Springer, New York.
Acknowledgements
The authors are grateful to two anonymous Referees for valuable remarks. The authors acknowledge the Banff International Research Station (BIRS), Canada, where this project has been completed during the Research in Team Programme “Random partitions and Bayesian nonparametrics”. S. Favaro is supported by the European Research Council through StG N-BNP 306406. Shui Feng is supported by the Natural Sciences and Engineering Research Council of Canada.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Favaro, S., Feng, S. & Gao, F. Moderate Deviations for Ewens-Pitman Sampling Models. Sankhya A 80, 330–341 (2018). https://doi.org/10.1007/s13171-018-0124-z
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13171-018-0124-z