Skip to main content
Log in

Moderate Deviations for Ewens-Pitman Sampling Models

  • Published:
Sankhya A Aims and scope Submit manuscript

Abstract

Consider a population of individuals belonging to an infinity number of types, and assume that type proportions follow the Poisson-Dirichlet distribution with parameter α ∈ [0,1) and 𝜃 > −α. Given a sample of size n from the population, two important statistics are the number Kn of different types in the sample, and the number Ml,n of different types with frequency l in the sample. We establish moderate deviation principles for (Kn)n≥ 1 and (Ml,n)n≥ 1. Corresponding rate functions are explicitly identified, which help in revealing a critical scale and in understanding the exact role of the parameters α and 𝜃.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Barbour, A. and Gnedin, A. (2009). Small counts in the infinite occupancy scheme. Electron. J. Probab. 14, 365–384.

    Article  MathSciNet  MATH  Google Scholar 

  • Ben-Hamou, A., Boucheron, S. and Ohannessian, M.I. (2016). Concentration inequalities in the infinite urn scheme for occupancy counts and the missing mass, with applications. Bernoulli 23, 249–287.

    Article  MathSciNet  MATH  Google Scholar 

  • Crane, H. (2016). The ubiquitous Ewens sampling formula. Statist. Sci. 31, 1–19.

    Article  MathSciNet  Google Scholar 

  • Dembo, A. and Zeitouni, O. (1998). Large deviations techniques and applications. Springer, New York.

    Book  MATH  Google Scholar 

  • Favaro, S. and Feng, S. (2015). Large deviation principles for the Ewens-Pitman sampling model. Electron. J. Probab. 20, 1–27.

    Article  MathSciNet  MATH  Google Scholar 

  • Favaro, S., Lijoi, A., Mena, R.H. and Prünster, I. (2009). Bayesian nonparametric inference for species variety with a two parameter Poisson-Dirichlet process prior. J. Roy. Statist. Soc. Ser. B 71, 993–1008.

    Article  Google Scholar 

  • Favaro, S., Lijoi, A. and Prünster, I. (2013). Conditional formulae for Gibbs-type exchangeable random partitions. Ann. Appl. Probab. 23, 1721–1754.

    Article  MathSciNet  MATH  Google Scholar 

  • Feng, S. and Hoppe, F.M. (1998). Large deviation principles for some random combinatorial structures in population genetics and Brownian motion. Ann. Appl. Probab. 8, 975–994.

    Article  MathSciNet  MATH  Google Scholar 

  • Gnedin, A., Hansen, B. and Pitman, J. (2007). Notes on the occupancy problem with infinitely many boxes: general asymptotic and power laws. Probab. Surv 4, 146–171.

    Article  MathSciNet  MATH  Google Scholar 

  • Goncharov, V.L. (1944). Some facts from combinatorics. Izvestia Akad. Nauk. SSSR, Ser. Mat. 8, 3–48.

    Google Scholar 

  • Hansen, J.C. (1990). A functional central limit theorem for the Ewens sampling formula. J. Appl. Probab 27, 28–43.

    Article  MathSciNet  MATH  Google Scholar 

  • Hwang, H. and Janson, S. (2008). Local limit theorems for finite and infinite urn models. Ann. Probab 36, 992–1022.

    Article  MathSciNet  MATH  Google Scholar 

  • Karlin, S. (1967). Central limit theorems for certain infinite urn schemes. J. Math. and Mech. 17, 373–401.

    MathSciNet  MATH  Google Scholar 

  • Kingman, J.F.C. (1975). Random discrete distributions. J. Roy. Statist. Soc. Ser. B 37, 1–22.

    MathSciNet  MATH  Google Scholar 

  • Korwar, R.M. and Hollander, M. (1973). Contributions to the theory of Dirichlet processes. Ann. Probab. 1, 705–711.

    Article  MathSciNet  MATH  Google Scholar 

  • Lijoi, A., Prünster, I. and Walker, S.G. (2008). Bayesian nonparametric estimators derived from conditional Gibbs structures. Ann. Appl. Probab. 18, 1519–1547.

    Article  MathSciNet  MATH  Google Scholar 

  • Perman, M., Pitman, J. and Yor, M. (1992). Size-biased sampling of Poisson point processes and excursions. Probab. Theory Related Fields 92, 21–39.

    Article  MathSciNet  MATH  Google Scholar 

  • Pitman, J. (1995). Exchangeable and partially exchangeable random partitions. Probab. Theory Related Fields 102, 145–158.

    Article  MathSciNet  MATH  Google Scholar 

  • Pitman, J. (1996). Some developments of the Blackwell-MacQueen urn scheme. In Statistics, probability and game theory. (T. S. Ferguson, L. S. Shapley and J. B. MacQueen, eds.). Institute of Mathematical Statistics, Hayward, pp. 245–267.

    Chapter  Google Scholar 

  • Pitman, J. and Yor, M. (1997). The two parameter Poisson-Dirichlet distribution derived from a stable subordinator. Ann. Probab. 25, 855–900.

    Article  MathSciNet  MATH  Google Scholar 

  • Pitman, J. (2006). Combinatorial stochastic processes. Ecole d’eté de probabilités de Saint-Flour XXXII. Lecture notes in mathematics, N 1875. Springer, New York.

    MATH  Google Scholar 

Download references

Acknowledgements

The authors are grateful to two anonymous Referees for valuable remarks. The authors acknowledge the Banff International Research Station (BIRS), Canada, where this project has been completed during the Research in Team Programme “Random partitions and Bayesian nonparametrics”. S. Favaro is supported by the European Research Council through StG N-BNP 306406. Shui Feng is supported by the Natural Sciences and Engineering Research Council of Canada.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Stefano Favaro.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Favaro, S., Feng, S. & Gao, F. Moderate Deviations for Ewens-Pitman Sampling Models. Sankhya A 80, 330–341 (2018). https://doi.org/10.1007/s13171-018-0124-z

Download citation

  • Received:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13171-018-0124-z

Keywords and phrases.

AMS (2000) subject classification.

Navigation