Determining the incidence of rare diseases

  • Matthew N. BainbridgeEmail author
Original Investigation


Extremely rare diseases are increasingly recognized due to wide-spread, inexpensive genomic sequencing. Understanding the incidence of rare disease is important for appreciating its health impact and allocating recourses for research. However, estimating incidence of rare disease is challenging because the individual contributory alleles are, themselves, extremely rare. We propose a new method to determine incidence of rare, severe, recessive disease in non-consanguineous populations that use known allele frequencies, estimate the combined allele frequency of observed alleles and estimate the number of causative alleles that are thus far unobserved in a disease cohort. Experiments on simulated and real data show that this approach is a feasible method to estimate the incidence of rare disease in European populations but due to several limitations in our ability to assess the full spectrum of pathogenic mutations serves as a useful tool to provide a lower threshold on disease incidence.


Rare disease Genetics Incidence Simulation 



The author would like to thank Drs. Mario Cleves, Charlotte Hobbs, Michelle Clark, Svasti Haricharan, David Dimmock and Sara Raskin for commenting on the manuscript. They would also like to thank the TESS foundation, A 501(c)(3) nonprofit corporation, ( for providing cohort data. This work was funded in part by gifts from the Liguori Family, John Motter and Effie Simanikas, Ernest and Evelyn Rady, and Rady Children’s Hospital San Diego.

Compliance with ethical standards

Conflict of interest

MNB is a member of the TESS scientific advisory board.


  1. Browning SR, Thompson EA (2012) Detecting rare variant associations by identity-by-descent mapping in case-control studies. Genetics 190:1521–1531. CrossRefPubMedPubMedCentralGoogle Scholar
  2. Cannizzo S, Lorenzoni V, Palla I et al (2018) Rare diseases under different levels of economic analysis: current activities, challenges and perspectives. RMD Open. CrossRefPubMedPubMedCentralGoogle Scholar
  3. Cohen AC Jr (1960) Estimating the parameters of a modified poisson distribution. J Am Stat Assoc 55:139–143. CrossRefGoogle Scholar
  4. Grier J, Hirano M, Karaa A et al (2018) Diagnostic odyssey of patients with mitochondrial disease: results of a survey. Neurol Genet 4:e230. CrossRefPubMedPubMedCentralGoogle Scholar
  5. Hardy GH (2003) Mendelian proportions in a mixed population. 1908. Yale J Biol Med 76:79–80PubMedPubMedCentralGoogle Scholar
  6. Karczewski KJ, Weisburd B, Thomas B et al (2017) The ExAC browser: displaying reference data information from over 60 000 exomes. Nucleic Acids Res 45:D840–D845. CrossRefPubMedGoogle Scholar
  7. Kobayashi Y, Yang S, Nykamp K et al (2017) Pathogenic variant burden in the ExAC database: an empirical approach to evaluating population data for clinical variant interpretation. Genome Med. CrossRefPubMedPubMedCentralGoogle Scholar
  8. Rode J (2005) Rare diseases: understanding this public health priority. EURORDIS, Paris, France.
  9. Schrodi SJ, DeBarber A, He M et al (2015) Prevalence estimation for monogenic autosomal recessive diseases using population-based genetic data. Hum Genet 134:659–669. CrossRefPubMedGoogle Scholar
  10. Updated Study Analyzes Use and Cost of Orphan Drugs (2018) In: NORD Natl Organ Rare Disord. Accessed 19 Nov 2019
  11. Valdez R, Ouyang L, Bolen J (2016) Public health and rare diseases: oxymoron no more. Prev Chronic Dis. CrossRefPubMedPubMedCentralGoogle Scholar
  12. Weinberg W (1909) Über Vererbungsgesetze beim Menschen. Z Für Indukt Abstamm- Vererbungslehre 2:276–330. CrossRefGoogle Scholar

Copyright information

© Springer-Verlag GmbH Germany, part of Springer Nature 2020

Authors and Affiliations

  1. 1.Rady Children’s Institute for Genomic MedicineSan DiegoUSA

Personalised recommendations