The role of de novo mutations in adult-onset neurodegenerative disorders
The genetic underpinnings of the most common adult-onset neurodegenerative disorders (AOND) are complex in majority of the cases. In some families, however, the disease can be inherited in a Mendelian fashion as an autosomal-dominant trait. Next to that, patients carrying mutations in the same disease genes have been reported despite a negative family history. Although challenging to demonstrate due to the late onset of the disease in most cases, the occurrence of de novo mutations can explain this sporadic presentation, as demonstrated for severe neurodevelopmental disorders. Exome or genome sequencing of patient–parent trios allows a hypothesis-free study of the role of de novo mutations in AOND and the discovery of novel disease genes. Another hypothesis that may explain a proportion of sporadic AOND cases is the occurrence of a de novo mutation after the fertilization of the oocyte (post-zygotic mutation) or even as a late-somatic mutation, restricted to the brain. Such somatic mutation hypothesis, that can be tested with the use of novel sequencing technologies, is fully compatible with the seeding and spreading mechanisms of the pathological proteins identified in most of these disorders. We review here the current knowledge and future perspectives on de novo mutations in known and novel candidate genes identified in the most common AONDs such as Alzheimer’s disease, Parkinson’s disease, the frontotemporal lobar degeneration spectrum and Prion disorders. Also, we review the first lessons learned from recent genomic studies of control and diseased brains and the challenges which remain to be addressed.
KeywordsAlzheimer Parkinson Frontotemporal dementia Somatic Mutation Mosaicism De novo
The etiology of most of the adult-onset neurodegenerative disorders (AOND) is considered multifactorial, including genetic and environmental factors. In certain proportions of patients, the disease can, however, be inherited as a Mendelian trait, i.e., monogenic forms (Box 1). An autosomal-dominant pattern of inheritance is the most frequently encountered, so that family history is often positive for the same disorder. Such monogenic forms may be associated with extreme phenotypes and early ages at onset, but this is not always the case. The existence of patients with an extreme/early onset of AOND and a negative family history indicates that this disease is not always transmitted in an autosomal-dominant fashion. Autosomal recessive inheritance explaining the disease in some of these patients has been described, for example, in patients with Parkinson’s disease , but in some patients, causal mutations were observed in autosomal-dominant genes known to cause AOND [62, 83, 116, 168]. The primary hypothesis is that these mutations appeared in the germline of these probands as de novo mutations (DNMs). To prove this, however, analysis of parental DNAs is required. As the disease onset even in these extreme cases occurs relatively late in life, one of the main practical challenges is access to parental biological samples .
Beyond germline mutations, there is now clear evidence that the human genome is mutable at any step of the development and during the entire life (Fig. 1b). In AOND, a putative role for somatic DNMs is currently being assessed [105, 112, 152]. The mechanisms of seeding and spreading of pathologic misfolded proteins are shared by multiple AONDs. It is thought to be a key mechanism explaining the irreversible progression of neurodegeneration throughout the brain . In theory, a small amount of pathologic seeds synthesized by a “colony” of neurons carrying a given somatic variant could be a source of spreading of a pathological protein throughout the brain. The hypothesis that a putatively causal DNM could have happened after the fertilization of the egg (post-zygotic mutation) or even later during the development (late-somatic mutation) has been evoked quite early in the recent history of human genetics . However, it remained difficult to assess until recently, because of technological limitations. Novel sequencing technologies now enable researchers to address this question accurately.
In this review, we report and discuss the existing evidence of de novo germline mutations identified in the most common AONDs as well as the increasing interest in the search for post-zygotic variants as candidate causal mechanisms in some patients with a sporadic presentation.
The later a Mendelian disorder manifests in life, the more challenging it is to provide evidence of a de novo occurrence of causal mutations, because of reduced availability of parental samples. However, the late onset also has another effect: a significant difference between AOND and severe early-onset neurodevelopmental disorders—which have been shown to be largely caused by DNMs—is that AOND causing mutations do not affect the ability of the carriers to have children. Hence, pathogenic DNMs can be transmitted to the offspring. As with all genetic variants, mutations segregating in families arose de novo at a given date in a given individual. Hence, founder effects can be identified, and, eventually, it may be possible to date the original (de novo) mutation. A few of the recurrent mutations causing autosomal-dominant Alzheimer’s disease (AD) have been subjected to the study of a putative founder effect. The most famous one is certainly the Colombian PSEN1 p.E280A mutation. Identity-by-descent analysis of the genomic sequence of 102 individuals originating from Antioquia provided the estimation of the DNM occurrence to 15 generations ago, back in the early 16th century . Another well-known example is the Parkinson’s disease-associated p.G2019S LRRK2 mutation, which is known to be present on different haplotypes suggesting different founders. One of them was estimated to have occurred 159 generations ago in a Berber founder . The identification of recurrent mutations in autosomal-dominant genes in the absence of a founder effect or evidence for multiple founders points to the regular occurrence of DNMs in these genes. For example, the PRNP p.P102L and the p.D178N mutations have been reported in multiple pedigrees on different haplotypes and indeed recently DNMs have been identified at these positions [81, 182] [8, 41]. Likewise, the recurrent SNCA p.A53T mutation has been shown to have occurred on different haplotypes as well as genuine de novo events [77, 134, 136]. In addition to point mutations, other genetic variations such as copy number variations (CNVs, genomic deletions or duplications) may also occur de novo. The existence of shared breakpoints mapping to short tandem repeats on chromosome 21 among different families carrying diverse sizes of APP duplications indicates that multiple recurrent de novo duplications can cause monogenic forms of AD . We can find similar evidence in synucleinopathies with SNCA copy gains (duplications and triplications) [147, 186].
Methods and strategies for the detection of de novo mutations in AOND
Main techniques for the detection of de novo mutations: pros and cons
Restricted to a few genes
Lack of sensitivity for mutations present in < 20% of cells
Targeted NGS (~ 100×)
Restricted to selected genes
Can be analyzed affordably and at high throughput
Restricted to selected genes, but able to sequence many samples affordably
Detection of mutations possible if present in > 10% cells
Standard WES (50–100×)
All coding regions, hypothesis free
Detection of mutations possible if present in > 10% cells
Standard WGS (15–30×)
All coding and non-coding regions, hypothesis free
Better sensitivity for structural variations
Detection of mutations possible if present in > 20% cells
Not much added value compared to standard depth NGS
Cost increasing with the number of genomic regions sequenced
Not affordable for WGS
The use of WGS or WES with a trio study design allows the detection of DNMs in a hypothesis-free way. DNMs can be detected in any gene, opening the way to the discovery of novel causal genes. Every individual genome contains around 4.1–5.0 million SNVs or indels as compared to the reference genome, of them, 20,000 on average map to the exons and can hence be detected by WES . Of these 20,000 exonic or splice site variants, around 1000 are considered rare variants as they occur in less than 1% of the normal population. Sequencing a single exome or genome then faces the need to filter and prioritize these rare variants with the hypothesis that one of them could be causative. In sporadic diseases where the hypothesis that the cause could be a DNM, the bioinformatics subtraction of all variants identified in a proband that are also present in the parents (inherited variants) unveils the proband-specific variants, i.e., the DNMs . On average, 40–80 DNMs can be identified per genome, of them 1–2 map to the exons (, Fig. 1a). This reduced number of candidate variants allows for a very effective follow-up in both research and diagnostics. For these reasons, trio-based patient–parent studies are now routinely carried out in medical genetics. Sequencing dozens of patient–parent trios may often turn out to be more powerful than sequencing hundreds of simplex cases of sporadic disease.
In clinical practice, the inclusion of a trio for a WES in the context of an AOND requires that (1) the parents are unaffected (2) there is no further family history in other generations, to reduce the risk of alternative mechanisms such as variants with reduced penetrance, (3) the parents are clinically accessible. This latter point is mandatory, both for checking the absence of disease and for DNA sampling after informed consent. In diseases with an onset after 50 years, it can be basically challenging to recruit unaffected parents with reduced mobility. Hence, only a small subset of patients and unaffected parents can be sampled, with a significant effort and organization being required to recruit each trio. However, one can hypothesize that the development of genomic medicine will be associated with a dramatic increase in the number of individuals undergoing WES or WGS during their lifetime, contributing to the success of future trio studies. Similarly, the development of large nationwide biobanks such as the UK biobank may in the near future also be very helpful for retrieving the genomic data of parents from tomorrow’s patients.
When a DNM occurs after the fertilization of the oocyte, this mutation is considered a post-zygotic DNM. When these post-zygotic DNMs occur early in the embryonic development, they may be present in majority of the individual’s cells, in tissues resulting from all three embryonic layers. Although post-zygotic and somatic mutations both refer to mutations being acquired during the lifespan of an individual , we will here use the term post-zygotic for mutations which have occurred during the early development, leading to their presence in most tissues from two or all three layers, and late-somatic mutations for those having been and being restricted to a single tissue, such as the brain. In addition, post-zygotic mutations may be transmitted to the offspring when present in the germline (germline or gonadal mosaicism) and explain how a germline heterozygous dominant de novo mutation may be present in affected sib pairs and appear to be absent in the parents. A combination of germline and somatic mosaicism (gonosomal mosaicism) can have clinical consequences or not in the carrier [23, 32].
WES and WGS technologies are based on the multiple sequencing of every base by multiple short reads in a paired-end sequencing manner, i.e., the sequencing of both extremities of a DNA fragment. The number of reads per position defines the depth of coverage. Most WES and WGS studies are performed with a depth of coverage allowing a high confidence in the detection of germline homozygous (~ 100% of alternate reads) and heterozygous (~ 50%) variants. When the alternate: reference allelic balance is different from these expected ratios, the existence of a post-zygotic DNM can be suspected . However, technical artifacts are the main source of such altered allelic balances, reducing the accuracy in the detection of post-zygotic variants. For these reasons, specific bioinformatics tools and independent molecular confirmations are required to confirm the presence of a post-zygotic variant instead of a sequencing artifact or a germline DNM. It has been shown that 6.5–7.5% of the DNMs detected in blood by WES actually occurred post-zygotically [1, 96]. The lower the true allelic ratio is, the more difficult it is to identify it reliably. Increasing the sequencing depth and the use of unique molecule identifiers (UMI, allowing the trimming of PCR errors) help to increase the confidence in low fraction post-zygotic or late-somatic mutations (Table 1) . The technical aspects and implications of neuronal post-zygotic mutations on aging have been recently reviewed [151, 177] as well as their putative role in neurodegenerative diseases . The sequencing of DNA isolated from the diseased tissue may help to identify late-somatic DNMs. In AOND, the access to CNS tissue is highly limited in vivo. Studies focusing on late-somatic DNMs must therefore rely on autopsies of patients, including the input of brain banks. For multiple reasons, despite the development of such brain banks in many Western countries, the proportion of patients undergoing full autopsy remains very low, limiting the use of such facilities. In addition, it becomes clear that brain banks should store not only brain tissue but also other tissues, to allow a better characterization of post-zygotic events.
In addition to the detection of putatively causal post-zygotic mutations, the same technologies can also be used to search for (1) “back mutations” that reverse or partially correct a phenotype or (2) second-hit mutations that trigger the disease in the presence of an additional inherited germline mutation. Although examples of both mechanisms have been described in other diseases such as skin diseases [124, 125], cortical dysplasia  or the neurocutaneous disorder neurofibromatosis type 1 , we could not find any example in AOND. It could be that these mechanisms play a more prominent role in rapidly dividing cells. Of note, the existence of an inherited genetic variant protecting against Prion infection seems to be in line with this theoretical hypothesis . In addition, we cannot exclude a two-hit mechanism in certain diseases. Of note, in the FTLD–ALS spectrum, several genes share common mechanisms of RNA metabolism perturbation . One could expect that second-hit post-zygotic mutations in the same pathways may increase such pathogenic processes. This would even be consistent with the hypothesis of a multistep process for the development of ALS . Likewise, in AD, any post-zygotic second mutation that would modify the production, aggregation, toxicity or clearance of the Aβ peptide could in theory influence the disease progression .
De novo mutations in known autosomal-dominant genes
Chromosomes and copy number variations
The major prevalence of Alzheimer’s disease and the presence of amyloid pathology observed very early in the life of patients with Down syndrome  suggested that an extra copy of the chromosome 21 might be sufficient to cause the disease. The identification of duplications of the APP locus (mapping to the chromosome 21) causing autosomal-dominant EOAD (early-onset Alzheimer’s disease, onset before 65 years) without Down syndrome  was of major importance to confirm the amyloid hypothesis stating that the Aβ peptide aggregation, produced following the processing of the APP protein, has a key role in AD pathophysiology. As most trisomy 21 cases occur de novo, it can be considered as the most common cause of DNM causing sporadic AD, although occurring in patients with a genomic disorder. The first de novo duplication of the APP locus, not encompassing the critical Down syndrome region, has been recently reported in a patient with neuropathologically confirmed sporadic EOAD .
The study of CNVs in AD has been performed with genome-wide screens by array CGH or SNP arrays in patients with or without a family history [65, 150]. For example, an array CGH screen performed in patients with EOAD identified a few candidate CNVs, some of which were identified in sporadic EOAD patients. Unfortunately, segregation of these CNVs could not be assessed in parents . More recently, a de novo partial deletion of intron 1 of the BACE2 gene was identified in a patient with sporadic EOAD . It has been hypothesized that this deletion of highly conserved non-coding genic DNA could lead to decreased non-amyloidogenic processing of APP, but the functional consequences remain to be determined. In a WES study of 522 EOAD cases including familial and sporadic cases and 584 controls, a 17q21.31 duplication encompassing the MAPT gene encoding the Tau protein was identified in 4 probands . In one of them, the duplication occurred de novo and the parents were indeed unaffected by any neurodegenerative disease. This observation, together with the segregation of the same duplication in one family with an EOAD clinical presentation and the absence in controls, was a strong argument suggesting causality. Interestingly, although the patients were selected for an accurate diagnosis of EOAD with clinico-biological arguments, this duplication may eventually cause a novel primary tauopathy, closely linked but neuropathologically distinct from AD.
Overall and despite the major role of CNVs in human genetics, only a few examples are known to cause autosomal-dominant AOND. Of them, duplications or triplications of the SNCA gene have been identified in families with Parkinson’s disease or Lewy body dementia. In three unrelated patients with Parkinson’s disease, increased dosage of the SNCA gene was identified as a DNM (Supplementary Table 4) [127, 128], present in 42–75% of oral mucosa cells of these patients, and absent or present at very low percentages in blood, indicating post-zygotic occurrence. However, given the complexity of distinguishing post-zygotic mutations from technical artifacts, it remains important to confirm these findings by independent technological approaches (see also below).
De novo point mutations: an earlier age of onset is associated with an increased DNM detection rate
Genetic analysis of sporadic patients presenting either an early-onset AD or a late onset of AD (the most common form) has been performed since the identification of the first causative genes (e.g., [40, 155, 185] or even before, as a candidate gene , but most of them were negative. The first proven DNM in AD was a PSEN1 pathogenic mutation identified in the blood of a patient with a disease onset at the age of 37 years, reported in the context of a screening of 13 sporadic AD patients with a very early onset (before 51 years) . Later on, two novel cases of PSEN1 DNMs were reported [56, 130] so that more than 10 years after the identification of the three known causal AOND genes APP, PSEN1 and PSEN2, only three pathogenic DNMs had been published (Table 1).
Summary statistics of causal de novo mutations reported in known autosomal-dominant genes of the most frequent adult-onset neurodegenerative disorders
Number of germline DNM
Number of post-zygotic DNM
Age of onset (average, range)
List of mutations and references
Supplementary Table 1
Supplementary Table 1
Supplementary Table 2
(PSP, n = 1; bvFTD, n = 2; EOAD-like: n = 1)
Supplementary Table 2
Supplementary Table 2
Supplementary Table 2
Supplementary Table 2
Supplementary Table 3
Synucleinopathies (Parkinson’s disease)
Supplementary Table 4
29.6 [11–52 ]
Supplementary Tables 1-4
Summary of whole exome or whole genome sequencing studies of trios of the most frequent adult-onset neurodegenerative disorders
Age at onset (mean, range)
No. of trios
Average non-synonymous DNM per proband
Analysis strategies and main results
Identification of novel candidate genes
Rovelet-Lecrux et al. 2015 
Early-onset Alzheimer’s disease
Diagnosis supported by biomarkers or post-mortem examination, no family history of dementia in first and second degree relatives, parents alive with normal cognitive score
WES (Agilent SureSelect Human All Exon, Illumina)
CGH array (n = 14) followed by WES of trios (n = 12). Network enrichment analysis Abeta network): significant enrichment in DNM in the network compared to the rest of the exome
VPS35, MARK4. In silico and in vitro functional evidence
Kun-Rodrigues et al. 2015 
Early-onset Parkinson’s disease
Age at onset < 40 years, typical presentation of PD with negative family history, absence of pathogenic mutations in any of the known PD genes
WES (Nextera Rapid Capture, Illumina)
Protein network analysis (STRING).Follow-up analysis of the most significant genes by searching for a recurrence among > 1200 exomes of PD patients
PTEN, VABP, ASNA1. One additional PTEN variant but mis-segregation. Two additional cases with a VAPB variant without segregation data. No functional analyses
Guo et al. 2018 
Early-onset Parkinson’s disease
Age at onset < 36 years, typical presentation of PD with negative family history, no genetic cause or environmental risk factor, no consanguinity
WES (NimbleGen capture, Illumna)
Selection of 12 candidate genes among the DNM, replication in two case–control datasets (rare non-synonymous variants)
Functional assessment in drosophila
One non-synonymous DNM in NUS1 in a proband, enrichment in rare non-synonymous variants in the case–control analysis
(OR = 11.13; p = 1.01 10−5), defects in climbing ability, reduction in dopaminergic neurons, in the brain dopamine levels in drosophila expressing siRNA against the fly NUS1 orthologue
Chesi et al. 2013 
Amyotrophic lateral sclerosis
Sporadic ALS, no C9ORF72 repeat expansion
32.1 years (28.5–51.4)
WES (Agilent SureSelect Human All Exon, Illumina)
Functional gene annotation (DAVID) enrichment in the chromatin regulators category. Follow-up on the SS18L1 gene and search for recurrence among 62 exomes of familial ALS, identification of a novel missense variant
SS108L1. In vitro functional evidence
Steinberg et al. 2015 
Amyotrophic lateral sclerosis
Sporadic ALS, no history of ALS, even if an ALS associated mutation was found in a known gene
WES (Roche NimbleGen SeqCap EZ Human Exome Library kits, Illumina)
Functional gene annotation meta-analysis with Chesi et al. (DAVID): enrichment in genes related to transcription regulation
Van Doormaal et al. 2017 
Amyotrophic lateral sclerosis
Sporadic ALS, prescreening of C9ORF72 ± SOD1, FUS, and TARDBP
WES (Roche NimbleGen SeqCap EZ Human Exome Library kits, Illumina, n = 61) or WGS (Complete Genomics, n = 21)
Despite the difficulty to assess the de novo occurrence of a pathogenic mutation detected in a Mendelian gene in a patient with a sporadic AOND, we found a total of 49 published examples (Table 3, supplementary Tables 1–4). The vast majority of them was reported in recent years, with 18 out of the 29 DNMs (62%) hitting “old” genes (APP, PSEN1, PSEN2, MAPT, SNCA, PRNP, SOD1) having been reported in the 2010s. This highlights an increasing interest in sporadic AOND genetics, but it also indicates that novel sequencing approaches and the trio-based study design are successfully being used. The highest numbers of proven DNM have been identified in sporadic early-onset ALS, especially in the FUS gene (all with a disease onset before 40 years). ALS is associated with a short life expectancy following the diagnosis and young patients often present with highly penetrant variants. This may have facilitated the prioritization of pathogenic variants as well as the collection of parental samples for demonstrating de novo occurrence. Similarly, all patients with a proven DNM in a known Mendelian AD gene had a disease onset before 50 years (range: 23–47 years). This may lead us to think that DNMs mainly cause early-onset sporadic AOND. This observation, however, is impacted by the fact that it is almost impossible to assess the role of DNMs in late-onset patients, because parental DNA samples will mostly be unavailable. The low mutation detection rates of known AOND causative genes in older sporadic patients do, however, suggest that this is not the only factor. It is well-known that early-onset severe disorders are more often monogenic and de novo in origin, whereas late-onset diseases show more complex inheritance .
Recurrence of DNMs
Among the AOND genes affected by DNMs, PSEN1 (AD), FUS (ALS) and PRNP (Prion disorders) are mostly reported (Table 2). Of the 11 missense PSEN1 DNMs reported, 8 mapped to genomic positions already identified by another or the same causative mutation in families, and two additional ones mapped to a same codon. Three DNMs in FUS were identified as recurrent DNMs: one missense (c.1574C > T, p.Pro525Leu in 8 patients) and two protein-truncating mutations (one nonsense and one frameshift indel, each observed twice as DNMs). Interestingly, four additional FUS DNMs affected genomic positions where the same change had been reported as pathogenic in sporadic cases or families. Although no parental data was available for those cases, it still does indicate that these positions are affected by recurrent DNMs. In addition, two FUS DNMs are also present in the gnomAD database consisting of genetic variations identified in controls free of severe pediatric disease, with a maximum allele count of 5 (out of ~ 138,000 individuals) . One of these was the recurrent nonsense DNM. This suggests that, although cases with a DNM in FUS were all young, the mutations may not be fully penetrant in other genomic contexts. Of the six reported PRNP DNMs, including one possibly post-zygotic mutation , all were reported in additional independent cases or families [34, 41, 63, 81, 154]. This may suggest the existence of hotspots or recurrent mechanisms such as octapeptide repeat expansions in the Prion gene.
Interestingly, no APP or PSEN2 SNV has ever been reported as occurring de novo in a patient with AD. There is no evidence of a higher mutability of the PSEN1 gene as a whole [95, 153]. However, PSEN1 pathogenic variants are more frequent in autosomal-dominant families and PSEN1 mutation carriers exhibit an earlier age at onset on average, compared to the other genes [88, 181]. This may help physicians to get access to parental samples at the time of their child’s diagnosis and introduce a bias in this analysis.
There are multiple examples of neurological disorders caused by repeat expansions. Intra-familial instability of such repeats has been identified in diseases such as Friedreich’s ataxia, DRPLA or Huntington’s disease (for review, see ). Such instability is a particular form of a DNM that can lead to anticipation of the age of onset with a few dramatic examples . In the main AOND, GGGGCC repeat expansions in the C9ORF72 gene are also subject both to intra-familial and somatic instability [53, 164]. However, the ranges of repeat sizes conferring instability that can be translated to a pathogenic expansion in the offspring or in other cells of the body remain to be identified. By analyzing blood and CNS samples of patients with ALS, it has been suggested that such instability is not likely to be of sufficient magnitude to result in a pathogenic expansion in the CNS from a normal C9ORF72 repeat length in blood . Similarly, ATXN2 intermediate repeat expansions have been shown to increase the risk of FTLD–ALS and have been reported as somatically instable [86, 111, 174].
Monogenic causes in sporadic patients: alternative hypotheses
Beyond the possibility of a DNM hitting a gene with an autosomal-dominant pattern of inheritance, several alternative hypotheses remain consistent with a monogenic cause for sporadic AOND patients. Most obviously, several forms of sporadic early-onset Parkinson’s disease (PD) cases are related to autosomal recessive inheritance—with early and possibly clinically distinct clinical presentation—as is the case for a few examples in FTLD–ALS or related, sometimes clinically undistinguishable disorders . In addition, in PD, the identification of an inherited pathogenic mutation in a dominant gene in a sporadic case is not uncommon . Although some of the presentations are clinically distinct including an early onset, reduced penetrance has been identified for a subset of the genes and pathogenic mutations. We could find only one detailed report of a likely DNM in a known autosomal-dominant PD gene  (Supplementary Table 4). Hence, sporadic presentations may be more frequently due to autosomal recessive inheritance or autosomal-dominant inherited mutations with reduced penetrance rather than DNM in PD, aside from complex forms. Although autosomal recessive inheritance is not uncommon for known Parkinson’s disease genes, it remains unclear if this will also be the case for the remainder of causal genes to be discovered. A recently published WES study indicated that this may not be the case . Similarly, the pathogenic C9ORF72 repeat expansion causing FTLD–ALS was reported in 5% of sporadic cases  and may be much more frequently inherited from an asymptomatic parent than occurring de novo . In AD, a few mutations have also been reported as associated with reduced penetrance (e.g., APP p.A713T, PSEN1 p.A79V ) and only one mutation in APP is considered causing AD in a homozygous state .
Of note, in addition to autosomal-dominant variants with reduced penetrance and autosomal recessive inheritance, sporadic occurrence might also be related to di-, oligogenic or a more complex inheritance. Some AOND such as AD and ALS exhibit a high heritability as measured by comparing concordant and discordant monozygotic and dizygotic twin pairs (58–79% depending on the models for AD  and 38–74% in ALS ), whereas this was more modest in Parkinson’s disease (34%) . In FTLD, it has been estimated that in most of the cases, frontotemporal dementia would be a genetic-based disease, even late-onset cases . The inheritance patterns may be more complex in some instances, as suggested by the observation of damaging TBK1 as well as OPTN mutations in a patient with FTLD-TDP . In AD, the high odds ratios of risk factors such as the common APOE4-4 genotype or rare variants in TREM2, SORL1 or ABCA7 in EOAD (for review, see ) suggest that they contribute significantly to the genetic component of EOAD in a complex determinism, including sporadic and familial presentations. Likewise, in Parkinson disease, high odds ratios associated to heterozygous GBA variants suggested oligogenic inheritance in some cases . Although a complex etiology might be relevant for a certain proportion of patients with sporadic AOND, it remains difficult to predict which patients, based on clinical criteria.
Looking for novel genes hit by de novo mutations: whole exome sequencing of trios
The evidence that highly penetrant mutations can arise de novo in known Mendelian genes was a proof of concept allowing the assessment of the de novo paradigm at the scale of the entire exome, looking for novel genes. As for the known autosomal-dominant genes, applying the trio study design requires the access to parental biological samples. To date, six trio studies have been published: one in AD, three in ALS, and two in PD, with a total of 247 trios included (Table 3).
Sporadic early-onset AD: enrichment of DNM in the Aβ network and in vitro characterization
After blood sampling of 14 patients with EOAD (mean age at onset: 50 years) and their unaffected parents and CNV screen by array CGH, WES of the 12 trios with no de novo CNV revealed 12 de novo non-synonymous variants in 6 patients, including one of the above-mentioned PSEN1 DNMs  (Table 3). A significant enrichment of non-synonymous DNMs was found in a network of genes centered on the Aβ peptide. Of particular interest, two novel genes affected by DNMs were further studied: VPS35 and MARK4. In silico and in vitro studies showed a deleterious effect of these DNMs with functional results being totally in line with AD pathophysiology. The VPS35 missense DNM was shown to result in reduced protein function and hence predicted to result in increased Aβ secretion. This mutation had striking different functional consequences than the previously reported Parkinson disease-causing VPS35 variant, even though both affected amino acids are located close to each other in the protein sequence [148, 178, 187]. The MARK4 variant increased the phosphorylation of Tau in a key position for Aβ-mediated toxicity. The trio strategy pointed here two novel genes, not previously related to AD genetics but whose gene products are key players in AD pathophysiology. One can hypothesize that they contributed significantly to the development of the disease in their carriers, but their penetrance cannot be assessed without segregation data in the offspring of the carriers or recurrence in unrelated individuals, currently limiting the impact for genetic counseling.
Two independent studies in ALS reveal one recurrently hit gene and pathway enrichment
In sporadic ALS, three trio studies using WES performed from blood samples were published. A first study reported 47 trios (mean age at onset: 32.1 years) and identified 25 genes hit by non-synonymous DNMs  (Table 3). After functional annotation, a significant enrichment in genes encoding chromatin regulators was found (5/25), including the neuronal chromatin remodeling complex component SS18L1 (also known as CREST). In a replication sample of 62 WES of families with ALS, a novel missense SS18L1 mutation was found in a proband. Both mutations inhibited activity-dependent neurite outgrowth in primary neurons, and CREST protein was associated with FUS in neurons. In an independent screen of SS18L1 in 87 patients with familial ALS, two novel mutations were identified in unrelated probands , although one latter patient also presented a potential pathogenic mutation in the OPTN gene (a gene associated with recessive or dominant FTLD–ALS). Although the role of SS18L1 and other genes hit by DNMs in this first WES study remains to determined, this study design unveiled good candidate genes to be studied further.
The second study enrolled 44 trios for WES (mean age at onset: 46.1 years) including one patient with FTLD–ALS (Table 3) . Importantly, patients carrying pathogenic mutations in known genes were not excluded, as two carried a C9ORF72 repeat expansion, one a SOD1 mutation and one a TARDBP mutation; these mutations were inherited from an asymptomatic parent. Of 54 DNMs identified, 17 non-synonymous or canonical splice site mutations were found in 12 trios (or 15 mutations in 11 trios after excluding a trio carrying a C9ORF72 repeat expansion with two non-synonymous mutations, ). Taking into account both WES trio studies together, one gene, CHRM1, was hit more than once, by a novel missense  and a start loss DNM . While very exciting, the interpretation of the role of CHRM1 DNMs in ALS pathophysiology remains to be assessed by functional analyses. In a similar in silico functional enrichment analysis using the DAVID bioinformatics tool as in the first study, the genes related to transcription regulation were highlighted as enriched in DNMs . These first two trio studies performed on sporadic ALS highlighted the genetic heterogeneity and the need to analyze multiple trios at the same time or in the context of meta-analyses.
Recently, 82 additional sporadic ALS trios were reported (WES, n = 61; WGS, n = 21) . No additional recurrence was identified at the gene level. The interpretation of results was conflicting with previous reports as global analysis of all genes hit by non-synonymous DNMs among the 173 published trios did not reveal any enrichment in the previously identified categories of genes (chromatin regulators, transcription regulation). Instead, an enrichment in the phosphoproteins category was identified, and no increased protein–protein interactions were identified as compared to the known interactions among known genes.
Protein–protein interaction and recurrence of variants in large cohorts of Parkinson disease patients
One WES study was performed on 21 trios of early-onset PD (onset before 40 years)  (Table 3). Twenty DNMs (19 non-synonymous) were identified for which in silico functional gene annotation and protein–protein interaction analyses were performed using the STRING bioinformatics tool. Three genes showed significant interactions with the known PD genes: PTEN, VABP, and ASNA1. Additional rare variants were identified from exome data of more than 1200 cases from the International Parkinson’s Disease Genomics Consortium (IPDGC). Although functional pathways link PTEN to PD pathophysiology, the putative role of the DNMs found in patients with PD remains to be elucidated. In particular, the DNM identified in the replication sample is predicted to result in a loss of function and hence supposed to cause Cowden syndrome. In addition, while this variant occurred de novo, the proband’s father was also affected with PD, showing non-co-segregation. VAPB missense mutations have been previously shown to cause autosomal-dominant ALS. The patient identified in the PD trio study presented an unknown in-frame deletion of one residue and two additional cases from the IPDGC dataset presented rare coding mutations in the same domain as the one reported to carry ALS-causative mutations. Whether or not these mutations have a functional effect and are related to PD pathophysiology remains to be determined.
Very recently, a trio-based WES study reported 39 early-onset PD patients of Han Chinese origin . After prioritization of DNMs based on protein–protein interactions and expression profiles in the brain, 12 genes were selected for case–control sequencing analyses. Among them, NUS1, which showed a splicing DNM in a proband, was enriched in non-synonymous rare variants with an OR of 11.3 and a p value approaching exome-wide significance (1.02 10−5) among 5089 cases and 4423 controls. In addition, a Drosophila model expressing an siRNA against the fly NUS1 orthologue showed decreased climbing ability, a reduction in dopaminergic neurons and a dramatic reduction in the brain dopamine levels. Of note, all coding variants identified in the replication case–control analysis were missense and their inheritance was unknown, but the DNM mutation introduced a splicing defect resulting in the frameshift deletion of 91 bp of exon 3. This could suggest a haploinsufficient mechanism. However, protein-truncating NUS1 DNMs have previously been reported in patients with global developmental delay and seizures , two of them also had a tremor. In addition, a 6q22.1 microdeletion, the critical region of which encompassed this gene, has been identified in patients with early-onset seizures  and a homozygous missense variant resulting in a protein loss of function was reported in two siblings with a congenital glycosylation disorder . Given the very early onset of neurological signs in the NUS1 DNM carrier in the PD study (16 years), the in-depth phenotypic description of patients carrying DNM will be of interest to better characterize the clinical expression of these mutations, in addition to the replication in other WES studies.
WES of trios in AOND: validation warranted
We found six trio studies applied to sporadic AOND (Table 3), all with limited samples sizes: one in AD (14 trios), three in ALS (total of 173 trios), and two in PD (total of 60 trios). Yet, the power of these strategies allowed the identification of at least two novel genes in EOAD genetics with a demonstrated functional effect (VPS35 and MARK4) , one in ALS (SS18L1) , and one recently in PD (NUS1) . Other variants, affecting genes not yet known to be involved in the Aβ network in AD, or the recurrently hit gene CHRM1 in ALS, might also be confirmed as truly involved in further studies.
Functional assessment of the impact of DNMs relies on the previous knowledge on the disease pathophysiology and mutated genes. It was possible to assess the role of VPS35 and MARK4 mutations mostly because the function of their gene products was well-documented and because AD pathophysiological pathways, although diverse, can all be linked to the amyloid cascade hypothesis . Regarding the SS18L1 gene, the known expression in motor neurons was a strong argument to justify its assessment, although the effects of the observed mutations might not be specific to ALS pathophysiology. In PD, although a direct functional link between NUS1 and PD was not known, a use of a clear biological read out in the Drosophila model (dopaminergic impairment) pointed to the relevance of this gene . It is, however, very challenging to perform a systematic functional assessment on all DNMs identified in a WES trio study and therefore in most cases only strong candidates are selected for functional validation.
The argument of recurrence, at the variant or at the gene level, in patients with the same phenotype, is a key argument in human genetics. A good example of this is the observation of recurrent DNMs in the CHRM1 gene in ALS by the combination of the first two studies. However, it is important to assess the a priori probability of a gene to be hit by multiple DNMs by chance, because the DNM rate varies per gene, depending on characteristics such as the gene size and the GC content . The statistical rigor required to demonstrate a significant enrichment of DNMs in a gene may require the analysis of many hundreds to even thousands of trios , which seems unrealistic in AOND. As discussed before, however, most of the AONDs are compatible with parenthood. As for PSEN1, APP, or FUS mutations, for example, one could expect to find pathogenic mutations in novel genes such as VPS35, MARK4, SS18L1, CHRM1 segregating in families if these mutations are highly penetrant. It may therefore be useful to perform replication studies in large cohorts, whether or not the patients present a positive or a negative family history, as performed in PD and ALS studies [37, 58, 84]. However, without sufficient segregation data as in the above-mentioned examples, no conclusion can be drawn.
The recurrence of rare variants at the gene level can also be assessed by association analysis in a case–control setting, as successfully applied for NUS1 rare variants in PD . WES or WGS data are rapidly increasing in terms of sample sizes, allowing for powerful association studies of rare variants. For example, about 5000 late-onset AD cases and 5000 controls have been exome sequenced by the US governmental AD Sequencing project (ADSP, ) and WES of 1779 AD cases and 1273 controls have been included in association studies in France . So far, genes affected by DNMs in trio studies have not been reported to be significantly associated with AD in these association studies. Notably, the major roles of TREM2, SORL1 and ABCA7 rare variants have been highlighted in AD [19, 73, 91, 114, 163]. As expected, an apparent inverse correlation was identified between the effect size at the gene level (measured by the odds ratios, OR) and the cumulative frequency of the variants in controls: the genes with the highest OR (TREM2 > SORL1 > ABCA7) were hit by damaging variants in the inverse order in terms of cumulative frequency . This observation, added to the full penetrance of autosomal-dominant pathogenic variants that are extremely rare in controls, is compatible with the paradigm reported by Manolio et al.  and McCarthy et al.  suggesting such inverse correlation for most of the association signals which can be detected using current methods. If DNMs identified in trio-based studies eventually carry a strong effect, they are likely to be extremely rare, so that association studies might fail in reaching the 2.5 10−6p value threshold required for exome-wide significance. Conversely, their extreme rarity is not sufficient to postulate on penetrance. The trio study design having an individual resolution, it remains possible that mutations in novel genes might be present only in a very limited number of cases in the world, precluding any confirmation by identifying recurrences leading to a significant association signal.
Lacking gene level evidence for most of the genes, WES/WGS trio studies attempted to find enrichment of DNMs at a larger scale by grouping genes into networks or pathways. Although this approach adds evidence to the study model, it does not allow the investigators to draw conclusions about the role of individual genes. Even in cases of a demonstrated functional effect, no genetic counseling can be provided to the carriers’ offspring and to putative unrelated carriers of rare variants in these genes.
Overall, when large series of patients with sporadic presentations of extreme phenotypes of AOND were screened for known Mendelian genes and by WES, a large proportion remained genetically unexplained. The application of WGS may further expand the spectrum of causal DNMs by allowing for a better coverage of the exome, the identification of non-coding DNMs as well as balanced and unbalanced structural variants . In addition, the recent development of more sensitive and specific next-generation sequencing applications has allowed researchers to test an alternative hypothesis: highly penetrant post-zygotic DNMs might explain part of these sporadic presentations.
Somatic mutations in adult-onset neurodegenerative disorders
The hypothesis that a genetic mutation present in only a proportion of the neuronal cells can cause a neurological disease has been formulated quite early in the history of AOND genetics [139, 180]. To date, most studies still focus on the analysis of germline mutations present in all cells by studying DNA isolated from a large proportion of blood cells. However, recent improvements in sequencing technologies have enabled the accurate identification of post-zygotic including late-somatic mutations present in subsets of cells or even in single cells . In different neurodevelopmental disorders, evidence has been provided that post-zygotic or late-somatic mutations can cause disease using a combination of technologies on different tissues [70, 141]. Some of these mutations were detected in blood samples, indicating that they occurred early during development. One can assume that post-zygotic mutations, if detected in multiple tissues or with high allelic ratios in blood, might be present in a significant proportion of brain cells. Hence, the level of causality between germline and post-zygotic functional variants should be comparable. It is much more complex to detect brain-specific somatic mutations. Autopsy of certain AOND cases sometimes reveal widespread neuropathological lesions, which would be more in line with germline causes of disease. A focal onset of disease, as seen in some types of primary progressive aphasia in the FTLD spectrum, on the other hand, suggests a role for late-somatic mutations. Overall, in neurodegenerative disorders eventually affecting a large part of the brain, one could assume that the causative mutations must be present in a high proportion of brain cells (neurons and/or glial cells). However, most of the AOND share mechanisms called seeding and spreading. These features, also referred to as Prion-like properties, are conferred by proteins that can transfer their pathogenic state into wild type, normally folded proteins (seeding) and then spread into the whole brain following neuronal connections (for review, see . This phenomenon has been studied first for the Prion protein itself. However, such properties are now being characterized for Tau, Aβ, TDP-43, α-synuclein, or even the Huntingtin protein even if they are not associated with a spontaneous infectious propensity. Similar to an external focal injection of pathologic proteins in animal models, one can hypothesize that a small proportion of cells carrying a somatic variant resulting in the production of a pathogenic misfolded protein could be the source of a cerebral neurodegenerative disorder. Low-level mosaics should therefore also be considered.
Lessons from control brains and clues for the interpretation of somatic mutations in AOND
Post-mitotic neurons exhibit an unexpectedly high burden of post-zygotic mutations. The use of single-cell genomic approaches including WGS in neurons from non-diseased brains unveiled an unsuspected amount of post-zygotic mutations, including about 1500 somatic SNV per neuron . Of them, some occurred during fetal development [15, 75], but a higher burden was detected in post-mortem adults . Although the number of single neurons that have been sequenced remains limited (dozens or hundreds), the elevated rate of post-zygotic SNV was quite unexpected as neurons are post-mitotic lifelong cells and hence are not subjected to errors during DNA replication, beyond the ones putatively acquired during the divisions of progenitors. Single neuron somatic SNV were mostly associated with transcriptional activity, i.e., neuronal activity, contrary to the variants shared by multiple cells in the brain or tissues with a high replication rate . In addition, neurons may accumulate late-somatic mutations during aging . Whatever the associated mechanisms, some of these events could result in the production of an abnormal/misfolded protein, which could represent a source of seeding and spreading in the brain, causing a neurodegenerative disease.
Bulk brain tissue is a combination of replicating and non-replicating, post-mitotic cells; sequencing genomic DNA isolated from bulk brain tissue does not allow the distinction between the cell types . The identification of very low-level mosaics from bulk brain tissue does not imply that different cell types carry the mutation of interest and that the mutated genes are expressed in the mutated cells, hence leading to the production of an abnormal protein.
Although the access to brain tissue is mandatory to assess the somatic variant hypothesis thoroughly, studying other tissues may be worth of interest. It has been shown that mutations present in more than 5–10% of the brain cells were generally also detected outside the brain, in tissues derived from all three embryonic layers, including the ectoderm, suggesting that these mutations occurred during early phases of embryonic development . Although this still requires replication, this is a strong argument for assessing other tissues, including ectodermal tissues and blood, when cerebral tissue is not available. The fact that some neuronal cells shared more common cellular ancestors with cells from other organs than the brain in one individual was also a surprising but promising finding for deep-sequencing studies performed on other tissues than the brain in AOND. However, every study performed from non-CNS tissue will be facing the non-representativeness of the allelic ratios eventually identified, and the lack of evidence that a putatively causal mutation with a low allelic ratio is really present in the affected neuronal cells. Importantly, recent results of single neuron WGS also raised the question of thresholds: if a putatively pathogenic mutation is found in a brain with an AOND, how many neurons should carry it, among the tens of billions of neurons in the brain , to be significant enough to cause a widespread neurodegenerative disease?
Data from mouse models suggest that seeds from peripheral tissues like blood  or intestine  can spread into the brain and cause a neurodegenerative disorder. The presence of a pathogenic mutation in the brain cells would hence not be mandatory to cause such a disease, although one can assume that a significant amount of pathological seeds should be produced to trigger an AOND.
The study of somatic aneuploidy and CNVs still requires technological improvements. Aneuploidy has been studied in AD brains for decades (for review see [9, 131, 145, 146], mainly thanks to slice-based cytometry and fluorescent in situ hybridization (e.g., [10, 106, 146]). The fraction of neurons containing extra chromosomes has been reported to be higher in AD brains than in controls . Controversial results on aneuploidy rates ranging from 1% or less to more than 50% percent have been reported, as recently reviewed together with other inconsistencies (see [145, 146]). After the introduction of single cell NGS, the fraction of aneuploid neurons has been reevaluated to be from 0 to 3% [30, 80, 103, 172] and the rate of neurons carrying post-zygotic CNVs has been evaluated to several dozen percents. Given the challenging assessment of germline CNVs using NGS in general, the interpretation of CNVs from single cell WGS may also require caution and improved techniques are required as proposed recently . In addition, it has been shown that DNA isolation protocols may significantly influence CNV detection . Similar to CNVs, a high burden of post-zygotic L1 insertions that can disrupt or deregulate the expression of genes has been reported, although there are differences of opinions on the numbers of post-zygotic L1 insertions in single neurons [50, 171]; some results may require technical validation.
The study of brains from AOND cases implies the analysis of patients with advanced disease, which can be associated with secondary DNA damage. Increased aneuploidy rates and/or of DNA content in AD neuronal cells could be related to errors during mitosis of neuronal progenitors. More likely, though, it can be caused by a reentry in the cell cycle as the result of AD pathophysiological processes. This has been suggested by the identification of neurons with 4n DNA content and a positive staining for Cyclin B1 . It is likely that, in the context of advanced neurotoxicity, such observation could basically be a simple consequence of neurodegeneration—nonspecific to a given AOND instead of a causative mechanism . In theory, the study of so-called preclinical AD brains could help tackle this issue. However, a proportion of neurons in preclinical AD brains might already be at a final stage of the pathophysiology. Studies of presymptomatic patients carrying APP, PSEN1, or PSEN2 mutations showed evidence of neuronal damages years before the first clinical signs, similar to what has been observed in animal models [21, 79, 131]. Even if a few studies had access to samples of patients with preclinical or early-stage AD, most research is performed on end-stage AD as it has been the main condition leading to the patient’s death. While many neurons already died, many other neurons have undergone stress and toxicity during years, before the first symptoms appeared. Some of them have accumulated DNA damage, including somatic SNVs . Oxidative stress and microtubule dysfunction in AD neurons are several of the causes leading to secondary damages in the DNA. In FTLD and ALS, it has been shown that secondary DNA damage can participate in the pathophysiological processes in C9ORF72 and FUS mutation carriers [51, 110]. Interpreting genetic results from AOND brains should therefore be done with great caution. Of note, similar caution should be provided for the interpretation of somatic mitochondrial DNA mutations which face the same issue of possible secondary mutations induced by neurodegeneration. In addition, caution is required when interpreting negative findings in sequencing studies performed on brain tissue. Indeed, the mutated cells may also be the more vulnerable ones and hence mutations may be undetectable because of cellular death.
Taken together, somatic mutations in the brain may result from (1) early embryonic events, (2) mutations occurring in neuronal progenitors during neurodevelopment as a result of replication errors, (3) mutations in replicating cells in the brain at any stage of life as a result of replication errors, (4) mutations in post-mitotic neurons as a result of transcriptional activity and (5) as a result of DNA damage in the context of cellular stress. Although advances in genomic and single cell technologies have provided novel information and promising hypotheses, the interpretation of sequencing data obtained from brains with AOND will be an even bigger challenge than the technology itself in the near future.
Somatic mutations in patients with AOND
Given the role of germline duplications of APP and SNCA, respectively, in autosomal-dominant EOAD and Parkinson’s disease, CNV studies have focused on these loci as well as chromosomal abnormalities. There is still debate on whether AD brains are enriched in neurons carrying extra copies of chromosome 21 containing the APP gene. Interestingly, Bushman et al.  recently reported increased copy numbers of the APP locus itself in AD brains. So far, however, this exciting result has not been replicated. Even more recently, the study of nigral dopaminergic neurons—the neurodegeneration of which causes Parkinson’s disease—revealed an average proportion of dopaminergic neurons with gains of SNCA copies in each nigra of 0.78% in Parkinson disease patients versus 0.45% in controls . Such enrichment was not found in non-dopaminergic neurons. Overall, among the 40 patients, 31 (77.5%) had at least one dopaminergic neuron showing a gain of SNCA copies as compared to 10/25 (40%) of the controls. These results suggest that late-somatic copy number gain of SNCA is not a rare event and suggest that a significant enrichment should be required to trigger the disease. The fact that picomolar concentrations of SNCA oligomers can induce disease-related pathways in cells in vitro seems to be in paradox with the latter study . If replicated, these results obtained on nigral neurons would question the hypothesis that low-level mosaics alone would be sufficient to trigger diffuse neurodegenerative disease in vivo. Other brain regions may also be studied as well as other mutations and their putative functional consequences.
In addition to CNVs affecting known disease genes, post-zygotic CNVs may also affect novel Parkinson’s disease genes. In phenotypically discordant monozygotic twin pairs with one of the twins exhibiting Parkinson’s disease, a few post-zygotic novel CNVs have been identified in the affected twins . Further research is needed to confidently link these genes to Parkinson’s disease.
The presence of brain-specific single nucleotide mutations has been assessed in 1988—before the identification of the first causative genes of autosomal-dominant EOAD. In an exploratory study, the sequence encoding the Aβ peptide was analyzed in cDNA isolated from three brains with sporadic AD but no mutation was found . The hypothesis that post-zygotic mutations could explain sporadic AD was reassessed after identification of APP, PSEN1, and PSEN2 germline pathogenic mutations in autosomal-dominant families. The analysis of DNA isolated from bulk brain pieces of 99 patients with sporadic AD revealed a PSEN1 mutation that was eventually confirmed to be present in the germline . This hypothesis was also assessed in Parkinson’s disease and ALS before the era of NGS, with negative results [121, 135]. Recently, WES was performed in hundreds of brains from patients with different types of AOND. While pathogenic mutations were detected in autosomal-dominant genes, parental DNA was not available for testing and the average depth of sequence coverage did not allow for the detection of low-level somatic mutations .
The use of deep-sequencing or blood–brain duo strategies has been applied only recently in sporadic AD. In a first study, a targeted deep-sequencing approach was used to analyze the genomic loci of APP, PSEN1, PSEN2 and MAPT in DNA isolated from the entorhinal cortex of 72 patients with sporadic AD and 58 controls . Custom capture and deep sequencing of the genomic regions of these four genes revealed 107 candidate post-zygotic mutations but only 3 could be confirmed by amplicon-based deep sequencing: two novel MAPT missense mutations of unknown significance in sporadic AD patients (variant allele frequencies of 1.0% and 1.1%) and one known PSEN2 likely benign missense mutation (variant allele frequency of 1.6–5.7%) in a control. Of note, among the 41 patients with an available age at onset, the median age of onset was 78 years (range: 46–92) and among the other 31 other patients, the median age at death was 79 years (range 57–96, the youngest carried a pathogenic PSEN1 germline variant p.H163R), suggesting that majority had a late disease onset. Another recent study focused on more technical aspects in the context of AD, but did not provide results directly relevant to AD itself . In a study including 17 sporadic AD patients, 2 controls, and 2 patients with vascular dementia, WES (mean depth of coverage: 60.8x) was performed on DNA isolated from blood as well as the hippocampus . This strategy did not allow the identification of low-level mosaics and no putatively pathogenic brain-specific mutation was identified. The average age at death was 86.8 years (range 73–94), suggesting that most of them, if not all, presented a late onset of AD.
With the hypothesis that, similar to germline DNM, post-zygotic, including late-somatic mutations causing sporadic AOND may be associated with early-onset forms, we recently performed a targeted deep-sequencing screen of 11 genes in 445 sporadic AD patients (355 blood samples, 100 brain samples), > 80% of which had an early onset . We used single molecule Molecular Inversion Probes (smMIPs) capture followed by deep sequencing and validation with independent ultra-deep sequencing, allowing for very high sensitivity and specificity. We identified nine post-zygotic mutations with allelic ratios ranging from 0.2% to 10.8%. Two of these mutations were predicted to alter the function of SORL1, which is currently considered as a strong risk factor for EOAD. However, no predicted pathogenic post-zygotic mutations in known autosomal-dominant genes could be identified in this large sample.
Even more recently, 102 genes were screened by targeted capture followed by deep sequencing in 173 samples from 54 human brains . Post-zygotic variants were validated by a technology including the use of UMI. Of them, 20 individuals presented with AD and 20 exhibited Parkinson’s disease or dementia with Lewy bodies. Despite the detection of 62 post-zygotic variants, no putatively pathogenic variant was identified.
The preliminary results of the above-mentioned studies do not immediately point to a significant role for post-zygotic mutations in known disease genes sporadic AOND. This may change with improvements in capture as well as sequencing technologies (including the analysis of single cells which is only just starting and is a tremendously promising field) as well as the analysis of many more brain samples, especially from patients with an early onset of disease and the more accurate detection of mosaic structural variations. Taking together all the positive and negative results obtained from control and diseased brains, the assessment of the somatic variant hypothesis in AOND has opened many novel questions. Among them, there is discussion about the minimum amount of pathological seeds, the timing of occurrence, and the regions where these seeds should appear to be sufficient to trigger neurodegenerative diseases. Experiments in animal models may help researchers to answer some of these questions, combined with the application of ultra-sensitive sequencing of multiple brain regions in AOND patients and controls.
Pathogenic DNMs, while not easy to identify in adult-onset diseases, clearly play an important role in sporadic AOND. Trio-based exome sequencing of patient–parent blood samples has pointed to numerous germline DNMs in both known disease genes as well as novel candidate genes. Larger sample sizes and functional follow-up studies are essential to validate the role of these candidate genes in AOND. Access to affected brain tissue and more sensitive and specific sequencing approaches will be crucial to investigate further the role of brain-specific mutations in AOND. Challenges lie ahead not only in the identification of these mutations but equally in the clinical interpretation of mutations that are present in only a proportion of all cells present.
In this review, we focused on the most common AOND, but examples of germline or post-zygotic DNM in known Mendelian genes causing other adult-onset neurological disorders have been reported (e.g., [46, 49, 115, 159]).
The in-depth genomic analysis of sporadic AOND is opening many exciting novel research directions, from the identification and characterization of novel disease genes and non-coding regions to the single cell analysis of somatic mutations as well as the analysis of seeding-spreading mechanisms leading to neurodegenerative disorders.
GN acknowledges Fondation Bettencourt-Schueller, Fondation Philippe Chatrier, Fondation Charles Nicolle, and Association Cerveau Progrès. This work was supported by a VICI grant from The Netherlands Organization for Scientific Research (918-15-667 to JAV).
Compliance with ethical standards
Conflicts of interest
The authors declare that they have no conflict of interest.
- 2.Acuna-Hidalgo R, Sengul H, Steehouwer M, van de Vorst M, Vermeulen SH, Kiemeney L et al (2017) Ultra-sensitive sequencing identifies high prevalence of clonal hematopoiesis-associated mutations throughout adult life. Am J Hum Genet 101:50–64. https://doi.org/10.1016/j.ajhg.2017.05.013 Google Scholar
- 8.Alzualde A, Moreno F, Martinez-Lage P, Ferrer I, Gorostidi A, Otaegui D et al (2010) Somatic mosaicism in a case of apparently sporadic Creutzfeldt-Jakob disease carrying a de novo D178N mutation in the PRNP gene. Am J Med Genet B Neuropsychiatr Genet 153B:1283–1291. https://doi.org/10.1002/ajmg.b.31099 Google Scholar
- 19.Bellenguez C, Charbonnier C, Grenier-Boley B, Quenez O, Le Guennec K, Nicolas G et al (2017) Contribution to Alzheimer’s disease risk of rare variants in TREM2, SORL1, and ABCA7 in 1779 cases and 1273 controls. Neurobiol Aging 59:220.e221–220.e229. https://doi.org/10.1016/j.neurobiolaging.2017.07.001 Google Scholar
- 22.Bianchin MM, Capella HM, Chaves DL, Steindel M, Grisard EC, Ganev GG et al (2004) Nasu-Hakola disease (polycystic lipomembranous osteodysplasia with sclerosing leukoencephalopathy–PLOSL): a dementia associated with bone cystic lesions. From clinical to genetic and molecular aspects. Cell Mol Neurobiol 24:1–24Google Scholar
- 24.Bis JC, Jian X, Kunkle BW, Chen Y, Hamilton-Nelson KL, Bush WS et al (2018) Whole exome sequencing study identifies novel rare and common Alzheimer’s-associated variants involved in immune response and transcriptional regulation. Molecular Psychiatry. https://doi.org/10.1038/s41380-018-0112-7 Google Scholar
- 25.Boeve BF, Tremont-Lukats IW, Waclawik AJ, Murrell JR, Hermann B, Jack CR Jr et al (2005) Longitudinal characterization of two siblings with frontotemporal dementia and parkinsonism linked to chromosome 17 associated with the S305N tau mutation. Brain 128:752–772. https://doi.org/10.1093/brain/awh356 Google Scholar
- 31.Calvo A, Moglia C, Canosa A, Brunetti M, Barberis M, Traynor BJ et al (2014) De novo nonsense mutation of the FUS gene in an apparently familial amyotrophic lateral sclerosis case. Neurobiol Aging 35(1513):e1511–e1517. https://doi.org/10.1016/j.neurobiolaging.2013.12.028 Google Scholar
- 38.Chio A, Calvo A, Moglia C, Ossola I, Brunetti M, Sbaiz L et al (2011) A de novo missense mutation of the FUS gene in a “true” sporadic ALS case. Neurobiol Aging 32(553):e523–e556. https://doi.org/10.1016/j.neurobiolaging.2010.05.016 Google Scholar
- 40.Cruts M, van Duijn CM, Backhovens H, Van den Broeck M, Wehnert A, Serneels S et al (1998) Estimation of the genetic contribution of presenilin-1 and -2 mutations in a population-based study of presenile Alzheimer disease. Hum Mol Genet 7:43–51Google Scholar
- 48.Dumanchin C, Brice A, Campion D, Hannequin D, Martin C, Moreau V et al (1998) De novo presenilin 1 mutations are rare in clinically sporadic, early onset Alzheimer’s disease cases. French Alzheimer’s Disease Study Group. J Med Genet 35:672–673Google Scholar
- 53.Gijselinck I, Van Mossevelde S, van der Zee J, Sieben A, Engelborghs S, De Bleecker J et al (2016) The C9orf72 repeat size correlates with onset age of disease, DNA methylation and transcriptional downregulation of the promoter. Mol Psychiatry 21:1112–1124. https://doi.org/10.1038/mp.2015.159 Google Scholar
- 55.Glenner GG, Wong CW (1984) Alzheimer’s disease and Down’s syndrome: sharing of a unique cerebrovascular amyloid fibril protein. Biochem Biophys Res Commun 122:1131–1135Google Scholar
- 58.Guo JF, Zhang L, Li K, Mei JP, Xue J, Chen J et al (2018) Coding mutations in NUS1 contribute to Parkinson’s disease. In: Proceedings of the National Academy of Sciences of the United States of America. https://doi.org/10.1073/pnas.1809969115
- 59.Guyant-Marechal I, Berger E, Laquerriere A, Rovelet-Lecrux A, Viennet G, Frebourg T (2008) Intrafamilial diversity of phenotype associated with app duplication. Neurology 71:1925–1926. https://doi.org/10.1212/01.wnl.0000339400.64213.56 Google Scholar
- 61.Hedjoudje A, Nicolas G, Goldenberg A, Vanhulle C, Dumant-Forrest C, Deverriere G et al (2018) Morphological features in juvenile Huntington disease associated with cerebellar atrophy—magnetic resonance imaging morphometric analysis. Pediatr Radiol. https://doi.org/10.1007/s00247-018-4167-z Google Scholar
- 67.Hubers A, Just W, Rosenbohm A, Muller K, Marroquin N, Goebel I et al (2015) De novo FUS mutations are the most frequent genetic cause in early-onset German ALS patients. Neurobiol Aging 36:3111–3116. https://doi.org/10.1016/j.neurobiolaging.2015.08.005 Google Scholar
- 78.Kim YE, Oh KW, Kwon MJ, Choi WJ, Oh SI, Ki CS et al (2015) De novo FUS mutations in 2 Korean patients with sporadic amyotrophic lateral sclerosis. Neurobiol Aging 36(1604):e1609–e1617. https://doi.org/10.1016/j.neurobiolaging.2014.10.002 Google Scholar
- 86.Laffita-Mesa JM, Velazquez-Perez LC, Santos Falcon N, Cruz-Marino T, Gonzalez Zaldivar Y, Vazquez Mojena Y et al (2012) Unexpanded and intermediate CAG polymorphisms at the SCA2 locus (ATXN2) in the Cuban population: evidence about the origin of expanded SCA2 alleles. Eur J Neurol 20:41–49. https://doi.org/10.1038/ejhg.2011.154 Google Scholar
- 99.Lou F, Luo X, Li M, Ren Y, He Z (2017) Very early-onset sporadic Alzheimer’s disease with a de novo mutation in the PSEN1 gene. Neurobiol Aging 53:191–193. https://doi.org/10.1016/j.neurobiolaging.2016.12.026 Google Scholar
- 109.Nacheva E, Mokretar K, Soenmez A, Pittman AM, Grace C, Valli R et al (2017) DNA isolation protocol effects on nuclear DNA analysis by microarrays, droplet digital PCR, and whole genome sequencing, and on mitochondrial DNA copy number estimation. PLoS One 12:e0180467. https://doi.org/10.1371/journal.pone.0180467 Google Scholar
- 110.Naumann M, Pal A, Goswami A, Lojewski X, Japtok J, Vehlow A, Naujock M, Gunther R, Jin M, Stanslowsky N et al (2018) Impaired DNA damage response signaling by FUS-NLS mutations leads to neurodegeneration and FUS aggregate formation. Nat Commun 9:335. https://doi.org/10.1038/s41467-017-02299-1 Google Scholar
- 118.Onyike CU, Diehl-Schmid J (2013) The epidemiology of frontotemporal dementia. Int Rev Psychiatry 25(2):130–137Google Scholar
- 130.Portet F, Dauvilliers Y, Campion D, Raux G, Hauw JJ, Lyon-Caen O et al (2003) Very early onset AD with a de novo mutation in the presenilin 1 gene (Met 233 Leu). Neurology 61:1136–1137Google Scholar
- 131.Potter H, Granic A, Caneus J (2016) Role of trisomy 21 mosaicism in sporadic and familial Alzheimer’s disease. Curr Alzheimer Res 13:7–17Google Scholar
- 132.Pottier C, Bieniek KF, Finch N, van de Vorst M, Baker M, Perkersen R et al (2015) Whole-genome sequencing reveals important role for TBK1 and OPTN mutations in frontotemporal lobar degeneration without motor neuron disease. Acta Neuropathol 130:77–92. https://doi.org/10.1007/s00401-015-1436-x Google Scholar
- 139.Reznik-Wolf H, Machado J, Haroutunian V, DeMarco L, Walter GF, Goldman B et al (1998) Somatic mutation analysis of the APP and Presenilin 1 and 2 genes in Alzheimer’s disease brains. J Neurogenet 12:55–65Google Scholar
- 151.Sala Frigerio C, Fiers M, Voet T, De Strooper B (2017) Identification of low allele frequency mosaic mutations in Alzheimer disease. Genom Mosaicism Neurons Cell Types: 361–378Google Scholar
- 155.Schellenberg GD, Anderson L, O’Dahl S, Wisjman EM, Sadovnick AD, Ball MJ et al (1991) APP717, APP693, and PRIP gene mutations are rare in Alzheimer disease. Am J Hum Genet 49:511–517Google Scholar
- 159.Shatunov A, Fridman EA, Pagan FI, Leib J, Singleton A, Hallett M et al (2004) Small de novo duplication in the repeat region of the TATA-box-binding protein gene manifest with a phenotype similar to variant Creutzfeldt-Jakob disease. Clin Genet 66:496–501. https://doi.org/10.1111/j.1399-0004.2004.00356.x Google Scholar
- 160.Sherrington R, Froelich S, Sorbi S, Campion D, Chi H, Rogaeva EA et al (1996) Alzheimer’s disease associated with mutations in presenilin 2 is rare and variably penetrant. Hum Mol Genet 5:985–988Google Scholar
- 164.Suh E, Lee EB, Neal D, Wood EM, Toledo JB, Rennert L et al (2015) Semi-automated quantification of C9orf72 expansion size reveals inverse correlation between hexanucleotide repeat number and disease duration in frontotemporal degeneration. Acta Neuropathol 130:363–372. https://doi.org/10.1007/s00401-015-1445-9 Google Scholar
- 169.Teyssou E, Vandenberghe N, Moigneu C, Boillee S, Couratier P, Meininger V et al (2014) Genetic analysis of SS18L1 in French amyotrophic lateral sclerosis. Neurobiol Aging 35:1231 e1212–1213 e1219. https://doi.org/10.1016/j.neurobiolaging.2013.11.023 Google Scholar
- 174.Van Langenhove T, van der Zee J, Engelborghs S, Vandenberghe R, Santens P, Van den Broeck M et al (2012) Ataxin-2 polyQ expansions in FTLD-ALS spectrum disorders in Flanders-Belgian cohorts. Neurobiol Aging 33(1004):e1017–e1020. https://doi.org/10.1016/j.neurobiolaging.2011.09.025 Google Scholar
- 180.Vitek MP, Rasool CG, de Sauvage F, Vitek SM, Bartus RT, Beer B et al (1988) Absence of mutation in the beta-amyloid cDNAs cloned from the brains of three patients with sporadic Alzheimer’s disease. Brain Res 464:121–131Google Scholar
- 181.Wallon D, Rousseau S, Rovelet-Lecrux A, Quillard-Muraine M, Guyant-Marechal L, Martinaud O et al (2012) The French series of autosomal dominant early onset Alzheimer’s disease cases: mutation spectrum and cerebrospinal fluid biomarkers. J Alzheimer’s Dis 30:847–856. https://doi.org/10.3233/JAD-2012-120172 Google Scholar
- 184.Wirdefeldt K, Gatz M, Reynolds CA, Prescott CA, Pedersen NL (2011) Heritability of Parkinson disease in Swedish twins: a longitudinal study. Neurobiol Aging 32(1923):e1921–e1928. https://doi.org/10.1016/j.neurobiolaging.2011.02.017 Google Scholar
- 185.Yoshioka K, Miki T, Katsuya T, Ogihara T, Sakaki Y (1991) The 717Val—Ile substitution in amyloid precursor protein is associated with familial Alzheimer’s disease regardless of ethnic groups. Biochem Biophys Res Commun 178:1141–1146Google Scholar
- 188.Zou ZY, Cui LY, Sun Q, Li XG, Liu MS, Xu Y et al (2013) De novo FUS gene mutations are associated with juvenile-onset sporadic amyotrophic lateral sclerosis in China. Neurobiol Aging 34(1312):e1311–e1318. https://doi.org/10.1016/j.neurobiolaging.2012.09.005 Google Scholar
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.