Patterns of variation of mutation rates of mitochondrial and nuclear genes of gastropods

Abstract

Background

Although mitochondrial DNA (mtDNA) of many animals tends to mutate at higher rates than nuclear DNA (nuDNA), a recent survey of mutation rates of various animal groups found that the gastropod family Bradybaenidae (suborder Helicina) shows a nearly 40-fold difference in mutation rates of mtDNA (\(\mu\)m) and nuDNA (\(\mu\)n), while other gastropod taxa exhibit only two to five-fold differences. To determine if Bradybaenidae represents an outlier within Gastropoda, I compared estimated values of \(\mu\)m/\(\mu\)n of additional gastropod groups. In particular, I reconstructed mtDNA and nuDNA gene trees of 121 datasets that include members of various clades contained within the gastropod subclasses Caenogastropoda, Heterobranchia, Patellogastropoda, and Vetigastropoda and then used total branch length estimates of these gene trees to infer \(\mu\)m/\(\mu\)n.

Results

Estimated values of \(\mu\)m/\(\mu\)n range from 1.4 to 91.9. Datasets that exhibit relatively large values of \(\mu\)m/\(\mu\)n (i.e., > 20), however, show relatively lower estimates of \(\mu\)n (and not elevated \(\mu\)m) in comparison to groups with lower values. These datasets also tend to contain sequences of recently diverged species. In addition, datasets with low levels of phylogenetic breadth (i.e., contain members of single genera or families) exhibit higher values of \(\mu\)m/\(\mu\)n than those with high levels (i.e., those that contain representatives of single superfamilies or higher taxonomic ranks).

Conclusions

Gastropods exhibit considerable variation in estimates of \(\mu\)m/\(\mu\)n. Large values of \(\mu\)m/\(\mu\)n that have been calculated for Bradybaenidae and other gastropod taxa may be overestimated due to possible sampling artifacts or processes that depress estimates of total molecular divergence of nuDNA in groups that recently diversified.

Background

Information concerning patterns of variation of mutation rates of nuclear and organellar genomes is important for understanding the factors that influence these rates and how they impact interactions among genomes [1,2,3,4]. For most animal taxa, mutation rates (\(\mu\)m) of mitochondrial DNA sequences (mtDNA) are higher than mutation rates (\(\mu\)n) of nuclear DNA sequences (nuDNA) [5,6,7]. Nonetheless, invertebrates tend to show smaller differences between \(\mu\)m and \(\mu\)n (~ 2 to 10-fold difference) than vertebrates (~ 10 to 25-fold difference) [3, 4, 7,8,9,10,11]. A recent comparison of estimates of the ratio \(\mu\)m/\(\mu\)n of 122 animal taxa (one sponge, 78 vertebrates, 33 arthropods, and 10 molluscs) found that one mollusc group, members of the gastropod family Bradybaenidae (subclass Heterobranchia, order Stylommatophora, suborder Helicina), shows a nearly 40-fold difference in \(\mu\)m and \(\mu\)n, whereas other molluscs, including representatives of six other gastropod groups, exhibited only two to five-fold differences [4]. Is Bradybaenidae an exception within Mollusca or do other gastropods (e.g., other members of the species rich suborder Helicina) have exceptionally large values of \(\mu\)m/\(\mu\)n? Furthermore, what potential factors contribute to the large values of \(\mu\)m/\(\mu\)n that Bradybaenidae and possibly other gastropod groups exhibit?

To address the above questions, I estimated and compared \(\mu\)m/\(\mu\)n of additional gastropod groups, including members of four of the six subclasses of Gastropoda—Caenogastropoda, Heterobranchia, Patellogastropoda, and Vetigastropoda—and additional groups from the suborder Helicina. I aimed to determine if Bradybaenidae is an outlier within Gastropoda or if other gastropod taxa also exhibit such large values of \(\mu\)m/\(\mu\)n. I also evaluated whether variation in \(\mu\)m/\(\mu\)n among gastropod taxa reflects relative differences in \(\mu\)m or \(\mu\)n among groups exhibiting different values of \(\mu\)m/\(\mu\)n. In addition, I sought to determine if large values of \(\mu\)m/\(\mu\)n might be overestimated due to possible inclusion of recently diverged species by comparing ratios of \(\mu\)m/\(\mu\)n from datasets that include different levels of taxonomic breadth (i.e., those that include members of genera, families, superfamilies, or higher taxonomic categories). I largely followed the approach described in [4] of gathering published mtDNA and nuDNA gene sequences from corresponding sets of individuals/species, reconstructing gene trees, and then inferring relationship between \(\mu\)m and \(\mu\)n by calculating the ratio of the total branch lengths of mtDNA and nuDNA gene trees. I estimated total branch lengths at third codon positions for coding regions (as implemented in [4]) and all positions of intron regions as a proxy for neutral divergence.

Results

Datasets

Searching for the term “Gastropoda [Organism]” in the NCBI PopSets database yielded 3692 datasets. More than two-thirds of the PopSets (N = 2593) contained mtDNA sequences and more than half of these (N = 1551) included COI sequences (Table 1). Most nuDNA sequences included sequences of various regions of the rRNA transcription unit (i.e., 5.8S, 18S, 28S, and internal transcribed spacers 1 and 2 (ITS1 and ITS2)) (N = 639) (Table 1). Otherwise, sequences of histone (mostly histone H3 (H3)) comprised the next most abundant nuDNA gene region included in these PopSets (Table 1).

Table 1 Identity of mtDNA and nuDNA gene regions that were included in the 3692 NCBI PopSets

I identified 118 sets of PopSets from the same author(s) that included both mtDNA and nuDNA data of protein-coding regions or introns of at least three species. All but four of the mtDNA sequence datasets included COI sequences; the four exceptions contained cytochrome b (cytb) sequences. All but six of the nuDNA sequence dataset included coding regions of histone H3 (H3); these other datasets contained coding regions of actin, adenine nucleotide translocase (ANT), histone H4 (H4; two datasets), and a megalin-like lipoprotein (mlp) gene as well as sequences of an intron of a gamma glutamyl carboxylase gene.

Five sets of sequences included data from two mtDNA genes and one nuDNA gene (N = 2; COI and cytb in both cases) or from two nuDNA genes and one mtDNA gene (N = 3; H3 and H4 in two cases and H3 and ANT in the other). I also included datasets of the two sets of mtDNA and nuDNA that were previously examined in [4] but not uploaded originally to NCBI as PopSets (i.e., COI and H3 sequences of Bradybaenidae and Aglajidae from [12] 12]). I divided mtDNA and nuDNA sequences of one set of PopSets into three sets of individual alignments that each contained sequences of single superfamilies of the subclass Vetigastropoda (i.e., Fissurelloidea and Lepetelloidea from the order Lepetellida, and Trochoidea from the order Trochida); this was performed to enable comparison of \(\mu\)m/\(\mu\)n from these lower taxonomic categories. I also combined two sets of PopSets from the same author(s) given that the different PopSets included sequences from members of the same family. Hence, in total I examined 121 sets of mtDNA and nuDNA data. The datasets contain sequences of members of four gastropod subclasses, including Caenogastropoda (N = 31), Heterobranchia (N = 81), Patellogastropoda (N = 1), and Vetigastropoda (N = 8), as well as a considerable breadth of the superfamilies (N = 34) contained within these clades (Additional file 1: Table S1). While most datasets contained representatives of single genera (N = 54), others included members of single families (N = 34), superfamilies (N = 18), or higher level ranks (N = 15).

Sequence analyses

I reconstructed mtDNA and nuDNA gene trees using maximum likelihood approaches and then calculated total branch lengths (TBL) of these trees at third positions of codons of coding regions or all positions of intron sequences with MEGAX v.10.1.8 [14]. I then calculated the ratio of these values (i.e., \(\mu\)m/\(\mu\)n) for PopSet pairs that included the same species (Additional file 1: Table S1).

Estimates of \(\mu\)m/\(\mu\)n of individual datasets ranged from a minimum of 1.4 (Vetigastropoda; Lepetellida; Fissurelloidea) to a maximum of 91.9 (Heterobranchia; Tectipleura; Helicina; Clausilioidea; Clausiliidae) with most taxa showing mean ratios that are less than 20 and median ratios that are less than ten (Fig. 1, Additional file 1: Table S1). Gene trees of nuDNA sequences of two PopSets exhibited a TBL of zero (PopSet UID 1735180796, genus Viviparus, family Viviparidae, order Architaenioglossa; PopSet UID 1125137272, genus Acroloxus, family Acroloxidae, superorder Hygrophila); results from analyses of these datasets were excluded from those that utilize ratios of \(\mu\)m/\(\mu\)n or log-transformed values of TBL of nuDNA given that these values are undefined.

Fig. 1
figure1

Boxplots of mutation rate ratios (\(\mu\)m/\(\mu\)n) of datasets of members of gastropod clades from the subclasses Caenogastropoda, Heterobranchia, Patellogastropoda, and Vetigastropoda. Median values (wide horizontal line in box of taxa with more than one dataset), interquartile ranges (boxes), minimum and maximum range (ends of whiskers that extend 1.5 times the interquartile range), means (solid square symbols), and outlier values (open circle symbols) are indicated

Results from an ANOVA that compared \(\mu\)m/\(\mu\)n values of datasets from members of higher level taxonomic groups (i.e., superorders, orders, and suborders) revealed significant differences in \(\mu\)m/\(\mu\)n among these groups (P = 0.00137). Based on results from a Tukey test, mean values of \(\mu\)m/\(\mu\)n are significantly different among Helicina (20.1) and Sacoglossa (3.3) and Cerithioidea (18.6) and Sacoglossa (Fig. 1). In addition, several groups include datasets that exhibit values of \(\mu\)m/\(\mu\)n that represent outliers in boxplots (Fig. 1). These include datasets from Architaenioglossa, Littorinimorpha, and Neogastropoda in Caenogastropoda; Nudibranchia, Helicina, and Sacoglossa in Heterobranchia; and Lepetellida in Vetigastropoda (Fig. 1, Table 2).

Table 2 Datasets that exhibit outlier values of \(\mu\)m/\(\mu\)n (Fig. 1)

The number of species (N) included in mtDNA and nuDNA gene trees and TBL of these trees exhibit strong positive associations (Fig. 2). Relationships between N and TBL of mtDNA gene trees are not significantly different for datasets with relatively low values of \(\mu\)m/\(\mu\)n (i.e., < 20) and those with relatively high values (i.e., > 20) (P = 0.643) (Fig. 2a, b). On the other hand, relationships among N and TBL are significantly different for nuDNA data from low ratio and high ratio datasets (P < 2.2 × 10–16), with TBL of high ratio datasets exhibiting a much lower rate of increase with increasing N in comparison to low ratio datasets (Fig. 2c, d).

Fig. 2
figure2

Relationships between number of species and total branch lengths of gene trees. a mtDNA datasets with \(\mu\)m/\(\mu\)n < 20, R2 = 0.934, P < 2.2 × 10–16; b mtDNA datasets with \(\mu\)m/\(\mu\)n > 20, R2 = 0.778, P < 7.8 × 10–8; c nuDNA datasets with \(\mu\)m/\(\mu\)n < 20, R2 = 0.705, P < 2.2 × 10–16; d nuDNA datasets with \(\mu\)m/\(\mu\)n > 20, R2 = 0.652, P < 5.8 × 10–6

Datasets that include representatives of genera and families exhibit significantly different values of \(\mu\)m/\(\mu\)n based on ANOVA and Tukey tests (P = 0.000055). In particular, while the mean \(\mu\)m/\(\mu\)n of datasets that contained members of genera and families were 17.9 (SD = 19.3, N = 52) and 12.7 (SD = 11.4, N = 34), respectively, mean values of \(\mu\)m/\(\mu\)n of datasets that included members of superfamilies (5.5, SD = 4.2, N = 19) and higher taxonomic categories (4.2, SD = 4.2, N = 15) were lower (Fig. 3). Also, although all four categories exhibit outliers in boxplots, only outliers from genera and families exhibited values of \(\mu\)m/\(\mu\)n that were greater than 20 (Fig. 3).

Fig. 3
figure3

Boxplots of mutation rate ratios (\(\mu\)m/\(\mu\)n) of datasets that include different levels of taxonomic breadth (i.e., genera, families, superfamilies, and high taxonomic ranks). Median values (wide horizontal line in box), interquartile ranges (boxes), minimum and maximum range (ends of whiskers that extend 1.5 times the interquartile range), means (solid square symbols), and outlier values (open circle symbols) are indicated

Discussion

A previous survey that examined relationships of \(\mu\) of mtDNA and nuDNA of 122 animal taxa found that the gastropod family Bradybaenidae (subclass Heterobranchia, order Stylommatophora, suborder Helicina) exhibits a nearly 40-fold difference in \(\mu\)m and \(\mu\)n [4]. Bradybaenidae appeared to be an outlier among molluscs and most other invertebrates which exhibit much smaller differences between \(\mu\)m and \(\mu\)n (i.e., generally less than ten-fold differences). Results from my analysis of 121 mtDNA and nuDNA datasets of various gastropod clades show that Bradybaenidae is not an outlier within Gastropoda. Indeed, groups of species from most of the subclasses examined exhibited values of \(\mu\)m/\(\mu\)n that are relatively large (i.e., > 20). Moreover, several groups have values of \(\mu\)m/\(\mu\)n that exceed the value previously estimated for Bradybaenidae (Fig. 1). Based on analysis of patterns of divergence of mtDNA and nuDNA, differences in \(\mu\)m/\(\mu\)n among gastropod taxa appear to reflect differences in \(\mu\)n and not \(\mu\)m (Fig. 2). Nonetheless, given that datasets with exceptionally large values of \(\mu\)m/\(\mu\)n include low levels of taxonomic breadth, the relatively low values of \(\mu\)n that were estimated for these datasets may be small because they contain recently diverged taxa that exhibit very few if any fixed differences at nuDNA.

Datasets from 11 groups are identified as outliers given that they exhibit values of \(\mu\)m/\(\mu\)n that are considerably larger (N = 10) or smaller (N = 1) than values from related groups (Fig. 1, Table 2). The species contained in the ten datasets that have relatively large values of \(\mu\)m/\(\mu\)n exclusively represent members of single genera that have radiated recently [15,16,17,18,19,20,21,22,23]. For example, the Conus dataset that represents an outlier (\(\mu\)m/\(\mu\)n = 56.7) includes 44 members of the Cape Verde species flock, a group of species that may have radiated explosively during the past few million years [15, 24, 25]. An additional dataset that contains members of the superfamily Conoidea, including six Conus species (but not any from the Cape Verde species flock), represented an outlier because its value of \(\mu\)m/\(\mu\)n (5.9) was less than those of related taxa (Fig. 1). Excluding all but these six Conus species yields a value of 9.0 for \(\mu\)m/\(\mu\)n. Although this value is larger than the value exhibited by members of the entire superfamily, it is still much less than \(\mu\)m/\(\mu\)n of the Cape Verde Conus dataset. Moreover, another dataset that includes sequences of three other Conus species (and none from the Cape Verde species flock) also has a relatively small value of \(\mu\)m/\(\mu\)n (11.2). Hence, while some Conus species exhibit a relatively modest value of \(\mu\)m/\(\mu\)n, species from Cape Verde species that underwent a recent radiation show an exceptionally large one.

Datasets with relatively large values of \(\mu\)m/\(\mu\)n show smaller values of \(\mu\)n (i.e., TBL of nuDNA gene trees) relative to the number of species included in tree compared to datasets with relatively small values of \(\mu\)m/\(\mu\)n (Fig. 2). These results suggest that differences in \(\mu\)m/\(\mu\)n among groups reflect differences in \(\mu\)n. Given that (i) all of the datasets that exhibit depressed values of \(\mu\)n include low levels of taxonomic breadth (i.e., only include members of single genera or families) (Fig. 2) and (ii) datasets with little breadth exhibit larger values of \(\mu\)m/\(\mu\)n than those with high levels of taxonomic breadth (i.e., include members of superfamilies and higher taxonomic categories) (Fig. 3), \(\mu\)n appears to be associated with the taxonomic breadth of species included in datasets. Recently diverged taxa may be more likely to exhibit elevated values of \(\mu\)m/\(\mu\)n because they show only very few if any fixed differences at nuDNA. Otherwise, some other processes or sampling artifacts may be responsible for depressing estimates of \(\mu\)n in recently radiated taxa. The Bradybaenidae data contain sequences of many recently diverged species of several genera that do not show reciprocal monophyly in molecular phylogenies [12]. Furthermore, the largest value of \(\mu\)m/\(\mu\)n that was reported in [4] (79.2) is for a group of recently diverged amphibian species (genus Bufo) [26]. While, it was hypothesized that the extreme value estimated for this group may be due to sampling error related to the small sample size of the datasets examined [4], the value instead may have been overestimated owing to the recent divergence of the species included in the dataset.

Although most of the datasets examined included coding regions of sequences of COI for the mitochondrial gene and H3 for the nuclear gene, four of the outlier datasets included sequences of coding regions of the mitochondrial gene cytB and sequences of coding regions of three additional nuclear genes (a megalin-like lipoprotein gene, adenine nucleotide transferase gene, and H4). Although it will be important to perform broader surveys of genes and gene regions (e.g., introns and intergenic regions) to further validate this pattern, it is not limited to the same mitochondrial and nuclear gene pairs and hence appears to be reasonably robust to gene sampling.

Conclusions

Members of Gastropoda appear to show considerable variation in \(\mu\)m and \(\mu\)n, but overall tend to exhibit lower values of \(\mu\)m/\(\mu\)n than vertebrates [4]. Nonetheless, some of the values reported herein may reflect overestimates of \(\mu\)m/\(\mu\)n due to the inclusion of species that show low levels of total molecular divergence at nuDNA possibly due to their recent divergence. Although comparing TBL of mtDNA and nuDNA gene trees is an effective means for determining relationships among \(\mu\)m and \(\mu\)n, the approach may give overestimates of \(\mu\)m/\(\mu\)n when datasets include a number of recently diverged species. Nonetheless, for groups in which fossil calibrations are not available, estimating \(\mu\)m/\(\mu\)n could be useful for identifying clades that have radiated recently.

Methods

Datasets

To estimate \(\mu\)m/\(\mu\)n, I gathered mtDNA and nuDNA sequence data that were uploaded to GenBank as ‘PopSets’ or collections of sequence data (as opposed to individual sequence submissions). I utilized this strategy in an effort to ensure that different authors’ views on the identity of species did not affect estimates of \(\mu\)m/\(\mu\)n. I searched the NCBI PopSet database (https://www.ncbi.nlm.nih.gov/popset) using the term “Gastropoda [Organism]” in the search field (accessed on 18-May-2020). I downloaded search results as an XML file and parsed the data to extract various information such as PopSet title, author(s), and unique identifier; publication info; taxa represented in the PopSet; and gene name, gene source (i.e., mtDNA or nuDNA), and number of sequences. I then sorted the resultant data by gene source and PopSet author(s) to identify PopSets from the same author(s) that included both mtDNA and nuDNA sequences from at least three gastropod species. I selected PopSets that exclusively included intron or coding regions (but not both) and for which sequence data were available for both mtDNA and nuDNA. The final list of prospective PopSets included all but two of the gastropod datasets that were examined in [4]; these latter datasets included sequences of cytochrome oxidase subunit I (COI), a mtDNA gene, and histone H3 (H3), a nuDNA gene of Bradybaenidae [12] and Aglajidae [13], a member of the order Cephalaspidea.

I downloaded fasta files of PopSets or individual sequences (i.e., for the two datasets that were examined in [4] but not uploaded as PopSets) from ‘PopSet’ or ‘Nucleotide’ databases at NCBI (https://www.ncbi.nlm.nih.gov/). I aligned each set of sequences using MUSCLE [27] in Seqotron v1.0.1 [28]. I evaluated sequence datasets by eye in Seqotron to ensure that alignments were robust. This included adjustments of out of frame insertions so that they occurred in the proper reading frame and elimination of ends of sequences that appeared to be misaligned (possibly due to base call errors) because they contained insertions that affected reading frames (i.e., did not occur in multiples of three). I then compared species and individuals present in the corresponding alignments of mtDNA and nuDNA sequence data and removed species and individuals from one alignment if they were not present in the other. I also eliminated all but one representative of each species in alignments and retained sequences of individuals that were represented in both datasets and/or that were the most complete; in cases where sequences from more than one individual satisfied these criteria, I retained the individual that was listed first.

I utilized taxonomy information presented in the PopSet or GenBank files to specify the genera, families, superfamilies and higher level taxonomic categories of species included in datasets. I then reconciled this information with the hierarchical classification of gastropods presented in MolluscaBase [29].

Sequence analyses

Total branch lengths (TBL) of mtDNA and nuDNA gene trees at putative neutral sites can be used to estimate relative differences in \(\mu\)m and \(\mu\)n based on calculation of the ratio \(\mu\)m/\(\mu\)n [4]. As described in [4], total molecular divergence (i.e., TBL of gene trees) at neutral sites is a function of neutral mutation rates and divergence times [30, 31]. Given that divergence times of species represented in mtDNA and nuDNA gene trees should be the same, the ratio of the TBL of these trees (at neutral sites) provides an estimate of \(\mu\)m/\(\mu\)n [4]. I used estimates of divergence (i.e., TBL) at third codon positions for coding regions (as implemented in [4]) and all positions of intron regions as a proxy for neutral divergence.

I used MEGAX v.10.1.8 [14] to construct gene trees and estimate branch lengths of individual datasets. I constructed individual phylograms for each locus to limit the effect of having discordant gene trees that could result in overestimates of TBL due to incomplete lineage sorting (see [4]). I specified the genetic code and examined alignments to set the appropriate codon start position. I reconstructed gene trees using the General Time Reversible model with maximum likelihood. I eliminated sites that were not defined in 80% of sequences; otherwise, all other positions were utilized in tree building. I examined gene trees in MEGA to ensure that the phylogenies did not contain any long branches that could be due to any alignment errors. I then used maximum likelihood to estimate the TBL of gene trees at third positions of codons of coding regions or all positions of intron sequences.

I performed all statistical tests in R [32]. I compared values of \(\mu\)m/\(\mu\)n for sequence datasets that included species from the following higher level taxonomic groups: the orders Architaenioglossa, Littorinimorpha, and Neogastropoda, and the superfamilies Abyssochrysoidea and Cerithioidea from the subclass Caenogastropoda; the orders Nudibranchia, Pleurobranchida, Aplysiida, Cephalaspidea, Ellobiida, and Runcinida, suborders Achatinina and Helicina, and superorder Hygrophila of the subclass Heterobranchia; the subclass Patellogastropoda; and the orders Lepetellida and Trochida from the subclass Vetigastropoda. I used average values of \(\mu\)m/\(\mu\)n for datasets that included more than one mtDNA or nuDNA gene region. I compared \(\mu\)m/\(\mu\)n (using log-transformed values) among these groups with ANOVA and used a Tukey test to identify groups with significant differences in \(\mu\)m/\(\mu\)n. I utilized boxplots to visualize patterns of variation of \(\mu\)m/\(\mu\)n among and within gastropod taxa and identify outlier datasets.

To evaluate whether differences in \(\mu\)m/\(\mu\)n ratios reflect relative increases in \(\mu\)m or decreases in \(\mu\)n, I compared TBL of mtDNA and nuDNA gene trees to the number of species included in these trees. Measures of TBL should increase proportionally to the number of species examined [33]. I specifically compared TBL of mtDNA and nuDNA gene trees among datasets that exhibited different relative values of \(\mu\)m/\(\mu\)n (i.e., less than and greater than 20) and determined levels of significance with an ANOVA based on comparison of log-transformed values of TBL that were standardized to the number of species included in the tree.

If values of \(\mu\)m/\(\mu\)n are overestimated because they include recently diverged species, \(\mu\)m/\(\mu\)n values that are calculated from datasets that include little taxonomic breadth (e.g., those including representatives of genera and families) will be greater than values from datasets that include more taxonomic breadth (e.g., those including members of superfamilies, suborders, etc.). To determine if this is the case, I compared estimates of \(\mu\)m/\(\mu\)n among datasets that include different levels of taxonomic breadth. While some PopSets only included members of single genera, others included various members of families, superfamilies, suborders, orders, and subclasses. I utilized an ANOVA to compare log-transformed values of \(\mu\)m/\(\mu\)n among datasets representing genera, families, superfamilies and combined higher taxonomic categories; I used a Tukey test to determine which samples exhibit significantly different values. I also utilized boxplots to visualize patterns of variation among datasets that included different levels of taxonomic breadth.

Availability of data and materials

The data underlying this article are available in the GenBank PopSet database (https://www.ncbi.nlm.nih.gov/popset). UID numbers of PopSets and additional information on data analyzed are available in Additional file 1: Table S1.

Abbreviations

ANT:

Adenine nucleotide translocase

COI:

Cytochrome oxidase subunit I

COII:

Cytochrome oxidase subunit II

EF1α:

Elongation factor 1-alpha

GTR:

General Time Reversible

H3:

Histone H3

ITS1:

Internal transcribed space 1

ITS2:

Internal transcribed space 2

Mlp:

Megalin-like lipoprotein

mtDNA:

Mitochondrial deoxyribonucleic acid

NADH:

Nicotinamide adenine dinucleotide

nuDNA:

Nuclear deoxyribonucleic acid

TBL:

Total branch length

μ m :

Mutation rates of mitochondrial deoxyribonucleic acid

μ n :

Mutation rates of nuclear deoxyribonucleic acid

References

  1. 1.

    Sloan DB, Havird JC, Sharbrough J. The on-again, off-again relationship between mitochondrial genomes and species boundaries. Mol Ecol. 2017;26:2212–36.

    Article  Google Scholar 

  2. 2.

    Yan Z, Ye G, Werren JH. Evolutionary rate correlation between mitochondrial-encoded and mitochondria-associated nuclear-encoded proteins in insects. Mol Biol Evol. 2019;36:1022–36.

    CAS  Article  Google Scholar 

  3. 3.

    Nabholz B, Glemin S, Galtier N. Strong variations of mitochondrial mutation rate across mammals—the longevity hypothesis. Mol Biol Evol. 2007;25:120–30.

    Article  Google Scholar 

  4. 4.

    Allio R, Donega S, Galtier N, Nabholz B. Large variation in the ratio of mitochondrial to nuclear mutation rate across animals: implications for genetic diversity and the use of mitochondrial DNA as a molecular marker. Mol Biol Evol. 2017;34:2762–72.

    CAS  Article  Google Scholar 

  5. 5.

    Brown WM, George M, Wilson AC. Rapid evolution of animal mitochondrial DNA. Proc Natl Acad Sci. 1979;76:1967–71.

    CAS  Article  Google Scholar 

  6. 6.

    Ballard JWO, Whitlock MC. The incomplete natural history of mitochondria. Mol Ecol. 2004;13:729–44.

    Article  Google Scholar 

  7. 7.

    Lynch M. Mutation pressure and the evolution of organelle genomic architecture. Science. 2006;311:1727–30.

    CAS  Article  Google Scholar 

  8. 8.

    Vawter L, Brown W. Nuclear and mitochondrial DNA comparisons reveal extreme rate variation in the molecular clock. Science. 1986;234:194–6.

    CAS  Article  Google Scholar 

  9. 9.

    Martin AP, Naylor GJP, Palumbi SR. Rates of mitochondrial DNA evolution in sharks are slow compared with mammals. Nature. 1992;357:153–5.

    CAS  Article  Google Scholar 

  10. 10.

    Metz EC, Robles-Sikisaka R, Vacquier VD. Nonsynonymous substitution in abalone sperm fertilization genes exceeds substitution in introns and mitochondrial DNA. Proc Natl Acad Sci. 1998;95:10676–81.

    CAS  Article  Google Scholar 

  11. 11.

    Shearer TL, van Oppen MJH, Romano SL, Wörheide G. Slow mitochondrial DNA sequence evolution in the Anthozoa (Cnidaria): anthozoan mtDNA evolution. Mol Ecol. 2002;11:2475–87.

    CAS  Article  Google Scholar 

  12. 12.

    Hirano T, Kameda Y, Kimura K, Chiba S. Substantial incongruence among the morphology, taxonomy, and molecular phylogeny of the land snails Aegista, Landouria, Trishoplita, and Pseudobuliminus (Pulmonata: Bradybaenidae) occurring in East Asia. Mol Phylogenet Evol. 2014;70:171–81.

    Article  Google Scholar 

  13. 13.

    Camacho-García YE, Ornelas-Gatdula E, Gosliner TM, Valdés Á. Phylogeny of the family Aglajidae (Pilsbry, 1895) (Heterobranchia: Cephalaspidea) inferred from mtDNA and nDNA. Mol Phylogenet Evol. 2014;71:113–26.

    Article  Google Scholar 

  14. 14.

    Kumar S, Stecher G, Li M, Knyaz C, Tamura K. MEGA X: Molecular evolutionary genetics analysis across computing platforms. Mol Biol Evol. 2018;35:1547–9.

    CAS  Article  Google Scholar 

  15. 15.

    Cunha RL, Castilho R, Rüber L, Zardoya R. Patterns of cladogenesis in the venomous marine gastropod genus Conus from the Cape Verde islands. Syst Biol. 2005;54:634–50.

    Article  Google Scholar 

  16. 16.

    Hoover C, Lindsay T, Goddard JHR, Valdés Á. Seeing double: pseudocryptic diversity in the Doriopsilla albopunctataDoriopsilla gemela species complex of the north-eastern Pacific. Zool Scr. 2015;44:612–31.

    Article  Google Scholar 

  17. 17.

    Huelsken T, Wägele H, Peters B, Mather A, Hollmann M. Molecular analysis of adults and egg masses reveals two independent lineages within the infaunal gastropod Naticarius onca (Röding, 1798) (Caenogastropoda: Naticidae). Molluscan Res. 2014;31:141–51.

    Google Scholar 

  18. 18.

    Layton KKS, Rouse GW, Wilson NG. A newly discovered radiation of endoparasitic gastropods and their coevolution with asteroid hosts in Antarctica. BMC Evol Biol. 2019;19:180.

    Article  Google Scholar 

  19. 19.

    Layton KKS, Gosliner TM, Wilson NG. Flexible colour patterns obscure identification and mimicry in Indo-Pacific Chromodoris nudibranchs (Gastropoda: Chromodorididae). Mol Phylogenet Evol. 2018;124:27–36.

    Article  Google Scholar 

  20. 20.

    McCarthy JB, Krug PJ, Valdés Á. Integrative systematics of Placida cremoniana (Trinchese, 1892) (Gastropoda, Heterobranchia, Sacoglossa) reveals multiple pseudocryptic species. Mar Biodivers. 2019;49:357–71.

    Article  Google Scholar 

  21. 21.

    Páll-Gergely B, Szekeres M, Fehér Z, Asami T, Harl J. Evolution of a dextral lineage by left-right reversal in Cristataria (Gastropoda, Pulmonata, Clausiliidae). J Zool Syst Evol Res. 2019;57:520–6.

    Article  Google Scholar 

  22. 22.

    Schultheiß R, Van Bocxlaer B, Riedel F, von Rintelen T, Albrecht C. Disjunct distributions of freshwater snails testify to a central role of the Congo system in shaping biogeographical patterns in Africa. BMC Evol Biol. 2014;14:42.

    Article  Google Scholar 

  23. 23.

    Tibiriçá Y, Pola M, Ortigosa D, Cervera JL. Systematic review of the “Chromodoris quadricolor group” of East Africa, with descriptions of two new species of the genus Chromodoris Alder & Hancock, 1855 (Heterobranchia, Nudibranchia). J Zool Syst Evol Res. 2020;58:230–61.

    Article  Google Scholar 

  24. 24.

    Abalde S, Tenorio MJ, Afonso CML, Uribe JE, Echeverry AM, Zardoya R. Phylogenetic relationships of cone snails endemic to Cabo Verde based on mitochondrial genomes. BMC Evol Biol. 2017;17:231.

    Article  Google Scholar 

  25. 25.

    Duda TF, Rolán E. Explosive radiation of Cape Verde Conus, a marine species flock. Mol Ecol. 2005;14:267–72.

    Article  Google Scholar 

  26. 26.

    Recuero E, Canestrelli D, Vörös J, Szabó K, Poyarkov NA, Arntzen JW, et al. Multilocus species tree analyses resolve the radiation of the widespread Bufo bufo species group (Anura, Bufonidae). Mol Phylogenet Evol. 2012;62:71–86.

    CAS  Article  Google Scholar 

  27. 27.

    Edgar RC. MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinform. 2004;5:1–19.

    Article  Google Scholar 

  28. 28.

    Fourment M, Holmes EC. Seqotron: a user-friendly sequence editor for Mac OS X. BMC Res Notes. 2016;9:106.

    Article  Google Scholar 

  29. 29.

    MolluscaBase eds. MolluscaBase. http://www.molluscabase.org. 2020. Accessed 28 July 2020.

  30. 30.

    Kimura M. Evolutionary rate at the molecular level. Nature. 1968;217:624–6.

    CAS  Article  Google Scholar 

  31. 31.

    Birky CW, Walsh JB. Effects of linkage on rates of molecular evolution. Proc Natl Acad Sci. 1988;85:6414–8.

    CAS  Article  Google Scholar 

  32. 32.

    R Core Team. R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. 2020. http://www.R-project.org/.

  33. 33.

    Nipperess DA. The rarefaction of phylogenetic diversity: formulation, extension and application. In: Pellens R, Grandcolas P, editors. Biodiversity conservation and phylogenetic systematics. Cham: Springer International Publishing; 2016. p. 197–217. https://doi.org/10.1007/978-3-319-22461-9_10.

    Google Scholar 

Download references

Acknowledgements

I am grateful to all of the workers who contributed the PopSets that were analyzed herein. I also thank two anonymous reviewers as well as Taehwan Lee and Peter Cerda for their criticisms and comments on earlier drafts of this manuscript.

Funding

Not applicable.

Author information

Affiliations

Authors

Contributions

The author read and approved the final manuscript.

Corresponding author

Correspondence to Thomas F. Duda Jr..

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The author declares that he has no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Table S1.

Information on PopSet data analyzed in this study.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Duda, T.F. Patterns of variation of mutation rates of mitochondrial and nuclear genes of gastropods. BMC Ecol Evo 21, 13 (2021). https://doi.org/10.1186/s12862-021-01748-2

Download citation

Keywords

  • Gastropoda
  • Mutation rates
  • Mitochondrial DNA
  • Nuclear DNA