Genomes reveal marked differences in the adaptive evolution between orangutan species

Mattle-Greminger, Maja P.; Bilgin Sonay, Tugce; Nater, Alexander; Pybus, Marc; Desai, Tariq; de Valles, Guillem; Casals, Ferran; Scally, Aylwyn; Bertranpetit, Jaume; Marques-Bonet, Tomas; van Schaik, Carel P.; Anisimova, Maria; Krützen, Michael

doi:10.1186/s13059-018-1562-6

Genomes reveal marked differences in the adaptive evolution between orangutan species

Research
Open access
Published: 15 November 2018

Volume 19, article number 193, (2018)
Cite this article

Download PDF

You have full access to this open access article

Genome Biology Aims and scope Submit manuscript

Genomes reveal marked differences in the adaptive evolution between orangutan species

Download PDF

Maja P. Mattle-Greminger^1,2^na1,
Tugce Bilgin Sonay^1,3^na1,
Alexander Nater^1,2,4^na1,
Marc Pybus⁵,
Tariq Desai⁶,
Guillem de Valles⁵,
Ferran Casals⁷,
Aylwyn Scally⁶,
Jaume Bertranpetit⁵,
Tomas Marques-Bonet^8,9,10,11,
Carel P. van Schaik¹,
Maria Anisimova^12,3 &
…
Michael Krützen¹

8939 Accesses
13 Citations
12 Altmetric
Explore all metrics

Abstract

Background

Integrating demography and adaptive evolution is pivotal to understanding the evolutionary history and conservation of great apes. However, little is known about the adaptive evolution of our closest relatives, in particular if and to what extent adaptions to environmental differences have occurred. Here, we used whole-genome sequencing data from critically endangered orangutans from North Sumatra (Pongo abelii) and Borneo (P. pygmaeus) to investigate adaptive responses of each species to environmental differences during the Pleistocene.

Results

Taking into account the markedly disparate demographic histories of each species after their split ~ 1 Ma ago, we show that persistent environmental differences on each island had a strong impact on the adaptive evolution of the genus Pongo. Across a range of tests for positive selection, we find a consistent pattern of between-island and species differences. In the more productive Sumatran environment, the most notable signals of positive selection involve genes linked to brain and neuronal development, learning, and glucose metabolism. On Borneo, however, positive selection comprised genes involved in lipid metabolism, as well as cardiac and muscle activities.

Conclusions

We find strikingly different sets of genes appearing to have evolved under strong positive selection in each species. In Sumatran orangutans, selection patterns were congruent with well-documented cognitive and behavioral differences between the species, such as a larger and more complex cultural repertoire and higher degrees of sociality. However, in Bornean orangutans, selective responses to fluctuating environmental conditions appear to have produced physiological adaptations to generally lower and temporally more unpredictable food supplies.

Population Genomics in the Great Apes

Comparative genomic analyses provide new insights into evolutionary history and conservation genomics of gorillas

Article Open access 26 January 2024

Genomic basis of insularity and ecological divergence in barn owls (Tyto alba) of the Canary Islands

Article Open access 29 September 2022

Background

Local adaptations are expected when species colonize novel environments, or within the context of heterogeneous environments and population structure [1]. In humans, paleoclimatic changes may have triggered the expansion out of Africa into novel environments (e.g., [2, 3]). These migrations through the Late Pleistocene and the Neolithic exposed our ancestors to novel selective pressures, leading to numerous local adaptations (reviewed in [4]). Examples for such adaptations are thermoregulation [5] and lighter skin pigmentation in response to living at higher latitudes [6, 7], tolerance to hypoxia in higher altitudes [8,9,10], and complex traits like body size [11]. In the Neolithic, with the switch to agricultural subsistence, cattle-breeding, higher population densities, and increased infectious-disease loads lead to changes in phenotypes linked to immune responses [12,13,14]. Compared to humans, however, little is known about the adaptive evolution in non-human great apes [15,16,17,18] and whether or to what extent adaptations to environmental changes have occurred. Identifying genetic signatures of adaptive evolution in our closest relatives will not only enhance our understanding of how adaptation has shaped genetic variation in extant great ape populations, but may also help us to understand the differences between human and non-human great apes.

Orangutans (genus: Pongo) are the only great apes in Asia and phylogenetically the most distantly related to humans. Once widely distributed throughout Southeast Asia [19], their range is currently restricted to increasingly isolated forest patches on northern Sumatra (P. abelii and P. tapanuliensis) and Borneo (P. pygmaeus) (Fig. 1) [20]. During most of the Pleistocene, orangutans on the two islands have been subjected to markedly different environmental conditions. On northern Sumatra, where all extant Sumatran orangutans occur, habitat conditions have been generally more stable and productive due to volcanism-enriched soils, lower levels of cloud cover, and more regular precipitation patterns [21, 22]. On Borneo, however, fluctuations of rainforest coverage during glacial cycles were more pronounced [23, 24]. Furthermore, particularly in the northeastern part of Borneo, orangutans (P. p. morio) have been exposed to marked temporal fluctuations in fruit availability, including mast fruiting events with short periods of overabundance of fruit, followed by extended periods of low fruit production [22, 25,26,27]. Moreover, severe and unpredictable droughts and forest fires due to El Niño events have been causing prolonged periods of extreme food scarcity, posing major threats to orangutan survival in this region [28, 29]. As a consequence of these differences in forest productivity and temporal stability of food supply, population densities are generally higher on northern Sumatra compared to Borneo and particularly low in the northeast of Borneo [30, 31].

In line with these long-term environmental differences, orangutans on Borneo and Sumatra show well-documented geographic variation in their biology, including morphology, physiology, life history, and cultural repertoire [32]. Orangutans therefore provide a unique opportunity to investigate the interplay between environmental processes, demography, and adaptive evolution in two closely related, yet phenotypically distinct great ape species.

To identify molecular targets of adaptive evolution, we used data from 35 complete orangutan genomes from nine geographically distinct sampling areas (Fig. 1) across the range of Bornean (P. pygmaeus) and North Sumatran (P. abelii) orangutans. We did not include the recently described Tapanuli orangutan (P. tapanuliensis) [33], as the low sample size (n = 2) of available genomes impeded such analyses.

We carried out multiple tests to detect different molecular patterns associated with positive selection, such as distortion of the site-frequency spectrum [34, 35], extended haplotype structure [36, 37], and substitution patterns in coding regions [38, 39]. Detecting signatures of past and ongoing selective processes on the background of complex demographic histories is challenging [40, 41], as population size changes and structure can produce patterns similar to the ones expected under non-neutral scenarios [34]. To avoid the detection of spurious signatures of selection, reliable identification of the targets of selection requires detailed knowledge about the demographic history [42]. We therefore used a detailed demographic model [33] to account for outliers in genome scans caused by neutral demographic processes.

Results

Demographic history of the genus Pongo

We investigated the demographic history of the Bornean and Sumatran orangutan populations using the pairwise sequentially Markovian coalescent (PSMC) [43] and multiple sequentially Markovian coalescent (MSMC2) [44] methods. While PSMC (no phasing required) infers variation in effective population size (N_e) over time based on single diploid genomes, MSMC2 (phasing required) can additionally infer temporal patterns of gene flow by comparing haplotype pairs across populations. However, inaccurate phasing of autosomal data will break up long haplotype segments, biasing split time estimates and masking signatures of recent gene flow [45]. We aimed to avoid such issues by focusing on male haploid X chromosomes. Our analyses suggested that the ancestors of Bornean and Sumatran orangutans began to diverge ~ 1 Ma (Fig. 2), and that gene flow between the diverged populations ceased between 1 Ma and 400 ka (Fig. 3a), with no evidence of subsequent gene flow. However, as X chromosomes in great apes have been found to be affected by widespread selection [46, 47], potential biases in coalescent-based estimates of N_e could arise using this approach. We therefore repeated the gene flow analyses using entire phased autosomal genomes, resulting in highly congruent estimates of the divergence time between orangutan species (Additional file 1: Figure S7A).

To further corroborate the gene flow estimates and assess the impact of population structure on the PSMC and MSMC2 analyses, we fitted a demographic model to the empirical multi-dimensional site-frequency spectrum using the program momi2 [48] (Additional file 1: Figure S8). Most parameter estimates were consistent with the findings from PSMC and MSMC2 (Additional file 1: Table S4). We obtained a split time between Borneo and Sumatra of ~ 731 ka (95% CI 716–741 ka), which falls right into the time window when the cross-population N_e estimates from MSMC2 increased towards infinity (Additional file 1: Figure S7A). However, while there was no indication of gene flow between Sumatra and Borneo more recent than ~ 400 ka in the MSMC2 analysis, our momi2 analysis suggested a more recent phase of admixture during the last glacial period (~ 44 ka, 40–48 ka). This admixture pulse was asymmetrical, with ~ 4.2% (4.0–4.4%) of Sumatran lineages being derived from Borneo while only ~ 1.5% (1.3–1.7%) of the Bornean lineages exhibit recent Sumatran ancestry.

After their initial separation in the Middle Pleistocene, orangutan populations on Borneo and Sumatra exhibited strongly divergent demographic trajectories (Fig. 2). While Sumatran orangutans went through a continuous increase in N_e until ~ 70 ka ago, Borneans experienced a steady decline in N_e. The N_e of Bornean orangutans was reduced from ~ 30,000 to ~ 5000 between 300 and 100 ka ago (Fig. 2). Low cross-population N_e estimates of below 5000 for all population comparisons on Borneo at this time (Fig. 3b) indicate high levels of gene flow among Bornean populations and suggest the possibility of a single common refugium during this period. This was followed by a rapid increase of cross-population N_e at 20–50 ka, indicating increasing genetic isolation of most populations during the late Pleistocene (Fig. 3b). This result is in good agreement with the momi2 analysis, indicating a split time among Bornean population of ~ 25 ka (23–27 ka; Additional file 1: Table S4). However, the increase in cross-population N_e was not accompanied by an increase of N_e within populations, which remained low on Borneo (3000–10,000; Fig. 2, Additional file 1: Table S4). In contrast, on Sumatra, high N_e during the Middle Pleistocene ended with a rapid decline of about an order of magnitude at ~ 70–80 ka (Fig. 2). Similar to the pattern in Borneo, N_e in Sumatran orangutans did not recover (Fig. 2) and the site-frequency spectrum-based demographic model resulted in current N_e estimates as low as 80–230 for the populations on Sumatra (Additional file 1: Table S4).

The divergent demographic histories of the orangutans on Borneo and Sumatra, along with persistent environmental differences on both islands [21, 23, 24], may have had a strong impact on adaptive evolution of this genus, since the efficiency of selection and the adaptive potential depend on N_e [49]. Considering this, we explored potential genomic adaptions over different evolutionary time scales.

Selection during more distant time scales

We detected selection over more distant evolutionary time scales based on non-synonymous to synonymous substitution rate ratios within protein-coding genes estimated by Markov codon models (MCM; Additional file 1: SI Section 7). In this highly conservative approach, we applied the branch-site tests for positive selection by Zhang et al. [50] on a set of 34,379 aligned exons of four primate species based on the reference sequences of human, chimpanzee, and rhesus macaque, along with the orangutan samples. We used samples from North Sumatran orangutans (P. abelii, LK/NA and WA), and Northeast Bornean orangutans (P. pygmaeus morio, populations SK/NK and EK), ignoring the recently established population structure (Additional file 1: Figure S8).

Our MCM tests revealed 46 genes with evidence of positive selection in the Northeast Bornean branch and 33 genes in the North Sumatran branch (Additional file 1: Tables S6 and S7). To gain a better understanding of the functions of these genes, we generated interaction networks, where the nodes (genes) were colored based on functional classes according to the presence of specific keywords (Additional file 1: Table S9) in the GeneCards encyclopedia [51] (Fig. 4). The two resulting networks revealed that the functions of the candidate genes differed markedly between the two islands.

Most of the genes under positive selection in Northeast Bornean orangutans were involved either in muscle and cardiac activities, in reproductive pathways, or in the lipid metabolism (Fig. 4a). In line with this observation, gene ontology (GO) enrichment analysis identified 11 GO terms that were significantly enriched for the candidate genes (Table 1), including multiple GO terms involved in fatty acid metabolism and muscle activities, one GO term involved in endocardial cell development, and another in the regulation of fertilization.

Table 1 Significantly enriched gene ontology (GO) terms based on MCM

Full size table

In North Sumatran orangutans, a striking number of positively selected genes were associated with brain and nervous system development as well as energy metabolism pathways (Fig. 4b). Gene ontology enrichment analyses of the these genes (Table 1) revealed a significant enrichment of GO terms related to ATP metabolism, cerebral cortex development, and two GO terms involved in brain-derived neurotrophic factor binding, all of which mediate higher cognitive abilities [52].

Selection during more recent time scales

To uncover signatures of more recent or ongoing local adaptations, we applied two complementary methods for detecting selective sweeps on the population level (Additional file 1: SI Section 9): (1) haplotype-based statistics (iHS) in windowed whole-genome scans [37], and (2) composite likelihood ratio (CLR) test implemented in SweeD [35]. We split our dataset to account for current population structure [33], subdividing Sumatran samples into the two clusters Northeast Alas (LK/NA) and West Alas (WA), and the Bornean samples into the clusters Northeast Borneo (SK/NK and EK) and Central/West Kalimantan (CK/WK). In order to assess thresholds of significance and minimize the occurrence of false positives due to confounding effects of demography [42], we performed a set of neutral simulations based on a complex demographic model estimated through ABC modeling [33] (Additional file 1: SI Section 9.4). We then used GO enrichment analyses to unravel differences in biological functions of genes found to be under positive selection.

(1)
Haplotype-based selection test (iHS)

We identified 961 annotated protein-coding genes within genomic candidate regions of positive selection (North Sumatran orangutans: n = 605, Bornean orangutans: n = 410, both: n = 54; Additional file 1: Table S11). Detailed information on all candidate genes, including summaries of their potential function and disease associations in humans and other animals, are provided in Additional file 2: Table S17. In Bornean orangutans, many candidate genes were associated with lipid and glucose metabolism as well as insulin and cholesterol regulation (Additional file 2: Table S18). After correction for multiple testing, GO enrichment analysis revealed one significantly enriched GO term within the Northeast Bornean orangutans which was related to immune system-related cell signaling (Table 2, Additional file 2: Tables S20 and S21).

Table 2 Significantly enriched gene ontology (GO) terms based on iHS

Full size table

In North Sumatran orangutans, a striking number of positively selected genes were associated with crucial functions in learning and memory (Additional file 2: Table S19). In line with this, GO analyses showed a significant enrichment of terms related to neurological processes, synaptic transmission, learning, adult behavior, and cerebral cortex development (Table 1, Additional file 2: Tables S22 and S23).

(2)
Composite likelihood ratio test (SweeD)

We identified 1027 protein-coding genes within genomic candidate regions of positive selection (North Sumatran orangutans: n = 658, Bornean orangutans: n = 489, both: n = 120; Additional file 1: Table S12). After correction for multiple testing, GO enrichment analysis revealed eight significantly enriched GO terms within the Sumatran West Alas population (Table 3). No other orangutan population showed significantly enriched GO terms after correction for multiple testing (Additional file 2: Tables S23–27). Among the significantly enriched GO terms in the West Alas population were such involved in glutamate receptor signaling, regulation of postsynaptic membrane potential, and cell projection organization. Glutamate receptors mediate postsynaptic excitation of neuronal cells and are important for neural communication, memory, and learning [53].

Table 3 Significantly enriched gene ontology (GO) terms based on SweeD

Full size table

Discussion

Our demographic analyses highlighted the deep divergence of orangutans inhabiting the Southeast Asian islands of Sumatra and Borneo. Despite land bridges occurring between these islands as recently as ~ 10 ka ago [19], we found no evidence of substantial gene flow more recently than 500 ka in our gene flow analyses using either haploid male X chromosomes or entire phased autosomal genomes. This finding is also supported by deep divergence of island-specific mitochondrial haplotypes, even though Y-chromosomal data indicate the presence of low levels of gene flow during the Middle Pleistocene [33].

In contrast, fitting a detailed demographic model to our genome-wide SNP data set, we found evidence for more recent admixture at ~ 44 ka. This recent admixture event falls into the last glacial period (115–12 ka), when sea levels were up to 120 m below the current sea level and the exposed Sunda shelf opened up land bridges in between the islands [54]. However, this recent admixture appears to have affected only small fractions of individual genomes, which might explain why we did not detect it as a drop in cross-population N_e in the MSMC2 analysis.

A more pervasive genetic exchange over the exposed Sunda shelf might have been prevented by the increasingly drier and seasonal climate of Late Pleistocene glacial periods [24]. A savannah corridor [55], large river systems dissecting the exposed shelf [19], or most likely both might have imposed impassable dispersal barriers for orangutans. This early separation of gene pools had put Sumatran and Bornean orangutans on largely independent evolutionary trajectories, which is apparent from their divergent demographic histories. Most strikingly, while Sumatran orangutans appear to have expanded considerably in N_e during the Middle Pleistocene, Bornean orangutans experienced a pronounced decline in N_e following their isolation from the ancestral population.

The Early–Middle Pleistocene transition 1.2–0.5 Ma was characterized by a fundamental change in the Earth’s climate [56], resulting in harsher glacial periods, affecting vegetation patterns in the tropics [57]. In Sundaland, the drier and more seasonal climate during glacial periods seems to have severely impacted Bornean orangutans, probably through contraction of rainforest coverage [24, 58]. Furthermore, El Niño events have likely caused prolonged lean periods in the eastern parts of Borneo [28, 29]. On Sumatra, however, substantial areas are thought to have remained forested during glacial periods, probably facilitated by sustained high levels of precipitation due to convergent rainfall along the Barisan Mountain range [58], which runs along the western side of the entire island. Moreover, the younger geological age of Sumatra compared to Borneo as well as the mineral-rich volcanic soil resulted in higher rainforest productivity [21, 22], potentially alleviating negative impacts of glacial cycles on Sumatran orangutans.

The decline of Bornean orangutans culminated in a very small N_e during the last glacial period. The combination of small N_e within populations and high levels of genetic exchange among populations is suggestive of a single refugium on Borneo, possibly linked to a severe contraction of rain forests during the last glacial period [23, 24]. The finding of strong gene flow between Bornean populations until less than 50 ka suggests that the genetic differentiation underlying the phenotypic differences between orangutan populations from central and northeast Borneo have emerged only recently.

In contrast to the long-term negative population dynamics on Borneo, Sumatran orangutans retained a large N_e until 70–80 ka, when they experienced a severe reduction in N_e. It is notable that this sudden signal coincides with the Toba supereruption ~ 73 ka [59]. The effects of this eruption, however, were probably limited in their geographic extent and likely of relatively short duration [60], as the mineral-rich deposits of Toba provided fertile ground for recovery of the highly productive Sumatran rainforests [22]. Nevertheless, as on Borneo, N_e did not recover in Sumatran orangutans. It is possible that this reflects hunting by early human colonizers, who expanded rapidly into insular Southeast Asia in the Late Pleistocene [61]; however, other environmental factors may also have been responsible.

The divergent demographic trajectories of orangutans on Borneo and Sumatra, together with persistent environmental differences until the present [22], may have had an important impact on adaptive evolution of this genus, and hence on the biology of these two species [32]. Across a range of selection tests, we found a consistent pattern of between-island (and thus between-species) differences.

In line with environmental constraints on Borneo and frequent episodes of extreme food scarcity [22], selection in Bornean orangutans appears to have manifested itself predominantly in physiological adaptations to the harsher environmental conditions on the island. P. p. morio showed an enrichment of genes under positive selection involved in cardiac activity, whose remodeling is known to enhance efficient usage of restricted energy resources [62]. Adaptive changes in cardiovascular activities have also been shown for others in extreme environments such as for humans in oxygen-deprived high altitudes (reviewed in [63]), polar bears in cold waters [64], and lizards facing seasonal starvation [65]. We also found an enrichment of genes under positive selection involved in lipid metabolism and energy storage. Such metabolic changes might allow for physiological buffering against starvation [25, 28, 66, 67]. This idea is supported by studies of physiology in wild orangutans and observations from captive animals, indicating that Bornean orangutans are better at storing fat in adipose depots than Sumatran orangutans [26, 28, 68]. It is worthwhile to note that in human populations inhabiting tropical rainforest and thus facing limitation and instability in their food supply, genes involved in lipid metabolism and muscle function were also found to be under selection [69].

In contrast to Bornean orangutans, energy budgets of their North Sumatran counterparts are more favorable due to generally higher and more stable food supplies [22]. In agreement with this, and the documented differences in phenotypic traits, we identified a very different set of candidate genes under positive selection, which were mostly linked to the brain and nervous system development, learning and memory processes, and glucose metabolism. It is conceivable that these selective changes have contributed to the development of extended behavioral plasticity via individual and social learning in North Sumatran orangutans, who surpass their Bornean counterparts in the size and complexity of cultural repertoire [32, 70, 71] and the ability to solve cognitive problems [72].

Among the most striking examples of the broad phenotypic variation in the genus Pongo are the higher sociability and social tolerance of the North Sumatran orangutans [73,74,75,76,77]. Current population densities are higher on northern Sumatra than on Borneo [20], a situation which may have existed throughout most of the Pleistocene, given the higher estimates of historical N_e in P. abelii. Such higher densities may have provided more opportunities for social learning in North Sumatran orangutans, which in conjunction with selection on genes underlying cognitive abilities and prosocial behavior then facilitated the emergence and persistence of larger cultural repertoires compared to their Bornean counterparts. While not empirically evident, our results nevertheless suggest an intriguing link between the demographic history during the Pleistocene glaciations and the cognitive and cultural evolution in orangutans.

Methods

Sampling and data generation

Our sample set included whole-genome data from 35 orangutans, representing the entire geographic range of Bornean and North Sumatran orangutans (Additional file 1: Figure S1, Additional file 1: Table S1). All individuals were wild-born, except for five orangutans which were first-generation offspring of wild-born parents of the same species (Additional file 1: Table S2). We obtained whole-genome sequencing data for the study individuals from three different sources (Additional file 1: Tables S1 and S2): (i) Nater et al. [33] (n = 16, effective quality-filtered sequencing coverage: 13.7–31.1x), (ii) Prado-Martinez et al. [78] (n = 10, 20.5–27.4x), and (iii) Locke et al. [79] (n = 9, 4.8–12.2x).

Read mapping, variant calling, and phasing

We followed identical bioinformatics procedures for all 35 study individuals. Read mapping and variant calling is described in Nater et al. [33] and briefly summarized in Additional file 1: SI Section 2.

Haplotype phasing

We phased genotype data from P. abelii and P. pygmaeus employing SHAPEIT v2.0 [80]. We used a stringent high-quality subset of genotype data from the original SNP-calling dataset for each species, for which we extracted only biallelic and polymorphic SNPs without missing genotype data. We ran the algorithm at chromosome level to generate a haplotype graph, which we used to assess phasing uncertainty and to extract the most likely haplotypes per individual.

Neighbor-net

We visualized the genetic relationships among the whole-genome sequences by building a neighbor-net using SplitsTree v4.14.6 [81]. To obtain the underlying distance matrix, we calculated pairwise identity-by-state proportions with PLINK v1.90 [82] using only biallelic SNPs with no missing genotypes across all 35 individuals.

PSMC and MSMC

We inferred orangutan population size history with the pairwise sequentially Markovian coalescent (PSMC) model [43]. PSMC is implemented as a hidden Markov model and allows inferring historical changes in N_e using the distribution of pairwise coalescent times within a single diploid genome. We applied the PSMC model to each sample, grouping individuals by sequencing coverage to avoid coverage-related biases (Fig. 2a, Additional file 1: Figures S1-S3). We ran PSMC with the parameter settings that were found to be suitable for great apes and have been applied to orangutans previously [43, 78]. We scaled the resulting plots using an autosomal mutation rate of 1.5 × 10⁻⁸ mutations per base pair per generation and a generation time of 25 years [83].

We inferred ancestral gene flow between islands and between populations within islands using the multiple sequentially Markovian coalescent (MSMC2) model [44]. Whereas PSMC uses information from inter-chromosomal genetic differences within a single diploid genome to infer ancestral N_e, MSMC2 extends this approach to multiple haplotypes from different populations. Since the model requires phased data to pair haploid genome sequences across populations, high phasing accuracy is paramount. To avoid issues with phasing uncertainty, we focused solely on male X chromosomes. We scaled the results using an X-chromosomal mutation rate of 1.17 × 10⁻⁸ mutations per base pair per generation and a generation time of 25 years [83].

Codon models

To identify protein-coding genes under positive selection, we used likelihood ratio tests (LRTs) based on pairs of nested Markov models of codon substitutions as implemented in PAML v1.3.1 [84]. Selection on the protein level was measured by the ω-ratio of non-synonymous to synonymous substitution rates [85], where higher ω-ratio indicates recurrent positive selection.

We used a single consensus FASTA genome sequence with randomly resolved haplotype phase for each individual. We identified the genomic positions of exons by retrieving a list of all unique protein-coding exons with Ensembl transcript IDs for the orangutan reference genome ponAbe2 from the UCSC Table Browser [86], only considering exons that (i) were at least 60 nucleotides long, (ii) had a number of nucleotides with multiples of 3, and (iii) contained at least one non-synonymous substitution in one of the individuals. As outgroups, we used human (hg18), chimpanzee (panTro4), and rhesus macaque (rheMac3) reference genomes from the UCSC Genome Browser [87]. Multiple sequence alignments for each exon were constructed using the multiple alignment format files from the UCSC Genome Browser, applying AlignIO and MafIO tools of Biopython v1.60+ [88] with default parameters.

Gene trees were inferred separately for each protein-coding exon based on codon model M0 with one ω-ratio per gene as implemented in CodonPhyML v1.0. [89] using the initial species topology in Additional file 1: Figure S14. We applied a test for variability of ω among sites (comparing models M0 vs. M3). For exons with significant variation of ω in their sequences, we fitted the nested site models (M8 vs. M8a) to identify exons with evidence for positive selection. For this subset of exons, we then performed a LRT for positive selection based on branch-site models, where we compared the modified model A to the corresponding null model with ω fixed to 1 at the “foreground” branches. Here, we defined either the North Sumatran or the Northeast Bornean (all P. pygmaeus morio individuals) lineages as foreground.

Genome scans

To uncover signatures of recent or ongoing selective sweeps, we applied two complementary tests of selection in genome-wide windows: (1) a haplotype-based selection test to identify extended linkage disequilibrium patterns surrounding recently selected alleles (iHS) [36, 37] and (2) a composite likelihood ratio (CLR) test to detect shifts in local site-frequency spectra relative to the genomic background. We used the R package rEHH [90] with default parameters to obtain absolute iHS scores for each variant position in each population and averaged iHS scores in sliding 25-kb windows with 12.5 kb step size. To calculate the CLR test statistic, we used the program SweeD v3.3.2 [35], applying a spacing of grid points of 12.5 kb and using the unfolded site-frequency spectrum. To prepare the input files, we used the phased genotype data described above and additionally determined the ancestral state for each SNP by comparing the alleles at each variant position with human and chimpanzee outgroup genotypes. We excluded sites that were not biallelic, for which the ancestral state could not be determined, or which were monomorphic at the population level. To identify significant departure from patterns expected under neutrality, we ran coalescent simulations under a detailed demographic model estimated earlier [33], generating 10,000 replicate sequences of 100 kb for comparison with the empirical data.

Gene ontology enrichment analysis

For the MCM analyses, we retrieved gene ontology (GO) terms associated with the positively selected orangutan genes using the BioMart data mining tool from Ensembl v82 [91]. We identified significantly enriched GO terms by computing the χ² statistic for each GO term in a given set of positively selected genes per analysis against the set of all genes considered in that analysis, applying FDR multiple testing correction [92]. For the iHS and SweeD analyses, we performed GO enrichment analyses using the R package “gProfileR” of the g:Profiler toolkit [93, 94]. Significance was assessed by comparing the candidate genes with a background list of all possible genes, i.e., all protein-coding genes (n = 12,866) located within any window with sufficient coverage for calculation of the iHS and CLR statistics. We applied the Benjamini-Hochberg method [92] for computing multiple testing correction for P values gained from GO enrichment analysis.

Detailed methods and materials are provided in Additional file 1: Supplementary Information.

Conclusions

We describe the interplay between environmental processes, demography, and adaptive evolution in a non-human great ape, the orangutan. We found that in the relatively stable environment of Sumatra, selection acted primarily on genes linked to brain development and learning, consistent with well-documented cognitive and cultural differences between species. On Borneo, however, selection manifested itself in physiological adaptations to harsher and more fluctuating environmental conditions. These findings may have important implications for the study of hominid evolution, as they suggest a link between cognitive and cultural evolution in great apes and the presence of high habitat productivity and demographic stability. Thus, our study provides a framework from which to develop and test more complex hypotheses about adaptive evolution in non-human great apes and to explore differences in adaptive evolution between our own species and our closest relatives.

References

Reznick DN, Ghalambor CK. The population ecology of contemporary adaptations: what empirical studies reveal about the conditions that promote adaptive evolution. Genetica. 2001;112:183–98.
Article PubMed Google Scholar
Armitage SJ, Jasim SA, Marks AE, Parker AG, Usik VI, Uerpmann HP. The southern route “out of Africa”: evidence for an early expansion of modern humans into Arabia. Science. 2011;331(6016):453–6.
Article CAS PubMed Google Scholar
Carto SL, Weaver AJ, Hetherington R, Lam Y, Wiebe EC. Out of Africa and into an ice age: on the role of global climate change in the late Pleistocene migration of early modern humans out of Africa. J Hum Evol. 2009;56(2):139–51.
Article PubMed Google Scholar
Scheinfeldt LB, Tishkoff SA. Recent human adaptation: genomic approaches, interpretation and insights. Nat Rev Genet. 2013;14(10):692–702.
Article PubMed PubMed Central Google Scholar
Mäkinen TM. Human cold exposure, adaptation, and performance in high latitude environments. Am J Hum Biol. 2007;19(2):155–64.
Article PubMed Google Scholar
Jablonski NG, Chaplin G. Human skin pigmentation as an adaptation to UV radiation. Proceedings Natl Acad Sci. 2010;107(Supplement 2):8962–8.
Article CAS Google Scholar
Deng L, Xu S. Adaptation of human skin color in various populations. Hereditas. 2017;155(1):1.
Article PubMed PubMed Central Google Scholar
Simonson TS, Yang Y, Huff CD, Yun H, Qin G, Witherspoon DJ, Bai Z, Lorenzo FR, Xing J, Jorde LB, et al. Genetic evidence for high-altitude adaptation in Tibet. Science. 2010;329(5987):72–5.
Article CAS PubMed Google Scholar
Yi X, Liang Y, Huerta-Sanchez E, Jin X, Cuo ZXP, Pool JE, Xu X, Jiang H, Vinckenbosch N, Korneliussen TS, et al. Sequencing of 50 human exomes reveals adaptation to high altitude. Science. 2010;329(5987):75–8.
Article CAS PubMed PubMed Central Google Scholar
Beall CM, Cavalleri GL, Deng L, Elston RC, Gao Y, Knight J, Li C, Li JC, Liang Y, McCormack M. Natural selection on EPAS1 (HIF2α) associated with low hemoglobin concentration in Tibetan highlanders. Proc Natl Acad Sci. 2010;107(25):11459–64.
Article CAS PubMed PubMed Central Google Scholar
Mondal M, Casals F, Xu T, Dall'Olio GM, Pybus M, Netea MG, Comas D, Laayouni H, Li Q, Majumder PP, et al. Genomic analysis of Andamanese provides insights into ancient human migration into Asia and adaptation. Nat Genet. 2016;48(9):1066–70.
Article CAS PubMed Google Scholar
Karlsson EK, Kwiatkowski DP, Sabeti PC. Natural selection and infectious disease in human populations. Nat Rev Genet. 2014;15(6):379–93.
Article CAS PubMed PubMed Central Google Scholar
Comas I, Coscolla M, Luo T, Borrell S, Holt KE, Kato-Maeda M, Parkhill J, Malla B, Berg S, Thwaites G, et al. Out-of-Africa migration and Neolithic coexpansion of Mycobacterium tuberculosis with modern humans. Nat Genet. 2013;45(10):1176–82.
Article CAS PubMed PubMed Central Google Scholar
Gagneux S. Host–pathogen coevolution in human tuberculosis. Philos Trans Royal Soc B Biol Sci. 2012;367(1590):850–9.
Article CAS Google Scholar
Scally A, Dutheil JY, Hillier LW, Jordan GE, Goodhead I, Herrero J, Hobolth A, Lappalainen T, Mailund T, Marques-Bonet T, et al. Insights into hominid evolution from the gorilla genome sequence. Nature. 2012;483(7388):169–75.
Article CAS PubMed PubMed Central Google Scholar
Xue YL, Prado-Martinez J, Sudmant PH, Narasimhan V, Ayub Q, Szpak M, Frandsen P, Chen Y, Yngvadottir B, Cooper DN, et al. Mountain gorilla genomes reveal the impact of long-term population decline and inbreeding. Science. 2015;348(6231):242–5.
Article CAS PubMed PubMed Central Google Scholar
McManus KF, Kelley JL, Song S, Veeramah KR, Woerner AE, Stevison LS, Ryder OA, Kidd JM, Wall JD, Bustamante CD, et al. Inference of gorilla demographic and selective history from whole-genome sequence data. Mol Biol Evol. 2015;32(3):600–12.
Article CAS PubMed PubMed Central Google Scholar
Cagan A, Theunert C, Laayouni H, Santpere G, Pybus M, Casals F, Prüfer K, Navarro A, Marques-Bonet T, Bertranpetit J. Natural selection in the great apes. Mol Biol Evol. 2016;33(12):3268–83.
Article CAS PubMed PubMed Central Google Scholar
Rijksen HD, Meijaard E. Our vanishing relative: the status of wild orang-utans at the close of the twentieth century: Springer Netherlands; 1999.
Google Scholar
Wich SA, Meijaard E, Marshall AJ, Husson S, Ancrenaz M, Lacy RC, van Schaik CP, Sugardjito J, Simorangkir T, Traylor-Holzer K, et al. Distribution and conservation status of the orang-utan (Pongo spp.) on Borneo and Sumatra: how many remain? Oryx. 2008;42(3):329–39.
Article Google Scholar
Whitten T, Damanik SJ, Anwar J, Hisyam N. The ecology of Sumatra, vol. 1. Hong Kong: Periplus Editions Ltd.; 2000.
Google Scholar
Wich SA, Vogel ER, Larsen MD, Fredriksson G, Leighton M, Yeager CP, Brearley FQ, van Schaik CP, Marshall AJ. Forest fruit production is higher on Sumatra than on Borneo. PLoS One. 2011;6(6):e21278.
Article CAS PubMed PubMed Central Google Scholar
Flenley JR. Tropical forests under the climates of the last 30,000 years. Clim Chang. 1998;39(2–3):177–97.
Article Google Scholar
Morley RJ. Origin and evolution of tropical rain forests. Chichester: Wiley; 2000.
Google Scholar
Morrogh-Bernard HC, Husson SJ, Knott CD, Wich SA, van Schaik CP, van Noordwijk MA, Lackman-Ancrenaz I, Marshall AJ, Kanamori T, Kuze N. Orangutan activity budgets and diet. In: Wich SA, Utami Atmoko SS, Mitra Setia T, van Schaik CP, editors. Orangutans geographic variation in behavioral ecology and conservation. New York: Oxford University Press; 2009. p. 119–34.
Chapter Google Scholar
Wich S, Utami-Atmoko S, Mitra Setia T, Djoyosudharmo S, Geurts M. Dietary and energetic responses of Pongo abelii to fruit availability fluctuations. Int J Primatol. 2006;27(6):1535–50.
Article Google Scholar
Kanamori T, Kuze N, Bernard H, Malim TP, Kohshima S. Feeding ecology of Bornean orangutans (Pongo pygmaeus morio) in Danum Valley, Sabah, Malaysia: a 3-year record including two mast fruitings. Am J Primatol. 2010;72(9):820–40.
Article PubMed Google Scholar
Knott CD. Changes in orangutan caloric intake, energy balance, and ketones in response to fluctuating fruit availability. Int J Primatol. 1998;19(6):1061–79.
Article Google Scholar
MacKinnon K, Hatta G, Halim H, Mangalik A, MacKinnon K, Hatta G, Halim H, Mangalik A. The island of Borneo. In: The ecology of Kalimantan. Singapore: Periplus Editions (HK) Ltd.; 1996. p. 9–68.
Google Scholar
Marshall AJ, Ancrenaz M, Brearley FQ, Fredriksson GM, Ghaffar N, Heydon M, Husson SJ, Leighton M, McConkey KR, Morrogh-Bernard HC, et al. The effects of forest phenology and floristics on populations of Bornean and Sumatran orangutans. In: Wich SA, Utami Atmoko SS, Mitra Setia T, van Schaik CP, editors. Orangutans geographic variation in behavioral ecology and conservation. New York: Oxford University Press; 2009. p. 311–26.
Chapter Google Scholar
Husson SJ, Wich SA, Marshall AJ, Dennis RD, Ancrenaz M, Brassey R, Gumal M, Hearn AJ, Meijaard E, Simorangkir T, et al. Orangutan distribution, density, abundance and impacts of disturbance. In: Wich SA, Utami Atmoko SS, Mitra Setia T, van Schaik CP, editors. Orangutans: geographic variation in behavioral ecology and conservation; 2009. p. 77–96.
Google Scholar
Wich S, Utami Atmoko S, Mitra Setia T, van Schaik C. Orangutans - geographic variation in behavioral ecology and conservation. New York: Oxford University Press; 2009.
Google Scholar
Nater A, Mattle-Greminger MP, Nurcahyo A, Nowak M, de Manuel M, Desai T, Groves C, Pybus M, Bilgin Sonay T, Roos C, et al. Morphometric, behavioral, and genomic evidence for a new orangutan species. Curr Biol. 2017;27(22):3487–98.
Article CAS PubMed Google Scholar
Nielsen R, Williamson S, Kim Y, Hubisz MJ, Clark AG, Bustamante C. Genomic scans for selective sweeps using SNP data. Genome Res. 2005;15(11):1566–75.
Article CAS PubMed PubMed Central Google Scholar
Pavlidis P, Zivkovic D, Stamatakis A, Alachiotis N. SweeD: likelihood-based detection of selective sweeps in thousands of genomes. Mol Biol Evol. 2013;30(9):2224–34.
Article CAS PubMed PubMed Central Google Scholar
Sabeti PC, Schaffner SF, Fry B, Lohmueller J, Varilly P, Shamovsky O, Palma A, Mikkelsen TS, Altshuler D, Lander ES. Positive natural selection in the human lineage. Science. 2006;312(5780):1614–20.
Article CAS PubMed Google Scholar
Voight BF, Kudaravalli S, Wen X, Pritchard JK. A map of recent positive selection in the human genome. PLoS Biol. 2006;4(3):e72.
Article PubMed PubMed Central Google Scholar
Anisimova M, Kosiol C. Investigating protein-coding sequence evolution with probabilistic codon substitution models. Mol Biol Evol. 2009;26(2):255–71.
Article CAS PubMed Google Scholar
Yang ZH, Nielsen R, Goldman N, Pedersen AMK. Codon-substitution models for heterogeneous selection pressure at amino acid sites. Genetics. 2000;155(1):431–49.
CAS PubMed PubMed Central Google Scholar
Jensen JD, Kim Y, DuMont VB, Aquadro CF, Bustamante CD. Distinguishing between selective sweeps and demography using DNA polymorphism data. Genetics. 2005;170(3):1401–10.
Article CAS PubMed PubMed Central Google Scholar
Crisci JL, Poh YP, Mahajan S, Jensen JD. The impact of equilibrium assumptions on tests of selection. Front Genet. 2013;4:235.
Article PubMed PubMed Central CAS Google Scholar
Freedman AH, Schweizer RM, Ortega-Del Vecchyo D, Han E, Davis BW, Gronau I, Silva PM, Galaverni M, Fan Z, Marx P, et al. Demographically-based evaluation of genomic regions under selection in domestic dogs. PLoS Genet. 2016;12(3):e1005851.
Article PubMed PubMed Central CAS Google Scholar
Li H, Durbin R. Inference of human population history from individual whole-genome sequences. Nature. 2011;475(7357):493–6.
Article CAS PubMed PubMed Central Google Scholar
Schiffels S, Durbin R. Inferring human population size and separation history from multiple genome sequences. Nat Genet. 2014;46(8):919–25.
Article CAS PubMed PubMed Central Google Scholar
Song S, Sliwerska E, Emery S, Kidd JM. Modeling human population separation history using physically phased genomes. Genetics. 2016;205(1):385–95.
Dutheil JY, Munch K, Nam K, Mailund T, Schierup MH. Strong selective sweeps on the X chromosome in the human-chimpanzee ancestor explain its low divergence. PLoS Genet. 2015;11(8):e1005451.
Article PubMed PubMed Central CAS Google Scholar
Nam K, Munch K, Hobolth A, Dutheil JY, Veeramah KR, Woerner AE, Hammer MF, Mailund T, Schierup MH. Extreme selective sweeps independently targeted the X chromosomes of the great apes. Proceed Natl Acad Sci. 2015;112(20):6413.
Article CAS Google Scholar
Kamm JA, Terhorst J, Durbin R, Song YS. Efficiently inferring the demographic history of many populations with allele count data. bioRxiv. 2018:287268.
Allendorf FW, Luikart G, Aitken SN. Conservation and the genetics of populations. 2nd ed. Hoboken: John Wiley & Sons; 2013.
Google Scholar
Zhang JZ, Nielsen R, Yang ZH. Evaluation of an improved branch-site likelihood method for detecting positive selection at the molecular level. Mol Biol Evol. 2005;22(12):2472–9.
Article CAS PubMed Google Scholar
Safran M, Dalah I, Alexander J, Rosen N, Stein TI, Shmoish M, Nativ N, Bahir I, Doniger T, Krug H. GeneCards version 3: the human gene integrator. Database. 2010;2010:baq020.
Article PubMed PubMed Central Google Scholar
Yamada K, Nabeshima T. Brain-derived neurotrophic factor/TrkB signaling in memory processes. J Pharmacol Sci. 2003;91(4):267–70.
Article CAS PubMed Google Scholar
Riedel G, Platt B, Micheau J. Glutamate receptor function in learning and memory. Behav Brain Res. 2003;140(1):1–47.
Article CAS PubMed Google Scholar
Voris HK. Maps of Pleistocene sea levels in Southeast Asia: shorelines, river systems and time durations. J Biogeogr. 2000;27(5):1153–67.
Article Google Scholar
Bird MI, Taylor D, Hunt C. Environments of insular Southeast Asia during the last glacial period: a savanna corridor in Sundaland? Quat Sci Rev. 2005;24(20–21):2228–42.
Article Google Scholar
Elderfield H, Ferretti P, Greaves M, Crowhurst S, McCave IN, Hodell D, Piotrowski AM. Evolution of ocean temperature and ice volume through the mid-Pleistocene climate transition. Science. 2012;337(6095):704–9.
Article CAS PubMed Google Scholar
Dupont LM, Bonner B, Schneider R, Wefer G. Mid-Pleistocene environmental change in tropical Africa began as early as 1.05 Ma. Geology. 2001;29(3):195–8.
Article CAS Google Scholar
Gathorne-Hardy F, Davies R, Eggleton P, Jones D. Quaternary rainforest refugia in south-east Asia: using termites (Isoptera) as indicators. Biol J Linn Soc. 2002;75(4):453–66.
Article Google Scholar
Chesner CA, Rose WI, Deino A, Drake R, Westgate JA. Eruptive history of Earths largest Quaternary caldera (Toba, Indonesia) clarified. Geology. 1991;19(3):200–3.
Article Google Scholar
Gathorne-Hardy FJ, Harcourt-Smith WEH. The super-eruption of Toba, did it cause a human bottleneck? J Hum Evol. 2003;45(3):227–30.
Article CAS PubMed Google Scholar
Mellars P, Gori KC, Carr M, Soares PA, Richards MB. Genetic and archaeological perspectives on the initial modern human colonization of southern Asia. Proc Natl Acad Sci U S A. 2013;110(26):10699–704.
Article CAS PubMed PubMed Central Google Scholar
Han X, Cheng H, Mancuso DJ, Gross RW. Caloric restriction results in phospholipid depletion, membrane remodeling, and triacylglycerol accumulation in murine myocardium. Biochemistry. 2004;43(49):15584–94.
Article CAS PubMed Google Scholar
Moore LG. Human genetic adaptation to high altitude. High Alt Med Biol. 2001;2(2):257–79.
Article CAS PubMed Google Scholar
Liu SP, Lorenzen ED, Fumagalli M, Li B, Harris K, Xiong ZJ, Zhou L, Korneliussen TS, Somel M, Babbitt C, et al. Population genomics reveal recent speciation and rapid evolutionary adaptation in polar bears. Cell. 2014;157(4):785–94.
Article CAS PubMed PubMed Central Google Scholar
da Silveira LC, do Nascimento LFR, Colquhoun A, Abe AS, de Souza SCR. Cardiac hypertrophy and structural and metabolic remodeling related to seasonal dormancy in the first annual cycle in tegu lizards. Comp Biochem Physiol A Mol Integr Physiol. 2013;165(3):371–80.
Article PubMed CAS Google Scholar
van Schaik CP, Marshall AJ, Wich SA. Geographic variation in orangutan behavior and biology. In: Wich SA, Utami Atmoko SS, Mitra Setia T, van Schaik CP, editors. Orangutans - geographic variation in behavioral ecology and conservation. New York: Oxford University Press; 2009.
Isler K. Adipose tissue in evolution. In: Fantuzzi G, Braunschweig C, editors. Adipose tissue and adipokines in health and disease. New York: Humana Press; 2014. p. 3–13.
Chapter Google Scholar
Dierenfeld E. Orangutan nutrition. In: Orangutan SSP husbandry manual. Brookfield: Orangutan SSP and Brookfield Zoo; 1997.
Google Scholar
Amorim CEG, Daub JT, Salzano FM, Foll M, Excoffier L. Detection of convergent genome-wide signals of adaptation to tropical forests in humans. PLoS One. 2015;10(4):e0121557.
Article PubMed PubMed Central CAS Google Scholar
Krützen M, Willems EP, van Schaik CP. Culture and geographic variation in orangutan behavior. Curr Biol. 2011;21(21):1808–12.
Article PubMed CAS Google Scholar
van Schaik CP, Ancrenaz M, Borgen G, Galdikas B, Knott CD, Singleton I, Suzuki A, Utami SS, Merrill M. Orangutan cultures and the evolution of material culture. Science. 2003;299(5603):102–5.
Article CAS PubMed Google Scholar
Forss SIF, Willems E, Call J, van Schaik CP. Cognitive differences between orang-utan species: a test of the cultural intelligence hypothesis. Sci Rep. 2016;6:30516.
Article CAS PubMed PubMed Central Google Scholar
Mitra Setia T, Delgado R, Utami Atmoko S, Singleton I, Van Schaik C. Social organization and male-female relationships. In: Wich SA, Utami Atmoko SS, Mitra Setia T, van Schaik CP, editors. Orangutans, geographic variations in behavioral ecology and conservation; 2009. p. 245–53.
Google Scholar
van Schaik CP. The socioecology of fission-fusion sociality in orangutans. Primates. 1999;40(1):69–86.
Article CAS PubMed Google Scholar
van Schaik CP. Among orangutans : red apes and the rise of human culture. Cambridge: Belknap of Harvard University Press; 2004.
Google Scholar
Knott C, Beaudrot L, Snaith T, White S, Tschauner H, Planansky G. Female-female competition in Bornean orangutans. Int J Primatol. 2008;29(4):975–97.
Article Google Scholar
Weingrill T, Willems EP, Zimmermann N, Steinmetz H, Heistermann M. Species-specific patterns in fecal glucocorticoid and androgen levels in zoo-living orangutans (Pongo spp.). Gen Comp Endocrinol. 2011;172(3):446–57.
Article CAS PubMed Google Scholar
Prado-Martinez J, Sudmant PH, Kidd JM, Li H, Kelley JL, Lorente-Galdos B, Veeramah KR, Woerner AE, O/’Connor TD, Santpere G, et al. Great ape genetic diversity and population history. Nature. 2013;499(7459):471–5.
Article CAS PubMed PubMed Central Google Scholar
Locke DP, Hillier LW, Warren WC, Worley KC, Nazareth LV, Muzny DM, Yang S-P, Wang Z, Chinwalla AT, Minx P, et al. Comparative and demographic analysis of orang-utan genomes. Nature. 2011;469(7331):529–33.
Article CAS PubMed PubMed Central Google Scholar
Delaneau O, Marchini J, Zagury JF. A linear complexity phasing method for thousands of genomes. Nat Methods. 2012;9(2):179–81.
Article CAS Google Scholar
Huson DH, Bryant D. Application of Phylogenetic Networks in Evolutionary Studies, Molecular Biology and Evolution. 2006;23(2):254–67. https://doi.org/10.1093/molbev/msj030.
Article CAS PubMed Google Scholar
Chang CC, Chow CC, Tellier LC, Vattikuti S, Purcell SM, Lee JJ. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience. 2015;4(1):7.
Article PubMed PubMed Central CAS Google Scholar
Wich S, De Vries H, Ancrenaz M, Perkins L, Shumaker R, Suzuki A, Van Schaik C. Orangutan life history variation. In: Wich SA, Utami Atmoko SS, Mitra Setia T, van Schaik CP, editors. Orangutans - geographic variation in behavioral ecology and conservation. New York: Oxford University Press; 2009. p. 65–75.
Chapter Google Scholar
Yang Z. PAML 4: phylogenetic analysis by maximum likelihood. Mol Biol Evol. 2007;24(8):1586–91.
Article CAS PubMed Google Scholar
Kimura M. Preponderance of synonymous changes as evidence for the neutral theory of molecular evolution. Nature. 1977;267(5608):275–6.
Article CAS PubMed Google Scholar
Karolchik D, Baertsch R, Diekhans M, Furey TS, Hinrichs A, Lu YT, Roskin KM, Schwartz M, Sugnet CW, Thomas DJ, et al. The UCSC genome browser database. Nucleic Acids Res. 2003;31(1):51–4.
Article CAS PubMed PubMed Central Google Scholar
Kent WJ, Sugnet CW, Furey TS, Roskin KM, Pringle TH, Zahler AM, Haussler D. The human genome browser at UCSC. Genome Res. 2002;12(6):996–1006.
Article CAS PubMed PubMed Central Google Scholar
Cock PJA, Antao T, Chang JT, Chapman BA, Cox CJ, Dalke A, Friedberg I, Hamelryck T, Kauff F, Wilczynski B, et al. Biopython: freely available Python tools for computational molecular biology and bioinformatics. Bioinformatics. 2009;25(11):1422–3.
Article CAS PubMed PubMed Central Google Scholar
Gil M, Zanetti MS, Zoller S, Anisimova M. CodonPhyML: fast maximum likelihood phylogeny estimation under codon substitution models. Mol Biol Evol. 2013;30(6):1270–80.
Article CAS PubMed PubMed Central Google Scholar
Gautier M, Vitalis R. rehh: an R package to detect footprints of selection in genome-wide SNP data from haplotype structure. Bioinformatics. 2012;28(8):1176–7.
Article CAS PubMed Google Scholar
Kinsella RJ, Kahari A, Haider S, Zamora J, Proctor G, Spudich G, Almeida-King J, Staines D, Derwent P, Kerhornou A, et al. Ensembl BioMarts: a hub for data retrieval across taxonomic space. Database J Biol Databases Curation. 2011.
Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J Royal Stat Soc Ser B (Methodological). 1995. p. 289–300.
Reimand J, Kull M, Peterson H, Hansen J, Vilo J. g: Profiler—a web-based toolset for functional profiling of gene lists from large-scale experiments. Nucleic Acids Res. 2007;35(suppl 2):W193–200.
Article PubMed PubMed Central Google Scholar
Reimand J, Arak T, Vilo J. g: Profiler—a web server for functional interpretation of gene lists (2011 update). Nucleic Acids Res. 2011;39(suppl 2):W307–15.
Article CAS PubMed PubMed Central Google Scholar
Nater A. Custom scripts used in “Genomes reveal marked differences in the adaptive evolution between orangutan species”. Github. https://github.com/alexnater/GB_D-18-00117.
Nater A. Custom scripts used in “Genomes reveal marked differences in the adaptive evolution between orangutan species”. Zenodo. https://doi.org/10.5281/zenodo.1435045.
Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13(11):2498–504.
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank the following institutions for supporting our research: Sabah Wildlife Department, Indonesian State Ministry for Research and Technology, Indonesian Institute of Sciences, Leuser International Foundation, Taman National Gunung Leuser, and Borneo Orangutan Survival Foundation. Mario Barbato, Anna K. Lindholm, and Serge Wich provided comments on analyses and earlier drafts of this manuscript.

Funding

This work was financially supported by University of Zurich Forschungskredit grants FK-10 (MPMG), FK-15-103 (AN), and FK-14-094 (TBS), Swiss National Science Foundation grants 3100A-116848 (MK, CvS) and P300PA_177852 (AN), Leakey Foundation (MPMG), UZH Research Priority Program “Evolution in Action” (MK), MINECO/FEDER UE BFU2017-86471-P (TMB), U01 MH106874 (TMB), Howard Hughes International Early Career (TMB), Obra Social “La Caixa” and Secretaria d’Universitats i Recerca and CERCA Programme del Departament d’Economia i Coneixement de la Generalitat de Catalunya (TMB), Julius–Klaus Foundation (MK), Ministerio de Economía y Competitividad grant BFU2013-43726-P (JB, MP), Gates Cambridge Trust (TD), A.H. Schultz Foundation grants (MK, MPMG), and the Department of Anthropology at the University of Zurich.

Availability of data and materials

The datasets supporting the conclusions of this article are included within the article and its additional files. Raw whole-genome sequencing data was obtained from three different sources, available in Sequence Read Archive under the following accession IDs: PRJNA189439 [78], PRJNA74653, and PRJNA20869 [79], as well as in European Nucleotide Archive: PRJEB19688 [33]. Custom analysis scripts are available from GitHub https://github.com/alexnater/GB_D-18-00117 [95] or from the zenodo research data repository https://doi.org/10.5281/zenodo.1435045 [96].

Author information

Maja P. Mattle-Greminger, Tugce Bilgin Sonay and Alexander Nater contributed equally to this work.

Authors and Affiliations

Evolutionary Genetics Group, Department of Anthropology, University of Zurich, Winterthurerstrasse 190, 8057, Zürich, Switzerland
Maja P. Mattle-Greminger, Tugce Bilgin Sonay, Alexander Nater, Carel P. van Schaik & Michael Krützen
Department of Evolutionary Biology and Environmental Studies, University of Zurich, Winterthurerstrasse 190, 8057, Zürich, Switzerland
Maja P. Mattle-Greminger & Alexander Nater
Swiss Institute of Bioinformatics, Quartier Sorge - Batiment Genopode, 1015, Lausanne, Switzerland
Tugce Bilgin Sonay & Maria Anisimova
Lehrstuhl für Zoologie und Evolutionsbiologie, Department of Biology, University of Konstanz, Universitätsstrasse 10, 78457, Konstanz, Germany
Alexander Nater
Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, Doctor Aiguader 88, 08003, Barcelona, Spain
Marc Pybus, Guillem de Valles & Jaume Bertranpetit
Department of Genetics, University of Cambridge, Downing Street, Cambridge, CB2 3EH, UK
Tariq Desai & Aylwyn Scally
Servei de Genòmica, Universitat Pompeu Fabra, Doctor Aiguader 88, 08003, Barcelona, Spain
Ferran Casals
Institute of Evolutionary Biology (UPF-CSIC), Universitat Pompeu Fabra, Doctor Aiguader 88, Barcelona, Spain
Tomas Marques-Bonet
Catalan Institution of Research and Advanced Studies (ICREA), Passeig de Lluís Companys 23, Barcelona, Spain
Tomas Marques-Bonet
CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, Barcelona, Spain
Tomas Marques-Bonet
Institut Català de Paleontologia Miquel Crusafont, Universitat Autònoma de Barcelona, Edifici ICTA-ICP, c/ Columnes s/n, Cerdanyola del Vallès, Barcelona, Spain
Tomas Marques-Bonet
Institute of Applied Simulations, School of Life Sciences and Facility Management, Zurich University of Applied Sciences ZHAW, Einsiedlerstrasse 31a, 8820, Wädenswil, Switzerland
Maria Anisimova

Authors

Maja P. Mattle-Greminger
View author publications
You can also search for this author in PubMed Google Scholar
Tugce Bilgin Sonay
View author publications
You can also search for this author in PubMed Google Scholar
Alexander Nater
View author publications
You can also search for this author in PubMed Google Scholar
Marc Pybus
View author publications
You can also search for this author in PubMed Google Scholar
Tariq Desai
View author publications
You can also search for this author in PubMed Google Scholar
Guillem de Valles
View author publications
You can also search for this author in PubMed Google Scholar
Ferran Casals
View author publications
You can also search for this author in PubMed Google Scholar
Aylwyn Scally
View author publications
You can also search for this author in PubMed Google Scholar
Jaume Bertranpetit
View author publications
You can also search for this author in PubMed Google Scholar
Tomas Marques-Bonet
View author publications
You can also search for this author in PubMed Google Scholar
Carel P. van Schaik
View author publications
You can also search for this author in PubMed Google Scholar
Maria Anisimova
View author publications
You can also search for this author in PubMed Google Scholar
Michael Krützen
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

MPMG, AN, TBS, and MK conceived the study and wrote the paper. MA also conceived the study. AS, JB, MA, MP, FC, TMB, and CPvS provided comments on the manuscript. MPMG, AN, TBS, MP, AS, JB, TD, GdV, FC, MA, and TMB carried out statistical analyses and contributed to the supplementary information. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Maja P. Mattle-Greminger or Michael Krützen.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional files

Additional file 1:

Supplementary Information. (PDF 1813 kb)

Additional file 2:

Table S13. List of SNPs completely fixed for different alleles in Bornean and North Sumatran orangutans. Table S14. Functional consequences of SNPs completely fixed between North Sumatran and Bornean orangutans. Table S15. Results of the GO enrichment analysis of fixed SNPs. Table S16. Detailed information on all genes containing fixed non-synonymous SNPs between North Sumatran and Bornean orangutans. Table S17. Detailed information on all candidate genes of the iHS analysis. Table S18. Selection of candidate genes of the iHS analysis in Bornean orangutans. Table S19. Selection of candidate genes of the iHS analysis in North Sumatran orangutans. Table S20. Results of GO enrichment analysis of SK/NK/EK for the iHS selection test. Table S21. Results of GO enrichment analysis of CK/WK for the iHS selection test. Table S22. Results of GO enrichment analysis of LK/NA for the iHS selection test. Table S23. Results of GO enrichment analysis of WA for the iHS selection test. Table S24. Results of GO enrichment analysis of CK/WK for the SweeD selection test. Table S25. Results of GO enrichment analysis of SK/NK/EK for the SweeD selection test. Table S26. Results of GO enrichment analysis of LK/NA for the SweeD selection test. Table S27. Results of GO enrichment analysis of WA for the SweeD selection test. (XLSX 13976 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Mattle-Greminger, M.P., Bilgin Sonay, T., Nater, A. et al. Genomes reveal marked differences in the adaptive evolution between orangutan species. Genome Biol 19, 193 (2018). https://doi.org/10.1186/s13059-018-1562-6

Download citation

Received: 12 January 2018
Accepted: 09 October 2018
Published: 15 November 2018
DOI: https://doi.org/10.1186/s13059-018-1562-6

Genomes reveal marked differences in the adaptive evolution between orangutan species

Abstract

Background

Results

Conclusions

Similar content being viewed by others

Population Genomics in the Great Apes

Comparative genomic analyses provide new insights into evolutionary history and conservation genomics of gorillas

Genomic basis of insularity and ecological divergence in barn owls (Tyto alba) of the Canary Islands

Background

Results

Demographic history of the genus Pongo

Selection during more distant time scales

Selection during more recent time scales

Discussion

Methods

Sampling and data generation

Read mapping, variant calling, and phasing

Haplotype phasing

Neighbor-net

PSMC and MSMC

Codon models

Genome scans

Gene ontology enrichment analysis

Conclusions

References

Acknowledgements

Funding

Availability of data and materials

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Publisher’s Note

Additional files

Additional file 1:

Additional file 2:

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation