Microbial assembly, interaction, functioning, activity and diversification: a review derived from community compositional data
- 519 Downloads
Microorganisms play crucial roles in maintaining ecosystem stability. The last two decades have witnessed an upsurge in studies on marine microbial community composition using high-throughput sequencing methods. Extensive mining of the compositional data has provided exciting new insights into marine microbial ecology from a number of perspectives. Both deterministic and stochastic processes contribute to microbial community assembly but their relative importance in structuring subcommunities, that are categorized by traits such as abundance, functional type and activity, differs. Through correlation-based network analysis, significant progress has been made in unraveling microbial co-occurrence patterns and dynamics in response to environmental changes. Prediction of ecosystem functioning, based on microbial data, is receiving increasing attention, as closely related microbes often share similar ecological traits and microbial diversity often exhibits significant correlations to ecosystem functioning. The ecosystem functioning is likely executed not by the whole community, but rather by an active fraction of a community, which can be inferred from the marker gene transcription level of community members. Furthermore, the huge amount of microbial community data has significantly expanded the tree of life and illuminated microbial phylogenetic divergence and evolutionary history. This review summarizes important findings in microbial assembly, interaction, functioning, activity and diversification, highlighting the interacting roles of different aspects, derived from community compositional data.
KeywordsrRNA gene Microbial community Assembly Interaction Ecosystem functioning
Microorganisms play key roles in biogeochemical cycling that are fundamental in maintaining climate and ecosystem stability. The structure of microbial communities is closely associated with environmental conditions and therefore is likely to evolve in the context of global change (Gutknecht et al. 2012). In the marine environment, frequent natural events and increasing human activity dramatically influence microbial community dynamics, which will change the balance of biogeochemical cycles and alter ecosystem functioning (Hutchins and Fu 2017). One of the major concerns associated with global changes is how to effectively predict variations in ecosystem functioning. Microorganisms, as major drivers of many biogeochemical processes, provide a linkage between ecosystem functioning and environments (Singh et al. 2010).
Marine microbial communities are significantly affected by environmental changes. Sanger and high-throughput sequencing in recent decades have provided an enormous amount of sequence data of molecular marker genes, including the ribosomal RNA (rRNA) gene. These data have helped to provide insights into marine microbial community dynamics (Liu et al. 2019; Needham et al. 2017; Reji et al. 2019), which are driven by environmental factors, such as salinity (Lozupone and Knight 2007) and temperature (Sunagawa et al. 2015). However, the association between environments and microbial dynamics (termed as deterministic processes) can be confounded by the effects of random events (termed as stochastic processes that include ecological drift and dispersal) (Mo et al. 2018; Wang et al. 2019; Zhou and Ning 2017). Deterministic and stochastic processes, which jointly determine microbial biogeography, vary in their relative contribution to community assembly over different temporal and spatial scales (Zhou and Ning 2017). Differences in microbial distribution impact ecosystem functioning. Their relationships can mostly be explained by the observed correlations between microbial phylogeny and functional traits and between microbial diversity and ecosystem functioning. There are ways to predict microbial functional potential based on taxonomy (Aßhauer et al. 2015; Langille et al. 2013; Louca et al. 2016). Therefore, it has become increasingly common to improve the fitness of ecosystem functioning prediction by including microbial data.
Marine microbes are highly diverse and encompass taxonomically and functionally different lineages. Complex interactions occur among microbial taxa, which underpin community stability and functioning. However, elucidation of microbial interactions is challenging and is largely dependent on correlation-based network analysis (Liu et al. 2014b; Milici et al. 2016; Zhang et al. 2014; Zhou et al. 2018). Diverse microbial communities can be divided into subcommunities, based on different criteria, such as abundance (abundant and rare taxa), functional type (e.g., autotrophic and heterotrophic taxa) and activity (active and dormant taxa). Accumulating evidence is showing that microbial subcommunities differ in their environmental sensitivity, interaction and distribution patterns (Wu et al. 2017; Zhang et al. 2014). Thus, different subcommunities may represent different consortiums and differ in their roles in ecosystem functioning prediction. Currently, the ecology of microbial subcommunities is less understood than the whole community, raising the need for a resolved community-based classification for future analysis.
By implementing rRNA gene-based amplicon sequencing, numerous previously unknown microbial lineages, even at the phylum level, have been described from the marine environment (Brown et al. 2015; DeLong 1992; Inagaki et al. 2003). These, together with those identified from terrestrial habitats, dramatically expand the tree of life (Hug et al. 2016). Refined phylogenetic analysis of molecular marker genes further demonstrates the occurrence of habitat-specific ecotypes within a lineage (Ivars-Martinez et al. 2008; Liu et al. 2014a). The diversification of microorganisms can be attributed to a joint effect of genetic and environmental variabilities, which dictate the specific evolutionary history of a taxon.
In this review, five aspects are presented (microbial assembly, interaction, functioning, activity and diversification) to show how microbial community data contribute to the understanding of marine microbial ecology. They are organized along a stepwise understanding of the role microbial communities play in marine ecosystems. A synthetic view of these aspects will provide novel insights into their interactions and complementarity, which will in turn help to stimulate new ideas on the interpretation of community compositional data and the perception of new microbial community studies.
Processes of microbial community assembly
One long debated question in community ecology is which processes determine an ecological community to assemble (Preston 1948). The current, well-established theories are primarily derived from research on animals and plants. Studies on microorganisms are scarce because of the assumption that microorganisms do not have distribution patterns due to their large numbers and small size (Baas-Becking 1934). However, using advanced sequencing technologies and statistical methods, microbial distribution patterns have been discovered in many natural environments including seawater and marine sediments (Liu et al. 2014b, 2015a; Lozupone and Knight 2007; Martiny et al. 2006). The spatial turnover of microbial communities always reflects a distance-decay relationship and/or a taxa-area relationship, which are the two most well established patterns depicting increasing community dissimilarity with spatial distance (Nekola and White 1999) and increasing taxa richness with area size (Horner-Devine et al. 2004), respectively.
Comparison of the niche and neutral theories
The niche theory
The neutral theory
1959 (A general description of niche concept)
Coexistence by niche differentiation
Random events occurring on individual taxon
Zero-sum multinomial distribution
Abiotic environmental selection and biotic interspecific interactions
Ecological drift and dispersal
Feature of process
Examples of studies examining the relative role of deterministic and stochastic processes in structuring microbial community
Arbuscular mycorrhizal fungi
< 50 m
Species relative abundance and spatial distance
Dumbrell et al. (2010)
Caruso et al. (2011)
~ 130 km in maximum
Neutral community model
Roguet et al. (2015)
Soil spanning 105 years succession
Newly designed model
Community when succession proceeded
Community at initial stage
Dini-Andreote et al. (2015)
Bacteria and archaea
Graham et al. (2017)
Coastal water and sediment
~ 20 km in maximum
Neutral community model and variation partitioning
Chen et al. (2017)
~ 200 km
Dai et al. (2017)
Bacteria and protist
Stegen’s framework and variation partitioning
Protist community (bottom water)
Protist (surface and deep chlorophyll maximum waters) and bacterial (all three layers) communities
Wu et al. (2018)
~ 1300 km in maximum
Neutral model and variation partitioning
Both abundant and rare communities
Mo et al. (2018)
~ 200 km
Stegen’s framework and variation partitioning
Wang et al. (2019)
Deterministic and stochastic processes jointly govern the assembly of microbial communities (Chave 2004). However, their relative importance varies across different spatial and temporal scales (Table 2), depending on the strength of environmental gradients and the sensitivity of the microbes to environmental changes. If the extent of environmental variation is greater than the threshold a microbe can endure, dispersal will be prevented (Wang et al. 2013), leading to the predominance of determinism. Thus, the mechanisms underlying microbial community assembly would alter over a seasonal or longer term period with changes in the magnitude of environmental heterogeneity (Dini-Andreote et al. 2015; Langenheder et al. 2012). These conclusions are mostly based on investigations of terrestrial microbial communities. By comparison, there have been very few studies on the relative roles of deterministic and stochastic processes in the marine environment. However, there is a general perception that stochastic processes have a greater effect on the assembly of planktonic bacterial and archaeal communities than deterministic processes (Table 2). This can be explained by marine prokaryotes having evolved strong adaptation capabilities to environmental changes and by spatial connectivity and seawater movement homogenizing environmental conditions. Another explanation is that the environmental factors analyzed to represent deterministic processes may be not the most relevant ones affecting community variations. Further studies are needed to confirm such a hypothesis and to compare the assembling processes between different habitats, such as coastal water vs open ocean and water vs sediment.
Different types of marine organisms differ in their responses to deterministic and stochastic processes. Wu et al. (2018) reported that determinism had a stronger effect on planktonic protist communities than on bacterial communities, which may relate to differences in their environmental sensitivity. Such different responses between bacteria and micro-eukaryotes have also been observed in soil (Powell et al. 2015a) and freshwater (Logares et al. 2018) habitats. Additionally, subcommunities that are divided by abundance, activity, functional trait or occupancy, can also undergo different ecological processes (Fig. 1). A microbial community is usually made up of a few abundant taxa and a long tail of rarer ones (Pedrós-Alió 2006, 2012). The rare taxa account for a great proportion of the microbial diversity and have been shown to assemble non-randomly and display similar distribution patterns to the abundant taxa (Galand et al. 2009; Gong et al. 2015; Liu et al. 2015b; Mo et al. 2018). Nevertheless, the abundant and rare taxa have both been observed to be differently affected by stochastic and deterministic processes (Liu et al. 2015b; Mo et al. 2018). Mo et al. (2018) reported that the rare bacterioplankton in coastal seawater had a weaker response to environmental factors than abundant taxa; this may be due to the small population size of the rare taxa, making them more susceptible to ecological drift (Nemergut et al. 2013). By contrast, a survey of bacterial communities in freshwater lakes and reservoirs revealed a greater influence of environmental changes on the rare than the abundant taxa (Liu et al. 2015b). These findings suggest complicated microbial ecological responses across distinct ecosystems. In this context, further studies are needed to gain an insight into the assembling processes of abundant and rare taxa in different environments. The most urgent need is to propose a common definition for rare taxa, facilitating a parallel comparison across studies (Jia et al. 2018). Subcommunities divided by functional traits, activity and occupancy receive less attention and have mainly been analyzed in terrestrial environments. For example, in deserts, the phototrophic community was mainly affected by stochastic processes, whereas the heterotrophic community displayed patterns mainly driven by environmental stresses (Caruso et al. 2011). The assembly of generalists and specialists in plateau lakes, however, was driven by stochastic and deterministic processes, respectively (Liao et al. 2016). While these studies provide novel and accurate information about the distribution patterns and assembling processes of microbial communities, there is an urgent need to investigate different subcommunities in the marine environment. It should be noticed that when evaluating the relative role of determinism and stochasticity, the estimated contribution from determinism is largely affected by the set of environmental factors measured, since they are not necessarily the most relevant parameters that provide the best explanatory power for the community variations.
Patterns of microbial co-occurrence
In niche theory, the microbe-microbe interactions, although being ecologically important, are less understood compared to the microbe-environment relationships (Chase and Leibold 2003). Inclusion of interactions to explain microbial distribution patterns is a great challenge, largely due to the difficulty in obtaining microbial co-cultures. An alternative way of elucidating microbial interactions is to apply correlation-based network analysis (Barberán et al. 2012; Layeghifard et al. 2017; Weiss et al. 2016), which is enhanced by the increase of community compositional data and the development of statistical tools. The most popular method used for constructing a correlation-based network is to calculate the Spearman’s rank correlation coefficients between taxa (Barberán et al. 2012; Table 2). Other methods are also available, including SPIEC-EASI, CCLasso, REBACCA, CoNet, SparCC, WGCNA, Molecular Ecological Networks Analysis, Local Similarity Analysis, Maximal Information Coefficient, etc. These methods, however, vary in their sensitivity and precision (Layeghifard et al. 2017; Weiss et al. 2016).
Nodes and edges are fundamental components of a network, representing taxa and correlations, respectively. Edge thickness often denotes the degree of a correlation, with a thicker edge representing a higher correlation coefficient. On the basis of nodes and edges, a number of parameters can be calculated to represent the topological structure of a network, including degree, density, betweenness centrality, network diameter and clustering coefficient (Newman 2003). The degree of a node describes its connectivity to other nodes, with a higher value indicating a wider correlation. The betweenness centrality of a node describes the number of shortest paths between any two nodes going through it. The nodes with high degree and low betweenness centrality potentially represent the keystone taxa of a community (Berry and Widder 2014; Liang et al. 2016). The keystone taxa are the cornerstone and initial components for a community to assemble (Berry and Widder 2014) and have recently been defined as “highly connected taxa that individually or in a guild exert a considerable influence on microbiome structure and functioning irrespective of their abundance across space and time” (Banerjee et al. 2018). A group of densely connected nodes with weak correlations to other nodes forms a module. Modular analysis can help to simplify the processes of identifying keystone taxa and/or exploring the effect of environmental factors on microbe-microbe interactions.
Examples of network analysis conducted in the marine ecosystem
Network within a single system
Bacteria and archaea
Spearman’s rank correlations
Liu et al. (2014b)
Whittaker’s index of associations
Buttigieg and Ramette (2015)
Bryant et al. (2016)
Spearman’s rank correlations
Qiao et al. (2018)
Bacteria, archaea and microbial eukaryotes
Spearman’s rank correlations
Zhou et al. (2018)
Network across different systems
Spatially resolved network
Extended local similarity analysis
Cram et al. (2015)
Milici et al. (2016)
Bacteria and archaea
Water and sediment
Spearman’s rank correlations
Wei et al. (2016)
Spearman’s rank correlations
Cui et al. (2019)
Temporally resolved network
Bacteria and archaea
Seawater (deep chlorophyll maximum)
Local similarity analysis
Beman et al. (2011)
Extended local similarity analysis
Cram et al. (2015)
Bacteria, myoviruses and phytoplankton
Daily to weekly
Local similarity analysis
Needham et al. (2017)
Daily to weekly
Extended local similarity analysis
Berdjeb et al. (2018)
Bacteria and archaea
Chafee et al. (2018)
Several studies have attempted to use networks to infer potential functional couplings between microbes. For example, Thaumarchaeota Marine Group I (MG-I), the most abundant archaeal clade in the marine environment, capable of ammonia oxidization, has been found to co-occur with Nitrospina (Reji et al. 2019) and/or with Nitrospira when Nitrospina is absent or in low abundance (Wang et al. 2019), both of which are nitrite oxidizers. Their co-occurrence in seawater is supported by substrate feeding (nitrite produced by MG-I is the substrate of Nitrospina/Nitrospira) and facilitates the complete nitrification process. Previous efforts to explore co-occurrence patterns between functional bacteria in marine sediments have demonstrated significant correlations between sulfate-reducing bacteria and sulfur-oxidizing bacteria, and between sulfate-reducing bacteria and nitrite-oxidizing bacteria (Liu et al. 2014b). Elucidation of co-occurrence patterns with functional gene abundance derived from GeoChip and metagenome may facilitate a more direct inference. However, the obtained co-occurrence patterns should be treated with caution when used to infer functional couplings, since they are not necessarily reflecting real interactions.
Classically, a network describes co-occurrence patterns between taxa. However, environmental variables can also be included to explore microbe-environment relationships. Additionally, considering the natural complexity of inter taxa relationships in an ecosystem, pairwise microbe-microbe correlations, derived from most current network analyses, need to be expanded to a higher order, such as three- or four-way correlations. The high-order microbial co-occurrence patterns may involve possible disruption or enhancement of another taxon to a pairwise relationship (Bairey et al. 2016). Such high-order co-occurrence patterns can also be unraveled by analyzing compositional data, as long as new and proper statistical tools are developed (Bairey et al. 2016). Although co-occurrence patterns are not appropriate to imply accurate microbial interactions, their spatiotemporal dynamics hold the potential to affect the assembling processes and ecological roles of microbial communities.
Microbial biogeochemical roles and ecosystem functioning
Phylogeny and functional traits
To better understand microbial ecosystem functioning, creating links between an individual taxon and a specific function is required. However, it is infeasible to trace all the diverse biogeochemical processes and to relate them to taxa. This problem may be solved from the point of view of the microorganisms, as microbial functional capabilities have been found to connect strongly with phylogeny (Martiny et al. 2013; Zimmerman et al. 2013). For example, the presence of functional genes related to oxygenic photosynthesis, methane oxidation and sulfate reduction has been found to be highly phylogenetically conserved (Martiny et al. 2013). In coastal seawater, recent studies have shown that microorganisms that assimilated organic matter, including starch and glucose, were phylogenetically clustered (Bryson et al. 2017; Mayali and Weber 2018), reflecting phylogenetically conserved resource partitioning in the coastal microbial loop (Bryson et al. 2017). Such phylogenetic conservation in substrate utilization supports similar distribution patterns (even at a broader taxonomic level; Philippot et al. 2010; Schmidt et al. 2016) and similar lifestyles among microbial relatives. Salazar et al. (2015) found that the particle-associated and free-living populations in the deep ocean had different phylogenetic origins. These observations enhance the possibility of inferring microbial functional traits with phylogenetic information.
However, growing evidence now suggests that variations in functional traits occur within closely related microbes, even at the loosely defined species level (Larkin and Martiny 2017). Prochlorococcus, the most abundant genus of photosynthetic organisms, diverges into high- and low-light-adapted ecotypes, which display different light-harvesting strategies (Bibby et al. 2003). Likewise, Alteromonas macleodii, a typical copiotrophic r-strategist, contains both surface and deep-sea ecotypes (Ivars-Martinez et al. 2008), which have been shown to differ substantially in their capacity to degrade algal polysaccharides (Neumann et al. 2015). On the other hand, microorganisms performing similar metabolic functions can be only distantly related (Martiny et al. 2015), the basic principle of functional redundancy. Louca et al. (2016, 2017) found high functional redundancy in both marine and plant-associated microbial communities, implying that microbial functional traits are widely spread among microbial lineages.
Diversity and ecosystem functioning
There is growing evidence of a positive relationship between microbial diversity and ecosystem functioning (Cardinale et al. 2012; Delgado-Baquerizo et al. 2016b; Schnyder et al. 2018), although negative or no relationships have also been reported (Becker et al. 2012). Such relationships are derived primarily from studies on the terrestrial environment and have rarely been assessed for marine microbial communities. A study of microbial diversity-ecosystem functioning (DEF) relationship in marine surface water also supported a positive correlation, showing that a more phylogenetically diverse bacterial community had a greater level of ecosystem functioning (heterotrophic productivity measured by leucine incorporation; Galand et al. 2015). The enhancement of ecosystem functioning by increased biodiversity is thought to result from complementarity (minimal overlap) in resource use by functionally distinct taxa (Petchey and Gaston 2002) and/or through inter taxa facilitation (Hooper et al. 2005). Therefore, the relationship between diversity and ecosystem functioning is controlled by the niche-based mechanisms: differentiation in resource niche and selection effect (Krause et al. 2014).
The few studies that investigated the shape of the positive relationship between microbial diversity and ecosystem functioning have frequently uncovered a more linear relationship (Delgado-Baquerizo et al. 2016a) than the approaching-flat relationship seen for plants and animals (Cardinale et al. 2011). Such a linear relationship implies an indefinite increase of ecosystem functioning with increasing microbial diversity, challenging the idea of functional redundancy as mentioned above. In fact, Galand et al. (2018) provide evidence against the hypothesis of functional redundancy by showing a strong link between marine microbial community compositions and functional attributes using all the set of metagenomic reads. The authors emphasize the need to consider all functional aspects rather than relying only on known genes in investigating microbial DEF relationships. In addition, different processing rates seen in the same functional trait (Morrissey et al. 2016) may also provide opposing evidence against functional redundancy. However, these findings do not rule out the possibility for a partial functional redundancy, implicating that different types of functional traits may have different levels of redundancy. The idea of functional redundancy on the one hand can help to explain the high level of marine microbial diversity (different taxa are supported by a limited range of resources and conduct the same set of metabolic processes; Allison and Martiny 2008), while on the other hand can limit the extent of ecosystem functioning.
Diversity is composed of different components, including richness (taxonomic diversity), phylogeny (phylogenetic diversity) and function (functional diversity) (Fig. 2b). Different types of diversity can inform distinct microbial DEF relationships. However, taxonomic diversity is the more frequently used proxy in inferring DEF relationships, compared to functional diversity and phylogenetic diversity. It has been reported that taxonomic diversity has relatively little impact on ecosystem functioning (Nielsen et al. 2011), while functional diversity was more correlated, mostly likely by determining ecological niches and inter taxa interactions (Hooper et al. 2005; Krause et al. 2014). Nevertheless, functional diversity is always difficult to measure (functional activity) and/or requires additional sequencing efforts to analyze (functional genes). Thus, phylogenetic diversity is increasingly implemented as a proxy of functional diversity, with the thought that many functional traits are phylogenetically conserved. Indeed, a positive correlation has been found between marine surface bacterial productivity and phylogenetic diversity of the active community; no similar association was found when taxonomic diversity (Shannon index) was analyzed (Galand et al. 2015). The findings of Galand et al. (2015) highlighted that ecosystem functioning is related to the active rather than the total community that contains dormant taxa. This provides an explanation for the more frequently observed negative and/or no relationships between phylogenetic diversity of the total community and ecosystem functioning (Goberna and Verdu 2018; Pérez-Valera et al. 2015; Severin et al. 2013). The relationship between phylogenetic diversity and ecosystem functioning also relates to taxon-specific functional capability and evolutionary history (Gravel et al. 2012). Under which conditions phylogenetic diversity can be used as a representative of functional diversity should be characterized further.
The microbial DEF relationship can be confounded by environmental variations, as environmental factors can exert influences on both diversity and ecosystem functioning. Orland et al. (2018) demonstrated that pH and organic matter quantity and quality explained as much variation in CO2 production as did taxonomic diversity in lake sediments; these environmental factors exerted direct influences on ecosystem functioning due to their unrelatedness to taxonomic diversity. Comparatively, Delgado-Baquerizo et al. (2016b) found in a global set of soil samples that the DEF relationship was maintained when accounting for edaphic factors, which suggests that taxonomic diversity can exert influences on ecosystem functioning independently of environmental variabilities. A global survey of microbiome in seawater showed a decoupling of taxonomy and function, with the latter being more susceptible to environmental changes (Louca et al. 2016). Environmental conditions determine the availability of electronic donors/acceptors to microbes and shape the process of biogeochemical reactions. In addition to the environment, stochastic processes may also affect diversity and influence ecosystem functioning (Orland et al. 2018; Zhou et al. 2013). Overall, the positive DEF relationship in microorganisms is supportive of phylogenetic conservation in functional traits. More effort is needed to discern the role of different diversity components and the role of deterministic and stochastic processes in determining ecosystem functioning.
Ecosystem functioning prediction
The abovementioned relationships have stimulated great interest in using microbial data to predict ecosystem functioning (statistical simulation instead of direct measurement) (Graham et al. 2016; Powell et al. 2015b) (Fig. 2b). Here, we focus on the interactions between community (diversity and abundance) and ecosystem functioning, although physiological properties can also be related (Wieder et al. 2013). Graham et al. (2016) synthesized 82 global datasets from different ecosystems to improve the predictive power of carbon and nitrogen processing rates by the inclusion of the microbial community data. They found that the addition of both compositional and diversity data could strengthen the predictive power, although this was not applicable to all datasets. Andersson et al. (2014) demonstrated, via structural equation models, that the model that included total bacterial abundance explained 54% of the variation in nitrogenase activity in coastal sediments and by replacing total bacterial abundance with cyanobacterial biomass it could increase the predictive power.
The explanatory power of microbial data in predictive models is always lower than that of abiotic factors (Graham et al. 2014, 2016; Powell et al. 2015b), consistent with the notion that environmental factors have direct impacts on ecosystem functioning. Moreover, under different environmental conditions (e.g., temperature, Dolan et al. 2017), the explanatory power of microbial data may change. However, this does not decrease the importance of microbial data in functional prediction. Recently, Zhang et al. (2018) found that in coastal sediments adding different copy number ratios of functional and rRNA genes into stepwise regression models substantially increased the predictive power of denitrification and anammox rates, although alpha diversity and gene abundance of involved bacteria were poorly correlated to the function potentials. In addition to abundance and diversity, it is also important to include microbial interactions to the predictive model in future studies (Fig. 2b). In summary, knowledge on the distribution patterns of microbial communities is indispensable for understanding their biogeochemical and ecological functions.
Inference of microbial activity
Mounting evidence suggests that there is a significant difference between the total (resident) and active fractions within a microbial community. Several abundant taxa are less active in the RNA pool, whereas some highly active taxa show low abundance or are almost absent in the DNA pool (Baldrian et al. 2012; Richa et al. 2017; Romanowicz et al. 2016; Sebastián et al. 2018). For example, Cyanobacteria and the SAR11 clade of Alphaproteobacteria are the most abundant microbes in the global surface ocean; whereas the former is always disproportionately active (16S rRNA:rDNA > 1), the latter tends to be less active (Campbell and Kirchman 2013; Hunt et al. 2013; Zhang et al. 2014). Further, a refined phylogenetic analysis showed that different ecotypes of the SAR11 clade varied in their 16S rRNA:rDNA ratios (Salter et al. 2015). A similar phenomenon was also shown for different ecotypes of MG-I (Hugoni et al. 2013). Within a microbial community, activity can also vary between rare and abundant taxa (Campbell et al. 2011; Richa et al. 2017). Richa et al. (2017) reported that more than 70% of the rare taxa in coastal seawater of the Mediterranean Sea had high 16S rRNA:rDNA ratios. To explain this decoupling of abundance and activity, Campbell et al. (2011) proposed that a substantial proportion of bacteria became active when their abundance decreased, indicating that high abundance may be a constraint factor for activity or that top-down processes i.e., grazing and virus lysis could stimulate activity. Seasonality (Hugoni et al. 2013), environmental factors, such as salinity (Campbell and Kirchman 2013), and lifestyle modes (free-living and particle-attached; Li et al. 2018) have also been reported as drivers for 16S rRNA:rDNA ratio variations.
The active and total microbial communities have been found to display contrasting biogeographic patterns and respond differently to environmental factors in seawater (Zhang et al. 2014). Environmental changes would generate uncomfortable conditions for active microbes, and the adaptation and successful establishment of active microbes in a new environment are difficult (Hanson et al. 2012). By contrast, the growth of several dormant microbes can be stimulated by changing environments, contributing to microbial community succession. Considering this, the active community is likely to display a stronger distance-decay relationship than the total community (Zhang et al. 2014). Nevertheless, our understanding of the assembling processes (relative role of deterministic and stochastic processes) of the total and active communities is limited. Zhang et al. (2014) also found that the active (dominated by negative correlations) and total (dominated by positive correlations) bacterial communities exhibited different co-occurrence patterns. Frequent transitions of microbes between an active or dormant status under changing environments would lead to variations in co-occurrence patterns, causing significant alterations in ecosystem functioning (Fig. 3). As mentioned above, active microbes are directly linked to ecosystem functioning by actively carrying out biogeochemical reactions (Nannipieri et al. 2003). Thus, distinguishing the active fraction from the whole community would provide novel insights into patterns of microbial assembly, co-occurrence and DEF relationship. Moreover, the finding of a higher environmental sensitivity of active heterotrophs than active autotrophs in seawater (Zhang et al. 2014) further raises the need to treat functionally different microbial groups separately.
Another possible utilization of the rRNA:rDNA ratio is to indicate potential growth rate (Campbell et al. 2011; Lankiewicz et al. 2016), as numerous microorganisms are yet to be cultivated and their growth rates are not able to be measured (Kirchman 2016). A high rRNA:rDNA ratio may imply a high growth rate. Thus, the lower proportion of SAR11 in the RNA than in the DNA pool, as mentioned above, may indicate a slow growth mode, although it may also be due to the low number of ribosomes per cell, given its small cell size. The slow growth rate of SAR11, however, is further verified by culture-based analyses (Lankiewicz et al. 2016). In comparison, copiotrophic taxa from Alteromonas and the Rosebacter clade often display higher growth rates (Hunt et al. 2013; Lankiewicz et al. 2016). Noticeably, the rRNA:rDNA ratio has been reported to be not always effective in quantifying microbial growth rates, and protein synthesis potential has been proposed to be a more suitable interpretation (Blazewicz et al. 2013).
Insight into microbial diversification
The ribosomal gene sequences obtained from the marine environments using culture-independent methods have significantly expanded the microbial phylogenetic tree and public rRNA gene databases (the most popular are SILVA, Greengenes and RDP databases) (Brown et al. 2015; Delong 1992; Hug et al. 2016; Kubo et al. 2012). The microbial groups that have no close isolates at the time of discovering their rRNA gene sequences are often indicated by informal nomenclature, such as SAR11, MG-I and Miscellaneous Crenarchaeota Group (MCG). Further, in-depth phylogenetic analyses of the rRNA gene sequences have clustered many microbial groups, such as MG-I and MCG, into subclades (Kubo et al. 2012; Liu et al. 2014a). These subclades have been shown to respond differently to environmental changes and exhibit different habitat preferences (Lazar et al. 2016; Liu et al. 2014a). Similar niche differentiations are also shown for marine bacteria sharing high 16S rRNA gene similarities (e.g., Alteromonas macleodii as mentioned above). These observations suggest that environmental selection plays a crucial role in the process of microbial evolution. In addition to rRNA genes, functional genes can be used to investigate the diversification of functional groups. For example, Alves et al. (2018) recently elucidated the global frequency, phylogenetic diversity and habitat specificity of ammonia-oxidizing archaea using amoA, the gene encoding the active-site subunit of ammonia monooxygenase. These results highlight the importance of using community data derived from molecular marker genes to investigate the phylogenetic relationships and to infer the evolutionary histories of different microbial taxa.
However, a single marker gene is not always effective in differentiating phylogenetically close relatives. In this context, the multilocus sequence typing (MLST) analysis, which takes the phylogenetic information of several (usually 5–7) housekeeping genes, has been shown to provide a better resolution (Enright and Spratt 1999). In recent years, the increasing number of public metagenomic/genomic sequences has provided an excellent resource for determining phylogenetic relatedness among microbes with a high confidence level (Brown et al. 2015; Hug et al. 2016). These phylogenomic analyses have enabled the designation of many novel microbial phyla, including the well-known Thaumarchaeota (containing MG-I) and Bathyarchaeota (MCG) (Adam et al. 2017). Spang et al. (2015) proposed a new archaeal phylum Lokiarchaeota through phylogenomic inference. This phylum contains many eukaryotic signatures, significantly contributing to our understanding of the evolution of life.
The prevalence of studies on microbial communities in the marine environment benefits from the development of new sequencing technologies. The resulting increase in sequencing depth facilities the obtaining of accurate insights into microbial community structure, and in particular enhances the capability of distinguishing rare subcommunities from sequencing errors. In the era of big data, the development of sequencing technologies calls for a simultaneous introduction of new statistical methods and tools, which would help to discern microbial assembling processes and unravel high-order (three- or four-way) microbe-microbe interactions that most likely occur in natural environments. Although microbe-microbe interactions, to date derived mainly from co-occurrence networks, are important in structuring microbial communities and mediating ecological functions, such linkages are still largely elusive. Suitable analytic tools are required to construct a model that bridges the gap between microbial interactions and ecosystem functioning.
Owing to the difficulty in measuring microbial functions, inclusion of community data in the predictive modeling of ecosystem functioning is receiving increasing attention. This raises associated questions such as whether different functional types exhibit similar sensitivity to community data and vice versa, and whether relationships between community data and ecosystem functioning vary spatially and temporally. In addition, the decoupling of phylogeny and function may act as an obstacle for such modeling, which calls for a resolved phylogenetic clustering and taxon-specific functional characterization. Omics, such as metagenome, single-cell genome and metatranscriptome, in parallel with advanced analytic tools, provide great opportunities to link a community, and further to link an individual to function (Fig. 2b).
In this review, we highlight the significance of separating a whole microbial community into subgroups according to different standards. The abundance-dependent grouping has led to the realization that rare subcommunities, the members of which often show high activity, have the potential to increase the power of ecosystem functioning prediction. By contrast, functional-, and other traits-dependent grouping have received less attention. After it has been established what exists, it is then necessary to determine what is alive and whether the active taxa follow patterns seen for the total community, and how active taxa contribute to ecosystem functioning. We also need to know if functional traits of the active community are phylogenetically organized, because such information is important for constructing a direct linkage between community and ecosystem functioning. In summary, subdivision of the whole community will provide accurate and novel insights into relationships between microbial diversity, interaction and ecosystem functioning.
This work was supported by the National Natural Science Foundation of China (Grants Nos. 41976101, 41506154 and 41730530) and the Fundamental Research Funds for the Central Universities (Grants No. 201762017).
XZ conceived, provided the idea to this work and edited the manuscript; JL, ZM and XL wrote the manuscript; JL prepared the figures and tables. All authors approved the final manuscript.
Compliance with ethical standards
Conflict of interest
The authors declare that they have no conflict of interest.
Animal and human rights statement
This article does not contain any studies with human participants or animals performed by any of the authors.
- Baas-Becking LGM (1934) Geobiologie of inleiding tot de milieukunde. W.P. van Stockum & Zoon, Den HaagGoogle Scholar
- Baldrian P, Kolarik M, Stursova M, Kopecky J, Valaskova V, Vetrovsky T, Zifcakova L, Snajdr J, Ridl J, Vlcek C, Voriskova J (2012) Active and total microbial communities in forest soil are largely different and highly stratified during decomposition. ISME J 6:248–258PubMedCrossRefPubMedCentralGoogle Scholar
- Graham EB, Knelman JE, Schindlbacher A, Siciliano S, Breulmann M, Yannarell A, Bemans JM, Abell G, Philippot L, Prosser J, Foulquier A, Yuste JC, Glanville HC, Jones DL, Angel F, Salminen J, Newton RJ, Burgmann H, Ingram LJ, Hamer U et al (2016) Microbes as engines of ecosystem function: when does community structure enhance predictions of ecosystem processes? Front Microbiol 7:214PubMedPubMedCentralGoogle Scholar
- Hubbell SP (2001) The unified neutral theory of biodiversity and biogeography (MPB-32), vol 32. Princeton University Press, PrincetonGoogle Scholar
- Jeraldo P, Sipos M, Chia N, Brulc JM, Dhillon AS, Konkel ME, Larson CL, Nelson KE, Qu A, Schook LB, Yang F, White BA, Goldenfeld N (2012) Quantification of the relative roles of niche and neutral processes in structuring gastrointestinal microbiomes. Proc Natl Acad Sci USA 109:9692–9698PubMedCrossRefGoogle Scholar
- Langille MGI, Zaneveld J, Caporaso JG, McDonald D, Knights D, Reyes JA, Clemente JC, Burkepile DE, Thurber RLV, Knight R, Beiko RG, Huttenhower C (2013) Predictive functional profiling of microbial communities using 16S rRNA marker gene sequences. Nat Biotechnol 31:814–821PubMedPubMedCentralCrossRefGoogle Scholar
- Lima-Mendez G, Faust K, Henry N, Decelle J, Colin S, Carcillo F, Chaffron S, Ignacio-Espinosa JC, Roux S, Vincent F, Bittner L, Darzi Y, Wang J, Audic S, Berline L, Bontempi G, Cabello AM, Coppola L, Cornejo-Castillo FM, d’Ovidio F et al (2015) Ocean plankton. Determinants of community structure in the global plankton interactome. Science 348:1262073PubMedCrossRefPubMedCentralGoogle Scholar
- Liu J, Zheng Y, Lin H, Wang X, Li M, Liu Y, Yu M, Zhao M, Pedentchouk N, Lea-Smith DJ, Todd JD, Magill CR, Zhang WJ, Zhou S, Song D, Zhong H, Xin Y, Yu M, Tian J, Zhang X-H (2019) Proliferation of hydrocarbon-degrading microbes at the bottom of the Mariana Trench. Microbiome 7:47PubMedPubMedCentralCrossRefGoogle Scholar
- Martiny JBH, Bohannan BJM, Brown JH, Colwell RK, Fuhrman JA, Green JL, Horner-Devine MC, Kane M, Krumins JA, Kuske CR, Morin PJ, Naeem S, Ovreas L, Reysenbach AL, Smith VH, Staley JT (2006) Microbial biogeography: putting microorganisms on the map. Nat Rev Microbiol 4:102–112PubMedCrossRefPubMedCentralGoogle Scholar
- Milici M, Deng ZL, Tomasch J, Decelle J, Wos-Oxley ML, Wang H, Jauregui R, Plumeier I, Giebel HA, Badewien TH, Wurst M, Pieper DH, Simon M, Wagner-Dobler I (2016) Co-occurrence analysis of microbial taxa in the Atlantic Ocean reveals high connectivity in the free-living bacterioplankton. Front Microbiol 7:649PubMedPubMedCentralGoogle Scholar
- Richa K, Balestra C, Piredda R, Benes V, Borra M, Passarelli A, Margiotta F, Saggiomo M, Biffali E, Sanges R, Scanlan DJ, Casotti R (2017) Distribution, community composition, and potential metabolic activity of bacterioplankton in an urbanized mediterranean sea coastal zone. Appl Environ Microbiol 83:e00494–17PubMedPubMedCentralCrossRefGoogle Scholar
- Sunagawa S, Coelho LP, Chaffron S, Kultima JR, Labadie K, Salazar G, Djahanschiri B, Zeller G, Mende DR, Alberti A, Cornejo-Castillo FM, Costea PI, Cruaud C, d’Ovidio F, Engelen S, Ferrera I, Gasol JM, Guidi L, Hildebrand F, Kokoszka F et al (2015) Ocean plankton. Structure and function of the global ocean microbiome. Science 348:1261359PubMedCrossRefPubMedCentralGoogle Scholar
- Weiss S, Van Treuren W, Lozupone C, Faust K, Friedman J, Deng Y, Xia LC, Xu ZZ, Ursell L, Alm EJ, Birmingham A, Cram JA, Fuhrman JA, Raes J, Sun FZ, Zhou JZ, Knight R (2016) Correlation detection strategies in microbial data sets vary widely in sensitivity and precision. ISME J 10:1669–1681PubMedPubMedCentralCrossRefGoogle Scholar
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.