Summarized datasheet for multi-omics response of three Exaiptasia strains to heat stress: a new way to process omics data
Corals, the building blocks of reef ecosystems, have been severely threatened by climate change. Coral bleaching, the loss of the coral’s endosymbiotic algae, occurs as a consequence of increasing ocean temperature. To understand mechanisms of stress tolerance in symbiotic cnidarians, the sea anemone Exaiptasia pallida from different regions was heat stressed. The three strains originated from the Red Sea, Hawaii and North Carolina, each with different temperature profiles, enabling a comparative study of local adaptation strategies.
Whole transcriptome and proteome data were collected from all anemones at control and stress condition. As part of the analysis of this large, multi-omic data, we wrote a script that creates a tabular datasheet that summarized the transcriptomic and proteomic changes for every gene. It facilitates the search of individual genes, or a group of genes, their up- or downregulation during stress and whether this change in expression was statistically significant. Furthermore, it enables examining if changes in RNA correspond to those in proteins. The datasheet can be used for future comparisons, as well as search and development of biomarkers.
KeywordsExaiptasia Transcriptomics Proteomics Model organism Biomarkers Coral bleaching Thermotolerance Climate change Coral reefs
Anemones originating from North Carolina
Anemones originating from Hawaii
Anemones originating from the Red Sea
Corals live in a symbiotic relationship with the algae Symbiodiniacea, which lives inside their tissue and provides corals with the majority of their energy demand. However, this relationship is fragile; particularly temperature stress can lead to the breakdown of this relationship, known as coral bleaching. Interestingly, a range of temperature tolerances can be found between and within species individuals, leading to some individuals being more susceptible to temperature increase than others. Particularly the habitat from which a coral originates can have an impact on its stress tolerance .
To understand what cellular mechanisms drive thermotolerance, how different genotypes have adapted to temperature and whether origin influences the stress response of symbiotic cnidarians, we conducted full transcriptome and proteome analysis of the coral-symbiosis model organisms the anemone Exaiptasia. Comprehensive analysis of the data and experimental details are described in Cziesielski et al. .
We created a datasheet that summarized all of our gene expression response on both transcriptomic and proteomic level. The spreadsheet eases data discovery, discern common patterns as well as differences in thermotolerance, thus aiding in hypothesis generation. While the raw data is freely accessible, it is far easier to access information summarized in this datasheet, especially for inter-study response comparisons, validation and biomarkers development. Through simply filtering columns for content, anyone can obtain entire transcriptome and proteome responses in a simple, yet informative, format. By making this datasheet available, we hope to contribute to facilitating collaborative progress in coral research, specifically regarding Exaiptasia, for researchers and educators alike.
We realized that this data format could be a useful tool to anyone working on large-omic datasets, as it condenses an extensive amount of sequencing information into an easy to use spreadsheet. In hopes of facilitating—omics data analysis across biological disciplines, we also provide the script used to generate the spreadsheet.
Anemones originating from thermally different environments [North Carolina (CC7), Hawaii (H2) and the Red Sea (RS)] were maintained for over a year at control conditions (25 °C). For thermal stress, population subsets were gradually taken up to 32° and kept there for 24 h. Transcriptomes and proteomes were sequenced  and analyzed for stress response changes, as per Cziesielski et al. .
Information on data files
Name of data file
Data repository identifier (DOI or accession number)
Data file 1
Summary datasheet of Exaiptasia heat stress response
MS Excel file (.xlsx)
Data file 2
Code and raw data used to produce summary datasheet
gzip-compressed tarball (.tar.gz)
Furthermore, we provide the code used to generate this summary sheet, with the hope that future studies will find value in creating summary sheets as presented here . The script, implemented in Python 3, first reads in raw transcriptomic results (the comma-separated *.csv files in Data file 2) and raw proteomic results (the tab-separated “prot.fold_changes.tsv” in Data file 2). We noticed that quite a number of Exaiptasia gene models were duplicated—while this is biologically feasible, these are most likely a result of assembly artefacts. The inclusion of duplicate gene models, which would have identical functional annotations, could potentially bias downstream functional enrichment analyses. To remove this bias, our script reads in a set of whitelisted gene IDs generated in Cziesielski et al. , and removes genes outside this list. The custom script presented here is written to integrate two sets of—omics data.
From a technical point, the in-depth insight into transcriptome and proteome allows investigation into previously suggested biomarkers as well as evaluating new candidates. Many factors need to be kept in consideration and what works for one strain may not necessarily be the correct indicator in another, a factor rarely addressed in biomarker development . Besides transcriptome–proteome interactions, developing and validating biomarkers need to consider that gene homologs respond differently to stress within and across genotypes. This can be observed using the datasheet, for example: glutathione peroxidase, commonly used as a biomarker in heat stress, has at least two homologs that significantly respond in all strains. However, both are significantly regulated in opposite directions (AIPGENE513, AIPGENE5657). Additionally, a gene that responds strongly in one genotype may not have a significant response in others. These limitations can inhibit the accuracy of data interpretation. By considering homolog and genotype response, the datasheet provides a source to make more informed decisions in biomarker usage.
This datasheet was made as a tool in order to utilize previously published data. As such, there are no major limitations. However, it should be kept under consideration that sequencing depth of the proteome is less than that of the transcriptome. While technology and analytical tools are quickly progressing, proteomic tools still do not keep up with sequencing efficiency of transcriptomics . Sequencing depth is critical for correlation studies and comprehensive analysis of the cell. Low proteome coverage is often a result of detecting only abundant proteins and peptides, while low abundant proteins are not detected . Furthermore, proteome changes are naturally time-dependent, and in light of protein misfolding due to heat stress likely further delayed , we cannot exclude time-lag as a potential factor for the absence in significant fold changes. Thus, we were unfortunately only able to sequence 12% of the proteome of Exaiptasia and could not find any significant differences in protein abundances in response to heat stress.
MJC and MA conceived idea for experiment. MJC and YJL conceived idea for datasheet tool. YJL wrote code for datasheet. MJC wrote the manuscript. All authors read and approved the final manuscript.
We thank all of our co-authors from the original study for their help in gathering the data for this datasheet: Guoxin Cui, Sebastian Schmidt-Roach, Sara Campana and Claudius Marondedze.
The authors declare that they have no competing interests.
Availability of data materials
Consent to publish
Ethics approval and consent to participate
Animal maintenance and experimental procedures complied with King Abdullah University of Science and Technology Institutional guidelines.
Research reported in this study was supported by funding from King Abdullah University of Science and Technology. The University also provided laboratory and sequencing facilities.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
- 2.Cziesielski MJ, Liew YJ, Cui G, Schmidt-Roach S, Campana S, Marondedze C, et al. Multi-omics analysis of thermal stress response in a zooxanthellate cnidarian reveals the importance of associating with thermotolerant symbionts. Proc Biol Sci. 2018;285:20172654. https://doi.org/10.1098/rspb.2017.2654.CrossRefPubMedPubMedCentralGoogle Scholar
- 3.Transcriptomic response to heat stress in multiple Exaiptasia pallida strains: H2, CC7, RS, and CC7 (SSB01). Accession: PRJNA406873. 2018. https://www.ncbi.nlm.nih.gov/sra/?term=PRJNA406873. Accessed 12 Dec 2018.
- 4.Liew YJ. lyijin/exaiptasia_datasheet: 1.0. 2018. https://doi.org/10.5281/zenodo.1469124. Accessed 12 Dec 2018.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.