Whole-genome discovery of miRNAs and their targets in wheat (Triticum aestivum L.)
MicroRNAs (miRNAs) are small, non-coding RNAs playing essential roles in plant growth, development, and stress responses. Sequencing of small RNAs is a starting point for understanding their number, diversity, expression and possible roles in plants.
In this study, we conducted a genome-wide survey of wheat miRNAs from 11 tissues, characterizing a total of 323 novel miRNAs belonging to 276 families in wheat. A miRNA conservation analysis identified 191 wheat-specific miRNAs, 2 monocot-specific miRNAs, and 30 wheat-specific variants from 9 highly conserved miRNA families. To understand possible roles of wheat miRNAs, we determined 524 potential targets for 124 miRNA families through degradome sequencing, and cleavage of a subset of them was validated via 5′ RACE. Based on the genome-wide identification and characterization of miRNAs and their associated target genes, we further identified 64 miRNAs preferentially expressing in developing or germinating grains, which could play important roles in grain development.
We discovered 323 wheat novel miRNAs and 524 target genes for 124 miRNA families in a genome-wide level, and our data will serve as a foundation for future research into the functional roles of miRNAs in wheat.
KeywordsmiRNA Family Wheat Genome Conserve miRNA Family Degradome Sequencing Degradome Library
Small RNAs, including small interfering RNAs (siRNAs) and microRNAs (miRNAs), are involved in both the transcriptional and posttranscriptional control pathways within nearly every crucial gene cascade in eukaryotic cells [1, 2]. MiRNAs are single-stranded non-coding RNAs with sizes most often ranging from 20–22 nucleotides (nt) . MiRNA loci are transcribed by RNA polymerase II into primary miRNA transcripts (pri-miRNAs) that are processed by nuclear RNase III-like enzymes, such as Dicer and Drosha in animals  and DICER-LIKE proteins (for example, DCL1) in plants . After being transported to the cytoplasm, miRNAs are incorporated into the RNA-induced silencing complex (RISC) to exert their regulatory functions through cleavage or translation inhibition based on the nearly complementary binding of an mRNA target [6, 7]. The examination of miRNAs from various plant species has revealed their possible involvement in organ development, cell differentiation, hormone signaling, biotic and abiotic stress responses, genome maintenance and integrity, and diverse physiological processes .
Sequencing of small RNAs is a starting point for understanding their number, diversity, expression and possible roles in plants. Published reports as well as publicly accessible miRNA datasets from different plant species suggest that plant miRNAs are highly complex and abundant. As of June 2013, release 20.0 of the miRBase database contained 7,385 plant miRNA entries, including 337 from Arabidopsis, 713 from rice, 321 from maize, 241 from sorghum, 69 from barley, and 401 from populous (http://www.mirbase.org/) . Sequencing of small RNA populations in plants has established the existence of 16 highly conserved miRNA families with abundant expression, which overwhelmingly regulate the expression of transcription factors that are critical for development or stress responses. Recently developed deep sequencing technologies are uncovering an increasing number of lineage-specific or species-specific miRNAs exhibiting low or tissue-specific expression, which target diverse genes with specialized functions. For example, the Brassicaceae family-specific miR824 regulates the expression of AGAMOUS-LIKE 16, which plays a role in controlling stomatal density and development in leaves . Therefore, the identification of miRNAs in diverse species has been a major focus in recent years.
Based on miRNA discovery, the miRNAs putatively related to certain tissues development have been identified by deep sequencing technologies. For example, cotton miRNAs show a trend of repression during ovule and fiber development, and this rapid and dynamic change may contribute to ovule and fiber development in allotetraploid cotton . A diverse set of miRNAs and miRNA-like small RNAs have been identified from developing rice grains, some of which are differentially expressed during seed development . Sequencing of sRNA populations from soybean seeds and vegetative tissues has also revealed tissue-preferential expression for certain miRNAs . Interestingly, the recently evolved miR163 displays differences in spatial expression between Arabidopsis thaliana and Arabidopsis arenosa and in their allotetraploids . These data suggest that species-specific miRNAs and the spatio-temporal regulation of conserved miRNAs play important roles in shaping morphological and developmental variation among related species during evolution [14, 15, 16].
MiRNA binding to complementary sequences in target mRNAs regulates eukaryotic gene expression at the post-transcriptional level through mRNA degradation or translational repression [17, 18]. Most plant miRNAs induce the degradation of their mRNA targets through precisely cleaving the target sequence between the tenth and eleventh nt from the 5′ end of the miRNA binding site . With the emergence of high-throughput sequencing technologies, degradome analysis or PARE (parallel analysis of RNA ends), which can globally collect 3′ fragments of mRNA targets, is the current choice for validating miRNA targets that are cleaved . Using this method, a large number of target genes have been successfully identified in Arabidopsis [20, 21], rice , soybean  and wheat . These validated targets include transcription factors that play key roles in development and genes involved in a variety of other physiological processes. In addition, miRNA-guided cleavage initiates the entry of primary transcripts into the phase-siRNA biogenesis pathway. For example, Arabidopsis thaliana ta-siRNAs form from primary transcripts that are initially targeted and cleaved by the AGO1–miR173 (TAS1 and TAS2), AGO1–miR828 (TAS4) or AGO7–miR390 (TAS3) complex [25, 26, 27, 28]. The tomato 22 nt miR4376 triggers the formation of phase-siRNA from its target ACA10 gene and may function as a novel layer of a molecular mechanism underlying tomato reproductive growth .
Hexaploid wheat, Triticum aestivum L. (2n = 6× = 42; genomes AABBDD) is one of the most widely cultivated crops globally due to its high yield and nutritional and processing qualities, providing 20% of the calories consumed by humans (FAO 2011). Previous studies attempted to identify miRNAs associated with development and stress response in wheat by sequencing small RNA population [24, 30, 31, 32, 33, 34, 35, 36] or by computational strategies [37, 38, 39]. For example, our group and Wei et al. identified 43 and 48 wheat miRNA families by sequencing pooled RNAs from leaves, stems, roots and spikes [32, 35]. Li et al. also constructed small RNA and degradome libraries leading to identification of 32 miRNAs and their targets from wheat seedlings . From developing grains, around 540 miRNAs putatively associated with grain development were identified . Only a small scale of miRNAs was determined spactial-temporal expression pattern along wheat development, and majority of detected miRNAs were preferentially expressed in certain tissues. However, no whole genome scale miRNA identification and expression comparison among multiple tissues types or developmental stages has been done. In this study, we selected 11 tissues throughout the wheat growth to discover wheat miRNAs in whole genome scale. Moreover, previous study on wheat miRNAs prediction relied on extremely limited wheat genome sequences, and given the larger genome size of wheat, there may be additional miRNAs that have not been identified. Recently, based on the whole-genome shotgun strategy, draft genomes for bread wheat , its A-genome progenitor Triticum urartu (2n = 14; AA)  and its D-genome progenitor Aegilops tauschii (2n = 14; DD)  have been reported. Furthermore, next-generation sequencing data of flow-sorted individual chromosome arms of wheat were also partly available, provided by International Wheat Genome Sequencing Consortium (IWGSC) (http://www.wheatgenome.org/). Indeed, a recent study predicted miRNAs on wheat chromosome 1AL, 6B and 5D [39, 43, 44]. In this study, in order to discover wheat miRNAs in whole genome scale by experimental approach, we identified 689 miRNAs from multiple wheat tissues of different developmental stages based on all of the genome sequences available.
Hexaploid wheat is one of the most widely cultivated crops globally due to its high yield and nutritional and processing qualities, providing 20% of the calories consumed by humans (FAO 2011). Despite its agricultural importance of wheat grains, research on the molecular basis of development of wheat grains is limited. Some topics that have been studied include expression profiles of metabolic proteins in endosperm  and of mRNA in whole grain . The role of miRNAs during grain development is still unknown, and identification of grain development associated miRNAs could accelerate the progress of wheat improvement and potentially increase its production. In this study, we further screened the miRNAs that were preferentially expressed in wheat grains, which might play important roles in grain development.
Distribution of small RNA populations in multiple wheat tissues of different developmental stages
Summary of 11 wheat small RNA libraries
20-24 nt/Total (%)
Genome-wide discovery of wheat miRNAs
Around 539 wheat miRNAs putatively associated with development and stress response have been identified by sequencing small RNA population [24, 30, 31, 32, 33, 34, 35, 36] (Additional file 1: Table S1). Firstly, we confirmed the presence of a total of 366 known miRNAs from 260 families sharing the exactly same sequences with reported wheat miRNAs in our small RNA sequencing dataset (Additional file 1: Table S1). In order to find novel variants of known miRNAs, we searched small RNAs with 1 or 2 mismatches to known miRNAs in our small RNA sequencing data libraries by use of homolog analysis, which leading to 119 novel variants belonging to known miRNA families (Additional file 2: Table S2).
Identification of wheat-specific miRNAs and wheat-specific variants for conserved miRNAs
To date, highly and moderately conserved miRNAs have been identified from eudicots to basal plants [53, 54]. In this study, we found that all 16 known highly conserved miRNA families were detected in our dataset. Moreover, we also identified 30 wheat-specific variants from 9 highly conserved miRNA families, including miR159, miR160, miR167, miR169, miR171, miR172, miR393, miR396 and miR398 families (Additional file 3: Table S3). These findings indicated that different members of the same miRNA family might evolve at different rates within the same plant species, or most likely associated with the polyploid nature of wheat. To screen the wheat specific miRNAs, we explored the presence of 323 novel wheat miRNAs across Arabidopsis, soybean, rice, maize, sorghum, barley and Brachypodium. Among them, 191 were wheat-specific, while orthologs were found for the remainder in other species (Additional file 3: Table S3). We also found 2 monocot specific miRNAs including tae-miR3014b and tae-miR3075 which were conserved among all of the monocots we examined. We analyzed the origin of wheat miRNAs including known and novel miRNAs along the grass evolution, and the results indicated that 55 miRNAs were shared by all monocots indicating their origin from ancient ancestors and 87 miRNAs diverged and retained in wheat, barley, rice and Brachypodium after divergence of maize and sorghum from rice. A total of 106 wheat miRNAs were shared with barley but loss in Brachypodium and rice (Additional file 3: Table S3). These results indicated that a large number of wheat miRNAs were born at divergence of barley and wheat from rice.
Transcriptome-wide identification of miRNA targets in wheat through degradome sequencing
To gain insight into the functions of known and novel miRNAs in wheat, miRNA target genes were identified through a degradome sequencing approach. Four libraries, prepared from germinating seeds, seedling leaves, seedling roots and grains collected 8 DAP were constructed for degradome sequencing, and more than 10 M high quality reads were obtained from each library. Because 24 nt miRNAs mainly mediate DNA methylation, only the identified miRNAs with sizes of 20–23 nt were subjected to further target gene analysis in this study.
Based on degradome sequencing, a total of 524 potential targets were identified for 124 wheat miRNA families (Additional file 4: Table S4). The number of predicted targets per miRNA (4.2) was higher in wheat as compared to Arabidopsis (2.9)  and rice (2.8) , suggesting the existence of additional paralogous and homoeologous genes in this hexaploid species. Among these target genes, 44.7% and 45.8% were regulated by miRNAs at the ORF and 3′ UTR, respectively, and only 9.5% of the genes were targeted in the 5′ UTR. Notably, the cleavage analyses revealed a total of 20 target transcripts that were targeted by more than two distinct miRNAs. For instance, the unigene encoding ATP-sulfurylase 3 was targeted by miR395 within the coding region and by Ta-miR2041, Ta-miR2047 and tae-miR3020 in the 3′ UTR. Although ATP-sulfurylase is similarly targeted by miR395 within its coding region in rice, we did not find similar miRNAs targeting its 3′ UTR, indicating miR395 combined with other miRNAs can target ATP-sulfurylase in a potential wheat-specific pathway.
Identification of miRNAs that are preferentially expressed in wheat grains
Wheat genome contains a huge set of conserved and wheat-specific miRNAs
Previous studies have reported identification of 510 miRNAs associated with development and stress response in wheat by sequencing small RNA population [24, 30, 31, 32, 33, 34, 35, 36]. These known miRNAs were identified from certain tissues such as seedlings or developing grains, or from mixed tissues including stems, leaves, roots and spikes. In this study, we extended the identification of wheat miRNAs to 689 and broaden the knowledge of tissues that some miRNAs preferentially expressed. The present study represents the first whole genome scale identification of wheat miRNAs from diverse tissues and the first large scale expression comparison among various tissues. Without a sequenced genome for wheat, it is difficult to map miRNAs to wheat genome sequences and predict potential foldback structures; therefore, these studies have provided only a partial understanding of wheat miRNA population. Recently, based on the whole-genome shotgun strategy, draft genomes for bread wheat , its A-genome progenitor Triticum urartu (2n = 14; AA)  and its D-genome progenitor Aegilops tauschii (2n = 14; DD)  have been reported, which will facilitate genome-scale miRNA analyses in wheat. In the present study, we systematically annotated a total of 689 miRNAs belonging to 536 families in 11 different tissues based on the draft wheat genome sequences, identifying 69 highly conserved miRNAs and 191 wheat-specific miRNAs at a genome-wide scale, thus significantly increasing the number of known miRNA genes in wheat. This extremely large set of miRNAs is likely associated with the polyploid nature of wheat, which is reasonably consistent with the higher gene numbers, ranging from 94,000 to 96,000, reported in bread wheat than its diploid progenitor and other species such as rice, maize and Arabidopsis. For a given miRNA locus in a diploid, there are three loci in a hexaploid, or more if the locus was duplicated in the diploid or tetraploid progenitor species or duplicated after the allopolyploid event, which might lead to a particularly large set of miRNAs at the genome scale.
Origins and evolution of wheat miRNAs
Many canonical miRNAs are conserved among moss, eudicots, and monocots, and some regulate conserved targets and display conserved functions among land plants. Our observations regarding 69 highly conserved miRNAs from 16 families provided evidence that these miRNAs are evolutionarily conserved in the plant kingdom. However, we also found multiple wheat-specific variants of conserved miRNAs exhibiting nucleotide substitutions as compared to other species. The divergence of these variants within highly conserved miRNA families might suggest that they have evolved at different rates. Furthermore, through degradome sequencing, we identified and validated a large set of non-conserved targets for the conserved miRNAs, in addition to the conserved target genes. However, the fact that these variations may or may not affect target specificity in wheat although they are wheat-specific, raising the question whether these conserved miRNA variants evolved independently to acquire wheat-specific functions.
The wheat-specific miRNAs identified in this study are particularly interesting because they may function in a species-specific manner in wheat growth and development. In plants, a minority of the annotated miRNA gene families are conserved between plant families, while the majority are family or species specific, suggesting that most known miRNA genes arose relatively recently in evolutionary time . In the present study, several observations indicated that unlike highly conserved miRNAs, species-specific miRNAs are often weakly expressed, processed imprecisely and lack targets. First, we compared the average abundances of highly conserved and wheat-specific miRNAs, which indicated that the average number of reads for highly conserved miRNAs reached 71188.7 RP10M, while that for wheat-specific miRNAs was 3583.6. Box plots show that the normalized expression of highly conserved miRNAs was significantly higher than that of wheat-specific miRNAs (T test, P < 0.01) (Additional file 7: Figure S2). Second, we identified 122 target genes for 17 conserved miRNAs and 71 target genes for 17 wheat-specific miRNAs, but we were unable to obtain putative targets for the remainder of the 174 wheat-specific miRNAs, suggesting a lack of or low expression of these target genes. Third, wheat-specific miRNAs regulated fewer targets (4 target genes per miRNA) on average compared to the highly conserved miRNAs (7 target genes per miRNA), further suggesting a lack of targets for the wheat-specific miRNAs.
Target genes of conserved and wheat-specific miRNAs
The integration of miRNAs in diverse biological networks relies on the conformation of their target genes. Therefore, degradome sequencing has been broadly applied to understand the roles of target gene degradation in transcriptional regulation. In this study, a total of 524 potential targets were identified for 124 putative miRNA families through degradome sequencing, and the cleavage of 19 genes was validated through 5′ RACE. For highly conserved miRNAs, 122 target genes were identified, including 92 targets that are conserved among other species and 30 non-homologous target genes. These non-conserved targets of miRNAs might evolve specific properties and display unique functions in wheat growth and development. In addition, it remains unclear whether the non-conserved targets and conserved targets shared by conserved miRNAs are related to a common biochemical pathway, although they are not homologous genes. It will be worthy to determine whether the non-conserved targets of conserved miRNAs are biologically relevant or merely represent a neutral, accidental event.
There are two outstanding characteristics related to the potential biological functions of the target genes of wheat miRNAs. First, the genes targeted by these miRNAs showed a strong tendency toward displaying transcription factors or transcription regulatory activity. The majority of these targets belongs to conserved ones regulated by highly conserved miRNAs and is involved in diverse aspects of plant growth and development. A large number of target genes were also found to be involved in protein metabolism, among which 38% and 34% were involved in protein degradation and synthesis, respectively. Enrichment of protein synthesis and degradation functions was observed in transition-stage SAMs, and protein synthesis, turnover and balance are required to establish a shoot meristem [59, 60, 61]. Therefore, our results might indicate that meristem development is also subject to miRNA regulation through the regulation of target genes responding to protein synthesis and degradation.
Highly or specifically expressed miRNAs during wheat grain development
The miRNAs found to be highly or specifically expressed during wheat grain development in this study are particularly interesting because wheat grains provide approximately 55% of the carbohydrates consumed by humans . To investigate the roles of small RNAs in grain development and to identify potentially seed-specific small RNAs, several groups have employed high-throughput sequencing technology to sequence small RNA populations from developing seeds in rice [12, 50, 63], barley , maize  and soybean [13, 23, 65]. The obtained sequencing data suggest that rice miR1428e_3p is highly expressed in grains and cleaves two SbRK1b kinases, which play a role in regulating starch accumulation based on their expression in the endosperm and aleurone . In addition, Arabidopsis miRNAs including miR160, miR166, and miR319 inhibit the expression of differentiation-promoting transcription factors such as ARF17, CNA, PHB, PHV, and TCP4 to enable proper embryonic patterning . The presence of a large set of miRNA molecules in the developing seeds from various species provides some indication that many processes that occur during seed development are under the control of miRNA regulation.
In this study, a total of 51 miRNAs from 36 families were found to be specifically expressed in developing grains, among which 28 miRNAs were wheat specific, indicating these miRNAs might be involved in wheat grain development in a wheat-specific manner. We also found a number of genes associated with grain development that serve as miRNA targets, such as gamma-gliadin and late embryogenesis abundant protein. We found a number of grain-abundant miRNAs specifically expressed in the embryo or endosperm during grain development (Figure 7C). For example, miR156a specifically accumulated in the embryo and gradually increased from 15 to 28 DAP. In Arabidopsis embryos, miR156 delays the production of maturation transcripts by directing the repression of SPL10/11. Therefore, wheat miR156 might also be involved in late embryo maturation during wheat grain development.
The 22 nt miRNA regulation pathway in seed development
The present study showed that 22 nt miRNAs displayed markedly higher expression level in seed tissues, including in dry grains, germinating seed embryos, and grains 8 and 15 days after pollination, compared to other tissues, which was quite contrary to the expression pattern of 21 nt miRNAs (Figure 2B). It has been reported that 22 nt miRNAs and siRNAs are associated with AGO1, which recruits the RNA-dependent RNA polymerase RDR6 to generate double-stranded RNA from 3′ cleavage fragments, initiating the production of a second wave of siRNAs, referred to as secondary or “transitive” siRNAs . We speculate that 22 nt miRNAs might involve in directing the generation of phased siRNAs during wheat seed germination and maturation. Enrichment of 22 nt sRNAs in grain has been reported in maize [28, 47, 48] and soybean , but not in rice [50, 51]. The available data suggest that a different selection of 22 nt siRNAs/miRNAs involved in seed development might have arisen during the evolution of dicotyledon and monocotyledon plants. The accumulation of 22 nt miRNAs might be optimized to simultaneously silence multiple members of a gene clade, and ta-siRNAs triggered by 22 nt miRNAs might serve as a means to extend the targeting range of the primary miRNA . In this study, northern blotting analysis revealed that two 22 nt miRNAs, tae-miR021b and tae-miR2003a, showed preferential accumulation in the embryo rather than the endosperm, suggesting important roles for miRNA-mediated gene regulation in wheat grain.
We conducted a genome-wide survey of wheat miRNAs from multiple wheat tissues of different developmental stages. The results indicated that a total of 323 novel miRNAs were characterized and 366 previously reported miRNAs were confirmed in our dataset. Furthermore, 524 potential targets for 124 miRNA families were determined through degradome sequencing. Based on the genome-wide identification and characterization of miRNAs and their associated target genes, we further identified 64 miRNAs preferentially expressing in developing or germinating grains, which could play important roles in grain development.
Eleven tissues of the hexaploid Chinese Spring wheat (Triticum aestivum L.) cultivar were employed as a source for generating small RNA libraries. Dry grains were used without any treatment. Embryos of germinating seeds and shoots were dissected from seeds soaked in a Petri dish covered with a layer of filter paper saturated with water for 12 hours and approximately 3 days, when leaves were just at the coleoptile tip. Seedling leaves and roots were obtained from seedlings growing in a growth chamber under a relative humidity of 75% and 26/20°C day/night temperatures, with a light intensity of 3000 lx when the third leaf was at least 50% emerged. To collect stems at the jointing stage as well as young spikelets, flag leaves and developing grains, plants were grown in field conditions. Young spikelets were collected when they reached 0–5 mm and 10–15 mm in length, and grains were collected at 8 and 15 days after pollination (DAP). Flag leaves were cut, and spikes were labeled at the beginning of flowering during the principal flowering stage.
RNA extraction, small RNA cloning and degradome library construction
Total RNA was isolated from frozen leaves using the TRIzol reagent (Invitrogen, USA) according to the manufacturer’s instructions. Low molecular weight RNA was enriched through precipitation with 0.5 M NaCl and 10% PEG8000. Approximately 100 μg of low molecular weight RNA was separated on a denaturing 15% polyacrylamide gel. RNA fragments with lengths between 18 and 26 nt was excised, purified from the gel, ligated to adaptors, reverse transcribed and subjected to PCR amplification. Approximately 100 μg of total RNA isolated from germinating seed embryos, seedling leaves, seedling roots and grains 8 days after pollination was used for degradome library construction, as described previously [20, 21]. Small RNA and degradome libraries were sequenced using the Illumina GA IIx platform (BGI at Shenzhen).
Identification of wheat miRNAs
The workflow for wheat miRNA identification (Additional file 8: Figure S3): The adaptor sequences were trimmed from the Illumina reads using ‘vector strip’ in the EMBOSS package. Reads with a length of 18–26 nt were mapped to the all of the available genome sequences including 454 reads with a 5X depth of coverage in the hexaploid wheat genome (http://www.cerealsdb.uk.net/CerealsDB/Documents/DOC_CerealsDB.php) , next-generation sequencing data of flow-sorted individual chromosome arms, provided by International Wheat Genome Sequencing Consortium (IWGSC) (http://www.wheatgenome.org/) and to wheat ESTs from the NCBI database and ESTs and cDNAs from the wheat genetic resources database (http://www.shigen.nig.ac.jp/wheat/komugi/ests/tissueBrowse.jsp;jsessionid=DD38CC8D511C04ADC414B40E0907544D.lb1) using the Bowtie package, version one . Only perfectly matched sRNAs were used for further analysis.
The wheat genome is estimated to be composed of approximately 80% repeats, and the degradation of larger RNA molecules, such as rRNAs, would contaminate the sRNA libraries. Therefore, to remove the sRNAs originating from sequences such as repeats, rRNAs, or tRNAs, any sequences with matching hit counts in the 5X coverage wheat genome over 500 as well as those that mapped perfectly to non-coding RNAs in the Rfam database (http://rfam.sanger.ac.uk/) and repetitive sequences stored in Plant Repeat Databases (http://plantrepeats.plantbiology.msu.edu/downloads.html) were considered repeat-, rRNA- or tRNA-associated siRNAs . The remaining clean sRNAs were subjected to miRNA identification using the modified package miReap, version 0.2. In the original version, minimum matched base pairs should be 14, which was revised to that mismatched base pairs was less than 4. MiRNA candidates with lengths of 20 nt to 24 nt and more than 20 reads in one library were used for the following analysis. The following key criteria were used for miRNA prediction: 1) the miRNA and miRNA* were derived from opposite stem-arms such that they formed a duplex with two-nucleotide 3′ overhangs; 2) the base-pairing between the miRNA and the other arm of the hairpin, which included the miRNA*, was extensive, such that there were typically four or fewer mismatched miRNA bases; 3) any asymmetric bulges were minimal in size (one or two bases) and frequency (typically one or less), especially within the miRNA/miRNA* duplex ; 4) the main miRNA sequence tag must cover at least 70% of all reads surrounding the miRNA start site from 20 nt upstream to 20 nt downstream of the site; and 5) the number of miRNA reads should be greater than 5 in either library.
The genome locus of miRNA precursors were determined through next-generation sequencing data of flow-sorted individual chromosome arms, provided by International Wheat Genome Sequencing Consortium (IWGSC) (http://www.wheatgenome.org/).
Wheat miRNA evolution and conservation analysis
All wheat mature miRNAs were searched against genome sequences of other species to check whether these miRNAs exist in other species, which included the Arabidopsis genome, version 10.0 (http://www.Arabidopsis.org/), Rice genome, version 7.0 (http://rice.plantbiology.msu.edu/), maize genome 5b.60 (http://www.maizesequence.org/index.html), Brachypodium distachyon genome (ftp://brachypodium.org/brachypodium.org/Assembly/), barley genome (http://188.8.131.52/gbrowse_new/), soybean genome and sorghum genome (http://www.phytozome.net/) using bowtie version 1. The sequences surrounding a miRNAmatching site (200 bp upstream and downstream) from the other species were extracted and checked using a modified version of miReap0.2 (http://mireap.sourceforge.net). The miRNAs for which no precursor could be found in any other genome were considered wheat-specific miRNAs.
Wheat miRNA target identification
We merged all of the wheat ESTs into a single wheat transcript dataset from NCBI, NCBI GEO database (GSE38344), EBI (ERP001415). A modified version of CleaveLand2  was used to find the potential targets of all of the wheat miRNAs supported by our sRNA libraries with an alignment score of no more than 4.5 and at least 5 degradome reads validating the miRNA-induced cleavage site in the transcript. Additionally, to examine the locations of cleavage sites, GETORF  was used to find all ORFs longer than 70 amino acids. The locations of the cleavage sites were determined according to the relationship of the cleavage site with the start and end positions of the ORF. The target genes were classified into MapMan functional categories after searching for homologs among the MapMan categories found in TAIR9 using blastn. To map the cleavage sites of the target transcripts, we performed RNA ligation-mediated (RLM) rapid amplification of 5′cDNA ends using a modified GeneRacer kit protocol (Invitrogen).
MiRNA expression analysis
Low molecular weight RNA (30 μg) was loaded into the lanes of a denaturing 15% polyacrylamide gel, resolved, and then transferred electrophoretically to Hybond-N + membranes (Amersham Biosciences, Buckinghamshire, UK). The membranes were UV crosslinked and baked for 2 hours at 80°C. DNA oligonucleotides complementary to miRNA sequences were end-labeled with γ-32P-ATP using T4 polynucleotide kinase (TaKaRa, Dalian, China). The membranes were prehybridized for more than 8 hours and then hybridized overnight in Church buffer at 38°C. Next, the blots were washed three times (two times with 2 × SSC + 1% SDS and one time with 1 × SSC + 0.5% SDS) at 50°C. Finally, the membranes were briefly air dried and then exposed to X-ray film for autography at -80°C. Images were acquired by scanning the films with FluorChem™ (Alpha Innotech, San Leandro, CA, USA).
SYBR® PrimeScript miRNA RT-PCR Kit were performed as manufacture’s instruction (TaKaRa). Briefly, 1 μg of total RNA was incubated with 10 μL of 2 × miRNA Reaction Buffer Mix (for Real Time), 2 μL of 0.1% BSA, 2 μL of miRNA PrimeScript RT Enzyme Mix, and 5 μL of RNase Free dH2O in a 20-μL reaction mixture. The temperature program was adjusted to run for 60 min at 37°C, 5 s at 85°C, and then 4°C forever. qRT-PCR was conducted on a Bio-Rad CFX96TM Real-Time System. Each reaction included 2 μL of product from the diluted RT reactions, 1.0 μL of miRNA primer (10 μM), 12.5 μL of SYBR Premix Ex Taq II(2×), and 8.5 μL of RNase Free water. The reactions were incubated in a 96-well plate at 95°C for 30 s, followed by 40 cycles of 95°C for 5 s, 59°C for 30 s, and 72°C for 30 s. All reactions were run in three replicates for each sample. The actin gene (GB#: AB181991) was used as the endogenous control. All of the probes and primers used in these analyses are listed in Additional file 9: Table S6.
Availability of supporting data
The data set including the raw sequencing data of 11 small RNA libraries and 4 degradome libraries in our study are available in SNBI SRA database under accession no (accession: SRP040143) (http://www.ncbi.nlm.nih.gov/sra/?term=SRP040143). The sequences of mature miRNA, miRNA* and precursors, as well as the precursor genome location and secondary structure are available in Additional file 2: Table S2. The target genes obtained in this study are available in Additional file 4: Table S4. MiRNA expression by high throughput sequencing along 11 tissues is available in Additional file 6: Table S5.
This work was financially supported by, Major Program of the National Natural Science Foundation of China (31322041, 31290210), the 863 Project of China (2012AA10A309) and the National Natural Science Foundation of China (30930058, 30600392, and 30871529).
- 11.Pang M, Woodward AW, Agarwal V, Guan X, Ha M, Ramachandran V, Chen X, Triplett BA, Stelly DM, Chen ZJ: Genome-wide analysis reveals rapid and dynamic changes in miRNA and siRNA sequence and expression during ovule and fiber development in allotetraploid cotton (Gossypium hirsutum L.). Genome Biol. 2009, 10 (11): R122-10.1186/gb-2009-10-11-r122.PubMedCentralCrossRefPubMedGoogle Scholar
- 13.Zabala G, Campos E, Varala KK, Bloomfield S, Jones SI, Win H, Tuteja JH, Calla B, Clough SJ, Hudson M, Vodkin LO: Divergent patterns of endogenous small RNA populations from seed and vegetative tissues of Glycine max. BMC Plant Biol. 2012, 12: 177-10.1186/1471-2229-12-177.PubMedCentralCrossRefPubMedGoogle Scholar
- 14.Ng DW, Zhang C, Miller M, Palmer G, Whiteley M, Tholl D, Chen ZJ: cis- and trans-Regulation of miR163 and target genes confers natural variation of secondary metabolites in two Arabidopsis species and their allopolyploids. Plant Cell. 2011, 23 (5): 1729-1740. 10.1105/tpc.111.083915.PubMedCentralCrossRefPubMedGoogle Scholar
- 20.German MA, Pillay M, Jeong DH, Hetawal A, Luo S, Janardhanan P, Kannan V, Rymarquis LA, Nobuta K, German R, De Paoli E, Lu C, Schroth G, Meyers BC, Green PJ: Global identification of microRNA-target RNA pairs by parallel analysis of RNA ends. Nat Biotechnol. 2008, 26 (8): 941-946. 10.1038/nbt1417.CrossRefPubMedGoogle Scholar
- 25.Montgomery TA, Yoo SJ, Fahlgren N, Gilbert SD, Howell MD, Sullivan CM, Alexander A, Nguyen G, Allen E, Ahn JH, Carrington JC: AGO1-miR173 complex initiates phased siRNA formation in plants. Proc Natl Acad Sci U S A. 2008, 105 (51): 20055-20062. 10.1073/pnas.0810241105.PubMedCentralCrossRefPubMedGoogle Scholar
- 26.Cuperus JT, Carbonell A, Fahlgren N, Garcia-Ruiz H, Burke RT, Takeda A, Sullivan CM, Gilbert SD, Montgomery TA, Carrington JC: Unique functionality of 22-nt miRNAs in triggering RDR6-dependent siRNA biogenesis from target transcripts in Arabidopsis. Nat Struct Mol Biol. 2010, 17 (8): 997-1003. 10.1038/nsmb.1866.PubMedCentralCrossRefPubMedGoogle Scholar
- 29.Wang Y, Itaya A, Zhong X, Wu Y, Zhang J, van der Knaap E, Olmstead R, Qi Y, Ding B: Function and evolution of a MicroRNA that regulates a Ca2 + -ATPase and triggers the formation of phased small interfering RNAs in tomato reproductive growth. Plant Cell. 2011, 23 (9): 3185-3203. 10.1105/tpc.111.088013.PubMedCentralCrossRefPubMedGoogle Scholar
- 35.Wei B, Cai T, Zhang R, Li A, Huo N, Li S, Gu YQ, Vogel J, Jia J, Qi Y, Mao L: Novel microRNAs uncovered by deep sequencing of small RNA transcriptomes in bread wheat (Triticum aestivum L.) and Brachypodium distachyon (L.) Beauv. Funct Integr Genomics. 2009, 9 (4): 499-511. 10.1007/s10142-009-0128-9.CrossRefPubMedGoogle Scholar
- 40.Brenchley R, Spannagl M, Pfeifer M, Barker GL, D'Amore R, Allen AM, McKenzie N, Kramer M, Kerhornou A, Bolser D, Kay S, Waite D, Trick M, Bancroft I, Gu Y, Huo N, Luo MC, Sehgal S, Gill B, Kianian S, Anderson O, Kersey P, Dvorak J, McCombie WR, Hall A, Mayer KF, Edwards KJ, Bevan MW, Hall N: Analysis of the bread wheat genome using whole-genome shotgun sequencing. Nature. 2012, 491 (7426): 705-710. 10.1038/nature11650.PubMedCentralCrossRefPubMedGoogle Scholar
- 41.Ling HQ, Zhao S, Liu D, Wang J, Sun H, Zhang C, Fan H, Li D, Dong L, Tao Y, Gao C, Wu H, Li Y, Cui Y, Guo X, Zheng S, Wang B, Yu K, Liang Q, Yang W, Lou X, Chen J, Feng M, Jian J, Zhang X, Luo G, Jiang Y, Liu J, Wang Z, Sha Y, et al: Draft genome of the wheat A-genome progenitor Triticum urartu. Nature. 2013, 496 (7443): 87-90. 10.1038/nature11997.CrossRefPubMedGoogle Scholar
- 42.Jia J, Zhao S, Kong X, Li Y, Zhao G, He W, Appels R, Pfeifer M, Tao Y, Zhang X, Jing R, Zhang C, Ma Y, Gao L, Gao C, Spannagl M, Mayer KF, Li D, Pan S, Zheng F, Hu Q, Xia X, Li J, Liang Q, Chen J, Wicker T, Gou C, Kuang H, He G, Luo Y, et al: Aegilops tauschii draft genome sequence reveals a gene repertoire for wheat adaptation. Nature. 2013, 496 (7443): 91-95. 10.1038/nature12028.CrossRefPubMedGoogle Scholar
- 43.Tanaka T, Kobayashi F, Joshi GP, Onuki R, Sakai H, Kanamori H, Wu J, Simkova H, Nasuda S, Endo TR, Hayakawa K, Dolezel J, Ogihara Y, Itoh T, Matsumoto T, Handa H: Next-generation survey sequencing and the molecular organization of wheat chromosome 6B. DNA Res. 2013Google Scholar
- 55.Thimm O, Blasing O, Gibon Y, Nagel A, Meyer S, Kruger P, Selbig J, Muller LA, Rhee SY, Stitt M: MAPMAN: a user-driven tool to display genomics data sets onto diagrams of metabolic pathways and other biological processes. Plant J Cell Mol Biol. 2004, 37 (6): 914-939. 10.1111/j.1365-313X.2004.02016.x.CrossRefGoogle Scholar
- 62.Gill BS, Appels R, Botha-Oberholster AM, Buell CR, Bennetzen JL, Chalhoub B, Chumley F, Dvorak J, Iwanaga M, Keller B, Li W, McCombie WR, Ogihara Y, Quetier F, Sasaki T: A workshop report on wheat genome sequencing: international genome research on wheat consortium. Genetics. 2004, 168 (2): 1087-1096. 10.1534/genetics.104.034769.PubMedCentralCrossRefPubMedGoogle Scholar
- 69.Meyers BC, Axtell MJ, Bartel B, Bartel DP, Baulcombe D, Bowman JL, Cao X, Carrington JC, Chen X, Green PJ, Griffiths-Jones S, Jacobsen SE, Mallory AC, Martienssen RA, Poethig RS, Qi Y, Vaucheret H, Voinnet O, Watanabe Y, Weigel D, Zhu JK: Criteria for annotation of plant MicroRNAs. Plant cell. 2008, 20 (12): 3186-3190. 10.1105/tpc.108.064311.PubMedCentralCrossRefPubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.