Introduction

Rice is a staple crop, of which increases in yield by genetic improvement are of great significance for world food security, as it provides roughly 50 % of the calories consumed by the world. In contrast to the swift expansion of the world’s population, slow or stagnant rice yield increases would have profound implications for world food security (Ray et al. 2012). Rice improvement can be divided into two categories based on the gene resources: endogenous genes and alien genes. Increasing evidence has documented that alien genes from other plants can be employed for rice improvement. Previous reports show that a gene encoding the large subunit of AGPase as a rate-limiting enzyme in starch synthesis, maize shrunken-2, can increase rice yield by 23 % (Hannah et al. 2012). Zmhdz10, a drought-induced HD-Zip I transcription factor overexpressed in both rice and Arabidopsis leads to enhanced drought and salt tolerance (Zhao et al. 2014). Transgenic rice plants overexpressing maize C4 pepc gene, which encodes phosphoenolpyruvate carboxylase (PEPC), enhance drought tolerance (Qian et al. 2015).

Sorghum, a Poaceae member related to sugar cane and maize, is the world’s fifth most important cereal crop (Mall et al. 2011). A genetic analysis revealed that about one-third of the Sorghum genome is similar to rice in gene order and density (Paterson et al. 2009). Genetic variation in the partitioning of carbon into sugar stores versus cell wall mass, and in perenniality and associated features such as tillering and stalk reserve retention, make Sorghum an attractive system for the study of traits important in perennial cellulosic biomass crops. Drought tolerance and C4 photosynthesis, biochemical and morphological specializations, make Sorghum especially important in dry regions. These specializations increase net carbon assimilation at high temperatures, disease and insect resistance and high yield make an excellent gene resource for the genetic improvement of rice (Wang et al. 2013).

Recently, 27 overexpressing transgenes from Arabidopsis were examined in Arabidopsis that led to the identification of dominant estradiol-dependent stress tolerance genes (Papdi et al. 2008). Because of the efficacy and high-throughput of rice transformation, the improvement of rice using a Sorghum cDNA library would be practical and significant.

In the present study, we constructed a Sorghum overexpression cDNA library, and generated thousands of transgenes. Agronomic characteristics revealed that many transformants exhibited a change in phenotype, of which nineteen lines showed high yield by acquiring either more effective panicles, bigger seeds, or higher grain numbers per panicle than the wild type, implying that this method is effective and practically useful for rice improvement.

Materials and methods

Plant materials

Sorghum TX430 plants (Sorghum bicolor L.) from Professor Lijia Li’s lab (Wuhan University, China) were grown in the greenhouse in Wuhan, Hubei province in China. Oryz. Sativa L. spp. Japonica, var ‘Nipponbare’ was used for transformation. T0 generation and T2 generation lines were planted in Wuhan and T1 generation were planted in Hainan in China.

Sorghum cDNA expression library construction

Total RNA (A260/A280 = 2.07, total mass was 0.87 mg) was extracted from Sorghum bicolor (L.) young panicles about 8–12 cm before the heading stage using TRIzol reagent (Invitrogen, Carlsbad, CA). mRNA (A260/A280 = 2.23, total mass is 2.87 μg) was isolated from the total RNA using the PolyATtract® mRNA Isolation System IV (without Magnetic Stand) Cat. #Z5310 (Promega). A cDNA library was constructed by using CloneMiner™ II cDNA Library Construction Kit (Invitrogen, USA).

Gateway technology (Invitrogen, USA) contains two steps: BP recombination reaction (PCR fragment + entry vector) and LR recombination reaction (entry vector after BP recombination reaction + destination vector). After LR recombination reaction, the cDNAs were recombined into overexpression vector pH7WG2D (Invitrogen, USA), and transformed using Agrobacterium-mediated transformation.

Rice transformation

Seed-embryos were induced to generate calli, which were then transformed based on the protocol (Hiei and Komari 2008) without heat treatment at 43 °C for 30 min, and the calli only growed on N6 medium (Chu 1978) and MS medium (Murashige and Skoog 1962). Seeds were surface-sterilized with 75 % ethanol for 1 min and 1.5 % HgCl2 for 10 min, washed with sterile ddH2O, and cultured on N6 medium (N6 + 2.5 mg/L 2,4-D) for about 45 days and subcultured twice. The calli were harvested and precultured on preculture medium (N6 + 20 mg/L acetosyringone) for 3 days, then incubated for 15 min in Agrobacterium tumefaciens resuspended (OD600 = 0.1) in Agrobacterium suspension medium (1/2N6 + 20 mg/L acetosyringone + 10 g/L Glucose). They were then dried on sterile filter paper and cocultivated with A. tumefaciens for 3 days at 21 °C on cocultivation medium (the same as preculture medium). After cocultivation, the rice calli were rinsed thoroughly (nine times) in sterile water containing 500 mg/L Cefotaxime Sodium Salt and transferred to selective medium (N6 + 50 mg/L hygromycin-B, 500 mg/L cefotaxime and 2.5 mg/L 2, 4-D) for 5 weeks at 28 °C. Deep yellow calli were transferred to selection medium for 2 weeks, then vigorously growing calli were regenerated on regeneration medium (MS + 2 ml/L 6-BA + 2 ml/L kinetin + 0.2 ml/L IAA + 0.2 ml/L NAA) for 3 weeks at 28 °C under light. Seedlings from regeneration medium were transferred to rooting media (1/2 MS) for 2 weeks, then adapted to the external environment for 1 week, and then transferred to soil.

Positive identification and expression analysis of transgenic lines

Primers for hygromycin B resistance gene (HYP) (Supplementary Table 1) were designed for the identification of positive regeneration plantlets. The amplification program consisted of 10 min at 95 °C, 35 cycles of 45 s at 95 °C, 45 s at 68 °C and 1 min at 72 °C, followed by 10 min at 72 °C.

For semi-quantitative RT-PCR, first-strand cDNA synthesis was performed with GoScript™ Reverse Transcription System (Promega, USA) using the total RNA extracted from young leaves. Semi-qPCR was performed with 1 μL of template cDNA in a 20-μL reaction system with Taq DNA Polymerase for 1 min at 94 °C, followed by 36 cycles of 45 s at 94 °C, 45 s at 55 °C, and 1 min at 72 °C. All semi-qPCR reactions were amplified using actin (AK061988) as internal control, and the PCR products were resolved by electrophoresis. The RT-PCR primers are listed in Supplementary Table 1.

Agronomic traits of 965 transgenic lines

Plant height and effective panicles of five plants per line for 965 lines were tested, and A rice panicle which bears more than 5 grains was called as an effective panicle. Heading dates of six plants per line were also recorded. Three plants per line were randomly chosen to test flag leaf length and leaf width, and 3 flag leaves per plant were recorded.

Agronomic traits of 93 transgenic lines

Five plants per line for 93 lines were chosen to test seed setting rate, effective spike, grain number per plant, filled grain number per plant, grains per panicle, grain weight per unit area, and yield per plant.

Tail-PCR

According to the Tail-PCR method (Liu et al. 2005), we designed three specific primers: RB0B, RB1B and RB2B at the right border of the gateway vector pH7WG2D (Supplementary Table 1).

Results

Construction of Sorghum bicolor (L.) cDNA overexpression library

Alien introgressions provide a rich gene resource for crop improvement (Gill et al. 2011), and alien gene expression by transformation is a popular and practical approach to improve crop plants (Hu and Lubberstedt 2015). In order to improve rice using alien genes from other plants especially the grass family of C4 plants, a Sorghum cDNA library was constructed using the young inflorescences (about 8–12 cm length) of Sorghum. The titer of the destination cDNA library reached to 5.2 × 106 cfu and the insert size ranged from 0.7–3.1 kb size (Fig. 1). Furthermore, we investigated the completeness of the cDNA orfs in the library, 37 clones were randomly selected and sequenced, of which, 36 clones had complete ORFs, and 20 were completely consistent with the full-length Sorghum cDNAs in NCBI (Supplementary Table 2). 10 novel genes were not present in the current NCBI database (Supplementary DNA sequences). These parameters suggest that the cDNA overexpression library was of high enough quality to be used for transformation.

Fig. 1
figure 1

Construction of Sorghum cDNA library. Recombined cDNA size in E. coli destination library was evaluated by restriction endonuclease BsrGI. M1 is 1 kb ladder

Generation of transgenic rice lines

1518 independent transgenic lines were generated through Agrobacterium transformation and regeneration. Of which, 1153 positive lines were confirmed by PCR analysis (Figure S1) with a transformation efficiency of 76 %. In order to accurately evaluate the genetic effects of the transgenesis the copy number of the transgenes were investigated with genomic PCR (Ichikawa et al. 2006). Most lines harboured only one alien gene, the copy number of alien genes was about 1.26 on average in the 586 investigated transgenic lines (Fig. 2b). Furthermore, the completeness of the inserted cDNA in the transgenic lines was analyzed. In the 45 randomly selected transgenic lines, 42 cDNAs had complete ORFs, 21 cDNAs were consistent with the full-length cDNAs of Sorghum deposited in the NCBI database, and insertion sizes ranged from 437 to 2661 bp (Supplementary Table 3). RT-PCR analysis revealed that the alien genes were expressed in transgenic lines (Fig. 2a). These investigations suggest that the altered characteristics in the transformed rice lines were likely conferred by the inserted Sorghum genes.

Fig. 2
figure 2

Molecular identification of the transgenic rice population. Note: a Expression analysis of Sorghum genes in the rice transgenic lines by semi-quantitative PCR. Actin was used as internal reference gene. b The copy number in transgenic rice lines

Observation on the agronomic traits of the transgenic lines

To further evaluate the genetic effects of Sorghum cDNA inserted on rice development, the agricultural characteristics of transgenic lines were investigated carefully. The 965 fertile transgenic lines were tested based on the agronomical traits including effective panicle, plant height, leaf length, leaf width, and heading date (Fig. 3), although some plants exhibited changes in multiple traits.

Fig. 3
figure 3

Phenotype categorization of transgenic lines. a Leaf color; b Plant height; c Panicle architecture; d Effective panicle, e Leaf shape, scale bar = 7.33 cm. f Grain length, scale bar = 0.38 cm. g Grain width, scale bar = 0.40 cm

890 lines had longer leaves, and 563 lines showed wider or narrower leaves compared to the wild type. For heading date, 300 lines were delayed, while 169 lines were earlier than the wild type. 9 lines were taller, and 874 lines were shorter than the wild type. For the effective panicle, 807 lines were increased, and only three lines were lower than the wild type. These data suggest that the phenotype of gross morphology were changed in over 92 % percent of the transgenic lines (Table 1), which means that overexpression of Sorghum cDNA in rice caused a great morphology variation.

Table 1 Categories of the 965 transgenic lines based on agronomic traits

Characterization of the high yield transgenic lines

Grain yield is the most important characteristic for rice improvement, and is a great focus for breeders around the world. In the present study, we investigated lines with obvious yield improvements, and yield-related traits such as effective panicles, grain size and grain number. In total, 93 lines were selected and were investigated further in four yield-determined traits containing seed setting rate, effective panicle, 1000-grain weight and grain number per panicle. Of these lines yield was increased in 19 lines relative to the wildtype (Table 2), 29 lines had reduced yield, and another 45 lines lacked an apparent change (Supplementary Table 4).

Table 2 Agronomic traits of 19 higher grain yield transgenic lines

When we investigated the 19 yield-increased lines, effective panicles significantly increased in 18 lines (from 11.33 to 21.67) compared to the wildtype (7.67). 9 lines increased 1000-grain weight from (24.69 to 27.65 g) than in the wild type (23.72 g). One line had increased grain number (155.04) than the wild type (89.39). Then, we analyzed the insert fragments of the 19 high yield transgenic lines to explore the potential function of these genes in plants. 14 genes were separately cloned from 14 high yield transgenic lines, and sequence analysis showed that CDSs (coding sequence) of 13 genes were completely consistent with genes in the NCBI database, with the exception of line 0402, which had a 170 bp deletion at the 5′ end of the CDS (Supplementary Table 5).

Genetic validation of phenotypes in transgenic lines

We investigated a possible mutation in transgenic line 0752, which exhibited dwarf plants, smaller grains, dense and erect panicles (Fig. 4). Tail-PCR showed that the insert gene Sb01g013740 was located at 7523486 bp on chromosome 3, which is 4.4 kb upstream of the Os03g0242300 (LOC_Os03g13870.1, 1386 bp), and did not affect other genes. Sb01g013740 encodes a patatin-like phospholipase domain containing protein (Marchler-Bauer et al. 2015) which is similar to DEP3 in rice. Phenotypic analysis showed that a rice dep3 mutant with a deletion in the last 47 bp of the third exon exhibited a very similar phenotype to line 0752, with the exception of plant height (Qiao et al. 2011). This implies that Sb01g013740 in Sorghum may have the same function as DEP3 in rice. Furthermore, we transformed the Sb01g013740 into rice again to confirm the phenotype. The positive T0 transgene rice lines were tested with PCR (Figure S2), and demonstrated a similar phenotype to the 0752 line, with the exception of lacking dense and erect panicles (Fig. 4 and Supplementary Table 6). This discrepancy could be because the T0 generation transgenic lines might be too weak to keep the panicles erect or perhaps the transgenic lines were not homozygous.

Fig. 4
figure 4

Retransformation verification of Sb01g013740 from the 0752 line. a Gross phenotype of WT and transgene line L1, scale bar = 7.33 cm. b Panicles of WT, 0752 line and the retransformed lines of L1, L2 and L3, scale bar = 0.94 cm. c Grain size of WT, 0752 line and the retransformed lines of L1, L2 and L3, scale bar = 0.26 cm

Discussion

Our experimental results are very successful, and the reason can be from the following two aspects. First, It is feasible to use gateway technology to build the cDNA overexpression library, 50 % of full-length cDNAs of 82 (37 + 45) cDNAs in our library were between 43 % of 14 cDNAs (LeClere and Bartel 2001) and 60 % of 27 cDNAs (Papdi et al. 2008), so the data was more convincing; considering the high GC content of cDNAs in Sorghum, we thought it was not easy to get cDNAs of such a high integrity; in addition, when we done homologous gene alignment we found 10 new genes and new gene ratio was as high as 10/(37 + 45 + 14). Second, the method of cDNA overexpression library had its own advantages compared with activation-tagged method, FOX-hunting system (Full-length cDNA Over2-eXpressing gene hunting system) and BIBAC (Binary Bacterial artificial chromosome). Using the activation-tagged method, it may be not easy to analyse gene-phenotype relationship, because multiple genes at a distance of as much as 10.7 kb from the insertion site could be activated simultaneously (Jeong et al. 2006). FOX-hunting system was first built in Arabidopsis (Ichikawa et al. 2006), but the collection of full-length cDNAs is time-consuming and needs a huge amount of work in genome-wide (Kondou et al. 2010) and requires prior knowledge of genome sequences and gene annotations; and to our knowledge, this system only utilized full length-cDNAs of both rice and Arabidopsis which were collected by FAIS and RIKEN(Hakata et al. 2010; Ichikawa et al. 2006). BIBAC (Wang et al. 2015) has a transgenic bottleneck, which has not been broken to massively transform large DNA fragments into another species, and analysis of subsequent gene function is difficult, too.

Heterologous expression of alien genes has been a practical an effective approach for engineering improvements into crop plants (Pratap and Kumar 2014). Usually plant genes that confer an important function are employed to improve the same agricultural character in related crop plants (Datta et al. 2012; Jacobs et al. 2011; Li et al. 2012; Oliva et al. 2014; Pasquali et al. 2008; Shin et al. 2006; Zhao et al. 2015; Zheng et al. 2015). Unfortunately, It is inefficient to improve rice by transforming one gene at a time, but high-throughput overexpression of cDNA libraries from other plants can surpass this limitation, because foreknowledge of gene function is unnecessary. In our research, over 90 % of transgenic lines altered an agricultural trait, which shows that alien expression of Sorghum cDNA can greatly affect plant phenotype, although correlating genes with function requires further study.

We believe that the high yield of 19 strains were caused by the overexpression of these genes. The majority of copy number was single copy by genomic PCR, combined with the identification results of semi-quantitative RT-PCR, the expressed genes showed the dominant effect; the redundancy and repeated genes of rice accounted for 29 % (International Rice Genome Sequencing 2005), that is to say, loss of function genes in the insertion site may not result in phenotype change. In this research, 19 plants exhibited a higher grain yield than the wild type, of which, 16 lines increased over 50 % mainly as a result of increasing effective panicles (Table 1). Interestingly, of the 14 Sorghum genes identified from the high grain yield plants, seven genes have proposed functions related to stress response through homologous analysis (Table 2). Of which, XM_002452726.1 from the line 0067, a homologue of Zea mays dhn2 (Campbell et al. 1998), is a member of dehydrin family and is related to drought tolerance. XM_008661925.1 from the line 0815, is homologous to ERD15 (EARLY RESPONSIVE TO DEHYDRATION 15) in Zea mays, and is related to ABA response and stress tolerance in Arabidopsis thaliana (Aalto et al. 2012). XM_002452812.1 from the line 0887 is homologous to ascorbate-specific transmembrane electron transporter 1 (Cytochrome b561-1) in Zea mays, and to cytochrome b561 proteins, which play an important role in plant growth, development, and the prevention of damage to plants (Opiyo and Moriyama 2010). Manipulation of cytochrome b6/f complex can potentially improve grain yield (Yamori et al. 2015). As reported in an article, the yield increase is thus achieved by increasing the ability of stress (Mao et al. 2015). Considering that Sorghum has a strong adaptability to the environment, these analyses imply that Sorghum genes related to drought tolerance, salt tolerance, cytochrome, dehydrin, high temperature tolerance, disease and insect resistance can functionally enhance rice yield. This reveals that some evolutionarily homologous genes may show different functions when alien genes were expressed in different plant species (Gururani et al. 2015; Hong et al. 2014), and provide a novel approach to explore gene resources for plant improvement.

Retransformation was necessary, because the phenotype is not only from the transgene itself but also other factors such as gene disruption at the insertion site, and the mutation during tissue culture. Phenotypes caused by culture mutations in both rice and Arabidopsis transformation is 5–10 % (Larmande et al. 2008). So a careful comprehensive analysis of each mutant line is necessary to ensure a connection between a transgene and phenotype (Nishimura et al. 2006). In view of this situation, we put forward an idea which was introducing Ac/Ds transposon elements into overexpression transgenic vector and then the vectors were transformed into rice, similar to large-scale Ac/Ds transposon T-DNA insertion mutant library built by China National Rice Research Institute. At last, we mainly study effect of the transgene, analyse the phenotype of the transgenic progeny population for determining the relation between gene and phenotype, eliminating the need for transgenic tedious steps (especially when indica rice is not suitable to create large-scale mutant library).