Introduction

Cotton bunchy top (CBT) was first observed in Australian cotton fields in 1998–99 where it caused significant economic losses (Reddall et al. 2004). Symptoms include reductions in plant height, leaf surface area, petiole and internode length (Ali et al. 2007; Reddall et al. 2004) resulting in significant loss of fiber yield. Leaves also often display pale green angular patterns at the leaf margins and dark green centers with a leathery texture. The disease is spread by the cotton aphid (Aphis gossypii Glover) (Ali et al. 2007) that can acquire CBT within 5 min of feeding on infected plants, and viruliferous aphids can inoculate healthy plants after only an hour (Ali et al. 2007). The causal agent of CBT was recently identified as the cotton bunchy top virus (CBTV) belonging to the genus Polerovirus of the family Luteoviridae (Ellis et al. 2013). Viruses of the Polerovirus genus are aphid transmitted and contain non-enveloped single stranded positive-sense RNA genomes that contain six open reading frames (ORF0 to ORF5) (Mayo and ZieglerGraff 1996).

CBT symptoms and CBTV shares some similarities with another Polerovirus disease; cotton blue disease (CBD) and its causal agent cotton leafroll dwarf virus (CLRDV) (Pupim et al. 2008) that is widespread in Asia and South America. Partial genome sequence of CBTV revealed that its RNA-dependent-RNA polymerase (RdRP, ORF2) shared 90 % amino acid sequence identity to the RdRP of CLRDV, although other ORFs have lower homology. The majority of Gossypium hirsutum L. cultivars surveyed are susceptible to CBD, but resistant cultivars (mainly of African descent) have been identified (Royo et al. 2003), including the Australian cultivar ‘Delta Opal’ used in this study. Resistance to CBD has been shown to be controlled by a single dominant locus (Fang et al. 2010; Pupim et al. 2008).

CBT appears to be endemic to Australia and is currently managed by controlling its aphid vector with the use of insecticides, as most existing commercial cultivars are susceptible. Robust plant resistance is required for long term management of the disease. A source of CBT plant resistance in G. hirsutum has been identified in ‘Delta Opal’, but progress in introducing this trait into modern cultivars has been slowed by the difficulty of reliably assaying for the disease on large segregating breeding populations.

In this study the inheritance of resistance to CBT was analyzed in an F2 and F2:3 population from an intraspecific G. hirsutum cross between the resistant cultivar ‘Delta Opal’ and the susceptible ‘Sicot 70’. CBT resistance in the population was consistent with the segregation of a single dominant gene, Cbt that was mapped to a 1.7 cM region on chromosome A10. The flanking markers identified in this study will provide useful tools for breeders for the marker-assisted selection of CBT resistant varieties to replace current difficult and labor-intensive phenotypic screening.

Materials and methods

Plant materials

A cross was made between a single plant each of the G. hirsutum cultivars ‘Delta Opal’ (CBT resistant) and ‘Sicot 70’ (CBT susceptible). After selfing, a population of 206 F2 plants were evaluated for CBT resistance via infestation with viruliferous aphids (as outlined in CBT resistance assay). All F2 plants were allowed to self-fertilize after the CBT assay, and F3 seeds collected. For the F2 assay the resistant control plants consisted of CSX180, a previously identified homozygous CBT resistant line from an earlier cross of ‘Delta Opal’ to ‘Sicot 70’. For the F2:3 assays the resistant control was a previously assayed breeding line designated CSX0895, a backcross derived line of CSX180 containing the genetically modified traits MON531 and MON15985. A panel of 25 mostly Australian varieties with known CBT resistance (W. Stiller pers communication) were selected for SNP genotyping using the CottonSNP63 K array (Table S1).

CBT resistance assay

CBT resistance assays were performed at CSIRO Agriculture, Narrabri NSW (Australia). Seeds were directly sown in 8L pots in a glasshouse with the temperature maintained at 32 °C during the day and 18 °C during the night. For the experiments on individual F2 plants, three seeds per pot were sown and after establishment, thinned to one plant per pot. One resistant (either CSX180 for F2 and CSX0895 for F2:3 assays) and one susceptible control (parent ‘Sicot 70’) were included for every 22 test lines. When plants reached four true leaves, they were infested with aphids that had fed on cotton plants already expressing CBT symptoms. This was done by placing a minimum of 20 aphids on each test plant. The experiments were managed so that the plants were not lacking for nutrients or water. When present, aphid parasitoids were controlled by applying a solution of 0.3 g/L of the selective insecticide Spinosad. Four weeks after infestation, all aphids were killed with an application of 0.75 g/L Diafenthiuron. Plants were then monitored on a weekly basis for expression of CBT symptoms such as short internodes and mosaic like leaf patterning, and when symptoms were clearly visible on the susceptible controls, all plants were rated for symptoms on a weekly basis, generally over a 4 week period. Any plant with CBT like symptoms was considered susceptible to the disease. For subsequent F2:3 experiments, 10 seeds were sown in each pot and thinned to five plants per pot after establishment. A randomized complete block design was used, comprising one pot replicated four times, with a minimum of 16 plants assayed per F2:3 family. In total, over all CBT F2 and F2:3 assays performed during this study, 271 susceptible control plants were assayed and CBT symptoms were found to develop on an average of 87 % per assay.

DNA extraction

Leaf tissue was collected for all plants in each trial, immediately frozen in liquid nitrogen and stored at −80 °C then freeze dried and stored with silica gel until DNA was extracted. Total genomic DNA was extracted using the DNeasy® Plant Kit (QIAGEN GmbH-Hilden-Germany) as indicated by the manufacturer.

AFLP genotyping

AFLP detection was performed using a protocol adapted from Vos et al. (1995), with minor modifications for cotton (Lopez-Lavalle et al. 2012). One microgram of genomic DNA was digested with one of three sets of enzyme combinations: PstI and MseI, EcoRI and MseI, or ApaI and MseI to create different DNA libraries. Each library was amplified with 16 sets of primers containing two or three selective bases (Table S2). A molecular weight standard (ABI GeneScan-500 LIZ #4322682) was added at 1 % in Hi-Di Formamide (ABI #4311320) and separated on an ABI 3130xl capillary sequencer. Traces were analysed using ABI GeneMapper software v4.0 (Thermo Fisher Scientific, Waltham, MA).

AFLP analysis was performed on the parent lines and a subset of 40 F2:3 lines (14 homozygous resistant, 7 heterozygotes, and 19 susceptible). Approximately 45 bands were scored per reaction, and in total over 2000 fragments were surveyed for linkage to Cbt. A single band of 213 bp (designated CL213) derived from PstI-AT: MseI-GAC amplification was found to be linked to CBT resistance. The AFLP fragment was scored as a co-dominant marker by using the height of nearby non-polymorphic peaks as a reference to identify heterozygous alleles based on peak height (Figure S1). To identify its sequence, the complexity of the amplified fragments was reduced by amplifying with an additional selective base on the PstI-AT(A/C/G/T) primer. It was found that the correct nucleotide at the third position was T and so the fragment was amplified using PstI-ATT:MseI-GAC and isolated based on size. The DNA was eluted from the gel in water, and re-amplified using the PstI-ATT:MseI-GAC primers for Sanger DNA sequencing.

TaqMan genotyping

The SNP associated with markers NG0203481, NG0204310 and NG0211495 (DC20027) were genotyped using Taqman technology using primer sequences previously used to map CBD (Fang et al. 2010). The 7-μL TaqMan assay reaction contained 10 μM each of primers, 0.2 μM each of probes, 5 ng genomic DNA, and 1X TaqMan universal PCR Master Mix (Thermo Fisher Scientific, Waltham, MA). Amplification was carried out using either an ABI7900HT real-time PCR machine or an Eppendorf EP384 cycler, according to the cycling conditions recommended by Applied Biosystems, except the number of cycles was increased to 60. Pre-and post-PCR fluorescence were measured on an ABI7900HT and the results analyzed using the endpoint genotyping module of the ABI7900HT software package (Applied Biosystems Inc.).

Kompetitive Allele Specific PCR (KASP) genotyping

For KASP genotyping, DNA concentration of each sample was standardized to 15 ng/µL. The amplification was performed as for Zhu et al. (2015), in an 8 µL reaction, including 1 µL of DNA, 4 µL of 2 × KASP master mix (LGC Group), 0.11 µL of primer mix (12 µM of each allele-specific primer and 30 µM of common primer) and 2.89 µL of H2O, according to the manufacturer’s instructions. Assays were performed in 384-well format. PCR cycling was performed on an Eppendorf Mastercycler ep384 using the following protocol: hotstart at 95 °C for 15 min, followed by ten touchdown cycles (95 °C for 20 s; touchdown 65–57 °C, 0.8 °C per cycle, 60 s), then followed by 31 cycles of amplification (94 °C for 20 s; 57 °C for 60 s). Plates were read on the ViiA7 Real-Time PCR System (Life Technologies) at ambient temperature and analyzed using the Applied Biosystems ViiA7 software (v1.2.2) (Thermo Fisher Scientific, Waltham, MA). Oligonucleotides used in KASP analyses are shown in Table S3.

Illumina CottonSNP63K array

For genotyping using the Illumina CottonSNP63K array (Illumina, San Diego CA), DNA was standardized at 50 ng/µL for each of the G. hirsutum accessions. DNA was processed according to Illumina protocols and hybridized to the CottonSNP63K array at CSIRO Agriculture (Brisbane, Australia) according to the manufacturer’s instructions. Single-base extension was performed and the chips were scanned using the Illumina iScan (Illumina, San Diego CA). Image files were saved and analyzed using the GenomeStudio Genotyping Module (v 1.9.4, Illumina). Genotype calls for each SNP were based on the cluster file generated specifically for the CottonSNP63K array (Hulse-Kemp et al. 2015).

Linkage analysis

JoinMap 4.0 software was used for linkage analysis and map construction (Stam 1993). Kosambi mapping function was used to convert recombination frequencies into map distances (centimorgan, cM) with LOD score ≥5.0. For the F2 disease assays the CBT phenotypes were entered as a dominant trait in the initial F2 genetic map. The phenotype data from 73 F2:3 families was subsequently entered as a co-dominant trait for the final F2 genetic map (see Table S4 for all F2:3 genotype and phenotype data). The TM-1 (G. hirsutum) genome of Zhang et al. (2015) was used for DNA sequence comparisons and all genome nucleotide positions listed for markers are based on this publicly available sequence.

Phylogenetic analysis

Cluster analysis to reveal phylogenetic relationships between the 27 cultivars was conducted using R package ape and hclust function (R Development Core Team 2011) based on the simple matching distance of each pair of cultivars calculated using the same SNP set. The reliability of the dendrogram topology was confirmed with 1000 bootstraps with replacement.

Results

Inheritance of CBT resistance

A population consisting of 206 F2 lines from the cross between Delta Opal and Sicot70 and eight replicates of a CBT susceptible (‘Sicot 70’) and resistant (CS × 180) lines were tested for CBT resistance. Seven of the eight susceptible ‘Sicot 70’ parental plants displayed CBT related symptoms, whereas none of the resistant lines showed any sign of disease. This indicates false CBT negatives can occur in the assay. A total of 61 F2 plants were identified as susceptible to CBT disease. This is consistent with a ratio of 3:1 (resistant: susceptible, X2 = 0.15). All individual F2 plants were selfed to generate F2:3 seeds. To improve the accuracy of our CBT phenotype scores, approximately half (73 out of 145) of the F2 plants that did not display CBT symptoms, were evaluated for CBT resistance using F2:3 seeds. If the CBT resistance in ‘Delta Opal’ is controlled by one single dominant gene, heterozygotes to homozygous resistant families should be identified at a 2:1 ratio in these F3 families. F3 CBT assays identified 47 heterozygous to 24 homozygous resistant families (X2 = 1, with two families found to be homozygous susceptible), consistent with a single dominant CBT resistance gene designated Cbt.

Genetic mapping of Cbt

AFLP analysis was performed on the parent lines and a subset of 40 F2:3 lines (14 homozygous resistant, 7 heterozygotes, and 19 susceptible). Over 2000 AFLP fragments were surveyed for linkage to Cbt and only a single band of 213 bp amplified using PstI-AT: MseI-GAC primers (designated CL213) was found to be linked to the resistance locus in this population (2.5 cM). Genotyping CL213 on the full 206 F2 lines determined that CL213 was 4.0 cM from Cbt.

The CL213 DNA fragment amplified by the primer combination PstI-ATT and MseI-GAC, was gel isolated and sequenced, and found to be 191 nt long when primer adaptor sequences were removed. Sequence analysis of region from the parent lines revealed that the AFLP polymorphism was due to an A to T transversion that abolished the PstI restriction site in ‘Sicot 70’ (Figure S2). CL213 did not completely match any region of the draft TM-1 (G. hirsutum) genome sequence (Li et al. 2015; Zhang et al. 2015) but was very similar to a region located on chromosome A10 (~98,755,400). Fang et al. (2010), previously mapped the CBD resistance gene to a 0.8 cM region on Chr A10 flanked by SNP markers NG0204310 (Chr A10_ 99,881,319 nt) and NG0211495 (DC20027, Chr A10_ 99,894,811 nt). Since these markers are in close proximity to the likely location of CL213, three of the SNP markers developed by Fang et al. (2010) for the mapping of Cbd, (another close SNP marker NG0203671, was not polymorphic in our population) and four SNP markers (GhCSIRO_2461, GhCSIRO2551, GhCSIRO_2637 and GhCSIRO_2637, Table S3), previously identified between ‘Delta Opal’ and ‘Sicot 70’ (Zhu et al. 2014) that were in the same region, were mapped onto the F2 population (Table S4). All markers were found to be tightly linked to Cbt, confirming its location on chromosome A10 (Fig. 1). Initial analysis of the marker data indicated that three F2 lines (#120, #182 and #208) that had not displayed CBT symptoms in the F2 assay, were ‘Sicot 70’-like in genotype across the entire interval. F2:3 families of those lines were assayed for CBT resistance and found to be homozygous susceptible, indicating the original F2 plants were false negatives for symptom development. Updating the CBT resistance status of these three lines allowed Cbt to be mapped to a 1.7 cM region flanked by NG0203481 (A10_ nt 98,715,931) and GhCSIRO_2637 (A10_nt 99,614,176) on Chr A10 (Fig. 1). Based on the draft G. hirsutum genome from Zhang et al. (2015) there are 25 annotated genes (Gh_A10G2062 to Gh_A10G2086) in this region including 11 putative disease resistance like NB-LRR genes (Table S5).

Fig. 1
figure 1

Genetic map of Cbt compared to the genetic map of the Cbd locus (Fang et al. 2010) on chromosome A10. Arrows indicate markers in common between the two genetic maps. All distances in cM

Targeted association analysis

The recent development and availability of the Cotton SNP63K array enabled SNP analysis of a range of G. hirsutum cultivars through targeted association analysis, as was recently used to narrow the genetic interval underlying the Okra leaf shape in cotton and identify the causal gene (Zhu et al. 2015). Twenty seven cultivars of mainly Australian origin with known CBT phenotype (4 resistant and 23 susceptible, Table S1) were genotyped using the CottonSNP63K array (Hulse-Kemp et al. 2015). Phylogenic analysis indicated that genetic distances (Nei 1978) among the cultivars were generally small, averaging 0.137, and ranging from 0.007 between TamcotSP37 and TamcotSP37H, to 0.251 between SicotF1 and TamcotSP37H (Figure S3). The cultivars could be separated into three major clusters with the four CBT resistant lines distributed across two clusters (Figure S3). Close examination of the region around Chr A10 where the Cbt resistance trait had been initially mapped, indicated forty SNPs that were completely associated with either the resistant or susceptible phenotype, and these spanned a region of approximately 1,220,000 nts, from A10 nt 98,718,917 to nt 99,938,987, based on the G. hirsutum genome from Zhang et al. (2015) (Table 1, Table S6). This region is slightly larger, but overlaps the genetically mapped region spanned by the markers NG0204310 (A10_ nt 98715931) and GhCSIRO_2637 (A10_nt 99614176), establishing that targeted SNP association analysis in cotton cultivars can provide an alternative means of mapping traits in cotton, even from a relatively small number of cultivars.

Table 1 SNPs associated with CBT resistance

Discussion

Aphid transmitted diseases such as CBT are very difficult and time consuming to assay. Despite the assay being performed under optimal conditions for aphid growth there were a significant number of plants that initially appeared without CBT symptoms, but were later found to be susceptible (false negatives for symptom development). This was most clearly observed in the infection rate of our susceptible controls, as on average only 87 % of control plants (271 in total) displayed obvious symptoms in each set of assays. False negatives are most likely due to the failure of aphids to become infected when feeding on infected host, or failing to thrive and pass on the virus to their new host. The inability to reliably identify susceptible individuals makes it essential to screen multiple individuals from the same family in order to confidently determine CBT resistance, but this makes large scale screening of genotypes in the field or glasshouse prohibitive.

CBT resistance segregated as a single dominant locus and genetically mapped to a 1.7 cM region on chromosome A10. The use of the close flanking markers NG0204310 and GhCSIRO_2637 in breeding will enable rapid selection of CBT resistant germplasm with high accuracy, and have already been adopted in the CSIRO breeding program. As these markers do not underlie the causal gene for resistance (perfect markers), disease assay validation would still be required, but only on the reduced number of lines pre-selected using those markers. Based on the published G. hirsutum genome as a reference (Zhang et al. 2015), the flanking markers NG0204310 and GhCSIRO_2637 are estimated to be 898,000 nt apart, although the genome is only a draft sequence and so physical distances are only a guide. Targeted SNP association analysis on 27 cultivars with known CBT phenotype, located the CBT resistance to a slightly larger region 1,220,070 nt, that almost completely overlaps the genetically mapped region (overlap of 895,259 nt). This indicates that as long as a trait is genetically simple and the gene underlying the trait in the population surveyed is the same, targeted association analysis on a relatively small number of cultivars can provide a quick method for finding linked markers for a trait that will be amenable to marker-assisted breeding. The linked SNPs we have identified could be used to survey for CBT resistance from CottonSNP63K data in other cotton varieties, possibly identifying other backgrounds with CBT resistance that could be useful for breeding.

It has been previously noted that CBT resistant cultivars, such as ‘Sicala 3-2’, ‘Delta Opal’, ‘Guazuncho II’, ‘Reba’ and others that share similar pedigrees, are also resistance to CBD (Ellis et al. 2013). Genetic mapping of CBT resistance confirms that the gene is near the CBD resistance gene mapped by Fang et al. (2010) using three populations of ‘Delta Opal’ crossed to the susceptible cultivars ‘DP388’, ‘DP5305’ and ‘SG747’. Our mapping suggests that Cbt is located in an adjacent region but is not likely to be the same gene conferring CBD resistance (Fig. 1). However, the genetic distance and marker order between the flanking Cbd markers NG0204310 and NG0211495 (DC20027) are different in our population compared to Fang et al. (2010) possibly due a chromosomal rearrangement, making direct comparisons difficult (Fig. 1). The difficulties in reliably assaying for either CBD or CBT resistance means that we cannot rule out the possibility that the same gene is responsible for resistance to both viruses without testing our population for resistance to CBD, which is difficult as this disease does not occur in Australia. It will require identification of the underlying resistance gene before this matter can be resolved. Resistance to multiple viral diseases by the same resistance gene is not uncommon, for example, the tomato Sw-5 gene confers broad spectrum resistance against tomato spotted wilt virus, and two related viruses; groundnut ringspot virus and tomato chlorotic spot virus (Brommonschenkel et al. 2000). The high identity between some of the viral proteins of CBTV and CLRDV could mean that a similar viral product is detected by the host defence systems of resistant cultivars upon infection by either pathogen.

To date no plant resistance gene has been cloned that acts against Poleroviruses. The majority of single dominant anti-viral resistance traits in plants have been found to encode nucleotide binding leucine rich repeat proteins (NB-LRR) (de Ronde et al. 2014). Analysis of the G. hirsutum genome sequence between NG0204310 and GhCSIRO_2637 reveals eleven putative NB-LRR genes (A10G2067, A10G2068, A102070, A102072, A102073, A102074, A102076, A102077, A102078, A102080 and A102081) that could be potential candidates for Cbt (Table S5). Identification of the Cbt gene will require either fine mapping of the region and/or testing of each of the eleven potential NB-LRR candidates isolated from ‘Delta Opal’ transformed into a susceptible variety.

In summary, long term management of CBT in Australia requires the transfer of host plant resistance against CBTV to all current and future commercial cotton varieties. Phenotypical assays for CBT resistance have proven to be difficult and somewhat unreliable and as yet no new CSIRO CBT resistant germplasm has been released in Australia. The SNP markers identified in this study will significantly improve the speed and accuracy of screening for CBT resistant germplasm and that should lead to a more rapid deployment of resistant varieties in the future.