Introduction

Rice is the most important staple crop for half the world’s population. Improvements in rice yield and quality beyond the benefits of the green revolution of 30 years ago are required to meet the demands of an increasing global population. At the beginning of the 21st century, with the hope of finding creative solutions to the problems of food production, nutrition and transportation, nearly the entire nucleotide sequence of the rice genome, Oryza sativa L. cv. Nipponbare, was determined through a world-wide collaborative effort (International Rice Genome Sequencing Project 2005). The sequence data provide an important resource for promoting the discovery of important genes for crop improvement. Currently, a Rice Annotation Project (RAP) using nucleotide sequences of full-length cDNAs (The Rice Full-Length cDNA Consortium 2003) is in progress to position functional genes on the rice genome map (Ohyanagi et al. 2006); however, many genes remain “unknown” due to lack of experimental evidence or sufficient similarity with characterized genes from other organisms. The next great challenge after completion of genome sequencing is the functional characterization of these genes and discovery of genes that affect vital developmental, agronomic or biochemical plant functions (Hirochika et al. 2004).

Gene annotation is mostly based on the sequence similarity to known genes from other species. The limitations of this method are that every organism may have unique genes that do not have homologues even in closely related species, and assignment of a protein’s function based on similarity may only give a partial description. For example, a gene may contain a domain that is conserved among protein kinases, but the actual substrate of the enzyme would be difficult or impossible to determine without experimental evidence. Categorization using cladistic associations (i.e. a phylogenetic tree) is more sensitive and is able to detect BLAST mis-hits or false positives (Sjolander 2004). However, characteristics supported by experimental data, e.g., correlation between the phenotype and the function of the protein kinase, are indispensable to the exact annotation, because all of the associative algorithms ultimately depend on experimental data.

Mutational analysis through gene disruption is one of the most efficient methods for identifying gene function. The related approaches of interrupting gene expression with RNAi (Miki and Shimamoto 2004), and overexpression are also effective because the functions of genes can be determined by the correlation of disrupted genes and their associated phenotypes. Currently, more than 130,000 T-DNA insertion lines of Arabidopsis thaliana have been created and are publicly available (Alonso et al. 2003). Phenotyping of Ds insertion lines in Arabidopsis is also in progress (Kuromori et al. 2006). For characterizing monocot genomes, we have produced more than 50,000 disruption lines of Oryza sativa cv. Nipponbare (Japonica), using the endogenous retrotransposon Tos17, which has a “copy and paste” type of transposition activity (Miyao et al. 2003). There are two native copies of Tos17 in the Nipponbare genome that are activated specifically in cultured cells (Hirochika et al. 1996). On average, ten new copies of Tos17 are transposed in each cell during 5 months in culture. When plants have been regenerated from the cultured cells, Tos17 retrotransposition is immediately inactivated, and Tos17 copies become fixed and segregate in a Mendelian fashion in the next generation.

There are many advantages to the Tos17 disruption system for mutational analyses. Because the flanking regions of Tos17 insertion points are easily amplified by TAIL-PCR or a suppression PCR method using a 3′-end primer of Tos17, insertion sites in the rice genome can be easily determined. Furthermore, mutants can be screened by PCR using the Tos17 end primer and a primer from any desired genomic sequence. The distribution of transposed Tos17s is not random, and a large number of transpositional “hot spots” are detected throughout the rice genome. The insertion frequency of Tos17 into genic regions is three-fold higher than that into non-coding regions. Owing to this polarization, Tos17 insertion lines have great advantages for the functional analysis of rice genes (Miyao et al. 2003). Because Tos17 is an endogenous retrotransposon, regenerated lines can be grown in the field, and seeds can be exported without regulations associated with genetically modified organisms such as rice lines that have undergone “transgenic-” insertion of T-DNA (Jeon et al. 2000; Sallaud et al. 2003; Wu et al. 2003), Ds (Greco et al. 2003; Kim et al. 2004), or En/Spm (Kumar et al. 2005).

Mutant lines generated by Tos17 retrotransposition have already been used for the functional analysis of rice genes. For example, Tos17 insertion mutants with a dwarf or viviparous phenotype were used to identify and analyze genes for gibberellin and abscisic acid metabolism (Sakamoto et al. 2004; Agrawal et al. 2001). Phenotypic characteristics of most of the Tos17 insertion lines, however, remain to be described. A large number of plant scientists working with rice or other monocotyledonous species could benefit from a systematic phenotypic analysis of many Tos17 insertion lines and the creation of a public database. To promote the functional analysis of rice genes, the phenotypes of all of our mutant lines have been observed in rice fields through the collaboration of seven laboratories. Collected phenotypic data are useful for predicting the function of a disrupted gene. In this paper, we report the phenotypic statistics of a Tos17 insertion mutant population for the discovery of agronomically important genes.

Materials and methods

Plant materials

Nipponbare calli derived from embryos were grown in N6 liquid medium (Otsuki 1990) containing 1 mg/l 2,4-D for 5 months. In total, about 50,000 plants were regenerated. Seeds of the M2 generation were harvested from each M1 plant. To check the activity of Tos17 under various hormone conditions, N6 liquid media containing 1, 2, 5, 10 or 20 mg/l 2,4-D with or without 0.1 mg/l BA were used.

Phenotyping

Ten to twenty-five seeds were planted per line. Germination rates and seedling phenotypes were observed in the nursery. After 1 month from seeding, the seedlings were transferred to the paddy field. Phenotypes in the field were observed at the vegetative stage, near the heading stage, at the seed maturation stage, and at harvest. Lines segregating abnormal plants at about 25% frequency were digitally photographed and assigned a phenotype ID.

Database

All phenotype data were stored into a relational database on the PostgreSQL relational database managing system (http://www.postgresql.org/) with a FreeBSD 5.5 operating system (http://www.freebsd.org/). FreeBSD 6.1 was used for the statistical analysis. Tabular structure for phenotypic description consists of the line name, plant serial number, observation date, observing person, a link to the photograph file, phenotype ID, and detailed descriptions. Additional data for each line are inserted as a new row. Data can be modified only by the observing person. With this table structure, observation logs for each line by many persons over the course of many years can be stored without conflict. For counting lines of each mutant phenotype, Perl script, which can connect with the PostgreSQL server with a Pg.pm module, reads the phenotype table, stores line names and phenotype IDs into an associative array (“hash”), to convert from a redundant number of phenotype IDs to a unique number for each line. The number of lines showing each phenotype ID in the hash was counted. Flanking sequence data of Tos17 insertion sites are also stored in the same database. Loci of Tos17 insertions are determined by BLASTN searches against rice genome sequences of the International Rice Genome Sequencing Project (IRGSP) Build3. Loci of annotated genes are also stored in the relational database.

SOM analysis

Software for SOM analysis was downloaded from the Neural Networks Research Center of Helsinki University of Technology (http://www.cis.hut.fi/research/som_pak/). For the initialization program, randinit, parameters: 24 for x dimension, 16 for y dimension, hexa for topology, bubble for neighborhood function, and 123 for seeds were used. For the map training program, vsom, parameters for first learning: 1000 for learning length in training, 0.05 for initial learning rate, 10 for initial radius of the training area in som-algorithm, parameters for second learning: 10000 for learning length, 0.02 for initial learning rate, 3 for initial radius. The analyzed data were visualized using the umat program.

Results

Flow of phenotypic analysis

Nipponbare calli from 92 seeds were independently cultured for 5 months, and about 50,000 plants were regenerated. Seeds (M2 generation) from the regenerated plants were independently harvested and labeled with a line name. Ten to twenty-five M2 plants of each line were observed over a full developmental cycle in the nursery and paddy.

Each line was designated by “N” for “Nipponbare” followed by a letter A-G that indicates the yearly lot, and four figures, e.g. NA1234. Lines named with “NA” and “NB” were used in a small-scale pilot study and “NC” to “NG” were used in large-scale studies.

To evaluate the effect of hormone (auxin and cytokinin) concentration in the medium on Tos17 retrotransposition activity, the NC line was subcultured in media containing 1, 2, 5, 10, or 20 mg/l of 2,4-dichlorophenoxyacetic acid (2,4-D) with or without 0.1 mg/l benzyl adenine (BA). There was no significant difference in observed transposition events among lines derived from the subcultures. In lots, ND, NE, NF, and NG, calli were induced in a medium containing 2 mg/l 2,4-D, and maintained in a medium supplemented with 1 mg/l 2,4-D for 5 months. The cultural conditions for each line are available with the phenotypic description list from the mutant panel database.

Co-segregation of mutant phenotypes with Tos17 insertion can be detected by DNA blot hybridization, which will help the further analysis of gene function. If co-segregation is detected, the flanking region of co-segregated Tos17 can be isolated by TAIL-PCR or suppression PCR (Miyao et al. 1998).

Classification of phenotypes

To provide a classification system and a database useful for most rice scientists, phenotype scoring was limited to 53 phenotype descriptors belonging to 12 classes. Because environmental conditions such as day length, temperature and soil conditions differ significantly among the seven fields in which this project was conducted, some variability in traits such as heading date or leaf color may have occurred between different fields. Thus, some of the present data may vary with environmental conditions, although most of mutant phenotypes are stably expressed. Phenotype classifications and a summary of observations are shown in Table 1. Each subclass has a phenotype ID code to enable data entry as a barcode with a portable recording device and to enable data compilation from all seven laboratories.

  1. 1.

    Germination. This trait was evaluated by measuring germination rate under defined conditions. Since wild type (cv. Nipponbare) showed a germination rate higher than 95%, lines that showed germination rates less than 75% were recorded in the present project. Since several laboratories measured germination rates of all lines, all primary data are also stored in the database. In total, 3489 lines showed a low germination rate, and 525 lines showed germination rates less than 50%. Some of the lines with low germination rates may be embryo mutants.

  2. 2.

    Growth. Growth was observed at the seedling stage in the rice nursery or at an early stage after transfer to the field. “Weak” refers to mutants that formed slim seedlings with retarded growth, probably caused by a deficiency in some housekeeping gene product (Fig. 1, NG0352). Some of the “Weak” lines were reclassified eventually to “Lethal”. An example of “Abnormal shoot” is shown in Fig. 1 (NG0356). Another example is represented by NE3024. This seedling is smaller than wild type and its leaves are short and wide, resembling mutants defective in genes associated with gibberellin biosynthesis or signaling pathways (Sakamoto et al. 2004; Uegchi-Tanaka et al. 2005).

  3. 3.

    Leaf color. Frequently appearing pigmentation phenotypes are “Albino” (Fig. 1, NG1048) and “Virescent” (Fig. 1, NE1517). Completely white or yellow (Fig. 1, NG1469) seedlings died within 3 weeks after seeding. If green and white segments coexisted on leaves, a condition called virescent, the seedlings survived in the field. Zebra mutants that show repetition of a white or a pale green band and a green band in the longitudinal direction were also observed (Fig. 1, NF6044). However, in most lines, the “Zebra” phenotype was limited to young stages. The “Stripe” phenotype (Fig. 1, NE4001) often showed extremely biased segregation, e.g., only one plant in 25, and often was not stably inherited.

  4. 4.

    Leaf shape. Many different kinds of abnormally shaped leaves were observed. “Short Leaf” and some “Wide leaf” phenotypes were eventually reclassified as “Dwarf” or “Severely dwarf” phenotypes (Fig. 1, NE8114). The “Short leaf” phenotype is similar to mutants defective in gibberellin biosynthesis or signal transduction. The line NE5022 could not develop a normal, flat leaf blade and eventually died (Fig. 1). Several lines showed pleiotropic phenotypes of the shoot. The line NE8329 did not develop tillers, showed dwarfism and formed rolled or twisted leaves (Fig. 1). Since these phenotypes are difficult to characterize exactly by only phenotype codes, additional remarks were presented in the comment column of the database. Frequency distribution of the leaf width mutants was biased toward the narrow type (Fig. 1, NG0754), rather than the wide type.

  5. 5.

    Culm shape. Dwarfism is the most abundant mutant phenotype, along with sterility. The “Semi-dwarf” condition is characterized by plant heights that are 70–80% of wild type values. “Dwarf” describes plants with heights smaller than “Semi-dwarf” but larger than “Severely dwarf”. Plants are classified “Severely dwarf” when the plant height is smaller than 30 cm at maturity. Some dwarf phenotypes often co-segregated with other abnormal phenotypes such as “Fine leaf”, “Wide leaf”, “Spiral leaf”, or “Abnormal panicle shape”. The dwarf mutants accompanying other shoot/panicle abnormalities are expected to be involved in hormonal signaling or synthesis pathways. Other culm phenotypes such as “Thick culm” (Fig. 1, NG9874) appeared infrequently.

  6. 6.

    Spotted leaf/lesion mimic. Various types of “Spotted leaf/lesion mimic” phenotypes, e.g., small and scattered spots, large and dispersed spots, were observed. Heavy lesions caused the early death of leaves (Fig. 1, NG0752). Mutants with large white lesions were also observed. Brown spots were more frequently observed than white ones.

  7. 7.

    Tillering. In the tillering class, “Low tillering” mutants were abundant. The “High tillering” phenotype tends to co-segregate with the “Fine leaf” phenotype. The “Lazy” (Fig. 1, NG0667) mutants have open and recumbent tillers. A quite unusual phenotype among “High tillering” lines is NG2552 that produced tillers reiteratively from upper internodes and failed to produce panicles.

  8. 8.

    Heading date. In this category, lines whose heading date deviated more than 7 days from normal were classified as heading date mutants. This category includes “Early heading”, “Late heading”, and “Non-heading?” mutants. A small number of lines segregated as “Non-heading?” mutants that did not form panicles even at harvest (five to 6 months after sowing). The number of “Early heading” lines (1797) is a little larger than that of “Late heading” lines (1244).

  9. 9.

    Spikelet. Each spikelet of rice is composed of two rudimentary glumes, two empty glumes and one floret comprised of one lemma, one palea and three kinds of floral organs (two lodicules, six stamens and one pistil). “Abnormal hull” refers to any glume abnormality and “Abnormal floral organ” refers to any abnormality in floral organs (e.g. an abnormal pistil). In the “Abnormal hull” class, there is a mutant that failed to close hulls after flower opening. On the other hand, mutants that could not open flowers due to underdeveloped stamens were often re-classified as “Completely sterile” mutants. Another mutant produced an extra glume. Abnormal flower phenotypes showing shoots growing from floral organs as in Fig. 1 (NC7672), were also detected at relatively low frequency.

  10. 10.

    Panicle. The most frequently observed abnormal panicle phenotype was the precocious germination of seeds while still attached to the maturing panicle, or “Viviparous”. As for panicle shape, “Dense panicle” and “Short panicle” mutants (Fig. 1, NE6778) were also frequently observed, and “Long panicle” and “Lax panicle” were somewhat rare. Incomplete emergence of the panicle from the flag leaf sheath was categorized as “Neck leaf”.

  11. 11.

    Sterility. “Sterile” is the third most frequent mutant in this Tos17 insertion mutant population after low seed fertility and dwarf mutants. The “Sterile” and “Low fertility” conditions correspond to lines with less than 2% and ca. 50% seed fertility, respectively.

  12. 12.

    Seed. Only “Large grain”, “Small grain”, and “Slender grain” were distinguished in this category. The frequency of “Small grain” was two and four times higher than the frequency of “Large grain” and “Slender grain”, respectively. Among the “Others”, white or dull kernel phenotypes were often observed.

Table 1 Summary of phenotype data
Fig. 1
figure 1

Representative phenotypes of Tos17 insertional mutants. Phenotype and line designation are indicated on each photograph

Correlation between phenotypes

To understand how the 53 phenotypic descriptors are related, lines showing two or more abnormal phenotypes were selected, and a matrix with 53 rows and 53 columns of phenotypes with a number of lines showing each pair of phenotypes was devised. Values for the number of respective phenotypes were changed to “x” in the matrix, (“x” is ignored by the SOM program), and data were subjected to self-organizing map (SOM) analysis (Kohonen 1995). The SOM algorithm is used for visualization of multidimensional complex data using an unsupervised learning method based on a grid of artificial neurons. The 53-dimensional correlation data of phenotypes was reflected in a two-dimensional map (Fig. 2).

Fig. 2
figure 2

Self-organizing map for correlation of phenotypes. Lines that showed two or more abnormal phenotypes were analyzed by SOM. Distances between phenotypes indicate the magnitude of correlation of phenotypes. Col_Oth., Color Others; Lg_lf, Long leaf; L_joint, Abnormal lamina joint angle; Lg_pa, Long panicle; Ab_floral_organ, Abnormal floral organ; Sl_grain, Slender grain; Thick_cm, Thick culm. Grayscale levels of each node represents the distance between references

Topologies (not distances) of phenotypes on the SOM coordinates coincide with the correlation between phenotypes. Distance between two phenotypes is indicated by grayscale color on the SOM. For example, assume that phenotype A, B, and C appeared in 5000, 50, and 10 lines, respectively. Seven lines showed phenotype A and C. Three-lines showed phenotype B and C. The color between phenotype A and C should be darker than the color between phenotype B and C, because the line number of phenotype A lines is much greater than that of phenotype B lines, although the phenotype correlation of C with A is stronger than that with B. It is difficult to explain multidimensional data completely on a two dimensional map, but a SOM map indicates generally that neighboring phenotypes have relatively tight correlations. The SOM analysis in Fig. 2 shows that there are many pairs of phenotypes that are apparently correlated. That is, among 53 phenotypes, several phenotypes have a high probability to emerge simultaneously with other specific phenotypes. This phenotypic correlation could be caused by pleitropic expression of a single gene, or could reflect developmental causality of the two abnormalities. For example, “Long leaf” and “Long panicle” (lg_lf, lg_pa) located in the upper left corner on the SOM map in Fig. 2 have a very strong correlation. This result suggests that the gene involved in leaf length also affects panicle length. On the contrary, long leaf and dwarfism, at the opposite corner of the map, do not have any correlation. Both dwarf and semi-dwarf phenotypes have a relatively strong correlation with sterile and low fertile phenotypes. Since this correlation is not so strong, a part of “Dwarf” and “Semi-dwarf” mutants do not affect seed fertility, but other “Dwarf” and “Semi-dwarf” mutants may be regulated by genes associated with some housekeeping functions.

Correlations between mutant phenotype and Tos17 insertion

If allelic insertion lines show the same phenotype, it is highly probable that the mutant phenotype is caused by disruption of the target gene by a Tos17 insertion. We searched for genes that were disrupted by Tos17 in at least two lines. A total of 391 genes were detected based on the Rice Annotation Project (Ohyanagi et al. 2006). Table 2 is a partial list of such loci and their mutant phenotypes. For example, five lines have insertions of Tos17 in the Os06g027500 gene (Fig. 3A), of which two lines have the Tos17 insertions in exons and show an early heading phenotype. On the other hand, the other three lines have Tos17 insertions in introns and are not early heading. (see http://tos.nias.affrc.go.jp/). The Os06g027500 gene is Hd1, a key gene for determining heading date (Yano et al. 2000). Another example is the magnesium chelatase subunit gene, Os07g0656500 in Fig. 3B (Jung et al. 2003). Tos17 insertion into this gene was correlated with an albino phenotype, because mutants defective in this gene lack active chlorophyll and eventually become albino. All correlations between phenotypes and disrupted genes by Tos17 insertion are displayed on the mutant panel database website (http://tos.nias.affrc.go.jp/).

Table 2 List of loci that have Tos17 insertions in exons in at least two lines
Fig. 3
figure 3

Examples of genes that have insertions of Tos17. (A) Os06g0275000, Hd1 gene. (B) Os07g0656500, Magnesium chelatase subunit gene. The structure and insertion points of these genes are drawn on demand from our web site (http://tos.nias.affrc.go.jp/). Coding regions are indicated with broad lines. Positions of Tos17 insertions are indicated by flags. The direction of the flag is the direction of the Tos17 insertion. Detailed descriptions of insertion points, sequence names, accession numbers, and line names will appear on the web page. When a sequence name is clicked, the figure will be re-drawn with a flag labeling the sequence name to distinguish closely inserted Tos17s

A large-scale phenotypic characterization of an M2 population generated by Tos17 retrotransposition has revealed that this population contains a large number of mutants covering many easily scored phenotypes. In addition, linkage between the mutant phenotype and a specific Tos17 insertion facilitates greatly the isolation of the causal genes and the elucidation of the gene functions.

Discussion

Three-years of collaboration among seven laboratories has led to a large-scale phenotypic characterization of about 50,000 M2 plants generated by Tos17 retrotransposition. We have examined 53 kinds of abnormal phenotypes in rice from the seedling to harvest stages that are easily evaluated in the field. This project has revealed that this population contains a large number of mutants covering a wide range of phenotypes. Although several mutant phenotypes may be environment-sensitive and their expression may be unstable, most of the mutant phenotypes were stable. We also deposited additional information about certain phenotypes in a comment field within the phenotype description table, if this information was provided. A text search box on the phenotype list web page is envisioned to facilitate use of the resource.

Because M2 plants were observed, inserted Tos17s segregate among individuals. An average of 10 new copies of Tos17 are inserted into each regenerated plant. If the phenotype co-segregates with an insertion of Tos17, the phenotype is likely to be caused by disruption of the target gene. Of course, there are lines whose mutant phenotypes are not correlated with Tos17 insertion. These mutants might be caused by insertions of other native transposons, by chromosomal aberrations, or by other mutations during tissue culture. In this case, their causal genes could be isolated by usual positional cloning methods.

There were some difficulties in clustering phenotypes using standard relational algorithms, because each of the 53 phenotypes has as many as 52 kinds of frequencies against other phenotypes, respectively. SOM analysis is a kind of clustering method suitable for such multi-dimensional non-linear data (Kohonen 1995). SOM analysis confirmed some associations due to hormonal or developmental constraint, but some additional intriguing correlations were also detected, e.g., between leaf color/shape and heading date, between dwarfism and sterility. In Fig. 2, the “Long leaf” phenotype is strongly correlated with the “Long panicle” phenotype. The “Weak” phenotype at the seedling stage has a relatively strong correlation with “Lethal”, “Abnormal shoot”, “Pale green” Leaf and “Spotted leaf/lesion mimic” phenotypes, indicating that the “Weak” plants at the seedling stage likely show “Lethal”, “Abnormal shoot”, “Pale green” and/or “Spotted leaf/lesion mimic” phenotypes at later stages. “Dwarf”, “Semi-dwarf”, “Sterile”, “Low fertile” and “Low tillering” phenotypes showed relatively high correlations. The SOM map is quite useful for the overview of phenotype correlations. This method of analysis will be useful not only for considering the major and side-effects of gene disruption but also for re-categorization of phenotypes. The advantage of SOM analysis is the ability to analyze many different kinds of data simultaneously. Currently, metabolome data have been obtained from Arabidopsis (Hirai et al. 2004), tomato (Schauer et al. 2006) and other plants. It is reasonable to predict that the combinatorial analysis of phenotype and metabolome data of rice will reveal correlations between metabolic pathways and phenotypes without additional genetic information.

Phenotype databases of rice have been developed in many countries, e.g., Oryzabase (Kurata and Yamazaki 2006), RMD (Zhang et al. 2006), and IRRS (Wu et al. 2005). Our system includes both phenotypic and insertion data on the rice genome in a relational database. The number of photographs on file is more than 58,000, and the number of phenotype description records is more than 158,000. Our database structure enables direct access of Perl script to the database and the extraction of many kinds of correlations. Flanking sequences of insertion points and phenotypes of the insertion lines provide a provisional assignment of the function of a disrupted gene. We have already sequenced more than 25,000 flanking sequences from 20% of the insertion lines, and the number of flanking sequences is still increasing. Of the 27,448 total annotated loci based on RAP1/IRGSP Build3, 391 loci have more than two insertions in exons. When two or more lines have Tos17 insertions in a common gene, they usually exhibited similar phenotypes. These data are a strong indication of the function of a disrupted gene. However, several phenotypes such as “Dwarf” and “Low fertile” were observed in many lines. It might be difficult to assign a gene function based solely on correlation for these frequently observed phenotypes, and a complementation test would be required to confirm the correlation. If flanking sequence data from all insertion lines can be obtained, this correlation will be more useful for the annotation of genes. We are continuing to sequence the flanking regions of all our mutant lines.

In this study, we collected a large amount of phenotypic data in seven fields under natural conditions. If phenotypes are observed under other conditions such as drought or temperature stress conditions or under pathogen pressure, phenotypic description of this population would be much more enriched. In addition, the present study evaluated only a limited number of traits easily scored on above-ground organs. Thus, other traits such as roots and seed storage composition remain to be investigated in the future. Expansion of this study to include new traits would enable investigators to find new correlations with disrupted genes. Furthermore, integration of phenotype data with those of microarray experiments, metabolic profiling, and other approaches will be a powerful tool for revealing new aspects of plant physiology.

All phenotype and flanking sequence data can be obtained via http://tos.nias.affrc.go.jp/. At this site, all annotated rice genes and locations of Tos17 insertions are shown on a clickable chromosome map. Details containing illustrated gene structures with insertion points, phenotypes of corresponding lines, nucleotide sequences flanking the disrupted region, candidate primer sequences for segregation analysis, and annotation from RAP data corresponding to the selected position are displayed. Mutant seeds are available for scientific use from the Genome Resource Center at NIAS (http://www.rgrc.dna.affrc.go.jp/).