|Home | About | Journals | Submit | Contact Us | Français|
With the completion of rice genome sequencing, large collection of expression data and the great efforts in annotating rice genomes, the next challenge is to systematically assign functions to all predicted genes in the genome. The generations and collections of mutants at the genome-wide level form technological platform of functional genomics. In this study, we have reviewed currently employed tools to generate such mutant populations. These tools include natural, physical, chemical, tissue culture, T-DNA, transposon or gene silencing based mutagenesis. We also reviewed how these tools were used to generate a large collection of mutants and how these mutants can be screened and detected for functional analysis of a gene. The data suggested that the current population of mutants might be large enough to tag all predicted genes. However, the collection of flanking sequencing tags (FSTs) is limited due to the relatively higher cost. Thus, we have proposed a new strategy to generate gene-silencing mutants at the genome-wide level. Due to the large collection of insertion mutants, the next step to rice functional genomics should be focusing on functional characterization of tagged genes by detailed survey of corresponding mutants. Additionally, we also evaluated the utilization of these mutants as valuable resources for molecular breeding.
Functional genomics is the branch of genomics that determines the biological function of genes and their products. Both Arabidopsis and rice plants have been regarded as model organisms for dicots and monocots, respectively 1. Now both japonica and indica rice genomes have been completely sequenced 2-4. With the sequencing of rice genome, gene prediction /annotation has been carried out. Various annotation databases were set up and were freely available for public researchers. One of these databases is RiceGAAS (http://ricegaas.dna.affrc.go.jp/; 5). The second one is the Rice Annotation Project Database RAP-DB (http://rapdb.dna.affrc.go.jp/; 6). The third is the TIGR rice genome annotation database (now moved to Michigan State University (MSU); http://rice.plantbiology.msu.edu/; 7, 8). The releasing of these databases has been significantly contributing to the research of rice functional genomics. As a result of ongoing annotation efforts, predicted gene numbers continue to be changed 9. More than 50,000 genes were predicted upon publication of its draft sequence 2, 3. Subsequently, 40,612 non-transposable element-related genes were predicted by the MSU rice genome annotation project. However, 37,544 genes were predicted to be protein-coding genes 4. Now only 30,000 or less protein-coding genes were estimated 10. The large differences in the annotated gene numbers suggest the necessary to further validate these annotated genes by various experimental approaches. Such necessity was strengthened by the fact that at least 40% of predicted Arabidopsis genes were wrongly annotated based on subsequent validation by experiments 11. Besides the efforts in genome-wide gene prediction, many rice gene families were also annotated by individual researchers. For example, we have identified and characterized members from 5 gene families at the genome-wide level. These included 14 rice myosin gene family members 12, 114 pollen-allergen-like genes 13, 49 rice cyclin genes 14, 111 small GTPase genes and 85 genes encoding small GTPase activating proteins 15 and 17 GRAM domain containing proteins 16. These works together with other community annotation of rice gene families (http://rice.plantbiology.msu.edu/ca/rice_ca.shtml) significantly contributed to and complemented genome-wide gene annotations.
Additionally, large amount of rice Expressed Sequence Tags (EST) data are available in public databases including the MSU (http://rice.plantbiology.msu.edu/), NCBI (http://www.ncbi.nlm.nih.gov/dbEST/index.html), Gramene (http://www.gramene.org/Oryza_sativa_japonica/index.html) databases and so on. For example, until October 29, 2009, total of 1,249,001 EST sequences have been released into NCBI database (http://www.ncbi.nlm.nih.gov/dbEST/dbEST_summary.html). In addition to these, more than 32,000 full-length cDNA clones from japonica rice have been sequenced 17 (http://cdna01.dna.affrc.go.jp/cDNA/) and 10,096 indica full-length cDNA clones were also released 18 (http://www.ncgr.Ac.cn/cDNA/indexe.html).
With the completion of rice genome sequencing, large collection of expression data and great efforts in annotating rice genomes, the next challenge is to systematically assign functions to all predicted genes in the genome. To broadly assign functions to unknown genes, various old approaches are improved and new methods are developed. The different methodologies have been developed to form their own fields within the functional genomics technological platform and are termed transcriptomics, proteomics, metabolomics and phenomics 19. However, all tools to identify functions of genes are based on the analyses in phenotypic variations between wild type and its mutant. Therefore, the generations and collections of mutants at the genome-wide level form the technological platform of functional genomics.
On the other hand, during long breeding history, farmers and breeders have been selecting new rice varieties with better agronomic traits such as higher yield, improved resistance to various diseases and better quality of grains and so on. These varieties were developed by altering the genetic makeup of the crop. Therefore, genetic variation is the basis of breeding selection. The variation may be produced by natural and artificial mutations as well as sexual crossing. Among the hundreds and thousands of variations, elite germplasms may be developed, which form the important resource for rice breeding. The evidence has shown that a breakthrough might be achieved when such germplasms have been found and used for rice breeding practice. For example, the rice yield has been greatly improved by the utilization of dwarf germplasm 20. Similarly, other important germplasms such as cytoplasmic male sterile lines and photoperiod/temperature-sensitive male sterile lines have led to the development of various hybrid rice combinations, which have further improved crop yields by 20-30% when compared with the conventional varieties 21. Therefore, it is very important for us to collect, to generate, and to evaluate rice germplasms for better serving rice breeding.
In this review, we will focus on the collections and characterizations of large rice mutants generated from various methods of mutagenesis such as maize two-element Ac/Ds system and T-DNA insertion mutagenesis and so on. We also review the applications of these tools and mutants in identifying gene functions and in rice breeding.
Natural mutants were generated during species evolution. Generally, the ratio of natural mutation is very low at only 10-5-10-8 in higher plants. However, a large collection is still available during long evolutionary history. Some of such mutants were harmful or neutral and might be lost during evolution. Others might exhibit higher resistance to various abiotic / biotic stresses or have some specific agricultural traits, which were valuable germplasm resources for rice breeding. One example is the utilization of dwarf germplasm Dee-geo-woo-gen from China and release of rice variety IR8, which was developed from the dwarf line 22. Another example is the application of cytoplasmic male sterile (CMS) and photoperiod-sensitive genic male sterile rice lines, which are widely utilized to develop hybrid rice seeds for commercial release 23.
Map-based cloning is a widely-used method to isolate genes using such mutants. Genetic analysis is the first step to use these mutants for identifying gene functions, by which we know that how many genetic loci control the mutated phenotype. The next step is to finely map these loci to rice genome and then to clone the mutated gene confirmed by genetic complementation experiments. In fact, many genes have been isolated and functionally characterized by using natural mutant lines. For example, the rice Xa21 and Xa27 gene, which confers resistance to Xanthomonas oryzae pv. oryzae race 6, was isolated by map-based cloning from a natural mutated rice variety 24, 25. Another such example is the isolation of rice semidwarf gene sd-1, which encodes a gibberellin 20-oxidase 26. Up to now, at least 67 rice genes have been isolated and functionally characterized by the map-based cloning (Table (Table11).
In 1930, Muller observed that mutation could be induced by X-rays 27. Subsequent researches found that the most efficient mutagenesis was mediated by fast neutron bombardment 28. A short deletion of DNA fragment was usually observed following the bombardment. Thus, a truncated gene might be detected by genomic subtraction and its functions could be identified by corresponding mutant phenotype. One example is the isolation and characterization of Arabidopsis ga1-3 gene 29. Currently, a new reverse genetics method has been developed to identify and isolate such mutants 30, 31. This method was named as Deleteagene. In this system, DNA samples were extracted from the fast neutron-treated plants and were used to screen for deletion mutants by polymerase chain reaction (PCR) using specific primers flanking the targeted genes. Li et al. (2001) has generated an Arabidopsis population of 51,840 lines by fast neutron mutagenesis 30. This library was then used for screening deletion mutants of 25 gene loci, among which deletion mutants were obtained for 21 (84%) gene loci. Similarly, they also generated a rice fast neutron mutant pool with 24,660 lines and similar method was successfully used for identification and isolation of targeted genes. Evidence showed that this method can be efficiently used for the identification of small genes or tandemly arrayed genes 30. Wu et al. (2005) reported the generation of around 10,000 rice mutant lines by the fast neutron bombardment and around 20,000 lines by γ-ray 32. Since the establishment of the method, many genes have been isolated and functionally characterized including the phytochrome family gene PHYC 33 and phytochrome-interacting transcription factor PIF3 34, 3 genes encoding TGA transcription factors TGA2, TGA5, TGA6 35 and so on.
Chemical mutagenesis is mediated by certain chemical reagents. One of the most frequently used reagents is ethyl methane sulfonic acid (EMS). This alkylating agent can efficiently induce chemical modification of nucleotides, which results in various point mutations including nonsense, missense and silent mutations, among which silent mutations can not generate any modification in phenotype and thus can not be used for mutagenesis. In Arabidopsis, EMS mainly induces C to T changes resulting in C/G to T/A substitutions and at a low frequency, EMS generates G/C to C/G or G/C to T/A transversions by 7-ethylguanine hydrolysis or A/T to G/C transition by 3-ethyladenine pairing errors 36. Based on codon usage in Arabidopsis, the frequency of EMS-induced stop codon and missense mutations has been calculated to be ~5% and ~65%, respectively 37.
In Arabidopsis, at least 125,000 M1 lines should be generated in order to achieve saturation of EMS mutagenesis 38. However, it is not difficult to produce such a population since viable seeds can be used for EMS treatment. The difficulty is how to detect single-nucleotide polymorphisms or substitutions in these mutation lines in a large scale. Based on the technology in detecting single-nucleotide polymorphisms 39, 40, McCallum et al (2000) established a new detecting method named as TILLING (Targeting Induced Local Lessions In Genomes) 37, 41 complemented with denaturing high-performance liquid chromatography (DHPLC). These technologies allow chemically induced mutant pools to be used for reverse genetics. With help of automation, robust and rapid detection makes it possible to screen a wide range of mutant pools in a short time and to avoid the laborious process of forward genetic screening 42, 43. Now the technology has been used in various species including animals and plants and some improved methods were also provided 44-51.
In rice, around 18,000 and 9,000 mutants were generated from diepoxybutane and EMS mutagenesis, respectively 32. Total of 10 genes were screened using TILLING and independent mutations were detected in two genes: pp2A4 encoding serine/threonine protein phosphatase catalytic subunit and cal7 encoding callose synthase, suggesting the feasibility of this screen method in chemical mutagenesis. In another report, they screened 10 genes including Os1433 (LOC_Os02g36974), OsBZIP (LOC_Os01g64000), OsCALS8R (LOC_Os01g55040), OsDREB (LOC_Os01g07120), OsEXTE (LOC_Os10g33970), OsMAPK (LOC_Os07g38530) OsPITA (LOC_Os12g18360), OsR1A (LOC_Os05g41290), OsRPLD1 (LOC_Os01g07760) and OsTPS1 (LOC_Os02g44230). Independent mutants were detected for all 10 genes 52. They also found that multiple nucleotide changes can be detected in each gene 52, suggesting that they have developed a useful method for more reliable and exact functional identification of a gene.
With the development of high efficient Agrobacterium-mediated transformation of rice 53, T-DNA mutagenesis has become a major method to generate a large collection of insertion mutants. Generally, T-DNA can be randomly and stably inserted into plant genome, which made it possible to generate a population saturated with insertions, i.e. having at least one insertion in each gene 54. In Arabidopsis, at least 225,000 independent T-DNA insertion lines have been created that represent near saturation of the gene space; the precise locations were determined for more than 88,000 T-DNA insertions, which resulted in the identification of mutations in more than 21,700 of the approximately 29,454 predicted Arabidopsis genes 55.
In rice, several research groups have contributed to the generation of T-DNA insertion lines. For example, An's group has generated approximately 100,000 insertion lines 56, 57. Around 42,000 T-DNA insertion lines have been generated by Zhang and Wu's group 58, 59. Recently, Hsing et al. (2007) have reported the generation of 55,000 T-DNA insertion lines 60. Several other groups also independently produced T-DNA insertional mutant lines in rice 61-63. According to the previous reports, the average copy number of T-DNA inserts per line is 1.4-2.0 64. Thus, more than 450,000 T-DNA tags have now been generated in rice (Fig. (Fig.1A).1A). Recent progresses on the generation of T-DNA insertion lines have been reviewed by several researchers 65-67. If there are only 30,000 or less protein-coding genes in rice genome 10, these populations are large enough to find a knockout in a given gene, assuming that T-DNA is randomly inserted into a chromosome. This suggestion was strengthened by the fact that T-DNA have been observed to insert preferentially in gene-rich regions 58-59, 68-70.
After T-DNA insertion, various phenotypes have been observed including changed growth rates, different plant statues, pollen and seed fertility and so on 67, 71-74. Those visible differences in phenotypes could significantly contribute to the identification of gene functions. Since the establishment of T-DNA insertion populations, at least 43 genes have been functionally characterized by T-DNA insertion mutants (Table (Table2).2). For example, a knockout line of OSMADS3 by T-DNA insertion shows homeotic transformation of stamens into lodicules and ectopic development of lodicules in the second whorl near the palea where lodicules do not form in the wild type but carpels develop almost normally 75. Their data show that this gene plays a crucial role in regulating stamen identity.
Maize transposon Ac and Ds elements have been successfully used as insertion mutagens for rice insertion mutagenesis. For obtaining stable insertion lines, a two-element Ac/Ds tagging system has been established. In this system, two different transgenic parental lines were used for sexual crossing. One parental line contains Ac element, in which Ac is immobilized and provides only Ac transposase under the control of 35S promoter. Another parental line is transgenic Ds plant, in which Ds is also non-autonomous element and provides only two wings of Ds element (5' Ds and 3' Ds). Thus, in both Ac and Ds parental lines transposon Ac or Ds can not mobilize by themselves. However, after crossing between Ac and Ds plants, Ds element will be mobilized and inserted into different genome positions under the presence of Ac transposase. In the next generation, these lines containing only Ds element and without Ac transposase were selected. Therefore these Ds insertional lines will be stable since the plants contained no Ac transposase. Besides Ac and Ds, other transposons such as En and Spm were also used to generate transposon insertion mutants 76.
Currently, multiple research groups have been greatly contributed to the large collection of transposon insertion mutants and various databases have been set up for better utilization of these resources (Fig. (Fig.1B;1B; 65-67, 76-84). Totally, more than 153,000 transposon insertion lines have been generated, providing valuable resources for the survey of functional genomics.
Since a large collection of transposon insertion lines were generated during 2000's, not so many genes have been functionally identified by such insertion mutants. Currently, at least 9 genes have been functionally characterized by Ds insertion mutants including ANTHER INDEHISCENCE1, BRANCHED FLORETLESS 1, CSL1, OSH6, OsKS1, OSMYOXIB, OsNAC2, OSNOP and OsPE (Table (Table3).3). These data suggested the feasibility and potential of transposon insertion lines as a tool to decipher gene functions.
In our lab, we have used a two-element Ac/Ds gene trap system to tag rice genes. In this system, an immobilized version of Ac, in which the transposase gene is under the control of cauliflower mosaic virus (CaMV) 35S promoter was used. The non-autonomous Ds element carries the bar gene encoding phosphinothricin acetyltransferase conferring resistance to phosphinothricin (herbicide Basta), which serves as a positive selection marker and a modified promoterless gusA gene encoding β-glucuronidase as a reporter gene. The gusA gene used in the Ds construct has the intron and triple splice acceptor sequences upstream of the ATG codon to trap the expression of tagged genes at 3' Ds. The synthetic green fluorescence protein (sGFP) was used under maize ubiquitin promoter as negative selection markers within both the Ac and the Ds T-DNA constructs as a negative selection marker.
These two constructs were then introduced into rice genome by Agrobacterium-mediated transformation. Transgenic Ac and Ds rice plants were used as parent lines for sexual crossing. In next generation, stable Ds insertion lines were obtained by selecting Basta positive and GFP negative plants. Homozygous Ds insertion lines were obtained after the fifth/sixth generations by self-crossing. These homozygous lines were used for phenotype investigation.
In this system, the germinal transposition frequency of Ds was estimated as an average of 51% by analyzing 4413 families. Study of Ds transposition pattern in siblings revealed that 79% had at least two different insertions, suggesting late transposition during rice development. Analysis of 2057 Ds flanking sequences showed that 88% of them were unique, whereas the rest within T-DNA. The insertions were distributed randomly throughout the genome; however, there was a bias toward chromosomes 4 and 7, which had two times as many insertions as that expected. A hot spot for Ds insertions was identified on chromosome 7 within a 40-kbp region. One-third of Ds flanking sequences was homologous to either proteins or rice ESTs, confirming a preference for Ds transposition into coding regions. Analysis of 200 Ds lines on chromosome 1 revealed that 72% insertions were found in genic region. Anchoring of more than 800 insertions to yeast artificial chromosome (YAC)-based EST map showed that Ds transposes preferentially into regions rich in expressed sequences. High germinal transposition frequency and independent transpositions among siblings show that the efficiency of this system is suitable for large-scale transposon mutagenesis in rice 78.
Additionally, we have performed a systematic analysis to survey the transposition activities of Ac/Ds parent lines in the following generations. We found that high somatic and germinal transposition frequencies were maintained as late as T4 and T5 generations; thus the propagation of parental lines did not induce transposon silencing. Moreover, the stably transposed Ds element was active even at the F5 generation, since Ac could remobilize the Ds element as indicated by the footprint analysis of several revertants. Strikingly, substantial transgenic silencing was not observed in any of the generations tested. We analyzed the timing of transposition during rice development and provide evidence that Ds was transposed late after tiller formation. Our study validates the Ac/Ds system as a tool for large-scale mutagenesis in rice, since the Ds elements were active in the starter and insertion lines even in the later generations 85.
Not all genes can be functionally identified by mutagenesis. One of the reasons is that many genes are functionally redundant and mutation of such genes may not lead to an easily recognizable phenotype. Another reason is that many genes function at multiple stages of development and mutations of these genes may lead to early lethality. Therefore, it is necessary to develop a system to monitor gene expression patterns to better understand functions of these genes. Gene trap, promoter trap or enhancer trap is a system that allows gene activity to be monitored by creating gene fusions with a reporter gene. In an enhancer trap, a reporter gene has a minimal promoter that is only expressed when inserted near cis-acting chromosomal enhancers. Reporter genes in gene trap and promoter trap have no promoter, so that reporter genes expression can occur only when the reporter gene inserts within a transcribed chromosomal gene, creating a transcriptional fusion. Expression of a promoter trap reporter gene requires that it be inserted into an exon, leading to a transcriptional fusion. In contrast, gene trap constructs contain one or more splice acceptor sequences preceding the reporter gene. Thus reporter genes can be detected even if insertion occurs in an intron since splicing from the splice donor sites to the splice acceptor sites in the reporter gene results in fusion of upstream exon sequences to the reporter gene 86. Currently, this system has been widely used for T-DNA or transposon mutagenesis. The GUS reporter gene is the mostly used gene for various trap systems in rice. In an enhancer trap Ds insertion population, around 8% of the lines were detected with GUS expression in panicles 81. For T-DNA promoter trap lines, histochemical GUS assays were carried out in the leaves and roots from 5353 lines, mature flowers from 7026 lines, and developing seeds from 1948 lines. The data revealed that 1.6-2.1% of tested organs were GUS-positive and that their GUS expression patterns were organ- or tissue-specific or ubiquitous in all parts of the plant 56. In our lab, 2852 Ds lines were subjected to GUS assay and the result showed that around 8.1% of the lines were with GUS activities 87. Some of the examples are shown in Figure Figure22.
These analyses suggested that Ds-tagged genes exhibited different expression patterns due to the Ds insertion into different genomic positions. Furthermore, the multimerized transcriptional enhancers from the cauliflower mosaic virus 35S promoter were positioned next to the left border of the T-DNA to make activation tagged lines 57. Histochemical GUS assays have revealed that the GUS-staining frequency from those lines is about twice as high as that from lines without the enhancer element. This result suggests that the enhancer sequence presented in the T-DNA improves the GUS-tagging efficiency 57. In another report, a CaMV35S enhancer was cloned in eight tandem repeats. This octamer configuration may serve as more potent activator than the traditional tetramer, as gene distances as far as 12.5 kb from the ATG start codon led to gene activation 60. Thus, most insertions of the CaMV35S enhancers into the rice genome (excluding insertions in exons and introns that lead to gene knockout) have the potential to activate at least one native gene based on the average gene density of one gene per 9.9 kb in the rice genome 4. Recently, a versatile transposon-based activation tag vector system was used for functional genomics in cereals and other monocot plants to further enhance rice gene expression 88. All these data showed that gene trap, promoter trap and enhancer trap in T-DNA or transposon mutagenesis can be used as efficient tools to trap gene expression and to analyze their functions.
Besides the activation tagging system, gain-of-function type mutants may also be obtained from over-expression of individual rice genes. Both over-expression and gene silencing (see below) have been widely used for the annotation of gene functions. Currently, many rice genes have been functionally characterized by the over-expression of their genes. For example, the biological function of ETHYLENE RESPONSE2 (ETR2) was annotated by comparing the difference between gain-of-function and knockout mutants 89. To investigate the over-expression mutants at the genome-wide level, special binary vectors have been designed to globally over-express all genes in an organism 90-92. Up to now, at least 45,000 FOX hunting rice lines have been generated 93, 94.
Tissue culture is also an efficient tool to induce various mutations, which is called somaclonal variations 95. Tissue culture mutagenesis formed the important resources for rice breeding 96, 97. However, little is known about the application of this technique in functional genomics until that some transposon elements in maize can be activated during tissue culture, indicating that some tissue culture-derived genetic variability may be the result of insertion or excision of transposable elements, or both 98. Subsequent studies showed that active DNA transposon elements were also observed during rice tissue culture 99-101. In addition to transposons, active retrotransposons were also detected during rice tissue culture 102. Differentiated from transposons, retrotransposons are mobile genetic elements that transpose through reverse transcription of an RNA intermediate. One of these retrotransposons was named as Tos17, a widely utilized retrotransposon in rice 102. One to five copies of Tos17 elements can be detected in normal growth conditions, varying with different rice varieties. For example, two copies of Tos17 were detected in japonica variety Nipponbare. These Tos17 elements have usually no activity in normal growth conditions. However, Tos17 will be activated during tissue culture and its copy number will increase to 5-30 102. For example, at least 5 new insertions of Tos17 were induced during 3- to 9- month tissue culture. Although Tos17 is actively transcribed during tissue culture, no transcript of Tos17 was detected in plants regenerated from tissue culture 102-104, suggesting that transposition is active only under tissue culture conditions. This result indicated that Tos17 could be used for mutagenesis to generate stabilized insertion lines. Subsequent studies showed that insertion sites were mostly found in genic regions and preferably in coding sequences 105, 106. In 2001, 32,000 rice lines were generated from Tos17, containing 256,000 insertions 103. Now they have produced around 50,000 insertion lines (Fig. (Fig.1C;1C; 107). Phenotypic investigation of these insertion lines indicated that nearly half of the lines showed more than one mutant phenotype; the most frequently observed phenotype was low fertility, followed by dwarfism 107. These phenotype data have been submitted into Tos17 mutant database with a dataset of sequences flanking Tos17 insertion points in rice genome (http://tos.nias.affrc.go.jp/).
Since the identification and characterization of Tos17, many genes have been isolated and functionally characterized through Tos17 insertion lines. Currently, at least 24 genes have been characterized by Tos17 insertion (Table (Table4).4). For example, oshkt2;1 is the first mutant that greatly diminishes sodium influx into plant roots. Further investigator showed that OsHKT2;1 is the central transporter for nutritional Na+ uptake into K+ starved rice roots 108.
Silencing a gene is also an efficient tool to determine its functions. Several methods can be used to silence a gene. For example, anti-sense or co-suppression was frequently observed in transgenic plants 109, 110. However, in some cases, only partial functions can be suppressed by anti-sense or co-suppression. Among RNA silencing methods, RNA interference (RNAi) is now widely used for gene silencing. During silencing, double-stranded RNA (dsRNA) is processed into 20- to 25-nt short interfering RNA (siRNA) and microRNA (miRNA) by RNaseIII-like enzymes called Dicers1. siRNAs and miRNAs guide RNA-induced silencing complexes (RISCs) to suppress gene expression at the level of transcription, RNA stability or translation. siRNAs are 21-23 nucleotide double-stranded RNA molecules. Once incorporated into RISC they facilitate the cleavage and degradation of their recognized mRNA. MicroRNAs are single-stranded RNAs of 22-nucleotides that are processed from ~70-nucleotide hairpin RNA precursors by Dicers. Similar to siRNAs, miRNAs can silence gene activity through destruction of homologous mRNA or blocking its translation in plants.
The key to utilize the RNAi silencing in plants is how to transfer double strand RNAs into cells. In animals, RNAi can be initiated by injecting or feeding dsRNA into cells 111. However, these methods can not be used in plants. Currently, at least two types of constructs have been used for RNAi-mediated silencing in plants. One is by virus-based vectors to transfer dsRNAs into plant cells. However, the virus-based silencing method can not be genetically inherited. Another one is by hairpin RNA (hpRNA)-mediated gene silencing. In this method, the target gene is cloned as an inverted repeat spaced with an unrelated sequence and is driven by a strong promoter, such as the 35S CaMV promoter for dicots or the maize ubiquitin 1 promoter for monocots. When an intron is used as the spacer, the efficiency becomes very high: almost 100% of transgenic plants show gene silencing 112. However, this technique cannot be applied to genes whose silencing may block plant regeneration or result in embryo lethality. To obviate these potential problems, a chemical-inducible Cre/loxP (CLX) recombination system was used to trigger the expression of an intron-containing inverted-repeat RNA (RNAi) in plants 113. In addition, a vector for high-throughput cloning of target genes as inverted repeats has been constructed for genome-wide analysis of gene functions 114. Another such RNAi vector is based on the spreading of RNA targeting (transitive RNAi) from an inverted repeat of a heterologous 3'UTR 115. Thus, the RNAi-mediated gene silencing can be used as a tool not only to analyze gene functions of a single gene or a gene family but also for genome-wide analysis of gene functions.
Now, this RNAi-mediated gene silencing has been successfully used for identification of gene functions in rice plants. In less than 7 years from 2003, at least 71 genes were functionally characterized by the RNAi silencing (Table (Table5).5). These genes included genes encoding various transcription factors, flowering-related proteins, pathogen/membrane-related proteins, various protein kinases and cell division-related proteins and so on. These results suggested that various genes can be silenced by RNAi and this method may be universally employed for characterizing various rice genes.
In order to use this RNAi-mediated gene silencing at the genome-wide level, we have designed a binary vector to globally silence rice genes (Fig. (Fig.3).3). Currently, in plants, genome-wide RNAi technology has been developed only in Arabidopsis 116. However, this method was based on whole genome sequence information and was costly. The new system is based on the construction of a cDNA library and subsequent RNAi constructs followed by transformation; thus, it is relatively cheap and can be used for other plants since no sequence information is required to generate such system. On the other hand, gene silence-based phenotypes can be investigated during the first or second generation; thus, it is a time-saved system. In this system, 300-500 bp 5'-UTR fragments are generated from a normalized cDNA library and are then cloned into the Gateway pENTR vector, which carry two recombination sites (attL1 and attL2) for LR clonase reactions (Fig. (Fig.3).3). These cloned UTR fragments are then transferred into a pANDA destination vector, which was developed by Miki and Shimamoto (2004) 117, through recombinase reactions. Thus, a RNAi silencing vector library is constructed, where UTR fragments are inserted into two regions flanked by two inverted repeats. This library can be used for transformation to develop a collection of transgenic rice plants integrated with different RNAi silencing T-DNAs.
In summary, we have reviewed 7 different methods for mutagenesis including natural, physical, chemical, tissue culture, T-DNA, transposon and gene silencing mutagenesis (Fig. (Fig.4).4). Natural mutagenesis has been widely used to identify gene functions but this method can not be used for genome-wide analysis due to its low frequency of mutation and its difficulty in identifying mutated genes by map-based cloning. Chemical mutagenesis can be efficiently used to produce a large number of point mutants in a short period and the induced mutants can be detected by TILLING. However, multiple point mutations were sometimes observed in one mutant. Thus, it is necessary to genetically segregate these point mutants. Similarly, Physical mutagenesis can be applied for producing a large number of deletion-based mutants in a short period, and the deletion mutants can be screened by Deleteagene. However, it is also very difficult to identify a deletion mutant and its phenotype when the induced deletion occurs covering multiple genes or within an intron. Insertional mutagenesis based on T-DNAs, transposons and retrotransposons is becoming a major approach to produce a saturated mutant pool. A large number of T-DNA insertion lines have been produced in rice; but T-DNA insertional mutagenesis can be used only for those rice varieties with highly efficient transformation. The retrotransposon Tos17 has been successfully applied for rice functional genomics. But it is also difficult to identify a mutant related to Tos17 because there are multiple copies of Tos17 in a mutant and only about 10 percent of mutants are tagged by Tos17 insertion. Theoretically, Ac/Ds two-element system is regarded as a best approach for rice insertional mutagenesis because more than 95% of Ds insertion lines have only single copy of Ds insertion. An additional advantage is that Ds can be remobilized from a tagged gene in the presence of Ac transposase, resulting in phenotypic reversion to the wild type or giving rise to alleles with weaker phenotypes. However, it is also difficult to identify a mutant when there are Ds excision footprints in the mutant caused by multiple Ds excision-insertion events in the presence of Ac transposase. RNAi can efficiently silence a gene, but not all genes can be silenced. In addition, RNAi can interfere in genes with redundant and overlapping functions or gene families with high homolog in sequence, making it difficult to identify a silenced gene. So it is obvious that each method has its advantage and disadvantage and different methods should be combined to produce a saturated mutant population.
Large-scale phenotype investigation has been carried out in rice using several mutant resources. Chern et al (2007) reported 11 categories of the visible phenotypes including growth condition, leaf color, leaf morphology, plant morphology, mimic response, tiller, heading date, flower, panicle, seed fertility and seed morphology, which were subdivided into 65 subcategories 71. Miyao et al (2007) also investigated around 50,000 Tos17 insertion lines in their phenotype variation including germination, growth, leaf color, leaf shape, culm shape, spotted leaf/lesion mimic, tillering, heading date, spikelet, panicle, sterility and seed 107. Park et al (2009) have analyzed 115,000 Ds insertion lines in their phenotype variation and 437 mutants from 12,162 Ds-tagged lines were catalogued in their agronomic traits including tillers, panicles, leaves, flowers, seed, chlorophyll content, and plant height 83. Furthermore, several rice mutant phenotype databases are now established including Tos17 insertion lines 107, T-DNA-tagged lines 72, and chemical- and irradiation-induced lines 32. Kuromori et al (2009) have reviewed a detail phenome analysis 67. However, they have not discussed their application in rice breeding. Recently, Mochida and Shinozaki (2010) have summarized the genomics and bioinformatics resources for crop improvement 118.
We have subjected around 20,000 Ds insertion lines to phenotypic and abiotic stress screens and evaluated these lines with respect to their seed yields and other agronomic traits as well as their tolerance to drought, salinity and cold. Based on this evaluation, we observed that random Ds insertions into rice genome have led to diverse variations including a range of morphological phenotypes. We have observed various variations in these Ds insertion lines including the differences in plant height, growth vigor, growth period of duration and stigma and so on (Fig. (Fig.5).5). Among the various phenotypes identified, some Ds lines showed significantly higher grain yield compared to wild-type plants under normal growth conditions indicating that rice could be improved in grain yield by disrupting certain endogenous genes 87. In addition, several thousands of Ds lines were subjected to abiotic stresses to identify conditional mutants. Subsequent to these screens, over 800 lines responsive to drought, salinity or cold stress were obtained, suggesting that rice has the genetic potential to survive under abiotic stresses when appropriate endogenous genes were suppressed. The mutant lines that have higher seed yielding potential or display higher tolerance to abiotic stresses may be used for rice breeding by conventional backcrossing combining with molecular marker-assisted selection. In addition, by exploiting the behavior of Ds to leave footprints upon remobilization, we have shown an alternative strategy to develop new rice varieties without foreign DNA sequences in their genome 87.
Phenotype screens of Ds insertion lines have identified two male sterile mutants. One is Orysa sativa no pollen (Osnop) mutant with a pollen-less phenotype at the flowering stage. The mutant phenotype showed linkage to Ds insertion into OSNOP gene region. This mutant contained a deletion of 65 kb chromosomal region at the site of Ds insertion containing 14 predicted genes. Among them, delegen 14 was expressed only in late stage of pollen development with the highest expression at the stage of pollen release and germination by RT-PCR, Northern blotting, in situ hybridization, and promoter-GUS transgenic plants. Thus, this gene is the best candidate for OSNOP. Since this gene encoded C2 and GRAM domains, it can be assumed that this gene cross-links both calcium and phosphoinositide signaling pathways. This is the first report to suggest possible functions for this gene in plant development 119.
Another one is the myosin mutant osmyoXIB. This mutant showed male sterility under short day length (SD) conditions and fertility under long day length (LD) conditions. Under both SD and LD conditions, the OSMYOXIB transcript was detected in whole anthers. However, under SD conditions, the OSMYOXIB-GUS fusion protein was localized only in the epidermal layer of anthers due to the lack of 3'-untranslated region (3'-UTR) and to dilute (DIL) domain sequences following the Ds insertion. As a result, mutant pollen development was affected, leading to male sterility. By contrast, under LD conditions, the fusion protein was localized normally in anthers. Despite normal localization, the protein was only partially functional due to the lack of DIL domain sequences, resulting in limited recovery of pollen fertility 120. Since this mutant is a photoperiod sensitive male sterile line, it can be a candidate line to develop new male sterile lines for producing two-line hybrid rice.
We thank Drs. Ildiko Szeverenyi and Tatiana Kolesnik for their help in generation of transposant lines and Dr. Doris Bachmann for her help in phenotyping and GUS staining. We take this opportunity to thank Zhigang Ma and Hongfen Luan for their technical assistance.