Rice is not only a major food staple for the world's population but it also is a model species for a major group of flowering plants, the monocotyledonous plants. Draft genomic sequence of two subspecies of rice, Oryza sativa spp. japonica and indica ssp. are publicly available. To provide the community with a resource to data-mine the rice genome, we have constructed an annotation resource for rice (http://www.tigr.org/tdb/e2k1/osa1/). In this resource, we have annotated the rice genome for gene content, identified motifs/domains within the predicted genes, constructed a rice repeat database, identified related sequences in other plant species, and identified syntenic sequences between rice and maize. All of the data is available through web-based interfaces, FTP downloads, and a Distributed Annotation System.
The Oryza sativa L. indica subspecies is the most widely cultivated rice. During the last few years, we have collected over 20,000 putative full-length cDNAs and over 40,000 ESTs isolated from various cDNA libraries of two indica varieties Guangluai 4 and Minghui 63. A database of the rice indica cDNAs was therefore built to provide a comprehensive web data source for searching and retrieving the indica cDNA clones.
Rice Indica cDNA Database (RICD) is an online MySQL-PHP driven database with a user-friendly web interface. It allows investigators to query the cDNA clones by keyword, genome position, nucleotide or protein sequence, and putative function. It also provides a series of information, including sequences, protein domain annotations, similarity search results, SNPs and InDels information, and hyperlinks to gene annotation in both The Rice Annotation Project Database (RAP-DB) and The TIGR Rice Genome Annotation Resource, expression atlas in RiceGE and variation report in Gramene of each cDNA.
The online rice indica cDNA database provides cDNA resource with comprehensive information to researchers for functional analysis of indica subspecies and for comparative genomics. The RICD database is available through our website .
The concurrent release of rice genome sequences for two subspecies (Oryza sativa L. ssp. japonica and Oryza sativa L. ssp. indica) facilitates rice studies at the whole genome level. Since the advent of high-throughput analysis, huge amounts of functional genomics data have been delivered rapidly, making an integrated online genome browser indispensable for scientists to visualize and analyze these data. Based on next-generation web technologies and high-throughput experimental data, we have developed Rice-Map, a novel genome browser for researchers to navigate, analyze and annotate rice genome interactively.
More than one hundred annotation tracks (81 for japonica and 82 for indica) have been compiled and loaded into Rice-Map. These pre-computed annotations cover gene models, transcript evidences, expression profiling, epigenetic modifications, inter-species and intra-species homologies, genetic markers and other genomic features. In addition to these pre-computed tracks, registered users can interactively add comments and research notes to Rice-Map as User-Defined Annotation entries. By smoothly scrolling, dragging and zooming, users can browse various genomic features simultaneously at multiple scales. On-the-fly analysis for selected entries could be performed through dedicated bioinformatic analysis platforms such as WebLab and Galaxy. Furthermore, a BioMart-powered data warehouse "Rice Mart" is offered for advanced users to fetch bulk datasets based on complex criteria.
Rice-Map delivers abundant up-to-date japonica and indica annotations, providing a valuable resource for both computational and bench biologists. Rice-Map is publicly accessible at http://www.ricemap.org/, with all data available for free downloading.
A transcriptome analysis of chromosome 10 of 2 rice subspecies identifies 549 new gene models and gives experimental evidence for around 75% of the previously unsupported predicted genes.
Sequencing and annotation of the genome of rice (Oryza sativa) have generated gene models in numbers that top all other fully sequenced species, with many lacking recognizable sequence homology to known genes. Experimental evaluation of these gene models and identification of new models will facilitate rice genome annotation and the application of this knowledge to other more complex cereal genomes.
We report here an analysis of the chromosome 10 transcriptome of the two major rice subspecies, japonica and indica, using oligonucleotide tiling microarrays. This analysis detected expression of approximately three-quarters of the gene models without previous experimental evidence in both subspecies. Cloning and sequence analysis of the previously unsupported models suggests that the predicted gene structure of nearly half of those models needs improvement. Coupled with comparative gene model mapping, the tiling microarray analysis identified 549 new models for the japonica chromosome, representing an 18% increase in the annotated protein-coding capacity. Furthermore, an asymmetric distribution of genome elements along the chromosome was found that coincides with the cytological definition of the heterochromatin and euchromatin domains. The heterochromatin domain appears to associate with distinct chromosome level transcriptional activities under normal and stress conditions.
These results demonstrated the utility of genome tiling microarray in evaluating annotated rice gene models and in identifying novel transcriptional units. The tiling microarray sanalysis further revealed a chromosome-wide transcription pattern that suggests a role for transposable element-enriched heterochromatin in shaping global transcription in response to environmental changes in rice.
Genealogical patterns in different genomic regions may be different due to the joint influence of gene flow and selection. The existence of two subspecies of cultivated rice provides a unique opportunity for analyzing these effects during domestication. We chose 66 accessions from the three rice taxa (about 22 each from Oryza sativa indica, O. sativa japonica, and O. rufipogon) for whole-genome sequencing. In the search for the signature of selection, we focus on low diversity regions (LDRs) shared by both cultivars. We found that the genealogical histories of these overlapping LDRs are distinct from the genomic background. While indica and japonica genomes generally appear to be of independent origin, many overlapping LDRs may have originated only once, as a result of selection and subsequent introgression. Interestingly, many such LDRs contain only one candidate gene of rice domestication, and several known domestication genes have indeed been “rediscovered” by this approach. In summary, we identified 13 additional candidate genes of domestication.
The origin of two cultivated rice Oryza sativa indica and O. sativa japonica has been an interesting topic in evolutionary biology. Through whole-genome sequencing, we show that the rice genome embodies two different evolutionary trajectories. Overall genome-wide pattern supports a history of independent origin of two cultivars from their wild population. However, genomic segments bearing important agronomic traits originated only once in one population and spread across all cultivars through introgression and human selection. Population genetic analysis allows us to pinpoint 13 additional candidate domestication genes.
Genome tiling microarray studies have consistently documented rich transcriptional activity beyond the annotated genes. However, systematic characterization and transcriptional profiling of the putative novel transcripts on the genome scale are still lacking. We report here the identification of 25,352 and 27,744 transcriptionally active regions (TARs) not encoded by annotated exons in the rice (Oryza. sativa) subspecies japonica and indica, respectively. The non-exonic TARs account for approximately two thirds of the total TARs detected by tiling arrays and represent transcripts likely conserved between japonica and indica. Transcription of 21,018 (83%) japonica non-exonic TARs was verified through expression profiling in 10 tissue types using a re-array in which annotated genes and TARs were each represented by five independent probes. Subsequent analyses indicate that about 80% of the japonica TARs that were not assigned to annotated exons can be assigned to various putatively functional or structural elements of the rice genome, including splice variants, uncharacterized portions of incompletely annotated genes, antisense transcripts, duplicated gene fragments, and potential non-coding RNAs. These results provide a systematic characterization of non-exonic transcripts in rice and thus expand the current view of the complexity and dynamics of the rice transcriptome.
Rice is a very important food staple that feeds more than half the world's population. Two major Asian cultivated rice (Oryza sativa L.) subspecies, japonica and indica, show significant phenotypic variation in their stress responses. However, the molecular mechanisms underlying this phenotypic variation are still largely unknown. A common link among different stresses is that they produce an oxidative burst and result in an increase of reactive oxygen species (ROS). In this study, methyl viologen (MV) as a ROS agent was applied to investigate the rice oxidative stress response. We observed that 93-11 (indica) seedlings exhibited leaf senescence with severe lesions under MV treatment compared to Nipponbare (japonica). Whole-genome microarray experiments were conducted, and 1,062 probe sets were identified with gene expression level polymorphisms between the two rice cultivars in addition to differential expression under MV treatment, which were assigned as Core Intersectional Probesets (CIPs). These CIPs were analyzed by gene ontology (GO) and highlighted with enrichment GO terms related to toxin and oxidative stress responses as well as other responses. These GO term-enriched genes of the CIPs include glutathine S-transferases (GSTs), P450, plant defense genes, and secondary metabolism related genes such as chalcone synthase (CHS). Further insertion/deletion (InDel) and regulatory element analyses for these identified CIPs suggested that there may be some eQTL hotspots related to oxidative stress in the rice genome, such as GST genes encoded on chromosome 10. In addition, we identified a group of marker genes individuating the japonica and indica subspecies. In summary, we developed a new strategy combining biological experiments and data mining to study the possible molecular mechanism of phenotypic variation during oxidative stress between Nipponbare and 93-11. This study will aid in the analysis of the molecular basis of quantitative traits.
Over the past 10 years, genomes of cultivated rice cultivars and their wild counterparts have been sequenced although most efforts are focused on genome assembly and annotation of two major cultivated rice (Oryza sativa L.) subspecies, 93-11 (indica) and Nipponbare (japonica). To integrate information from genome assemblies and annotations for better analysis and application, we now introduce a comparative rice genome database, the Rice Genome Knowledgebase (RGKbase, http://rgkbase.big.ac.cn/RGKbase/). RGKbase is built to have three major components: (i) integrated data curation for rice genomics and molecular biology, which includes genome sequence assemblies, transcriptomic and epigenomic data, genetic variations, quantitative trait loci (QTLs) and the relevant literature; (ii) User-friendly viewers, such as Gbrowse, GeneBrowse and Circos, for genome annotations and evolutionary dynamics and (iii) Bioinformatic tools for compositional and synteny analyses, gene family classifications, gene ontology terms and pathways and gene co-expression networks. RGKbase current includes data from five rice cultivars and species: Nipponbare (japonica), 93-11 (indica), PA64s (indica), the African rice (Oryza glaberrima) and a wild rice species (Oryza brachyantha). We are also constantly introducing new datasets from variety of public efforts, such as two recent releases—sequence data from ∼1000 rice varieties, which are mapped into the reference genome, yielding ample high-quality single-nucleotide polymorphisms and insertions–deletions.
Calculating the metabolome size of species by genome-guided reconstruction of metabolic pathways misses all products from orphan genes and from enzymes lacking annotated genes. Hence, metabolomes need to be determined experimentally. Annotations by mass spectrometry would greatly benefit if peer-reviewed public databases could be queried to compile target lists of structures that already have been reported for a given species. We detail current obstacles to compile such a knowledge base of metabolites.
As an example, results are presented for rice. Two rice (oryza sativa) subspecies have been fully sequenced, oryza japonica and oryza indica. Several major small molecule databases were compared for listing known rice metabolites comprising PubChem, Chemical Abstracts, Beilstein, Patent databases, Dictionary of Natural Products, SetupX/BinBase, KNApSAcK DB, and finally those databases which were obtained by computational approaches, i.e. RiceCyc, KEGG, and Reactome. More than 5,000 small molecules were retrieved when searching these databases. Unfortunately, most often, genuine rice metabolites were retrieved together with non-metabolite database entries such as pesticides. Overlaps from database compound lists were very difficult to compare because structures were either not encoded in machine-readable format or because compound identifiers were not cross-referenced between databases.
We conclude that present databases are not capable of comprehensively retrieving all known metabolites. Metabolome lists are yet mostly restricted to genome-reconstructed pathways. We suggest that providers of (bio)chemical databases enrich their database identifiers to PubChem IDs and InChIKeys to enable cross-database queries. In addition, peer-reviewed journal repositories need to mandate submission of structures and spectra in machine readable format to allow automated semantic annotation of articles containing chemical structures. Such changes in publication standards and database architectures will enable researchers to compile current knowledge about the metabolome of species, which may extend to derived information such as spectral libraries, organ-specific metabolites, and cross-study comparisons.
Here we report that the change from the red seeds of wild rice to the white seeds of cultivated rice (Oryza sativa) resulted from the strong selective sweep of a single mutation, a frame-shift deletion within the Rc gene that is found in 97.9% of white rice varieties today. A second mutation, also within Rc, is present in less than 3% of white accessions surveyed. Haplotype analysis revealed that the predominant mutation originated in the japonica subspecies and crossed both geographic and sterility barriers to move into the indica subspecies. A little less than one Mb of japonica DNA hitchhiked with the rc allele into most indica varieties, suggesting that other linked domestication alleles may have been transferred from japonica to indica along with white pericarp color. Our finding provides evidence of active cultural exchange among ancient farmers over the course of rice domestication coupled with very strong, positive selection for a single white allele in both subspecies of O. sativa.
Understanding the history and origin of genetic mutations that have changed wild plants into crops can help us understand the history of the people who cultivated these plants. Rice is one of the oldest crops grown in Asia and it contains two different subspecies that are believed to have been domesticated in different locations by different people. Surprisingly, some of the genetic mutations responsible for domestication are common in all rice. We here show that a mutation in the Rc gene that changed the red seed of wild rice into the white seeds of modern rice is shared by a large majority of all rice varieties, regardless of subspecies. This transfer of genes requires contact among rice types and implies contact among the people who cultivated the different subspecies. We have traced the origin of the mutation in Rc to the japonica subspecies. As additional domestication genes are cloned and their evolutionary history described, we will see how many times and in how many directions such gene transfers have occurred.
Rice has been found in archaeological sites dating to 8000 bc, although the date of rice domestication is a matter of continuing debate. Two species of domesticated rice, Oryza sativa (Asian) and Oryza glaberrima (African) are grown globally. Numerous traits separate wild and domesticated rices including changes in: pericarp colour, dormancy, shattering, panicle architecture, tiller number, mating type and number and size of seeds.
Genetic studies using diverse methodologies have uncovered a deep population structure within domesticated rice. Two main groups, the indica and japonica subspecies, have been identified with several subpopulations existing within each group. The antiquity of the divide has been estimated at more than 100 000 years ago. This date far precedes domestication, supporting independent domestications of indica and japonica from pre-differentiated pools of the wild ancestor. Crosses between subspecies display sterility and segregate for domestication traits, indicating that different populations are fixed for different networks of alleles conditioning these traits. Numerous domestication QTLs have been identified in crosses between the subspecies and in crosses between wild and domesticated accessions of rice. Many of the QTLs cluster in the same genomic regions, suggesting that a single gene with pleiotropic effects or that closely linked clusters of genes underlie these QTL. Recently, several domestication loci have been cloned from rice, including the gene controlling pericarp colour and two loci for shattering. The distribution and evolutionary history of these genes gives insight into the domestication process and the relationship between the subspecies.
The evolutionary history of rice is complex, but recent work has shed light on the genetics of the transition from wild (O. rufipogon and O. nivara) to domesticated (O. sativa) rice. The types of genes involved and the geographic and genetic distribution of alleles will allow scientists to better understand our ancestors and breed better rice for our descendents.
Oryza sativa; domestication; shattering; pericarp colour; QTL; subpopulation structure; subspecies
Heat stress transcription factors (Hsfs) are the central regulators of defense response to heat stress. We identified a total of 25 rice Hsf genes by genome-wide analysis of rice (Oryza sativa L.) genome, including the subspecies of O. japonica and O. indica. Proteins encoded by OsHsfs were divided into three classes according to their structures. Digital Northern analysis showed that OsHsfs were expressed constitutively. The expressions of these OsHsfs in response to heat stress and oxidative stress differed among the members of the gene family. Promoter analysis identified a number of stress-related cis-elements in the promoter regions of these OsHsfs. No significant correlation, however, was found between the heat-shock responses of genes and their cis-elements. Overall, our results provide a foundation for future research of OsHsfs function.
Heat shock; Transcription factors; Rice; Protein structure; Expression analysis
Etoposide (epipodophyllotoxin) is a chemical commonly used as an anti-cancer drug which inhibits DNA synthesis by blocking topoisomerase II activity. Previous studies in animal cells have demonstrated that etoposide constitutes a genotoxic stress which may induce genomic instability including mobilization of normally quiescent transposable elements (TEs). However, it remained unknown whether similar genetically mutagenic effects could be imposed by etoposide in plant cells. Also, no information is available with regard to whether the drug may cause a perturbation of epigenetic stability in any organism.
To investigate whether etoposide could generate genetic and/or epigenetic instability in plant cells, we applied etoposide to germinating seeds of six cultivated rice (Oryza sativa L.) genotypes including both subspecies, japonica and indica. Based on the methylation-sensitive gel-blotting results, epigenetic changes in DNA methylation of three TEs (Tos17, Osr23 and Osr36) and two protein-encoding genes (Homeobox and CDPK-related genes) were detected in the etoposide-treated plants (S0 generation) in four of the six studied japonica cultivars, Nipponbare, RZ1, RZ2, and RZ35, but not in the rest japonica cultivar (Matsumae) and the indica cultivar (93-11). DNA methylation changes in the etoposide-treated S0 rice plants were validated by bisulfite sequencing at both of two analyzed loci (Tos17 and Osr36). Transpositional activity was tested for eight TEs endogenous to the rice genome in both the S0 plants and their selfed progenies (S1 and S2) of one of the cultivars, RZ1, which manifested heritable phenotypic variations. Results indicated that no transposition occurred in the etoposide-treated S0 plants for any of the TEs. Nonetheless, a MITE transposon, mPing, showed rampant mobilization in the S1 and S2 progenies descended from the drug-treated S0 plants.
Our results demonstrate that etoposide imposes a similar genotoxic stress on plant cells as it does on animal and human cells, which may induce transgenerational genomic instability by instigating transpositional activation of otherwise dormant TEs. In addition, we show for the first time that etoposide may induce epigenetic instability in the form of altered DNA methylation patterns in eukaryotes. However, penetrance of the genotoxic effects of etoposide on plant cells, as being reflected as genetic and epigenetic instability, appears to be in a strictly genotype- and/or generation-dependent manner.
Rice (Oryza sativa L.) is a model organism for the functional genomics of monocotyledonous plants since the genome size is considerably smaller than those of other monocotyledonous plants. Although highly accurate genome sequences of indica and japonica rice are available, additional resources such as full-length complementary DNA (FL-cDNA) sequences are also indispensable for comprehensive analyses of gene structure and function. We cross-referenced 28.5K individual loci in the rice genome defined by mapping of 578K FL-cDNA clones with the 56K loci predicted in the TIGR genome assembly. Based on the annotation status and the presence of corresponding cDNA clones, genes were classified into 23K annotated expressed (AE) genes, 33K annotated non-expressed (ANE) genes, and 5.5K non-annotated expressed (NAE) genes. We developed a 60mer oligo-array for analysis of gene expression from each locus. Analysis of gene structures and expression levels revealed that the general features of gene structure and expression of NAE and ANE genes were considerably different from those of AE genes. The results also suggested that the cloning efficiency of rice FL-cDNA is associated with the transcription activity of the corresponding genetic locus, although other factors may also have an effect. Comparison of the coverage of FL-cDNA among gene families suggested that FL-cDNA from genes encoding rice- or eukaryote-specific domains, and those involved in regulatory functions were difficult to produce in bacterial cells. Collectively, these results indicate that rice genes can be divided into distinct groups based on transcription activity and gene structure, and that the coverage bias of FL-cDNA clones exists due to the incompatibility of certain eukaryotic genes in bacteria.
Rice is a major food staple for the world’s population and serves as a model species in cereal genome research. The Beijing Genomics Institute (BGI) has long been devoting itself to sequencing, information analysis and biological research of the rice and other crop genomes. In order to facilitate the application of the rice genomic information and to provide a foundation for functional and evolutionary studies of other important cereal crops, we implemented our Rice Information System (BGI-RIS), the most up-to-date integrated information resource as well as a workbench for comparative genomic analysis. In addition to comprehensive data from Oryza sativa L. ssp. indica sequenced by BGI, BGI-RIS also hosts carefully curated genome information from Oryza sativa L. ssp. japonica and EST sequences available from other cereal crops. In this resource, sequence contigs of indica (93-11) have been further assembled into Mbp-sized scaffolds and anchored onto the rice chromosomes referenced to physical/genetic markers, cDNAs and BAC-end sequences. We have annotated the rice genomes for gene content, repetitive elements, gene duplications (tandem and segmental) and single nucleotide polymorphisms between rice subspecies. Designed as a basic platform, BGI-RIS presents the sequenced genomes and related information in systematic and graphical ways for the convenience of in-depth comparative studies (http://rise.genomics.org.cn/).
The MIPS Rice (Oryza sativa) database (MOsDB; http://mips.gsf.de/proj/rice) provides a comprehensive data collection dedicated to the genome information of rice. Rice (O. sativa L.) is one of the most important food crops for over half the world's population and serves as a major model system in cereal genome research. MOsDB integrates data from two publicly available rice genomic sequences, O. sativa L. ssp. indica and O. sativa L. ssp. japonica. Besides regularly updated rice genome sequence information, MOsDB provides an integrated resource for associated analysis data, e.g. internal and external annotation information as well as a complex characterization of all annotated rice genes. The MOsDB web interface supports various search options and allows browsing the database content. MOsDB is continuously expanding to include an increasing range of data type and the growing amount of information on the rice genome.
Rice is staple food for more than half of the world’s population including two billion Asians, who obtain 60-70% of their energy intake from rice and its derivatives. To meet the growing demand from human population, rice varieties with higher yield potential and greater yield stability need to be developed. The favourable alleles for yield and yield contributing traits are distributed among two subspecies i.e., indica and japonica of cultivated rice (Oryza sativa L.). Identification of novel favourable alleles in indica/japonica will pave way to marker-assisted mobilization of these alleles in to a genetic background to break genetic barriers to yield.
A new plant type (NPT) based mapping population of 310 recombinant inbred lines (RILs) was used to map novel genomic regions and QTL hotspots influencing yield and eleven yield component traits. We identified major quantitative trait loci (QTLs) for days to 50% flowering (R2 = 25%, LOD = 14.3), panicles per plant (R2 = 19%, LOD = 9.74), flag leaf length (R2 = 22%, LOD = 3.05), flag leaf width (R2 = 53%, LOD = 46.5), spikelets per panicle (R2 = 16%, LOD = 13.8), filled grains per panicle (R2 = 22%, LOD = 15.3), percent spikelet sterility (R2 = 18%, LOD = 14.24), thousand grain weight (R2 = 25%, LOD = 12.9) and spikelet setting density (R2 = 23%, LOD = 15) expressing over two or more locations by using composite interval mapping. The phenotypic variation (R2) ranged from 8 to 53% for eleven QTLs expressing across all three locations. 19 novel QTLs were contributed by the NPT parent, Pusa1266. 15 QTL hotpots on eight chromosomes were identified for the correlated traits. Six epistatic QTLs effecting five traits at two locations were identified. A marker interval (RM3276-RM5709) on chromosome 4 harboring major QTLs for four traits was identified.
The present study reveals that favourable alleles for yield and yield contributing traits were distributed among two subspecies of rice and QTLs were co-localized in different genomic regions. QTL hotspots will be useful for understanding the common genetic control mechanism of the co-localized traits and selection for beneficial allele at these loci will result in a cumulative increase in yield due to the integrative positive effect of various QTLs. The information generated in the present study will be useful to fine map and to identify the genes underlying major robust QTLs and to transfer all favourable QTLs to one genetic background to break genetic barriers to yield for sustained food security.
The domestication of Asian rice (Oryza sativa) was a complex process punctuated by episodes of introgressive hybridization among and between subpopulations. Deep genetic divergence between the two main varietal groups (Indica and Japonica) suggests domestication from at least two distinct wild populations. However, genetic uniformity surrounding key domestication genes across divergent subpopulations suggests cultural exchange of genetic material among ancient farmers.
In this study, we utilize a novel 1,536 SNP panel genotyped across 395 diverse accessions of O. sativa to study genome-wide patterns of polymorphism, to characterize population structure, and to infer the introgression history of domesticated Asian rice. Our population structure analyses support the existence of five major subpopulations (indica, aus, tropical japonica, temperate japonica and GroupV) consistent with previous analyses. Our introgression analysis shows that most accessions exhibit some degree of admixture, with many individuals within a population sharing the same introgressed segment due to artificial selection. Admixture mapping and association analysis of amylose content and grain length illustrate the potential for dissecting the genetic basis of complex traits in domesticated plant populations.
Genes in these regions control a myriad of traits including plant stature, blast resistance, and amylose content. These analyses highlight the power of population genomics in agricultural systems to identify functionally important regions of the genome and to decipher the role of human-directed breeding in refashioning the genomes of a domesticated species.
In Asian cultivated rice (Oryza sativa L.), aroma is one of the most valuable traits in grain quality and 2-ACP is the main volatile compound contributing to the characteristic popcorn-like odour of aromatic rices. Although the major locus for grain fragrance (frg gene) has been described recently in Basmati rice, this gene has not been characterised in true japonica varieties and molecular information available on the genetic diversity and evolutionary origin of this gene among the different varieties is still limited. Here we report on characterisation of the frg gene in the Azucena variety, one of the few aromatic japonica cultivars. We used a RIL population from a cross between Azucena and IR64, a non-aromatic indica, the reference genomic sequence of Nipponbare (japonica) and 93–11 (indica) as well as an Azucena BAC library, to identify the major fragance gene in Azucena. We thus identified a betaine aldehyde dehydrogenase gene, badh2, as the candidate locus responsible for aroma, which presented exactly the same mutation as that identified in Basmati and Jasmine-like rices. Comparative genomic analyses showed very high sequence conservation between Azucena and Nipponbare BADH2, and a MITE was identified in the promotor region of the BADH2 allele in 93–11. The badh2 mutation and MITE were surveyed in a representative rice collection, including traditional aromatic and non-aromatic rice varieties, and strongly suggested a monophylogenetic origin of this badh2 mutation in Asian cultivated rices. Altogether these new data are discussed here in the light of current hypotheses on the origin of rice genetic diversity.
Miniature inverted–repeat transposable elements (MITEs) are predicted to play important roles on genome evolution. We developed a BLASTN-based approach for de novo identification of MITEs and systematically analyzed MITEs in rice genome. The genome of rice cultivar Nipponbare (Oryza sativa ssp. japonica) harbors 178,533 MITE-related sequences classified into 338 families. Pairwise nucleotide diversity and phylogenetic tree analysis indicated that individual MITE families were resulted from one or multiple rounds of amplification bursts. The timing of amplification burst varied considerably between different MITE families or subfamilies. MITEs are associated with 23,623 (58.2%) genes in rice genome. At least 7,887 MITEs are transcribed and more than 3,463 were transcribed with rice genes. The MITE sequences transcribed with rice coding genes form 1,130 pairs of potential natural sense/antisense transcripts. MITEs generate 23.5% (183,837 of 781,885) of all small RNAs identified from rice. Some MITE families generated small RNAs mainly from the terminals, while other families generated small RNAs predominantly from the central region. More than half (51.8%) of the MITE-derived small RNAs were generated exclusively by MITEs located away from genes. Genome-wide analysis showed that genes associated with MITEs have significantly lower expression than genes away from MITEs. Approximately 14.8% of loci with full-length MITEs have presence/absence polymorphism between rice cultivars 93-11 (O. sativa ssp. indica) and Nipponbare. Considering that different sets of genes may be regulated by MITE-derived small RNAs in different genotypes, MITEs provide considerable diversity for O. sativa.
MITEs; amplification; gene expression; evolution; small RNA
DNA methylation plays important biological roles in plants and animals. To examine the rice genomic methylation landscape and assess its functional significance, we generated single-base resolution DNA methylome maps for Asian cultivated rice Oryza sativa ssp. japonica, indica and their wild relatives, Oryza rufipogon and Oryza nivara.
The overall methylation level of rice genomes is four times higher than that of Arabidopsis. Consistent with the results reported for Arabidopsis, methylation in promoters represses gene expression while gene-body methylation generally appears to be positively associated with gene expression. Interestingly, we discovered that methylation in gene transcriptional termination regions (TTRs) can significantly repress gene expression, and the effect is even stronger than that of promoter methylation. Through integrated analysis of genomic, DNA methylomic and transcriptomic differences between cultivated and wild rice, we found that primary DNA sequence divergence is the major determinant of methylational differences at the whole genome level, but DNA methylational difference alone can only account for limited gene expression variation between the cultivated and wild rice. Furthermore, we identified a number of genes with significant difference in methylation level between the wild and cultivated rice.
The single-base resolution methylomes of rice obtained in this study have not only broadened our understanding of the mechanism and function of DNA methylation in plant genomes, but also provided valuable data for future studies of rice epigenetics and the epigenetic differentiation between wild and cultivated rice.
Cultivated and wild rice; Methylomes; Transcriptional termination regions (TTRs); Gene expression
Oryza sativa or Asian cultivated rice is one of the major cereal grass species domesticated for human food use during the Neolithic. Domestication of this species from the wild grass Oryza rufipogon was accompanied by changes in several traits, including seed shattering, percent seed set, tillering, grain weight, and flowering time. Quantitative trait locus (QTL) mapping has identified three genomic regions in chromosome 3 that appear to be associated with these traits. We would like to study whether these regions show signatures of selection and whether the same genetic basis underlies the domestication of different rice varieties. Fragments of 88 genes spanning these three genomic regions were sequenced from multiple accessions of two major varietal groups in O. sativa—indica and tropical japonica—as well as the ancestral wild rice species O. rufipogon. In tropical japonica, the levels of nucleotide variation in these three QTL regions are significantly lower compared to genome-wide levels, and coalescent simulations based on a complex demographic model of rice domestication indicate that these patterns are consistent with selection. In contrast, there is no significant reduction in nucleotide diversity in the homologous regions in indica rice. These results suggest that there are differences in the genetic and selective basis for domestication between these two Asian rice varietal groups.
China is rich of germplasm resources of common wild rice (Oryza rufipogon Griff.) and Asian cultivated rice (O. sativa L.) which consists of two subspecies, indica and japonica. Previous studies have shown that China is one of the domestication centers of O. sativa. However, the geographic origin and the domestication times of O. sativa in China are still under debate. To settle these disputes, six chloroplast loci and four mitochondrial loci were selected to examine the relationships between 50 accessions of Asian cultivated rice and 119 accessions of common wild rice from China based on DNA sequence analysis in the present study. The results indicated that Southern China is the genetic diversity center of O. rufipogon and it might be the primary domestication region of O. sativa. Molecular dating suggested that the two subspecies had diverged 0.1 million years ago, much earlier than the beginning of rice domestication. Genetic differentiations and phylogeography analyses indicated that indica was domesticated from tropical O. rufipogon while japonica was domesticated from O. rufipogon which located in higher latitude. These results provided molecular evidences for the hypotheses of (i) Southern China is the origin center of O. sativa in China and (ii) the two subspecies of O. sativa were domesticated multiple times.
Emerging evidence supports that cooked rice (Oryza sativa L.) contains metabolites with biomedical activities, yet little is known about the genetic diversity that is responsible for metabolite variation and differences in health traits. Metabolites from ten diverse varieties of cooked rice were detected using ultra performance liquid chromatography coupled to mass spectrometry. A total of 3,097 compounds were detected, of which 25% differed among the ten varieties. Multivariate analyses of the metabolite profiles showed that the chemical diversity among the varieties cluster according to their defined subspecies classifications: indica, japonica, and aus. Metabolite-specific genetic diversity in rice was investigated by analyzing a collection of single nucleotide polymorphisms (SNPs) in genes from biochemical pathways of nutritional importance. Two classes of bioactive compounds, phenolics and vitamin E, contained nonsynonymous SNPs and SNPs in the 5′ and 3′ untranslated regions for genes in their biosynthesis pathways. Total phenolics and tocopherol concentrations were determined to examine the effect of the genetic diversity among the ten varieties. Per gram of cooked rice, total phenolics ranged from 113.7 to 392.6 µg (gallic acid equivalents), and total tocopherols ranged between 7.2 and 20.9 µg. The variation in the cooked rice metabolome and quantities of bioactive components supports that the SNP-based genetic diversity influenced nutritional components in rice, and that this approach may guide rice improvement strategies for plant and human health.
The effects of viral infection involve concomitant plant gene variations and cellular changes. A simple system is required to assess the complexity of host responses to viral infection. The genome of the Rice yellow mottle virus (RYMV) is a single-stranded RNA with a simple organisation. It is the most well-known monocotyledon virus model. Several studies on its biology, structure and phylogeography have provided a suitable background for further genetic studies. 12 rice chromosome sequences are now available and provide strong support for genomic studies, particularly physical mapping and gene identification.
The present data, obtained through the cDNA-AFLP technique, demonstrate differential responses to RYMV of two different rice cultivars, i.e. susceptible IR64 (Oryza sativa indica), and partially resistant Azucena (O. s. japonica). This RNA profiling provides a new original dataset that will enable us to gain greater insight into the RYMV/rice interaction and the specificity of the host response. Using the SIM4 subroutine, we took the intron/exon structure of the gene into account and mapped 281 RYMV stress responsive (RSR) transcripts on 12 rice chromosomes corresponding to 234 RSR genes. We also mapped previously identified deregulated proteins and genes involved in partial resistance and thus constructed the first global physical map of the RYMV/rice interaction. RSR transcripts on rice chromosomes 4 and 10 were found to be not randomly distributed. Seven genes were identified in the susceptible and partially resistant cultivars, and transcripts were colocalized for these seven genes in both cultivars. During virus infection, many concomitant plant gene expression changes may be associated with host changes caused by the infection process, general stress or defence responses. We noted that some genes (e.g. ABC transporters) were regulated throughout the kinetics of infection and differentiated susceptible and partially resistant hosts.
We enhanced the first RYMV/rice interaction map by combining information from the present study and previous studies on proteins and ESTs regulated during RYMV infection, thus providing a more comprehensive view on genes related to plant responses. This combined map provides a new tool for exploring molecular mechanisms underlying the RYMV/rice interaction.