Chinese cabbage (Brassica rapa ssp. pekinensis) is a member of one of the most important leaf vegetables grown worldwide, which has experienced thousands of years in cultivation and artificial selection. The entire Chinese cabbage genome sequence, and more than forty thousand proteins have been obtained to date. The genome has undergone triplication events since its divergence from Arabidopsis thaliana (13 to 17 Mya), however a high degree of sequence similarity and conserved genome structure remain between the two species. Arabidopsis is therefore a viable reference species for comparative genomics studies. Variation in the number of members in gene families due to genome triplication may contribute to the broad range of phenotypic plasticity, and increased tolerance to environmental extremes observed in Brassica species. Transcription factors are important regulators involved in plant developmental and physiological processes. The AP2/ERF proteins, one of the most important families of transcriptional regulators, play a crucial role in plant growth, and in response to biotic and abiotic stressors. Our analysis will provide resources for understanding the tolerance mechanisms in Brassica rapa ssp. pekinensis.
In the present study, 291 putative AP2/ERF transcription factor proteins were identified from the Chinese cabbage genome database, and compared with proteins from 15 additional species. The Chinese cabbage AP2/ERF superfamily was classified into four families, including AP2, ERF, RAV, and Soloist. The ERF family was further divided into DREB and ERF subfamilies. The AP2/ERF superfamily was subsequently divided into 15 groups. The identification, classification, phylogenetic reconstruction, conserved motifs, chromosome distribution, functional annotation, expression patterns, and interaction networks of the AP2/ERF transcription factor superfamily were predicted and analyzed. Distribution mapping results showed AP2/ERF superfamily genes were localized on the 10 Chinese cabbage chromosomes. AP2/ERF transcription factor expression levels exhibited differences among six tissue types based on expressed sequence tags (ESTs). In the AP2/ERF superfamily, 214 orthologous genes were identified between Chinese cabbage and Arabidopsis. Orthologous gene interaction networks were constructed, and included seven CBF and four AP2 genes, primarily involved in cold regulatory pathways and ovule development, respectively.
The evolution of the AP2/ERF transcription factor superfamily in Chinese cabbage resulted from genome triplication and tandem duplications. A comprehensive analysis of the physiological functions and biological roles of AP2/ERF superfamily genes in Chinese cabbage is required to fully elucidate AP2/ERF, which provides us with rich resources and opportunities to understand crop stress tolerance mechanisms.
Chinese cabbage; AP2/ERF; Stress tolerance; Gene expression; Interaction network; Protein annotation
Brassica rapa is an economically important crop and a model plant for studies concerning polyploidization and the evolution of extreme morphology. The multinational B. rapa Genome Sequencing Project (BrGSP) was launched in 2003. In 2008, next generation sequencing technology was used to sequence the B. rapa genome. Several maps concerning B. rapa pseudochromosome assembly have been published but their coverage of the genome is incomplete, anchoring approximately 73.6% of the scaffolds on to chromosomes. Therefore, a new genetic map to aid pseudochromosome assembly is required.
This study concerns the construction of a reference genetic linkage map for Brassica rapa, forming the backbone for anchoring sequence scaffolds of the B. rapa genome resulting from recent sequencing efforts. One hundred and nineteen doubled haploid (DH) lines derived from microspore cultures of an F1 cross between a Chinese cabbage (B. rapa ssp. pekinensis) DH line (Z16) and a rapid cycling inbred line (L144) were used to construct the linkage map. PCR-based insertion/deletion (InDel) markers were developed by re-sequencing the two parental lines. The map comprises a total of 507 markers including 415 InDels and 92 SSRs. Alignment and orientation using SSR markers in common with existing B. rapa linkage maps allowed ten linkage groups to be identified, designated A01-A10. The total length of the linkage map was 1234.2 cM, with an average distance of 2.43 cM between adjacent marker loci. The lengths of linkage groups ranged from 71.5 cM to 188.5 cM for A08 and A09, respectively. Using the developed linkage map, 152 scaffolds were anchored on to the chromosomes, encompassing more than 82.9% of the B. rapa genome. Taken together with the previously available linkage maps, 183 scaffolds were anchored on to the chromosomes and the total coverage of the genome was 88.9%.
The development of this linkage map is vital for the integration of genome sequences and genetic information, and provides a useful resource for the international Brassica research community.
Growth regulating factors (GRFs) have been shown to play important roles in plant growth and development. GRF genes represent a large multigene family in plants. Recently, genome-wide structural and evolutionary analyses of the GRF gene families in Arabidopsis, rice, and maize have been reported. Chinese cabbage (Brassica rapa L. ssp. pekinensis) is one of the most important vegetables for agricultural production, and a full genome assembly for this plant has recently been released. However, to our knowledge, the GRF gene family from Chinese cabbage has not been characterized in detail.
In this study, genome-wide analysis was carried out to identify all the GRF genes in Chinese cabbage. Based on the complete Chinese cabbage genome sequence, 17 nonredundant GRF genes, named BrGRFs, were identified and classified into six groups. Phylogenetic analysis of the translated GRF protein sequences from Chinese cabbage, Arabidopsis, and rice indicated that the Chinese cabbage GRF proteins were more closely related to the GRF proteins of Arabidopsis than to those of rice. Expression profile analysis showed that the BrGRF genes had uneven transcript levels in different organs or tissues, and the transcription of most BrGRF genes was induced by gibberellic acid (GA3) treatment. Additionally, over-expression of BrGRF8 in transgenic Arabidopsis plants increased the sizes of the leaves and other organs by regulation of cell proliferation.
The data obtained from this investigation will contribute to a better understanding of the characteristics of the GRF gene family in Chinese cabbage, and provide a basis for further studies to investigate GRF protein function during development as well as for Chinese cabbage-breeding programs to improve yield and/or head size.
Electronic supplementary material
The online version of this article (doi:10.1186/1471-2164-15-807) contains supplementary material, which is available to authorized users.
Chinese cabbage; Expression; Gene family; GRF; Transgenic lines
The Brassicaceae family includes the model plant Arabidopsis thaliana as well as a number of agronomically important species such as oilseed crops (in particular Brassica napus, B. juncea and B. rapa) and vegetables (eg. B. rapa and B. oleracea).
Separated by only 10-20 million years, Brassica species and Arabidopsis thaliana are closely related, and it is expected that knowledge obtained relating to Arabidopsis growth and development can be translated into Brassicas for crop improvement. Moreover, certain aspects of plant development are sufficiently different between Brassica and Arabidopsis to warrant studies to be carried out directly in the crop species. However, mutating individual genes in the amphidiploid Brassicas such as B. napus and B. juncea may, on the other hand, not give rise to expected phenotypes as the genomes of these species can contain up to six orthologues per single-copy Arabidopsis gene. In order to elucidate and possibly exploit the function of redundant genes for oilseed rape crop improvement, it may therefore be more efficient to study the effects in one of the diploid Brassica species such as B. rapa. Moreover, the ongoing sequencing of the B. rapa genome makes this species a highly attractive model for Brassica research and genetic resource development.
Seeds from the diploid Brassica A genome species, B. rapa were treated with ethyl methane sulfonate (EMS) to produce a TILLING (Targeting Induced Local Lesions In Genomes) population for reverse genetics studies. We used the B. rapa genotype, R-o-18, which has a similar developmental ontogeny to an oilseed rape crop. Hence this resource is expected to be well suited for studying traits with relevance to yield and quality of oilseed rape. DNA was isolated from a total of 9,216 M2 plants and pooled to form the basis of the TILLING platform. Analysis of six genes revealed a high level of mutations with a density of about one per 60 kb. This analysis also demonstrated that screening a 1 kb amplicon in just one third of the population (3072 M2 plants) will provide an average of 68 mutations and a 97% probability of obtaining a stop-codon mutation resulting in a truncated protein. We furthermore calculated that each plant contains on average ~10,000 mutations and due to the large number of plants, it is predicted that mutations in approximately half of the GC base pairs in the genome exist within this population.
We have developed the first EMS TILLING resource in the diploid Brassica species, B. rapa. The mutation density in this population is ~1 per 60 kb, which makes it the most densely mutated diploid organism for which a TILLING population has been published. This resource is publicly available through the RevGenUK reverse genetics platform http://revgenuk.jic.ac.uk.
The western flower thrips (Frankliniella occidentalis [Pergande]) is one of the most important insect herbivores of cultivated plants. However, no pesticide provides complete control of this species, and insecticide resistance has emerged around the world. We previously reported the important role of jasmonate (JA) in the plant's immediate response to thrips feeding by using an Arabidopsis leaf disc system. In this study, as the first step toward practical use of JA in thrips control, we analyzed the effect of JA-regulated Arabidopsis defense at the whole plant level on thrips behavior and life cycle at the population level over an extended period. We also studied the effectiveness of JA-regulated plant defense on thrips damage in Chinese cabbage (Brassica rapa subsp. pekinensis).
Thrips oviposited more on Arabidopsis JA-insensitive coi1-1 mutants than on WT plants, and the population density of the following thrips generation increased on coi1-1 mutants. Moreover, thrips preferred coi1-1 mutants more than WT plants. Application of JA to WT plants before thrips attack decreased the thrips population. To analyze these important functions of JA in a brassica crop plant, we analyzed the expression of marker genes for JA response in B. rapa. Thrips feeding induced expression of these marker genes and significantly increased the JA content in B. rapa. Application of JA to B. rapa enhanced plant resistance to thrips, restricted oviposition, and reduced the population density of the following generation.
Our results indicate that the JA-regulated plant defense restricts thrips performance and preference, and plays an important role in the resistance of Arabidopsis and B. rapa to thrips damage.
The cabbage, Brassica oleracea var. capitata L., has a distinguishable phenotype within the genus Brassica. Despite the economic and genetic importance of cabbage, there is little genomic data for cabbage, and most studies of Brassica are focused on other species or other B. oleracea subspecies. The lack of genomic data for cabbage, a non-model organism, hinders research on its molecular biology. Hence, the construction of reliable transcriptomic data based on high-throughput sequencing technologies is needed to enhance our understanding of cabbage and provide genomic information for future work.
We constructed cDNAs from total RNA isolated from the roots, leaves, flowers, seedlings, and calcium-limited seedling tissues of two cabbage genotypes: 102043 and 107140. We sequenced a total of six different samples using the Illumina HiSeq platform, producing 40.5 Gbp of sequence data comprising 401,454,986 short reads. We assembled 205,046 transcripts (≥ 200 bp) using the Velvet and Oases assembler and predicted 53,562 loci from the transcripts. We annotated 35,274 of the loci with 55,916 plant peptides in the Phytozome database. The average length of the annotated loci was 1,419 bp. We confirmed the reliability of the sequencing assembly using reverse-transcriptase PCR to identify tissue-specific gene candidates among the annotated loci.
Our study provides valuable transcriptome sequence data for B. oleracea var. capitata L., offering a new resource for studying B. oleracea and closely related species. Our transcriptomic sequences will enhance the quality of gene annotation and functional analysis of the cabbage genome and serve as a material basis for future genomic research on cabbage. The sequencing data from this study can be used to develop molecular markers and to identify the extreme differences among the phenotypes of different species in the genus Brassica.
The genomes of non-heading Chinese cabbage (Brassica rapa ssp. chinensis), heading Chinese cabbage (Brassica rapa ssp. pekinensis) and their close relative Arabidopsis thaliana have provided important resources for studying the evolution and genetic improvement of cruciferous plants. Natural growing conditions present these plants with a variety of physiological challenges for which they have a repertoire of genes that ensure adaptability and normal growth. We investigated the differential expressions of genes that control adaptability and development in plants growing in the natural environment to study underlying mechanisms of their expression.
Using digital gene expression tag profiling, we constructed an expression profile to identify genes related to important agronomic traits under natural growing conditions. Among three non-heading Chinese cabbage cultivars, we found thousands of genes that exhibited significant differences in expression levels at five developmental stages. Through comparative analysis and previous reports, we identified several candidate genes associated with late flowering, cold tolerance, self-incompatibility, and leaf color. Two genes related to cold tolerance were verified using quantitative real-time PCR.
We identified a large number of genes associated with important agronomic traits of non-heading Chinese cabbage. This analysis will provide a wealth of resources for molecular-assisted breeding of cabbage. The raw data and detailed results of this analysis are available at the website http://nhccdata.njau.edu.cn.
Non-heading Chinese cabbage; Expression profile; Differentially expressed genes; Protein function annotation; Chromosome distribution; Agronomic traits
CYP83A1 and CYP83B1 are two key synthesis genes in the glucosinolate biosynthesis pathway. CYP83A1 mainly metabolizes the aliphatic oximes to form aliphatic glucosinolate and CYP83B1 mostly catalyzes aromatic oximes to synthesis corresponding substrates for aromatic and indolic glucosinolates. In this study, two CYP83A1 genes named BcCYP83A1-1 (JQ289997), BcCYP83A1-2 (JQ289996) respectively and one CYP83B1 (BcCYP83B1, HM347235) gene were cloned from the leaves of pak choi (Brassica rapa L. ssp. chinensis var. communis (N. Tsen & S.H. Lee) Hanelt) “Hangzhou You Dong Er” cultivar. Their ORFs were 1506, 1509 and 1500 bp in length, encoding 501, 502 and 499 amino acids, respectively. The predicted amino acid sequences of CYP83A1-1, CYP83A1-2 and CYP83B1 shared high sequence identity of 87.65, 86.48 and 95.59% to the corresponding ones in Arabidopsis, and 98.80, 98.61 and 98.80% to the corresponding ones in Brassica pekinensis (Chinese cabbage), respectively. Quantitative real-time PCR analysis indicated that both CYP83A1 and CYP83B1 expressed in roots, leaves and petioles of pak choi, while the transcript abundances of CYP83A1 were higher in leaves than in petioles and roots, whereas CYP83B1 showed higher abundances in roots. The expression levels of glucosinolate biosynthetic genes were consistent with the glucosinolate profile accumulation in shoots of seven cultivars and three organs. The isolation and characterization of the glucosinolate synthesis genes in pak choi would promote the way for further development of agronomic traits via genetic engineering.
pak choi; glucosinolate; phylogenetic analysis; quantitative real-time PCR
Brassica rapa, which is closely related to
Arabidopsis thaliana, is an important crop and a
model plant for studying genome evolution via
polyploidization. We report the current understanding of the
genome structure of B. rapa and efforts for the
whole-genome sequencing of the species. The tribe
Brassicaceae, which comprises ca. 240 species,
descended from a common hexaploid ancestor with a basic genome
similar to that of Arabidopsis. Chromosome
rearrangements, including fusions and/or fissions, resulted in
the present-day “diploid” Brassica
species with variation in chromosome number and phenotype.
Triplicated genomic segments of B. rapa are
collinear to those of A. thaliana with InDels.
The genome triplication has led to an approximately 1.7-fold
increase in the B. rapa gene number compared to
that of A. thaliana. Repetitive DNA of B.
rapa has also been extensively amplified and has
diverged from that of A. thaliana. For its
whole-genome sequencing, the Brassica rapa Genome
Sequencing Project (BrGSP) consortium has developed suitable
genomic resources and constructed genetic and physical maps.
Ten chromosomes of B. rapa are being allocated to
BrGSP consortium participants, and each chromosome will be
sequenced by a BAC-by-BAC approach. Genome sequencing of
B. rapa will offer a new perspective for plant
biology and evolution in the context of polyploidization.
The species Brassica rapa includes various vegetable crops. Production of these vegetable crops is usually impaired by heat stress. Some microRNAs (miRNAs) in Arabidopsis have been considered to mediate gene silencing in plant response to abiotic stress. However, it remains unknown whether or what miRNAs play a role in heat resistance of B. rapa. To identify genomewide conserved and novel miRNAs that are responsive to heat stress in B. rapa, we defined temperature thresholds of non-heading Chinese cabbage (B. rapa ssp. chinensis) and constructed small RNA libraries from the seedlings that had been exposed to high temperature (46 °C) for 1 h. By deep sequencing and data analysis, we selected a series of conserved and novel miRNAs that responded to heat stress. In total, Chinese cabbage shares at least 35 conserved miRNA families with Arabidopsis thaliana. Among them, five miRNA families were responsive to heat stress. Northern hybridization and real-time PCR showed that the conserved miRNAs bra-miR398a and bra-miR398b were heat-inhibitive and guided heat response of their target gene, BracCSD1; and bra-miR156h and bra-miR156g were heat-induced and its putative target BracSPL2 was down-regulated. According to the criteria of miRNA and miRNA* that form a duplex, 21 novel miRNAs belonging to 19 miRNA families were predicted. Of these, four were identified to be heat-responsive by Northern blotting and/or expression analysis of the putative targets. The two novel miRNAs bra-miR1885b.3 and bra-miR5718 negatively regulated their putative target genes. 5′-Rapid amplification of cDNA ends PCR indicated that three novel miRNAs cleaved the transcripts of their target genes where their precursors may have evolved from. These results broaden our perspective on the important role of miRNA in plant responses to heat.
Brassica rapa; heat response; miRNA; small RNA
In view of the immense value of Brassica rapa in the fields of agriculture and molecular biology, the multinational Brassica rapa Genome Sequencing Project (BrGSP) was launched in 2003 by five countries. The developing BrGSP has valuable resources for the community, including a reference genetic map and seed BAC sequences. Although the initial B. rapa linkage map served as a reference for the BrGSP, there was ambiguity in reconciling the linkage groups with the ten chromosomes of B. rapa. Consequently, the BrGSP assigned each of the linkage groups to the project members as chromosome substitutes for sequencing.
We identified simple sequence repeat (SSR) motifs in the B. rapa genome with the sequences of seed BACs used for the BrGSP. By testing 749 amplicons containing SSR motifs, we identified polymorphisms that enabled the anchoring of 188 BACs onto the B. rapa reference linkage map consisting of 719 loci in the 10 linkage groups with an average distance of 1.6 cM between adjacent loci. The anchored BAC sequences enabled the identification of 30 blocks of conserved synteny, totaling 534.9 cM in length, between the genomes of B. rapa and Arabidopsis thaliana. Most of these were consistent with previously reported duplication and rearrangement events that differentiate these genomes. However, we were able to identify the collinear regions for seven additional previously uncharacterized sections of the A genome. Integration of the linkage map with the B. rapa cytogenetic map was accomplished by FISH with probes representing 20 BAC clones, along with probes for rDNA and centromeric repeat sequences. This integration enabled unambiguous alignment and orientation of the maps representing the 10 B. rapa chromosomes.
We developed a second generation reference linkage map for B. rapa, which was aligned unambiguously to the B. rapa cytogenetic map. Furthermore, using our data, we confirmed and extended the comparative genome analysis between B. rapa and A. thaliana. This work will serve as a basis for integrating the genetic, physical, and chromosome maps of the BrGSP, as well as for studies on polyploidization, speciation, and genome duplication in the genus Brassica.
Chinese cabbage is an important leaf vegetable that experienced long-term cultivation and artificial selection. Dof (DNA-binding One Zinc Finger) transcription factors, with a highly conserved Dof domain, are members of a major plant-specific transcription factor family that play important roles in many plant biological processes. The Dof family transcription factors, one of the most important families of transcriptional regulators in higher plants, are involved in massive aspects of plant growth, development, and response to abiotic stresses. Our study will supply resources for understanding how Dof transcription factors respond to abiotic stress and the interaction network of these genes in tolerance mechanism.
In this study, we performed a comprehensive analysis of Dof family factors in Chinese cabbage. In total, 76 genes encoding BraDof family transcription factor were identified from Chinese cabbage, and those BraDof factors were divided into nine classes. Fifteen motifs were found based on Dof amino acid sequence alignments. Chromosome locations and gene duplications of BraDof family genes were also analyzed. Ten duplicate events of BraDof genes were discovered in Chinese cabbage chromosomes. The uneven distribution of BraDof genes in Brassica chromosomes may cause the expansion of BraDof genes. In the Dof family, 37 and 7 orthologous genes were identified between Chinese cabbage and Arabidopsis and between Chinese cabbage and Oryza sativa, respectively. The interaction networks of Dof factors in Chinese cabbage were also constructed. Expression profiles of nine selected genes from different nine classes subjected to four abiotic stresses (cold, heat, salt and drought) were further investigated by quantitative real-time PCR to obtain a better understanding of the functions and regulation mechanisms of BraDof family transcription factors in two Chinese cabbage varieties, ‘Lubaisanhao’ and ‘Qingdao 87-114’.
Dof-family transcription factors were analyzed in genome of Chinese cabbage. Chromosomal locations showed that duplication might result in expansion. Response to abiotic stresses was elucidated in Chinese cabbage varieties. The results provide novel insights into the stress responses of BraDof genes and promote a better understanding of the construction and function of Dofs in Chinese cabbage.
Electronic supplementary material
The online version of this article (doi:10.1186/s12864-015-1242-9) contains supplementary material, which is available to authorized users.
Dof; Transcription factor; Chromosomal location; Gene duplication; qRT-PCR; Chinese cabbage
Nectaries are the floral organs responsible for the synthesis and secretion of nectar. Despite their central roles in pollination biology, very little is understood about the molecular mechanisms underlying nectar production. This project was undertaken to identify genes potentially involved in mediating nectary form and function in Brassica rapa.
Methodology and Principal Findings
Four cDNA libraries were created using RNA isolated from the median and lateral nectaries of B. rapa flowers, with one normalized and one non-normalized library being generated from each tissue. Approximately 3,000 clones from each library were randomly sequenced from the 5′ end to generate a total of 11,101 high quality expressed sequence tags (ESTs). Sequence assembly of all ESTs together allowed the identification of 1,453 contigs and 4,403 singleton sequences, with the Basic Localized Alignment Search Tool (BLAST) being used to identify 4,138 presumptive orthologs to Arabidopsis thaliana genes. Several genes differentially expressed between median and lateral nectaries were initially identified based upon the number of BLAST hits represented by independent ESTs, and later confirmed via reverse transcription polymerase chain reaction (RT PCR). RT PCR was also used to verify the expression patterns of eight putative orthologs to known Arabidopsis nectary-enriched genes.
This work provided a snapshot of gene expression in actively secreting B. rapa nectaries, and also allowed the identification of differential gene expression between median and lateral nectaries. Moreover, 207 orthologs to known nectary-enriched genes from Arabidopsis were identified through this analysis. The results suggest that genes involved in nectar production are conserved amongst the Brassicaceae, and also supply clones and sequence information that can be used to probe nectary function in B. rapa.
A complete genome sequence provides unlimited information in the sequenced organism
as well as in related taxa. According to the guidance of the Multinational Brassica
Genome Project (MBGP), the Korea Brassica Genome Project (KBGP) is sequencing
chromosome 1 (cytogenetically oriented chromosome #1) of Brassica rapa. We
have selected 48 seed BACs on chromosome 1 using EST genetic markers and FISH
analyses. Among them, 30 BAC clones have been sequenced and 18 are on the way.
Comparative genome analyses of the EST sequences and sequenced BAC clones from
Brassica chromosome 1 revealed their homeologous partner regions on the Arabidopsis
genome and a syntenic comparative map between Brassica chromosome 1 and
Arabidopsis chromosomes. In silico chromosome walking and clone validation have
been successfully applied to extending sequence contigs based on the comparative
map and BAC end sequences. In addition, we have defined the (peri)centromeric
heterochromatin blocks with centromeric tandem repeats, rDNA and centromeric
retrotransposons. In-depth sequence analyses of five homeologous BAC clones and
an Arabidopsis chromosomal region reveal overall co-linearity, with 82% sequence
similarity. The data indicate that the Brassica genome has undergone triplication and
subsequent gene losses after the divergence of Arabidopsis and Brassica. Based on in-depth
comparative genome analyses, we propose a comparative genomics approach
for conquering the Brassica genome. In 2005 we intend to construct an integrated
physical map, including sequence information from 500 BAC clones and integration
of fingerprinting data and end sequence data of more than 100 000 BAC clones.
The sequences have been submitted to GenBank with accession numbers: 10 204
BAC ends of the KBrH library (CW978640–CW988843); KBrH138P04, AC155338;
KBrH117N09, AC155337; KBrH097M21, AC155348; KBrH093K03, AC155347;
KBrH081N08, AC155346; KBrH080L24, AC155345; KBrH077A05, AC155343;
KBrH020D15, AC155340; KBrH015H17, AC155339; KBrH001H24, AC155335;
KBrH080A08, AC155344; KBrH004D11, AC155341; KBrH117M18, AC146875;
Plasmodiophora brassicae, the causal agent of clubroot disease of the Brassica crops, is widespread in the world. Quantitative trait loci (QTLs) for partial resistance to 4 different isolates of P. brassicae (Pb2, Pb4, Pb7, and Pb10) were investigated using a BC1F1 population from a cross between two subspecies of Brassica rapa, i.e. Chinese cabbage inbred line C59-1 as a susceptible recurrent parent and turnip inbred line ECD04 as a resistant donor parent. The BC1F2 families were assessed for resistance under controlled conditions. A linkage map constructed with simple sequence repeats (SSR), unigene-derived microsatellite (UGMS) markers, and specific markers linked to published clubroot resistance (CR) genes of B. rapa was used to perform QTL mapping. A total of 6 QTLs residing in 5 CR QTL regions of the B. rapa chromosomes A01, A03, and A08 were identified to account for 12.2 to 35.2% of the phenotypic variance. Two QTL regions were found to be novel except for 3 QTLs in the respective regions of previously identified Crr1, Crr2, and Crr3. QTL mapping results indicated that 1 QTL region was common for partial resistance to the 2 isolates of Pb2 and Pb7, whereas the others were specific for each isolate. Additionally, synteny analysis between B. rapa and Arabidopsis thaliana revealed that all CR QTL regions were aligned to a single conserved crucifer blocks (U, F, and R) on 3 Arabidopsis chromosomes where 2 CR QTLs were detected in A. thaliana. These results suggest that some common ancestral genomic regions were involved in the evolution of CR genes in B. rapa.
Brassica oleracea is a morphologically diverse species in the family Brassicaceae and contains a group of nutrition-rich vegetable crops, including common heading cabbage, cauliflower, broccoli, kohlrabi, kale, Brussels sprouts. This diversity along with its phylogenetic membership in a group of three diploid and three tetraploid species, and the recent availability of genome sequences within Brassica provide an unprecedented opportunity to study intra- and inter-species divergence and evolution in this species and its close relatives.
We have developed a comprehensive database, Bolbase, which provides access to the B. oleracea genome data and comparative genomics information. The whole genome of B. oleracea is available, including nine fully assembled chromosomes and 1,848 scaffolds, with 45,758 predicted genes, 13,382 transposable elements, and 3,581 non-coding RNAs. Comparative genomics information is available, including syntenic regions among B. oleracea, Brassica rapa and Arabidopsis thaliana, synonymous (Ks) and non-synonymous (Ka) substitution rates between orthologous gene pairs, gene families or clusters, and differences in quantity, category, and distribution of transposable elements on chromosomes. Bolbase provides useful search and data mining tools, including a keyword search, a local BLAST server, and a customized GBrowse tool, which can be used to extract annotations of genome components, identify similar sequences and visualize syntenic regions among species. Users can download all genomic data and explore comparative genomics in a highly visual setting.
Bolbase is the first resource platform for the B. oleracea genome and for genomic comparisons with its relatives, and thus it will help the research community to better study the function and evolution of Brassica genomes as well as enhance molecular breeding research. This database will be updated regularly with new features, improvements to genome annotation, and new genomic sequences as they become available. Bolbase is freely available at http://ocri-genomics.org/bolbase.
Brassica oleracea; Database; Genome sequence; Synteny; Comparative genomics
The MYB superfamily is one of the most abundant transcription factor (TF) families in plants. MYB proteins include highly conserved N-terminal MYB repeats (1R, R2R3, 3R, and atypical) and various C-terminal sequences that confer extensive functions. However, the functions of most MYB genes are unknown, and have been little studied in Chinese cabbage.
Here, we analyzed 256 (55.2% of total MYBs) R2R3-MYB genes from Chinese cabbage (Brassica rapa ssp. pekinensis) and anchored them onto the 10 chromosomes and three subgenomes. The R2R3-, 3R- and atypical MYB proteins in Chinese cabbage formed 45 subgroups based on domain similarity and phylogenetic topology. Organization and syntenic analysis revealed the genomic distribution and collinear relationships of the R2R3-BrMYBs. Synonymous nucleotide substitution (Ka/Ks) analysis showed that the Chinese cabbage MYB DNA-binding domain is under strong purifying selection. Moreover, RNA-seq data revealed tissue-specific and distinct R2R3-BrMYB expression profiles, and quantitative real-time PCR (qPCR) analysis in leaves showed stress responsive expression and crosstalk with ABA-auxin signaling cascades.
In this study, we identified the largest MYB gene family in plants to date. Our results indicate that members of this superfamily may be involved in plant development, stress responses and leaf senescence, highlighting their functional diversity.
Electronic supplementary material
The online version of this article (doi:10.1186/s12864-015-1216-y) contains supplementary material, which is available to authorized users.
Genome-wide analysis; R2R3-MYB transcription factor; Stress responses; Hormone signals; Chinese cabbage
Brassica rapa is an economically important crop species. During its long breeding history, a large number of morphotypes have been generated, including leafy vegetables such as Chinese cabbage and pakchoi, turnip tuber crops and oil crops.
To investigate the genetic variation underlying this morphological variation, we re-sequenced, assembled and annotated the genomes of two B. rapa subspecies, turnip crops (turnip) and a rapid cycling. We then analysed the two resulting genomes together with the Chinese cabbage Chiifu reference genome to obtain an impression of the B. rapa pan-genome. The number of genes with protein-coding changes between the three genotypes was lower than that among different accessions of Arabidopsis thaliana, which can be explained by the smaller effective population size of B. rapa due to its domestication. Based on orthology to a number of non-brassica species, we estimated the date of divergence among the three B. rapa morphotypes at approximately 250,000 YA, far predating Brassica domestication (5,000-10,000 YA).
By analysing genes unique to turnip we found evidence for copy number differences in peroxidases, pointing to a role for the phenylpropanoid biosynthesis pathway in the generation of morphological variation. The estimated date of divergence among three B. rapa morphotypes implies that prior to domestication there was already considerably divergence among B. rapa genotypes. Our study thus provides two new B. rapa reference genomes, delivers a set of computer tools to analyse the resulting pan-genome and uses these to shed light on genetic drivers behind the rich morphological variation found in B. rapa.
Brassica species include both vegetable and oilseed crops, which are very important to the daily life of common human beings. Meanwhile, the Brassica species represent an excellent system for studying numerous aspects of plant biology, specifically for the analysis of genome evolution following polyploidy, so it is also very important for scientific research. Now, the genome of Brassica rapa has already been assembled, it is the time to do deep mining of the genome data.
BRAD, the Brassica database, is a web-based resource focusing on genome scale genetic and genomic data for important Brassica crops. BRAD was built based on the first whole genome sequence and on further data analysis of the Brassica A genome species, Brassica rapa (Chiifu-401-42). It provides datasets, such as the complete genome sequence of B. rapa, which was de novo assembled from Illumina GA II short reads and from BAC clone sequences, predicted genes and associated annotations, non coding RNAs, transposable elements (TE), B. rapa genes' orthologous to those in A. thaliana, as well as genetic markers and linkage maps. BRAD offers useful searching and data mining tools, including search across annotation datasets, search for syntenic or non-syntenic orthologs, and to search the flanking regions of a certain target, as well as the tools of BLAST and Gbrowse. BRAD allows users to enter almost any kind of information, such as a B. rapa or A. thaliana gene ID, physical position or genetic marker.
BRAD, a new database which focuses on the genetics and genomics of the Brassica plants has been developed, it aims at helping scientists and breeders to fully and efficiently use the information of genome data of Brassica plants. BRAD will be continuously updated and can be accessed through http://brassicadb.org.
The Brassicaceae family is an exemplary model for studying plant polyploidy. The Brassicaceae knowledge-base includes the well-annotated Arabidopsis thaliana reference sequence; well-established evidence for three rounds of whole genome duplication (WGD); and the conservation of genomic structure, with 24 conserved genomic blocks (GBs). The recently released Brassica rapa draft genome provides an ideal opportunity to update our knowledge of the conserved genomic structures in Brassica, and to study evolutionary innovations of the mesohexaploid plant, B. rapa.
Three chronological B. rapa genomes (recent, young, and old) were reconstructed with sequence divergences, revealing a trace of recursive WGD events. A total of 636 fast evolving genes were unevenly distributed throughout the recent and young genomes. The representative Gene Ontology (GO) terms for these genes were ‘stress response’ and ‘development’ both through a change in protein modification or signaling, rather than by enhancing signal recognition. In retention patterns analysis, 98% of B. rapa genes were retained as collinear gene pairs; 77% of those were singly-retained in recent or young genomes resulting from death of the ancestral copies, while others were multi-retained as long retention genes. GO enrichments indicated that single retention genes mainly function in the interpretation of genetic information, whereas, multi-retention genes were biased toward signal response, especially regarding development and defense. In the recent genome, 13,302, 5,790, and 20 gene pairs were multi-retained following Brassica whole genome triplication (WGT) events with 2, 3, and 4 homoeologous copies, respectively. Enriched GO-slim terms from B. rapa homomoelogues imply that a major effect of the B. rapa WGT may have been to acquire environmental adaptability or to change the course of development. These homoeologues seem to more frequently undergo subfunctionalization with spatial expression patterns compared with other possible events including nonfunctionalization and neofunctionalization.
We refined Brassicaceae GB information using the latest genomic resources, and distinguished three chronologically ordered B. rapa genomes. B. rapa genes were categorized into fast evolving, single- and multi-retention genes, and long retention genes by their substitution rates and retention patterns. Representative functions of the categorized genes were elucidated, providing better understanding of B. rapa evolution and the Brassica genus.
Electronic supplementary material
The online version of this article (doi:10.1186/1471-2164-15-606) contains supplementary material, which is available to authorized users.
Brassica rapa; Chronological genomes; Fast-evolving genes; Single-retention genes; Multi-retention genes
To identify genes associated with genic male sterility (GMS) that could be useful for hybrid breeding in Chinese cabbage (Brassicarapa ssp. pekinensis), floral bud transcriptome analysis was carried out using a B. rapa microarray with 300,000 probes (Br300K). Among 47,548 clones deposited on a Br300K microarray with seven probes of 60 nt length within the 3' 150 bp region, a total of 10,622 genes were differentially expressed between fertile and sterile floral buds; 4,774 and 5,848 genes were up-regulated over 2-fold in fertile and sterile buds, respectively. However, the expression of 1,413 and 199 genes showed fertile and sterile bud-specific features, respectively. Genes expressed specifically in fertile buds, possibly GMS-related genes, included homologs of several Arabidopsis male sterility-related genes, genes associated with the cell wall and synthesis of its surface proteins, pollen wall and coat components, signaling components, and nutrient supplies. However, most early genes for pollen development, genes for primexine and callose formation, and genes for pollen maturation and anther dehiscence showed no difference in expression between fertile and sterile buds. Some of the known genes associated with Arabidopsis pollen development showed similar expression patterns to those seen in this study, while others did not. BrbHLH89 and BrMYP99 are putative GMS genes. Additionally, 17 novel genes identified only in B. rapa were specifically and highly expressed only in fertile buds, implying the possible involvement in male fertility. All data suggest that Chinese cabbage GMS might be controlled by genes acting in post-meiotic tapetal development that are different from those known to be associated with Arabidopsis male sterility.
The genus Brassica includes the most extensively cultivated vegetable crops worldwide. Investigation of the Brassica genome presents excellent challenges to study plant genome evolution and divergence of gene function associated with polyploidy and genome hybridization. A physical map of the B. rapa genome is a fundamental tool for analysis of Brassica "A" genome structure. Integration of a physical map with an existing genetic map by linking genetic markers and BAC clones in the sequencing pipeline provides a crucial resource for the ongoing genome sequencing effort and assembly of whole genome sequences.
A genome-wide physical map of the B. rapa genome was constructed by the capillary electrophoresis-based fingerprinting of 67,468 Bacterial Artificial Chromosome (BAC) clones using the five restriction enzyme SNaPshot technique. The clones were assembled into contigs by means of FPC v8.5.3. After contig validation and manual editing, the resulting contig assembly consists of 1,428 contigs and is estimated to span 717 Mb in physical length. This map provides 242 anchored contigs on 10 linkage groups to be served as seed points from which to continue bidirectional chromosome extension for genome sequencing.
The map reported here is the first physical map for Brassica "A" genome based on the High Information Content Fingerprinting (HICF) technique. This physical map will serve as a fundamental genomic resource for accelerating genome sequencing, assembly of BAC sequences, and comparative genomics between Brassica genomes. The current build of the B. rapa physical map is available at the B. rapa Genome Project website for the user community.
Plant disease resistance (R) genes with the nucleotide binding site (NBS) play an important role in offering resistance to pathogens. The availability of complete genome sequences of Brassica oleracea and Brassica rapa provides an important opportunity for researchers to identify and characterize NBS-encoding R genes in Brassica species and to compare with analogues in Arabidopsis thaliana based on a comparative genomics approach. However, little is known about the evolutionary fate of NBS-encoding genes in the Brassica lineage after split from A. thaliana.
Here we present genome-wide analysis of NBS-encoding genes in B. oleracea, B. rapa and A. thaliana. Through the employment of HMM search and manual curation, we identified 157, 206 and 167 NBS-encoding genes in B. oleracea, B. rapa and A. thaliana genomes, respectively. Phylogenetic analysis among 3 species classified NBS-encoding genes into 6 subgroups. Tandem duplication and whole genome triplication (WGT) analyses revealed that after WGT of the Brassica ancestor, NBS-encoding homologous gene pairs on triplicated regions in Brassica ancestor were deleted or lost quickly, but NBS-encoding genes in Brassica species experienced species-specific gene amplification by tandem duplication after divergence of B. rapa and B. oleracea. Expression profiling of NBS-encoding orthologous gene pairs indicated the differential expression pattern of retained orthologous gene copies in B. oleracea and B. rapa. Furthermore, evolutionary analysis of CNL type NBS-encoding orthologous gene pairs among 3 species suggested that orthologous genes in B. rapa species have undergone stronger negative selection than those in B .oleracea species. But for TNL type, there are no significant differences in the orthologous gene pairs between the two species.
This study is first identification and characterization of NBS-encoding genes in B. rapa and B. oleracea based on whole genome sequences. Through tandem duplication and whole genome triplication analysis in B. oleracea, B. rapa and A. thaliana genomes, our study provides insight into the evolutionary history of NBS-encoding genes after divergence of A. thaliana and the Brassica lineage. These results together with expression pattern analysis of NBS-encoding orthologous genes provide useful resource for functional characterization of these genes and genetic improvement of relevant crops.
Brassica species; Disease resistance gene; Nucleotide binding site; Tandem duplication; Whole genome duplication
The Multinational Brassica rapa Genome Sequencing Project (BrGSP) has developed valuable genomic resources, including BAC libraries, BAC-end sequences, genetic and physical maps, and seed BAC sequences for Brassica rapa. An integrated linkage map between the amphidiploid B. napus and diploid B. rapa will facilitate the rapid transfer of these valuable resources from B. rapa to B. napus (Oilseed rape, Canola).
In this study, we identified over 23,000 simple sequence repeats (SSRs) from 536 sequenced BACs. 890 SSR markers (designated as BrGMS) were developed and used for the construction of an integrated linkage map for the A genome in B. rapa and B. napus. Two hundred and nineteen BrGMS markers were integrated to an existing B. napus linkage map (BnaNZDH). Among these mapped BrGMS markers, 168 were only distributed on the A genome linkage groups (LGs), 18 distrubuted both on the A and C genome LGs, and 33 only distributed on the C genome LGs. Most of the A genome LGs in B. napus were collinear with the homoeologous LGs in B. rapa, although minor inversions or rearrangements occurred on A2 and A9. The mapping of these BAC-specific SSR markers enabled assignment of 161 sequenced B. rapa BACs, as well as the associated BAC contigs to the A genome LGs of B. napus.
The genetic mapping of SSR markers derived from sequenced BACs in B. rapa enabled direct links to be established between the B. napus linkage map and a B. rapa physical map, and thus the assignment of B. rapa BACs and the associated BAC contigs to the B. napus linkage map. This integrated genetic linkage map will facilitate exploitation of the B. rapa annotated genomic resources for gene tagging and map-based cloning in B. napus, and for comparative analysis of the A genome within Brassica species.
MITE, TRIM and SINEs are miniature form transposable elements (mTEs) that are ubiquitous and dispersed throughout entire plant genomes. Tens of thousands of members cause insertion polymorphism at both the inter- and intra- species level. Therefore, mTEs are valuable targets and resources for development of markers that can be utilized for breeding, genetic diversity and genome evolution studies. Taking advantage of the completely sequenced genomes of Brassica rapa and B. oleracea, characterization of mTEs and building a curated database are prerequisite to extending their utilization for genomics and applied fields in Brassica crops.
We have developed BrassicaTED as a unique web portal containing detailed characterization information for mTEs of Brassica species. At present, BrassicaTED has datasets for 41 mTE families, including 5894 and 6026 members from 20 MITE families, 1393 and 1639 members from 5 TRIM families, 1270 and 2364 members from 16 SINE families in B. rapa and B. oleracea, respectively. BrassicaTED offers different sections to browse structural and positional characteristics for every mTE family. In addition, we have added data on 289 MITE insertion polymorphisms from a survey of seven Brassica relatives. Genes with internal mTE insertions are shown with detailed gene annotation and microarray-based comparative gene expression data in comparison with their paralogs in the triplicated B. rapa genome. This database also includes a novel tool, K BLAST (Karyotype BLAST), for clear visualization of the locations for each member in the B. rapa and B. oleracea pseudo-genome sequences.
BrassicaTED is a newly developed database of information regarding the characteristics and potential utility of mTEs including MITE, TRIM and SINEs in B. rapa and B. oleracea. The database will promote the development of desirable mTE-based markers, which can be utilized for genomics and breeding in Brassica species. BrassicaTED will be a valuable repository for scientists and breeders, promoting efficient research on Brassica species. BrassicaTED can be accessed at http://im-crop.snu.ac.kr/BrassicaTED/index.php.
Brassica; Miniature inverted-repeat transposable element (MITE); Terminal-repeat retrotransposon in miniature (TRIM); Miniature form transposable elements (mTEs); Short interspersed elements (SINEs); TE Database