Peanut (Arachis hypogaea L.) causes one of the most serious food allergies. Peanut seed proteins, Arah1, Arah2, and Arah3, are considered to be among the most important peanut allergens. To gain insights into genome organization and evolution of allergen-encoding genes, approximately 617 kb from the genome of cultivated peanut and 215 kb from a wild relative were sequenced including three Arah1, one Arah2, eight Arah3, and two Arah6 gene family members. To assign polarity to differences between homoeologous regions in peanut, we used as outgroups the single orthologous regions in Medicago, Lotus, common bean, chickpea, and pigeonpea, which diverged from peanut about 50 Ma and have not undergone subsequent polyploidy. These regions were also compared with orthologs in many additional dicot plant species to help clarify the timing of evolutionary events. The lack of conservation of allergenic epitopes between species, and the fact that many different proteins can be allergenic, makes the identification of allergens across species by comparative studies difficult. The peanut allergen genes are interspersed with low-copy genes and transposable elements. Phylogenetic analyses revealed lineage-specific expansion and loss of low-copy genes between species and homoeologs. Arah1 syntenic regions are conserved in soybean, pigeonpea, tomato, grape, Lotus, and Arabidopsis, whereas Arah3 syntenic regions show genome rearrangements. We infer that tandem and segmental duplications led to the establishment of the Arah3 gene family. Our analysis indicates differences in conserved motifs in allergen proteins and in the promoter regions of the allergen-encoding genes. Phylogenetic analysis and genomic organization studies provide new insights into the evolution of the major peanut allergen-encoding genes.
Arachis hypogaea L.; allergens; gene synteny; genome organization; homologs; evolution
Background and Aims
Peanut (Arachis hypogaea) is an allotetraploid (AABB-type genome) of recent origin, with a genome of about 2·8 Gb and a high repetitive content. This study reports an analysis of the repetitive component of the peanut A genome using bacterial artificial chromosome (BAC) clones from A. duranensis, the most probable A genome donor, and the probable consequences of the activity of these elements since the divergence of the peanut A and B genomes.
The repetitive content of the A genome was analysed by using A. duranensis BAC clones as probes for fluorescence in situ hybridization (BAC-FISH), and by sequencing and characterization of 12 genomic regions. For the analysis of the evolutionary dynamics, two A genome regions are compared with their B genome homeologues.
BAC-FISH using 27 A. duranensis BAC clones as probes gave dispersed and repetitive DNA characteristic signals, predominantly in interstitial regions of the peanut A chromosomes. The sequences of 14 BAC clones showed complete and truncated copies of ten abundant long terminal repeat (LTR) retrotransposons, characterized here. Almost all dateable transposition events occurred <3·5 million years ago, the estimated date of the divergence of A and B genomes. The most abundant retrotransposon is Feral, apparently parasitic on the retrotransposon FIDEL, followed by Pipa, also non-autonomous and probably parasitic on a retrotransposon we named Pipoka. The comparison of the A and B genome homeologous regions showed conserved segments of high sequence identity, punctuated by predominantly indel regions without significant similarity.
A substantial proportion of the highly repetitive component of the peanut A genome appears to be accounted for by relatively few LTR retrotransposons and their truncated copies or solo LTRs. The most abundant of the retrotransposons are non-autonomous. The activity of these retrotransposons has been a very significant driver of genome evolution since the evolutionary divergence of the A and B genomes.
Arachis hypogaea; A. duranensis; peanut; groundnut; BAC-FISH; BAC sequencing; retrotransposons; genome evolution; phylogeny; homeology
Background and Aims
The genus Arachis contains 80 described species. Section Arachis is of particular interest because it includes cultivated peanut, an allotetraploid, and closely related wild species, most of which are diploids. This study aimed to analyse the genetic relationships of multiple accessions of section Arachis species using two complementary methods. Microsatellites allowed the analysis of inter- and intraspecific variability. Intron sequences from single-copy genes allowed phylogenetic analysis including the separation of the allotetraploid genome components.
Intron sequences and microsatellite markers were used to reconstruct phylogenetic relationships in section Arachis through maximum parsimony and genetic distance analyses.
Although high intraspecific variability was evident, there was good support for most species. However, some problems were revealed, notably a probable polyphyletic origin for A. kuhlmannii. The validity of the genome groups was well supported. The F, K and D genomes grouped close to the A genome group. The 2n = 18 species grouped closer to the B genome group. The phylogenetic tree based on the intron data strongly indicated that A. duranensis and A. ipaënsis are the ancestors of A. hypogaea and A. monticola. Intron nucleotide substitutions allowed the ages of divergences of the main genome groups to be estimated at a relatively recent 2·3–2·9 million years ago. This age and the number of species described indicate a much higher speciation rate for section Arachis than for legumes in general.
The analyses revealed relationships between the species and genome groups and showed a generally high level of intraspecific genetic diversity. The improved knowledge of species relationships should facilitate the utilization of wild species for peanut improvement. The estimates of speciation rates in section Arachis are high, but not unprecedented. We suggest these high rates may be linked to the peculiar reproductive biology of Arachis.
Arachis; peanut; groundnut; intron sequences; single-copy genes; molecular phylogeny; microsatellites; genetic relationships; speciation rates; genome donors; molecular dating
Single nucleotide polymorphic markers (SNPs) are attractive for use in genetic mapping and marker-assisted breeding because they can be scored in parallel assays at favorable costs. However, scoring SNP markers in polyploid plants like the peanut is problematic because of interfering signal generated from the DNA bases that are homeologous to those being assayed. The present study used a previously constructed 1536 GoldenGate SNP assay developed using SNPs identified between two A. duranensis accessions. In this study, the performance of this assay was tested on two RIL mapping populations, one diploid (A. duranensis × A. stenosperma) and one tetraploid [A. hypogaea cv. Runner IAC 886 × synthetic tetraploid (A. ipaënsis × A. duranensis)4×]. The scoring was performed using the software GenomeStudio version 2011.1. For the diploid, polymorphic markers provided excellent genotyping scores with default software parameters. In the tetraploid, as expected, most of the polymorphic markers provided signal intensity plots that were distorted compared to diploid patterns and that were incorrectly scored using default parameters. However, these scorings were easily corrected using the GenomeStudio software. The degree of distortion was highly variable. Of the polymorphic markers, approximately 10% showed no distortion at all behaving as expected for single-dose markers, and another 30% showed low distortion and could be considered high-quality. The genotyped markers were incorporated into diploid and tetraploid genetic maps of Arachis and, in the latter case, were located almost entirely on A genome linkage groups.
Arachis; breeding; genotyping; markers; wild
The complex, tetraploid genome structure of peanut (Arachis hypogaea) has obstructed advances in genetics and genomics in the species. The aim of this study is to understand the genome structure of Arachis by developing a high-density integrated consensus map. Three recombinant inbred line populations derived from crosses between the A genome diploid species, Arachis duranensis and Arachis stenosperma; the B genome diploid species, Arachis ipaënsis and Arachis magna; and between the AB genome tetraploids, A. hypogaea and an artificial amphidiploid (A. ipaënsis × A. duranensis)4×, were used to construct genetic linkage maps: 10 linkage groups (LGs) of 544 cM with 597 loci for the A genome; 10 LGs of 461 cM with 798 loci for the B genome; and 20 LGs of 1442 cM with 1469 loci for the AB genome. The resultant maps plus 13 published maps were integrated into a consensus map covering 2651 cM with 3693 marker loci which was anchored to 20 consensus LGs corresponding to the A and B genomes. The comparative genomics with genome sequences of Cajanus cajan, Glycine max, Lotus japonicus, and Medicago truncatula revealed that the Arachis genome has segmented synteny relationship to the other legumes. The comparative maps in legumes, integrated tetraploid consensus maps, and genome-specific diploid maps will increase the genetic and genomic understanding of Arachis and should facilitate molecular breeding.
Arachis spp.; comparative genomics; genetic linkage map; integrated consensus map; legume genome
Chromosome segment substitution lines (CSSLs) are powerful QTL mapping populations that have been used to elucidate the molecular basis of interesting traits of wild species. Cultivated peanut is an allotetraploid with limited genetic diversity. Capturing the genetic diversity from peanut wild relatives is an important objective in many peanut breeding programs. In this study, we used a marker-assisted backcrossing strategy to produce a population of 122 CSSLs from the cross between the wild synthetic allotetraploid (A. ipaënsis×A. duranensis)4x and the cultivated Fleur11 variety. The 122 CSSLs offered a broad coverage of the peanut genome, with target wild chromosome segments averaging 39.2 cM in length. As a demonstration of the utility of these lines, four traits were evaluated in a subset of 80 CSSLs. A total of 28 lines showed significant differences from Fleur11. The line×trait significant associations were assigned to 42 QTLs: 14 for plant growth habit, 15 for height of the main stem, 12 for plant spread and one for flower color. Among the 42 QTLs, 37 were assigned to genomic regions and three QTL positions were considered putative. One important finding arising from this QTL analysis is that peanut growth habit is a complex trait that is governed by several QTLs with different effects. The CSSL population developed in this study has proved efficient for deciphering the molecular basis of trait variations and will be useful to the peanut scientific community for future QTL mapping studies.
Cultivated peanut (Arachis hypogaea) is one of the most widely grown grain legumes in the world, being valued for its high protein and unsaturated oil contents. Worldwide, the major constraints to peanut production are drought and fungal diseases. Wild Arachis species, which are exclusively South American in origin, have high genetic diversity and have been selected during evolution in a range of environments and biotic stresses, constituting a rich source of allele diversity. Arachis stenosperma harbors resistances to a number of pests, including fungal diseases, whilst A. duranensis has shown improved tolerance to water limited stress. In this study, these species were used for the creation of an extensive databank of wild Arachis transcripts under stress which will constitute a rich source for gene discovery and molecular markers development.
Transcriptome analysis of cDNA collections from A. stenosperma challenged with Cercosporidium personatum (Berk. and M.A. Curtis) Deighton, and A. duranensis submitted to gradual water limited stress was conducted using 454 GS FLX Titanium generating a total of 7.4 x 105 raw sequence reads covering 211 Mbp of both genomes. High quality reads were assembled to 7,723 contigs for A. stenosperma and 12,792 for A. duranensis and functional annotation indicated that 95% of the contigs in both species could be appointed to GO annotation categories. A number of transcription factors families and defense related genes were identified in both species. Additionally, the expression of five A. stenosperma Resistance Gene Analogs (RGAs) and four retrotransposon (FIDEL-related) sequences were analyzed by qRT-PCR. This data set was used to design a total of 2,325 EST-SSRs, of which a subset of 584 amplified in both species and 214 were shown to be polymorphic using ePCR.
This study comprises one of the largest unigene dataset for wild Arachis species and will help to elucidate genes involved in responses to biological processes such as fungal diseases and water limited stress. Moreover, it will also facilitate basic and applied research on the genetics of peanut through the development of new molecular markers and the study of adaptive variation across the genus.
Only a few genetic maps based on recombinant inbred line (RIL) and backcross (BC) populations have been developed for tetraploid groundnut. The marker density, however, is not very satisfactory especially in the context of large genome size (2800 Mb/1C) and 20 linkage groups (LGs). Therefore, using marker segregation data for 10 RILs and one BC population from the international groundnut community, with the help of common markers across different populations, a reference consensus genetic map has been developed. This map is comprised of 897 marker loci including 895 simple sequence repeat (SSR) and 2 cleaved amplified polymorphic sequence (CAPS) loci distributed on 20 LGs (a01–a10 and b01–b10) spanning a map distance of 3, 863.6 cM with an average map density of 4.4 cM. The highest numbers of markers (70) were integrated on a01 and the least number of markers (21) on b09. The marker density, however, was lowest (6.4 cM) on a08 and highest (2.5 cM) on a01. The reference consensus map has been divided into 20 cM long 203 BINs. These BINs carry 1 (a10_02, a10_08 and a10_09) to 20 (a10_04) loci with an average of 4 marker loci per BIN. Although the polymorphism information content (PIC) value was available for 526 markers in 190 BINs, 36 and 111 BINs have at least one marker with >0.70 and >0.50 PIC values, respectively. This information will be useful for selecting highly informative and uniformly distributed markers for developing new genetic maps, background selection and diversity analysis. Most importantly, this reference consensus map will serve as a reliable reference for aligning new genetic and physical maps, performing QTL analysis in a multi-populations design, evaluating the genetic background effect on QTL expression, and serving other genetic and molecular breeding activities in groundnut.
Polyploidy can result in genetic bottlenecks, especially for species of monophyletic origin. Cultivated peanut is an allotetraploid harbouring limited genetic diversity, likely resulting from the combined effects of its single origin and domestication. Peanut wild relatives represent an important source of novel alleles that could be used to broaden the genetic basis of the cultigen. Using an advanced backcross population developed with a synthetic amphidiploid as donor of wild alleles, under two water regimes, we conducted a detailed QTL study for several traits involved in peanut productivity and adaptation as well as domestication.
A total of 95 QTLs were mapped in the two water treatments. About half of the QTL positive effects were associated with alleles of the wild parent and several QTLs involved in yield components were specific to the water-limited treatment. QTLs detected for the same trait mapped to non-homeologous genomic regions, suggesting differential control in subgenomes as a consequence of polyploidization. The noteworthy clustering of QTLs for traits involved in seed and pod size and in plant and pod morphology suggests, as in many crops, that a small number of loci have contributed to peanut domestication.
In our study, we have identified QTLs that differentiated cultivated peanut from its wild relatives as well as wild alleles that contributed positive variation to several traits involved in peanut productivity and adaptation. These findings offer novel opportunities for peanut improvement using wild relatives.
Peanut (Arachis hypogaea L.) is a crop of economic and social importance, mainly in tropical areas, and developing countries. Its molecular breeding has been hindered by a shortage of polymorphic genetic markers due to a very narrow genetic base. Microsatellites (SSRs) are markers of choice in peanut because they are co-dominant, highly transferrable between species and easily applicable in the allotetraploid genome. In spite of substantial effort over the last few years by a number of research groups, the number of SSRs that are polymorphic for A. hypogaea is still limiting for routine application, creating the demand for the discovery of more markers polymorphic within cultivated germplasm.
A plasmid genomic library enriched for TC/AG repeats was constructed and 1401 clones sequenced. From the sequences obtained 146 primer pairs flanking mostly TC microsatellites were developed. The average number of repeat motifs amplified was 23. These 146 markers were characterized on 22 genotypes of cultivated peanut. In total 78 of the markers were polymorphic within cultivated germplasm. Most of those 78 markers were highly informative with an average of 5.4 alleles per locus being amplified. Average gene diversity index (GD) was 0.6, and 66 markers showed a GD of more than 0.5. Genetic relationship analysis was performed and corroborated the current taxonomical classification of A. hypogaea subspecies and varieties.
The microsatellite markers described here are a useful resource for genetics and genomics in Arachis. In particular, the 66 markers that are highly polymorphic in cultivated peanut are a significant step towards routine genetic mapping and marker-assisted selection for the crop.
Cultivated peanut (Arachis hypogaea L.) is an important crop worldwide, valued for its edible oil and digestible protein. It has a very narrow genetic base that may well derive from a relatively recent single polyploidization event. Accordingly molecular markers have low levels of polymorphism and the number of polymorphic molecular markers available for cultivated peanut is still limiting.
Here, we report a large set of BAC-end sequences (BES), use them for developing SSR (BES-SSR) markers, and apply them in genetic linkage mapping. The majority of BESs had no detectable homology to known genes (49.5%) followed by sequences with similarity to known genes (44.3%), and miscellaneous sequences (6.2%) such as transposable element, retroelement, and organelle sequences. A total of 1,424 SSRs were identified from 36,435 BESs. Among these identified SSRs, dinucleotide (47.4%) and trinucleotide (37.1%) SSRs were predominant. The new set of 1,152 SSRs as well as about 4,000 published or unpublished SSRs were screened against two parents of a mapping population, generating 385 polymorphic loci. A genetic linkage map was constructed, consisting of 318 loci onto 21 linkage groups and covering a total of 1,674.4 cM, with an average distance of 5.3 cM between adjacent loci. Two markers related to resistance gene homologs (RGH) were mapped to two different groups, thus anchoring 1 RGH-BAC contig and 1 singleton.
The SSRs mined from BESs will be of use in further molecular analysis of the peanut genome, providing a novel set of markers, genetically anchoring BAC clones, and incorporating gene sequences into a linkage map. This will aid in the identification of markers linked to genes of interest and map-based cloning.
Wild peanut species (Arachis spp.) are a rich source of new alleles for peanut improvement. Plant transcriptome analysis under specific experimental conditions helps the understanding of cellular processes related, for instance, to development, stress response, and crop yield. The validation of these studies has been generally accomplished by quantitative reverse transcription-polymerase chain reaction (qRT-PCR) which requires normalization of mRNA levels among samples. This can be achieved by comparing the expression ratio between a gene of interest and a reference gene which is constitutively expressed. Nowadays there is a lack of appropriate reference genes for both wild and cultivated Arachis. The identification of such genes would allow a consistent analysis of qRT-PCR data and speed up candidate gene validation in peanut.
A set of ten reference genes were analyzed in four Arachis species (A. magna; A. duranensis; A. stenosperma and A. hypogaea) subjected to biotic (root-knot nematode and leaf spot fungus) and abiotic (drought) stresses, in two distinct plant organs (roots and leaves). By the use of three programs (GeNorm, NormFinder and BestKeeper) and taking into account the entire dataset, five of these ten genes, ACT1 (actin depolymerizing factor-like protein), UBI1 (polyubiquitin), GAPDH (glyceraldehyde-3-phosphate dehydrogenase), 60S (60S ribosomal protein L10) and UBI2 (ubiquitin/ribosomal protein S27a) emerged as top reference genes, with their stability varying in eight subsets. The former three genes were the most stable across all species, organs and treatments studied.
This first in-depth study of reference genes validation in wild Arachis species will allow the use of specific combinations of secure and stable reference genes in qRT-PCR assays. The use of these appropriate references characterized here should improve the accuracy and reliability of gene expression analysis in both wild and cultivated Arachis and contribute for the better understanding of gene expression in, for instance, stress tolerance/resistance mechanisms in plants.
Worldwide, diseases are important reducers of peanut (Arachis hypogaea) yield. Sources of resistance against many diseases are available in cultivated peanut genotypes, although often not in farmer preferred varieties. Wild species generally harbor greater levels of resistance and even apparent immunity, although the linkage of agronomically un-adapted wild alleles with wild disease resistance genes is inevitable. Marker-assisted selection has the potential to facilitate the combination of both cultivated and wild resistance loci with agronomically adapted alleles. However, in peanut there is an almost complete lack of knowledge of the regions of the Arachis genome that control disease resistance.
In this work we identified candidate genome regions that control disease resistance. For this we placed candidate disease resistance genes and QTLs against late leaf spot disease on the genetic map of the A-genome of Arachis, which is based on microsatellite markers and legume anchor markers. These marker types are transferable within the genus Arachis and to other legumes respectively, enabling this map to be aligned to other Arachis maps and to maps of other legume crops including those with sequenced genomes. In total, 34 sequence-confirmed candidate disease resistance genes and five QTLs were mapped.
Candidate genes and QTLs were distributed on all linkage groups except for the smallest, but the distribution was not even. Groupings of candidate genes and QTLs for late leaf spot resistance were apparent on the upper region of linkage group 4 and the lower region of linkage group 2, indicating that these regions are likely to control disease resistance.
Peanut (Arachis hypogaea L.) is widely used as a food and cash crop around the world. It is considered to be an allotetraploid (2n = 4x = 40) originated from a single hybridization event between two wild diploids. The most probable hypothesis gave A. duranensis as the wild donor of the A genome and A. ipaënsis as the wild donor of the B genome. A low level of molecular polymorphism is found in cultivated germplasm and up to date few genetic linkage maps have been published. The utilization of wild germplasm in breeding programs has received little attention due to the reproductive barriers between wild and cultivated species and to the technical difficulties encountered in making large number of crosses. We report here the development of a SSR based genetic map and the analysis of genome-wide segment introgressions into the background of a cultivated variety through the utilization of a synthetic amphidiploid between A. duranensis and A. ipaënsis.
Two hundred ninety eight (298) loci were mapped in 21 linkage groups (LGs), spanning a total map distance of 1843.7 cM with an average distance of 6.1 cM between adjacent markers. The level of polymorphism observed between the parent of the amphidiploid and the cultivated variety is consistent with A. duranensis and A. ipaënsis being the most probable donor of the A and B genomes respectively. The synteny analysis between the A and B genomes revealed an overall good collinearity of the homeologous LGs. The comparison with the diploid and tetraploid maps shed new light on the evolutionary forces that contributed to the divergence of the A and B genome species and raised the question of the classification of the B genome species. Structural modifications such as chromosomal segment inversions and a major translocation event prior to the tetraploidisation of the cultivated species were revealed. Marker assisted selection of BC1F1 and then BC2F1 lines carrying the desirable donor segment with the best possible return to the background of the cultivated variety provided a set of lines offering an optimal distribution of the wild introgressions.
The genetic map developed, allowed the synteny analysis of the A and B genomes, the comparison with diploid and tetraploid maps and the analysis of the introgression segments from the wild synthetic into the background of a cultivated variety. The material we have produced in this study should facilitate the development of advanced backcross and CSSL breeding populations for the improvement of cultivated peanut.
Arachis hypogaea (peanut) is an important crop worldwide, being mostly used for edible oil production, direct consumption and animal feed. Cultivated peanut is an allotetraploid species with two different genome components, A and B. Genetic linkage maps can greatly assist molecular breeding and genomic studies. However, the development of linkage maps for A. hypogaea is difficult because it has very low levels of polymorphism. This can be overcome by the utilization of wild species of Arachis, which present the A- and B-genomes in the diploid state, and show high levels of genetic variability.
In this work, we constructed a B-genome linkage map, which will complement the previously published map for the A-genome of Arachis, and produced an entire framework for the tetraploid genome. This map is based on an F2 population of 93 individuals obtained from the cross between the diploid A. ipaënsis (K30076) and the closely related A. magna (K30097), the former species being the most probable B genome donor to cultivated peanut. In spite of being classified as different species, the parents showed high crossability and relatively low polymorphism (22.3%), compared to other interspecific crosses. The map has 10 linkage groups, with 149 loci spanning a total map distance of 1,294 cM. The microsatellite markers utilized, developed for other Arachis species, showed high transferability (81.7%). Segregation distortion was 21.5%. This B-genome map was compared to the A-genome map using 51 common markers, revealing a high degree of synteny between both genomes.
The development of genetic maps for Arachis diploid wild species with A- and B-genomes effectively provides a genetic map for the tetraploid cultivated peanut in two separate diploid components and is a significant advance towards the construction of a transferable reference map for Arachis. Additionally, we were able to identify affinities of some Arachis linkage groups with Medicago truncatula, which will allow the transfer of information from the nearly-complete genome sequences of this model legume to the peanut crop.
Most agriculturally important legumes fall within two sub-clades of the Papilionoid legumes: the Phaseoloids and Galegoids, which diverged about 50 Mya. The Phaseoloids are mostly tropical and include crops such as common bean and soybean. The Galegoids are mostly temperate and include clover, fava bean and the model legumes Lotus and Medicago (both with substantially sequenced genomes). In contrast, peanut (Arachis hypogaea) falls in the Dalbergioid clade which is more basal in its divergence within the Papilionoids. The aim of this work was to integrate the genetic map of Arachis with Lotus and Medicago and improve our understanding of the Arachis genome and legume genomes in general. To do this we placed on the Arachis map, comparative anchor markers defined using a previously described bioinformatics pipeline. Also we investigated the possible role of transposons in the patterns of synteny that were observed.
The Arachis genetic map was substantially aligned with Lotus and Medicago with most synteny blocks presenting a single main affinity to each genome. This indicates that the last common whole genome duplication within the Papilionoid legumes predated the divergence of Arachis from the Galegoids and Phaseoloids sufficiently that the common ancestral genome was substantially diploidized. The Arachis and model legume genomes comparison made here, together with a previously published comparison of Lotus and Medicago allowed all possible Arachis-Lotus-Medicago species by species comparisons to be made and genome syntenies observed. Distinct conserved synteny blocks and non-conserved regions were present in all genome comparisons, implying that certain legume genomic regions are consistently more stable during evolution than others. We found that in Medicago and possibly also in Lotus, retrotransposons tend to be more frequent in the variable regions. Furthermore, while these variable regions generally have lower densities of single copy genes than the more conserved regions, some harbor high densities of the fast evolving disease resistance genes.
We suggest that gene space in Papilionoids may be divided into two broadly defined components: more conserved regions which tend to have low retrotransposon densities and are relatively stable during evolution; and variable regions that tend to have high retrotransposon densities, and whose frequent restructuring may fuel the evolution of some gene families.
Many commercial banana varieties lack sources of resistance to pests and diseases, as a consequence of sterility and narrow genetic background. Fertile wild relatives, by contrast, possess greater variability and represent potential sources of disease resistance genes (R-genes). The largest known family of plant R-genes encode proteins with nucleotide-binding site (NBS) and C-terminal leucine-rich repeat (LRR) domains. Conserved motifs in such genes in diverse plant species offer a means for isolation of candidate genes in banana which may be involved in plant defence.
A computational strategy was developed for unbiased conserved motif discovery in NBS and LRR domains in R-genes and homologues in monocotyledonous plant species. Degenerate PCR primers targeting conserved motifs were tested on the wild cultivar Musa acuminata subsp. burmannicoides, var. Calcutta 4, which is resistant to a number of fungal pathogens and nematodes. One hundred and seventy four resistance gene analogs (RGAs) were amplified and assembled into 52 contiguous sequences. Motifs present were typical of the non-TIR NBS-LRR RGA subfamily. A phylogenetic analysis of deduced amino-acid sequences for 33 RGAs with contiguous open reading frames (ORFs), together with RGAs from Arabidopsis thaliana and Oryza sativa, grouped most Musa RGAs within monocotyledon-specific clades. RFLP-RGA markers were developed, with 12 displaying distinct polymorphisms in parentals and F1 progeny of a diploid M. acuminata mapping population. Eighty eight BAC clones were identified in M. acuminata Calcutta 4, M. acuminata Grande Naine, and M. balbisiana Pisang Klutuk Wulung BAC libraries when hybridized to two RGA probes. Multiple copy RGAs were common within BAC clones, potentially representing variation reservoirs for evolution of new R-gene specificities.
This is the first large scale analysis of NBS-LRR RGAs in M. acuminata Calcutta 4. Contig sequences were deposited in GenBank and assigned numbers ER935972 – ER936023. RGA sequences and isolated BACs are a valuable resource for R-gene discovery, and in future applications will provide insight into the organization and evolution of NBS-LRR R-genes in the Musa A and B genome. The developed RFLP-RGA markers are applicable for genetic map development and marker assisted selection for defined traits such as pest and disease resistance.
Cultivated peanut, Arachis hypogaea is an allotetraploid of recent origin, with an AABB genome. In common with many other polyploids, it seems that a severe genetic bottle-neck was imposed at the species origin, via hybridisation of two wild species and spontaneous chromosome duplication. Therefore, the study of the genome of peanut is hampered both by the crop's low genetic diversity and its polyploidy. In contrast to cultivated peanut, most wild Arachis species are diploid with high genetic diversity. The study of diploid Arachis genomes is therefore attractive, both to simplify the construction of genetic and physical maps, and for the isolation and characterization of wild alleles. The most probable wild ancestors of cultivated peanut are A. duranensis and A. ipaënsis with genome types AA and BB respectively.
We constructed and characterized two large-insert libraries in Bacterial Artificial Chromosome (BAC) vector, one for each of the diploid ancestral species. The libraries (AA and BB) are respectively c. 7.4 and c. 5.3 genome equivalents with low organelle contamination and average insert sizes of 110 and 100 kb. Both libraries were used for the isolation of clones containing genetically mapped legume anchor markers (single copy genes), and resistance gene analogues.
These diploid BAC libraries are important tools for the isolation of wild alleles conferring resistances to biotic stresses, comparisons of orthologous regions of the AA and BB genomes with each other and with other legume species, and will facilitate the construction of a physical map.
Due to its origin, peanut has a very narrow genetic background. Wild relatives can be a source of genetic variability for cultivated peanut. In this study, the transcriptome of the wild species Arachis stenosperma accession V10309 was analyzed.
ESTs were produced from four cDNA libraries of RNAs extracted from leaves and roots of A. stenosperma. Randomly selected cDNA clones were sequenced to generate 8,785 ESTs, of which 6,264 (71.3%) had high quality, with 3,500 clusters: 963 contigs and 2537 singlets. Only 55.9% matched homologous sequences of known genes. ESTs were classified into 23 different categories according to putative protein functions. Numerous sequences related to disease resistance, drought tolerance and human health were identified. Two hundred and six microsatellites were found and markers have been developed for 188 of these. The microsatellite profile was analyzed and compared to other transcribed and genomic sequence data.
This is, to date, the first report on the analysis of transcriptome of a wild relative of peanut. The ESTs produced in this study are a valuable resource for gene discovery, the characterization of new wild alleles, and for marker development. The ESTs were released in the [GenBank:EH041934 to EH048197].