Cultivated peanut (Arachis hypogaea) is an allotetraploid species whose ancestral genomes are most likely derived from the A-genome species, A. duranensis, and the B-genome species, A. ipaensis. The very recent (several millennia) evolutionary origin of A. hypogaea has imposed a bottleneck for allelic and phenotypic diversity within the cultigen. However, wild diploid relatives are a rich source of alleles that could be used for crop improvement and their simpler genomes can be more easily analyzed while providing insight into the structure of the allotetraploid peanut genome. The objective of this research was to establish a high-density genetic map of the diploid species A. duranensis based on de novo generated EST databases. Arachis duranensis was chosen for mapping because it is the A-genome progenitor of cultivated peanut and also in order to circumvent the confounding effects of gene duplication associated with allopolyploidy in A. hypogaea.
More than one million expressed sequence tag (EST) sequences generated from normalized cDNA libraries of A. duranensis were assembled into 81,116 unique transcripts. Mining this dataset, 1236 EST-SNP markers were developed between two A. duranensis accessions, PI 475887 and Grif 15036. An additional 300 SNP markers also were developed from genomic sequences representing conserved legume orthologs. Of the 1536 SNP markers, 1054 were placed on a genetic map. In addition, 598 EST-SSR markers identified in A. hypogaea assemblies were included in the map along with 37 disease resistance gene candidate (RGC) and 35 other previously published markers. In total, 1724 markers spanning 1081.3 cM over 10 linkage groups were mapped. Gene sequences that provided mapped markers were annotated using similarity searches in three different databases, and gene ontology descriptions were determined using the Medicago Gene Atlas and TAIR databases. Synteny analysis between A. duranensis, Medicago and Glycine revealed significant stretches of conserved gene clusters spread across the peanut genome. A higher level of colinearity was detected between A. duranensis and Glycine than with Medicago.
The first high-density, gene-based linkage map for A. duranensis was generated that can serve as a reference map for both wild and cultivated Arachis species. The markers developed here are valuable resources for the peanut, and more broadly, to the legume research community. The A-genome map will have utility for fine mapping in other peanut species and has already had application for mapping a nematode resistance gene that was introgressed into A. hypogaea from A. cardenasii.
The genus Arachis is native to a region that includes Central Brazil and neighboring countries. Little is known about the genetic variability of the Brazilian cultivated peanut (Arachis hypogaea, genome AABB) germplasm collection at the DNA level. The understanding of the genetic diversity of cultivated and wild species of peanut (Arachis spp.) is essential to develop strategies of collection, conservation and use of the germplasm in variety development. The identity of the ancestor progenitor species of cultivated peanut has also been of great interest. Several species have been suggested as putative AA and BB genome donors to allotetraploid A. hypogaea. Microsatellite or SSR (Simple Sequence Repeat) markers are co-dominant, multiallelic, and highly polymorphic genetic markers, appropriate for genetic diversity studies. Microsatellite markers may also, to some extent, support phylogenetic inferences. Here we report the use of a set of microsatellite markers, including newly developed ones, for phylogenetic inferences and the analysis of genetic variation of accessions of A. hypogea and its wild relatives.
A total of 67 new microsatellite markers (mainly TTG motif) were developed for Arachis. Only three of these markers, however, were polymorphic in cultivated peanut. These three new markers plus five other markers characterized previously were evaluated for number of alleles per locus and gene diversity using 60 accessions of A. hypogaea. Genetic relationships among these 60 accessions and a sample of 36 wild accessions representative of section Arachis were estimated using allelic variation observed in a selected set of 12 SSR markers. Results showed that the Brazilian peanut germplasm collection has considerable levels of genetic diversity detected by SSR markers. Similarity groups for A. hypogaea accessions were established, which is a useful criteria for selecting parental plants for crop improvement. Microsatellite marker transferability was up to 76% for species of the section Arachis, but only 45% for species from the other eight Arachis sections tested. A new marker (Ah-041) presented a 100% transferability and could be used to classify the peanut accessions in AA and non-AA genome carriers.
The level of polymorphism observed among accessions of A. hypogaea analyzed with newly developed microsatellite markers was low, corroborating the accumulated data which show that cultivated peanut presents a relatively reduced variation at the DNA level. A selected panel of SSR markers allowed the classification of A. hypogaea accessions into two major groups. The identification of similarity groups will be useful for the selection of parental plants to be used in breeding programs. Marker transferability is relatively high between accessions of section Arachis. The possibility of using microsatellite markers developed for one species in genetic evaluation of other species greatly reduces the cost of the analysis, since the development of microsatellite markers is still expensive and time consuming. The SSR markers developed in this study could be very useful for genetic analysis of wild species of Arachis, including comparative genome mapping, population genetic structure and phylogenetic inferences among species.
The genus Arachis, originated in South America, is divided into nine taxonomical sections comprising of 80 species. Most of the Arachis species are diploids (2n = 2x = 20) and the tetraploid species (2n = 2x = 40) are found in sections Arachis, Extranervosae and Rhizomatosae. Diploid species have great potential to be used as resistance sources for agronomic traits like pests and diseases, drought related traits and different life cycle spans. Understanding of genetic relationships among wild species and between wild and cultivated species will be useful for enhanced utilization of wild species in improving cultivated germplasm. The present study was undertaken to evaluate genetic relationships among species (96 accessions) belonging to seven sections of Arachis by using simple sequence repeat (SSR) markers developed from Arachis hypogaea genomic library and gene sequences from related genera of Arachis.
The average transferability rate of 101 SSR markers tested to section Arachis and six other sections was 81% and 59% respectively. Five markers (IPAHM 164, IPAHM 165, IPAHM 407a, IPAHM 409, and IPAHM 659) showed 100% transferability. Cluster analysis of allelic data from a subset of 32 SSR markers on 85 wild and 11 cultivated accessions grouped accessions according to their genome composition, sections and species to which they belong. A total of 109 species specific alleles were detected in different wild species, Arachis pusilla exhibited largest number of species specific alleles (15). Based on genetic distance analysis, the A-genome accession ICG 8200 (A. duranensis) and the B-genome accession ICG 8206 (A. ipaënsis) were found most closely related to A. hypogaea.
A set of cross species and cross section transferable SSR markers has been identified that will be useful for genetic studies of wild species of Arachis, including comparative genome mapping, germplasm analysis, population genetic structure and phylogenetic inferences among species. The present study provides strong support based on both genomic and genic markers, probably for the first time, on relationships of A. monticola and A. hypogaea as well as on the most probable donor of A and B-genomes of cultivated groundnut.
Peanut (Arachis hypogaea) is an autogamous allotetraploid legume (2n = 4x = 40) that is widely cultivated as a food and oil crop. More than 6,000 DNA markers have been developed in Arachis spp., but high-density linkage maps useful for genetics, genomics, and breeding have not been constructed due to extremely low genetic diversity. Polymorphic marker loci are useful for the construction of such high-density linkage maps. The present study used in silico analysis to develop simple sequence repeat-based and transposon-based markers.
The use of in silico analysis increased the efficiency of polymorphic marker development by more than 3-fold. In total, 926 (34.2%) of 2,702 markers showed polymorphisms between parental lines of the mapping population. Linkage analysis of the 926 markers along with 253 polymorphic markers selected from 4,449 published markers generated 21 linkage groups covering 2,166.4 cM with 1,114 loci. Based on the map thus produced, 23 quantitative trait loci (QTLs) for 15 agronomical traits were detected. Another linkage map with 326 loci was also constructed and revealed a relationship between the genotypes of the FAD2 genes and the ratio of oleic/linoleic acid in peanut seed.
In silico analysis of polymorphisms increased the efficiency of polymorphic marker development, and contributed to the construction of high-density linkage maps in cultivated peanut. The resultant maps were applicable to QTL analysis. Marker subsets and linkage maps developed in this study should be useful for genetics, genomics, and breeding in Arachis. The data are available at the Kazusa DNA Marker Database (http://marker.kazusa.or.jp).
DNA marker; Genetic linkage map; Peanut (Arachis hypogaea); QTL analysis; Ratio of oleic/linoleic acid (O/L ratio)
Arachis hypogaea (peanut) is an important crop worldwide, being mostly used for edible oil production, direct consumption and animal feed. Cultivated peanut is an allotetraploid species with two different genome components, A and B. Genetic linkage maps can greatly assist molecular breeding and genomic studies. However, the development of linkage maps for A. hypogaea is difficult because it has very low levels of polymorphism. This can be overcome by the utilization of wild species of Arachis, which present the A- and B-genomes in the diploid state, and show high levels of genetic variability.
In this work, we constructed a B-genome linkage map, which will complement the previously published map for the A-genome of Arachis, and produced an entire framework for the tetraploid genome. This map is based on an F2 population of 93 individuals obtained from the cross between the diploid A. ipaënsis (K30076) and the closely related A. magna (K30097), the former species being the most probable B genome donor to cultivated peanut. In spite of being classified as different species, the parents showed high crossability and relatively low polymorphism (22.3%), compared to other interspecific crosses. The map has 10 linkage groups, with 149 loci spanning a total map distance of 1,294 cM. The microsatellite markers utilized, developed for other Arachis species, showed high transferability (81.7%). Segregation distortion was 21.5%. This B-genome map was compared to the A-genome map using 51 common markers, revealing a high degree of synteny between both genomes.
The development of genetic maps for Arachis diploid wild species with A- and B-genomes effectively provides a genetic map for the tetraploid cultivated peanut in two separate diploid components and is a significant advance towards the construction of a transferable reference map for Arachis. Additionally, we were able to identify affinities of some Arachis linkage groups with Medicago truncatula, which will allow the transfer of information from the nearly-complete genome sequences of this model legume to the peanut crop.
The peanut (Arachis hypogaea) is an important oil crop. Breeding for high oil content is becoming increasingly important. Wild Arachis species have been reported to harbor genes for many valuable traits that may enable the improvement of cultivated Arachis hypogaea, such as resistance to pests and disease. However, only limited information is available on variation in oil content. In the present study, a collection of 72 wild Arachis accessions representing 19 species and 3 cultivated peanut accessions were genotyped using 136 genome-wide SSR markers and phenotyped for oil content over three growing seasons. The wild Arachis accessions showed abundant diversity across the 19 species. A. duranensis exhibited the highest diversity, with a Shannon-Weaver diversity index of 0.35. A total of 129 unique alleles were detected in the species studied. A. rigonii exhibited the largest number of unique alleles (75), indicating that this species is highly differentiated. AMOVA and genetic distance analyses confirmed the genetic differentiation between the wild Arachis species. The majority of SSR alleles were detected exclusively in the wild species and not in A. hypogaea, indicating that directional selection or the hitchhiking effect has played an important role in the domestication of the cultivated peanut. The 75 accessions were grouped into three clusters based on population structure and phylogenic analysis, consistent with their taxonomic sections, species and genome types. A. villosa and A. batizocoi were grouped with A. hypogaea, suggesting the close relationship between these two diploid wild species and the cultivated peanut. Considerable phenotypic variation in oil content was observed among different sections and species. Nine alleles were identified as associated with oil content based on association analysis, of these, three alleles were associated with higher oil content but were absent in the cultivated peanut. The results demonstrated that there is great potential to increase the oil content in A. hypogaea by using the wild Arachis germplasm.
Most agriculturally important legumes fall within two sub-clades of the Papilionoid legumes: the Phaseoloids and Galegoids, which diverged about 50 Mya. The Phaseoloids are mostly tropical and include crops such as common bean and soybean. The Galegoids are mostly temperate and include clover, fava bean and the model legumes Lotus and Medicago (both with substantially sequenced genomes). In contrast, peanut (Arachis hypogaea) falls in the Dalbergioid clade which is more basal in its divergence within the Papilionoids. The aim of this work was to integrate the genetic map of Arachis with Lotus and Medicago and improve our understanding of the Arachis genome and legume genomes in general. To do this we placed on the Arachis map, comparative anchor markers defined using a previously described bioinformatics pipeline. Also we investigated the possible role of transposons in the patterns of synteny that were observed.
The Arachis genetic map was substantially aligned with Lotus and Medicago with most synteny blocks presenting a single main affinity to each genome. This indicates that the last common whole genome duplication within the Papilionoid legumes predated the divergence of Arachis from the Galegoids and Phaseoloids sufficiently that the common ancestral genome was substantially diploidized. The Arachis and model legume genomes comparison made here, together with a previously published comparison of Lotus and Medicago allowed all possible Arachis-Lotus-Medicago species by species comparisons to be made and genome syntenies observed. Distinct conserved synteny blocks and non-conserved regions were present in all genome comparisons, implying that certain legume genomic regions are consistently more stable during evolution than others. We found that in Medicago and possibly also in Lotus, retrotransposons tend to be more frequent in the variable regions. Furthermore, while these variable regions generally have lower densities of single copy genes than the more conserved regions, some harbor high densities of the fast evolving disease resistance genes.
We suggest that gene space in Papilionoids may be divided into two broadly defined components: more conserved regions which tend to have low retrotransposon densities and are relatively stable during evolution; and variable regions that tend to have high retrotransposon densities, and whose frequent restructuring may fuel the evolution of some gene families.
Late leaf spot (LLS) and rust have the greatest impact on yield losses worldwide in groundnut (Arachis hypogaea L.). With the objective of identifying tightly linked markers to these diseases, a total of 3,097 simple sequence repeats (SSRs) were screened on the parents of two recombinant inbred line (RIL) populations, namely TAG 24 × GPBD 4 (RIL-4) and TG 26 × GPBD 4 (RIL-5), and segregation data were obtained for 209 marker loci for each of the mapping populations. Linkage map analysis of the 209 loci resulted in the mapping of 188 and 181 loci in RIL-4 and RIL-5 respectively. Using 143 markers common to the two maps, a consensus map with 225 SSR loci and total map distance of 1,152.9 cM was developed. Comprehensive quantitative trait locus (QTL) analysis detected a total of 28 QTL for LLS and 15 QTL for rust. A major QTL for LLS, namely QTLLLS01 (GM1573/GM1009-pPGPseq8D09), with 10.27–62.34% phenotypic variance explained (PVE) was detected in all the six environments in the RIL-4 population. In the case of rust resistance, in addition to marker IPAHM103 identified earlier, four new markers (GM2009, GM1536, GM2301 and GM2079) showed significant association with the major QTL (82.96% PVE). Localization of 42 QTL for LLS and rust on the consensus map identified two candidate genomic regions conferring resistance to LLS and rust. One region present on linkage group AhXV contained three QTL each for LLS (up to 67.98% PVE) and rust (up to 82.96% PVE). The second candidate genomic region contained the major QTL with up to 62.34% PVE for LLS. Molecular markers associated with the major QTL for resistance to LLS and rust can be deployed in molecular breeding for developing groundnut varieties with enhanced resistance to foliar diseases.
Electronic supplementary material
The online version of this article (doi:10.1007/s11032-011-9661-z) contains supplementary material, which is available to authorized users.
Genetic linkage map; Rust resistance; Late leaf spot resistance; Molecular breeding; Molecular markers
Cultivated peanut (Arachis hypogaea) is one of the most widely grown grain legumes in the world, being valued for its high protein and unsaturated oil contents. Worldwide, the major constraints to peanut production are drought and fungal diseases. Wild Arachis species, which are exclusively South American in origin, have high genetic diversity and have been selected during evolution in a range of environments and biotic stresses, constituting a rich source of allele diversity. Arachis stenosperma harbors resistances to a number of pests, including fungal diseases, whilst A. duranensis has shown improved tolerance to water limited stress. In this study, these species were used for the creation of an extensive databank of wild Arachis transcripts under stress which will constitute a rich source for gene discovery and molecular markers development.
Transcriptome analysis of cDNA collections from A. stenosperma challenged with Cercosporidium personatum (Berk. and M.A. Curtis) Deighton, and A. duranensis submitted to gradual water limited stress was conducted using 454 GS FLX Titanium generating a total of 7.4 x 105 raw sequence reads covering 211 Mbp of both genomes. High quality reads were assembled to 7,723 contigs for A. stenosperma and 12,792 for A. duranensis and functional annotation indicated that 95% of the contigs in both species could be appointed to GO annotation categories. A number of transcription factors families and defense related genes were identified in both species. Additionally, the expression of five A. stenosperma Resistance Gene Analogs (RGAs) and four retrotransposon (FIDEL-related) sequences were analyzed by qRT-PCR. This data set was used to design a total of 2,325 EST-SSRs, of which a subset of 584 amplified in both species and 214 were shown to be polymorphic using ePCR.
This study comprises one of the largest unigene dataset for wild Arachis species and will help to elucidate genes involved in responses to biological processes such as fungal diseases and water limited stress. Moreover, it will also facilitate basic and applied research on the genetics of peanut through the development of new molecular markers and the study of adaptive variation across the genus.
Cultivated peanut, Arachis hypogaea is an allotetraploid of recent origin, with an AABB genome. In common with many other polyploids, it seems that a severe genetic bottle-neck was imposed at the species origin, via hybridisation of two wild species and spontaneous chromosome duplication. Therefore, the study of the genome of peanut is hampered both by the crop's low genetic diversity and its polyploidy. In contrast to cultivated peanut, most wild Arachis species are diploid with high genetic diversity. The study of diploid Arachis genomes is therefore attractive, both to simplify the construction of genetic and physical maps, and for the isolation and characterization of wild alleles. The most probable wild ancestors of cultivated peanut are A. duranensis and A. ipaënsis with genome types AA and BB respectively.
We constructed and characterized two large-insert libraries in Bacterial Artificial Chromosome (BAC) vector, one for each of the diploid ancestral species. The libraries (AA and BB) are respectively c. 7.4 and c. 5.3 genome equivalents with low organelle contamination and average insert sizes of 110 and 100 kb. Both libraries were used for the isolation of clones containing genetically mapped legume anchor markers (single copy genes), and resistance gene analogues.
These diploid BAC libraries are important tools for the isolation of wild alleles conferring resistances to biotic stresses, comparisons of orthologous regions of the AA and BB genomes with each other and with other legume species, and will facilitate the construction of a physical map.
The construction of genetic linkage maps for cultivated peanut (Arachis hypogaea L.) has and continues to be an important research goal to facilitate quantitative trait locus (QTL) analysis and gene tagging for use in a marker-assisted selection in breeding. Even though a few maps have been developed, they were constructed using diploid or interspecific tetraploid populations. The most recently published intra-specific map was constructed from the cross of cultivated peanuts, in which only 135 simple sequence repeat (SSR) markers were sparsely populated in 22 linkage groups. The more detailed linkage map with sufficient markers is necessary to be feasible for QTL identification and marker-assisted selection. The objective of this study was to construct a genetic linkage map of cultivated peanut using simple sequence repeat (SSR) markers derived primarily from peanut genomic sequences, expressed sequence tags (ESTs), and by "data mining" sequences released in GenBank.
Three recombinant inbred lines (RILs) populations were constructed from three crosses with one common female parental line Yueyou 13, a high yielding Spanish market type. The four parents were screened with 1044 primer pairs designed to amplify SSRs and 901 primer pairs produced clear PCR products. Of the 901 primer pairs, 146, 124 and 64 primer pairs (markers) were polymorphic in these populations, respectively, and used in genotyping these RIL populations. Individual linkage maps were constructed from each of the three populations and a composite map based on 93 common loci were created using JoinMap. The composite linkage maps consist of 22 composite linkage groups (LG) with 175 SSR markers (including 47 SSRs on the published AA genome maps), representing the 20 chromosomes of A. hypogaea. The total composite map length is 885.4 cM, with an average marker density of 5.8 cM. Segregation distortion in the 3 populations was 23.0%, 13.5% and 7.8% of the markers, respectively. These distorted loci tended to cluster on LG1, LG3, LG4 and LG5. There were only 15 EST-SSR markers mapped due to low polymorphism. By comparison, there were potential synteny, collinear order of some markers and conservation of collinear linkage groups among the maps and with the AA genome but not fully conservative.
A composite linkage map was constructed from three individual mapping populations with 175 SSR markers in 22 composite linkage groups. This composite genetic linkage map is among the first "true" tetraploid peanut maps produced. This map also consists of 47 SSRs that have been used in the published AA genome maps, and could be used in comparative mapping studies. The primers described in this study are PCR-based markers, which are easy to share for genetic mapping in peanuts. All 1044 primer pairs are provided as additional files and the three RIL populations will be made available to public upon request for quantitative trait loci (QTL) analysis and linkage map improvement.
Cultivated peanut or groundnut (Arachis hypogaea L.) is an important oilseed crop with an allotetraploid genome (AABB, 2n = 4x = 40). Both the low level of genetic variation within the cultivated gene pool and its polyploid nature limit the utilization of molecular markers to explore genome structure and facilitate genetic improvement. Nevertheless, a wealth of genetic diversity exists in diploid Arachis species (2n = 2x = 20), which represent a valuable gene pool for cultivated peanut improvement. Interspecific populations have been used widely for genetic mapping in diploid species of Arachis. However, an intraspecific mapping strategy was essential to detect chromosomal rearrangements among species that could be obscured by mapping in interspecific populations. To develop intraspecific reference linkage maps and gain insights into karyotypic evolution within the genus, we comparatively mapped the A- and B-genome diploid species using intraspecific F2 populations. Exploring genome organization among diploid peanut species by comparative mapping will enhance our understanding of the cultivated tetraploid peanut genome. Moreover, new sources of molecular markers that are highly transferable between species and developed from expressed genes will be required to construct saturated genetic maps for peanut.
A total of 2,138 EST-SSR (expressed sequence tag-simple sequence repeat) markers were developed by mining a tetraploid peanut EST assembly including 101,132 unigenes (37,916 contigs and 63,216 singletons) derived from 70,771 long-read (Sanger) and 270,957 short-read (454) sequences. A set of 97 SSR markers were also developed by mining 9,517 genomic survey sequences of Arachis. An SSR-based intraspecific linkage map was constructed using an F2 population derived from a cross between K 9484 (PI 298639) and GKBSPSc 30081 (PI 468327) in the B-genome species A. batizocoi. A high degree of macrosynteny was observed when comparing the homoeologous linkage groups between A (A. duranensis) and B (A. batizocoi) genomes. Comparison of the A- and B-genome genetic linkage maps also showed a total of five inversions and one major reciprocal translocation between two pairs of chromosomes under our current mapping resolution.
Our findings will contribute to understanding tetraploid peanut genome origin and evolution and eventually promote its genetic improvement. The newly developed EST-SSR markers will enrich current molecular marker resources in peanut.
Peanut (Arachis hypogaea); SSR; Genetic linkage map; Intraspecific cross; EST
Wild peanut species (Arachis spp.) are a rich source of new alleles for peanut improvement. Plant transcriptome analysis under specific experimental conditions helps the understanding of cellular processes related, for instance, to development, stress response, and crop yield. The validation of these studies has been generally accomplished by quantitative reverse transcription-polymerase chain reaction (qRT-PCR) which requires normalization of mRNA levels among samples. This can be achieved by comparing the expression ratio between a gene of interest and a reference gene which is constitutively expressed. Nowadays there is a lack of appropriate reference genes for both wild and cultivated Arachis. The identification of such genes would allow a consistent analysis of qRT-PCR data and speed up candidate gene validation in peanut.
A set of ten reference genes were analyzed in four Arachis species (A. magna; A. duranensis; A. stenosperma and A. hypogaea) subjected to biotic (root-knot nematode and leaf spot fungus) and abiotic (drought) stresses, in two distinct plant organs (roots and leaves). By the use of three programs (GeNorm, NormFinder and BestKeeper) and taking into account the entire dataset, five of these ten genes, ACT1 (actin depolymerizing factor-like protein), UBI1 (polyubiquitin), GAPDH (glyceraldehyde-3-phosphate dehydrogenase), 60S (60S ribosomal protein L10) and UBI2 (ubiquitin/ribosomal protein S27a) emerged as top reference genes, with their stability varying in eight subsets. The former three genes were the most stable across all species, organs and treatments studied.
This first in-depth study of reference genes validation in wild Arachis species will allow the use of specific combinations of secure and stable reference genes in qRT-PCR assays. The use of these appropriate references characterized here should improve the accuracy and reliability of gene expression analysis in both wild and cultivated Arachis and contribute for the better understanding of gene expression in, for instance, stress tolerance/resistance mechanisms in plants.
The complex, tetraploid genome structure of peanut (Arachis hypogaea) has obstructed advances in genetics and genomics in the species. The aim of this study is to understand the genome structure of Arachis by developing a high-density integrated consensus map. Three recombinant inbred line populations derived from crosses between the A genome diploid species, Arachis duranensis and Arachis stenosperma; the B genome diploid species, Arachis ipaënsis and Arachis magna; and between the AB genome tetraploids, A. hypogaea and an artificial amphidiploid (A. ipaënsis × A. duranensis)4×, were used to construct genetic linkage maps: 10 linkage groups (LGs) of 544 cM with 597 loci for the A genome; 10 LGs of 461 cM with 798 loci for the B genome; and 20 LGs of 1442 cM with 1469 loci for the AB genome. The resultant maps plus 13 published maps were integrated into a consensus map covering 2651 cM with 3693 marker loci which was anchored to 20 consensus LGs corresponding to the A and B genomes. The comparative genomics with genome sequences of Cajanus cajan, Glycine max, Lotus japonicus, and Medicago truncatula revealed that the Arachis genome has segmented synteny relationship to the other legumes. The comparative maps in legumes, integrated tetraploid consensus maps, and genome-specific diploid maps will increase the genetic and genomic understanding of Arachis and should facilitate molecular breeding.
Arachis spp.; comparative genomics; genetic linkage map; integrated consensus map; legume genome
Background and Aims
The genus Arachis contains 80 described species. Section Arachis is of particular interest because it includes cultivated peanut, an allotetraploid, and closely related wild species, most of which are diploids. This study aimed to analyse the genetic relationships of multiple accessions of section Arachis species using two complementary methods. Microsatellites allowed the analysis of inter- and intraspecific variability. Intron sequences from single-copy genes allowed phylogenetic analysis including the separation of the allotetraploid genome components.
Intron sequences and microsatellite markers were used to reconstruct phylogenetic relationships in section Arachis through maximum parsimony and genetic distance analyses.
Although high intraspecific variability was evident, there was good support for most species. However, some problems were revealed, notably a probable polyphyletic origin for A. kuhlmannii. The validity of the genome groups was well supported. The F, K and D genomes grouped close to the A genome group. The 2n = 18 species grouped closer to the B genome group. The phylogenetic tree based on the intron data strongly indicated that A. duranensis and A. ipaënsis are the ancestors of A. hypogaea and A. monticola. Intron nucleotide substitutions allowed the ages of divergences of the main genome groups to be estimated at a relatively recent 2·3–2·9 million years ago. This age and the number of species described indicate a much higher speciation rate for section Arachis than for legumes in general.
The analyses revealed relationships between the species and genome groups and showed a generally high level of intraspecific genetic diversity. The improved knowledge of species relationships should facilitate the utilization of wild species for peanut improvement. The estimates of speciation rates in section Arachis are high, but not unprecedented. We suggest these high rates may be linked to the peculiar reproductive biology of Arachis.
Arachis; peanut; groundnut; intron sequences; single-copy genes; molecular phylogeny; microsatellites; genetic relationships; speciation rates; genome donors; molecular dating
Late leaf spot (LLS) and rust are two major foliar diseases of groundnut (Arachis hypogaea L.) that often occur together leading to 50–70% yield loss in the crop. A total of 268 recombinant inbred lines of a mapping population TAG 24 × GPBD 4 segregating for LLS and rust were used to undertake quantitative trait locus (QTL) analysis. Phenotyping of the population was carried out under artificial disease epiphytotics. Positive correlations between different stages, high to very high heritability and independent nature of inheritance between both the diseases were observed. Parental genotypes were screened with 1,089 simple sequence repeat (SSR) markers, of which 67 (6.15%) were found polymorphic. Segregation data obtained for these markers facilitated development of partial linkage map (14 linkage groups) with 56 SSR loci. Composite interval mapping (CIM) undertaken on genotyping and phenotyping data yielded 11 QTLs for LLS (explaining 1.70–6.50% phenotypic variation) in three environments and 12 QTLs for rust (explaining 1.70–55.20% phenotypic variation). Interestingly a major QTL associated with rust (QTLrust01), contributing 6.90–55.20% variation, was identified by both CIM and single marker analysis (SMA). A candidate SSR marker (IPAHM 103) linked with this QTL was validated using a wide range of resistant/susceptible breeding lines as well as progeny lines of another mapping population (TG 26 × GPBD 4). Therefore, this marker should be useful for introgressing the major QTL for rust in desired lines/varieties of groundnut through marker-assisted backcrossing.
Electronic supplementary material
The online version of this article (doi:10.1007/s00122-010-1366-x) contains supplementary material, which is available to authorized users.
Cultivated groundnut or peanut (Arachis hypogaea L.), an allotetraploid (2n = 4x = 40), is a self pollinated and widely grown crop in the semi-arid regions of the world. Improvement of drought tolerance is an important area of research for groundnut breeding programmes. Therefore, for the identification of candidate QTLs for drought tolerance, a comprehensive and refined genetic map containing 191 SSR loci based on a single mapping population (TAG 24 × ICGV 86031), segregating for drought and surrogate traits was developed. Genotyping data and phenotyping data collected for more than ten drought related traits in 2–3 seasons were analyzed in detail for identification of main effect QTLs (M-QTLs) and epistatic QTLs (E-QTLs) using QTL Cartographer, QTLNetwork and Genotype Matrix Mapping (GMM) programmes. A total of 105 M-QTLs with 3.48–33.36% phenotypic variation explained (PVE) were identified using QTL Cartographer, while only 65 M-QTLs with 1.3–15.01% PVE were identified using QTLNetwork. A total of 53 M-QTLs were such which were identified using both programmes. On the other hand, GMM identified 186 (8.54–44.72% PVE) and 63 (7.11–21.13% PVE), three and two loci interactions, whereas only 8 E-QTL interactions with 1.7–8.34% PVE were identified through QTLNetwork. Interestingly a number of co-localized QTLs controlling 2–9 traits were also identified. The identification of few major, many minor M-QTLs and QTL × QTL interactions during the present study confirmed the complex and quantitative nature of drought tolerance in groundnut. This study suggests deployment of modern approaches like marker-assisted recurrent selection or genomic selection instead of marker-assisted backcrossing approach for breeding for drought tolerance in groundnut.
Electronic supplementary material
The online version of this article (doi:10.1007/s00122-010-1517-0) contains supplementary material, which is available to authorized users.
Peanut; Drought tolerance; Genetic map; Molecular markers; Main-effect QTLs; Epistatic QTLs; Molecular breeding
Successful introgression of a major QTL for rust resistance, through marker-assisted backcrossing, in three popular Indian peanut cultivars generated several promising introgression lines with enhanced rust resistance and higher yield.
Leaf rust, caused by Puccinia arachidis Speg, is one of the major devastating diseases in peanut (Arachis hypogaea L.). One QTL region on linkage group AhXV explaining upto 82.62 % phenotypic variation for rust resistance was validated and introgressed from cultivar ‘GPBD 4’ into three rust susceptible varieties (‘ICGV 91114’, ‘JL 24’ and ‘TAG 24’) through marker-assisted backcrossing (MABC). The MABC approach employed a total of four markers including one dominant (IPAHM103) and three co-dominant (GM2079, GM1536, GM2301) markers present in the QTL region. After 2–3 backcrosses and selfing, 200 introgression lines (ILs) were developed from all the three crosses. Field evaluation identified 81 ILs with improved rust resistance. Those ILs had significantly increased pod yields (56–96 %) in infested environments compared to the susceptible parents. Screening of selected 43 promising ILs with 13 markers present on linkage group AhXV showed introgression of the target QTL region from the resistant parent in 11 ILs. Multi-location field evaluation of these ILs should lead to the release of improved varieties. The linked markers may be used in improving rust resistance in peanut breeding programmes.
Electronic supplementary material
The online version of this article (doi:10.1007/s00122-014-2338-3) contains supplementary material, which is available to authorized users.
Infectious pancreatic necrosis (IPN) is one of the most prevalent and economically devastating diseases in Atlantic salmon (Salmo salar) farming worldwide. The disease causes large mortalities at both the fry- and post-smolt stages. Family selection for increased IPN resistance is performed through the use of controlled challenge tests, where survival rates of sib-groups are recorded. However, since challenge-tested animals cannot be used as breeding candidates, within-family selection is not performed and only half of the genetic variation for IPN resistance is being exploited. DNA markers linked to quantitative trait loci (QTL) affecting IPN resistance would therefore be a powerful selection tool. The aim of this study was to identify and fine-map QTL for IPN-resistance in Atlantic salmon, for use in marker-assisted selection to increase the rate of genetic improvement for this trait.
A genome scan was carried out using 10 large full-sib families of challenge-tested Atlantic salmon post-smolts and microsatellite markers distributed across the genome. One major QTL for IPN-resistance was detected, explaining 29% and 83% of the phenotypic and genetic variances, respectively. This QTL mapped to the same location as a QTL recently detected in a Scottish Atlantic salmon population. The QTL was found to be segregating in 10 out of 20 mapping parents, and subsequent fine-mapping with additional markers narrowed the QTL peak to a 4 cM region on linkage group 21. Challenge-tested fry were used to show that the QTL had the same effect on fry as on post-smolt, with the confidence interval for QTL position in fry overlapping the confidence interval found in post-smolts. A total of 178 parents were tested for segregation of the QTL, identifying 72 QTL-heterozygous parents. Genotypes at QTL-heterozygous parents were used to determine linkage phases between alleles at the underlying DNA polymorphism and alleles at single markers or multi-marker haplotypes. One four-marker haplotype was found to be the best predictor of QTL alleles, and was successfully used to deduce genotypes of the underlying polymorphism in 72% of the parents of the next generation within a breeding nucleus. A highly significant population-level correlation was found between deduced alleles at the underlying polymorphism and survival of offspring groups in the fry challenge test, parents with the three deduced genotypes (QQ, Qq, qq) having mean offspring mortality rates of 0.13, 0.32, and 0.49, respectively. The frequency of the high-resistance allele (Q) in the population was estimated to be 0.30. Apart from this major QTL, one other experiment-wise significant QTL for IPN-resistance was detected, located on linkage group 4.
The QTL confirmed in this study represents a case of a major gene explaining the bulk of genetic variation for a presumed complex trait. QTL genotypes were deduced within most parents of the 2005 generation of a major breeding company, providing a solid framework for linkage-based MAS within the whole population in subsequent generations. Since haplotype-trait associations valid at the population level were found, there is also a potential for MAS based on linkage disequilibrium (LD). However, in order to use MAS across many generations without reassessment of linkage phases between markers and the underlying polymorphism, the QTL needs to be positioned with even greater accuracy. This will require higher marker densities than are currently available.
Peanut (Arachis hypogaea L) is one of the widely cultivated and leading oilseed crops of the world and its yields are greatly affected by various biotic and abiotic stresses. Arachis diogoi, a wild relative of peanut, is an important source of genes for resistance against various stresses that affect peanut. In our previous study a thaumatin-like protein gene was found to be upregulated in a differential expression reverse transcription PCR (DDRT-PCR) study using the conidial spray of the late leaf spot pathogen, Phaeoisariopsis personata. In the present study, the corresponding full length cDNA was cloned using RACE-PCR and has been designated as AdTLP. It carried an open reading frame of 726 bp potentially capable of encoding a polypeptide of 241 amino acids with 16 conserved cysteine residues. The semi-quantitative RT-PCR analysis showed that the transcript level of AdTLP increased upon treatment with the late leaf spot pathogen of peanut, P. personata and various hormone treatments indicating its involvement in both, biotic and abiotic stresses. The antifungal activity of the purified recombinant protein was checked against different fungal pathogens, which showed enhanced anti-fungal activity compared to many other reported TLP proteins. The recombinant AdTLP-GFP fusion protein was found to be predominantly localized to extracellular spaces. Transgenic tobacco plants ectopically expressing AdTLP showed enhanced resistance to fungal pathogen, Rhizoctonia solani. The seedling assays showed enhanced tolerance of AdTLP transgenic plants against salt and oxidative stress. The transcript analysis of various defense related genes highlighted constitutively higher level expression of PR1a, PI-I and PI-II genes in transgenic plants. These results suggest that the AdTLP is a good candidate gene for enhancing stress resistance in crop plants.
Groundnut (Arachis hypogaea L.), a self-pollinated legume is an important crop cultivated in 24 million ha world over for extraction of edible oil and food uses. The kernels are rich in oil (48–50%) and protein (25–28%), and are source of several vitamins, minerals, antioxidants, biologically active polyphenols, flavonoids, and isoflavones. Improved varieties of groundnut with high yield potential were developed and released for cultivation world over. The improved varieties belong to different maturity durations and possess resistance to diseases, tolerance to drought, enhanced oil content, and improved quality traits for food uses. Conventional breeding procedures along with the tools for phenotyping were largely used in groundnut improvement programs. Mutations were used to induce variability and wide hybridization was attempted to tap variability from wild species. Low genetic variability has been a bottleneck for groundnut improvement. The vast potential of wild species, reservoir of new alleles remains under-utilized. Development of linkage maps of groundnut during the last decade was followed by identification of markers and quantitative trait loci for the target traits. Consequently, the last decade has witnessed the deployment of molecular breeding approaches to complement the ongoing groundnut improvement programs in USA, China, India, and Japan. The other potential advantages of molecular breeding are the feasibility to target multiple traits for improvement and provide tools to tap new alleles from wild species. The first groundnut variety developed through marker-assisted back-crossing is a root-knot nematode-resistant variety, NemaTAM in USA. The uptake of molecular breeding approaches in groundnut improvement programs by NARS partners in India and many African countries is slow or needs to be initiated in part due to inadequate infrastructure, high genotyping costs, and human capacities. Availability of draft genome sequence for diploid (AA and BB) and tetraploid, AABB genome species of Arachis in coming years is expected to bring low-cost genotyping to the groundnut community that will facilitate use of modern genetics and breeding approaches such as genome-wide association studies for trait mapping and genomic selection for crop improvement.
Arachis hypogaea; genetic variability; pedigree; disease resistance; phenotyping; QTLs; molecular breeding; genomic selection
Lack of sufficient molecular markers hinders current genetic research in peanuts (Arachis hypogaea L.). It is necessary to develop more molecular markers for potential use in peanut genetic research. With the development of peanut EST projects, a vast amount of available EST sequence data has been generated. These data offered an opportunity to identify SSR in ESTs by data mining.
In this study, we investigated 24,238 ESTs for the identification and development of SSR markers. In total, 881 SSRs were identified from 780 SSR-containing unique ESTs. On an average, one SSR was found per 7.3 kb of EST sequence with tri-nucleotide motifs (63.9%) being the most abundant followed by di- (32.7%), tetra- (1.7%), hexa- (1.0%) and penta-nucleotide (0.7%) repeat types. The top six motifs included AG/TC (27.7%), AAG/TTC (17.4%), AAT/TTA (11.9%), ACC/TGG (7.72%), ACT/TGA (7.26%) and AT/TA (6.3%). Based on the 780 SSR-containing ESTs, a total of 290 primer pairs were successfully designed and used for validation of the amplification and assessment of the polymorphism among 22 genotypes of cultivated peanuts and 16 accessions of wild species. The results showed that 251 primer pairs yielded amplification products, of which 26 and 221 primer pairs exhibited polymorphism among the cultivated and wild species examined, respectively. Two to four alleles were found in cultivated peanuts, while 3–8 alleles presented in wild species. The apparent broad polymorphism was further confirmed by cloning and sequencing of amplified alleles. Sequence analysis of selected amplified alleles revealed that allelic diversity could be attributed mainly to differences in repeat type and length in the microsatellite regions. In addition, a few single base mutations were observed in the microsatellite flanking regions.
This study gives an insight into the frequency, type and distribution of peanut EST-SSRs and demonstrates successful development of EST-SSR markers in cultivated peanut. These EST-SSR markers could enrich the current resource of molecular markers for the peanut community and would be useful for qualitative and quantitative trait mapping, marker-assisted selection, and genetic diversity studies in cultivated peanut as well as related Arachis species. All of the 251 working primer pairs with names, motifs, repeat types, primer sequences, and alleles tested in cultivated and wild species are listed in Additional File 1.
In this paper, we describe a Ty3-gypsy retrotransposon from allotetraploid peanut (Arachis hypogaea) and its putative diploid ancestors Arachis duranensis (A-genome) and Arachis ipaënsis (B-genome). The consensus sequence is 11,223 bp. The element, named FIDEL (Fairly long Inter-Dispersed Euchromatic LTR retrotransposon), is more frequent in the A- than in the B-genome, with copy numbers of about 3,000 (±950, A. duranensis), 820 (±480, A. ipaënsis), and 3,900 (±1,500, A. hypogaea) per haploid genome. Phylogenetic analysis of reverse transcriptase sequences showed distinct evolution of FIDEL in the ancestor species. Fluorescent in situ hybridization revealed disperse distribution in euchromatin and absence from centromeres, telomeric regions, and the nucleolar organizer region. Using paired sequences from bacterial artificial chromosomes, we showed that elements appear less likely to insert near conserved ancestral genes than near the fast evolving disease resistance gene homologs. Within the Ty3-gypsy elements, FIDEL is most closely related with the Athila/Calypso group of retrovirus-like retrotransposons. Putative transmembrane domains were identified, supporting the presence of a vestigial envelope gene. The results emphasize the importance of FIDEL in the evolution and divergence of different Arachis genomes and also may serve as an example of the role of retrotransposons in the evolution of legume genomes in general.
Electronic supplementary material
The online version of this article (doi:10.1007/s10577-009-9109-z) contains supplementary material, which is available to authorized users.
peanut; Arachis; retrotransposon; retrovirus-like; fluorescent in situ hybridization
Peanut (Arachis hypogaea L.) is an important crop economically and nutritionally, and is one of the most susceptible host crops to colonization of Aspergillus parasiticus and subsequent aflatoxin contamination. Knowledge from molecular genetic studies could help to devise strategies in alleviating this problem; however, few peanut DNA sequences are available in the public database. In order to understand the molecular basis of host resistance to aflatoxin contamination, a large-scale project was conducted to generate expressed sequence tags (ESTs) from developing seeds to identify resistance-related genes involved in defense response against Aspergillus infection and subsequent aflatoxin contamination.
We constructed six different cDNA libraries derived from developing peanut seeds at three reproduction stages (R5, R6 and R7) from a resistant and a susceptible cultivated peanut genotypes, 'Tifrunner' (susceptible to Aspergillus infection with higher aflatoxin contamination and resistant to TSWV) and 'GT-C20' (resistant to Aspergillus with reduced aflatoxin contamination and susceptible to TSWV). The developing peanut seed tissues were challenged by A. parasiticus and drought stress in the field. A total of 24,192 randomly selected cDNA clones from six libraries were sequenced. After removing vector sequences and quality trimming, 21,777 high-quality EST sequences were generated. Sequence clustering and assembling resulted in 8,689 unique EST sequences with 1,741 tentative consensus EST sequences (TCs) and 6,948 singleton ESTs. Functional classification was performed according to MIPS functional catalogue criteria. The unique EST sequences were divided into twenty-two categories. A similarity search against the non-redundant protein database available from NCBI indicated that 84.78% of total ESTs showed significant similarity to known proteins, of which 165 genes had been previously reported in peanuts. There were differences in overall expression patterns in different libraries and genotypes. A number of sequences were expressed throughout all of the libraries, representing constitutive expressed sequences. In order to identify resistance-related genes with significantly differential expression, a statistical analysis to estimate the relative abundance (R) was used to compare the relative abundance of each gene transcripts in each cDNA library. Thirty six and forty seven unique EST sequences with threshold of R > 4 from libraries of 'GT-C20' and 'Tifrunner', respectively, were selected for examination of temporal gene expression patterns according to EST frequencies. Nine and eight resistance-related genes with significant up-regulation were obtained in 'GT-C20' and 'Tifrunner' libraries, respectively. Among them, three genes were common in both genotypes. Furthermore, a comparison of our EST sequences with other plant sequences in the TIGR Gene Indices libraries showed that the percentage of peanut EST matched to Arabidopsis thaliana, maize (Zea mays), Medicago truncatula, rapeseed (Brassica napus), rice (Oryza sativa), soybean (Glycine max) and wheat (Triticum aestivum) ESTs ranged from 33.84% to 79.46% with the sequence identity ≥ 80%. These results revealed that peanut ESTs are more closely related to legume species than to cereal crops, and more homologous to dicot than to monocot plant species.
The developed ESTs can be used to discover novel sequences or genes, to identify resistance-related genes and to detect the differences among alleles or markers between these resistant and susceptible peanut genotypes. Additionally, this large collection of cultivated peanut EST sequences will make it possible to construct microarrays for gene expression studies and for further characterization of host resistance mechanisms. It will be a valuable genomic resource for the peanut community. The 21,777 ESTs have been deposited to the NCBI GenBank database with accession numbers ES702769 to ES724546.
Cultivated peanut, or groundnut (Arachis hypogaea L.), is an important oilseed crop with an allotetraploid genome (AABB, 2n = 4x = 40). In recent years, many efforts have been made to construct linkage maps in cultivated peanut, but almost all of these maps were constructed using low-throughput molecular markers, and most show a low density, directly influencing the value of their applications. With advances in next-generation sequencing (NGS) technology, the construction of high-density genetic maps has become more achievable in a cost-effective and rapid manner. The objective of this study was to establish a high-density single nucleotide polymorphism (SNP)-based genetic map for cultivated peanut by analyzing next-generation double-digest restriction-site-associated DNA sequencing (ddRADseq) reads.
We constructed reduced representation libraries (RRLs) for two A. hypogaea lines and 166 of their recombinant inbred line (RIL) progenies using the ddRADseq technique. Approximately 175 gigabases of data containing 952,679,665 paired-end reads were obtained following Solexa sequencing. Mining this dataset, 53,257 SNPs were detected between the parents, of which 14,663 SNPs were also detected in the population, and 1,765 of the obtained polymorphic markers met the requirements for use in the construction of a genetic map. Among 50 randomly selected in silico SNPs, 47 were able to be successfully validated. One linkage map was constructed, which was comprised of 1,685 marker loci, including 1,621 SNPs and 64 simple sequence repeat (SSR) markers. The map displayed a distribution of the markers into 20 linkage groups (LGs A01–A10 and B01–B10), spanning a distance of 1,446.7 cM. The alignment of the LGs from this map was shown in comparison with a previously integrated consensus map from peanut.
This study showed that the ddRAD library combined with NGS allowed the rapid discovery of a large number of SNPs in the cultivated peanut. The first high density SNP-based linkage map for A. hypogaea was generated that can serve as a reference map for cultivated Arachis species and will be useful in genetic mapping. Our results contribute to the available molecular marker resources and to the assembly of a reference genome sequence for the peanut.
Electronic supplementary material
The online version of this article (doi:10.1186/1471-2164-15-351) contains supplementary material, which is available to authorized users.
Cultivated peanut; Linkage map; SNP; ddRADseq