Field pea (Pisum sativum L.) and faba bean (Vicia faba L.) are cool-season grain legume species that provide rich sources of food for humans and fodder for livestock. To date, both species have been relative 'genomic orphans' due to limited availability of genetic and genomic information. A significant enrichment of genomic resources is consequently required in order to understand the genetic architecture of important agronomic traits, and to support germplasm enhancement, genetic diversity, population structure and demographic studies.
cDNA samples obtained from various tissue types of specific field pea and faba bean genotypes were sequenced using 454 Roche GS FLX Titanium technology. A total of 720,324 and 304,680 reads for field pea and faba bean, respectively, were de novo assembled to generate sets of 70,682 and 60,440 unigenes. Consensus sequences were compared against the genome of the model legume species Medicago truncatula Gaertn., as well as that of the more distantly related, but better-characterised genome of Arabidopsis thaliana L.. In comparison to M. truncatula coding sequences, 11,737 and 10,179 unique hits were obtained from field pea and faba bean. Totals of 22,057 field pea and 18,052 faba bean unigenes were subsequently annotated from GenBank. Comparison to the genome of soybean (Glycine max L.) resulted in 19,451 unique hits for field pea and 16,497 unique hits for faba bean, corresponding to c. 35% and 30% of the known gene space, respectively. Simple sequence repeat (SSR)-containing expressed sequence tags (ESTs) were identified from consensus sequences, and totals of 2,397 and 802 primer pairs were designed for field pea and faba bean. Subsets of 96 EST-SSR markers were screened for validation across modest panels of field pea and faba bean cultivars, as well as related non-domesticated species. For field pea, 86 primer pairs successfully obtained amplification products from one or more template genotypes, of which 59% revealed polymorphism between 6 genotypes. In the case of faba bean, 81 primer pairs displayed successful amplification, of which 48% detected polymorphism.
The generation of EST datasets for field pea and faba bean has permitted effective unigene identification and functional sequence annotation. EST-SSR loci were detected at incidences of 14-17%, permitting design of comprehensive sets of primer pairs. The subsets from these primer pairs proved highly useful for polymorphism detection within Pisum and Vicia germplasm.
Field pea (Pisum sativum L.) is a self-pollinating, diploid, cool-season food legume. Crop production is constrained by multiple biotic and abiotic stress factors, including salinity, that cause reduced growth and yield. Recent advances in genomics have permitted the development of low-cost high-throughput genotyping systems, allowing the construction of saturated genetic linkage maps for identification of quantitative trait loci (QTLs) associated with traits of interest. Genetic markers in close linkage with the relevant genomic regions may then be implemented in varietal improvement programs.
In this study, single nucleotide polymorphism (SNP) markers associated with expressed sequence tags (ESTs) were developed and used to generate comprehensive linkage maps for field pea. From a set of 36,188 variant nucleotide positions detected through in silico analysis, 768 were selected for genotyping of a recombinant inbred line (RIL) population. A total of 705 SNPs (91.7%) successfully detected segregating polymorphisms. In addition to SNPs, genomic and EST-derived simple sequence repeats (SSRs) were assigned to the genetic map in order to obtain an evenly distributed genome-wide coverage. Sequences associated with the mapped molecular markers were used for comparative genomic analysis with other legume species. Higher levels of conserved synteny were observed with the genomes of Medicago truncatula Gaertn. and chickpea (Cicer arietinum L.) than with soybean (Glycine max [L.] Merr.), Lotus japonicus L. and pigeon pea (Cajanus cajan [L.] Millsp.). Parents and RIL progeny were screened at the seedling growth stage for responses to salinity stress, imposed by addition of NaCl in the watering solution at a concentration of 18 dS m-1. Salinity-induced symptoms showed normal distribution, and the severity of the symptoms increased over time. QTLs for salinity tolerance were identified on linkage groups Ps III and VII, with flanking SNP markers suitable for selection of resistant cultivars. Comparison of sequences underpinning these SNP markers to the M. truncatula genome defined genomic regions containing candidate genes associated with saline stress tolerance.
The SNP assays and associated genetic linkage maps developed in this study permitted identification of salinity tolerance QTLs and candidate genes. This constitutes an important set of tools for marker-assisted selection (MAS) programs aimed at performance enhancement of field pea cultivars.
Grain legume; Genetic marker; Trait dissection; Comparative genomics; Abiotic stress; Breeding
Gene-based SNPs were identified and mapped in pea using five recombinant inbred line populations segregating for traits of agronomic importance.
Pea (Pisum sativum L.) is one of the world’s oldest domesticated crops and has been a model system in plant biology and genetics since the work of Gregor Mendel. Pea is the second most widely grown pulse crop in the world following common bean. The importance of pea as a food crop is growing due to its combination of moderate protein concentration, slowly digestible starch, high dietary fiber concentration, and its richness in micronutrients; however, pea has lagged behind other major crops in harnessing recent advances in molecular biology, genomics and bioinformatics, partly due to its large genome size with a large proportion of repetitive sequence, and to the relatively limited investment in research in this crop globally. The objective of this research was the development of a genome-wide transcriptome-based pea single-nucleotide polymorphism (SNP) marker platform using next-generation sequencing technology. A total of 1,536 polymorphic SNP loci selected from over 20,000 non-redundant SNPs identified using deep transcriptome sequencing of eight diverse Pisum accessions were used for genotyping in five RIL populations using an Illumina GoldenGate assay. The first high-density pea SNP map defining all seven linkage groups was generated by integrating with previously published anchor markers. Syntenic relationships of this map with the model legume Medicago truncatula and lentil (Lens culinaris Medik.) maps were established. The genic SNP map establishes a foundation for future molecular breeding efforts by enabling both the identification and tracking of introgression of genomic regions harbouring QTLs related to agronomic and seed quality traits.
Electronic supplementary material
The online version of this article (doi:10.1007/s00122-014-2375-y) contains supplementary material, which is available to authorized users.
Pea has a complex genome of 4.3 Gb for which only limited genomic resources are available to date. Although SNP markers are now highly valuable for research and modern breeding, only a few are described and used in pea for genetic diversity and linkage analysis.
We developed a large resource by cDNA sequencing of 8 genotypes representative of modern breeding material using the Roche 454 technology, combining both long reads (400 bp) and high coverage (3.8 million reads, reaching a total of 1,369 megabases). Sequencing data were assembled and generated a 68 K unigene set, from which 41 K were annotated from their best blast hit against the model species Medicago truncatula. Annotated contigs showed an even distribution along M. truncatula pseudochromosomes, suggesting a good representation of the pea genome. 10 K pea contigs were found to be polymorphic among the genetic material surveyed, corresponding to 35 K SNPs.
We validated a subset of 1538 SNPs through the GoldenGate assay, proving their ability to structure a diversity panel of breeding germplasm. Among them, 1340 were genetically mapped and used to build a new consensus map comprising a total of 2070 markers. Based on blast analysis, we could establish 1252 bridges between our pea consensus map and the pseudochromosomes of M. truncatula, which provides new insight on synteny between the two species.
Our approach created significant new resources in pea, i.e. the most comprehensive genetic map to date tightly linked to the model species M. truncatula and a large SNP resource for both academic research and breeding.
Pisum sativum; Medicago truncatula; Next generation sequencing; Genetic diversity; Composite genetic map; Synteny; Marker assisted selection
Development of durable plant genetic resistance to pathogens through strategies of QTL pyramiding and diversification requires in depth knowledge of polygenic resistance within the available germplasm. Polygenic partial resistance to Aphanomyces root rot, caused by Aphanomyces euteiches, one of the most damaging pathogens of pea worldwide, was previously dissected in individual mapping populations. However, there are no data available regarding the diversity of the resistance QTL across a broader collection of pea germplasm. In this study, we performed a meta-analysis of Aphanomyces root rot resistance QTL in the four main sources of resistance in pea and compared their genomic localization with genes/QTL controlling morphological or phenological traits and with putative candidate genes.
Meta-analysis, conducted using 244 individual QTL reported previously in three mapping populations (Puget x 90–2079, Baccara x PI180693 and Baccara x 552) and in a fourth mapping population in this study (DSP x 90–2131), resulted in the identification of 27 meta-QTL for resistance to A. euteiches. Confidence intervals of meta-QTL were, on average, reduced four-fold compared to mean confidence intervals of individual QTL. Eleven consistent meta-QTL, which highlight seven highly consistent genomic regions, were identified. Few meta-QTL specificities were observed among mapping populations, suggesting that sources of resistance are not independent. Seven resistance meta-QTL, including six of the highly consistent genomic regions, co-localized with six of the meta-QTL identified in this study for earliness and plant height and with three morphological genes (Af, A, R). Alleles contributing to the resistance were often associated with undesirable alleles for dry pea breeding. Candidate genes underlying six main meta-QTL regions were identified using colinearity between the pea and Medicago truncatula genomes.
QTL meta-analysis provided an overview of the moderately low diversity of loci controlling partial resistance to A. euteiches in four main sources of resistance in pea. Seven highly consistent genomic regions with potential use in marker-assisted-selection were identified. Confidence intervals at several main QTL regions were reduced and co-segregation among resistance and morphological/phenological alleles was identified. Further work will be required to identify the best combinations of QTL for durably increasing partial resistance to A. euteiches.
Aphanomyces root rot; Pisum sativum; Meta-QTL; QTL diversity; Consistent genomic regions; Resistance alleles; Undesirable alleles
White mold, caused by Sclerotinia sclerotiorum, is one of the most important diseases of pea (Pisum sativum L.), however, little is known about the genetics and biochemistry of this interaction. Identification of genes underlying resistance in the host or pathogenicity and virulence factors in the pathogen will increase our knowledge of the pea-S. sclerotiorum interaction and facilitate the introgression of new resistance genes into commercial pea varieties. Although the S. sclerotiorum genome sequence is available, no pea genome is available, due in part to its large genome size (~3500 Mb) and extensive repeated motifs. Here we present an EST data set specific to the interaction between S. sclerotiorum and pea, and a method to distinguish pathogen and host sequences without a species-specific reference genome.
10,158 contigs were obtained by de novo assembly of 128,720 high-quality reads generated by 454 pyrosequencing of the pea-S. sclerotiorum interactome. A method based on the tBLASTx program was modified to distinguish pea and S. sclerotiorum ESTs. To test this strategy, a mixture of known ESTs (18,490 pea and 17,198 S. sclerotiorum ESTs) from public databases were pooled and parsed; the tBLASTx method successfully separated 90.1% of the artificial EST mix with 99.9% accuracy. The tBLASTx method successfully parsed 89.4% of the 454-derived EST contigs, as validated by PCR, into pea (6,299 contigs) and S. sclerotiorum (2,780 contigs) categories. Two thousand eight hundred and forty pea ESTs and 996 S. sclerotiorum ESTs were predicted to be expressed specifically during the pea-S. sclerotiorum interaction as determined by homology search against 81,449 pea ESTs (from flowers, leaves, cotyledons, epi- and hypocotyl, and etiolated and light treated etiolated seedlings) and 57,751 S. sclerotiorum ESTs (from mycelia at neutral pH, developing apothecia and developing sclerotia). Among those ESTs specifically expressed, 277 (9.8%) pea ESTs were predicted to be involved in plant defense and response to biotic or abiotic stress, and 93 (9.3%) S. sclerotiorum ESTs were predicted to be involved in pathogenicity/virulence. Additionally, 142 S. sclerotiorum ESTs were identified as secretory/signal peptides of which only 21 were previously reported.
We present and characterize an EST resource specific to the pea-S. sclerotiorum interaction. Additionally, the tBLASTx method used to parse S. sclerotiorum and pea ESTs was demonstrated to be a reliable and accurate method to distinguish ESTs without a reference genome.
Pisum sativum; Sclerotinia sclerotiorum; Transcriptome; Parsing of host-pathogen sequences; Non-model organism
Ascochyta blight, caused by Mycosphaerella pinodes is one of the most important pea pathogens. However, little is known about the genes and mechanisms of resistance acting against M. pinodes in pea. Resistance identified so far to this pathogen is incomplete, polygenic and scarce in pea, being most common in Pisum relatives. The identification of the genes underlying resistance would increase our knowledge about M. pinodes-pea interaction and would facilitate the introgression of resistance into pea varieties. In the present study differentially expressed genes in the resistant P. sativum ssp. syriacum accession P665 comparing to the susceptible pea cv. Messire after inoculation with M. pinodes have been identified using a M. truncatula microarray.
Of the 16,470 sequences analysed, 346 were differentially regulated. Differentially regulated genes belonged to almost all functional categories and included genes involved in defense such as genes involved in cell wall reinforcement, phenylpropanoid and phytoalexins metabolism, pathogenesis- related (PR) proteins and detoxification processes. Genes associated with jasmonic acid (JA) and ethylene signal transduction pathways were induced suggesting that the response to M. pinodes in pea is regulated via JA and ET pathways. Expression levels of ten differentially regulated genes were validated in inoculated and control plants using qRT-PCR showing that the P665 accession shows constitutively an increased expression of the defense related genes as peroxidases, disease resistance response protein 39 (DRR230-b), glutathione S-transferase (GST) and 6a-hydroxymaackiain methyltransferase.
Through this study a global view of genes expressed during resistance to M. pinodes has been obtained, giving relevant information about the mechanisms and pathways conferring resistance to this important disease. In addition, the M. truncatula microarray represents an efficient tool to identify candidate genes controlling resistance to M. pinodes in pea.
Grain aphid (Sitobion avenae F) and pea aphid (Acyrthosiphon pisum) are two agriculturally important pest species, which cause significant yield losses to crop plants each year by inflicting damage both through the direct effects of feeding and by vectoring debilitating plant viruses. Although a close phylogenetic relationship between grain aphid and pea aphid was proposed, the biological variations between these two aphid species are obvious. While the host ranges of grain aphid is restricted to cereal crops and in particular wheat, that of pea aphid is wider, mainly colonizing leguminous plant species. Until now, the genetic factors underlying the divergence between grain aphid and pea aphid still remain unclear due to the limited genomic data of grain aphid available in public databases.
Based on a set of transcriptome data of grain aphid generated by using Roche 454 GS-FLX pyrosequencing, comparative analysis between this set of transcriptome data of grain aphid and mRNA sequences of pea aphid available in the public databases was performed. Compared with mRNA sequences of pea aphid, 4,857 unigenes were found to be specifically presented in the transcriptome of grain aphid under the rearing conditions described in this study. Furthermore, 3,368 orthologous pairs which could be calculated with both nonsynonymous (Ka) and synonymous (Ks) substitutions were used to infer their sequence divergences. The average differences in the coding, 5′ and 3′ untranslated regions of these orthologs were 10.53%, 21.29% and 18.96%, respectively. Moreover, of 340 orthologs which were identified to have evolved in response to positive selection based on the rates of Ka and Ks substitutions, 186 were predicted to be involved in secondary metabolism and xenobiotic metabolisms which might contribute to the divergence of these two aphid species.
The comprehensive transcriptome divergent sequence analysis between grain aphid and pea aphid provides an invaluable resource for the investigation of genes involved in host plant adaptation and evolution. Moreover, the demonstration of divergent transcriptome sequences between grain aphid and pea aphid pave the way for the investigation of the molecular mechanisms underpinning the biological variations of these two agriculturally important aphid species.
Electronic supplementary material
The online version of this article (doi:10.1186/1471-2164-15-1023) contains supplementary material, which is available to authorized users.
Grain aphid; Pea aphid; Transcriptome; Comparative transcriptomic analysis; Divergence
Rose-comb, a classical monogenic trait of chickens, is characterized by a drastically altered comb morphology compared to the single-combed wild-type. Here we show that Rose-comb is caused by a 7.4 Mb inversion on chromosome 7 and that a second Rose-comb allele arose by unequal crossing over between a Rose-comb and wild-type chromosome. The comb phenotype is caused by the relocalization of the MNR2 homeodomain protein gene leading to transient ectopic expression of MNR2 during comb development. We also provide a molecular explanation for the first example of epistatic interaction reported by Bateson and Punnett 104 years ago, namely that walnut-comb is caused by the combined effects of the Rose-comb and Pea-comb alleles. Transient ectopic expression of MNR2 and SOX5 (causing the Pea-comb phenotype) occurs in the same population of mesenchymal cells and with at least partially overlapping expression in individual cells in the comb primordium. Rose-comb has pleiotropic effects, as homozygosity in males has been associated with poor sperm motility. We postulate that this is caused by the disruption of the CCDC108 gene located at one of the inversion breakpoints. CCDC108 is a poorly characterized protein, but it contains a MSP (major sperm protein) domain and is expressed in testis. The study illustrates several characteristic features of the genetic diversity present in domestic animals, including the evolution of alleles by two or more consecutive mutations and the fact that structural changes have contributed to fast phenotypic evolution.
Comb morphology is a trait that shows considerable variability among domestic chickens. The Rose-comb mutation causes a drastically altered shape of the comb, whereas the Pea-comb mutation leads to a considerable reduction in the size of the comb. The combined effect of Rose-comb and Pea-comb is a comb shaped like a walnut, and the phenotype is consequently named walnut-comb. Both Pea-comb and Rose-comb are caused by structural changes in the genome leading to altered expression of important transcription factors. In a previous study we showed that Pea-comb is caused by misexpression of SOX5 during the development of the comb. In this study we report that Rose-comb is caused by a large inversion on chicken chromosome 7. The inversion moves the MNR2 gene to a new genomic location. This leads to misexpression of MNR2 during comb development, similar to the defect causing Pea-comb. Roosters that are homozygous for the Rose-comb inversion show poor sperm motility, and our results suggest that this is caused by the disruption of the CCDC108 gene that is located at one of the inversion breakpoints. CCDC108 is well conserved between chickens and humans, and this study establishes CCDC108 as a candidate gene for sperm motility disorders in humans.
The development of genetic markers is complex and costly in species with little pre-existing genomic information. Faba bean possesses one of the largest and least studied genomes among cultivated crop plants and no gene-based genetic maps exist. Gene-based orthologous markers allow chromosomal regions and levels of synteny to be characterised between species, reveal phylogenetic relationships and chromosomal evolution, and enable targeted identification of markers for crop breeding. In this study orthologous codominant cross-species markers have been deployed to produce the first exclusively gene-based genetic linkage map of faba bean (Vicia faba), using an F6 population developed from a cross between the lines Vf6 (equina type) and Vf27 (paucijuga type).
Of 796 intron-targeted amplified polymorphic (ITAP) markers screened, 151 markers could be used to construct a comparative genetic map. Linkage analysis revealed seven major and five small linkage groups (LGs), one pair and 12 unlinked markers. Each LG was comprised of three to 30 markers and varied in length from 23.6 cM to 324.8 cM. The map spanned a total length of 1685.8 cM. A simple and direct macrosyntenic relationship between faba bean and Medicago truncatula was evident, while faba bean and lentil shared a common rearrangement relative to M. truncatula. One hundred and four of the 127 mapped markers in the 12 LGs, which were previously assigned to M. truncatula genetic and physical maps, were found in regions syntenic between the faba bean and M. truncatula genomes. However chromosomal rearrangements were observed that could explain the difference in chromosome numbers between these three legume species. These rearrangements suggested high conservation of M. truncatula chromosomes 1, 5 and 8; moderate conservation of chromosomes 2, 3, 4 and 7 and no conservation with M. truncatula chromosome 6. Multiple PCR amplicons and comparative mapping were suggestive of small-scale duplication events in faba bean. This study also provides a preliminary indication for finer scale macrosynteny between M. truncatula, lentil and faba bean. Markers originally designed from genes on the same M. truncatula BACs were found to be grouped together in corresponding syntenic areas in lentil and faba bean.
Despite the large size of the faba bean genome, comparative mapping did not reveal evidence for polyploidisation, segmental duplication, or significant rearrangements compared to M. truncatula, although a bias in the use of single locus markers may have limited the detection of duplications. Non-coding repetitive DNA or transposable element content provides a possible explanation for the difference in genome sizes. Similar patterns of rearrangements in faba bean and lentil compared to M. truncatula support phylogenetic studies dividing these species into the tribes Viceae and Trifoliae. However, substantial macrosynteny was apparent between faba bean and M. truncatula, with the exception of chromosome 6 where no orthologous markers were found, confirming previous investigations suggesting chromosome 6 is atypical. The composite map, anchored with orthologous markers mapped in M. truncatula, provides a central reference map for future use of genomic and genetic information in faba bean genetic analysis and breeding.
The genetic regulation of flower color has been widely studied, notably as a character used by Mendel and his predecessors in the study of inheritance in pea.
We used the genome sequence of model legumes, together with their known synteny to the pea genome to identify candidate genes for the A and A2 loci in pea. We then used a combination of genetic mapping, fast neutron mutant analysis, allelic diversity, transcript quantification and transient expression complementation studies to confirm the identity of the candidates.
We have identified the pea genes A and A2. A is the factor determining anthocyanin pigmentation in pea that was used by Gregor Mendel 150 years ago in his study of inheritance. The A gene encodes a bHLH transcription factor. The white flowered mutant allele most likely used by Mendel is a simple G to A transition in a splice donor site that leads to a mis-spliced mRNA with a premature stop codon, and we have identified a second rare mutant allele. The A2 gene encodes a WD40 protein that is part of an evolutionarily conserved regulatory complex.
Highly polymorphic and transferable microsatellites (SSRs) are important for comparative genomics, genome analysis and
phylogenetic studies. Development of novel species-specific microsatellite markers remains a costly and labor-intensive project.
Therefore, interest has been shifted from genomic to genic markers owing to their high inter-species transferability as they are
developed from conserved coding regions of the genome. This study concentrates on comparative analysis of genic microsatellites
in nine important legume (Arachis hypogaea, Cajanus cajan, Cicer arietinum, Glycine max, Lotus japonicus, Medicago truncatula, Phaseolus
vulgaris, Pisum sativum and Vigna unguiculata) and two model plant species (Oryza sativa and Arabidopsis thaliana). Screening of a
total of 228090 putative unique sequences spanning 219610522 bp using a microsatellite search tool, MISA, identified 12.18% of the
unigenes containing 36248 microsatellite motifs excluding mononucleotide repeats. Frequency of legume unigene-derived SSRs
was one SSR in every 6.0 kb of analyzed sequences. The trinucleotide repeats were predominant in all the unigenes with the
exception of C. cajan, which showed prevalence of dinucleotide repeats over trinucleotide repeats. Dinucleotide repeats along with
trinucleotides counted for more than 90% of the total microsatellites. Among dinucleotide and trinucleotide repeats, AG and AAG
motifs, respectively, were the most frequent. Microsatellite positive chickpea unigenes were assigned Gene Ontology (GO) terms to
identify the possible role of unigenes in various molecular and biological functions. These unigene based microsatellite markers
will prove valuable for recording allelic variance across germplasm collections, gene tagging and searching for putative candidate
Microsatellites; SSRs; Unigenes; Legumes; Functional annotation
The locations of the two replication origins in pea chloroplast DNA (ctDNA) have been mapped by electron microscopic analysis of restriction digests of supercoiled ctDNA cross-linked with trioxalen. Both origins of replication, identified as displacement loops (D-loops), were present in the 44-kilobase-pair (kbp) SalI A fragment. The first D-loop was located at 9.0 kbp from the closest SalI restriction site. The average size of this D-loop was about 0.7 kbp. The second D-loop started 14.2 kbp in from the same restriction site and ended at about 15.5 kbp, giving it a size of about 1.3 kbp. The orientation of these two D-loops on the restriction map of pea ctDNA was determined by analyzing SmaI, PstI, and SalI-SmaI restriction digests of pea ctDNA. One D-loop has been mapped in the spacer region between the 16S and 23S rRNA genes. The second D-loop was located downstream of the 23S rRNA gene. Denaturation mapping of recombinants pCP 12-7 and pCB 1-12, which contain both D-loops, confirmed the location of the D-loops in the restriction map of pea ctDNA. Denaturation-mapping studies also showed that the two D-loops had different base compositions; the one closest to a SalI restriction site denatured readily compared with the other D-loop. The recombinants pCP 12-7 and pCB 1-12 were found to be highly active in DNA synthesis when used as templates in a partially purified replication system from pea chloroplasts. Analysis of in vitro-synthesized DNA with either of these recombinants showed that full-length template DNA was synthesized. Recombinants from other regions of the pea chloroplast genome showed no significant DNA synthesis activity in vitro.
Cowpea, Vigna unguiculata (L.) Walp., is one of the most important food and forage legumes in the semi-arid tropics because of its drought tolerance and ability to grow on poor quality soils. Approximately 80% of cowpea production takes place in the dry savannahs of tropical West and Central Africa, mostly by poor subsistence farmers. Despite its economic and social importance in the developing world, cowpea remains to a large extent an underexploited crop. Among the major goals of cowpea breeding and improvement programs is the stacking of desirable agronomic traits, such as disease and pest resistance and response to abiotic stresses. Implementation of marker-assisted selection and breeding programs is severely limited by a paucity of trait-linked markers and a general lack of information on gene structure and organization. With a nuclear genome size estimated at ~620 Mb, the cowpea genome is an ideal target for reduced representation sequencing.
We report here the sequencing and analysis of the gene-rich, hypomethylated portion of the cowpea genome selectively cloned by methylation filtration (MF) technology. Over 250,000 gene-space sequence reads (GSRs) with an average length of 610 bp were generated, yielding ~160 Mb of sequence information. The GSRs were assembled, annotated by BLAST homology searches of four public protein annotation databases and four plant proteomes (A. thaliana, M. truncatula, O. sativa, and P. trichocarpa), and analyzed using various domain and gene modeling tools. A total of 41,260 GSR assemblies and singletons were annotated, of which 19,786 have unique GenBank accession numbers. Within the GSR dataset, 29% of the sequences were annotated using the Arabidopsis Gene Ontology (GO) with the largest categories of assigned function being catalytic activity and metabolic processes, groups that include the majority of cellular enzymes and components of amino acid, carbohydrate and lipid metabolism. A total of 5,888 GSRs had homology to genes encoding transcription factors (TFs) and transcription associated factors (TAFs) representing about 5% of the total annotated sequences in the dataset. Sixty-two (62) of the 64 well-characterized plant transcription factor (TF) gene families are represented in the cowpea GSRs, and these families are of similar size and phylogenetic organization to those characterized in other plants. The cowpea GSRs also provides a rich source of genes involved in photoperiodic control, symbiosis, and defense-related responses. Comparisons to available databases revealed that about 74% of cowpea ESTs and 70% of all legume ESTs were represented in the GSR dataset. As approximately 12% of all GSRs contain an identifiable simple-sequence repeat, the dataset is a powerful resource for the design of microsatellite markers.
The availability of extensive publicly available genomic data for cowpea, a non-model legume with significant importance in the developing world, represents a significant step forward in legume research. Not only does the gene space sequence enable the detailed analysis of gene structure, gene family organization and phylogenetic relationships within cowpea, but it also facilitates the characterization of syntenic relationships with other cultivated and model legumes, and will contribute to determining patterns of chromosomal evolution in the Leguminosae. The micro and macrosyntenic relationships detected between cowpea and other cultivated and model legumes should simplify the identification of informative markers for marker-assisted trait selection and map-based gene isolation necessary for cowpea improvement.
The eukaryotic translation initiation factor 4E was shown to be involved in resistance against several potyviruses in plants, including pea. We combined our knowledge of pea germplasm diversity with that of the eIF4E gene to identify novel genetic diversity.
Germplasm of 2803 pea accessions was screened for eIF4E intron 3 length polymorphism, resulting in the detection of four eIF4EA-B-C-S variants, whose distribution was geographically structured. The eIF4EA variant conferring resistance to the P1 PSbMV pathotype was found in 53 accessions (1.9%), of which 15 were landraces from India, Afghanistan, Nepal, and 7 were from Ethiopia. A newly discovered variant, eIF4EB, was present in 328 accessions (11.7%) from Ethiopia (29%), Afghanistan (23%), India (20%), Israel (25%) and China (39%). The eIF4EC variant was detected in 91 accessions (3.2% of total) from India (20%), Afghanistan (33%), the Iberian Peninsula (22%) and the Balkans (9.3%). The eIF4ES variant for susceptibility predominated as the wild type. Sequencing of 73 samples, identified 34 alleles at the whole gene, 26 at cDNA and 19 protein variants, respectively. Fifteen alleles were virologically tested and 9 alleles (eIF4EA-1-2-3-4-5-6-7, eIF4EB-1, eIF4EC-2) conferred resistance to the P1 PSbMV pathotype.
This work identified novel eIF4E alleles within geographically structured pea germplasm and indicated their independent evolution from the susceptible eIF4ES1 allele. Despite high variation present in wild Pisum accessions, none of them possessed resistance alleles, supporting a hypothesis of distinct mode of evolution of resistance in wild as opposed to crop species. The Highlands of Central Asia, the northern regions of the Indian subcontinent, Eastern Africa and China were identified as important centers of pea diversity that correspond with the diversity of the pathogen. The series of alleles identified in this study provides the basis to study the co-evolution of potyviruses and the pea host.
Women with triple-negative breast cancer have the worst prognosis, frequently present with metastatic tumors and have few targeted therapy options. Notch-1 and Notch-4 are potent breast oncogenes that are overexpressed in triple-negative and other subtypes of breast cancer. PEA3, an ETS transcription factor, is also overexpressed in triple-negative and other breast cancer subtypes. We investigated whether PEA3 could be the critical transcriptional activator of Notch receptors in MDA-MB-231 and other breast cancer cells.
Real-time PCR and Western blot analysis were performed to detect Notch-1, Notch-2, Notch-3 and Notch-4 receptor expression in breast cancer cells when PEA3 was knocked down by siRNA. Chromatin immunoprecipitation was performed to identify promoter regions for Notch genes that recruited PEA3. TAM-67 and c-Jun siRNA were used to identify that c-Jun was necessary for PEA3 enrichment on the Notch-4 promoter. A Notch-4 luciferase reporter was used to confirm that endogenous PEA3 or AP-1 activated the Notch-4 promoter region. Cell cycle analysis, trypan blue exclusion, annexin V flow cytometry, colony formation assay and an in vivo xenograft study were performed to determine the biological significance of targeting PEA3 via siRNA, Notch signaling via a γ-secretase inhibitor, or both.
Herein we provide new evidence for transcriptional regulation of Notch by PEA3 in breast cancer. PEA3 activates Notch-1 transcription in MCF-7, MDA-MB-231 and SKBr3 breast cancer cells. PEA3 activates Notch-4 transcription in MDA-MB-231 cells where PEA3 levels are endogenously high. In SKBr3 and BT474 breast cancer cells where PEA3 levels are low, overexpression of PEA3 increases Notch-4 transcripts. Chromatin immunoprecipitation confirmed the enrichment of PEA3 on Notch-1 and Notch-4 promoters in MDA-MB-231 cells. PEA3 recruitment to Notch-1 was AP-1-independent, whereas PEA3 recruitment to Notch-4 was c-JUN-dependent. Importantly, the combined inhibition of Notch signaling via a γ-secretase inhibitor (MRK-003 GSI) and knockdown of PEA3 arrested growth in the G1 phase, decreased both anchorage-dependent and anchorage-independent growth and significantly increased apoptotic cells in vitro. Moreover, either PEA3 knockdown or MRK-003 GSI treatment significantly reduced tumor growth of MDA-MB-231 xenografts in vivo.
Taken together, the results from this study demonstrate for the first time that Notch-1 and Notch-4 are novel transcriptional targets of PEA3 in breast cancer cells. Targeting of PEA3 and/or Notch pathways might provide a new therapeutic strategy for triple-negative and possibly other breast cancer subtypes.
Cowpea [Vigna unguiculata (L.) Walp] exhibits a considerable variation in leaf shape. Although cowpea is mostly utilized as a dry grain and animal fodder crop, cowpea leaves are also used as a high-protein pot herb in many countries of Africa.
Leaf morphology was studied in the cowpea RIL population, Sanzi (sub-globose leaf shape) x Vita 7 (hastate leaf shape). A QTL for leaf shape, Hls (hastate leaf shape), was identified on the Sanzi x Vita 7 genetic map spanning from 56.54 cM to 67.54 cM distance on linkage group 15. SNP marker 1_0910 was the most significant over the two experiments, accounting for 74.7% phenotypic variance (LOD 33.82) in a greenhouse experiment and 71.5% phenotypic variance (LOD 30.89) in a field experiment. The corresponding Hls locus was positioned on the cowpea consensus genetic map on linkage group 4, spanning from 25.57 to 35.96 cM. A marker-trait association of the Hls region identified SNP marker 1_0349 alleles co-segregating with either the hastate or sub-globose leaf phenotype. High co-linearity was observed for the syntenic Hls region in Medicago truncatula and Glycine max. One syntenic locus for Hls was identified on Medicago chromosome 7 while syntenic regions for Hls were identified on two soybean chromosomes, 3 and 19. In all three syntenic loci, an ortholog for the EZA1/SWINGER (AT4G02020.1) gene was observed and is the candidate gene for the Hls locus. The Hls locus was identified on the cowpea physical map via SNP markers 1_0910, 1_1013 and 1_0992 which were identified in three BAC contigs; contig926, contig821 and contig25.
This study has demonstrated how integrated genomic resources can be utilized for a candidate gene approach. Identification of genes which control leaf morphology may be utilized to improve the quality of cowpea leaves for vegetable and or forage markets as well as contribute to more fundamental research understanding the control of leaf shape in legumes.
QTL analysis; Leaf morphology; Genomics; Genetics; Physical map; Synteny; Candidate genes; Cowpea; Legumes; EZA1/SWINGER
Lentil (Lens culinaris Medik.) is a global food crop with increasing importance for food security in south Asia and other regions. Lens ervoides, a wild relative of cultivated lentil, is an important source of agronomic trait variation. Lens is a member of the galegoid clade of the Papilionoideae family, which includes other important dietary legumes such as chickpea (Cicer arietinum) and pea (Pisum sativum), and the sequenced model legume Medicago truncatula. Understanding the genetic structure of Lens spp. in relation to more fully sequenced legumes would allow leveraging of genomic resources. A set of 1107 TOG-based amplicons were identified in L. ervoides and a subset thereof used to design SNP markers for mapping. A map of L. ervoides consisting of 377 SNP markers spread across seven linkage groups was developed using a GoldenGate genotyping array and single SNP marker assays. Comparison with maps of M. truncatula and L. culinaris documented considerable shared synteny and led to the identification of a few major translocations and a major inversion that distinguish Lens from M. truncatula, as well as a translocation that distinguishes L. culinaris from L. ervoides. The identification of chromosome-level differences among Lens spp. will aid in the understanding of introgression of genes from L. ervoides into cultivated L. culinaris, furthering genetic research and breeding applications in lentil.
wild lentil; legume tentative orthologs; mapping; translocation; synteny; Medicago
Extraordinary size variation of higher plant nuclear genomes is in large part caused by differences in accumulation of repetitive DNA. This makes repetitive DNA of great interest for studying the molecular mechanisms shaping architecture and function of complex plant genomes. However, due to methodological constraints of conventional cloning and sequencing, a global description of repeat composition is available for only a very limited number of higher plants. In order to provide further data required for investigating evolutionary patterns of repeated DNA within and between species, we used a novel approach based on massive parallel sequencing which allowed a comprehensive repeat characterization in our model species, garden pea (Pisum sativum).
Analysis of 33.3 Mb sequence data resulted in quantification and partial sequence reconstruction of major repeat families occurring in the pea genome with at least thousands of copies. Our results showed that the pea genome is dominated by LTR-retrotransposons, estimated at 140,000 copies/1C. Ty3/gypsy elements are less diverse and accumulated to higher copy numbers than Ty1/copia. This is in part due to a large population of Ogre-like retrotransposons which alone make up over 20% of the genome. In addition to numerous types of mobile elements, we have discovered a set of novel satellite repeats and two additional variants of telomeric sequences. Comparative genome analysis revealed that there are only a few repeat sequences conserved between pea and soybean genomes. On the other hand, all major families of pea mobile elements are well represented in M. truncatula.
We have demonstrated that even in a species with a relatively large genome like pea, where a single 454-sequencing run provided only 0.77% coverage, the generated sequences were sufficient to reconstruct and analyze major repeat families corresponding to a total of 35–48% of the genome. These data provide a starting point for further investigations of legume plant genomes based on their global comparative analysis and for the development of more sophisticated approaches for data mining.
Studying the genetics of host shifts and range expansions in phytophagous insects contributes to our understanding of the evolution of host plant adaptation. We investigated the recent host range expansion to pea, in the pea-adapted strain (P-strain) of the crucifer-specialist diamondback moth, Plutella xylostella (Lepidoptera: Plutellidae). Larval survivorship on the novel host plant pea and a typical crucifer host (kale) was measured in reciprocal F1, F2 and backcrosses between the P-strain and a strain reared only on crucifers (C-strain). Reciprocal F1 hybrids differed: offspring from P-strain mothers survived better on pea, indicating a maternal effect. However, no evidence for sex-linkage was found. Backcrosses to the P-strain produced higher survivorship on pea than C-strain backcrosses, suggesting recessive inheritance. In a linkage analysis with amplified fragment length polymorphism markers using P-strain backcrosses, two, four and five linkage groups contributing to survival on pea were identified in three different families respectively, indicating oligogenic inheritance. Thus, the newly evolved ability to survive on pea has a complex genetic basis, and the P-strain is still genetically heterogeneous and not yet fixed for all the alleles enabling it to survive on pea. Survivorship on kale was variable, but not related to survivorship on pea. This pattern may characterize the genetic inheritance of early host plant adaptation in oligophagous insect species.
host range expansion; diamondback moth; AFLP analysis; linkage mapping; inheritance
Legumes, as protein-rich crops, are widely used for human food, animal feed and vegetable oil production. Over the past decade, two legume species, Medicago truncatula and Lotus japonicus, have been adopted as model legumes for genomics and physiological studies. The tobacco transposable element, Tnt1, is a powerful tool for insertional mutagenesis and gene inactivation in plants. A large collection of Tnt1-tagged lines of M. truncatula cv. Jemalong was generated during the course of the project ‘GLIP’: Grain Legumes Integrated Project, funded by the European Union (www.eugrainlegumes.org). In the project ‘IFCOSMO’: Integrated Functional and COmparative genomics Studies on the MOdel Legumes Medicago truncatula and Lotus japonicus, supported by a grant from the Ministry of Education, Youth and Science, Bulgaria, these lines are used for development of functional genomics platform of legumes in Bulgaria. This review presents recent advances in the evaluation of the M. truncatula Tnt1 mutant collection and outlines the steps that are taken in using the Tnt1-tagging for generation of a mutant collection of the second model legume L. japonicus. Both collections will provide a number of legume-specific mutants and serve as a resource for functional and comparative genomics research on legumes. Genomics technologies are expected to advance genetics and breeding of important legume crops (pea, faba bean, alfalfa and clover) in Bulgaria and worldwide.
Insertional mutagenesis; legume genomics; Medicago truncatula; Lotus japonicus; phenotyping; Tnt1 mutants.
White clover (Trifolium repens L.) is a temperate forage legume with an allotetraploid genome (2n=4×=32) estimated at 1093 Mb. Several linkage maps of various sizes, marker sources and completeness are available, however, no integrated map and marker set has explored consistency of linkage analysis among unrelated mapping populations. Such integrative analysis requires tools for homoeologue matching among populations. Development of these tools provides for a consistent framework map of the white clover genome, and facilitates in silico alignment with the model forage legume, Medicago truncatula.
This is the first report of integration of independent linkage maps in white clover, and adds to the literature on methyl filtered GeneThresher®-derived microsatellite (simple sequence repeat; SSR) markers for linkage mapping. Gene-targeted SSR markers were discovered in a GeneThresher® (TrGT) methyl-filtered database of 364,539 sequences, which yielded 15,647 SSR arrays. Primers were designed for 4,038 arrays and of these, 465 TrGT-SSR markers were used for parental consensus genetic linkage analysis in an F1 mapping population (MP2). This was merged with an EST-SSR consensus genetic map of an independent population (MP1), using markers to match homoeologues and develop a multi-population integrated map of the white clover genome. This integrated map (IM) includes 1109 loci based on 804 SSRs over 1274 cM, covering 97% of the genome at a moderate density of one locus per 1.2 cM. Eighteen candidate genes and one morphological marker were also placed on the IM. Despite being derived from disparate populations and marker sources, the component maps and the derived IM had consistent representations of the white clover genome for marker order and genetic length. In silico analysis at an E-value threshold of 1e-20 revealed substantial co-linearity with the Medicago truncatula genome, and indicates a translocation between T. repens groups 2 and 6 relative to M. truncatula.
This integrated genetic linkage analysis provides a consistent and comprehensive linkage analysis of the white clover genome, with alignment to a model forage legume. Associated marker locus information, particularly the homoeologue-specific markers, offers a new resource for forage legume research to enable genetic analysis and improvement of this forage and grassland species.
Legume; Trifolium repens; Linkage; Microsatellite; SSR; GeneThresher®; Allotetraploid; Homoeologue; Medicago truncatula; Synteny
The garden pea, Pisum sativum, is among the best-investigated legume plants and of significant agro-commercial relevance. Pisum sativum has a large and complex genome and accordingly few comprehensive genomic resources exist.
We analyzed the pea transcriptome at the highest possible amount of accuracy by current technology. We used next generation sequencing with the Roche/454 platform and evaluated and compared a variety of approaches, including diverse tissue libraries, normalization, alternative sequencing technologies, saturation estimation and diverse assembly strategies. We generated libraries from flowers, leaves, cotyledons, epi- and hypocotyl, and etiolated and light treated etiolated seedlings, comprising a total of 450 megabases. Libraries were assembled into 324,428 unigenes in a first pass assembly.
A second pass assembly reduced the amount to 81,449 unigenes but caused a significant number of chimeras. Analyses of the assemblies identified the assembly step as a major possibility for improvement. By recording frequencies of Arabidopsis orthologs hit by randomly drawn reads and fitting parameters of the saturation curve we concluded that sequencing was exhaustive. For leaf libraries we found normalization allows partial recovery of expression strength aside the desired effect of increased coverage. Based on theoretical and biological considerations we concluded that the sequence reads in the database tagged the vast majority of transcripts in the aerial tissues. A pathway representation analysis showed the merits of sampling multiple aerial tissues to increase the number of tagged genes. All results have been made available as a fully annotated database in fasta format.
We conclude that the approach taken resulted in a high quality - dataset which serves well as a first comprehensive reference set for the model legume pea. We suggest future deep sequencing transcriptome projects of species lacking a genomics backbone will need to concentrate mainly on resolving the issues of redundancy and paralogy during transcriptome assembly.
Recent genome sequencing enables mega-base scale comparisons between related genomes. Comparisons between animals, plants, fungi, and bacteria demonstrate extensive synteny tempered by rearrangements. Within the legume plant family, glimpses of synteny have also been observed. Characterizing syntenic relationships in legumes is important in transferring knowledge from model legumes to crops that are important sources of protein, fixed nitrogen, and health-promoting compounds.
We have uncovered two large soybean regions exhibiting synteny with M. truncatula and with a network of segmentally duplicated regions in Arabidopsis. In all, syntenic regions comprise over 500 predicted genes spanning 3 Mb. Up to 75% of soybean genes are colinear with M. truncatula, including one region in which 33 of 35 soybean predicted genes with database support are colinear to M. truncatula. In some regions, 60% of soybean genes share colinearity with a network of A. thaliana duplications. One region is especially interesting because this 500 kbp segment of soybean is syntenic to two paralogous regions in M. truncatula on different chromosomes. Phylogenetic analysis of individual genes within these regions demonstrates that one is orthologous to the soybean region, with which it also shows substantially denser synteny and significantly lower levels of synonymous nucleotide substitutions. The other M. truncatula region is inferred to be paralogous, presumably resulting from a duplication event preceding speciation.
The presence of well-defined M. truncatula segments showing orthologous and paralogous relationships with soybean allows us to explore the evolution of contiguous genomic regions in the context of ancient genome duplication and speciation events.
Medicago truncatula is a model legume forage crop native to the arid and semi-arid environments of the Mediterranean. Given its drought-adapted nature, it is an ideal candidate to study the molecular and biochemical mechanisms conferring drought resistance in plants. Medicago plants were subjected to a progressive drought stress over 14 d of water withholding followed by rewatering under controlled environmental conditions. Based on physiological measurements of plant water status and changes in morphology, plants experienced mild, moderate and severe water stress before rehydration. Transcriptome analysis of roots and shoots from control, mildly, moderately and severely stressed, and rewatered plants, identified many thousands of genes that were altered in expression in response to drought. Many genes with expression tightly coupled to the plant water potential (i.e. drought intensity) were identified suggesting an involvement in Medicago drought adaptation responses. Metabolite profiling of drought-stressed plants revealed the presence of 135 polar and 165 non-polar compounds in roots and shoots. Combining Medicago metabolomic data with transcriptomic data yielded insight into the regulation of metabolic pathways operating under drought stress. Among the metabolites detected in drought-stressed Medicago plants, myo-inositol and proline had striking regulatory profiles indicating involvement in Medicago drought tolerance.
Global transcriptional and metabolic responses to drought and rewatering were investigated in Medicago truncatula, a naturally drought-adapted model legume species. Integration of metabolomic and transcriptomic data yielded insights into the regulation of metabolic pathways underlying drought-stress adaptation. Many genes and metabolites with expression tightly coupled to drought intensity were identified, suggesting active involvement in Medicago drought resistance. These could prove useful targets for future translational approaches to improve closely related crop plants such as common bean, soybean and pea.
legumes; drought stress; resistance; tolerance; microarray; gene profiling; transcriptomics; metabolomics; proline; ABA