Knowing the extent and structure of genetic variation in germplasm collections is essential for the conservation and utilization of biodiversity in cultivated plants. Cucumber is the fourth most important vegetable crop worldwide and is a model system for other Cucurbitaceae, a family that also includes melon, watermelon, pumpkin and squash. Previous isozyme studies revealed a low genetic diversity in cucumber, but detailed insights into the crop's genetic structure and diversity are largely missing. We have fingerprinted 3,342 accessions from the Chinese, Dutch and U.S. cucumber collections with 23 highly polymorphic Simple Sequence Repeat (SSR) markers evenly distributed in the genome. The data reveal three distinct populations, largely corresponding to three geographic regions. Population 1 corresponds to germplasm from China, except for the unique semi-wild landraces found in Xishuangbanna in Southwest China and East Asia; population 2 to Europe, America, and Central and West Asia; and population 3 to India and Xishuangbanna. Admixtures were also detected, reflecting hybridization and migration events between the populations. The genetic background of the Indian germplasm is heterogeneous, indicating that the Indian cucumbers maintain a large proportion of the genetic diversity and that only a small fraction was introduced to other parts of the world. Subsequently, we defined a core collection consisting of 115 accessions and capturing over 77% of the SSR alleles. Insight into the genetic structure of cucumber will help developing appropriate conservation strategies and provides a basis for population-level genome sequencing in cucumber.
Cucumber, Cucumis sativus L. (2n = 2 × = 14) and melon, C. melo L. (2n = 2 × = 24) are two important vegetable species in the genus Cucumis (family Cucurbitaceae). Both species have an Asian origin that diverged approximately nine million years ago. Cucumber is believed to have evolved from melon through chromosome fusion, but the details of this process are largely unknown. In this study, comparative genetic mapping between cucumber and melon was conducted to examine syntenic relationships of their chromosomes.
Using two melon mapping populations, 154 and 127 cucumber SSR markers were added onto previously reported F2- and RIL-based genetic maps, respectively. A consensus melon linkage map was developed through map integration, which contained 401 co-dominant markers in 12 linkage groups including 199 markers derived from the cucumber genome. Syntenic relationships between melon and cucumber chromosomes were inferred based on associations between markers on the consensus melon map and cucumber draft genome scaffolds. It was determined that cucumber Chromosome 7 was syntenic to melon Chromosome I. Cucumber Chromosomes 2 and 6 each contained genomic regions that were syntenic with melon chromosomes III+V+XI and III+VIII+XI, respectively. Likewise, cucumber Chromosomes 1, 3, 4, and 5 each was syntenic with genomic regions of two melon chromosomes previously designated as II+XII, IV+VI, VII+VIII, and IX+X, respectively. However, the marker orders in several syntenic blocks on these consensus linkage maps were not co-linear suggesting that more complicated structural changes beyond simple chromosome fusion events have occurred during the evolution of cucumber.
Comparative mapping conducted herein supported the hypothesis that cucumber chromosomes may be the result of chromosome fusion from a 24-chromosome progenitor species. Except for a possible inversion, cucumber Chromosome 7 has largely remained intact in the past nine million years since its divergence from melon. Meanwhile, many structural changes may have occurred during the evolution of the remaining six cucumber chromosomes. Further characterization of the genomic nature of Cucumis species closely related to cucumber and melon might provide a better understanding of the evolutionary history leading to modern cucumber.
Cucumber; Melon; Cucumis; Microsatellite; Comparative mapping; Chromosome evolution
Melon (Cucumis melo), an economically important vegetable crop, belongs to the Cucurbitaceae family which includes several other important crops such as watermelon, cucumber, and pumpkin. It has served as a model system for sex determination and vascular biology studies. However, genomic resources currently available for melon are limited.
We constructed eleven full-length enriched and four standard cDNA libraries from fruits, flowers, leaves, roots, cotyledons, and calluses of four different melon genotypes, and generated 71,577 and 22,179 ESTs from full-length enriched and standard cDNA libraries, respectively. These ESTs, together with ~35,000 ESTs available in public domains, were assembled into 24,444 unigenes, which were extensively annotated by comparing their sequences to different protein and functional domain databases, assigning them Gene Ontology (GO) terms, and mapping them onto metabolic pathways. Comparative analysis of melon unigenes and other plant genomes revealed that 75% to 85% of melon unigenes had homologs in other dicot plants, while approximately 70% had homologs in monocot plants. The analysis also identified 6,972 gene families that were conserved across dicot and monocot plants, and 181, 1,192, and 220 gene families specific to fleshy fruit-bearing plants, the Cucurbitaceae family, and melon, respectively. Digital expression analysis identified a total of 175 tissue-specific genes, which provides a valuable gene sequence resource for future genomics and functional studies. Furthermore, we identified 4,068 simple sequence repeats (SSRs) and 3,073 single nucleotide polymorphisms (SNPs) in the melon EST collection. Finally, we obtained a total of 1,382 melon full-length transcripts through the analysis of full-length enriched cDNA clones that were sequenced from both ends. Analysis of these full-length transcripts indicated that sizes of melon 5' and 3' UTRs were similar to those of tomato, but longer than many other dicot plants. Codon usages of melon full-length transcripts were largely similar to those of Arabidopsis coding sequences.
The collection of melon ESTs generated from full-length enriched and standard cDNA libraries is expected to play significant roles in annotating the melon genome. The ESTs and associated analysis results will be useful resources for gene discovery, functional analysis, marker-assisted breeding of melon and closely related species, comparative genomic studies and for gaining insights into gene expression patterns.
The melon belongs to the Cucurbitaceae family, whose economic importance among vegetable crops is second only to Solanaceae. The melon has a small genome size (454 Mb), which makes it suitable for molecular and genetic studies. Despite similar nuclear and chloroplast genome sizes, cucurbits show great variation when their mitochondrial genomes are compared. The melon possesses the largest plant mitochondrial genome, as much as eight times larger than that of other cucurbits.
The nucleotide sequences of the melon chloroplast and mitochondrial genomes were determined. The chloroplast genome (156,017 bp) included 132 genes, with 98 single-copy genes dispersed between the small (SSC) and large (LSC) single-copy regions and 17 duplicated genes in the inverted repeat regions (IRa and IRb). A comparison of the cucumber and melon chloroplast genomes showed differences in only approximately 5% of nucleotides, mainly due to short indels and SNPs. Additionally, 2.74 Mb of mitochondrial sequence, accounting for 95% of the estimated mitochondrial genome size, were assembled into five scaffolds and four additional unscaffolded contigs. An 84% of the mitochondrial genome is contained in a single scaffold. The gene-coding region accounted for 1.7% (45,926 bp) of the total sequence, including 51 protein-coding genes, 4 conserved ORFs, 3 rRNA genes and 24 tRNA genes. Despite the differences observed in the mitochondrial genome sizes of cucurbit species, Citrullus lanatus (379 kb), Cucurbita pepo (983 kb) and Cucumis melo (2,740 kb) share 120 kb of sequence, including the predicted protein-coding regions. Nevertheless, melon contained a high number of repetitive sequences and a high content of DNA of nuclear origin, which represented 42% and 47% of the total sequence, respectively.
Whereas the size and gene organisation of chloroplast genomes are similar among the cucurbit species, mitochondrial genomes show a wide variety of sizes, with a non-conserved structure both in gene number and organisation, as well as in the features of the noncoding DNA. The transfer of nuclear DNA to the melon mitochondrial genome and the high proportion of repetitive DNA appear to explain the size of the largest mitochondrial genome reported so far.
Integration of molecular, genetic and cytological maps is still a challenge for most plant species. Recent progress in molecular and cytogenetic studies created a basis for developing integrated maps in cucumber (Cucumis sativus L.).
In this study, eleven fosmid clones and three plasmids containing 45S rDNA, the centromeric satellite repeat Type III and the pericentriomeric repeat CsRP1 sequences respectively were hybridized to cucumber metaphase chromosomes to assign their cytological location on chromosome 2. Moreover, an integrated molecular cytogenetic map of cucumber chromosomes 2 was constructed by fluorescence in situ hybridization (FISH) mapping of 11 fosmid clones together with the cucumber centromere-specific Type III sequence on meiotic pachytene chromosomes. The cytogenetic map was fully integrated with genetic linkage map since each fosmid clone was anchored by a genetically mapped simple sequence repeat marker (SSR). The relationship between the genetic and physical distances along chromosome was analyzed.
Recombination was not evenly distributed along the physical length of chromosome 2. Suppression of recombination was found in centromeric and pericentromeric regions. Our results also indicated that the molecular markers composing the linkage map for chromosome 2 provided excellent coverage of the chromosome.
There are few genomic tools available in melon (Cucumis melo L.), a member of the Cucurbitaceae, despite its importance as a crop. Among these tools, genetic maps have been constructed mainly using marker types such as simple sequence repeats (SSR), restriction fragment length polymorphisms (RFLP) and amplified fragment length polymorphisms (AFLP) in different mapping populations. There is a growing need for saturating the genetic map with single nucleotide polymorphisms (SNP), more amenable for high throughput analysis, especially if these markers are located in gene coding regions, to provide functional markers. Expressed sequence tags (ESTs) from melon are available in public databases, and resequencing ESTs or validating SNPs detected in silico are excellent ways to discover SNPs.
EST-based SNPs were discovered after resequencing ESTs between the parental lines of the PI 161375 (SC) × 'Piel de sapo' (PS) genetic map or using in silico SNP information from EST databases. In total 200 EST-based SNPs were mapped in the melon genetic map using a bin-mapping strategy, increasing the map density to 2.35 cM/marker. A subset of 45 SNPs was used to study variation in a panel of 48 melon accessions covering a wide range of the genetic diversity of the species. SNP analysis correctly reflected the genetic relationships compared with other marker systems, being able to distinguish all the accessions and cultivars.
This is the first example of a genetic map in a cucurbit species that includes a major set of SNP markers discovered using ESTs. The PI 161375 × 'Piel de sapo' melon genetic map has around 700 markers, of which more than 500 are gene-based markers (SNP, RFLP and SSR). This genetic map will be a central tool for the construction of the melon physical map, the step prior to sequencing the complete genome. Using the set of SNP markers, it was possible to define the genetic relationships within a collection of forty-eight melon accessions as efficiently as with SSR markers, and these markers may also be useful for cultivar identification in Occidental melon varieties.
Cucumis melo (melon) belongs to the Cucurbitaceae family, whose economic importance among horticulture crops is second only to Solanaceae. Melon has high intra-specific genetic variation, morphologic diversity and a small genome size (450 Mb), which make this species suitable for a great variety of molecular and genetic studies that can lead to the development of tools for breeding varieties of the species. A number of genetic and genomic resources have already been developed, such as several genetic maps and BAC genomic libraries. These tools are essential for the construction of a physical map, a valuable resource for map-based cloning, comparative genomics and assembly of whole genome sequencing data. However, no physical map of any Cucurbitaceae has yet been developed. A project has recently been started to sequence the complete melon genome following a whole-genome shotgun strategy, which makes use of massive sequencing data. A BAC-based melon physical map will be a useful tool to help assemble and refine the draft genome data that is being produced.
A melon physical map was constructed using a 5.7 × BAC library and a genetic map previously developed in our laboratories. High-information-content fingerprinting (HICF) was carried out on 23,040 BAC clones, digesting with five restriction enzymes and SNaPshot labeling, followed by contig assembly with FPC software. The physical map has 1,355 contigs and 441 singletons, with an estimated physical length of 407 Mb (0.9 × coverage of the genome) and the longest contig being 3.2 Mb. The anchoring of 845 BAC clones to 178 genetic markers (100 RFLPs, 76 SNPs and 2 SSRs) also allowed the genetic positioning of 183 physical map contigs/singletons, representing 55 Mb (12%) of the melon genome, to individual chromosomal loci. The melon FPC database is available for download at http://melonomics.upv.es/static/files/public/physical_map/.
Here we report the construction of the first physical map of a Cucurbitaceae species described so far. The physical map was integrated with the genetic map so that a number of physical contigs, representing 12% of the melon genome, could be anchored to known genetic positions. The data presented is already helping to improve the quality of the melon genomic sequence available as a result of a project currently being carried out in Spain, adopting a whole genome shotgun approach based on 454 sequencing data.
Despite the great advances in genomic technology observed in several crop species, the availability of molecular tools such as microsatellite markers has been limited in melon (Cucumis melo L.) and cucurbit species. The development of microsatellite markers will have a major impact on genetic analysis and breeding of melon, especially on the generation of marker saturated genetic maps and implementation of marker assisted breeding programs. Genomic microsatellite enriched libraries can be an efficient alternative for marker development in such species.
Seven hundred clones containing microsatellite sequences from a Tsp-AG/TC microsatellite enriched library were identified and one-hundred and forty-four primer pairs designed and synthesized. When 67 microsatellite markers were tested on a panel of melon and other cucurbit accessions, 65 revealed DNA polymorphisms among the melon accessions. For some cucurbit species, such as Cucumis sativus, up to 50% of the melon microsatellite markers could be readily used for DNA polymophism assessment, representing a significant reduction of marker development costs. A random sample of 25 microsatellite markers was extracted from the new microsatellite marker set and characterized on 40 accessions of melon, generating an allelic frequency database for the species. The average expected heterozygosity was 0.52, varying from 0.45 to 0.70, indicating that a small set of selected markers should be sufficient to solve questions regarding genotype identity and variety protection. Genetic distances based on microsatellite polymorphism were congruent with data obtained from RAPD marker analysis. Mapping analysis was initiated with 55 newly developed markers and most primers showed segregation according to Mendelian expectations. Linkage analysis detected linkage between 56% of the markers, distributed in nine linkage groups.
Genomic library microsatellite enrichment is an efficient procedure for marker development in melon. One-hundred and forty-four new markers were developed from Tsp-AG/TC genomic library. This is the first reported attempt of successfully using enriched library for microsatellite marker development in the species. A sample of the microsatellite markers tested proved efficient for genetic analysis of melon, including genetic distance estimates and identity tests. Linkage analysis indicated that the markers developed are dispersed throughout the genome and should be very useful for genetic analysis of melon.
Cotton, with a large genome, is an important crop throughout the world. A high-density genetic linkage map is the prerequisite for cotton genetics and breeding. A genetic map based on simple polymerase chain reaction markers will be efficient for marker-assisted breeding in cotton, and markers from transcribed sequences have more chance to target genes related to traits. To construct a genome-wide, functional marker-based genetic linkage map in cotton, we isolated and mapped expressed sequence tag-simple sequence repeats (EST-SSRs) from cotton ESTs derived from the A1, D5, (AD)1, and (AD)2 genome.
A total of 3177 new EST-SSRs developed in our laboratory and other newly released SSRs were used to enrich our interspecific BC1 genetic linkage map. A total of 547 loci and 911 loci were obtained from our EST-SSRs and the newly released SSRs, respectively. The 1458 loci together with our previously published data were used to construct an updated genetic linkage map. The final map included 2316 loci on the 26 cotton chromosomes, 4418.9 cM in total length and 1.91 cM in average distance between adjacent markers. To our knowledge, this map is one of the three most dense linkage maps in cotton. Twenty-one segregation distortion regions (SDRs) were found in this map; three segregation distorted chromosomes, Chr02, Chr16, and Chr18, were identified with 99.9% of distorted markers segregating toward the heterozygous allele. Functional analysis of SSR sequences showed that 1633 loci of this map (70.6%) were transcribed loci and 1332 loci (57.5%) were translated loci.
This map lays groundwork for further genetic analyses of important quantitative traits, marker-assisted selection, and genome organization architecture in cotton as well as for comparative genomics between cotton and other species. The segregation distorted chromosomes can be a guide to identify segregation distortion loci in cotton. The annotation of SSR sequences identified frequent and rare gene ontology items on each chromosome, which is helpful to discover functions of cotton chromosomes.
Brassica oleracea encompass a family of vegetables and cabbage that are among the most widely cultivated crops. In 2009, the B. oleracea Genome Sequencing Project was launched using next generation sequencing technology. None of the available maps were detailed enough to anchor the sequence scaffolds for the Genome Sequencing Project. This report describes the development of a large number of SSR and SNP markers from the whole genome shotgun sequence data of B. oleracea, and the construction of a high-density genetic linkage map using a double haploid mapping population.
The B. oleracea high-density genetic linkage map that was constructed includes 1,227 markers in nine linkage groups spanning a total of 1197.9 cM with an average of 0.98 cM between adjacent loci. There were 602 SSR markers and 625 SNP markers on the map. The chromosome with the highest number of markers (186) was C03, and the chromosome with smallest number of markers (99) was C09.
This first high-density map allowed the assembled scaffolds to be anchored to pseudochromosomes. The map also provides useful information for positional cloning, molecular breeding, and integration of information of genes and traits in B. oleracea. All the markers on the map will be transferable and could be used for the construction of other genetic maps.
Cabbage; Brassica; Genetic linkage map; SSR; SNP; Genome
Genetic markers and linkage mapping are basic prerequisites for comparative genetic analyses, QTL detection and map-based cloning. A large number of mapping populations have been developed for oak, but few gene-based markers are available for constructing integrated genetic linkage maps and comparing gene order and QTL location across related species.
We developed a set of 573 expressed sequence tag-derived simple sequence repeats (EST-SSRs) and located 397 markers (EST-SSRs and genomic SSRs) on the 12 oak chromosomes (2n = 2x = 24) on the basis of Mendelian segregation patterns in 5 full-sib mapping pedigrees of two species: Quercus robur (pedunculate oak) and Quercus petraea (sessile oak). Consensus maps for the two species were constructed and aligned. They showed a high degree of macrosynteny between these two sympatric European oaks. We assessed the transferability of EST-SSRs to other Fagaceae genera and a subset of these markers was mapped in Castanea sativa, the European chestnut. Reasonably high levels of macrosynteny were observed between oak and chestnut. We also obtained diversity statistics for a subset of EST-SSRs, to support further population genetic analyses with gene-based markers. Finally, based on the orthologous relationships between the oak, Arabidopsis, grape, poplar, Medicago, and soybean genomes and the paralogous relationships between the 12 oak chromosomes, we propose an evolutionary scenario of the 12 oak chromosomes from the eudicot ancestral karyotype.
This study provides map locations for a large set of EST-SSRs in two oak species of recognized biological importance in natural ecosystems. This first step toward the construction of a gene-based linkage map will facilitate the assignment of future genome scaffolds to pseudo-chromosomes. This study also provides an indication of the potential utility of new gene-based markers for population genetics and comparative mapping within and beyond the Fagaceae.
A number of molecular marker linkage maps have been developed for melon (Cucumis melo L.) over the last two decades. However, these maps were constructed using different marker sets, thus, making comparative analysis among maps difficult. In order to solve this problem, a consensus genetic map in melon was constructed using primarily highly transferable anchor markers that have broad potential use for mapping, synteny, and comparative quantitative trait loci (QTL) analysis, increasing breeding effectiveness and efficiency via marker-assisted selection (MAS).
Under the framework of the International Cucurbit Genomics Initiative (ICuGI, http://www.icugi.org), an integrated genetic map has been constructed by merging data from eight independent mapping experiments using a genetically diverse array of parental lines. The consensus map spans 1150 cM across the 12 melon linkage groups and is composed of 1592 markers (640 SSRs, 330 SNPs, 252 AFLPs, 239 RFLPs, 89 RAPDs, 15 IMAs, 16 indels and 11 morphological traits) with a mean marker density of 0.72 cM/marker. One hundred and ninety-six of these markers (157 SSRs, 32 SNPs, 6 indels and 1 RAPD) were newly developed, mapped or provided by industry representatives as released markers, including 27 SNPs and 5 indels from genes involved in the organic acid metabolism and transport, and 58 EST-SSRs. Additionally, 85 of 822 SSR markers contributed by Syngenta Seeds were included in the integrated map. In addition, 370 QTL controlling 62 traits from 18 previously reported mapping experiments using genetically diverse parental genotypes were also integrated into the consensus map. Some QTL associated with economically important traits detected in separate studies mapped to similar genomic positions. For example, independently identified QTL controlling fruit shape were mapped on similar genomic positions, suggesting that such QTL are possibly responsible for the phenotypic variability observed for this trait in a broad array of melon germplasm.
Even though relatively unsaturated genetic maps in a diverse set of melon market types have been published, the integrated saturated map presented herein should be considered the initial reference map for melon. Most of the mapped markers contained in the reference map are polymorphic in diverse collection of germplasm, and thus are potentially transferrable to a broad array of genetic experimentation (e.g., integration of physical and genetic maps, colinearity analysis, map-based gene cloning, epistasis dissection, and marker-assisted selection).
The genetic differences between mungbean and its presumed wild ancestor were analyzed for domestication related traits by QTL mapping. A genetic linkage map of mungbean was constructed using 430 SSR and EST-SSR markers from mungbean and its related species, and all these markers were mapped onto 11 linkage groups spanning a total of 727.6 cM. The present mungbean map is the first map where the number of linkage groups coincided with the haploid chromosome number of mungbean. In total 105 QTLs and genes for 38 domestication related traits were identified. Compared with the situation in other Vigna crops, many linkage groups have played an important role in the domestication of mungbean. In particular the QTLs with high contribution were distributed on seven out of 11 linkage groups. In addition, a large number of QTLs with small contribution were found. The accumulation of many mutations with large and/or small contribution has contributed to the differentiation between wild and cultivated mungbean. The useful QTLs for seed size, pod dehiscence and pod maturity that have not been found in other Asian Vigna species were identified in mungbean, and these QTLs may play the important role as new gene resources for other Asian Vigna species. The results provide the foundation that will be useful for improvement of mungbean and related legumes.
The objective of these studies was to assess the degree to which bean leaf beetle, Cerotoma trifurcata (Forster), will feed on cucurbits. In 2003, we documented an infestation of C. trifurcata in a commercial pumpkin field near Rosemount, MN, USA. To evaluate C. trifurcata feeding on cucurbits, we conducted laboratory no-choice and choice test feeding studies. In the laboratory, C. trifurcata fed most heavily on cotyledon-stage cucumber plants, followed by pumpkin and squash. With soybean plants present, C. trifurcata still fed on cucumber plants. However, C. trifurcata appeared to prefer soybeans until the quality of the soybean plants was diminished through feeding damage. This is the first known report of C. trifurcata feeding on cucurbits. The pest potential of C. trifurcata in cucurbit cropping systems should be further evaluated.
Cerotoma trifurcata; host plant; squash; pumpkin; cucumber
Soybean is a major crop that is an important source of oil and proteins. A number of genetic linkage maps have been developed in soybean. Specifically, hundreds of simple sequence repeat (SSR) markers have been developed and mapped. Recent sequencing of the soybean genome resulted in the generation of vast amounts of genetic information. The objectives of this investigation were to use SSR markers in developing a connection between genetic and physical maps and to determine the physical distribution of recombination on soybean chromosomes. A total of 2,188 SSRs were used for sequence-based physical localization on soybean chromosomes. Linkage information was used from different maps to create an integrated genetic map. Comparison of the integrated genetic linkage maps and sequence based physical maps revealed that the distal 25% of each chromosome was the most marker-dense, containing an average of 47.4% of the SSR markers and 50.2% of the genes. The proximal 25% of each chromosome contained only 7.4% of the markers and 6.7% of the genes. At the whole genome level, the marker density and gene density showed a high correlation (R2) of 0.64 and 0.83, respectively with the physical distance from the centromere. Recombination followed a similar pattern with comparisons indicating that recombination is high in telomeric regions, though the correlation between crossover frequency and distance from the centromeres is low (R2 = 0.21). Most of the centromeric regions were low in recombination. The crossover frequency for the entire soybean genome was 7.2%, with extremes much higher and lower than average. The number of recombination hotspots varied from 1 to 12 per chromosome. A high correlation of 0.83 between the distribution of SSR markers and genes suggested close association of SSRs with genes. The knowledge of distribution of recombination on chromosomes may be applied in characterizing and targeting genes.
The cultivated strawberry (Fragaria× ananassa) is an octoploid (2n = 8x = 56) of the Rosaceae family whose genomic architecture is still controversial. Several recent studies support the AAA′A′BBB′B′ model, but its complexity has hindered genetic and genomic analysis of this important crop. To overcome this difficulty and to assist genome-wide analysis of F. × ananassa, we constructed an integrated linkage map by organizing a total of 4474 of simple sequence repeat (SSR) markers collected from published Fragaria sequences, including 3746 SSR markers [Fragaria vesca expressed sequence tag (EST)-derived SSR markers] derived from F. vesca ESTs, 603 markers (F. × ananassa EST-derived SSR markers) from F. × ananassa ESTs, and 125 markers (F. × ananassa transcriptome-derived SSR markers) from F. × ananassa transcripts. Along with the previously published SSR markers, these markers were mapped onto five parent-specific linkage maps derived from three mapping populations, which were then assembled into an integrated linkage map. The constructed map consists of 1856 loci in 28 linkage groups (LGs) that total 2364.1 cM in length. Macrosynteny at the chromosome level was observed between the LGs of F. × ananassa and the genome of F. vesca. Variety distinction on 129 F. × ananassa lines was demonstrated using 45 selected SSR markers.
Fragaria × ananassa; SSR marker; integrated linkage map; comparative mapping
The genus Nelumbo Adans. comprises two living species, N. nucifera Gaertan. (Asian lotus) and N. lutea Pers. (American lotus). A genetic linkage map is an essential resource for plant genetic studies and crop improvement but has not been generated for Nelumbo. We aimed to develop genomic simple sequence repeat (SSR) markers from the genome sequence and construct two genetic maps for Nelumbo to assist genome assembly and integration of a genetic map with the genome sequence.
A total of 86,089 SSR motifs were identified from the genome sequences. Di- and tri-nucleotide repeat motifs were the most abundant, and accounted for 60.73% and 31.66% of all SSRs, respectively. AG/GA repeats constituted 51.17% of dinucleotide repeat motifs, followed by AT/TA (44.29%). Of 500 SSR primers tested, 386 (77.20%) produced scorable alleles with an average of 2.59 per primer, and 185 (37.00%) showed polymorphism among two parental genotypes, N. nucifera ‘Chinese Antique’ and N. lutea ‘AL1’, and six progenies of their F1 population. The normally segregating markers, which comprised 268 newly developed SSRs, 37 previously published SSRs and 53 sequence-related amplified polymorphism markers, were used for genetic map construction. The map for Asian lotus was 365.67 cM with 47 markers distributed in seven linkage groups. The map for American lotus was 524.51 cM, and contained 177 markers distributed in 11 genetic linkage groups. The number of markers per linkage group ranged from three to 34 with an average genetic distance of 3.97 cM between adjacent markers. Moreover, 171 SSR markers contained in linkage groups were anchored to 97 genomic DNA sequence contigs of ‘Chinese Antique’. The 97 contigs were merged into 60 scaffolds.
Genetic mapping of SSR markers derived from sequenced contigs in Nelumbo enabled the associated contigs to be anchored in the linkage map and facilitated assembly of the genome sequences of ‘Chinese Antique’. The present study reports the first construction of genetic linkage maps for Nelumbo, which can serve as reference linkage maps to accelerate characterization germplasm, genetic mapping for traits of economic interest, and molecular breeding with marker-assisted selection.
Genetic linkage map; Genome sequence; Nelumbo; SRAP; SSR
A total of 355 simple sequence repeat (SSR) markers were developed, based on expressed sequence tag (EST) and bacterial artificial chromosome (BAC)-end sequence databases, and successfully used to construct an SSR-based genetic linkage map of the apple. The consensus linkage map spanned 1143 cM, with an average density of 2.5 cM per marker. Newly developed SSR markers along with 279 SSR markers previously published by the HiDRAS project were further used to integrate physical and genetic maps of the apple using a PCR-based BAC library screening approach. A total of 470 contigs were unambiguously anchored onto all 17 linkage groups of the apple genome, and 158 contigs contained two or more molecular markers. The genetically mapped contigs spanned ∼421 Mb in cumulative physical length, representing 60.0% of the genome. The sizes of anchored contigs ranged from 97 kb to 4.0 Mb, with an average of 995 kb. The average physical length of anchored contigs on each linkage group was ∼24.8 Mb, ranging from 17.0 Mb to 37.73 Mb. Using BAC DNA as templates, PCR screening of the BAC library amplified fragments of highly homologous sequences from homoeologous chromosomes. Upon integrating physical and genetic maps of the apple, the presence of not only homoeologous chromosome pairs, but also of multiple locus markers mapped to adjacent sites on the same chromosome was detected. These findings demonstrated the presence of both genome-wide and segmental duplications in the apple genome and provided further insights into the complex polyploid ancestral origin of the apple.
Genetic map; genome duplication; Malus×domestica; physical map; segmental duplication; simple sequence repeat
Cucumber, Cucumis sativus L. is an important vegetable crop worldwide. Until very recently, cucumber genetic and genomic resources, especially molecular markers, have been very limited, impeding progress of cucumber breeding efforts. Microsatellites are short tandemly repeated DNA sequences, which are frequently favored as genetic markers due to their high level of polymorphism and codominant inheritance. Data from previously characterized genomes has shown that these repeats vary in frequency, motif sequence, and genomic location across taxa. During the last year, the genomes of two cucumber genotypes were sequenced including the Chinese fresh market type inbred line '9930' and the North American pickling type inbred line 'Gy14'. These sequences provide a powerful tool for developing markers in a large scale. In this study, we surveyed and characterized the distribution and frequency of perfect microsatellites in 203 Mbp assembled Gy14 DNA sequences, representing 55% of its nuclear genome, and in cucumber EST sequences. Similar analyses were performed in genomic and EST data from seven other plant species, and the results were compared with those of cucumber.
A total of 112,073 perfect repeats were detected in the Gy14 cucumber genome sequence, accounting for 0.9% of the assembled Gy14 genome, with an overall density of 551.9 SSRs/Mbp. While tetranucleotides were the most frequent microsatellites in genomic DNA sequence, dinucleotide repeats, which had more repeat units than any other SSR type, had the highest cumulative sequence length. Coding regions (ESTs) of the cucumber genome had fewer microsatellites compared to its genomic sequence, with trinucleotides predominating in EST sequences. AAG was the most frequent repeat in cucumber ESTs. Overall, AT-rich motifs prevailed in both genomic and EST data. Compared to the other species examined, cucumber genomic sequence had the highest density of SSRs (although comparable to the density of poplar, grapevine and rice), and was richest in AT dinucleotides. Using an electronic PCR strategy, we investigated the polymorphism between 9930 and Gy14 at 1,006 SSR loci, and found unexpectedly high degree of polymorphism (48.3%) between the two genotypes. The level of polymorphism seems to be positively associated with the number of repeat units in the microsatellite. The in silico PCR results were validated empirically in 660 of the 1,006 SSR loci. In addition, primer sequences for more than 83,000 newly-discovered cucumber microsatellites, and their exact positions in the Gy14 genome assembly were made publicly available.
The cucumber genome is rich in microsatellites; AT and AAG are the most abundant repeat motifs in genomic and EST sequences of cucumber, respectively. Considering all the species investigated, some commonalities were noted, especially within the monocot and dicot groups, although the distribution of motifs and the frequency of certain repeats were characteristic of the species examined. The large number of SSR markers developed from this study should be a significant contribution to the cucurbit research community.
• Background The gorgeous frescoes organized by the master Renaissance painter Raphael Sanzio (1483–1520) and illustrating the heavenly adventures of Cupid and Psyche were painted between 1515 and 1518 to decorate the Roman villa (now known as the Villa Farnesina) of the wealthy Sienese banker Agostino Chigi (1466–1520). Surrounding these paintings are festoons of fruits, vegetables and flowers painted by Giovanni Martini da Udine (1487–1564), which include over 170 species of plants. A deconstruction and collation of the cucurbit images in the festoons makes it possible to evaluate the genetic diversity of cucurbits in Renaissance Italy 500 years ago.
• Findings The festoons contain six species of Old World cucurbits, Citrullus lanatus (watermelon), Cucumis melo (melon), Cucumis sativus (cucumber), Ecballium elaterium (squirting cucumber), Lagenaria siceraria (bottle gourd) and Momordica balsamina (balsam apple), and two or three species of New World cucurbits, Cucurbita maxima, C. pepo and, perhaps, C. moschata (pumpkin, squash, gourd). The images of C. maxima are the first illustrations of this species in Europe.
Citrullus lanatus; Cucumis melo; Cucumis sativus; Cucurbita maxima; Cucurbita pepo; Ecballium elaterium; Lagenaria siceraria; Momordica balsamina; Cucurbitaceae; Agostino Chigi; Giovanni Martini da Udine
The construction of genetic linkage maps for cultivated peanut (Arachis hypogaea L.) has and continues to be an important research goal to facilitate quantitative trait locus (QTL) analysis and gene tagging for use in a marker-assisted selection in breeding. Even though a few maps have been developed, they were constructed using diploid or interspecific tetraploid populations. The most recently published intra-specific map was constructed from the cross of cultivated peanuts, in which only 135 simple sequence repeat (SSR) markers were sparsely populated in 22 linkage groups. The more detailed linkage map with sufficient markers is necessary to be feasible for QTL identification and marker-assisted selection. The objective of this study was to construct a genetic linkage map of cultivated peanut using simple sequence repeat (SSR) markers derived primarily from peanut genomic sequences, expressed sequence tags (ESTs), and by "data mining" sequences released in GenBank.
Three recombinant inbred lines (RILs) populations were constructed from three crosses with one common female parental line Yueyou 13, a high yielding Spanish market type. The four parents were screened with 1044 primer pairs designed to amplify SSRs and 901 primer pairs produced clear PCR products. Of the 901 primer pairs, 146, 124 and 64 primer pairs (markers) were polymorphic in these populations, respectively, and used in genotyping these RIL populations. Individual linkage maps were constructed from each of the three populations and a composite map based on 93 common loci were created using JoinMap. The composite linkage maps consist of 22 composite linkage groups (LG) with 175 SSR markers (including 47 SSRs on the published AA genome maps), representing the 20 chromosomes of A. hypogaea. The total composite map length is 885.4 cM, with an average marker density of 5.8 cM. Segregation distortion in the 3 populations was 23.0%, 13.5% and 7.8% of the markers, respectively. These distorted loci tended to cluster on LG1, LG3, LG4 and LG5. There were only 15 EST-SSR markers mapped due to low polymorphism. By comparison, there were potential synteny, collinear order of some markers and conservation of collinear linkage groups among the maps and with the AA genome but not fully conservative.
A composite linkage map was constructed from three individual mapping populations with 175 SSR markers in 22 composite linkage groups. This composite genetic linkage map is among the first "true" tetraploid peanut maps produced. This map also consists of 47 SSRs that have been used in the published AA genome maps, and could be used in comparative mapping studies. The primers described in this study are PCR-based markers, which are easy to share for genetic mapping in peanuts. All 1044 primer pairs are provided as additional files and the three RIL populations will be made available to public upon request for quantitative trait loci (QTL) analysis and linkage map improvement.
Previous loblolly pine (Pinus taeda L.) genetic linkage maps have been based on a variety of DNA polymorphisms, such as AFLPs, RAPDs, RFLPs, and ESTPs, but only a few SSRs (simple sequence repeats), also known as simple tandem repeats or microsatellites, have been mapped in P. taeda. The objective of this study was to integrate a large set of SSR markers from a variety of sources and published cDNA markers into a composite P. taeda genetic map constructed from two reference mapping pedigrees. A dense genetic map that incorporates SSR loci will benefit complete pine genome sequencing, pine population genetics studies, and pine breeding programs. Careful marker annotation using a variety of references further enhances the utility of the integrated SSR map.
The updated P. taeda genetic map, with an estimated genome coverage of 1,515 cM(Kosambi) across 12 linkage groups, incorporated 170 new SSR markers and 290 previously reported SSR, RFLP, and ESTP markers. The average marker interval was 3.1 cM. Of 233 mapped SSR loci, 84 were from cDNA-derived sequences (EST-SSRs) and 149 were from non-transcribed genomic sequences (genomic-SSRs). Of all 311 mapped cDNA-derived markers, 77% were associated with NCBI Pta UniGene clusters, 67% with RefSeq proteins, and 62% with functional Gene Ontology (GO) terms. Duplicate (i.e., redundant accessory) and paralogous markers were tentatively identified by evaluating marker sequences by their UniGene cluster IDs, clone IDs, and relative map positions. The average gene diversity, He, among polymorphic SSR loci, including those that were not mapped, was 0.43 for 94 EST-SSRs and 0.72 for 83 genomic-SSRs. The genetic map can be viewed and queried at http://www.conifergdb.org/pinemap.
Many polymorphic and genetically mapped SSR markers are now available for use in P. taeda population genetics, studies of adaptive traits, and various germplasm management applications. Annotating mapped genes with UniGene clusters and GO terms allowed assessment of redundant and paralogous EST markers and further improved the quality and utility of the genetic map for P. taeda.
MicroRNAs (miRNAs) are a class of non-coding small RNAs involved in post-transcriptional regulation of gene expression critical for plant growth and development, stress responses and other diverse biological processes in plants. The Cucurbitaceae or cucurbit family represents some of economically important species, particularly those with edible and medicinal fruits. Genomic tools for the molecular analysis of members of this family are just emerging. Partial draft genome sequence became available recently for cucumber and watermelon facilitating investigation of the small RNA component of the transcriptomes in cucurbits.
We generated four small RNA libraries from bottle gourd (Lagenaria siceraria), Cucurbita moschata, Cucurbita pepo, and, watermelon (Citrullus lanatus var. lanatus) in order to identify conserved and novel lineage specific miRNAs in these cucurbits. Deep sequencing of small RNA libraries from these species resulted in 1,597,263, 532,948, 601,388, and 493,384 unique sRNA reads from bottle gourd, moschata, pepo and watermelon, respectively. Sequence analysis of these four libraries resulted in identification of 21 miRNA families that are highly conserved and 8 miRNA families that are moderately conserved in diverse dicots. We also identified 4 putative novel miRNAs in these plant species. Furthermore, the tasiRNAs were identified and their biogenesis was determined in these cucurbits. Small RNA blot analysis or q-PCR analyses of leaf and fruit tissues of these cucurbits showed differential expression of several conserved miRNAs. Interestingly, the abundance of several miRNAs in leaves and fruits of closely related C. moschata and C. pepo was also distinctly different. Target genes for the most conserved miRNAs are also predicted.
High-throughput sequencing of small RNA libraries from four cucurbit species has provided a glimpse of small RNA component in their transcriptomes. The analysis also showed considerable variation within four cucurbit species with regards to expression of individual miRNAs.
Sorghum genome mapping based on DNA markers began in the early 1990s and numerous genetic linkage maps of sorghum have been published in the last decade, based initially on RFLP markers with more recent maps including AFLPs and SSRs and very recently, Diversity Array Technology (DArT) markers. It is essential to integrate the rapidly growing body of genetic linkage data produced through DArT with the multiple genetic linkage maps for sorghum generated through other marker technologies. Here, we report on the colinearity of six independent sorghum component maps and on the integration of these component maps into a single reference resource that contains commonly utilized SSRs, AFLPs, and high-throughput DArT markers.
The six component maps were constructed using the MultiPoint software. The lengths of the resulting maps varied between 910 and 1528 cM. The order of the 498 markers that segregated in more than one population was highly consistent between the six individual mapping data sets. The framework consensus map was constructed using a "Neighbours" approach and contained 251 integrated bridge markers on the 10 sorghum chromosomes spanning 1355.4 cM with an average density of one marker every 5.4 cM, and were used for the projection of the remaining markers. In total, the sorghum consensus map consisted of a total of 1997 markers mapped to 2029 unique loci (1190 DArT loci and 839 other loci) spanning 1603.5 cM and with an average marker density of 1 marker/0.79 cM. In addition, 35 multicopy markers were identified. On average, each chromosome on the consensus map contained 203 markers of which 58.6% were DArT markers. Non-random patterns of DNA marker distribution were observed, with some clear marker-dense regions and some marker-rare regions.
The final consensus map has allowed us to map a larger number of markers than possible in any individual map, to obtain a more complete coverage of the sorghum genome and to fill a number of gaps on individual maps. In addition to overall general consistency of marker order across individual component maps, good agreement in overall distances between common marker pairs across the component maps used in this study was determined, using a difference ratio calculation. The obtained consensus map can be used as a reference resource for genetic studies in different genetic backgrounds, in addition to providing a framework for transferring genetic information between different marker technologies and for integrating DArT markers with other genomic resources. DArT markers represent an affordable, high throughput marker system with great utility in molecular breeding programs, especially in crops such as sorghum where SNP arrays are not publicly available.
Melon, Cucumis melo L. is an important vegetable crop worldwide. At present, there are phenomena of homonyms and synonyms present in the melon seed markets of China, which could cause variety authenticity issues influencing the process of melon breeding, production, marketing and other aspects. Molecular markers, especially microsatellites or simple sequence repeats (SSRs) are playing increasingly important roles for cultivar identification. The aim of this study was to construct a DNA fingerprinting database of major melon cultivars, which could provide a possibility for the establishment of a technical standard system for purity and authenticity identification of melon seeds. In this study, to develop the core set SSR markers, 470 polymorphic SSRs were selected as the candidate markers from 1219 SSRs using 20 representative melon varieties (lines). Eighteen SSR markers, evenly distributed across the genome and with the highest contents of polymorphism information (PIC) were identified as the core marker set for melon DNA fingerprinting analysis. Fingerprint codes for 471 melon varieties (lines) were established. There were 51 materials which were classified into17 groups based on sharing the same fingerprint code, while field traits survey results showed that these plants in the same group were synonyms because of the same or similar field characters. Furthermore, DNA fingerprinting quick response (QR) codes of 471 melon varieties (lines) were constructed. Due to its fast readability and large storage capacity, QR coding melon DNA fingerprinting is in favor of read convenience and commercial applications.