SSR markers have become common place for plant genetics and breeding applications. Despite the fact that hundreds of SSR markers have been identified and tested in chickpea (Hüttel et al.
1999; Sethy et al.
2006a,
2006b; Winter et al.
1999; Lichtenzveig et al.
2005), the narrow genetic background of cultivated chickpea germplasm has limited their application, and thus there exists a need to develop a larger set of novel genetic markers. With the objective of enriching the marker repertoire of chickpea, we have contributed novel SSR markers derived from a genomic library enriched for GA and TAA repeat motifs and a set of gene-based SNP markers. The basis of our marker discovery work was
C. arietinum genotype ICC 4958, which is being used as a reference genotype for genomic and genetic resource by the chickpea community.
In the present study, 65.4% of hybridizing genomic clones in our SSR-enriched library yielded 643 SSRs. This rate of SSR recovery is comparable with previous studies, for example in peanut where 68% of hybridizing clones yielded SSRs (Cuc et al.
2008). Moreover, the relatively high abundance of tri- and di-nucleotide repeats that we observed is consistent with previous studies in chickpea (Hüttel et al.
1999; Lichtenzveig et al.
2005; Winter et al.
1999). Among the SSRs identified here, the most common SSR motifs were TAA/ATT repeats and GA/CT repeats. This result reflects the fact that our enrichment targeted TAA and GA motifs, and it is consistent with previous studies in chickpea (Hüttel et al.
1999; Lichtenzveig et al.
2005; Winter et al.
1999), other legume species (Akkaya et al.
1992; Cregan et al.
1994; Mun et al.
2006), and even in cereal species (Varshney et al.
2002; Jayashree et al.
2006).
Temnykh et al. (
2001) developed a scheme to classify SSRs according to length, in which Class I and Class II SSRs are greater than or less than 20 bp, respectively. This division based on sequence length has practical utility, because Class I SSRs are generally more polymorphic and thus more desirable as genetic markers. The majority of SSRs isolated from our SSR-enriched library belong to Class II, though as expected the Class I SSRs had higher rates of polymorphism. A useful measure of polymorphic potential for any genetic marker is its polymorphism information content value, or PIC value. PIC values provide information on the probability that a given marker will be polymorphic between any two individuals in a population, and thus are a function both of allele frequencies and allele number. Screening of the ICCM-series markers on 48 genotypes revealed that average PIC value of SSR markers having Class I repeats (0.38) was higher than that of Class II repeats (0.22). The majority of the Class I repeats were tri-nucleotide repeats, consistent with the known utility of tri-nucleotide repeats as genetic markers in plants (Varshney et al.
2005).
Polymorphic information content value was also analyzed in relation to repeat unit type and length. Among di-, tri-, and tetra-nucleotide repeats, tri-nucleotide repeats showed higher polymorphism (average PIC = 0.33) with average allele number of 5.7 per marker. Markers with mono-nucleotide repeats showed the least polymorphism (average PIC = 0.197). Relatively longer repeats appear to have contributed to the higher level of polymorphism as compared to di-nucleotide repeats (Gupta and Varshney
2000). It was also observed that among tri-nucleotide SSRs, the SSR markers based on (TAA/TTA) repeat motifs displayed higher polymorphism (average PIC = 0.35) with an average allele number of 6.12 per marker. Similarly, among di-nucleotide repeats SSR markers based on TA/AT repeat motifs had a higher average PIC value (0.27) compared to others with an average of 6.1 alleles. In fact, the earlier studies in chickpea also revealed the abundance of TAA/TTA (tri-nucleotide) and TA/GA (di-nucleotide) SSR motifs and the extensive polymorphism found with markers containing these repeat motifs (Hüttel et al.
1999; Lichtenzveig et al.
2005). PIC values of compound SSRs (average PIC = 0.29) were comparable with tri-nucleotide repeats with 5.68 alleles per marker. This can be attributed to the fact that the markers with compound SSRs have more than one SSR motif, which increases their chances to be polymorphic markers.
We assessed the potential identity of SSR-related sequences by performing BLAST analyses versus plant EST data sets, and based on GeneOntology analysis through UniProt. Less than one-third of the SSR-associated GSS sequences had significant hits in these databases, though were hits were recorded the derived annotations add a potentially useful data type to the marker metadata. Not surprisingly, chickpea GSS sequences (from which the SSRs were derived) had higher similarity to ESTs from other legume species, and overall higher similarity to dicot outgroups (i.e., poplar and Arabidopsis) than to monocot (i.e., rice) data sets.
Comprehensive genetic map of chickpea
An inter-specific mapping population derived from ICC 4958 (
C. arietinum) and PI 489777 (
C. reticulatum) was used to incorporate novel microsatellite and gene based markers. This mapping population has been widely used in past by chickpea community in order to incorporate several hundred microsatellite markers (Winter et al.
2000) and gene-based markers (Pfaff and Kahl
2003). The diverse genetic background of the parents provides for higher rates of polymorphism not only at the genetic level but also at phenotypic levels such as resistance to
Fusarium wilt (Winter et al.
2000) and
Ascochyta blight (Rakshit et al.
2003), facilitating trait mapping. Therefore, this population is generally considered as the international reference mapping population.
The present genetic map of chickpea represents 521 marker loci, spanning 2,602 cM with an average inter-marker distance of 4.99 cM. The order of common marker loci defined in present map agrees with earlier reports from Winter et al. (
2000). However, the current map differs considerably from that of Winter et al. (
2000) in having eight linkage groups, in agreement with eight chromosomes, whereas the Winter et al. (
2000) map was composed of 16 linkage groups. There are probably at least two factors that contribute to this condensation of linkage groups: first, the new markers identified in the present study act as bridge points between the Winter et al. linkage groups, and second, essentially all of the markers mapped in the current study behave a co-dominant genetic features, which adds considerable power to the genetic evaluation compared to a high fraction of dominant markers in earlier studies. Importantly, the comparative analyses to
Medicago support a simple assignment of eight chickpea linkage groups to eight chromosomes.
Comparative mapping of chickpea and Medicago
Mappig of the gene-based markers from
Medicago in the genetic map of chickpea showed not only a high level of macrosynteny but also revealed features of structural divergence between the two genomes. Six of the eight linkage groups display a one-to-one correspondence between the
Medicago and chickpea, suggesting that these linkage groups reflect the genome of the common Galegoid clade legume ancestor.
Medicago LG5 and LG6, and chickpea LG2 and LG4, appear to have a more complicated ancestry, consisting of a minimum of several chromosomal translocation events. Thus, Mt-LG 5 is essentially a composite of portions of LG2 and LG8 of chickpea. Several research groups have compared genome structure between
Medicago and various crop legumes (see Zhu et al.
2005). Our current results extend the comparative network to include chickpea, by demonstrating broad conservation of genome macrostructure between chickpea and
Medicago.
One goal of comparative genetic analyses is to transfer information from well-characterized reference species to less well-characterized crops with an eye toward crop improvement. Among the agronomic targets in chickpea is resistance to several economically important pathogens; candidate genes for disease resistance are the conserved family of NBS-LRR resistance gene homologs (RGH). Several phylogenetically distinct RGH classes have been placed on the genetic map of chickpea (Huettel et al.
2002), thus facilitating the comparative genome analysis between chickpea and
Medicago. In particular, we have documented two cases of syntenic NBS-LRR clusters that contain co-phyletic genes in each species. Interestingly, Ca-LG2 is known to harbor active resistance genes against
Fusarium wilt and
Ascochyta blight. At present, no RGHs have been reported mapped close to these resistance phenotypes. Nevertheless, the facts that a single conserved gene (TC87369) maps to the top terminal region of both Mt-LG6 and Ca-LG2, and that both linkage groups are rich in NBS-LRR genes and/or active disease resistance genes (Sharma et al.
2004; Zhu et al.
2002), may suggest shared ancestry of Mt-LG6 and Ca-LG2, though such speculation needs to be verified by more detailed study of the respective genome regions.
Similar observations of NBS-LRR synteny have been made for resistance gene homologs within the Solanaceae (Grube et al
.2000) and between
Medicago and pea (
Pisum sativum) (Zhu et al.
2002). However, the limited numbers of comparative molecular markers (gene-based SNPs) around these
R gene regions in both
Medicago and chickpea precludes precise statements regarding the relationship of these genome regions. Although the current analysis is based on a relatively small number of comparative markers, the potential of more detailed analyses to predict gene content and chromosomal structure in chickpea by reference to
Medicago seems clear.