We hypothesized that variants in genes expressed as a consequence of interactions between ovarian cancer cells and the host micro-environment could contribute to cancer susceptibility. We therefore used a two-stage approach to evaluate common single nucleotide polymorphisms (SNPs) in 173 genes involved in stromal epithelial interactions in the Ovarian Cancer Association Consortium (OCAC). In the discovery stage, cases with epithelial ovarian cancer (n = 675) and controls (n = 1,162) were genotyped at 1,536 SNPs using an Illumina GoldenGate assay. Based on Positive Predictive Value estimates, three SNPs—PODXL rs1013368, ITGA6 rs13027811, and MMP3 rs522616—were selected for replication using TaqMan genotyping in up to 3,059 serous invasive cases and 8,905 controls from 16 OCAC case-control studies. An additional 18 SNPs with Pper-allele<0.05 in the discovery stage were selected for replication in a subset of five OCAC studies (n = 1,233 serous invasive cases; n = 3,364 controls). The discovery stage associations in PODXL, ITGA6, and MMP3 were attenuated in the larger replication set (adj. Pper-allele≥0.5). However genotypes at TERT rs7726159 were associated with ovarian cancer risk in the smaller, five-study replication study (Pper-allele = 0.03). Combined analysis of the discovery and replication sets for this TERT SNP showed an increased risk of serous ovarian cancer among non-Hispanic whites [adj. ORper-allele 1.14 (1.04–1.24) p = 0.003]. Our study adds to the growing evidence that, like the 8q24 locus, the telomerase reverse transcriptase locus at 5p15.33, is a general cancer susceptibility locus.
In this article, we report the findings from a large-scale analysis of common variation in genes that are expressed as a consequence of interactions between ovarian cancer cells and their host micro-environment that could influence serous ovarian cancer risk. We evaluated 1,302 common variants within or near 173 genes in two large case-control studies from the Ovarian Cancer Association Consortium (OCAC) and selected three variants for further evaluation in sixteen OCAC studies and an additional 18 for evaluation in five OCAC studies. We observed a significantly increased risk of serous ovarian cancer associated with a variant in the telomerase reverse transcriptase (TERT) gene. Although TERT variants have not been previously shown to contribute to ovarian cancer risk, several studies have recently reported associations between TERT variants and other forms of cancer, including gliomas, lung cancer, adenocarcinoma, basal cell carcinoma, prostate cancer, and multiple other cancers. TERT encodes a protein that is essential for the replication and maintenance of chromosomal integrity during cell division. In cancer cells, TERT has been linked to genomic instability and tumour cell proliferation. Further studies are necessary to confirm our findings and to investigate the mechanisms for the observed association.
Genetic variants regulating the host immune system may contribute to the susceptibility for the development of gastric cancer. Little is known about the role of the innate immunity- and non-Hodgkin’s lymphoma (NHL)-related genes for gastric cancer risk. This nested case-control study was conducted to identify candidate genes for gastric cancer risk for future studies.
In the Discovery phase, 3,072 SNPs in 203 innate immunity- and 264 NHL-related genes using the Illumine GoldenGateTM OPA Panel were analyzed in 42 matched case-control sets selected from the Korean Multi-center Cancer Cohort (KMCC). Six significant SNPs in four innate immunity (DEFA6, DEFB1, JAK3, and ACAA1) and 11 SNPs in nine NHL-related genes (INSL3, CHMP7, BCL2L11, TNFRSF8, RAD50, CASP7, CHUK, CD79B, and CLDN9) with a permutated p-value <0.01 were re-genotyped in the Replication phase among 386 cases and 348 controls. Odds ratios (ORs) for gastric cancer risk were estimated adjusting for age, smoking status, and H. pylori and CagA sero-positivity. Summarized ORs in the total study population (428 cases and 390 controls) are presented using pooled- and meta-analyses.
Four SNPs had no heterogeneity across the phases: in the meta-analysis, DEFA6 rs13275170 and DEFB1 rs2738169 had both a 1.3-fold increased odds ratio (OR) for gastric cancer (95% CIs = 1.1–1.6; and 1.1–1.5, respectively). INSL3 rs10421916 and rs11088680 had both a 0.8-fold decreased OR for gastric cancer (95% CIs = 0.7–0.97; and 0.7–0.9, respectively).
Our findings suggest that certain variants in the innate immunity and NHL-related genes affect the gastric cancer risk, perhaps by modulating infection-inflammation-immunity mechanisms that remain to be defined.
The growing accessibility to genomic resources using next-generation sequencing (NGS) technologies has revolutionized the application of molecular genetic tools to ecology and evolutionary studies in non-model organisms. Here we present the case study of the European hake (Merluccius merluccius), one of the most important demersal resources of European fisheries. Two sequencing platforms, the Roche 454 FLX (454) and the Illumina Genome Analyzer (GAII), were used for Single Nucleotide Polymorphisms (SNPs) discovery in the hake muscle transcriptome. De novo transcriptome assembly into unique contigs, annotation, and in silico SNP detection were carried out in parallel for 454 and GAII sequence data. High-throughput genotyping using the Illumina GoldenGate assay was performed for validating 1,536 putative SNPs. Validation results were analysed to compare the performances of 454 and GAII methods and to evaluate the role of several variables (e.g. sequencing depth, intron-exon structure, sequence quality and annotation). Despite well-known differences in sequence length and throughput, the two approaches showed similar assay conversion rates (approximately 43%) and percentages of polymorphic loci (67.5% and 63.3% for GAII and 454, respectively). Both NGS platforms therefore demonstrated to be suitable for large scale identification of SNPs in transcribed regions of non-model species, although the lack of a reference genome profoundly affects the genotyping success rate. The overall efficiency, however, can be improved using strict quality and filtering criteria for SNP selection (sequence quality, intron-exon structure, target region score).
An effective immune response to vaccination is, in part, a complex interaction of alleles of multiple genes regulating cytokine networks. We conducted a genotyping study of Th1/Th2/inflammatory cytokines/cytokine receptors in healthy children (n=738, 11–19 years) to determine associations between individual single-nucleotide polymorphisms (SNPs)/haplotypes and immune outcomes after two doses of rubella vaccine. SNPs (n=501) were selected using the ldSelect-approach and genotyped using Illumina GoldenGate™ and TaqMan assays. Rubella-IgG levels were measured by immunoassay and secreted cytokines by ELISA. Linear regression and post hoc haplotype analyses were used to determine associations between single SNPs/haplotypes and immune outcomes. Increased carriage of minor alleles for the promoter SNPs (rs2844482 and rs2857708) of the TNFA gene were associated with dose-related increases in rubella antibodies. IL-6 secretion was co-directionally associated (p≤0.01) with five intronic SNPs in the TNFRSF1B gene in an allele dose-related manner, while five promoter/intronic SNPs in the IL12B gene were associated with variations in IL-6 secretion. TNFA haplotype AAACGGGGC (t-statistic=3.32) and IL12B promoter haplotype TAG (t-statistic=2.66) were associated with higher levels of (p≤0.01) rubella-IgG and IL-6 secretion, respectively. We identified individual SNPs/haplotypes in TNFA/TNFRSF1B and IL12B genes that appear to modulate immunity to rubella vaccination. Identification of such “genetic fingerprints” may predict the outcome of vaccine response and inform new vaccine strategies.
Rubella; Vaccine; Antibody; Cytokine; SNP; Haplotype
Aims: To develop a panel of markers able to extract full haplotype information for candidate genes in alcoholism, other addictions and disorders of mood and anxiety. Methods: A total of 130 genes were haplotype tagged and genotyped in 7 case/control populations and 51 reference populations using Illumina GoldenGate SNP genotyping technology, determining haplotype coverage. We also constructed and determined the efficacy of a panel of 186 ancestry informative markers. Results: An average of 1465 loci were genotyped at an average completion rate of 91.3%, with an average call rate of 98.3% and replication rate of 99.7%. Completion and call rates were lowered by the performance of two datasets, highlighting the importance of the DNA quality in high throughput assays. A comparison of haplotypes captured by the Addictions Array tagging SNPs and commercially available whole-genome arrays from Illumina and Affymetrix shows comparable performance of the tag SNPs to the best whole-genome array in all populations for which data are available. Conclusions: Arrays of haplotype-tagged candidate genes, such as this addictions-focused array, represent a cost-effective approach to generate high-quality SNP genotyping data useful for the haplotype-based analysis of panels of genes such as these 130 genes of interest to alcohol and addictions researchers. The inclusion of the 186 ancestry informative markers allows for the detection and correction for admixture and further enhances the utility of the array.
Whole genome amplification (WGA) offers a means to enrich DNA quantities for epidemiologic studies. We used an ovarian cancer study of 1,536 single nucleotide polymorphisms (SNPs) and 2,368 samples to assess performance of multiple displacement amplification (MDA) WGA using an Illumina GoldenGate BeadArray. Initial screening revealed successful genotyping for 93.4% of WGA samples and 99.3% of genomic samples, and 93.2% of SNPs for WGA samples and 96.3% of SNPs for genomic samples. SNP failure was predicted by Illumina-provided designability rank, %GC (P ≤ 0.002), and for WGA only, distance to telomere and Illumina-provided SNP score (P ≤ 0.002). Distance to telomere and %GC were highly correlated; adjustment for %GC removed the association between distance to telomere and SNP failure. Although universally high, per-SNP call rates were related to designability rank, SNP score, %GC, minor allele frequency, distance to telomere (P ≤ 0.01), and, for WGA only, Illumina-provided validation class (P < 0.001). We found excellent concordance generally (>99.0%) among 124 WGA:genomic replicates, 15 WGA replicates, 88 replicate aliquots of the same WGA preparation, and 25 genomic replicates. Where there was discordance, it was across WGA:genomic replicates but limited to only a few samples among other replicates suggesting the introduction of error. Designability rank and SNP score correlated with WGA:genomic concordance (P < 0.001). In summary, use of MDA WGA DNA is feasible; however, caution is warranted regarding SNP selection and analysis. We recommend that biological SNP characteristics, notably distance to telomere and GC content (<50% GC recommended), as well as Illumina-provided metrics be considered in the creation of GoldenGate assays using MDA WGA DNA.
Abdominal aortic aneurysm (AAA) is a complex disorder with multiple genetic risk factors. Using affected relative pair linkage analysis, we previously identified an AAA susceptibility locus on chromosome 19q13. This locus has been designated as the AAA1 susceptibility locus in the Online Mendelian Inheritance in Man (OMIM) database.
Nine candidate genes were selected from the AAA1 locus based on their function, as well as mRNA expression levels in the aorta. A sample of 394 cases and 419 controls was genotyped for 41 SNPs located in or around the selected nine candidate genes using the Illumina GoldenGate platform. Single marker and haplotype analyses were performed. Three genes (CEBPG, PEPD and CD22) were selected for DNA sequencing based on the association study results, and exonic regions were analyzed. Immunohistochemical staining of aortic tissue sections from AAA and control individuals was carried out for the CD22 and PEPD proteins with specific antibodies.
Several SNPs were nominally associated with AAA (p < 0.05). The SNPs with most significant p-values were located near the CCAAT enhancer binding protein (CEBPG), peptidase D (PEPD), and CD22. Haplotype analysis found a nominally associated 5-SNP haplotype in the CEBPG/PEPD locus, as well as a nominally associated 2-SNP haplotype in the CD22 locus. DNA sequencing of the coding regions revealed no variation in CEBPG. Seven sequence variants were identified in PEPD, including three not present in the NCBI SNP (dbSNP) database. Sequencing of all 14 exons of CD22 identified 20 sequence variants, five of which were in the coding region and six were in the 3'-untranslated region. Five variants were not present in dbSNP. Immunohistochemical staining for CD22 revealed protein expression in lymphocytes present in the aneurysmal aortic wall only and no detectable expression in control aorta. PEPD protein was expressed in fibroblasts and myofibroblasts in the media-adventitia border in both aneurysmal and non-aneurysmal tissue samples.
Association testing of the functional positional candidate genes on the AAA1 locus on chromosome 19q13 demonstrated nominal association in three genes. PEPD and CD22 were considered the most promising candidate genes for altering AAA risk, based on gene function, association evidence, gene expression, and protein expression.
A newly developed maize Illumina GoldenGate Assay with 1536 SNPs from 582 loci was used to genotype a highly diverse global maize collection of 632 inbred lines from temperate, tropical, and subtropical public breeding programs. A total of 1229 informative SNPs and 1749 haplotypes within 327 loci was used to estimate the genetic diversity, population structure, and familial relatedness. Population structure identified tropical and temperate subgroups, and complex familial relationships were identified within the global collection. Linkage disequilibrium (LD) was measured overall and within chromosomes, allelic frequency groups, subgroups related by geographic origin, and subgroups of different sample sizes. The LD decay distance differed among chromosomes and ranged between 1 to 10 kb. The LD distance increased with the increase of minor allelic frequency (MAF), and with smaller sample sizes, encouraging caution when using too few lines in a study. The LD decay distance was much higher in temperate than in tropical and subtropical lines, because tropical and subtropical lines are more diverse and contain more rare alleles than temperate lines. A core set of inbreds was defined based on haplotypes, and 60 lines capture 90% of the haplotype diversity of the entire panel. The defined core sets and the entire collection can be used widely for different research targets.
Single nucleotide polymorphisms (SNPs) are the most abundant source of genetic variation among individuals of a species. New genotyping technologies allow examining hundreds to thousands of SNPs in a single reaction for a wide range of applications such as genetic diversity analysis, linkage mapping, fine QTL mapping, association studies, marker-assisted or genome-wide selection. In this paper, we evaluated the potential of highly-multiplexed SNP genotyping for genetic mapping in maritime pine (Pinus pinaster Ait.), the main conifer used for commercial plantation in southwestern Europe.
We designed a custom GoldenGate assay for 1,536 SNPs detected through the resequencing of gene fragments (707 in vitro SNPs/Indels) and from Sanger-derived Expressed Sequenced Tags assembled into a unigene set (829 in silico SNPs/Indels). Offspring from three-generation outbred (G2) and inbred (F2) pedigrees were genotyped. The success rate of the assay was 63.6% and 74.8% for in silico and in vitro SNPs, respectively. A genotyping error rate of 0.4% was further estimated from segregating data of SNPs belonging to the same gene. Overall, 394 SNPs were available for mapping. A total of 287 SNPs were integrated with previously mapped markers in the G2 parental maps, while 179 SNPs were localized on the map generated from the analysis of the F2 progeny. Based on 98 markers segregating in both pedigrees, we were able to generate a consensus map comprising 357 SNPs from 292 different loci. Finally, the analysis of sequence homology between mapped markers and their orthologs in a Pinus taeda linkage map, made it possible to align the 12 linkage groups of both species.
Our results show that the GoldenGate assay can be used successfully for high-throughput SNP genotyping in maritime pine, a conifer species that has a genome seven times the size of the human genome. This SNP-array will be extended thanks to recent sequencing effort using new generation sequencing technologies and will include SNPs from comparative orthologous sequences that were identified in the present study, providing a wider collection of anchor points for comparative genomics among the conifers.
We genotyped a Somali population (n=85; age ≤ 30 years) for 617 cytokine and cytokine receptor SNPs using Illumina GoldenGate genotyping to determine associations with measles, mumps and rubella immunity. Overall, sixty-one significant associations (p≤0.01) were found between SNPs belonging to cytokine receptor genes regulating Th1 (IL12RB2, IL2RA and B) and Th2 (IL4R, IL10RB) immunity, and cytokine (IL1B, TNFA, IL6 and IFNB1) and cytokine receptor (IL1RA, IFNAR2, IL18R1, TNFRSF1A and B) genes regulating innate immunity, and variations in antibody levels to measles, mumps or rubella. SNPs within two major inflammatory cytokine genes, TNFA and IL6, demonstrated associations with measles-specific antibodies. Specifically, the minor allele variant of rs1799964 (TNFA -1211 C>T) was associated with primarily seronegative values (median EIA index values ≤0.87; p=0.002; q=0.23) in response to measles disease and/or vaccination. A heterozygous variant CT for rs2069849 (IL6 +4272C>T; Phe201Phe) was also associated with seronegative values and a lower median level of antibody response to measles disease and/or vaccination (p=0.004; q=0.36) or measles vaccination alone (p=0.008). Several SNPs within the coding and regulatory regions of cytokine and cytokine receptor genes demonstrated associations with mumps and rubella antibody levels, but were less informative as strong LD patterns and lower frequencies for minor alleles were observed among these SNPs. Our study identifies specific SNPs in innate immune response genes that may play a role in modulating antibody responses to measles vaccination and/or infection in Somali subjects.
Polymorphisms; Cytokine; Cytokine receptors; Antibodies; MMR; Immunity
The incidence of esophageal adenocarcinoma (EA) has been increasing rapidly, particularly among white males, over the past few decades in the USA. However, the etiology of EA and the striking male predominance is not fully explained by known risk factors. To identify susceptible genes for EA risk, we conducted a pathway-based candidate gene association study on 335 Caucasian EA cases and 319 Caucasian controls. A total of 1330 single-nucleotide polymorphisms (SNPs) selected from 354 genes were analyzed using an Illumina GoldenGate assay. The genotyped common SNPs include missense and exonic SNPs, SNPs within untranslated regions and 2 kb 5′ of the gene, and tagSNPs for genes with little functional information available. Logistic regression adjusted for potential confounders was used to assess the genetic effect of each SNP on EA risk. We also tested gene–gender interactions using the likelihood ratio tests. We found that the genetic variants in the apoptosis pathway were significantly associated with EA risk after correcting for multiple comparisons. SNPs of rs3127075 in Caspase-7 (CASP7) and rs4661636 in Caspase-9 (CASP9) genes that play a critical role in apoptosis were found to be associated with an increased risk of EA. A protective effect of SNP rs572483 in the progesterone receptor (PGR) gene was observed among women carrying the variant G allele [adjusted odds ratio (OR) = 0.19; 95% confidence interval (CI) = 0.08–0.46] but was not observed among men (adjusted OR = 1.38; 95% CI = 0.95–2.00). In conclusion, this study suggests that the genetic variants of CASP7 and CASP9 in the apoptosis pathway may be important predictive markers for EA susceptibility and that PGR in the sex hormone signaling pathway may be associated with the gender differences in EA risk.
The genotype data of 7054 single nucleotide polymorphism (SNP) loci in 40 tomato lines, including inbred lines, F1 hybrids, and wild relatives, were collected using Illumina's Infinium and GoldenGate assay platforms, the latter of which was utilized in our previous study. The dendrogram based on the genotype data corresponded well to the breeding types of tomato and wild relatives. The SNPs were classified into six categories according to their positions in the genes predicted on the tomato genome sequence. The genes with SNPs were annotated by homology searches against the nucleotide and protein databases, as well as by domain searches, and they were classified into the functional categories defined by the NCBI's eukaryotic orthologous groups (KOG). To infer the SNPs' effects on the gene functions, the three-dimensional structures of the 843 proteins that were encoded by the genes with SNPs causing missense mutations were constructed by homology modelling, and 200 of these proteins were considered to carry non-synonymous amino acid substitutions in the predicted functional sites. The SNP information obtained in this study is available at the Kazusa Tomato Genomics Database (http://plant1.kazusa.or.jp/tomato/).
single nucleotide polymorphism (SNP); Solanum lycopersicum; infinium assay; goldengate assay; homology modelling
Benzene is an established human hematotoxicant and leukemogen but its mechanism of action is unclear. To investigate the role of single-nucleotide polymorphisms (SNPs) on benzene-induced hematotoxicity, we analyzed 1395 SNPs in 411 genes using an Illumina GoldenGate assay in 250 benzene-exposed workers and 140 unexposed controls. Highly significant findings clustered in five genes (BLM, TP53, RAD51, WDR79 and WRN) that play a critical role in DNA repair and genomic maintenance, and these regions were then further investigated with tagSNPs. One or more SNPs in each gene were associated with highly significant 10–20% reductions (P values ranged from 0.0011 to 0.0002) in the white blood cell (WBC) count among benzene-exposed workers but not controls, with evidence for gene–environment interactions for SNPs in BLM, WRN and RAD51. Further, among workers exposed to benzene, the genotype-associated risk of having a WBC count <4000 cells/μl increased when using individuals with progressively higher WBC counts as the comparison group, with some odds ratios >8-fold. In vitro functional studies revealed that deletion of SGS1 in yeast, equivalent to lacking BLM and WRN function in humans, caused reduced cellular growth in the presence of the toxic benzene metabolite hydroquinone, and knockdown of WRN using specific short hairpin RNA increased susceptibility of human TK6 cells to hydroquinone toxicity. Our findings suggest that SNPs involved in DNA repair and genomic maintenance, with particular clustering in the homologous DNA recombination pathway, play an important role in benzene-induced hematotoxicity.
To explore the potential value of high-throughput genotyping assays in the analysis of large and complex genomes, we designed two highly multiplexed Illumina bead arrays using the GoldenGate SNP assay for gene mapping in white spruce (Picea glauca [Moench] Voss) and black spruce (Picea mariana [Mill.] B.S.P.).
Each array included 768 SNPs, identified by resequencing genomic DNA from parents of each mapping population. For white spruce and black spruce, respectively, 69.2% and 77.1% of genotyped SNPs had valid GoldenGate assay scores and segregated in the mapping populations. For each of these successful SNPs, on average, valid genotyping scores were obtained for over 99% of progeny. SNP data were integrated to pre-existing ALFP, ESTP, and SSR markers to construct two individual linkage maps and a composite map for white spruce and black spruce genomes. The white spruce composite map contained 821 markers including 348 gene loci. Also, 835 markers including 328 gene loci were positioned on the black spruce composite map. In total, 215 anchor markers (mostly gene markers) were shared between the two species. Considering lineage divergence at least 10 Myr ago between the two spruces, interspecific comparison of homoeologous linkage groups revealed remarkable synteny and marker colinearity.
The design of customized highly multiplexed Illumina SNP arrays appears as an efficient procedure to enhance the mapping of expressed genes and make linkage maps more informative and powerful in such species with poorly known genomes. This genotyping approach will open new avenues for co-localizing candidate genes and QTLs, partial genome sequencing, and comparative mapping across conifers.
The present study reports the large-scale discovery of genome-wide single-nucleotide polymorphisms (SNPs) in chickpea, identified mainly through the next generation sequencing of two genotypes, i.e. Cicer arietinum ICC4958 and its wild progenitor C. reticulatum PI489777, parents of an inter-specific reference mapping population of chickpea. Development and validation of a high-throughput SNP genotyping assay based on Illumina's GoldenGate Genotyping Technology and its application in building a high-resolution genetic linkage map of chickpea is described for the first time. In this study, 1022 SNPs were identified, of which 768 high-confidence SNPs were selected for designing the custom Oligo Pool All (CpOPA-I) for genotyping. Of these, 697 SNPs could be successfully used for genotyping, demonstrating a high success rate of 90.75%. Genotyping data of the 697 SNPs were compiled along with those of 368 co-dominant markers mapped in an earlier study, and a saturated genetic linkage map of chickpea was constructed. One thousand and sixty-three markers were mapped onto eight linkage groups spanning 1808.7 cM (centiMorgans) with an average inter-marker distance of 1.70 cM, thereby representing one of the most advanced maps of chickpea. The map was used for the synteny analysis of chickpea, which revealed a higher degree of synteny with the phylogenetically close Medicago than with soybean. The first set of validated SNPs and map resources developed in this study will not only facilitate QTL mapping, genome-wide association analysis and comparative mapping in legumes but also help anchor scaffolds arising out of the whole-genome sequencing of chickpea.
chickpea; SNP; linkage map; genotyping; NGS
Hypertension is a known risk factor for renal cell carcinoma (RCC), although the underlying biological mechanisms of its action are unknown. To clarify the role of hypertension in RCC, we examined the risk of RCC in relation to 142 single-nucleotide polymorphisms (SNPs) in eight genes having a role in blood pressure control. We analyzed 777 incident and histologically confirmed RCC cases and 1035 controls who completed an in-person interview as part of a multi-center, hospital-based case–control study in Central Europe. Genotyping was conducted with an Illumina® GoldenGate® Oligo Pool All assay using germ line DNA. Of the eight genes examined, AGT (angiotensinogen) was most strongly associated with RCC (minimum P-value permutation test = 0.02). Of the 17 AGT tagging SNPs considered, associations were strongest for rs1326889 [odds ratio (OR) = 1.35, 95% confidence interval (CI) = 1.15–1.58] and rs2493137 (OR = 1.31, 95% CI = 1.12–1.54), which are located in the promoter. Stratified analysis revealed that the effects of the AGT SNPs were statistically significant in participants with hypertension or high body mass index (BMI) (≥25 kg/m2), but not in subjects without hypertension and with a normal BMI (<25 kg/m2). Also, haplotypes with risk-conferring alleles of markers located in the promoter and intron 1 regions of AGT were significantly associated with RCC compared with the common haplotype in subjects with hypertension or high BMI (global P = 0.003). Our findings suggest that common genetic variants of AGT, particularly those in the promoter, increase RCC risk among subjects who are hypertensive or overweight.
Tocopherols, which are vitamin E compounds, play an important role in maintaining human health. Compared with other staple foods, maize grains contain high level of tocopherols.
Two F2 populations (K22/CI7 and K22/Dan340, referred to as POP-1 and POP-2, respectively), which share a common parent (K22), were developed and genotyped using a GoldenGate assay containing 1,536 single nucleotide polymorphism (SNP) markers. An integrated genetic linkage map was constructed using 619 SNP markers, spanning a total of 1649.03 cM of the maize genome with an average interval of 2.67 cM. Seventeen quantitative trait loci (QTLs) for all the traits were detected in the first map and 13 in the second. In these two maps, QTLs for different traits were localized to the same genomic regions and some were co-located with candidate genes in the tocopherol biosynthesis pathway. Single QTL was responsible for 3.03% to 52.75% of the phenotypic variation and the QTLs in sum explained23.4% to 66.52% of the total phenotypic variation. A major QTL (qc5-1/qd5-1) affecting α-tocopherol (αT) was identified on chromosome 5 between the PZA03161.1 and PZA02068.1 in the POP-2. The QTL region was narrowed down from 18.7 Mb to 5.4 Mb by estimating the recombination using high-density markers of the QTL region. This allowed the identification of the candidate gene VTE4 which encodes γ-tocopherol methyltransferase, an enzyme that transforms γ-tocopherol (γT)to αT.
These results demonstrate that a few QTLs with major effects and several QTLs with medium to minor effects might contribute to the natural variation of tocopherols in maize grain. The high-density markers will help to fine map and identify the QTLs with major effects even in the preliminary segregating populations. Furthermore, this study provides a simple guide line for the breeders to improve traits that minimize the risk of malnutrition, especially in developing countries.
Maize; Tocopherols; QTL mapping
Single nucleotide polymorphism (SNP) validation and large-scale genotyping are required to maximize the use of DNA sequence variation and determine the functional relevance of candidate genes for complex stress tolerance traits through genetic association in rice. We used the bead array platform-based Illumina GoldenGate assay to validate and genotype SNPs in a select set of stress-responsive genes to understand their functional relevance and study the population structure in rice.
Of the 384 putative SNPs assayed, we successfully validated and genotyped 362 (94.3%). Of these 325 (84.6%) showed polymorphism among the 91 rice genotypes examined. Physical distribution, degree of allele sharing, admixtures and introgression, and amino acid replacement of SNPs in 263 abiotic and 62 biotic stress-responsive genes provided clues for identification and targeted mapping of trait-associated genomic regions. We assessed the functional and adaptive significance of validated SNPs in a set of contrasting drought tolerant upland and sensitive lowland rice genotypes by correlating their allelic variation with amino acid sequence alterations in catalytic domains and three-dimensional secondary protein structure encoded by stress-responsive genes. We found a strong genetic association among SNPs in the nine stress-responsive genes with upland and lowland ecological adaptation. Higher nucleotide diversity was observed in indica accessions compared with other rice sub-populations based on different population genetic parameters. The inferred ancestry of 16% among rice genotypes was derived from admixed populations with the maximum between upland aus and wild Oryza species.
SNPs validated in biotic and abiotic stress-responsive rice genes can be used in association analyses to identify candidate genes and develop functional markers for stress tolerance in rice.
Illumina GoldenGate assay; Population structure; Rice; SNPs; Single nucleotide polymorphisms; Stress-responsive genes
The stay-green phenomenon is a key plant trait with wide usage in managing crop production under limited water conditions. This trait enhances delayed senescence, biomass, and grain yield under drought stress. In this study we sought to identify QTLs in cowpea (Vigna unguiculata) consistent across experiments conducted in Burkina Faso, Nigeria, Senegal, and the United States of America under limited water conditions. A panel of 383 diverse cowpea accessions and a recombinant inbred line population (RIL) were SNP genotyped using an Illumina 1536 GoldenGate assay. Phenotypic data from thirteen experiments conducted across the four countries were used to identify SNP-trait associations based on linkage disequilibrium association mapping, with bi-parental QTL mapping as a complementary strategy. We identified seven loci, five of which exhibited evidence suggesting pleiotropic effects (stay-green) between delayed senescence, biomass, and grain yield. Further, we provide evidence suggesting the existence of positive pleiotropy in cowpea based on positively correlated mean phenotypic values (0.34< r <0.87) and allele effects (0.07< r <0.86) for delayed senescence and grain yield across three African environments. Three of the five putative stay-green QTLs, Dro-1, 3, and 7 were identified in both RILs and diverse germplasm with resolutions of 3.2 cM or less for each of the three loci, suggesting that these may be valuable targets for marker-assisted breeding in cowpea. Also, the co-location of early vegetative delayed senescence with biomass and grain yield QTLs suggests the possibility of using delayed senescence at the seedling stage as a rapid screening tool for post-flowering drought tolerance in cowpea breeding. BLAST analysis using EST sequences harboring SNPs with the highest associations provided a genomic context for loci identified in this study in closely related common bean (Phaseolus vulgaris) and soybean (Glycine max) reference genomes.
Single Nucleotide Polymorphisms (SNPs) can be used as genetic markers for applications such as genetic diversity studies or genetic mapping. New technologies now allow genotyping hundreds to thousands of SNPs in a single reaction.
In order to evaluate the potential of these technologies in pea, we selected a custom 384-SNP set using SNPs discovered in Pisum through the resequencing of gene fragments in different genotypes and by compiling genomic sequence data present in databases. We then designed an Illumina GoldenGate assay to genotype both a Pisum germplasm collection and a genetic mapping population with the SNP set.
We obtained clear allelic data for more than 92% of the SNPs (356 out of 384). Interestingly, the technique was successful for all the genotypes present in the germplasm collection, including those from species or subspecies different from the P. sativum ssp sativum used to generate sequences. By genotyping the mapping population with the SNP set, we obtained a genetic map and map positions for 37 new gene markers.
Our results show that the Illumina GoldenGate assay can be used successfully for high-throughput SNP genotyping of diverse germplasm in pea. This genotyping approach will simplify genotyping procedures for association mapping or diversity studies purposes and open new perspectives in legume genomics.
Human genetic variation is a determinant of recovery from an acute hepatitis C virus (HCV) infection, but, to date, single nucleotide polymorphisms (SNPs) in a limited number of genes have been studied with respect to HCV clearance. We determined whether SNPs in 112 selected immune-response genes are important for HCV clearance by genotyping 1536 SNPs in a cohort of 343 persons with natural HCV clearance and 547 persons with HCV persistence. PLINK and Haploview software packages were used to perform association, permutation, and haplotype analyses stratified by African-American (AA) and European-American (EA) race. Of the 1536 SNPs tested, 1426 were successfully genotyped (92.8%). In AAs, we identified 18 SNPs located in 11 gene regions that were associated with HCV outcome (empirical p-value < 0.01). In EAs, there were 20 SNPs located in eight gene regions associated with HCV outcome. Four of the gene regions studied (TNFSF18, TANK, HAVCR1 and IL18BP) contained SNPs with empirical p-values < 0.01 in both of the race groups.
In this large-scale analysis of 1426 genotyped SNPs in 112 candidate genes, we identified four gene regions that are likely candidates for a role in HCV clearance or persistence in both AAs and EAs.
TANK; TNFSF18; HAVCR1; IL18BP; genetics
Common genetic variation could alter the risk for developing bladder cancer. We conducted a large-scale evaluation of single nucleotide polymorphisms (SNPs) in candidate genes for cancer to identify common variants that influence bladder cancer risk. An Illumina GoldenGate assay was used to genotype 1,433 SNPs within or near 386 genes in 1,086 cases and 1,033 controls in Spain. The most significant finding was in the 5′ UTR of VEGF (rs25648, p for likelihood ratio test, 2 degrees of freedom = 1 × 10−5). To further investigate the region, we analyzed 29 additional SNPs in VEGF, selected to saturate the promoter and 5′ UTR and to tag common genetic variation in this gene. Three additional SNPs in the promoter region (rs833052, rs1109324, and rs1547651) were associated with increased risk for bladder cancer: odds ratio (95% confidence interval): 2.52 (1.06–5.97), 2.74 (1.26–5.98), and 3.02 (1.36–6.63), respectively; and a polymorphism in intron 2 (rs3024994) was associated with reduced risk: 0.65 (0.46–0.91). Two of the promoter SNPs and the intron 2 SNP showed linkage disequilibrium with rs25648. Haplotype analyses revealed three blocks of linkage disequilibrium with significant associations for two blocks including the promoter and 5′ UTR (global p = 0.02 and 0.009, respectively). These findings are biologically plausible since VEGF is critical in angiogenesis, which is important for tumor growth, its elevated expression in bladder tumors correlates with tumor progression, and specific 5′ UTR haplotypes have been shown to influence promoter activity. Associations between bladder cancer risk and other genes in this report were not robust based on false discovery rate calculations. In conclusion, this large-scale evaluation of candidate cancer genes has identified common genetic variants in the regulatory regions of VEGF that could be associated with bladder cancer risk.
This article reports findings from a large-scale evaluation of common variation in candidate genes for cancer to identify variants that influence bladder cancer risk. We first evaluated 1,433 common variants within or near 386 genes in a large case-control study in Spain. The most significant finding was the gene coding for the vascular endothelial growth factor (VEGF). To further investigate this finding, we identified markers that captured most common variation in the whole gene. Analyses indicated that variants in regulatory regions of VEGF could modify the risk for developing bladder cancer. This association is biologically plausible since VEGF is critical for the growth of new blood vessels, which is important for tumor development, and its elevated expression in bladder tumors correlates with tumor progression. Future studies are required to confirm these findings, as well as to investigate the mechanisms for the observed associations.
Non-Hodgkin lymphoma (NHL) is the fifth most common cancer in the U.S. and few causes have been identified. Genetic association studies may help identify environmental risk factors and enhance our understanding of disease mechanisms.
768 coding and haplotype tagging SNPs in 146 genes were examined using Illumina GoldenGate technology in a large population-based case-control study of NHL in the San Francisco Bay Area (1,292 cases 1,375 controls are included here). Statistical analyses were restricted to HIV- participants of white non-Hispanic origin. Genes involved in steroidogenesis, immune function, cell signaling, sunlight exposure, xenobiotic metabolism/oxidative stress, energy balance, and uptake and metabolism of cholesterol, folate and vitamin C were investigated. Sixteen SNPs in eight pathways and nine haplotypes were associated with NHL after correction for multiple testing at the adjusted q<0.10 level. Eight SNPs were tested in an independent case-control study of lymphoma in Germany (494 NHL cases and 494 matched controls). Novel associations with common variants in estrogen receptor 1 (ESR1) and in the vitamin C receptor and matrix metalloproteinase gene families were observed. Four ESR1 SNPs were associated with follicular lymphoma (FL) in the U.S. study, with rs3020314 remaining associated with reduced risk of FL after multiple testing adjustments [odds ratio (OR) = 0.42, 95% confidence interval (CI) = 0.23–0.77) and replication in the German study (OR = 0.24, 95% CI = 0.06–0.94). Several SNPs and haplotypes in the matrix metalloproteinase-3 (MMP3) and MMP9 genes and in the vitamin C receptor genes, solute carrier family 23 member 1 (SLC23A1) and SLC23A2, showed associations with NHL risk.
Our findings suggest a role for estrogen, vitamin C and matrix metalloproteinases in the pathogenesis of NHL that will require further validation.
As high-throughput genetic marker screening systems are essential for a range of genetics studies and plant breeding applications, the International RosBREED SNP Consortium (IRSC) has utilized the Illumina Infinium® II system to develop a medium- to high-throughput SNP screening tool for genome-wide evaluation of allelic variation in apple (Malus×domestica) breeding germplasm. For genome-wide SNP discovery, 27 apple cultivars were chosen to represent worldwide breeding germplasm and re-sequenced at low coverage with the Illumina Genome Analyzer II. Following alignment of these sequences to the whole genome sequence of ‘Golden Delicious’, SNPs were identified using SoapSNP. A total of 2,113,120 SNPs were detected, corresponding to one SNP to every 288 bp of the genome. The Illumina GoldenGate® assay was then used to validate a subset of 144 SNPs with a range of characteristics, using a set of 160 apple accessions. This validation assay enabled fine-tuning of the final subset of SNPs for the Illumina Infinium® II system. The set of stringent filtering criteria developed allowed choice of a set of SNPs that not only exhibited an even distribution across the apple genome and a range of minor allele frequencies to ensure utility across germplasm, but also were located in putative exonic regions to maximize genotyping success rate. A total of 7867 apple SNPs was established for the IRSC apple 8K SNP array v1, of which 5554 were polymorphic after evaluation in segregating families and a germplasm collection. This publicly available genomics resource will provide an unprecedented resolution of SNP haplotypes, which will enable marker-locus-trait association discovery, description of the genetic architecture of quantitative traits, investigation of genetic variation (neutral and functional), and genomic selection in apple.
Ethanol is metabolized by two rate limiting reactions: alcohol dehydrogenases (ADH) convert ethanol to acetaldehyde, subsequently metabolized to acetate by aldehyde dehydrogenases (ALDH). Approximately 50% of East Asians have genetic variants that significantly impair this pathway and influence alcohol dependence (AD) vulnerability. We investigated whether variation in alcohol metabolism genes might alter the AD risk in four non-East Asian populations by performing systematic haplotype association analyses in order to maximize the chances of capturing functional variation.
Haplotype-tagging SNPs were genotyped using the Illumina GoldenGate platform. Genotypes were available for 40 SNPs across the ADH genes cluster and 24 SNPs across the two ALDH genes in four diverse samples that included cases (lifetime AD) and controls (no Axis 1 disorders). The case, control sample sizes were: Finnish Caucasians: 232, 194; African Americans: 267, 422; Plains American Indians: 226, 110; Southwestern American (SW) Indians: 317, 72.
In all four populations, as well as HapMap populations, five haplotype blocks were identified across the ADH gene cluster: (1) ADH5-ADH4; (2) ADH6-ADH1A-ADH1B; (3) ADH1C; (4) intergenic; (5) ADH7. The ALDH1A1 gene was defined by four blocks and ALDH2 by one block. No haplotype or SNP association results were significant after correction for multiple comparisons; however several results, particularly for ALDH1A1 and ADH4, replicated earlier findings. There was an ALDH1A1 block 1 and 2 (extending from intron 5 to the 3′ UTR) yin yang haplotype (haplotypes that have opposite allelic configuration) association with AD in the Finns driven by SNPs rs3764435 and rs2303317 respectively, and an ALDH1A1 block 3 (including the promoter region) yin yang haplotype association in SW Indians driven by 5 SNPs, all in allelic identity. The ADH4 SNP rs3762894 was associated with AD in Plains Indians.
The systematic evaluation of alcohol metabolizing genes in four non-East Asian populations has shown only modest associations with AD, largely for ALDH1A1 and ADH4. A concentration of signals for AD with ALDH1A1 yin yang haplotypes in several populations warrants further study.
Alcohol dependence; alcohol dehydrogenases (ADH); aldehyde dehydrogenases (ALDH); haplotype association; ALDH1A1