Marginal zone lymphoma (MZL) is the third most common subtype of B-cell non-Hodgkin lymphoma. Here we perform a two-stage GWAS of 1,281 MZL cases and 7,127 controls of European ancestry and identify two independent loci near BTNL2 (rs9461741, P=3.95 × 10−15) and HLA-B (rs2922994, P=2.43 × 10−9) in the HLA region significantly associated with MZL risk. This is the first evidence that genetic variation in the major histocompatibility complex influences MZL susceptibility.
Marginal zone lymphoma (MZL) is a common subtype of B-cell non-Hodgkin lymphoma. Here the authors carry out a two-stage genome-wide association study in over 8,000 Europeans and identify two new MZL risk loci at chromosome 6p, implicating the major histocompatibility complex in the disease for the first time.
The recent advances in high-throughput sequencing technologies bring the potential of a better characterization of the genetic variation in humans and other organisms. In many occasions, either by design or by necessity, the sequencing procedure is performed on a pool of DNA samples with different abundances, where the abundance of each sample is unknown. Such a scenario is naturally occurring in the case of metagenomics analysis where a pool of bacteria is sequenced, or in the case of population studies involving DNA pools by design. Particularly, various pooling designs were recently suggested that can identify carriers of rare alleles in large cohorts, dramatically reducing the cost of such large-scale sequencing projects. A fundamental problem with such approaches for population studies is that the uncertainty of DNA proportions from different individuals in the pools might lead to spurious associations. Fortunately, it is often the case that the genotype data of at least some of the individuals in the pool is known. Here, we propose a method (eALPS) that uses the genotype data in conjunction with the pooled sequence data in order to accurately estimate the proportions of the samples in the pool, even in cases where not all individuals in the pool were genotyped (eALPS-LD). Using real data from a sequencing pooling study of non-Hodgkin's lymphoma, we demonstrate that the estimation of the proportions is crucial, since otherwise there is a risk for false discoveries. Additionally, we demonstrate that our approach is also applicable to the problem of quantification of species in metagenomics samples (eALPS-BCR) and is particularly suitable for metagenomic quantification of closely related species.
algorithms; alignment; cancer genomics; NP-completeness
Recent GWAS have identified several susceptibility loci for NHL. Despite these successes, much of the heritable variation in NHL risk remains to be explained. Common copy-number variants are important genomic sources of variability, and hence a potential source to explain part of this missing heritability. In this study, we carried out a CNV analysis using GWAS data from 681 NHL cases and 749 controls to explore the relationship between common structural variation and lymphoma susceptibility. Here we found a novel association with diffuse large B-cell lymphoma (DLBCL) risk involving a partial duplication of the C-terminus region of the LOC283177 long non-coding RNA that was further confirmed by quantitative PCR. For chronic lymphocytic leukemia/small lymphocytic lymphoma (CLL/SLL), known somatic deletions were identified on chromosomes 13q14, 11q22-23, 14q32 and 22q11.22. Our study shows that GWAS data can be used to identify germline CNVs associated with disease risk for DLBCL and somatic CNVs for CLL/SLL.
Studies of smoking and risk of non-Hodgkin lymphoma (NHL) have yielded inconsistent results, possibly due to subtype heterogeneity and/or genetic variation impacting the metabolism of tobacco-derived carcinogens, including substrates of the N-acetyltransferase enzymes NAT1 and NAT2.
We conducted a pooled analysis of 5,026 NHL cases and 4,630 controls from seven case–control studies in the international lymphoma epidemiology consortium to examine associations between smoking, variation in the N-acetyltransferase genes NAT1 and NAT2, and risk of NHL subtypes. Smoking data were harmonized across studies, and genetic variants in NAT1 and NAT2 were used to infer acetylation phenotype of the NAT1 and NAT2 enzymes, respectively. Pooled odds ratios (ORs) and 95 % confidence intervals (95 % CIs) for risk of NHL and subtypes were calculated using joint fixed effects unconditional logistic regression models.
Current smoking was associated with a significant 30 % increased risk of follicular lymphoma (n = 1,176) but not NHL overall or other NHL subtypes. The association was similar among NAT2 slow (OR 1.36; 95 % CI 1.07–1.75) and intermediate/rapid (OR 1.27; 95 % CI 0.95–1.69) acetylators (pinteraction = 0.82) and also did not differ by NAT1*10 allelotype. Neither NAT2 phenotype nor NAT1*10 allelotype was associated with risk of NHL overall or NHL subtypes.
The current findings provide further evidence for a modest association between current smoking and follicular lymphoma risk and suggest that this association may not be influenced by variation in the N-acetyltransferase enzymes.
Non-Hodgkin lymphoma; Gene environment interaction; Cigarette smoking; N-acetyltransferase; Follicular lymphoma
A recent meta-analysis of three genome-wide association studies of chronic lymphocytic leukaemia (CLL) identified two common variants at the 6p21.31 locus that are associated with CLL risk. To verify and further explore the association of these variants with other non-Hodgkin lymphoma (NHL) subtypes, we genotyped 1196 CLL cases, 1699 NHL cases, and 2410 controls. We found significant associations between the 6p21.31 variants and CLL risk (rs210134: P=0.01; rs210142: P=6.8×10−3). These variants also showed a trend towards association with some of the other NHL subtypes. Our results validate the prior work and support specific genetic pathways for risk among NHL subtypes.
CLL; NHL; SNPs; BAK1; risk locus
Non-Hodgkin lymphomas (NHL) are a heterogeneous group of solid tumours of lymphoid cell origin. Three important aspects of lymphocyte development include immunity and inflammation, DNA repair, and programmed cell death. We have used a previously established case-control study of NHL to ask whether genetic variation in genes involved in these three important processes influences risk of this cancer. 118 genes in these three categories were tagged with single nucleotide polymorphisms (SNPs), which were tested for association with NHL and its subtypes. The main analysis used logistic regression (additive model) to estimate odds ratios in European-ancestry cases and controls. 599 SNPs and 1116 samples (569 cases and 547 controls) passed quality control measures and were included in analyses. Following multiple-testing correction, one SNP in MSH3, a mismatch repair gene, showed an association with diffuse large B-cell lymphoma (OR: 1.91; 95% CI: 1.41–2.59; uncorrected p = 0.00003; corrected p = 0.010). This association was not replicated in an independent European-ancestry sample set of 251 diffuse large B-cell lymphoma cases and 737 controls, indicating this result was likely a false positive. It is likely that moderate sample size, inter-subtype and other genetic heterogeneity, and small true effect sizes account for the lack of replicable findings.
H2AFX encodes a histone variant involved in signaling sites of DNA damage and recruiting repair factors. Genetic variants in H2AFX may influence risk of non-Hodgkin lymphoma (NHL), a heterogeneous group of lymphoid tumors that are characterized by chromosomal translocations. We previously reported that rs2509049, a common variant in the promoter of H2AFX, was associated with risk for NHL in the British Columbia population. Here we report results for 13 single nucleotide polymorphisms (SNPs) in 100 Kb surrounding H2AFX in an expanded collection of 568 NHL cases and 547 controls. After correction for multiple testing, significant associations were present for mantle cell lymphoma (p=0.007 for rs604714) and all B-cell lymphomas (p=0.046 for rs2509049). Strong linkage disequilibrium in the 5 Kb upstream of H2AFX limited the ability to determine which specific SNP (rs2509049, rs7759, rs8551, rs643788, rs604714, or rs603826), if any, was responsible. There was a significant interaction between sex and rs2509049 in the all B-cell lymphomas group (p=0.002); a sex-stratified analysis revealed that the association was confined to females (p=0.001). Neither the overall nor the female-specific association with rs2509049 was replicated in any of four independent NHL sample sets. Meta-analysis of all five study populations (3,882 B-cell NHL cases and 3,718 controls) supported a weak association with B-cell lymphoma (OR=0.92, 95% CI=0.86-0.99, p=0.034), although this association was not significant after exclusion of the British Columbia data. Further research into the potential sex-specificity of the H2AFX-NHL association may identify a subset of NHL cases that are influenced by genotype at this locus.
Owing to their role in controlling the efflux of toxic compounds, transporters are central players in the process of detoxification and elimination of xenobiotics, which in turn is related to cancer risk. Among these transporters, ATP-binding cassette B1/multidrug resistance 1 (ABCB1/MDR1), ABCC2/multidrug resistance protein 2 (MRP2), and ABCG2/breast cancer resistance protein (BCRP) affect susceptibility to many hematopoietic malignancies. The maintenance of regulated expression of these transporters is governed through the activation of intracellular “xenosensors” like the nuclear receptor 1I2/pregnane X receptor (NR1I2/PXR). SNPs in genes encoding these regulators have also been implicated in the risk of several cancers. Using a tagging approach, we tested the hypothesis that common polymorphisms in the transporter genes ABCB1, ABCC2, ABCG2, and the regulator gene NR1I2 could be implicated in lymphoma risk. We selected 68 SNPs in the 4 genes, and we genotyped them in 1,481 lymphoma cases and 1,491 controls of the European cases-control study (EpiLymph) using the Illumina™ GoldenGate assay technology.Carriers of the SNP rs6857600 minor allele in ABCG2, was associated with a decrease in risk of B-cell lymphoma (B-NHL) overall (p<0.001). Furthermore, a decreased risk of chronic lymphocytic leukemia (CLL) was associated with the ABCG2 rs2231142 variant (p=0.0004), which could be replicated in an independent population. These results suggest a role for this gene in B-NHL susceptibility, especially for CLL.
Lymphoma; multidrug resistance 1 (MDR1); multidrug resistance protein 2 (MRP2); breast cancer resistance protein (BCRP); pregnane X receptor (PXR)
Follicular lymphoma (FL) is an indolent, sometimes fatal disease characterized by recurrence at progressively shorter intervals and is frequently refractive to therapy. Genome-wide association studies have identified SNPs in the human leukocyte antigen (HLA) region on chromosome 6p21.32–33 that are statistically significantly associated with FL risk. Low to medium resolution typing of single or multiple HLA genes has provided an incomplete picture of the total genetic risk imparted by this highly variable region. To gain further insight into the role of HLA alleles in lymphomagenesis and to investigate the independence of validated SNPs and HLA alleles with FL risk, high-resolution HLA typing was conducted using next-generation sequencing in 222 non-Hispanic white FL cases and 220 matched controls from a larger San Francisco Bay Area population-based case-control study of lymphoma. A novel protective association was found between the DPB1*03:01 allele and FL risk (OR=0.39, 95% CI 0.21–0.68). Extended haplotypes DRB1*01:01-DQA1*01:01-DQB1*05:01 (OR=2.01, 95% CI 1.22–3.38) and DRB1*15-DQA1*01-DQB1*06 (OR=0.55, 95% CI 0.36–0.82) also influenced FL risk. Moreover, DRB1*15-DQA1*01-DQB1*06 was highly correlated with an established FL risk locus, rs2647012. These results provide further insight into the critical roles of HLA alleles and SNPs in FL pathogenesis that involve multi-locus effects across the HLA region.
follicular lymphoma; HLA; genetic risk factors; next-generation sequencing
B-cell non-Hodgkin lymphoma represents a diverse group of hematological malignancies, of which follicular lymphoma (FL) is one of the most common subtypes. Family and epidemiological studies suggest an important genetic role in the etiology of FL. In recent genome-wide association studies (GWAS) of FL, several genetic susceptibility loci have been identified on chromosome 6p21.33 (rs6457327) and 6p21.32 (rs10484561, rs2647012) in the human leukocyte antigen class I and class II regions. To identify new genetic variants and further elucidate the genetic basis of FL, a meta-analysis was performed of the top 1000 SNPs associated with FL risk from two GWAS in the US, Denmark and Sweden (592 cases, 1541 controls), with independent validation in 107 cases and 681 controls.
rs9275517 and rs3117222 in the HLA class II region were validated and inversely associated with FL risk (rs9275517: OR = 0.63, 95% CI = 0.55-0.73, p = 4.03 × 10-11; rs3117222: OR = 0.66, 95% CI = 0.57-0.77, p = 1.45 × 10-7). rs9275517, which is in high linkage disequilibrium with rs2647012 (r2 = 0.9), was no longer associated with FL after conditioning on rs2647012. The rs3117222 association was independent of established FL SNPs, but not of the HLA-DPB1*0301 allele. Using publicly available gene expression profiles with matching genotype information, we found that rs3117222 also was significantly correlated with increased HLA-DPB1 expression.
By performing a meta-analysis of two GWAS of FL, we further validated the relevance of HLA-DPB1*0301 as a protective allele in the pathogenesis of FL. Moreover, the protective rs3117222 A allele correlated with increased levels of HLA-DPB1, suggesting a possible disease mechanism involving HLA-DPB1 expression regulation. Our results add further support to the major role of HLA genetic variation in the pathogenesis of FL.
Follicular lymphoma (FL); Genome-wide association studies (GWAS); Human leukocyte antigen (HLA); Meta-analysis
Polymorphisms in chemokine genes have been associated with human immunodeficiency virus (HIV)-related non-Hodgkin lymphoma (NHL) but are understudied in non-HIV-related NHL. Associations of NHL and NHL subtypes with polymorphisms and haplotypes in CCR5, CCR2, CCL5, CXCL12 and CX3CR1 were explored in a pooled analysis of three case-control studies (San Francisco Bay Area, California; United Kingdom; total: cases N=1610, controls N=1992). Adjusted unconditional logistic regression was used to estimate relative risks among HIV-negative non-Hispanic Caucasians. The CCR5M Δ32 deletion reduced the risk of NHL (odds ratio=0.56, 95% confidence interval=0.38-0.83) in men but not women with similar effects observed for diffuse large-cell and follicular lymphoma (FL). NHL risk also was reduced in men with the CCR2/CCR5 haplotype characterized by the CCR5 Δ32 deletion. The CCL5 −403A allele conferred reduced risks of FL and chronic lymphocytic leukemia/small lymphocytic lymphoma. Results should be interpreted conservatively. Continued investigation is warranted to confirm these findings.
Lymphoma non-Hodgkin; Chemokines; Polymorphism, genetic; Case-Control
Non-Hodgkin lymphoma (NHL) is a hematological malignancy of the immune system, and, as with autoimmune and inflammatory diseases (ADs), is influenced by genetic variation in the major histocompatibility complex (MHC). Persons with a history of specific ADs also have increased risk of NHL. As the coexistence of ADs and NHL could be caused by factors common to both diseases, here we examined whether some of the associated genetic signals are shared. Overlapping risk loci for NHL subytpes and several ADs were explored using data from genome-wide association studies. Several common genomic regions and susceptibility loci were identified suggesting a potential shared genetic background. Two independent MHC regions showed the main overlap, with several alleles in the human leukocyte antigen (HLA) Class II region exhibiting an opposite risk effect for follicular lymphoma and type I diabetes. These results support continued investigation to further elucidate the relationship between lymphoma and autoimmune diseases.
Non-Hodgkin lymphoma; Autoimmune diseases; Genome-wide Association Studies; Human Leukocyte Antigen
In a recent genome-wide association study of follicular lymphoma (FL), we identified novel risk alleles on chromosome 6p21.33 that appeared to be part of an extended haplotype including HLA-DRB1*0101, DQA1*0101, and DQB1*0501. To follow up on these findings, we obtained 2–4 digit HLA-DQB1 allelotypes on a subset of 265 FL cases and 757 controls using a novel assay that applies multiplexed ligation-dependent probe amplification (MLPA). We confirmed a positive association between FL and the HLA-DQB1*05 allele group (OR=1.70, 95% CI 1.28–2.27; adjusted p-value=0.013) and also identified an allele group inversely associated with FL risk, HLA-DQB1*06 (OR=0.51, 95% CI 0.38–0.69; adjusted p-value=4.46×10−5). Although these findings require verification, the role of HLA class II proteins in B-cell survival and proliferation make this a biologically plausible association.
follicular lymphoma; MHC; HLA; DQ; NHL
Recent genome-wide association studies (GWAS) have identified several gene variants associated with sporadic chronic lymphocytic leukemia/small lymphocytic lymphoma (CLL/SLL). Many of these CLL/SLL susceptibility loci are located in non-coding or intergenic regions, posing a significant challenge to determine their potential functional relevance. Here, we review the literature of all CLL/SLL GWAS and validation studies, and apply eQTL analysis to identify putatively functional SNPs that affect gene expression that may be causal in the pathogenesis of CLL/SLL. We tested 12 independent risk loci for their potential to alter gene expression through cis-acting mechanisms, using publicly available gene expression profiles with matching genotype information. Sixteen SNPs were identified that are linked to differential expression of SP140, a putative tumor suppressor gene previously associated with CLL/SLL. Three additional SNPs were associated with differential expression of DACT3 and GNG8, which are involved in the WNT/β-catenin- and G protein-coupled receptor signaling pathways, respectively, that have been previously implicated in CLL/SLL pathogenesis. Using in silico functional prediction tools, we found that 14 of the 19 significant eQTL SNPs lie in multiple putative regulatory elements, several of which have prior implications in CLL/SLL or other hematological malignancies. Although experimental validation is needed, our study shows that the use of existing GWAS data in combination with eQTL analysis and in silico methods represents a useful starting point to screen for putatively causal SNPs that may be involved in the etiology of CLL/SLL.
Non-Hodgkin lymphoma (NHL) represents a diverse group of hematological
malignancies, of which follicular lymphoma (FL) is a prevalent subtype. A
previous genome-wide association study has established a marker, rs10484561 in
the human leukocyte antigen (HLA) class II region on 6p21.32 associated with
increased FL risk. Here, in a three-stage genome-wide association study,
starting with a genome-wide scan of 379 FL cases and 791 controls followed by
validation in 1,049 cases and 5,790 controls, we identified a second independent
FL–associated locus on 6p21.32, rs2647012
(ORcombined = 0.64,
Pcombined = 2×10−21)
located 962 bp away from rs10484561 (r2<0.1 in controls). After
mutual adjustment, the associations at the two SNPs remained genome-wide
significant (rs2647012:ORadjusted = 0.70,
Padjusted = 4×10−12;
rs10484561:ORadjusted = 1.64,
Padjusted = 5×10−15).
Haplotype and coalescence analyses indicated that rs2647012 arose on an
evolutionarily distinct haplotype from that of rs10484561 and tags a novel
allele with an opposite (protective) effect on FL risk. Moreover, in a follow-up
analysis of the top 6 FL–associated SNPs in 4,449 cases of other NHL
subtypes, rs10484561 was associated with risk of diffuse large B-cell lymphoma
(ORcombined = 1.36,
Pcombined = 1.4×10−7).
Our results reveal the presence of allelic heterogeneity within the HLA class II
region influencing FL susceptibility and indicate a possible shared genetic
etiology with diffuse large B-cell lymphoma. These findings suggest that the HLA
class II region plays a complex yet important role in NHL.
Earlier studies have established a marker rs10484561, in the HLA class II region
on 6p21.32, associated with increased follicular lymphoma (FL) risk. Here, in a
three-stage genome-wide association study of 1,428 FL cases and 6,581 controls,
we identified a second independent FL–associated marker on 6p21.32,
rs2647012, located 962 bp away from rs10484561. The associations at two SNPs
remained genome-wide significant after mutual adjustment. Haplotype and
coalescence analyses indicated that rs2647012 arose on an evolutionarily
distinct lineage from that of rs10484561 and tags a novel allele with an
opposite, protective effect on FL risk. Moreover, in an analysis of the top 6
FL–associated SNPs in 4,449 cases of other NHL subtypes, rs10484561 was
associated with risk of diffuse large B-cell lymphoma. Our results reveal the
presence of allelic heterogeneity at 6p21.32 in FL risk and suggest a shared
genetic etiology with the common diffuse large B-cell lymphoma subtype.
To identify susceptibility loci for non-Hodgkin lymphoma (NHL) subtypes, we conducted a three-stage genome-wide association study. We identified two variants associated with follicular lymphoma (FL) in 1,465 FL cases/6,958 controls at 6p21.32 (rs10484561, rs7755224, r2=1.0; combined p-values=1.12×10-29, 2.00×10-19), providing further support that MHC genetic variation influences FL susceptibility. Confirmatory evidence of a previously reported association was also found between chronic lymphocytic leukemia/small lymphocytic lymphoma and rs735665 (combined p-value=4.24×10-9).
We conducted genome-wide association studies of non-Hodgkin lymphoma using Illumina HumanHap550 BeadChips to identify subtype-specific associations in follicular, diffuse large B-cell and chronic lymphocytic leukemia/small lymphocytic lymphomas. We found that rs6457327 on 6p21.33 was associated with susceptibility to follicular lymphoma (FL, N=189 cases/592 controls) with validation in an additional 456 FL cases and 2,785 controls (combined allelic p-value=4.7×10−11). The region of strongest association overlaps C6orf15(STG), located near psoriasis susceptibility region 1(PSORS1).
Non-Hodgkin lymphoma (NHL) is the fifth most common cancer in the U.S. and few causes have been identified. Genetic association studies may help identify environmental risk factors and enhance our understanding of disease mechanisms.
768 coding and haplotype tagging SNPs in 146 genes were examined using Illumina GoldenGate technology in a large population-based case-control study of NHL in the San Francisco Bay Area (1,292 cases 1,375 controls are included here). Statistical analyses were restricted to HIV- participants of white non-Hispanic origin. Genes involved in steroidogenesis, immune function, cell signaling, sunlight exposure, xenobiotic metabolism/oxidative stress, energy balance, and uptake and metabolism of cholesterol, folate and vitamin C were investigated. Sixteen SNPs in eight pathways and nine haplotypes were associated with NHL after correction for multiple testing at the adjusted q<0.10 level. Eight SNPs were tested in an independent case-control study of lymphoma in Germany (494 NHL cases and 494 matched controls). Novel associations with common variants in estrogen receptor 1 (ESR1) and in the vitamin C receptor and matrix metalloproteinase gene families were observed. Four ESR1 SNPs were associated with follicular lymphoma (FL) in the U.S. study, with rs3020314 remaining associated with reduced risk of FL after multiple testing adjustments [odds ratio (OR) = 0.42, 95% confidence interval (CI) = 0.23–0.77) and replication in the German study (OR = 0.24, 95% CI = 0.06–0.94). Several SNPs and haplotypes in the matrix metalloproteinase-3 (MMP3) and MMP9 genes and in the vitamin C receptor genes, solute carrier family 23 member 1 (SLC23A1) and SLC23A2, showed associations with NHL risk.
Our findings suggest a role for estrogen, vitamin C and matrix metalloproteinases in the pathogenesis of NHL that will require further validation.
Single nucleotide polymorphisms (SNPs) are, together with copy number variation, the primary source of variation in the human genome. SNPs are associated with altered response to drug treatment, susceptibility to disease and other phenotypic variation. Furthermore, during genetic screens for disease-associated mutations in groups of patients and control individuals, the distinction between disease causing mutation and polymorphism is often unclear. Annotation of the functional and structural implications of single nucleotide changes thus provides valuable information to interpret and guide experiments. The SNPeffect and PupaSuite databases are now synchronized to deliver annotations for both non-coding and coding SNP, as well as annotations for the SwissProt set of human disease mutations. In addition, SNPeffect now contains predictions of Tango2: an improved aggregation detector, and Waltz: a novel predictor of amyloid-forming sequences, as well as improved predictors for regions that are recognized by the Hsp70 family of chaperones. The new PupaSuite version incorporates predictions for SNPs in silencers and miRNAs including their targets, as well as additional methods for predicting SNPs in TFBSs and splice sites. Also predictions for mouse and rat genomes have been added. In addition, a PupaSuite web service has been developed to enable data access, programmatically. The combined database holds annotations for 4 965 073 regulatory as well as 133 505 coding human SNPs and 14 935 disease mutations, and phenotypic descriptions of 43 797 human proteins and is accessible via http://snpeffect.vib.be and http://pupasuite.bioinfo.cipf.es/.