The highest associated SNP identified in this study was rs2244012 in intron 2 of
RAD50 (P = 3.04E-07). In addition, evidence was observed for association with multiple SNPs in the RAD50-IL13 region for asthma susceptibility and asthma related quantitative traits. The protein encoded by
RAD50 is involved in DNA double-strand break repair and its expression level is constitutively low in most tissues, thus it has no known function directly related to asthma, although MER11-RAD50-NBS1 complex has been shown to be involved in somatic hypermutation and gene conversion of immunoglobulin regions [
30]. On the contrary, other genes (
IL4,
IL5, and
IL13) in the T
H2 cytokines locus are better candidates based on their biologic functions. Three SNPs in
IL13 in this study were associated with asthma.
IL13 is critical to the pathogenesis of allergen-induced asthma and thus one of the most highly studied and replicated genes in both genome-wide linkage and candidate-gene association studies. rs20541 (Arg130Gln or IL13+4257GA), in the coding region of
IL13, was also analyzed in this study and has been shown to be associated with asthma [
31] and total serum IgE levels [
32]. rs1800925 (IL13-1111CT), in the promoter region of
IL13, has been shown to be associated with asthma [
33] and total serum IgE levels [
34]. In a GWAS with total serum IgE levels, four SNPs in
RAD50 (rs2706347, rs3798135, rs2040704, and rs7737470), have been identified (P < 10E-04) [
15]. These four SNPs in
RAD50 were in strong LD with rs1800925 (0.7 < r
2 < 0.8) and in weak LD with
rs20541 (0.2 < r
2 < 0.3) in
IL13 [
15]. These results are consistent with the results of this study; since many of the TENOR asthmatics were recruited from allergist’s offices and the population has increased IgE levels [
18]. Since the actual functional SNPs can-not be determined purely by their P values, it is difficult to dissect the association data of
RAD50 from
IL13 in this study or other genetic studies due to the degree of LD present in this chromosomal region.
In a transgenic mouse study, a T
H2 locus control region (LCR) was identified as the 25 kb fragment at the 3’ end of
Rad50 [
35]. An LCR is defined experimentally as regulating the expression of linked genes in a copy number dependent and tissue-specific manner. The T
H2 LCR is involved in the chromatin configuration to re-organize promoters of
IL4,
IL5,
IL13 in proximity and co-regulation of T
H2 cytokine expression [
36]. Seven
Rad50 DNase I hypersensitive sites (RHS1-7) were identified, where RHS4-7 formed the core of the LCR [
37]. LCR-C (RHS7) and LCR-B (RHS6) were possible T
H2 cytokine expression enhancers; LCR-A (RHS6) and LCR-O (RHS5) were likely insulators [
38]. RHS7 is essential for T
H2 cytokine expression by showing T
H2 specific demethylation after allergen stimulation and intrachromosomal interactions between LCR and the promoters of T
H2 cytokines [
39]. Furthermore, RHS6,
Rad50 promoter (RHS2), and
IL5 promoter interacted with interferon gamma (
Ifng) on a different chromosome, which suggests an interchromosomal regulation of the expression of T
H1/T
H2 cytokines [
40]. Although all the above experiments were done in mouse, the
RAD50 sequence is highly conserved in the LCR between human and mouse. With imputation, multiple significant SNPs were found in the LCR (see
Table E2 in the Online Repository): rs3798135 (P = 1.49E-06, in RHS5/LCR-O), rs12653750 (P = 1.49E-06, in RHS6/LCR-A), rs2040704 (P = 1.33E-06, in RHS6/LCR-B), and rs2240032 (P = 6.68E-06, in RHS7/LCR-C). The association of rs2244012 with the expression levels of
IL13 in lymphocytes from white adults based on GENEVAR dataset was not significant (P = 0.176), but may be due to small sample size.
Since both a previous GWAS for total serum IgE levels and our GWAS of asthma identified RAD50, it appears to be a new candidate gene for asthma. Although it is still possible the signal from RAD50 is purely due to its LD with the promoter of IL13, RAD50 deserves to be carefully studied when considering TH2 cytokine locus.
HLA-DR/DQ also showed consistent association with asthma, for example, rs1063355 in the 3’ UTR of
HLA-DQB1 (P = 9.55E-06), rs2239804 in intron of
HLA-DRA (P = 2.80E-05), and rs2516049 5’ upstream of
HLA-DRB1 (P = 2.62E-05).
HLA-DR/DQ is part of the HLA class II region, which is one of the most gene/variant dense regions in the human genome and is associated with many diseases [
41].
HLA-DQB1 and
HLA-DRB1 have been shown to be associated with asthma in multiple independent studies [
42–
44]. Genetic variants in the HLA-DR/DQ region have also been shown to be highly associated with
HLA-DR/DQ gene expression, indicating that the association of
HLA-DR/DQ with disease might be due to gene expression levels in addition to antigen recognition [
45,
46]. The association of rs2516049 with asthma in our study and with the expression levels of
HLA-DRB1 (P = 1.25E-04) in lymphocytes from white adults based on GENEVAR dataset indicated that the variant might function through expression level changes (see
Figure E4 in the Online Repository) [
28,
29]. Imputation identified a SNP with a more significant P value, rs3998159 (P = 1.45E-06), between
HLA-DQB1 and
HLA-DQA2. It is difficult to determine the functional genes/SNPs in the HLA-DR/DQ region in our study due to the complicated LD pattern in this region. The long-range LD and haplotype analysis based on the MHC Haplotype Project may solve the issue [
47].
Using a GWAS approach, this study is the first to confirm the association of RAD50-IL13 and HLA-DR/DQ regions with asthma susceptibility, regions which have been identified by multiple candidate-gene association studies and one genome-wide association study on total serum IgE levels. Our results weakly replicated the findings of the other GWAS:
ORMDL3 and gasdermin B (
GSDML) (rs7216389) with asthma (P = 0.057); FCER1A (rs2251746) with total serum IgE (P = 0.040);
CHI3L1 (rs880633) with FEV
1 (P = 0.003), FVC (P = 0.031), and FEV
1/FVC (P = 0.040). rs1588265 (P = 0.507) and rs1544791 (P = 0.678) in
PDE4D with asthma were not replicated. GWAS of total serum IgE by Weidinger [
15] identified several SNPs in
RAD50 (P < 10E-04). In our study, the most significant SNP in
RAD50 for total serum IgE is rs6871536 (P = 2.61E-03). The geometric mean of total serum IgE in Weidinger’s study is 42.41 (95% CI: 39.56 – 45.47). In our study, the geometric mean of total serum IgE is higher, 48.94 (95% CI: 43.04 – 55.65). The difference in the total serum IgE distribution and relatively small sample size in our study may lead to the difference of significant levels between these two studies.
The potential for false negative results could not be avoided in this study due to the relatively small sample size (473 cases) which may also be the reason that although significance levels of 10-7 were observed, no SNP reached Bonferroni adjusted multiple test criterion (p=0.05/292,443 = 1.71E-07. However, evidence for multiple SNPs was observed in our results in this comprehensively phenotyped relatively homogeneous cohort of difficult-to-treat asthmatics from the larger TENOR study. Our control datasets (general population and phenotyped controls) both have some limitations. They were both significantly younger (see ) than TENOR cases, making our results a little conservative because some controls might become asthma cases in the future. Genotyping confirmation and fine-mapping of candidate regions were impossible since the Illumina controls were from a public database, but our approach compensated for this by using imputation. Population stratification was relatively strong between TENOR cases and Illumina 550k controls.
This GWAS confirmed the important role of TH2 cytokine and antigen presentation genes in asthma at a genome-wide level. Furthermore, these findings will stimulate more comprehensive research (e.g., re-sequencing, long-range LD, epistasis, epigenetics, copy number variant, and function) on these two regions due to their functional importance and structural complexity.
Key messages
GWAS of asthma identifies RAD50-IL13 and HLA-DR/DQ. These findings will stimulate more comprehensive research on these genes because of their structural complexity and functional importance in the pathogenesis of asthma.