Rationale: Chronic obstructive pulmonary disease (COPD) is defined by the presence of airflow limitation on spirometry, yet subjects with COPD can have marked differences in computed tomography imaging. These differences may be driven by genetic factors. We hypothesized that a genome-wide association study (GWAS) of quantitative imaging would identify loci not previously identified in analyses of COPD or spirometry. In addition, we sought to determine whether previously described genome-wide significant COPD and spirometric loci were associated with emphysema or airway phenotypes.
Objectives: To identify genetic determinants of quantitative imaging phenotypes.
Methods: We performed a GWAS on two quantitative emphysema and two quantitative airway imaging phenotypes in the COPDGene (non-Hispanic white and African American), ECLIPSE (Evaluation of COPD Longitudinally to Identify Predictive Surrogate Endpoints), NETT (National Emphysema Treatment Trial), and GenKOLS (Genetics of COPD, Norway) studies and on percentage gas trapping in COPDGene. We also examined specific loci reported as genome-wide significant for spirometric phenotypes related to airflow limitation or COPD.
Measurements and Main Results: The total sample size across all cohorts was 12,031, of whom 9,338 were from COPDGene. We identified five loci associated with emphysema-related phenotypes, one with airway-related phenotypes, and two with gas trapping. These loci included previously reported associations, including the HHIP, 15q25, and AGER loci, as well as novel associations near SERPINA10 and DLC1. All previously reported COPD and a significant number of spirometric GWAS loci were at least nominally (P < 0.05) associated with either emphysema or airway phenotypes.
Conclusions: Genome-wide analysis may identify novel risk factors for quantitative imaging characteristics in COPD and also identify imaging features associated with previously identified lung function loci.
emphysema; airway; genetics; chronic obstructive pulmonary disease
Genetic risk loci have been identified for a wide range of diseases through genome-wide association studies (GWAS), but the relevant functional mechanisms have been identified for only a small proportion of these GWAS-identified loci. By integrating results from the largest current GWAS of chronic obstructive disease (COPD) with expression quantitative trait locus (eQTL) analysis in whole blood and sputum from 121 subjects with COPD from the ECLIPSE Study, this analysis identifies loci that are simultaneously associated with COPD and the expression of nearby genes (COPD eQTLs). After integrative analysis, 19 COPD eQTLs were identified, including all four previously identified genome-wide significant loci near HHIP, FAM13A, and the 15q25 and 19q13 loci. For each COPD eQTL, fine mapping and colocalization analysis to identify causal shared eQTL and GWAS variants identified a subset of sites with moderate-to-strong evidence of harboring at least one shared variant responsible for both the eQTL and GWAS signals. Transcription factor binding site (TFBS) analysis confirms that multiple COPD eQTL lead SNPs disrupt TFBS, and enhancer enrichment analysis for loci with the strongest colocalization signals showed enrichment for blood-related cell types (CD3 and CD4+ T cells, lymphoblastoid cell lines). In summary, integrative eQTL and GWAS analysis confirms that genetic control of gene expression plays a key role in the genetic architecture of COPD and identifies specific blood-related cell types as likely participants in the functional pathway from GWAS-associated variant to disease phenotype.
Smoking is a notorious risk factor for chronic mucus hypersecretion (CMH). CMH frequently occurs in Chronic Obstructive Pulmonary Disease (COPD). The question arises whether the same single nucleotide polymorphisms (SNPs) are related to CMH in smokers with and without COPD.
We performed two genome wide association studies on CMH under an additive genetic model in male heavy smokers (≥20 pack-years) with COPD (n=849, 39.9% CMH) and without COPD (n=1,348, 25.4% CMH), followed by replication and meta-analysis in comparable populations, and assessment of the functional relevance of significantly associated SNPs.
GWA analysis on CMH in COPD and non-COPD yielded no genome wide significance after replication. In COPD, our top SNP (rs10461985, p=5.43×10−5) was located in the GDNF-antisense gene that is functionally associated with the GDNF gene. Expression of GDNF in bronchial biopsies of COPD patients was significantly associated with CMH (p=0.007). In non-COPD, 4 SNPs had a p-value <10−5 in the meta-analysis, including a SNP (rs4863687) in the MAML3 gene, the T-allele showing modest association with CMH (p=7.57×10−6, OR=1.48) and with significantly increased MAML3 expression in lung tissue (p=2.59×10−12).
Our data suggest the potential for differential genetic backgrounds of CMH in individuals with and without COPD.
Pulmonary function decline is a major contributor to morbidity and mortality among smokers. Post bronchodilator FEV1 and FEV1/FVC ratio are considered the standard assessment of airflow obstruction. We performed a genome-wide association study (GWAS) in 9919 current and former smokers in the COPDGene study (6659 non-Hispanic Whites [NHW] and 3260 African Americans [AA]) to identify associations with spirometric measures (post-bronchodilator FEV1 and FEV1/FVC). We also conducted meta-analysis of FEV1 and FEV1/FVC GWAS in the COPDGene, ECLIPSE, and GenKOLS cohorts (total n = 13,532).
Among NHW in the COPDGene cohort, both measures of pulmonary function were significantly associated with SNPs at the 15q25 locus [containing CHRNA3/5, AGPHD1, IREB2, CHRNB4] (lowest p-value = 2.17 × 10−11), and FEV1/FVC was associated with a genomic region on chromosome 4 [upstream of HHIP] (lowest p-value = 5.94 × 10−10); both regions have been previously associated with COPD. For the meta-analysis, in addition to confirming associations to the regions near CHRNA3/5 and HHIP, genome-wide significant associations were identified for FEV1 on chromosome 1 [TGFB2] (p-value = 8.99 × 10−9), 9 [DBH] (p-value = 9.69 × 10−9) and 19 [CYP2A6/7] (p-value = 3.49 × 10−8) and for FEV1/FVC on chromosome 1 [TGFB2] (p-value = 8.99 × 10−9), 4 [FAM13A] (p-value = 3.88 × 10−12), 11 [MMP3/12] (p-value = 3.29 × 10−10) and 14 [RIN3] (p-value = 5.64 × 10−9).
In a large genome-wide association study of lung function in smokers, we found genome-wide significant associations at several previously described loci with lung function or COPD. We additionally identified a novel genome-wide significant locus with FEV1 on chromosome 9 [DBH] in a meta-analysis of three study populations.
Electronic supplementary material
The online version of this article (doi:10.1186/s12863-015-0299-4) contains supplementary material, which is available to authorized users.
Chronic obstructive pulmonary disease; DBH; FEV1; FEV1/FVC; Genome-wide association study; Spirometry
Hypoxemia is a major complication of chronic obstructive pulmonary disease (COPD) that correlates with disease prognosis. Identifying genetic variants associated with oxygenation may provide clues for deciphering the heterogeneity in prognosis among patients with COPD. However, previous genetic studies have been restricted to investigating COPD candidate genes for association with hypoxemia. To report results from the first genome-wide association study (GWAS) of resting oxygen saturation (as measured by pulse oximetry [Spo2]) in subjects with COPD, we performed a GWAS of Spo2 in two large, well characterized COPD populations: COPDGene, including both the non-Hispanic white (NHW) and African American (AA) groups, and Evaluation of COPD Longitudinally to Identify Predictive Surrogate Endpoints (ECLIPSE). We identified several suggestive loci (P < 1 × 10−5) associated with Spo2 in COPDGene in the NHW (n = 2810) and ECLIPSE (n = 1758) groups, and two loci on chromosomes 14 and 15 in the AA group (n = 820) from COPDGene achieving a level of genome-wide significance (P < 5 × 10−8). The chromosome 14 single-nucleotide polymorphism, rs6576132, located in an intergenic region, was nominally replicated (P < 0.05) in the NHW group from COPDGene. The chromosome 15 single-nucleotide polymorphisms were rare in subjects of European ancestry, so the results could not be replicated. The chromosome 15 region contains several genes, including TICRR and KIF7, and is proximal to RHCG (Rh family C glyocoprotein gene). We have identified two loci associated with resting oxygen saturation in AA subjects with COPD, and several suggestive regions in subjects of European descent with COPD. Our study highlights the importance of investigating the genetics of complex traits in different racial groups.
chronic obstructive pulmonary disease; hypoxemia; pulse oximetry; genome-wide association study; oxygen saturation
To use a systems biology approach to integrate genotype and protein-protein interaction (PPI) data to identify disease network modules associated with chronic obstructive pulmonary disease (COPD) and to perform traditional pathway analysis.
We used a standard gene-set association approach (FORGE) using gene-based association analysis and gene-set definitions from the molecular signatures database (MSigDB). As a discovery step we analyzed GWAS results from two well-characterized COPD cohorts, COPDGene and GenKOLS. We used a third well-characterized COPD case-control cohort for replication, ECLIPSE. Next, we used dmGWAS, a method that integrates GWAS results with PPI, to identify COPD disease modules.
No gene-sets reached experiment-wide significance in either discovery population. We identified a consensus network of 10 genes identified in modules by integrating GWAS results with PPI that replicated in COPDGene, GenKOLS, and ECLIPSE. Members of four gene-sets were enriched among these 10 genes: (i) lung adenocarcinoma tumor sequencing genes, (ii) IL7 pathway genes, (iii) kidney cell response to arsenic, and (iv) CD4 T cell responses. Further, several genes have also been associated with pathophysiology relevant to COPD including KCNK3, NEDD4L and RIN3. In particular, KCNK3 has been associated with pulmonary arterial hypertension, a common complication in advanced COPD.
We report a set of new genes that may influence the etiology of COPD that would not have been identified using traditional GWAS and pathway analyses alone.
COPD; genome-wide scan; protein-protein interaction network; gene-set association; pathway association; disease module
The genetic risk factors for susceptibility to chronic obstructive
pulmonary disease (COPD) are still largely unknown. Additional genetic
variants are likely to be identified by genome-wide association studies in
larger cohorts or specific subgroups.
Genome-wide association analysis in COPDGene (non-Hispanic whites and
African-Americans) was combined with existing data from the ECLIPSE,
NETT/NAS, and GenKOLS (Norway) studies. Analyses were performed both using
all moderate-to-severe cases and the subset of severe cases. Top loci not
previously described as genome-wide significant were genotyped in the ICGN
study, and results combined in a joint meta-analysis.
Analysis of a total of 6,633 moderate-to-severe cases and 5,704
controls confirmed association at three known loci:
CHRNA3/CHRNA5/IREB2, FAM13A, and HHIP
(10−12 < P < 10−14),
and also showed significant evidence of association at a novel locus near
RIN3 (overall P, including ICGN =
5•4×10−9). In the severe COPD analysis
(n=3,497), the effects at two of three previously described loci were
significantly stronger; we also identified two additional loci previously
reported to affect gene expression of MMP12 and
TGFB2 (overall P = 2•6x10−9
and 8•3×10−9). RIN3 and
TGFB2 expression levels were reduced in a set of Lung
Tissue Research Consortium COPD lung tissue samples compared with
In a genome-wide study of COPD, we confirmed associations at three
known loci and found additional genome-wide significant associations with
moderate-to-severe COPD near RIN3 and with severe COPD near
MMP12 and TGFB2. Genetic variants,
apart from alpha-1 antitrypsin deficiency, increase the risk of COPD. Our
analysis of severe COPD suggests additional genetic variants may be
identified by focusing on this subgroup.
National Heart, Lung, and Blood Institute; the COPD Foundation
through contributions from AstraZeneca, Boehringer Ingelheim, Novartis, and
Sepracor; GlaxoSmithKline; Centers for Medicare and Medicaid Services;
Agency for Healthcare Research and Quality; US Department of Veterans
Motivation: For samples of unrelated individuals, we propose a general analysis framework in which hundred thousands of genetic loci can be tested simultaneously for association with complex phenotypes. The approach is built on spatial-clustering methodology, assuming that genetic loci that are associated with the target phenotype cluster in certain genomic regions. In contrast to standard methodology for multilocus analysis, which has focused on the dimension reduction of the data, our multilocus association-clustering test profits from the availability of large numbers of genetic loci by detecting clusters of loci that are associated with the phenotype.
Results: The approach is computationally fast and powerful, enabling the simultaneous association testing of large genomic regions. Even the entire genome or certain chromosomes can be tested simultaneously. Using simulation studies, the properties of the approach are evaluated. In an application to a genome-wide association study for chronic obstructive pulmonary disease, we illustrate the practical relevance of the proposed method by simultaneously testing all genotyped loci of the genome-wide association study and by testing each chromosome individually. Our findings suggest that statistical methodology that incorporates spatial-clustering information will be especially useful in whole-genome sequencing studies in which millions or billions of base pairs are recorded and grouped by genomic regions or genes, and are tested jointly for association.
Availability and implementation: Implementation of the approach is available upon request.
Supplementary data are available at Bioinformatics online.
Hedgehog Interacting Protein (HHIP) was implicated in chronic obstructive pulmonary disease (COPD) by genome-wide association studies (GWAS). However, it remains unclear how HHIP contributes to COPD pathogenesis. To identify genes regulated by HHIP, we performed gene expression microarray analysis in a human bronchial epithelial cell line (Beas-2B) stably infected with HHIP shRNAs. HHIP silencing led to differential expression of 296 genes; enrichment for variants nominally associated with COPD was found. Eighteen of the differentially expressed genes were validated by real-time PCR in Beas-2B cells. Seven of 11 validated genes tested in human COPD and control lung tissues demonstrated significant gene expression differences. Functional annotation indicated enrichment for extracellular matrix and cell growth genes. Network modeling demonstrated that the extracellular matrix and cell proliferation genes influenced by HHIP tended to be interconnected. Thus, we identified potential HHIP targets in human bronchial epithelial cells that may contribute to COPD pathogenesis.
Hedgehog interacting protein (HHIP); Gene expression profiling; COPD (Chronic obstructive pulmonary disease); extracellular matrix (ECM); network modeling
Chronic mucus hypersecretion (CMH) is associated with an increased frequency of respiratory infections, excess lung function decline, and increased hospitalisation and mortality rates in the general population. It is associated with smoking, but it is unknown why only a minority of smokers develops CMH. A plausible explanation for this phenomenon is a predisposing genetic constitution. Therefore, we performed a genome wide association (GWA) study of CMH in Caucasian populations.
GWA analysis was performed in the NELSON-study using the Illumina 610 array, followed by replication and meta-analysis in 11 additional cohorts. In total 2,704 subjects with, and 7,624 subjects without CMH were included, all current or former heavy smokers (≥20 pack-years). Additional studies were performed to test the functional relevance of the most significant single nucleotide polymorphism (SNP).
A strong association with CMH, consistent across all cohorts, was observed with rs6577641 (p = 4.25×10−6, OR = 1.17), located in intron 9 of the special AT-rich sequence-binding protein 1 locus (SATB1) on chromosome 3. The risk allele (G) was associated with higher mRNA expression of SATB1 (4.3×10−9) in lung tissue. Presence of CMH was associated with increased SATB1 mRNA expression in bronchial biopsies from COPD patients. SATB1 expression was induced during differentiation of primary human bronchial epithelial cells in culture.
Our findings, that SNP rs6577641 is associated with CMH in multiple cohorts and is a cis-eQTL for SATB1, together with our additional observation that SATB1 expression increases during epithelial differentiation provide suggestive evidence that SATB1 is a gene that affects CMH.
Cigarette smoking is the major environmental risk factor for chronic obstructive pulmonary disease (COPD). Genome-wide association studies have provided compelling associations for three loci with COPD. In this study, we aimed to estimate direct, i.e., independent from smoking, and indirect effects of those loci on COPD development using mediation analysis. We included a total of 3,424 COPD cases and 1,872 unaffected controls with data on two smoking-related phenotypes: lifetime average smoking intensity and cumulative exposure to tobacco smoke (pack years). Our analysis revealed that effects of two linked variants (rs1051730 and rs8034191) in the AGPHD1/CHRNA3 cluster on COPD development are significantly, yet not entirely, mediated by the smoking-related phenotypes. Approximately 30 % of the total effect of variants in the AGPHD1/CHRNA3 cluster on COPD development was mediated by pack years. Simultaneous analysis of modestly (r2 = 0.21) linked markers in CHRNA3 and IREB2 revealed that an even larger (~42 %) proportion of the total effect of the CHRNA3 locus on COPD was mediated by pack years after adjustment for an IREB2 single nucleotide polymorphism. This study confirms the existence of direct effects of the AGPHD1/CHRNA3, IREB2, FAM13A and HHIP loci on COPD development. While the association of the AGPHD1/CHRNA3 locus with COPD is significantly mediated by smoking-related phenotypes, IREB2 appears to affect COPD independently of smoking.
Rationale: A genome-wide association study (GWAS) for circulating chronic obstructive pulmonary disease (COPD) biomarkers could identify genetic determinants of biomarker levels and COPD susceptibility.
Objectives: To identify genetic variants of circulating protein biomarkers and novel genetic determinants of COPD.
Methods: GWAS was performed for two pneumoproteins, Clara cell secretory protein (CC16) and surfactant protein D (SP-D), and five systemic inflammatory markers (C-reactive protein, fibrinogen, IL-6, IL-8, and tumor necrosis factor-α) in 1,951 subjects with COPD. For genome-wide significant single nucleotide polymorphisms (SNPs) (P < 1 × 10−8), association with COPD susceptibility was tested in 2,939 cases with COPD and 1,380 smoking control subjects. The association of candidate SNPs with mRNA expression in induced sputum was also elucidated.
Measurements and Main Results: Genome-wide significant susceptibility loci affecting biomarker levels were found only for the two pneumoproteins. Two discrete loci affecting CC16, one region near the CC16 coding gene (SCGB1A1) on chromosome 11 and another locus approximately 25 Mb away from SCGB1A1, were identified, whereas multiple SNPs on chromosomes 6 and 16, in addition to SNPs near SFTPD, had genome-wide significant associations with SP-D levels. Several SNPs affecting circulating CC16 levels were significantly associated with sputum mRNA expression of SCGB1A1 (P = 0.009–0.03). Several SNPs highly associated with CC16 or SP-D levels were nominally associated with COPD in a collaborative GWAS (P = 0.001–0.049), although these COPD associations were not replicated in two additional cohorts.
Conclusions: Distant genetic loci and biomarker-coding genes affect circulating levels of COPD-related pneumoproteins. A subset of these protein quantitative trait loci may influence their gene expression in the lung and/or COPD susceptibility.
Clinical trial registered with www.clinicaltrials.gov (NCT 00292552).
biomarker; chronic obstructive pulmonary disease; genome-wide association study
Rationale: Genome-wide association studies (GWAS) have identified loci influencing lung function, but fewer genes influencing chronic obstructive pulmonary disease (COPD) are known.
Objectives: Perform meta-analyses of GWAS for airflow obstruction, a key pathophysiologic characteristic of COPD assessed by spirometry, in population-based cohorts examining all participants, ever smokers, never smokers, asthma-free participants, and more severe cases.
Methods: Fifteen cohorts were studied for discovery (3,368 affected; 29,507 unaffected), and a population-based family study and a meta-analysis of case-control studies were used for replication and regional follow-up (3,837 cases; 4,479 control subjects). Airflow obstruction was defined as FEV1 and its ratio to FVC (FEV1/FVC) both less than their respective lower limits of normal as determined by published reference equations.
Measurements and Main Results: The discovery meta-analyses identified one region on chromosome 15q25.1 meeting genome-wide significance in ever smokers that includes AGPHD1, IREB2, and CHRNA5/CHRNA3 genes. The region was also modestly associated among never smokers. Gene expression studies confirmed the presence of CHRNA5/3 in lung, airway smooth muscle, and bronchial epithelial cells. A single-nucleotide polymorphism in HTR4, a gene previously related to FEV1/FVC, achieved genome-wide statistical significance in combined meta-analysis. Top single-nucleotide polymorphisms in ADAM19, RARB, PPAP2B, and ADAMTS19 were nominally replicated in the COPD meta-analysis.
Conclusions: These results suggest an important role for the CHRNA5/3 region as a genetic risk factor for airflow obstruction that may be independent of smoking and implicate the HTR4 gene in the etiology of airflow obstruction.
chronic obstructive pulmonary disease; single-nucleotide polymorphism; genes
The genetic risk factors for chronic obstructive pulmonary disease (COPD) are still largely unknown. To date, genome-wide association studies (GWASs) of limited size have identified several novel risk loci for COPD at CHRNA3/CHRNA5/IREB2, HHIP and FAM13A; additional loci may be identified through larger studies. We performed a GWAS using a total of 3499 cases and 1922 control subjects from four cohorts: the Evaluation of COPD Longitudinally to Identify Predictive Surrogate Endpoints (ECLIPSE); the Normative Aging Study (NAS) and National Emphysema Treatment Trial (NETT); Bergen, Norway (GenKOLS); and the COPDGene study. Genotyping was performed on Illumina platforms with additional markers imputed using 1000 Genomes data; results were summarized using fixed-effect meta-analysis. We identified a new genome-wide significant locus on chromosome 19q13 (rs7937, OR = 0.74, P = 2.9 × 10−9). Genotyping this single nucleotide polymorphism (SNP) and another nearby SNP in linkage disequilibrium (rs2604894) in 2859 subjects from the family-based International COPD Genetics Network study (ICGN) demonstrated supportive evidence for association for COPD (P = 0.28 and 0.11 for rs7937 and rs2604894), pre-bronchodilator FEV1 (P = 0.08 and 0.04) and severe (GOLD 3&4) COPD (P = 0.09 and 0.017). This region includes RAB4B, EGLN2, MIA and CYP2A6, and has previously been identified in association with cigarette smoking behavior.
Two recent metaanalyses of genome-wide association studies conducted by the CHARGE and SpiroMeta consortia identified novel loci yielding evidence of association at or near genome-wide significance (GWS) with FEV1 and FEV1/FVC. We hypothesized that a subset of these markers would also be associated with chronic obstructive pulmonary disease (COPD) susceptibility. Thirty-two single-nucleotide polymorphisms (SNPs) in or near 17 genes in 11 previously identified GWS spirometric genomic regions were tested for association with COPD status in four COPD case-control study samples (NETT/NAS, the Norway case-control study, ECLIPSE, and the first 1,000 subjects in COPDGene; total sample size, 3,456 cases and 1,906 controls). In addition to testing the 32 spirometric GWS SNPs, we tested a dense panel of imputed HapMap2 SNP markers from the 17 genes located near the 32 GWS SNPs and in a set of 21 well studied COPD candidate genes. Of the previously identified GWS spirometric genomic regions, three loci harbored SNPs associated with COPD susceptibility at a 5% false discovery rate: the 4q24 locus including FLJ20184/INTS12/GSTCD/NPNT, the 6p21 locus including AGER and PPT2, and the 5q33 locus including ADAM19. In conclusion, markers previously associated at or near GWS with spirometric measures were tested for association with COPD status in data from four COPD case-control studies, and three loci showed evidence of association with COPD susceptibility at a 5% false discovery rate.
Traditional genome-wide association studies (GWAS) of large cohort of subjects with chronic obstructive pulmonary disease (COPD) have successfully identified novel candidate genes, but several other plausible loci do not meet strict criteria for genome-wide significance after correction for multiple testing.
We hypothesize that by applying unbiased weights derived from unique populations we can identify additional COPD susceptibility loci.
We performed a homozygosity haplotype analysis on a group of subjects with and without COPD to identify regions of conserved homozygosity (RCHH). Weights were constructed based on the frequency of these RCHH in case vs. controls, and used to adjust the P values from a large collaborative GWAS of COPD.
We identified 2,318 regions of conserved homozygosity, of which 576 were significantly (P < .05) overrepresented in cases. After applying the weights constructed from these regions to a collaborative GWAS of COPD, we identified two single nucleotide polymorphisms in a novel gene (FGF7) that gained genome-wide significance by the false discovery rate method. In a follow-up analysis, both SNPs (rs12591300 and rs4480740) were significantly associated with COPD in an independent population (combined P values of 7.9E-07 and 2.8E-06 respectively). In another independent population, increased lung tissue FGF7 expression was associated with worse measures of lung function.
Weights constructed from a homozygosity haplotype analysis of an isolated population successfully identify novel genetic associations from a GWAS on a separate population. This method can be used to identify promising candidate genes that fail to meet strict correction for multiple testing.
Cigarette smoking is a major risk factor for COPD and COPD severity. Previous genome-wide association studies (GWAS) have identified numerous single nucleotide polymorphisms (SNPs) associated with the number of cigarettes smoked per day (CPD) and a Dopamine Beta-Hydroxylase (DBH) locus associated with smoking cessation in multiple populations.
To identify SNPs associated with lifetime average and current CPD, age at smoking initiation, and smoking cessation in COPD subjects.
GWAS were conducted in 4 independent cohorts encompassing 3,441 ever-smoking COPD subjects (GOLD stage II or higher). Untyped SNPs were imputed using HapMap (phase II) panel. Results from all cohorts were meta-analyzed.
Several SNPs near the HLA region on chromosome 6p21 and in an intergenic region on chromosome 2q21 showed associations with age at smoking initiation, both with the lowest p=2×10−7. No SNPs were associated with lifetime average CPD, current CPD or smoking cessation with p<10−6. Nominally significant associations with candidate SNPs within alpha-nicotinic acetylcholine receptors 3/5 (CHRNA3/CHRNA5; e.g. p=0.00011 for SNP rs1051730) and Cytochrome P450 2A6 (CYP2A6; e.g. p=2.78×10−5 for a nonsynonymous SNP rs1801272) regions were observed for lifetime average CPD, however only CYP2A6 showed evidence of significant association with current CPD. A candidate SNP (rs3025343) in the DBH was significantly (p=0.015) associated with smoking cessation.
We identified two candidate regions associated with age at smoking initiation in COPD subjects. Associations of CHRNA3/CHRNA5 and CYP2A6 loci with CPD and DBH with smoking cessation are also likely of importance in the smoking behaviors of COPD patients.
Chronic Obstructive Pulmonary Disease (COPD); Genome Wide Association study (GWAS); smoking behaviors; Single Nucleotide Polymorphism (SNP)
Cachexia, whether assessed by body mass index (BMI) or fat-free mass index (FFMI), affects a significant proportion of patients with chronic obstructive pulmonary disease (COPD), and is an independent risk factor for increased mortality, increased emphysema, and more severe airflow obstruction. The variable development of cachexia among patients with COPD suggests a role for genetic susceptibility. The objective of the present study was to determine genetic susceptibility loci involved in the development of low BMI and FFMI in subjects with COPD. A genome-wide association study (GWAS) of BMI was conducted in three independent cohorts of European descent with Global Initiative for Chronic Obstructive Lung Disease stage II or higher COPD: Evaluation of COPD Longitudinally to Identify Predictive Surrogate End-Points (ECLIPSE; n = 1,734); Norway-Bergen cohort (n = 851); and a subset of subjects from the National Emphysema Treatment Trial (NETT; n = 365). A genome-wide association of FFMI was conducted in two of the cohorts (ECLIPSE and Norway). In the combined analyses, a significant association was found between rs8050136, located in the first intron of the fat mass and obesity–associated (FTO) gene, and BMI (P = 4.97 × 10−7) and FFMI (P = 1.19 × 10−7). We replicated the association in a fourth, independent cohort consisting of 502 subjects with COPD from COPDGene (P = 6 × 10−3). Within the largest contributing cohort of our analysis, lung function, as assessed by forced expiratory volume at 1 second, varied significantly by FTO genotype. Our analysis suggests a potential role for the FTO locus in the determination of anthropomorphic measures associated with COPD.
chronic obstructive pulmonary disease genetics; chronic obstructive pulmonary disease epidemiology; chronic obstructive pulmonary disease metabolism; genome-wide association study
Chronic obstructive pulmonary disease (COPD) is characterized by alveolar destruction and abnormal inflammatory responses to noxious stimuli. Surfactant protein–D (SFTPD) is immunomodulatory and essential to host defense. We hypothesized that polymorphisms in SFTPD could influence the susceptibility to COPD. We genotyped six single-nucleotide polymorphisms (SNPs) in surfactant protein D in 389 patients with COPD in the National Emphysema Treatment Trial (NETT) and 472 smoking control subjects from the Normative Aging Study (NAS). Case-control association analysis was performed using Cochran–Armitage trend tests and multivariate logistic regression. The replication of significant associations was attempted in the Boston Early-Onset COPD Study, the Evaluation of COPD Longitudinally to Identify Predictive Surrogate Endpoints (ECLIPSE) Study, and the Bergen Cohort. We also correlated SFTPD genotypes with serum concentrations of surfactant protein–D (SP-D) in the ECLIPSE Study. In the NETT–NAS case-control analysis, four SFTPD SNPs were associated with susceptibility to COPD: rs2245121 (P = 0.01), rs911887 (P = 0.006), rs6413520 (P = 0.004), and rs721917 (P = 0.006). In the family-based analysis of the Boston Early-Onset COPD Study, rs911887 was associated with prebronchodilator and postbronchodilator FEV1 (P = 0.003 and P = 0.02, respectively). An intronic SNP in SFTPD, rs7078012, was associated with COPD in the ECLIPSE Study and the Bergen Cohort. Multiple SFTPD SNPs were associated with serum SP-D concentrations in the ECLIPSE Study. We demonstrated an association of polymorphisms in SFTPD with COPD in multiple populations. We demonstrated a correlation between SFTPD SNPs and SP-D protein concentrations. The SNPs associated with COPD and SP-D concentrations differed, suggesting distinct genetic influences on susceptibility to COPD and SP-D concentrations.
COPD; surfactant protein–D; single-nucleotide polymorphisms; genetics
Rationale: Chronic obstructive pulmonary disease (COPD), characterized by airflow limitation, is a disorder with high phenotypic and genetic heterogeneity. Pulmonary emphysema is a major but variable component of COPD; familial data suggest that different components of COPD, such as emphysema, may be influenced by specific genetic factors.
Objectives: To identify genetic determinants of emphysema assessed through high-resolution chest computed tomography in individuals with COPD.
Methods: We performed a genome-wide association study (GWAS) of emphysema determined from chest computed tomography scans with a total of 2,380 individuals with COPD in three independent cohorts of white individuals from (1) a cohort from Bergen, Norway, (2) the Evaluation of COPD Longitudinally to Identify Predictive Surrogate Endpoints (ECLIPSE) Study, and (3) the National Emphysema Treatment Trial (NETT). We tested single-nucleotide polymorphism associations with the presence or absence of emphysema determined by radiologist assessment in two of the three cohorts and a quantitative emphysema trait (percentage of lung voxels less than –950 Hounsfield units) in all three cohorts.
Measurements and Main Results: We identified association of a single-nucleotide polymorphism in BICD1 with the presence or absence of emphysema (P = 5.2 × 10−7 with at least mild emphysema vs. control subjects; P = 4.8 × 10−8 with moderate and more severe emphysema vs. control subjects).
Conclusions: Our study suggests that genetic variants in BICD1 are associated with qualitative emphysema in COPD. Variants in BICD1 are associated with length of telomeres, which suggests that a mechanism linked to accelerated aging may be involved in the pathogenesis of emphysema.
Clinical trial registered with www.clinicaltrials.gov (NCT00292552).
emphysema; chronic obstructive pulmonary disease; BICD1; single-nucleotide polymorphism
Previous expression quantitative trait loci (eQTL) studies have performed genetic association studies for gene expression, but most of these studies examined lymphoblastoid cell lines from non-diseased individuals. We examined the genetics of gene expression in a relevant disease tissue from chronic obstructive pulmonary disease (COPD) patients to identify functional effects of known susceptibility genes and to find novel disease genes. By combining gene expression profiling on induced sputum samples from 131 COPD cases from the ECLIPSE Study with genomewide single nucleotide polymorphism (SNP) data, we found 4315 significant cis-eQTL SNP-probe set associations (3309 unique SNPs). The 3309 SNPs were tested for association with COPD in a genomewide association study (GWAS) dataset, which included 2940 COPD cases and 1380 controls. Adjusting for 3309 tests (p<1.5e-5), the two SNPs which were significantly associated with COPD were located in two separate genes in a known COPD locus on chromosome 15: CHRNA5 and IREB2. Detailed analysis of chromosome 15 demonstrated additional eQTLs for IREB2 mapping to that gene. eQTL SNPs for CHRNA5 mapped to multiple linkage disequilibrium (LD) bins. The eQTLs for IREB2 and CHRNA5 were not in LD. Seventy-four additional eQTL SNPs were associated with COPD at p<0.01. These were genotyped in two COPD populations, finding replicated associations with a SNP in PSORS1C1, in the HLA-C region on chromosome 6. Integrative analysis of GWAS and gene expression data from relevant tissue from diseased subjects has located potential functional variants in two known COPD genes and has identified a novel COPD susceptibility locus.
Rationale: Several family-based studies have identified genetic linkage for lung function and airflow obstruction to chromosome 2q.
Objectives: We hypothesized that merging results of high-resolution single nucleotide polymorphism (SNP) mapping in four separate populations would lead to the identification of chronic obstructive pulmonary disease (COPD) susceptibility genes on chromosome 2q.
Methods: Within the chromosome 2q linkage region, 2,843 SNPs were genotyped in 806 COPD cases and 779 control subjects from Norway, and 2,484 SNPs were genotyped in 309 patients with severe COPD from the National Emphysema Treatment Trial and 330 community control subjects. Significant associations from the combined results across the two case-control studies were followed up in 1,839 individuals from 603 families from the International COPD Genetics Network (ICGN) and in 949 individuals from 127 families in the Boston Early-Onset COPD Study.
Measurements and Main Results: Merging the results of the two case-control analyses, 14 of the 790 overlapping SNPs had a combined P < 0.01. Two of these 14 SNPs were consistently associated with COPD in the ICGN families. The association with one SNP, located in the gene XRCC5, was replicated in the Boston Early-Onset COPD Study, with a combined P = 2.51 × 10−5 across the four studies, which remains significant when adjusted for multiple testing (P = 0.02). Genotype imputation confirmed the association with SNPs in XRCC5.
Conclusions: By combining data from COPD genetic association studies conducted in four independent patient samples, we have identified XRCC5, an ATP-dependent DNA helicase, as a potential COPD susceptibility gene.
emphysema; genetic linkage; metaanalysis; single nucleotide polymorphism
Substantial evidence suggests that there is genetic susceptibility to chronic obstructive pulmonary disease (COPD). To identify common genetic risk variants, we performed a genome-wide association study in 2940 cases and 1380 smoking controls with normal lung function. We demonstrate a novel susceptibility locus at 4q22.1 in FAM13A (rs7671167, OR=0.76, P=8.6×10−8) and provide evidence of replication in one case-control and two family-based cohorts (for all studies, combined P=1.2×10−11).
There is considerable variability in the susceptibility of smokers to develop chronic obstructive pulmonary disease (COPD). The only known genetic risk factor is severe deficiency of α1-antitrypsin, which is present in 1–2% of individuals with COPD. We conducted a genome-wide association study (GWAS) in a homogenous case-control cohort from Bergen, Norway (823 COPD cases and 810 smoking controls) and evaluated the top 100 single nucleotide polymorphisms (SNPs) in the family-based International COPD Genetics Network (ICGN; 1891 Caucasian individuals from 606 pedigrees) study. The polymorphisms that showed replication were further evaluated in 389 subjects from the US National Emphysema Treatment Trial (NETT) and 472 controls from the Normative Aging Study (NAS) and then in a fourth cohort of 949 individuals from 127 extended pedigrees from the Boston Early-Onset COPD population. Logistic regression models with adjustments of covariates were used to analyze the case-control populations. Family-based association analyses were conducted for a diagnosis of COPD and lung function in the family populations. Two SNPs at the α-nicotinic acetylcholine receptor (CHRNA 3/5) locus were identified in the genome-wide association study. They showed unambiguous replication in the ICGN family-based analysis and in the NETT case-control analysis with combined p-values of 1.48×10−10, (rs8034191) and 5.74×10−10 (rs1051730). Furthermore, these SNPs were significantly associated with lung function in both the ICGN and Boston Early-Onset COPD populations. The C allele of the rs8034191 SNP was estimated to have a population attributable risk for COPD of 12.2%. The association of hedgehog interacting protein (HHIP) locus on chromosome 4 was also consistently replicated, but did not reach genome-wide significance levels. Genome-wide significant association of the HHIP locus with lung function was identified in the Framingham Heart study (Wilk et al., companion article in this issue of PLoS Genetics; doi:10.1371/journal.pgen.1000429). The CHRNA 3/5 and the HHIP loci make a significant contribution to the risk of COPD. CHRNA3/5 is the same locus that has been implicated in the risk of lung cancer.
There is considerable variability in the susceptibility of smokers to develop chronic obstructive pulmonary disease (COPD), which is a heritable multi-factorial trait. Identifying the genetic determinants of COPD risk will have tremendous public health importance. This study describes the first genome-wide association study (GWAS) in COPD. We conducted a GWAS in a homogenous case-control cohort from Norway and evaluated the top 100 single nucleotide polymorphisms in the family-based International COPD Genetics Network. The polymorphisms that showed replication were further evaluated in subjects from the US National Emphysema Treatment Trial and controls from the Normative Aging Study and then in a fourth cohort of extended pedigrees from the Boston Early-Onset COPD population. Two polymorphisms in the α-nicotinic acetylcholine receptor 3/5 locus on chromosome 15 showed unambiguous evidence of association with COPD. This locus has previously been implicated in both smoking behavior and risk of lung cancer, suggesting the possibility of multiple functional polymorphisms in the region or a single polymorphism with wide phenotypic consequences. The hedgehog interacting protein (HHIP) locus on chromosome 4, which is associated with COPD, is also a significant risk locus for COPD.