Motivation: For samples of unrelated individuals, we propose a general analysis framework in which hundred thousands of genetic loci can be tested simultaneously for association with complex phenotypes. The approach is built on spatial-clustering methodology, assuming that genetic loci that are associated with the target phenotype cluster in certain genomic regions. In contrast to standard methodology for multilocus analysis, which has focused on the dimension reduction of the data, our multilocus association-clustering test profits from the availability of large numbers of genetic loci by detecting clusters of loci that are associated with the phenotype.
Results: The approach is computationally fast and powerful, enabling the simultaneous association testing of large genomic regions. Even the entire genome or certain chromosomes can be tested simultaneously. Using simulation studies, the properties of the approach are evaluated. In an application to a genome-wide association study for chronic obstructive pulmonary disease, we illustrate the practical relevance of the proposed method by simultaneously testing all genotyped loci of the genome-wide association study and by testing each chromosome individually. Our findings suggest that statistical methodology that incorporates spatial-clustering information will be especially useful in whole-genome sequencing studies in which millions or billions of base pairs are recorded and grouped by genomic regions or genes, and are tested jointly for association.
Availability and implementation: Implementation of the approach is available upon request.
Supplementary data are available at Bioinformatics online.
Dyspnea is a cardinal symptom for cardiorespiratory diseases. No study has assessed worldwide variation in dyspnea prevalence or predictors of dyspnea.
We used cross-sectional data from population-based samples in 15 countries of the BOLD study to estimate prevalence of dyspnea in the full sample as well as in an a priori defined low-risk group (few risk factors or dyspnea-associated diseases). Dyspnea was defined by the modified Medical Research Council questions. We used ordered logistic regression analysis to study the association of dyspnea with site, sex, age, education, smoking habits, low/high BMI, self-reported disease, and spirometry results.
Of the 9,484 participants, 27% reported any dyspnea. In the low-risk subsample (N=4,329), 16% reported some dyspnea. In multivariate analyses, all covariates were correlated to dyspnea, but only 13% of dyspnea variation was explained. Women reported more dyspnea than men (odds ratio ≈ 2.1). When forced vital capacity (FVC) fell below 60% of predicted, dyspnea was much more likely.
There was considerable geographical variation in dyspnea, even when we adjusted for known risk factors and spirometry results. We were only able to explain 13% of dyspnea variation.
Dyspnea; Lung function; Epidemiology; Multi-center study
We aimed to estimate incremental productivity losses (sick leave and disability) of spirometry-defined chronic obstructive pulmonary disease (COPD) in a population-based sample and in hospital-recruited patients with COPD. Furthermore, we examined predictors of productivity losses by multivariate analyses.
We performed four quarterly telephone interviews of 53 and 107 population-based patients with COPD and controls, as well as 102 hospital-recruited patients with COPD below retirement age. Information was gathered regarding annual productivity loss, exacerbations of respiratory symptoms and comorbidities. Incremental productivity losses were estimated by multivariate quantile median regression according to the human capital approach, adjusting for sex, age, smoking habits, education and lung function. Main effect variables were COPD/control status, number of comorbidities and exacerbations of respiratory symptoms.
Altogether 55%, 87% and 31% of population-based COPD cases, controls and hospital patients, respectively, had a paid job at baseline. The annual incremental productivity losses were 5.8 (95% CI 1.4 to 10.1) and 330.6 (95% CI 327.8 to 333.3) days, comparing population-recruited and hospital-recruited patients with COPD to controls, respectively. There were significantly higher productivity losses associated with female sex and less education. Additional adjustments for comorbidities, exacerbations and FEV1% predicted explained all productivity losses in the population-based sample, as well as nearly 40% of the productivity losses in hospital-recruited patients.
Annual incremental productivity losses were more than 50 times higher in hospital-recruited patients with COPD than that of population-recruited patients with COPD. To ensure a precise estimation of societal burden, studies on patients with COPD should be population-based.
COPD epidemiology; Health Economist
Chronic bronchitis (CB) is one of the classic phenotypes of COPD. The aims of our study were to investigate genetic variants associated with COPD subjects with CB relative to smokers with normal spirometry, and to assess for genetic differences between subjects with CB and without CB within the COPD population.
We analyzed data from current and former smokers from three cohorts: the COPDGene Study; GenKOLS (Bergen, Norway); and the Evaluation of COPD Longitudinally to Identify Predictive Surrogate Endpoints (ECLIPSE). CB was defined as having a cough productive of phlegm on most days for at least 3 consecutive months per year for at least 2 consecutive years. CB COPD cases were defined as having both CB and at least moderate COPD based on spirometry. Our primary analysis used smokers with normal spirometry as controls; secondary analysis was performed using COPD subjects without CB as controls. Genotyping was performed on Illumina platforms; results were summarized using fixed-effect meta-analysis.
For CB COPD relative to smoking controls, we identified a new genome-wide significant locus on chromosome 11p15.5 (rs34391416, OR = 1.93, P = 4.99 × 10-8) as well as significant associations of known COPD SNPs within FAM13A. In addition, a GWAS of CB relative to those without CB within COPD subjects showed suggestive evidence for association on 1q23.3 (rs114931935, OR = 1.88, P = 4.99 × 10-7).
We found genome-wide significant associations with CB COPD on 4q22.1 (FAM13A) and 11p15.5 (EFCAB4A, CHID1 and AP2A2), and a locus associated with CB within COPD subjects on 1q23.3 (RPL31P11 and ATF6). This study provides further evidence that genetic variants may contribute to phenotypic heterogeneity of COPD.
ClinicalTrials.gov NCT00608764, NCT00292552
Electronic supplementary material
The online version of this article (doi:10.1186/s12931-014-0113-2) contains supplementary material, which is available to authorized users.
Pulmonary disease; Chronic obstructive; Chronic bronchitis; Genome-wide association study
Even in large-scale genome-wide association studies, only a fraction of the true associations are detected at the genome-wide significance level. When few or no associations reach the significance threshold, one strategy is to follow-up on the most promising candidates, i.e. the single nucleotide polymorphisms with the smallest association-test p-values, by genotyping them in additional studies. In this communication, we propose an overall test for genome-wide association studies that analyzes the SNP’s with the most promising p-values simultaneously and thereby allows an early assessment of whether the follow- up of the selected SNP’s is likely promising. We theoretically derive the properties of the proposed overall test under the null hypothesis and assess its power based on simulation studies. An application to a GWAS for chronic obstructive pulmonary disease suggests that there are true association signals among the top SNPs and that an additional follow-up study is promising.
genome wide association studies; snps association tests; chronic obstructive pulmonary disease; statistical genetics; multiple testing
Hedgehog Interacting Protein (HHIP) was implicated in chronic obstructive pulmonary disease (COPD) by genome-wide association studies (GWAS). However, it remains unclear how HHIP contributes to COPD pathogenesis. To identify genes regulated by HHIP, we performed gene expression microarray analysis in a human bronchial epithelial cell line (Beas-2B) stably infected with HHIP shRNAs. HHIP silencing led to differential expression of 296 genes; enrichment for variants nominally associated with COPD was found. Eighteen of the differentially expressed genes were validated by real-time PCR in Beas-2B cells. Seven of 11 validated genes tested in human COPD and control lung tissues demonstrated significant gene expression differences. Functional annotation indicated enrichment for extracellular matrix and cell growth genes. Network modeling demonstrated that the extracellular matrix and cell proliferation genes influenced by HHIP tended to be interconnected. Thus, we identified potential HHIP targets in human bronchial epithelial cells that may contribute to COPD pathogenesis.
Hedgehog interacting protein (HHIP); Gene expression profiling; COPD (Chronic obstructive pulmonary disease); extracellular matrix (ECM); network modeling
Chronic mucus hypersecretion (CMH) is associated with an increased frequency of respiratory infections, excess lung function decline, and increased hospitalisation and mortality rates in the general population. It is associated with smoking, but it is unknown why only a minority of smokers develops CMH. A plausible explanation for this phenomenon is a predisposing genetic constitution. Therefore, we performed a genome wide association (GWA) study of CMH in Caucasian populations.
GWA analysis was performed in the NELSON-study using the Illumina 610 array, followed by replication and meta-analysis in 11 additional cohorts. In total 2,704 subjects with, and 7,624 subjects without CMH were included, all current or former heavy smokers (≥20 pack-years). Additional studies were performed to test the functional relevance of the most significant single nucleotide polymorphism (SNP).
A strong association with CMH, consistent across all cohorts, was observed with rs6577641 (p = 4.25×10−6, OR = 1.17), located in intron 9 of the special AT-rich sequence-binding protein 1 locus (SATB1) on chromosome 3. The risk allele (G) was associated with higher mRNA expression of SATB1 (4.3×10−9) in lung tissue. Presence of CMH was associated with increased SATB1 mRNA expression in bronchial biopsies from COPD patients. SATB1 expression was induced during differentiation of primary human bronchial epithelial cells in culture.
Our findings, that SNP rs6577641 is associated with CMH in multiple cohorts and is a cis-eQTL for SATB1, together with our additional observation that SATB1 expression increases during epithelial differentiation provide suggestive evidence that SATB1 is a gene that affects CMH.
Cigarette smoking is the major environmental risk factor for chronic obstructive pulmonary disease (COPD). Genome-wide association studies have provided compelling associations for three loci with COPD. In this study, we aimed to estimate direct, i.e., independent from smoking, and indirect effects of those loci on COPD development using mediation analysis. We included a total of 3,424 COPD cases and 1,872 unaffected controls with data on two smoking-related phenotypes: lifetime average smoking intensity and cumulative exposure to tobacco smoke (pack years). Our analysis revealed that effects of two linked variants (rs1051730 and rs8034191) in the AGPHD1/CHRNA3 cluster on COPD development are significantly, yet not entirely, mediated by the smoking-related phenotypes. Approximately 30 % of the total effect of variants in the AGPHD1/CHRNA3 cluster on COPD development was mediated by pack years. Simultaneous analysis of modestly (r2 = 0.21) linked markers in CHRNA3 and IREB2 revealed that an even larger (~42 %) proportion of the total effect of the CHRNA3 locus on COPD was mediated by pack years after adjustment for an IREB2 single nucleotide polymorphism. This study confirms the existence of direct effects of the AGPHD1/CHRNA3, IREB2, FAM13A and HHIP loci on COPD development. While the association of the AGPHD1/CHRNA3 locus with COPD is significantly mediated by smoking-related phenotypes, IREB2 appears to affect COPD independently of smoking.
Rationale: A genome-wide association study (GWAS) for circulating chronic obstructive pulmonary disease (COPD) biomarkers could identify genetic determinants of biomarker levels and COPD susceptibility.
Objectives: To identify genetic variants of circulating protein biomarkers and novel genetic determinants of COPD.
Methods: GWAS was performed for two pneumoproteins, Clara cell secretory protein (CC16) and surfactant protein D (SP-D), and five systemic inflammatory markers (C-reactive protein, fibrinogen, IL-6, IL-8, and tumor necrosis factor-α) in 1,951 subjects with COPD. For genome-wide significant single nucleotide polymorphisms (SNPs) (P < 1 × 10−8), association with COPD susceptibility was tested in 2,939 cases with COPD and 1,380 smoking control subjects. The association of candidate SNPs with mRNA expression in induced sputum was also elucidated.
Measurements and Main Results: Genome-wide significant susceptibility loci affecting biomarker levels were found only for the two pneumoproteins. Two discrete loci affecting CC16, one region near the CC16 coding gene (SCGB1A1) on chromosome 11 and another locus approximately 25 Mb away from SCGB1A1, were identified, whereas multiple SNPs on chromosomes 6 and 16, in addition to SNPs near SFTPD, had genome-wide significant associations with SP-D levels. Several SNPs affecting circulating CC16 levels were significantly associated with sputum mRNA expression of SCGB1A1 (P = 0.009–0.03). Several SNPs highly associated with CC16 or SP-D levels were nominally associated with COPD in a collaborative GWAS (P = 0.001–0.049), although these COPD associations were not replicated in two additional cohorts.
Conclusions: Distant genetic loci and biomarker-coding genes affect circulating levels of COPD-related pneumoproteins. A subset of these protein quantitative trait loci may influence their gene expression in the lung and/or COPD susceptibility.
Clinical trial registered with www.clinicaltrials.gov (NCT 00292552).
biomarker; chronic obstructive pulmonary disease; genome-wide association study
Due to the pleiotropic effects of nitric oxide (NO) within the lungs, it is likely that NO is a significant factor in the pathogenesis of chronic obstructive pulmonary disease (COPD). The aim of this study was to test for association between single nucleotide polymorphisms (SNPs) in three NO synthase (NOS) genes and lung function, as well as to examine gene expression and protein levels in relation to the genetic variation.
One SNP in each NOS gene (neuronal NOS (NOS1), inducible NOS (NOS2), and endothelial NOS (NOS3)) was genotyped in the Lung Health Study (LHS) and correlated with lung function. One SNP (rs1800779) was also analyzed for association with COPD and lung function in four COPD case–control populations. Lung tissue expression of NOS3 mRNA and protein was tested in individuals of known genotype for rs1800779. Immunohistochemistry of lung tissue was used to localize NOS3 expression.
For the NOS3 rs1800779 SNP, the baseline forced expiratory volume in one second in the LHS was significantly higher in the combined AG + GG genotypic groups compared with the AA genotypic group. Gene expression and protein levels in lung tissue were significantly lower in subjects with the AG + GG genotypes than in AA subjects. NOS3 protein was expressed in the airway epithelium and subjects with the AA genotype demonstrated higher NOS3 expression compared with AG and GG individuals. However, we were not able to replicate the associations with COPD or lung function in the other COPD study groups.
Variants in the NOS genes were not associated with lung function or COPD status. However, the G allele of rs1800779 resulted in a decrease of NOS3 gene expression and protein levels and this has implications for the numerous disease states that have been associated with this polymorphism.
Chronic obstructive pulmonary disease; Nitric oxide synthase; Polymorphism; Gene expression
Rationale: Genome-wide association studies (GWAS) have identified loci influencing lung function, but fewer genes influencing chronic obstructive pulmonary disease (COPD) are known.
Objectives: Perform meta-analyses of GWAS for airflow obstruction, a key pathophysiologic characteristic of COPD assessed by spirometry, in population-based cohorts examining all participants, ever smokers, never smokers, asthma-free participants, and more severe cases.
Methods: Fifteen cohorts were studied for discovery (3,368 affected; 29,507 unaffected), and a population-based family study and a meta-analysis of case-control studies were used for replication and regional follow-up (3,837 cases; 4,479 control subjects). Airflow obstruction was defined as FEV1 and its ratio to FVC (FEV1/FVC) both less than their respective lower limits of normal as determined by published reference equations.
Measurements and Main Results: The discovery meta-analyses identified one region on chromosome 15q25.1 meeting genome-wide significance in ever smokers that includes AGPHD1, IREB2, and CHRNA5/CHRNA3 genes. The region was also modestly associated among never smokers. Gene expression studies confirmed the presence of CHRNA5/3 in lung, airway smooth muscle, and bronchial epithelial cells. A single-nucleotide polymorphism in HTR4, a gene previously related to FEV1/FVC, achieved genome-wide statistical significance in combined meta-analysis. Top single-nucleotide polymorphisms in ADAM19, RARB, PPAP2B, and ADAMTS19 were nominally replicated in the COPD meta-analysis.
Conclusions: These results suggest an important role for the CHRNA5/3 region as a genetic risk factor for airflow obstruction that may be independent of smoking and implicate the HTR4 gene in the etiology of airflow obstruction.
chronic obstructive pulmonary disease; single-nucleotide polymorphism; genes
Smoking is a leading global cause of disease and mortality1. We performed a genomewide meta-analytic association study of smoking-related behavioral traits in a total sample of 41,150 individuals drawn from 20 disease, population, and control cohorts. Our analysis confirmed an effect on smoking quantity (SQ) at a locus on 15q25 (P=9.45e-19) that includes three genes encoding neuronal nicotinic acetylcholine receptor subunits (CHRNA5, CHRNA3, CHRNB4). We used data from the 1000 Genomes project to investigate the region using imputation, which allowed analysis of virtually all common variants in the region and offered a five-fold increase in coverage over the HapMap. This increased the spectrum of potentially causal single nucleotide polymorphisms (SNPs), which included a novel SNP that showed the highest significance, rs55853698, located within the promoter region of CHRNA5. Conditional analysis also identified a secondary locus (rs6495308) in CHRNA3.
The genetic risk factors for chronic obstructive pulmonary disease (COPD) are still largely unknown. To date, genome-wide association studies (GWASs) of limited size have identified several novel risk loci for COPD at CHRNA3/CHRNA5/IREB2, HHIP and FAM13A; additional loci may be identified through larger studies. We performed a GWAS using a total of 3499 cases and 1922 control subjects from four cohorts: the Evaluation of COPD Longitudinally to Identify Predictive Surrogate Endpoints (ECLIPSE); the Normative Aging Study (NAS) and National Emphysema Treatment Trial (NETT); Bergen, Norway (GenKOLS); and the COPDGene study. Genotyping was performed on Illumina platforms with additional markers imputed using 1000 Genomes data; results were summarized using fixed-effect meta-analysis. We identified a new genome-wide significant locus on chromosome 19q13 (rs7937, OR = 0.74, P = 2.9 × 10−9). Genotyping this single nucleotide polymorphism (SNP) and another nearby SNP in linkage disequilibrium (rs2604894) in 2859 subjects from the family-based International COPD Genetics Network study (ICGN) demonstrated supportive evidence for association for COPD (P = 0.28 and 0.11 for rs7937 and rs2604894), pre-bronchodilator FEV1 (P = 0.08 and 0.04) and severe (GOLD 3&4) COPD (P = 0.09 and 0.017). This region includes RAB4B, EGLN2, MIA and CYP2A6, and has previously been identified in association with cigarette smoking behavior.
Two recent metaanalyses of genome-wide association studies conducted by the CHARGE and SpiroMeta consortia identified novel loci yielding evidence of association at or near genome-wide significance (GWS) with FEV1 and FEV1/FVC. We hypothesized that a subset of these markers would also be associated with chronic obstructive pulmonary disease (COPD) susceptibility. Thirty-two single-nucleotide polymorphisms (SNPs) in or near 17 genes in 11 previously identified GWS spirometric genomic regions were tested for association with COPD status in four COPD case-control study samples (NETT/NAS, the Norway case-control study, ECLIPSE, and the first 1,000 subjects in COPDGene; total sample size, 3,456 cases and 1,906 controls). In addition to testing the 32 spirometric GWS SNPs, we tested a dense panel of imputed HapMap2 SNP markers from the 17 genes located near the 32 GWS SNPs and in a set of 21 well studied COPD candidate genes. Of the previously identified GWS spirometric genomic regions, three loci harbored SNPs associated with COPD susceptibility at a 5% false discovery rate: the 4q24 locus including FLJ20184/INTS12/GSTCD/NPNT, the 6p21 locus including AGER and PPT2, and the 5q33 locus including ADAM19. In conclusion, markers previously associated at or near GWS with spirometric measures were tested for association with COPD status in data from four COPD case-control studies, and three loci showed evidence of association with COPD susceptibility at a 5% false discovery rate.
Traditional genome-wide association studies (GWAS) of large cohort of subjects with chronic obstructive pulmonary disease (COPD) have successfully identified novel candidate genes, but several other plausible loci do not meet strict criteria for genome-wide significance after correction for multiple testing.
We hypothesize that by applying unbiased weights derived from unique populations we can identify additional COPD susceptibility loci.
We performed a homozygosity haplotype analysis on a group of subjects with and without COPD to identify regions of conserved homozygosity (RCHH). Weights were constructed based on the frequency of these RCHH in case vs. controls, and used to adjust the P values from a large collaborative GWAS of COPD.
We identified 2,318 regions of conserved homozygosity, of which 576 were significantly (P < .05) overrepresented in cases. After applying the weights constructed from these regions to a collaborative GWAS of COPD, we identified two single nucleotide polymorphisms in a novel gene (FGF7) that gained genome-wide significance by the false discovery rate method. In a follow-up analysis, both SNPs (rs12591300 and rs4480740) were significantly associated with COPD in an independent population (combined P values of 7.9E-07 and 2.8E-06 respectively). In another independent population, increased lung tissue FGF7 expression was associated with worse measures of lung function.
Weights constructed from a homozygosity haplotype analysis of an isolated population successfully identify novel genetic associations from a GWAS on a separate population. This method can be used to identify promising candidate genes that fail to meet strict correction for multiple testing.
Chronic Obstructive Pulmonary Disease (COPD) is defined by post-bronchodilator spirometry. Data on “normal values” come predominantly from pre-bronchodilator spirometry. The effects of this on diagnosis are unknown.
Lower limits of normal (LLN) were estimated from “normal” participants in the Burden of Obstructive Lung Disease (BOLD) programme. Values separately derived using pre- and post-bronchodilator spirometry were compared. Sensitivity and specificity of criteria derived from pre-bronchodilator spirometry and pre-bronchodilator spirometry adjusted by a constant were assessed in the remaining population. The “gold standard” was the LLN for the post-bronchodilator spirometry in the “normal population”. For FEV1/FVC, sensitivity and specificity of criteria were also assessed when a fixed value of < 70% was used rather than LLN.
Of 6,600 participants with full data, 1,354 were defined as “normal”. Mean differences between pre- and post- bronchodilator measurements were small and the Bland-Altman plots showed no association between difference and mean value. Compared with using the gold standard, however, tests using pre-bronchodilator spirometry had a sensitivity and specificity of detecting a low FEV1 of 78.4% and 100%, a low FVC of 99.8% and 99.1% and a low FEV1/FVC ratio of 65% and 100%. Adjusting this by a constant improved the sensitivity without substantially altering the specificity for FEV1 (99%, 99.8%), FVC (97.4%, 99.9%) and FEV1/FVC (98.7%, 99.5%).
Using pre-bronchodilator spirometry to derive norms for lung function reduces sensitivity compared to a post-bronchodilator gold standard. Adjustment of these values by a constant can improve validity of the test.
Normal values; BOLD study; European population
Cigarette smoking is a major risk factor for COPD and COPD severity. Previous genome-wide association studies (GWAS) have identified numerous single nucleotide polymorphisms (SNPs) associated with the number of cigarettes smoked per day (CPD) and a Dopamine Beta-Hydroxylase (DBH) locus associated with smoking cessation in multiple populations.
To identify SNPs associated with lifetime average and current CPD, age at smoking initiation, and smoking cessation in COPD subjects.
GWAS were conducted in 4 independent cohorts encompassing 3,441 ever-smoking COPD subjects (GOLD stage II or higher). Untyped SNPs were imputed using HapMap (phase II) panel. Results from all cohorts were meta-analyzed.
Several SNPs near the HLA region on chromosome 6p21 and in an intergenic region on chromosome 2q21 showed associations with age at smoking initiation, both with the lowest p=2×10−7. No SNPs were associated with lifetime average CPD, current CPD or smoking cessation with p<10−6. Nominally significant associations with candidate SNPs within alpha-nicotinic acetylcholine receptors 3/5 (CHRNA3/CHRNA5; e.g. p=0.00011 for SNP rs1051730) and Cytochrome P450 2A6 (CYP2A6; e.g. p=2.78×10−5 for a nonsynonymous SNP rs1801272) regions were observed for lifetime average CPD, however only CYP2A6 showed evidence of significant association with current CPD. A candidate SNP (rs3025343) in the DBH was significantly (p=0.015) associated with smoking cessation.
We identified two candidate regions associated with age at smoking initiation in COPD subjects. Associations of CHRNA3/CHRNA5 and CYP2A6 loci with CPD and DBH with smoking cessation are also likely of importance in the smoking behaviors of COPD patients.
Chronic Obstructive Pulmonary Disease (COPD); Genome Wide Association study (GWAS); smoking behaviors; Single Nucleotide Polymorphism (SNP)
Cachexia, whether assessed by body mass index (BMI) or fat-free mass index (FFMI), affects a significant proportion of patients with chronic obstructive pulmonary disease (COPD), and is an independent risk factor for increased mortality, increased emphysema, and more severe airflow obstruction. The variable development of cachexia among patients with COPD suggests a role for genetic susceptibility. The objective of the present study was to determine genetic susceptibility loci involved in the development of low BMI and FFMI in subjects with COPD. A genome-wide association study (GWAS) of BMI was conducted in three independent cohorts of European descent with Global Initiative for Chronic Obstructive Lung Disease stage II or higher COPD: Evaluation of COPD Longitudinally to Identify Predictive Surrogate End-Points (ECLIPSE; n = 1,734); Norway-Bergen cohort (n = 851); and a subset of subjects from the National Emphysema Treatment Trial (NETT; n = 365). A genome-wide association of FFMI was conducted in two of the cohorts (ECLIPSE and Norway). In the combined analyses, a significant association was found between rs8050136, located in the first intron of the fat mass and obesity–associated (FTO) gene, and BMI (P = 4.97 × 10−7) and FFMI (P = 1.19 × 10−7). We replicated the association in a fourth, independent cohort consisting of 502 subjects with COPD from COPDGene (P = 6 × 10−3). Within the largest contributing cohort of our analysis, lung function, as assessed by forced expiratory volume at 1 second, varied significantly by FTO genotype. Our analysis suggests a potential role for the FTO locus in the determination of anthropomorphic measures associated with COPD.
chronic obstructive pulmonary disease genetics; chronic obstructive pulmonary disease epidemiology; chronic obstructive pulmonary disease metabolism; genome-wide association study
Chronic obstructive pulmonary disease (COPD) is characterized by alveolar destruction and abnormal inflammatory responses to noxious stimuli. Surfactant protein–D (SFTPD) is immunomodulatory and essential to host defense. We hypothesized that polymorphisms in SFTPD could influence the susceptibility to COPD. We genotyped six single-nucleotide polymorphisms (SNPs) in surfactant protein D in 389 patients with COPD in the National Emphysema Treatment Trial (NETT) and 472 smoking control subjects from the Normative Aging Study (NAS). Case-control association analysis was performed using Cochran–Armitage trend tests and multivariate logistic regression. The replication of significant associations was attempted in the Boston Early-Onset COPD Study, the Evaluation of COPD Longitudinally to Identify Predictive Surrogate Endpoints (ECLIPSE) Study, and the Bergen Cohort. We also correlated SFTPD genotypes with serum concentrations of surfactant protein–D (SP-D) in the ECLIPSE Study. In the NETT–NAS case-control analysis, four SFTPD SNPs were associated with susceptibility to COPD: rs2245121 (P = 0.01), rs911887 (P = 0.006), rs6413520 (P = 0.004), and rs721917 (P = 0.006). In the family-based analysis of the Boston Early-Onset COPD Study, rs911887 was associated with prebronchodilator and postbronchodilator FEV1 (P = 0.003 and P = 0.02, respectively). An intronic SNP in SFTPD, rs7078012, was associated with COPD in the ECLIPSE Study and the Bergen Cohort. Multiple SFTPD SNPs were associated with serum SP-D concentrations in the ECLIPSE Study. We demonstrated an association of polymorphisms in SFTPD with COPD in multiple populations. We demonstrated a correlation between SFTPD SNPs and SP-D protein concentrations. The SNPs associated with COPD and SP-D concentrations differed, suggesting distinct genetic influences on susceptibility to COPD and SP-D concentrations.
COPD; surfactant protein–D; single-nucleotide polymorphisms; genetics
Rationale: Chronic obstructive pulmonary disease (COPD), characterized by airflow limitation, is a disorder with high phenotypic and genetic heterogeneity. Pulmonary emphysema is a major but variable component of COPD; familial data suggest that different components of COPD, such as emphysema, may be influenced by specific genetic factors.
Objectives: To identify genetic determinants of emphysema assessed through high-resolution chest computed tomography in individuals with COPD.
Methods: We performed a genome-wide association study (GWAS) of emphysema determined from chest computed tomography scans with a total of 2,380 individuals with COPD in three independent cohorts of white individuals from (1) a cohort from Bergen, Norway, (2) the Evaluation of COPD Longitudinally to Identify Predictive Surrogate Endpoints (ECLIPSE) Study, and (3) the National Emphysema Treatment Trial (NETT). We tested single-nucleotide polymorphism associations with the presence or absence of emphysema determined by radiologist assessment in two of the three cohorts and a quantitative emphysema trait (percentage of lung voxels less than –950 Hounsfield units) in all three cohorts.
Measurements and Main Results: We identified association of a single-nucleotide polymorphism in BICD1 with the presence or absence of emphysema (P = 5.2 × 10−7 with at least mild emphysema vs. control subjects; P = 4.8 × 10−8 with moderate and more severe emphysema vs. control subjects).
Conclusions: Our study suggests that genetic variants in BICD1 are associated with qualitative emphysema in COPD. Variants in BICD1 are associated with length of telomeres, which suggests that a mechanism linked to accelerated aging may be involved in the pathogenesis of emphysema.
Clinical trial registered with www.clinicaltrials.gov (NCT00292552).
emphysema; chronic obstructive pulmonary disease; BICD1; single-nucleotide polymorphism
Previous expression quantitative trait loci (eQTL) studies have performed genetic association studies for gene expression, but most of these studies examined lymphoblastoid cell lines from non-diseased individuals. We examined the genetics of gene expression in a relevant disease tissue from chronic obstructive pulmonary disease (COPD) patients to identify functional effects of known susceptibility genes and to find novel disease genes. By combining gene expression profiling on induced sputum samples from 131 COPD cases from the ECLIPSE Study with genomewide single nucleotide polymorphism (SNP) data, we found 4315 significant cis-eQTL SNP-probe set associations (3309 unique SNPs). The 3309 SNPs were tested for association with COPD in a genomewide association study (GWAS) dataset, which included 2940 COPD cases and 1380 controls. Adjusting for 3309 tests (p<1.5e-5), the two SNPs which were significantly associated with COPD were located in two separate genes in a known COPD locus on chromosome 15: CHRNA5 and IREB2. Detailed analysis of chromosome 15 demonstrated additional eQTLs for IREB2 mapping to that gene. eQTL SNPs for CHRNA5 mapped to multiple linkage disequilibrium (LD) bins. The eQTLs for IREB2 and CHRNA5 were not in LD. Seventy-four additional eQTL SNPs were associated with COPD at p<0.01. These were genotyped in two COPD populations, finding replicated associations with a SNP in PSORS1C1, in the HLA-C region on chromosome 6. Integrative analysis of GWAS and gene expression data from relevant tissue from diseased subjects has located potential functional variants in two known COPD genes and has identified a novel COPD susceptibility locus.
Rationale: Several family-based studies have identified genetic linkage for lung function and airflow obstruction to chromosome 2q.
Objectives: We hypothesized that merging results of high-resolution single nucleotide polymorphism (SNP) mapping in four separate populations would lead to the identification of chronic obstructive pulmonary disease (COPD) susceptibility genes on chromosome 2q.
Methods: Within the chromosome 2q linkage region, 2,843 SNPs were genotyped in 806 COPD cases and 779 control subjects from Norway, and 2,484 SNPs were genotyped in 309 patients with severe COPD from the National Emphysema Treatment Trial and 330 community control subjects. Significant associations from the combined results across the two case-control studies were followed up in 1,839 individuals from 603 families from the International COPD Genetics Network (ICGN) and in 949 individuals from 127 families in the Boston Early-Onset COPD Study.
Measurements and Main Results: Merging the results of the two case-control analyses, 14 of the 790 overlapping SNPs had a combined P < 0.01. Two of these 14 SNPs were consistently associated with COPD in the ICGN families. The association with one SNP, located in the gene XRCC5, was replicated in the Boston Early-Onset COPD Study, with a combined P = 2.51 × 10−5 across the four studies, which remains significant when adjusted for multiple testing (P = 0.02). Genotype imputation confirmed the association with SNPs in XRCC5.
Conclusions: By combining data from COPD genetic association studies conducted in four independent patient samples, we have identified XRCC5, an ATP-dependent DNA helicase, as a potential COPD susceptibility gene.
emphysema; genetic linkage; metaanalysis; single nucleotide polymorphism
The objective of the present study was to determine the association between CT phenotypes—emphysema by low attenuation area and bronchitis by airway wall thickness—and body composition parameters in a large cohort of subjects with and without COPD. In 452 COPD subjects and 459 subjects without COPD, CT scans were performed to determine emphysema (%LAA), airway wall thickness (AWT-Pi10), and lung mass. Muscle wasting based on FFMI was assessed by bioelectrical impedance. In both the men and women with COPD, FFMI was negatively associated with %LAA. FMI was positively associated with AWT-Pi10 in both subjects with and without COPD. Among the subjects with muscle wasting, the percentage emphysema was high, but the predictive value was moderate. In conclusion, the present study strengthens the hypothesis that the subgroup of COPD cases with muscle wasting have emphysema. Airway wall thickness is positively associated with fat mass index in both subjects with and without COPD.
Mortality statistics represent important endpoints in epidemiological studies. The diagnostic validity of cerebral stroke and ischemic heart disease recorded as the underlying cause of death in Norwegian mortality statistics was assessed by using mortality data of participants in the Bergen Clinical Blood Pressure Study in Norway and autopsy records from the Gade Institute in Bergen. In the 41 years of the study (1965–2005) 4,387 subjects had died and 1,140 (26%) had undergone a post mortem examination; 548 (12%) died from cerebral stroke and 1,120 (24%) from ischemic heart disease according to the mortality statistics, compared to 113 (10%) strokes and 323 (28%) coronary events registered in the autopsy records. The sensitivity and positive predictive value of fatal cerebral strokes in the mortality statistics were 0.75, 95% confidence interval (CI) [0.66, 0.83] and 0.86 [0.77, 0.92], respectively, whereas those of coronary deaths were 0.87 [0.84, 0.91] and 0.85 [0.81, 0.89] respectively. Cohen’s Kappa coefficients were 0.78 [0.72, 0.84] for stroke and 0.80 [0.76, 0.84] for coronary deaths. In addition to female gender and increasing age at death, cerebral stroke was a negative predictor of an autopsy being carried out (odds ratio (OR) 0.69, 95% CI [0.54, 0.87]), whereas death from coronary heart disease was not (OR 1.14, 95% CI [0.97, 1,33]), both adjusted for gender and age at death. There was substantial agreement between mortality statistics and autopsy findings for both fatal strokes and coronary deaths. Selection for post mortem examinations was associated with age, gender and cause of death.
Autopsy; Stroke; Ischemic heart disease; Death certification; Validity; Mortality statistics
International guidelines recommend that pulmonary reference populations consist of never‐smokers without respiratory diseases or symptoms, but the diseases and symptoms are not clearly specified. The present study aimed to identify simple exclusion criteria for defining pulmonary reference populations.
Based on a random sample from a general population (the parent population), 2358 subjects aged 26–82 years performed spirometric tests. From this sample, subjects were stepwise excluded according to self‐reported obstructive lung diseases, symptoms and smoking history. Four increasingly more healthy respiratory reference populations were formed. Prediction equations for the median and lower limit of normal lung function were derived using quantile regression analysis.
Subjects without self‐reported obstructive lung diseases or the cardinal respiratory symptoms of breathlessness, cough or wheeze (population B), never‐smokers without cardinal symptoms (population C) and never‐smokers without any respiratory symptoms (population D) constituted 50% (n = 1184), 23% (n = 539) and 14% (n = 331) of the parent population (population A), respectively. The largest discrepancy between prediction equations was found between the parent population and the population without cardinal respiratory symptoms (population B) (p<0.05). Minor changes in the reference equations were also seen when excluding ever‐smokers (population C). There was no additional change with exclusion of other respiratory symptoms (population D). Age‐related decline in lung function was steepest in the parent population.
Obstructive lung diseases, smoking history, breathlessness, cough and wheeze are optimal exclusion criteria for a pulmonary reference population. Further validation of the exclusion criteria identified in this study is recommended with identical wording in other and larger multinational populations.