|Home | About | Journals | Submit | Contact Us | Français|
A combined genome-wide association and linkage study was used to identify loci causing variation in CF lung disease severity. A significant association (P=3. 34 × 10-8) near EHF and APIP (chr11p13) was identified in F508del homozygotes (n=1,978). The association replicated in F508del homozygotes (P=0.006) from a separate family-based study (n=557), with P=1.49 × 10-9 for the three-study joint meta-analysis. Linkage analysis of 486 sibling pairs from the family-based study identified a significant QTL on chromosome 20q13.2 (LOD=5.03). Our findings provide insight into the causes of variation in lung disease severity in CF and suggest new therapeutic targets for this life-limiting disorder.
Lung disease is the major source of morbidity and mortality in cystic fibrosis (CF), a recessive disorder caused by mutations in the cystic fibrosis transmembrane conductance regulator (CFTR) gene. Allelic variation in CFTR does not explain the wide variation in severity of lung disease1 however studies of twins and siblings demonstrate substantial heritability underlying differences in lung function measures in CF patients (h2 > 0.5)2. Candidate gene studies have produced conflicting results, with only a few large scale replications accounting for a small proportion of heritable variation in CF lung function3,4. Identification of other genetic modifiers could identify potential mechanisms for variation in lung function in CF, as well as for common diseases such as chronic obstructive pulmonary disease (COPD), and suggest new targets for intervention.
Whole-genome methods provide an attractive approach to identify modifier loci of Mendelian disorders. However CF presents numerous challenges, such as: (1) collecting multiple years of lung function measures to accurately classify lung disease severity; (2) selecting the appropriate study design to identify common and rare variants; (3) accruing sufficient sample sizes, and (4) accounting for potential interaction between CFTR and modifier loci. To overcome these challenges, we formed a North American CF Gene Modifier Consortium to identify modifiers of lung disease severity and other phenotypes. For lung disease in CF, the forced expiratory volume in 1 second (FEV1) is the most clinically useful measure of lung disease severity and is a well-established predictor of survival5,6. However, comparison of FEV1 measures across a broad age range of CF patients is confounded by decline with age and mortality attrition. To account for these confounders, the Consortium developed a quantitative lung disease phenotype based on multiple measures of FEV1 over 3 years7 that displays robust genetic influence (h2 = 0.51)8.
The Consortium is composed of three samples of CF patients recruited using different study designs. The Genetic Modifier Study (GMS) consists of unrelated patients homozygous for the common CF allele F508del (HGVS nomenclature: p.Phe508del), recruited from extremes of lung function9. The Canadian Consortium for Genetic Studies (CGS) enrolled unrelated patients having pancreatic insufficiency from a population-based sample10. The CF Twin and Sibling Study (TSS) recruited families where two or more surviving children have CF2. The GMS and CGS were designed for association analysis, while the TSS was designed for both linkage and association, providing an opportunity to detect rarer variants or poorly tagged loci.
As many current Genome Wide Association Studies (GWASs) employ sample sizes that are several-fold larger than available for the CF population, we sought to maximize power by (1) testing association using combined data from GMS and CGS, followed by replication using the association evidence from TSS, and (2) testing linkage using the TSS, followed by SNP association testing in linked regions in the unrelated patients in GMS and CGS. We also restricted analysis to patients bearing two severe loss-of-function CFTR alleles and a subset of these patients that had identical CFTR genotypes (homozygosity for F508del).
A total of 3,467 CF patients are represented in three study designs (Table 1, Supplementary Note). Patients in the GMS and 60% of the patients in the CGS and TSS are F508del homozygotes (F508del/F508del), while the remainder has other severe exocrine pancreatic CFTR genotypes2,9,10. The three samples showed consistent distributions of the lung disease phenotype, with the mid-range under-represented in GMS due to the extremes-of-phenotype design (Figure 1). Patients were contemporaneously genotyped using the Illumina 610-Quad array® in a single facility with stringent quality control (Online Methods). Association scans for the GMS and CGS used an additive model adjusted for sex and principal components as described11. Results were combined using a directional meta-analysis approach for (1) GMS and CGS, n=2,494 and (2) GMS and CGS F508del/F508del, n=1,978 (power analysis shown in Supplementary Figure 1).
The combined GMS and CGS analysis identified seven regions with suggestive association (P ≤ 1/570,725 = 1.75 × 10-6) (Figure 2 and Table 2). Restricting analysis to F508del/F508del patients, the EHF-APIP region on 11p13 achieved genome-wide significance at rs12793173 (P=3.34 × 10-8, explaining 1.0% of the phenotype variation in GMS, 2.2% in CGS F508del/F508del). We verified the significance by permutation analysis and by developing an alternative conditional likelihood approach which acknowledged the GMS extremes of phenotype (Online Methods, Supplementary Figure 2). With the inclusion of CF-relevant covariates (sex, BMI and previously associated genes), association for rs12793173 was even stronger (P= 9.42 × 10-9 for GMS and CGS F508del/F508del; Supplementary Table 1). Two purported modifiers of CF lung disease, TGFB1 and IFRD1, did not achieve genome-wide significance. TGFB1 did, however, achieve P-values in the range of 10-3 to 10-4 in the GMS sample, depending on additional covariates (Supplementary Table 1).
The SNPs in the significant region and the six suggestive regions in GMS and CGS were evaluated for association in TSS using Merlin12, while accounting for family structure. To be consistent with the GMS and CGS allelic effect, each replication test was one-sided, with the TSS sample (all or F508del/F508del patients) for each suggestive SNP chosen to be consistent with the GMS and CGS sample set providing maximum significance. Covariates for sex and four principal components11 were included for TSS. The SNP attaining genome-wide significance in GMS and CGS (rs12793173, F508del/F508del) demonstrated significant association in the TSS F508del/F508del sample (P=0.006; Bonferroni corrected P = 0.041 for the seven replication tests; Table 2). Two of the suggestive SNPs provided modest evidence in TSS: rs9268905 near HLA-DRA (P=0.032) and rs1403543 near AGTR2 (P=0.053), with neither significant after correcting for the seven replication tests.
We next performed a joint analysis, shown to be more powerful than testing followed by replication13, using a weighted meta-analysis procedure (Online Methods). Using all patients, rs12793173 attained genome-wide significance (P=1.12 × 10-8). For this patient set, rs568529, a SNP in high LD (r2 > 0.9) with rs12793173, achieved slightly greater significance (P=9.75 × 10-9). As in the earlier analysis, restricting to F508del/F508del patients increased the significance of EHF-APIP (P=1.49 × 10-9 for rs12793173 (Table 2), P=8.28 × 10-10 for rs568529). In the HLA class II region, a SNP (rs2395185, ~1kb from the suggestive SNP rs9268905 identified from GMS and CGS) approached genome-wide significance using all patients (P=9.02 × 10-8; Supplementary Figure 3). SNPs in AGTR2 remained suggestive for all patients (rs5952206, P=1.25 × 10-7) and for F508del/F508del patients (rs7060450, P=3.67 × 10-7).
Figure 3 shows the GMS and CGS results for an 800kb interval including EHF-APIP. The minimum P-value appears in an intergenic region 3’ to both EHF and APIP. A second peak at rs286873 (P=5.62 × 10-7) near EHF exhibited low linkage disequilibrium (r2 < 0.2) with the primary SNP (Figure 3). After conditioning on the primary finding, rs286873 had regional statistical significance (rs12793173; corrected P=0.0029), suggesting additional regional genetic variants (Supplementary Figure 4). We repeated the testing after MACH imputation14. The imputed SNPs in the region identified the same EHF/APIP interval, with minimum P=1.45 × 10-8, at rs535719, at a position 19kb closer to APIP than rs12793173. None of the imputed SNPs produced substantially improved association evidence (Supplementary Figure 5). Neither total copy number nor allele-specific copy number (Online Methods) models met genome-wide significance (illustrative Manhattan plot in Supplementary Figure 6). Finally, after sequencing the exonic regions of EHF and APIP in 48 patients with mild pulmonary disease and 48 patients with severe pulmonary disease from the GMS, no additional genetic variation was found that offered insight into putative modifying roles (data not shown).
Linkage analysis revealed a genome-wide significant multipoint LOD score of 5.03 at rs4811626, located at 53.81 Mb (~85cM) on chromosome 20q13.2 (nominal P=7.9 × 10-7 ; genome-wide15 P=2.3 × 10-3; Figure 4). Another, but more modest linkage signal was on chromosome 1p22.21, with multipoint LOD score of 2.48 for rs941031at 91.07 Mb (119 cM). Inclusion of BMI-Z, an important covariate of CF lung function (Supplementary Table 1), increased the LOD score for the linkage peak on 20q13.2 to 5.72 (genome-wide P=5.05 × 10-4 at rs4811645 which is 0.07cM (0.13Mb) from rs4811626; Figure 5) while linkage on chromosome 1p22.21 decreased to LOD 1.67. Thus, anthropometric measures are not major contributors to the linkage on 20q13.2 but may be playing a role on 1p22.21. We estimated that the QTL at 20q13.2 is approaching 50% of the variation in lung function in the CF sibling pairs (Supplementary Figure 7); however, this estimate is highly likely to be biased upward due to winner’s curse16.
A region of 1.31 Mb on 20q13.2, demarcated by 1 LOD unit below the maximum (when BMI-Z was used as a covariate), was analyzed for association in the combined GMS and CGS samples. A 16kb cluster of SNPs in high LD (rs6092179, rs6024437, rs8125625, rs6024454 and rs6024460; r2 > 0.8) located ~200kb from CBLN4 generated the lowest P-values in the combined GMS and CGS F508del/F508del samples (Figure 5). The SNP with the lowest P-value (rs6024460; P=1.34 × 10-4) reached regional significance (corrected P = 0.041). Association in the TSS identified a SNP (rs6069437) with marginal association (uncorrected P = 0.014) that displays weak LD with the GMS and CGS cluster of SNPs. Imputation did not identify any SNPs exhibiting a lower P value for association than rs6024460 (Supplementary Figure 8).
To evaluate association and linkage in a single framework, linkage information was used to reprioritize genome-wide association using extensions of the false discovery rate (FDR)17 via the stratified FDR (SFDR)18 and weighted FDR (WFDR)19. We (1) obtained linkage-weighted q-values representing the combined evidence at each SNP, and (2) re-ranked GWAS results by linkage-weighted q-values (see Online Methods). Results are presented from the WFDR; results were confirmed using the SFDR (data not shown). SNPs with q-values less than 0.05 were declared to be genome-wide significant (Table 3). SNPs in the EHF-APIP region on chromosome 11 are highly significant (low q-values), because of the strong association (Table 3). After accounting for linkage, the q-values for SNPs under the linkage peak on chromosome 20 are considerably decreased. The results presented in Table 3 illustrate that the linked SNPs on chromosome 20 are now top ranked genome-wide, while they were ranked 154th or lower, prior to incorporating the linkage information. The top-ranked SNP by the WFDR analysis was rs6092179 at 53.81 Mb on chr 20 (WFDR q-value=0.015, Table 3). SNP rs6092179 is within an LD block containing 4 other SNPs (rs6024437, rs8125625, rs6024454 and rs6024460), all demonstrating association with CF lung function and q-value <0.05. A rank-based q-value Manhattan plot demonstrates that chromosome 11 and chromosome 20 both attain genome-wide significance (Supplementary Figure 9).
We identified two new loci containing genetic variants contributing to variation in lung function in CF patients. The success of this project reflected: 1) coordinated analysis of three independent samples of the CF population (representing ~15% of all patients in North America) where each study subject was characterized by the same quantitative measure of lung function; 2) simultaneous genotyping of samples using a single platform which allowed for data cleaning using relatedness assessments and removal of poor quality genotypes based on parent to child transmission predictions; 3) analyzing for loci with small effect sizes using association, and loci of major effect (even in the presence of substantial allelic heterogeneity) using linkage. Moreover, we garnered increased power from an extreme of phenotype sample, while a population-based sample allowed for the development of a phenotype with external validity.
The association at chr11p13 is in an intergenic region 3’ to APIP and EHF with regulatory features including: i) significant conservation across species, ii) open chromatin (DNAase hypersensitivity and FAIRE-Seq), and iii) DNAase hypersensitive patterns suggesting cell-type-specificity (http://genome.ucsc.edu). The UCLA Gene Expression Tool (UGET, http://genome.ucla.edu/~jdong/GeneCorr.html)20,21 indicates correlation of expression of nearby genes, including strong correlation of EHF to ELF5, both epithelial-specific transcription factors; APIP to PDHX, which have the same promoter region; and EHF to APIP. APIP (Apaf-1-interacting protein) is known to inhibit apoptosis by binding to APAF-1, an important activator of caspase-922, 23 and by APAF-1 independent activation of AKT and ERK1/224. EHF is a member of epithelial-specific-Ets transcription factors that share a conserved Ets domain25-27. EHF can be induced in bronchial epithelial cells, smooth muscle cells and fibroblasts28,29, leading to transcriptional repression of a subset of ETS/AP-1-responsive genes activated by MAP-kinase pathways26,28, and in airway it may serve as an important regulator of differentiation under conditions of stress and inflammation26,27. Both genes show evidence of robust expression in lung and trachea, with APIP showing ubiquitous expression across tissues and EHF showing highest expression in trachea (http://www.ncbi.nlm.nih.gov/UniGene and http://www.ncbi.nlm.nig.gov/geo)30. Interestingly, cis-eQTL signatures for APIP are reported for lymphocytes and monocytes (eqtl.uchicago.edu). Comparing the eQTLs to the direction of phenotype-genotype association suggests that increased expression of APIP may be associated with decreased lung function, implying that inhibition of apoptosis worsens CF lung disease. This hypothesis is consistent with the emerging concepts that delayed neutrophil clearance, due to reduced apoptosis in neutrophils in the airways of CF patients, could lead to a hyperinflammatory state and more severe lung disease 31,32 and that inhibition of apoptosis contributes to goblet cell metaplasia, a central feature in CF airway pathophysiology33.
All 5 genes within the 1 LOD support interval in the chromosome 20 linkage region (Figure 5) are expressed in either fetal or adult lung or in bronchial epithelial cells (http://genome.ucsc.edu/). The 16kb cluster of SNPs associated with lung function in the GMS and CGS samples is located ~200kb to 500 kb centromeric to the five genes. None of the SNPs lies within a segment of open chromatin identified in the 16kb region in Normal Human Bronchial Epithelia cells (http://genome.ucsc.edu). Neither eQTL in lymphocytes (eqtl.uchicago.edu), miRNA (http://www.mirbase.org) nor DNaseI hypersensitive sites in Small Airway Epithelial cells map to the 16kb region. However, this does not exclude the possibility that the associated region regulates expression of any of the five genes or more distant genes. Among the five genes, MC3R has been implicated in weight maintenance and regulation of energy balance in animals and humans34-36. Variation in resting energy expenditure has been correlated with lung function measurements, lung tissue damage and lung disease exacerbation in CF patients 37,38. MC3R has also been implicated as a modulator of neutrophil accumulation in a murine model of lung inflammation39, a key feature of CF lung disease, as noted above. Other genes of interest within the linkage peak encode Crk-associated substrate scaffolding (CASS) 4 (CASS4/HEPL), a relative of proteins implicated in cell attachment, migration establishing polarity, invasion and phagocytosis of bacterial pathogens40 and Aurora kinase A (AURKA) which been shown to interact with Hef1/NEDD9, a member of the CASS family that mediates cytokinesis in late mitosis and facilitates disassembly of primary cilia41.
Twin studies in adults demonstrate that FEV1 is under strong genetic influence42,43, and at least three loci (GSTCD, TNS1 and HTR4) have been reproducibly associated with this measure44-46. Multiple replicated loci have also been associated with variation in the FEV1/FVC ratio45,46 and at least two of these loci (HHIP and FAM13) show reproducible association with COPD44,47,48. While the lung phenotype used here was based on FEV1, none of the above loci coincides with the regions identified in this study and neither of the loci identified here occur within the top 2000 associations for FEV1 or FEV1/FVC45,46.
Common variation in the EHF/APIP region is estimated to alter the lung function measure in the GMS and CGS F508del/F508del patients by ~0.2 units of the quantitative lung disease phenotype per allele (Table 2). Translated into more familiar clinical terms, the 0.2 unit difference is approximately equivalent to a mean difference in FEV1 percent predicted of 5.1 ± 1.9, corresponding to a mean difference in FEV1 of 254 ± 86mL in patients over 18 years of age (Online Methods). The QTL on chromosome 20 may account for a sizeable fraction of lung function variation in CF. Using simulations described by Blangero and colleagues16, we estimate that this locus accounts for a maximum of 46% and a minimum of 4% of the variance in the CF siblings (Online Methods).
In summary, our association and linkage approach provided complementary findings with the identification of two significant loci harboring genes of biologic relevance for CF. Of particular note for modifier searches in other monogenic diseases is the potential importance of minimizing variation in the causative gene. When we confined association analysis to patients with identical CFTR genotypes (i.e. F508del/F508del), one of the 7 suggestive loci achieved genome-wide significance, despite the reduction in sample size due to the exclusion of 38% of subjects in the CGS sample with other CFTR genotypes. The remaining suggestive loci contain biologically intriguing candidate modifiers that will be evaluated in future studies. Finally, the identification of genetic loci that modify lung function in CF, should provide new insight leading to the development of novel therapies for this devastating condition.
DNA from whole blood or transformed lymphocytes was hybridized to the Illumina 610-Quad ® platform at Genome Quebec (McGill University and Genome Quebec Innovation Centre,) using the 96-well plates with CEPH and one replicate control per plate. Illumina BeadStudio® was used to call genotype, and identity confirmed by Sequenom® fingerprinting. SNPs were removed if they were monomorphic, missing > 10% calls or with >1% Mendelian error in TSS trios. Finally, 570,725 autosomal and X-chromosome SNPs were selected, as well as 158 chromosome Y SNPs and 138 mitochondrial SNPs. Duplicate discordance was 0.004% in GMS, and similar for the other studies.
Sample exclusions included: initial call rate below 98%, unexpected close relatives or duplicate enrollments, unresolved sex mismatches, aneuploidy or outlying heterozygosity (> 5 standard deviations from the mean of 31.6%). Overlapping from 542 Illumina GoldenGate ® SNPs in GMS revealed platform discordance of 0.07%. Families with >5% Mendelian errors were excluded. Twenty-eight patient samples were excluded (GMS6; CGS 17, TSS 5) due to genotyping failure or artifacts, two GMS samples excluded due to outlying ancestry (by PC analysis), and eight GMS samples excluded for > second degree relation with other samples. Reported findings were verified using Illumina GenomeStudio V1.0.2® module V1.0.10 and manually-assisted calling.
Regressions for the lung phenotype were performed separately for GMS, all CGS, and CGS F508del/F508del using an additive model in PLINK v. 1.07 49, adjusted for sex and genotype principal components (PCs)11. Using the PLINK z-statistics for GMS and CGS, the standard meta-analysis z-statistic50 was z = wGMSzGMS + wCGSzCGS, with weights inversely proportional to standard errors, and common reference alleles for directional consistency. “Suggestive” association used the approximate threshold 1/(number of SNPs)=1/570,725=1.75 × 10-6, and significant association the Bonferroni threshold P < 0.05/570,725 = 8.76 × 10-8. For males, X-chromosome genotypes followed PLINK defaults (0 or 1 minor alleles; alternative coding resulted in no qualitative changes).
Permutations of genotypes relative to phenotypes and covariates (1,000) were used to refine the thresholds. From this pool of permutations, 10,000 permuted meta-analyses were computed. The obtained significance thresholds for a genome-wide error 0.05 were P = 1.07 × 10-7 (GMS and CGS) and P = 1.05 × 10-7 (GMS and CGS F508del/F508del). Consequently, P< 5 × 10-8 achieves false positive error control at genome-wide α<0.05, even correcting for two separate GWAS analyses. Regional multiple-comparisons correction (after highlighting a region) used the Bonferroni correction for the regional SNPs.
TSS association analysis was performed in 973 CF siblings and for the 557-patient F508del/F508del subset using the Merlin variance-components additive model framework12, corrected for linkage, family structure, sex, and 4 PCs. Missing genotypes (0.125%) were inferred to increase power51. Joint analyses of GMS, CGS and TSS used the meta-analysis approach described above.
We devised a novel approach using the assumption that CGS represents a random population sample, whereas GMS was conditional on the observed phenotypes. Letting g be the number of SNP minor alleles, the phenotypes y were pre-adjusted for sex and the study-specific PCs. We assumed an additive model y = β0 + β1g + ε, ε ~ N (0, σ2). The full likelihood conditioned on GMS sampling was where . Finally, we computed the SNP-specific statistic 2 × (log-likelihood ratio), with β1 = 0 as the null and compared to 2 . The approach assumes the effect sizes are the same in GMS and CGS, which is true under the null.
Power analyses for the combination of GMS and CGS assumed an additive genetic model, with effect β1 on the average phenotype for each minor allele. The results for GMS and CGS F508del/F508del are in Supplementary Figure 1. For each simulation the weighted meta-analysis P-values were compared to 5 × 10-8.
MACH (autosomes, http://www.sph.umich.edu/csg/abecasis/mach/) and IMPUTE (chromosome X, http://mathgen.stats.ox.ac.uk/impute/impute.html) imputation was conducted for 1162 GMS patients, 1,254 self-reported CGS “Caucasian” patients and 60 CEU reference samples from HapMap I/II. Some of these individuals were later used for TSS, and association analyses considered only unique subsets in GMS and CGS, respectively (Table 1). Imputation yielded data for ~2,544,000 autosomal and ~65,000 chromosome X SNPs.
Copy number variants (CNVs) were detected using pennCNV (2008Nov19 version)52 and genoCNV (version 1.08)53 using default parameters in 1103 GMS and 1301 CGS samples. CNVs with fewer than 5 probes or showing <1% variation were used, resulting in 3,008/4,868 probes from genoCNV/pennCNV in GMS and 3015/4663 probes for genoCNV /pennCNV in CGS. Genotype PCs were used to control stratification.
19,566 SNPs were selected from the Illumina platform with minor allele frequency >0.4 and r2 <0.01 between adjacent SNPs, using Merlin54. HapMap II recombination data were used to integrate genetic and physical map positions. Average inter-marker distance was 0.18 cM, or 0.13 Mbp. Physical positions not appearing in HapMap were estimated assuming uniform recombination between known adjacent SNPs. The average marker information content was ~0.9 (multipoint) and ~0.31 (two-point).
Variance components were estimated in SOLAR (Sequential Oligogenic Linkage Analysis Routines)55, with similar results from Merlin54, using multipoint IBD probabilities obtained from Merlin. LOD scores were computed with and without covariates (sex and average BMI Z-score). Multipoint LODs>2.0 was considered suggestive and LOD>3.7 was considered genome-wide significant15.
Let Pi be the p-value of an association test for SNP i, i =1,…,m. Converting p-values to q-values56 controls the FDR. SNPs with q-values less than the FDR threshold value (e.g. γ = 0.05) are declared significant. The expected proportion of false positives among all the positives is then controlled at level γ. Note that ranking SNPs by P-value or q-value are equivalent.
Let Zi be the linkage score of SNP i obtained from a GWL study. For the SFDR method, m SNPs are divided into K disjoint strata based on the prior linkage information57. Cconsider K = 2 and assign each SNP i to stratum 1 (the high priority group) or stratum 2 (the low priority group) according to whether the linkage score Zi exceeds a threshold C (we used C=3.3 corresponding to significant linkage15). Q-values are then calculated separately for each stratum of SNPs, achieving FDR control in each stratum (Sun et al., 2006). Ranks of the GWAS SNPs are determined by the q-values with the original association p-values used to break any q-value ties.
WFDR calculates a weighting factor Wi for each SNP i with weights subject to two constraints: Wi ≥ 0_ and = Σi Wi /m = 1. The weight Wi is proportional to the linkage signal Zi for SNP i (e.g.Wi exp(B · Zi) / ν, ν = Σi exp(B · Zi)/m,, and B=1) (Roeder et al., 2006), and the FDR procedure is applied to the set of weight-adjusted p-values, Pi/Wi, i =1,…,m. We use B=2 in the present analysis. The WFDR and SFDR were implemented in a perl program called SFDR, available at http://www.utstat.toronto.edu/sun/Software/SFDR/index.html.
The proportion of variation due to each SNP was measured as the change in regression sums of squares vs. the smaller model with the SNP removed58. Using the genome-scan threshold of P=5 ×10-8 and minimum P=3.34 × 10-8 in the chromosome 11p13 region for GMS and CGS F508del/F508del patients, we estimate a 57.4% reduction in effect size compared to the nominal result. Using the joint analysis based on GMS, CGS F508del/F508del and TSS F508del/F508del patients, the observed minimum P=8.28 × 10-8 results in ~ 28.0% reduction of the effect size. Using the rough parallel to explained variation in the trait, the estimated explained variation for 11p13 remains 1%-2%. For a linkage study of comparable size (n=500 sibling pairs), with a phenotype heritability of 0.5, the bias attributed to the winner’s curse varies from approximately 0.46 down to zero as the true (unmeasured) heritability attributable to the QTL increases16. While not possible to quantify the magnitude of this bias in this single study, these calculations provide an upper bound on the bias of 0.38 to 0.46 and a lower bound of 0.04 to 0.12.
Using 973 TSS individuals, a hypothetical quantity of 0.2 was added to each individual’s lung phenotype, to correspond to the effect size observed for the significant association of SNPs near EHF/APIP. The average raw FEV1 (in liters) was then back-extrapolated8 and FEV1 percent predicted values were generated using the predictive equations59,60. Height and age adjustments used to calculate the original quantitative lung phenotype were preserved. The average increase (mean ± SD) in FEV1 percent predicted corresponding to a 0.2-unit increase of our lung phenotype was 5.09% ± 1.90% [n = 841; Range: 0.00 – 14.53%]. The corresponding average increase in raw FEV1 was 253.5 ± 85.9mL in adult subjects (>18 years) [n = 244; Range: 0.0 – 630.0mL].
This work was supported in part by the US National Institutes of Health grants: HL68927, K23DK083551, R01HL068890, R01DK066368, R01HL095396, P30DK027651, SOLAR-MH059490, HG-0004314; U.S. Cystic Fibrosis Foundation grants: CUTTIN06P0, COLLAC07A0, R025-CR07, KNOWLE00A0, RDP-R026-CR07, DRUMM0A00 and contract GENOMEQUEBEC07DDS0; Flight Attendant Medical Research Institute grant: FAMRI2006; Lawson Wilkins Pediatric Endocrine Society grant: LWPES Clinical Scholar Award; The Canadian Cystic Fibrosis Foundation; Genome Canada through the Ontario Genomics Institute as per research agreement 2004-OGI-3-05; Ontario Research Fund, Research Excellence Program; Lloyd Carr-Harris Foundation; Joint Fellowship of Canadian Institutes of Health Research and Ontario Women’s Health Council. Funds for genome-wide genotyping were generously provided by the U.S. CFF.
The authors would like to thank the CFF Patient Registry, the University of North Carolina DNA Laboratory, Genome Quebec and McGill University Innovation Centre and the following for their contributions: Manuscript preparation: Patricia Cornwall Study Design: Ada Hamosh, Rita McWilliams Recruitment: Sonya Adams, Marilyn Algire, Nicole Anderson, Amanda Bowers, Jennifer Breaton, Colette Bucur, Leia Charnin, Mary Christofi, Barbara Coleman, John Dunn, Barry Elkind, Despina Fragolias, Julie Hoover-Fong, Katherine Keenan, Patricia Miller, Sarah Norris, Roxanne Rousseau, Sally Wood, Rossitta Yung, Catherine Yurk Phenotyping: Nicole Anderson, Jennifer Breaton, Mary Christofi, Despina Frangolias, Wan Ip, Katherine Keenan, Roxanne Rousseau, Rossitta Yung Data Entry: John Dunn, Sarah Norris, Teresa Lai Genotyping: Kaeleen Boden, Rebecca Darrah, Qiuju Huang, Kyle Kanieki, Fan Lin, Sarah Ritter, Nulang Wang, Yongqian Wang, Colleen Weiler, Whitney Wolf, Xiaowei Yuan Analysis: Tony Dang, Evan Hawbaker, Lindsay Henderson, Rita McWilliams, Yunfei Wang, Christopher Watson, Aaron Webel, Bioinformatics: Edgar Crowdy, Hong Dang, Hemant Kelkar, Tom Randall, Annie Xu
Aaron,S., Ottawa General Hospital, Ottawa, Canada/Accurso,F., University of Colorado Health Sciences Center, CO/Acton,J., Cincinnati Children’s Hospital and Medical Center, OH/Ahrens,R., University of Iowa Hospitals & Clinics, IA/Aljadeff,G., Lutheran General Children’s Hospital, IL/Allard,C., Centre de Santé et de Services Sociaux de Chicoutimi, Chicoutimi, Canada/Amaro,R., University of Texas at Tyler Health Center, TX/Anbar,R., SUNY Upstate Medical University, NY/Anderson,P., University of Arkansas, AR/Atlas,A., Morristown Memorial Hospital, NJ/Bell,S., The Prince Charles Hospital, Australia/Berdella,M., St. Vincents Hospital & Medical Center, NY /Biller,J., Children’s Hospital of Wisconsin, WI/Black,H., Asthma & Allergy Specialists, Charlotte, NC/Black,P., Children’s Mercy Hospital, MO/Boas,S., Children’s Asthma Respiratory&Exercise Specialists, IL/Boland,M., Children’s Hospital of Eastern Ontario, Ottawa, Canada/Borowitz,D., Women & Children’s Hospital of Buffalo, NY/Boswell,R., University of Tennessee, TN/Boucher,J., Centre Hospitalier Régional de Rimouski, Rimouski, Canada/Bowman,C.M., Medical University of South Carolina, SC/Boyle,M., Johns Hopkins Hospital, MD/Brown,C., California Pacific Medical Center, CA/Brown,D., Pediatric Pulmonary Associates., SC/Brown,N., University of Alberta Hospitals, Edmonton, Canada/Caffey,L.F., University of New Mexico, NM/Chatfield,B., University of Utah Health Sciences Center, UT/Chesrown,S., University of Florida, FL/Chipps,B., Sutter Medical Center, CA/Clancy,J.P., University of Alabama at Birmingham, AL/Cohen,R., Kaiser Permanente, OR/Colombo,J., University of Nebraska Medical Center, NE/Cronin,J., Women & Children’s Hospital of Buffalo, NY/Cruz,M., St. Mary’s Medical Center, FL/Cunningham,J., Cook Children’s Medical Center, TX/Davidson,G., B.C. Children’s Hospital, Vancouver, Canada/Davies,J, University of New Mexico, NM/Davies,L., University of New Mexico, SOM, NM/DeCelie-Germana,J., Schneider Children’s Hospital, NY/Devenny,A., Royal Hospital for Sick Children, Scotland/DiMango,E., Columbia University Medical Center, NY/Doornbos,D., Via-Christi, St. Francis Campus, KS/Dozor,A., New York Medical College-Westchester Medical Center, NY/Dunitz,J., University of Minnesota, MN/Egan,M., Yale University SOM, CT/Eichner,J., Great Falls Clinic, MT/Ferkol,T., St. Louis Children’s Hospital, MO/Fiel,S., Morristown Memorial Hospital, NJ/Flume,P., Medical University of South Carolina, SC/ Freitag,A.,Hamilton Health Sciences Corporation, Hamilton, Canada/ Franco,M., Miami Children’s Hospital, FL/Froh,D., University of Virginia Health System, VA/Garey,N., Saint John Regional Hospital, Saint John, Canada/Geller,D., Nemours Children’s Clinic Orlando, FL/Gershan,W., Children’s Hospital of Wisconsin, WI/Gibson,R., Children’s Hospital & Regional Medical Center, WA/Giusti,R., Long Island College Hospital, NY/Gjevre,J., Royal University Hospital, Saskatoon, Canada/Gondor,M., University of South Florida, FL/Gong,G., Phoenix Children’s Hospital, AZ/Guill,M., Medical College of Georgia, GA/Gutierrez,H., University of Alabama at Birmingham, AL/Hadeh,A., Drexel University College of Medicine, PA/Hardy,K., Children’s Hospital - Oakland, CA/Hiatt,P., Texas Children’s Hospital, TX/Hicks,D., Children’s Hospital of Orange County, CA/Holmes,B., Regina General Hospital, Regina, Canada/Holsclaw,D., University of Pennsylvania, PA/Holzwarth,P., St. Vincent Hospital - Genetics, WI/Honicky,R., Michigan State University, MI/Howenstine,M., Riley Hospital for Children, IN/Hughes,D., IWK Health Centre, Halifax, Canada/Jackson,M., Grand River Hospital, Kitchener, Canada/James,P., Lutheran Hospital, IN/ Jenneret A., Hôtel Dieu de Montréal, Montréal, Canada/Joseph,P., University of Cincinnati, OH/Kanga,J., University of Kentucky, KY/Katz,M., Baylor College of Medicine, TX/Kent,S., Victoria General Hospital, Victoria, Canada/Kepron,W., Winnipeg Health Sciences Centre, Winnipeg, Canada/Knowles,M., University of North Carolina at Chapel Hill, NC/Konig,P., University of Missouri- Columbia, MO/Konstan,M., Case Western Reserve University, OH/ Kovesi,T., Children’s Hospital of Eastern Ontario, Ottawa, Canada/Kramer,J., Oklahoma Cystic Fibrosis Center, OK/Kraynack,N., Children’s Hospital Medical Center of Akron, OH/Kumar,V., Hôpital Régional de Sudbury Regional Hospital, Sudbury, Canada/Lahiri,T., Fletcher Allen Health Care, VT/Landon,C., Pediatric Diagnostic Center, CA/Lands,L., Montréal Children’s Hospital, Montréal, Canada/Lapin,C., Connecticut Children’s Medical Center, CT/Larj,M., Wake Forest University Baptist Med. Ctr., NC/Ledbetter,J., TC Thompson Children’s Hospital, TN/Lee,R., Naval Medical Center - Portsmouth, VA/Leigh,M., University of North Carolina at Chapel Hill, NC/Lester,L., University of Chicago Children’s Hospital, IL/Lever,T., Eastern Maine Medical Center, ME/Levy,H., Children’s Hospital Boston, MA/Lieberthal,A., Kaiser Permanente Southern California, CA/Liou,T., University of Utah, UT/Lipton,A., National Naval Medical Center, MD/Lyttle,B., Children’s Hospital of Western Ontario, London, Canada/Lothian,B., Royal University Hospital, Saskatoon, Canada/Lougheed,D., Hotel Dieu Hospital, Kingston, Canada/Malhotra,K., Grand River Hospital, Kitchener, Canada/Marcotte,J., Hôpital Sante-Justine, Montréal, Canada/Matouk,E., Montréal Chest Institute, Montréal, Canada/McCarthy,M., Providence Medical Center, WA/McColley,S., Children’s Memorial Hospital & Northwestern University, IL/McCoy,K., Columbus Children’s Hospital, OH/McNamara,J., Children’s Hospitals and Clinics of Minneapolis, MN/Michael,R., Queen Elizabeth II Health Sciences Centre, Halifax, Canada/Miller,S., University of Mississippi Medical Center, MS/Milot,M., Centre de Santé et de Services Sociaux de Chicoutimi, Chicoutimi, Canada/Moffett,K., West Virginia University, WV/Montgomery,M., Alberta Children’s Hospital, Calgary, Canada/Moore,P., Vanderbilt University Medical Center, TN/Morgan,W., Tucson Cystic Fibrosis Center, AZ/Morris,R., Janeway Children’s Health & Rehabilitation, St. John’s, Canada/Morse,M., Methodist Children’s Hospital, TX/Moskowitz,S., Children’s Hospital & Regional Medical Center, WA/Moss,R., Stanford University Medical Center, CA/Murphy,P., University of Nebraska Medical Center, NE/Nakielna,E., St. Paul’s Hospital, Vancouver, Canada/Nasr,S., University of Michigan Medical Center, MI/Nassri,L., Sparks Regional Medical Center, AR/Naureckas,E., University of Chicago Hospitals, IL/Nielson,D., University of California at San Francisco, CA/Noseworthy,M., Janeway Children’s Health & Rehabilitation, St. John’s, Canada/Noyes,B., St. Louis University, MO/Olivier,K., Wilford Hall USAF Med. Ctr. San Antonio, TX/Olson,E., University of Florida, FL/Omlor,G., Akron Children’s Hospital, OH/Orenstein,D., Children’s Hospital of Pittsburgh, PA/O’Sullivan,B., University of Massachusetts Memorial Health Care, MA/Parker,H.W., Dartmouth-Hitchcock Medical Center, NH/Passero,M., Brown University Medical School Rhode Island Hospital, RI/Pasterkamp,H., Children’s Hospital of Winnipeg, Winnipeg, Canada/Pedder,L., Hamilton Health Sciences Corporation, Hamilton, Canada/Perkett,E., Vanderbilt University Medical Center, TN/Perry,G., University of Kansas Medical Center, KS/Petit,N., Center Hospitalier Rouyn-Noranda, Rouyn-Noranda, Canada/Pian,M., University of California San Diego Children’s Hospital, CA/Platzker,A., Children’s Hospital of Los Angeles, CA/Prestidge,C., Children’s Medical Center of Dallas, TX/Price,A., Children’s Hospital of Western Ontario, London, Canada/Rabin,H., Foothills Medical Centre, Calgary, Canada/Radford,P., Phoenix Children’s Hospital, AZ/Ratjen,F., The Hospital for Sick Children, Toronto, Canada/Regelmann,W., University of Minnesota, MN/Ren,C., University of Rochester Medical Center, Strong Memorial Hospital, NY/Retsch-Bogart,G., University of North Carolina at Chapel Hill, NC/Richards,W., Memphis Lung Physicians, MS/Riva,M., Via-Christi, St. Francis Campus, KS/Rivard,L., Centre Unversitaire de Santé de L’estrie, Sherbrook, Canada/Roberts,D., Providence Medical Center, AK/Rock,M., University of Wisconsin Hospital, WI/Rosen,J., Albany Medical College, NY/Royall,J., Childrens Hospital of Oklahoma, OK/Rubenstein,R., Children’s Hospital of Philadelphia, PA/Ruiz,F., University of Mississippi Med. Ctr., MS/Scanlin,T., Children’s Hospital of Philadelphia, PA/Schechter,M., Emory University School of Medicine, GA/Schmidt,H.J., Virginia Commonwealth University, VA/Schwartzman,M., Joe DiMaggio Children’s Hospital, FL/Scott,P., Georgia Pediatric Pulmonology Assoc., PC, GA/Shay,G., Kaiser Permanente Medical Center, CA/Simon,R., University of Michigan Health System, MI/Smith,P., Long Island College Hospital, NY/Solomon,M., The Hospital for Sick Children, Toronto, Canada/Spencer,T., Children’s Hospital of Boston, MA/Stecenko,A., Emory University, GA/Stokes,D., University of Tennessee, TN/Sullivan,B., Marshfield Clinic, WI/Taylor-Cousar,J., University of New Mexico, NM/Thomas,N., Pennsylvania State University College of Medicine, PA/Thompson,H., St. Luke’s CF Clinic, ID/Toder,D., Children’s Hospital of Michigan and Harper University Hospital, MI/Tullis,E., St.Michael’s Hospital, Toronto, Canada/Turcios,N., University of Medicine & Dentistry of NJ, NJ/van Wylick,R., Hotel Dieu Hospital, Kingston, Canada/Varlotta,L., St. Christopher’s Hospital for Children, PA/Vauthy,P., Toledo Children’s Hospital, OH/Voynow,J., Duke University, NC/Wainwright,C., Royal Children’s Hospital, Australia/Walker,P., St. Vincent’s Hospital - Manhattan, NY/Warren,W.S., Hershey Medical Center, PA/Wilcox,P., Royal Jubilee Hospital, Victoria, Canada/Wilmott,R., St. Louis University, MO/ Wilcox,P., St. Paul’s Hospital, Vancouver, Canada/Wojtczak,H., Naval Medical Center - San Diego, CA/Yee,W., New England Medical Center, MA/Zacher,C., St. Alexius Heart & Lung CF Clinic, ND/Zanni,R., Monmouth Medical Center, NJ/Zeitlin,P., Johns Hopkins Hospital, MD/Zuberbuhler,P., University of Alberta Hospitals, Edmonton, Canada.
Author Contributions F.A.W., L.J.S., L.S., D.C, M.C., R.D., L.L.V., W.K.O., M.L.D., P.R.D., A.H., RM., M.R.K., and G.R.C., worked on study design. Y.B., A.C., J.M.C., M.C., R.D., D.G., W.L., K.M.N., R.G.P., P.P., J.M.R., A.S., J.R.S., C.T., L.L.V., J.Z., M.L.D., P.R.D., M.R.K., and G.R.C. performed sample collection and phenotyping. F.A.W., C.W.C., S.M.B., D.C., K.G., E.M.L., J.L., R.G.P., J.R.S., F.Z., W.K.O., M.L.D., M.R.K., G.R.C. performed genotyping and data cleaning. F.A.W., L.J.S., V.D., C.W.C., S.M.B., L.S., A.C., J.M.C., M.C., R.D., K.G., J.K., E.M.L., S.L., W.L., G.M.M., J.M.R., W.S., C.T., F.Z., J.B., performed statistical analysis. F.A.W., L.J.S., V.D., L.S., R.D., J.M.R., W.O., M.L.D., P.D., M.R.K. and G.R.C. wrote the manuscript.
The authors have no competing financial interests.