Genome-wide association studies of European and East Asian populations have identified lung cancer susceptibility loci on chromosomes 5p15.33, 6p22.1-p21.31 and 15q25.1. We investigated whether these regions contain lung cancer susceptibly loci in African-Americans refined previous association signals by utilizing the reduced linkage disequilibrium observed in African-Americans.
1308 African-American cases and 1241 African-American controls from three centers were genotyped for 760 single nucleotide polymorphisms spanning three regions, and additional SNP imputation was performed. Associations between polymorphisms and lung cancer risk were estimated using logistic regression, stratified by tumor histology where appropriate.
The strongest associations were observed on 15q25.1 in/near CHRNA5, including a missense substitution (rs16969968: OR = 1.57, 95% CI = 1.25–1.97, P = 1.1 × 10−4) and variants in the 5′-UTR. Associations on 6p22.1-p21.31 were histology-specific and included a missense variant in BAT2 associated with squamous-cell carcinoma (rs2736158: OR = 0.64, 95% CI = 0.48–0.85, P = 1.82 × 10−3). Associations on 5p15.33 were detected near TERT, the strongest of which was rs2735940 (OR = 0.82, 95% CI = 0.73–0.93, P = 1.1 × 10−3). This association was stronger among cases with adenocarcinoma (OR = 0.75, 95% CI = 0.65–0.86, P = 8.1 × 10−5).
Polymorphisms in 5p15.33, 6p22.1-p21.31 and 15q25.1 are associated with lung cancer in African-Americans. Variants on 5p15.33 are stronger risk factors for adenocarcinoma and variants on 6p21.33 associated only with squamous-cell carcinoma.
Results implicate the BAT2, TERT and CHRNA5 genes in the pathogenesis of specific lung cancer histologies.
Lung cancer; adenocarcinoma; squamous-cell carcinoma; fine-mapping; African-American; genetic association
Recent meta-analyses of European ancestry subjects show strong evidence for association between smoking quantity and multiple genetic variants on chromosome 15q25. This meta-analysis extends the examination of association between distinct genes in the CHRNA5-CHRNA3-CHRNB4 region and smoking quantity to Asian and African American populations to confirm and refine specific reported associations.
Association results for a dichotomized cigarettes smoked per day (CPD) phenotype in 27 datasets (European ancestry (N=14,786), Asian (N=6,889), and African American (N=10,912) for a total of 32,587 smokers) were meta-analyzed by population and results were compared across all three populations.
We demonstrate association between smoking quantity and markers in the chromosome 15q25 region across all three populations, and narrow the region of association. Of the variants tested, only rs16969968 is associated with smoking (p < 0.01) in each of these three populations (OR=1.33, 95%C.I.=1.25–1.42, p=1.1×10−17 in meta-analysis across all population samples). Additional variants displayed a consistent signal in both European ancestry and Asian datasets, but not in African Americans.
The observed consistent association of rs16969968 with heavy smoking across multiple populations, combined with its known biological significance, suggests rs16969968 is most likely a functional variant that alters risk for heavy smoking. We interpret additional association results that differ across populations as providing evidence for additional functional variants, but we are unable to further localize the source of this association. Using the cross-population study paradigm provides valuable insights to narrow regions of interest and inform future biological experiments.
smoking; genetics; meta-analysis; cross-population
Studies in European and East Asian populations have identified lung cancer susceptibility loci in nicotinic acetylcholine receptor (nAChR) genes on chromosome 15q25.1 which also appear to influence smoking behaviors. We sought to determine if genetic variation in nAChR genes influences lung cancer susceptibly in African-Americans, and evaluated the association of these cancer susceptibility loci with smoking behavior. A total of 1308 African-Americans with lung cancer and 1241 African-American controls from three centers were genotyped for 378 single nucleotide polymorphisms (SNPs) spanning the sixteen human nAChR genes. Associations between SNPs and the risk of lung cancer were estimated using logistic regression, adjusted for relevant covariates. Seven SNPs in three nAChR genes were significantly associated with lung cancer at a strict Bonferroni-corrected level, including a novel association on chromosome 2 near the promoter of CHRNA1 (rs3755486: OR = 1.40, 95% CI = 1.18-1.67, P = 1.0 × 10−4). Association analysis of an additional 305 imputed SNPs on 2q31.1 supported this association. Publicly available expression data demonstrated that the rs3755486 risk allele correlates with increased CHRNA1 gene expression. Additional SNP associations were observed on 15q25.1 in genes previously associated with lung cancer, including a missense variant in CHRNA5 (rs16969968: OR = 1.60, 95% CI = 1.27-2.01, P = 5.9 × 10−5). Risk alleles on 15q25.1 also correlated with an increased number of cigarettes smoked per day among the controls. These findings identify a novel lung cancer risk locus on 2q31.1 which correlates with CHRNA1 expression and replicate previous associations on 15q25.1 in African-Americans.
Lung cancer; nicotine dependence; African-Americans; genetic association; smoking
We conducted imputation to the 1000 Genomes Project of four genome-wide association studies of lung cancer in populations of European ancestry (11,348 cases and 15,861 controls) and genotyped an additional 10,246 cases and 38,295 controls for follow-up. We identified large-effect genome-wide associations for squamous lung cancer with the rare variants of BRCA2-K3326X (rs11571833; odds ratio [OR]=2.47, P=4.74×10−20) and of CHEK2-I157T (rs17879961; OR=0.38 P=1.27×10−13). We also showed an association between common variation at 3q28 (TP63; rs13314271; OR=1.13, P=7.22×10−10) and lung adenocarcinoma previously only reported in Asians. These findings provide further evidence for inherited genetic susceptibility to lung cancer and its biological basis. Additionally, our analysis demonstrates that imputation can identify rare disease-causing variants having substantive effects on cancer risk from pre-existing GWAS data.
Black/white disparities in lung cancer incidence and mortality mandate an evaluation of underlying biological differences. We have previously shown higher risks of lung cancer associated with prior emphysema in African American compared with white lung cancer patients.
We therefore evaluated a panel of 1440 inflammatory gene variants in a two phase analysis (discovery and replication), added top GWAS lung cancer hits from Caucasian populations, and 28 SNPs from a published gene panel. The discovery set (477 self-designated African Americans cases, 366 controls matched on age, ethnicity, and gender) were from Houston, Texas. The external replication set (330 cases, 342 controls) was from the EXHALE study at Wayne State University.
In discovery, 154 inflammation SNPs were significant (P<0.05) on univariate analysis, as was one of the gene panel SNPs (rs308738 in REV1, P=0.0013), and three GWAS hits, rs16969968 P=0.0014 and rs10519203 P=0.0003 in the 15q locus and rs2736100, the HTERT locus, P=0.0002. One inflammation SNP, rs950286, was successfully replicated with a concordant odds ratio of 1.46(1.14-1.87) in discovery, 1.37(1.05-1.77) in replication, and a combined OR of 1.40 (1.17-1.68). This SNP is intergenic between IRF4 and EXOC2 genes. We also constructed and validated epidemiologic and extended risk prediction models. The AUC for the epidemiologic discovery model was 0.77 and 0.80 for the extended model. For the combined datasets, the AUC values were 0.75 and 0.76, respectively.
As has been reported for other cancer sites and populations, incorporating top genetic hits into risk prediction models, provides little improvement in model performance and no clinical relevance.
Lung cancer in lifetime never smokers is distinct from that in smokers, but the role of separate or overlapping carcinogenic pathways has not been explored. We therefore evaluated a comprehensive panel of 11,737 SNPs in inflammatory-pathway genes in a discovery phase (451 lung cancer cases, 508 controls from Texas). SNPs that were significant were evaluated in a second external population (303 cases, 311 controls from the Mayo Clinic). An intronic SNP in the ACVR1B gene, rs12809597, was replicated with significance and restricted to those reporting adult exposure to environmental tobacco smoke Another promising candidate was a SNP in NR4A1, although the replication OR did not achieve statistical significance. ACVR1B belongs to the TGFR-β superfamily, contributing to resolution of inflammation and initiation of airway remodeling. An inflammatory microenvironment, (second hand smoking, asthma, or hay fever) is necessary for risk from these gene variants to be expressed. These findings require further replication, followed by targeted resequencing, and functional validation.
lung cancer; never smokers; inflammation genes; sidestream exposure
Tobacco-induced lung cancer is characterized by a deregulated inflammatory microenvironment. Variants in multiple genes in inflammation pathways may contribute to risk of lung cancer.
We therefore conducted a three-stage comprehensive pathway analysis (discovery, replication and meta-analysis) of inflammation gene variants in ever smoking lung cancer cases and controls. A discovery set (1096 cases; 727 controls) and an independent and non-overlapping internal replication set (1154 cases; 1137 controls) were derived from an ongoing case-control study. For discovery, we used an iSelect BeadChip to interrogate a comprehensive panel of 11737 inflammation pathway SNPs and selected nominally significant (p<0.05) SNPs for internal replication.
There were 6 SNPs that achieved statistical significance (p<0.05) in the internal replication dataset with concordant risk estimates for former smokers and 5 concordant and replicated SNPs in current smokers. Replicated hits were further tested in a subsequent meta-analysis using external data derived from two published GWAS and a case-control study. Two of these variants (a BCL2L14 SNP in former smokers and a SNP in IL2RB in current smokers) were further validated. In risk score analyses, there was a 26% increase in risk with each additional adverse allele when we combined the genotyped SNP and the most significant imputed SNP in IL2RB in current smokers and a 36% similar increase in risk for former smokers associated with genotyped and imputed BCL2L14 SNPs.
Before they can be applied for risk prediction efforts, these SNPs should be subject to further external replication and more extensive fine mapping studies.
Inflammation SNPS; lung cancer; smokers
Recent evidence suggests that inflammation plays a pivotal role in the development of lung cancer. In this study, we used a two-stage approach to investigate associations between genetic variants in inflammation pathways and lung cancer risk based on genome-wide association study (GWAS) data. A total of 7,650 sequence variants from 720 genes relevant to inflammation pathways were identified using keyword and pathway searches from Gene Cards and Gene Ontology databases. In Stage 1, six GWAS datasets from the International Lung Cancer Consortium were pooled (4,441 cases and 5,094 controls of European ancestry), and a hierarchical modeling (HM) approach was used to incorporate prior information for each of the variants into the analysis. The prior matrix was constructed using (1) role of genes in the inflammation and immune pathways; (2) physical properties of the variants including the location of the variants, their conservation scores and amino acid coding; (3) LD with other functional variants and (4) measures of heterogeneity across the studies. HM affected the priority ranking of variants particularly among those having low prior weights, imprecise estimates and/or heterogeneity across studies. In Stage 2, we used an independent NCI lung cancer GWAS study (5,699 cases and 5,818 controls) for in silico replication. We identified one novel variant at the level corrected for multiple comparisons (rs2741354 in EPHX2 at 8q21.1 with p value = 7.4 × 10−6), and confirmed the associations between TERT (rs2736100) and the HLA region and lung cancer risk. HM allows for prior knowledge such as from bioinformatic sources to be incorporated into the analysis systematically, and it represents a complementary analytical approach to the conventional GWAS analysis.
Suboptimal cellular DNA repair capacity (DRC) has been shown to be associated with enhanced cancer risk, but genetic variants affecting the DRC phenotype have not been comprehensively investigated. In this study, with the available DRC phenotype data, we analyzed correlations between the DRC phenotype and genotypes detected by the Illumina 317K platform in 1,774 individuals of European ancestry from a Texas lung cancer genome-wide association study. The discovery phase was followed by a replication in an independent set of 1,374 cases and controls of European ancestry. We applied a generalized linear model with SNPs as predictors and DRC (a continuous variable) as the outcome. Covariates of age, sex, pack-years of smoking, DRC assay-related variables and case-control status of the study participants were adjusted in the model. We validated that reduced DRC was associated with an increased risk of lung cancer in both independent datasets. Several suggestive loci that contributed to the DRC phenotype were defined in ERCC2/XPD, PHACTR2 and DUSP1. In summary, we determined that DRC is an independent risk factor for lung cancer and we defined several genetic loci contributing to DRC phenotype.
DNA repair capacity; genetic susceptibility; genome-wide association; molecular epidemiology
Genome-wide association studies (GWAS) have identified 36 loci associated with body mass index (BMI), predominantly in populations of European ancestry. We conducted a meta-analysis to examine the association of >3.2 million SNPs with BMI in 39,144 men and women of African ancestry, and followed up the most significant associations in an additional 32,268 individuals of African ancestry. We identified one novel locus at 5q33 (GALNT10, rs7708584, p=3.4×10−11) and another at 7p15 when combined with data from the Giant consortium (MIR148A/NFE2L3, rs10261878, p=1.2×10−10). We also found suggestive evidence of an association at a third locus at 6q16 in the African ancestry sample (KLHL32, rs974417, p=6.9×10−8). Thirty-two of the 36 previously established BMI variants displayed directionally consistent effect estimates in our GWAS (binomial p=9.7×10−7), of which five reached genome-wide significance. These findings provide strong support for shared BMI loci across populations as well as for the utility of studying ancestrally diverse populations.
Risk prediction models are useful in clinical decision making. We have published an internally validated prediction tool for lung cancer based on easily obtainable epidemiologic and clinical data. Because the precision of the model was modest, we now estimate the improvement obtained by adding two markers of DNA repair capacity.
Assay data (host-cell reactivation and mutagen sensitivity) were available for 725 White lung cancer cases and 615 controls, all former or current smokers, a subset of cases and controls from the previous analysis. Multivariable models were constructed from the original variables with addition of the biomarkers separately and together. Pairwise comparisons of the area under the receiver operating characteristic curves (AUC) and 3-fold cross-validations were done.
For former smokers, the AUC and 95% confidence intervals were 0.67 (0.63–0.71) for the baseline model and 0.70 (0.66–0.74) for the expanded model. For current smokers, the comparable AUC values were 0.68 (0.64–0.72) and 0.73 (0.69–0.77). For both groups, the expanded models were statistically significantly better than the baseline models (P = 0.006 and P = 0.0048, respectively), although the increases in the concordance statistics were modest. We also recomputed 1-year absolute risks of lung cancer as described previously for two different risk profiles and showed that individuals who exhibited poor repair capacity or heightened mutagen sensitivity had increased absolute risks of lung cancer.
Addition of biomarker assays improved the sensitivity of the expanded models.
Common variants in the nicotinic acetylcholine receptor gene cluster on chromosome 15q24-25.1 were associated with lung cancer risk in three recently published independently conducted genome-wide association studies, with no consensus as to the relative impact of the variants on the propensity to smoke vs a direct carcinogenic effect. To further explore our hypothesis that these variants are indeed associated with both cancer causation and nicotine dependence, we performed a more detailed analysis of the association of these putative risk genotypes with smoking phenotype, as well as in lifetime never smokers, and in other smoking-related cancers. We demonstrate a statistically significant association of the variants with both nicotine dependence, as well as lung cancer phenotypes, including earlier age at lung cancer onset. The variants were associated with higher risks of lung cancer in lower smoking-exposed strata, and in individuals with a strong family history of lung or smoking-related cancers. In contrast, we found no evidence that the variants were associated with elevated risks in 547 lifetime never-smoking lung cancer case subjects, nor in other smoking-related cancers (bladder and renal). Thus, we conclude that the variants are implicated both in smoking behavior and more directly in lung cancer risk.
Asbestos exposure is a known risk factor for lung cancer. Although recent genome-wide association studies (GWASs) have identified some novel loci for lung cancer risk, few addressed genome-wide gene–environment interactions. To determine gene–asbestos interactions in lung cancer risk, we conducted genome-wide gene–environment interaction analyses at levels of single nucleotide polymorphisms (SNPs), genes and pathways, using our published Texas lung cancer GWAS dataset. This dataset included 317 498 SNPs from 1154 lung cancer cases and 1137 cancer-free controls. The initial SNP-level P-values for interactions between genetic variants and self-reported asbestos exposure were estimated by unconditional logistic regression models with adjustment for age, sex, smoking status and pack-years. The P-value for the most significant SNP rs13383928 was 2.17×10–6, which did not reach the genome-wide statistical significance. Using a versatile gene-based test approach, we found that the top significant gene was C7orf54, located on 7q32.1 (P = 8.90×10–5). Interestingly, most of the other significant genes were located on 11q13. When we used an improved gene-set-enrichment analysis approach, we found that the Fas signaling pathway and the antigen processing and presentation pathway were most significant (nominal P < 0.001; false discovery rate < 0.05) among 250 pathways containing 17 572 genes. We believe that our analysis is a pilot study that first describes the gene–asbestos interaction in lung cancer risk at levels of SNPs, genes and pathways. Our findings suggest that immune function regulation-related pathways may be mechanistically involved in asbestos-associated lung cancer risk.
Abbreviations:CIconfidence intervalEenvironmentFDRfalse discovery rateGgeneGSEAgene-set-enrichment analysisGWASgenome-wide association studiesi-GSEAimproved gene-set-enrichment analysis approachORodds ratioSNPsingle nucleotide polymorphism
Genome-wide association studies have identified variants on chromosome 15q25.1 that increase the risks of both lung cancer and nicotine dependence and associated smoking behavior. However, there remains debate as to whether the association with lung cancer is direct or is mediated by pathways related to smoking behavior. Here, the authors apply a novel method for mediation analysis, allowing for gene-environment interaction, to a lung cancer case-control study (1992–2004) conducted at Massachusetts General Hospital using 2 single nucleotide polymorphisms, rs8034191 and rs1051730, on 15q25.1. The results are validated using data from 3 other lung cancer studies. Tests for additive interaction (P = 2 × 10−10 and P = 1 × 10−9) and multiplicative interaction (P = 0.01 and P = 0.01) were significant. Pooled analyses yielded a direct-effect odds ratio of 1.26 (95% confidence interval (CI): 1.19, 1.33; P = 2 × 10−15) for rs8034191 and an indirect-effect odds ratio of 1.01 (95% CI: 1.00, 1.01; P = 0.09); the proportion of increased risk mediated by smoking was 3.2%. For rs1051730, direct- and indirect-effect odds ratios were 1.26 (95% CI: 1.19, 1.33; P = 1 × 10−15) and 1.00 (95% CI: 0.99, 1.01; P = 0.22), respectively, with a proportion mediated of 2.3%. Adjustment for measurement error in smoking behavior allowing up to 75% measurement error increased the proportions mediated to 12.5% and 9.2%, respectively. These analyses indicate that the association of the variants with lung cancer operates primarily through other pathways.
gene-environment interaction; lung neoplasms; mediation; pathway analysis; smoking
The common disease/common variant hypothesis has been popular for describing the genetic architecture of common human diseases for several years. According to the originally stated hypothesis, one or a few common genetic variants with a relatively large effect size control the risk of common diseases. A growing body of evidence, however, suggests that rare single-nucleotide polymorphisms (SNPs), i.e., those with a minor allele frequency of less than 5%, are also an important component of the genetic architecture of common human diseases. In this study, we analyzed the relevance of rare SNPs to the risk of common disease from an evolutionary perspective and found that rare SNPs are more likely than common SNPs to be functional and tend to have a stronger effect size than do common SNPs. This observation, plus the fact that most of the SNPs in the human genome are rare, suggests that rare SNPs are a crucial element of the genetic architecture of common human diseases. We propose that the next generation of genomic studies should focus on analyzing rare SNPs. Further, targeting patients with a family history of the disease, an extreme phenotype, or early disease onset may facilitate the detection of risk-associated rare SNPs.
Single Nucleotide Polymorphisms (SNPs); Genome Wide Association Studies (GWAS); Minor Allele Frequency (MAF); negative selection
We explored the contribution of nitrosamine metabolism to lung cancer in a pilot investigation of genetic variation in CYP2B6, a high-affinity enzymatic activator of tobacco-specific nitrosamines with a negligible role in nicotine metabolism. Previously we found that variation in CYP2A6 and CHRNA5-CHRNA3-CHRNB4 combined to increase lung cancer risk in a case-control study in European American ever-smokers (n = 860). However, these genes are involved in the pharmacology of both nicotine, through which they alter smoking behaviours, and carcinogenic nitrosamines. Herein, we separated participants by CYP2B6 genotype into a high- vs. low-risk group (*1/*1 + *1/*6 vs. *6/*6). Odds ratios estimated through logistic regression modeling were 1.25 (95% CI 0.68–2.30), 1.27 (95% CI 0.89–1.79) and 1.56 (95% CI 1.04–2.31) for CYP2B6, CYP2A6 and CHRNA5-CHRNA3-CHRNB4, respectively, with negligible differences when all genes were evaluated concurrently. Modeling the combined impact of high-risk genotypes yielded odds ratios that rose from 2.05 (95% CI 0.39–10.9) to 2.43 (95% CI 0.47–12.7) to 3.94 (95% CI 0.72–21.5) for those with 1, 2 and 3 vs. 0 high-risk genotypes, respectively. Findings from this pilot point to genetic variation in CYP2B6 as a lung cancer risk factor supporting a role for nitrosamine metabolic activation in the molecular mechanism of lung carcinogenesis.
CYP2B6; CYP2A6; CHRNA5-CHRNA3-CHRNB4; tobacco specific nitrosamines; lung cancer risk; genetic variation
Tumor size at diagnosis (TSD) indirectly reflects tumor growth rate. The relationship between TSD and smoking is poorly understood. The aim of the study was to determine the relationship between smoking and TSD. We reviewed 1712 newly diagnosed and previously untreated non-small cell lung cancer (NSCLC) patients’ electronic medical records and collected tumor characteristics. Demographic and epidemiologic characteristics were derived from questionnaires administered during personal interviews. Univariate and multivariate linear regression models were used to evaluate the relationship between TSD and smoking controlling for demographic and clinical factors. We also investigated the relationship between the rs1051730 SNP in an intron of the CHRNA3 gene (the polymorphism most significantly associated with lung cancer risk and smoking behavior) and TSD. We found a strong dose dependent relationship between TSD and smoking. Current smokers had largest and never smokers smallest TSD with former smokers having intermediate TSD. In the multivariate linear regression model, smoking status (never, former, and current), histological type (adenocarcinoma vs SqCC), and gender were significant predictors of TSD. Smoking duration and intensity may explain the gender effect in predicting TSD. We found that the variant allele of rs1051730 in CHRNA3 gene was associated with larger TSD of squamous cell carcinoma. In the multivariate linear regression model, both rs1051730 and smoking were significant predictors for the size of squamous carcinomas. We conclude that smoking is positively associated with lung tumor size at the moment of diagnosis.
Lung cancer; tumor size; epidemiologic characteristics; risk factors; CHRNA3
Genetic variants located at 15q25, including those in the cholinergic receptor nicotinic cluster (CHRNA5) have been implicated in both lung cancer risk and nicotine dependence in recent genome-wide association studies. Among these variants, a 22 base pair insertion/deletion, rs3841324 showed the strongest association with CHRNA5 mRNA expression levels. However the influence of rs3841324 on lung cancer risk has not been studied in depth.
We have therefore evaluated the association of rs3841324 genotypes with lung cancer risk in a case-control study of 624 Caucasian subjects with lung cancer and 766 age- and sex-matched cancer-free Caucasian controls. We also evaluated the joint effects of rs3841324 with single-nucleotide polymorphisms (SNPs) rs16969968 and rs8034191 in the 15q25 region that have been consistently implicated in lung cancer risk.
We found that the homozygous genotype with both short alleles (SS) of rs3841324 was associated with a decreased lung cancer risk in female ever smokers relative to the homozygous wild-type (LL) and heterozygous (LS) genotypes combined in a recessive model (OR adjusted = 0.55, 95% CI = 0.31–0.89, P = 0.0168). There was no evidence for a sex difference in the association between this variant and cigarettes smoked per day (CPD). Diplotype analysis of rs3841324 with either rs16969968 or rs8034191 showed that these polymorphisms influenced the lung cancer risk independently.
Conclusions and impact
This study has shown a sex difference in the association between the 15q25 variant rs3841324 and lung cancers. Further research is warranted to elucidate the mechanisms underlying these observations.
lung cancer; CHRNA5; Chromosome 15q25; rs3841324; sex-specific association
Platinum-based regimens are the standard chemotherapy for patients with advanced non–small-cell lung cancer (NSCLC). DNA repair capacity (DRC) in tumor cells plays an important role in resistance to platinum-based drugs. We have previously reported that efficient DRC, as assessed by an in vitro lymphocyte-based assay, was a determinant of poor survival in patients with NSCLC in a relatively small data set. In this larger independent study of 591 patients with NSCLC, we further evaluated whether DRC in peripheral lymphocytes predicts survival of patients with NSCLC who receive platinum-based chemotherapy.
Patients and Methods
All patients were recruited at The University of Texas MD Anderson Cancer Center and donated blood samples before the start of any chemotherapy. We measured DRC in cultured T lymphocytes by using the host-cell reactivation assay, and we assessed associations between DRC in peripheral lymphocytes and survival of patients with NSCLC who were treated with first-line platinum-based chemotherapy.
We found an inverse association between DRC in peripheral lymphocytes and patient survival. Compared with patients in the low tertile of DRC, patients with NSCLC in the high tertile of DRC had significantly worse overall and 3-year survival (adjusted hazard ratio [HR], 1.33; 95% CI, 1.04 to 1.71; P = .023; and HR, 1.35; 95% CI, 1.04 to 1.76; P = .025, respectively). This trend was more pronounced in patients with early-stage tumors, adenocarcinoma, or squamous cell carcinoma.
We confirmed that DRC in peripheral lymphocytes is an independent predictor of survival for patients with NSCLC treated with platinum-based chemotherapy.
A mediation model explores the direct and indirect effects between an independent variable and a dependent variable by including other variables (or mediators). Mediation analysis has recently been used to dissect the direct and indirect effects of genetic variants on complex diseases using case-control studies. However, bias could arise in the estimations of the genetic variant-mediator association because the presence or absence of the mediator in the study samples is not sampled following the principles of case-control study design. In this case, the mediation analysis using data from case-control studies might lead to biased estimates of coefficients and indirect effects. In this article, we investigated a multiple-mediation model involving a three-path mediating effect through two mediators using case-control study data. We propose an approach to correct bias in coefficients and provide accurate estimates of the specific indirect effects. Our approach can also be used when the original case-control study is frequency matched on one of the mediators. We employed bootstrapping to assess the significance of indirect effects. We conducted simulation studies to investigate the performance of the proposed approach, and showed that it provides more accurate estimates of the indirect effects as well as the percent mediated than standard regressions. We then applied this approach to study the mediating effects of both smoking and chronic obstructive pulmonary disease (COPD) on the association between the CHRNA5-A3 gene locus and lung cancer risk using data from a lung cancer case-control study. The results showed that the genetic variant influences lung cancer risk indirectly through all three different pathways. The percent of genetic association mediated was 18.3% through smoking alone, 30.2% through COPD alone, and 20.6% through the path including both smoking and COPD, and the total genetic variant-lung cancer association explained by the two mediators was 69.1%.
Chromosome 5p15.33 has been identified by genome-wide association studies as one of the regions that associate with lung cancer risk. A few single-nucleotide polymorphisms (SNPs) in the telomerase reverse transcriptase (TERT) and cleft lip and palate transmembrane 1-like (CLPTM1L) genes located in this region have shown consistent associations. We performed dense genotyping of SNPs in this region to refine the previously reported association signals for lung cancer risk. Two hundred and fifteen SNPs were genotyped on an Illumina iSelect panel, in a hospital-based case–control study of 1681 lung cancer cases and 1235 unaffected controls. Association was tested using unconditional logistic regression, while adjusting for age, sex and pack-years smoked. Furthermore, since many of the SNPs were in linkage disequilibrium (LD), haplotype blocks were constructed, from which tagging SNPs at an r2 threshold of ≥0.95 were included in a stepwise forward selection logistic regression model. Of the 215 SNPs, 69 were significant at P < 0.05 in univariate analysis; of these, 35 SNPs meeting the r2 threshold were included in the multiple logistic regression model. Two SNPs, rs370348 (odds ratio = 0.76, P = 1.6 × 10−6) and rs4975538 (odds ratio = 1.18, P = 0.005), significantly associated with risk in the overall sample. Among ever smokers, rs4975615 (odds ratio = 0.75, P = 1.2 × 10−4) and rs4975538 (odds ratio = 1.26, P = 0.002) were significant, whereas among never-smokers, rs451360 (odds ratio = 0.62, P = 7.6 × 10−5) was significant. We refined the consistent association signal in this region, allowing for the considerable LD between SNPs and identified four novel SNPs that were independently and significantly associated with lung cancer risk. Results of these analyses strongly suggest effects on risk from several loci in the TERT/CLPTM1L region.
Genetic variations in the CYP2A6 nicotine metabolic gene and the CHRNA5-CHRNA3-CHRNB4 (CHRNA5-A3-B4) nicotinic gene cluster have been independently associated with lung cancer. With genotype data from ever-smokers of European ancestry (417 lung cancer patients and 443 control subjects), we investigated the relative and combined associations of polymorphisms in these two genes with smoking behavior and lung cancer risk. Kruskal–Wallis tests were used to compare smoking variables among the different genotype groups, and odds ratios (ORs) for cancer risk were estimated using logistic regression analysis. All statistical tests were two-sided. Cigarette consumption (P < .001) and nicotine dependence (P = .036) were the highest in the combined CYP2A6 normal metabolizers and CHRNA5-A3-B4 AA (tag single-nucleotide polymorphism rs1051730 G>A) risk group. The combined risk group also exhibited the greatest lung cancer risk (OR = 2.03; 95% confidence interval [CI] = 1.21 to 3.40), which was even higher among those who smoked 20 or fewer cigarettes per day (OR = 3.03; 95% CI = 1.38 to 6.66). Variation in CYP2A6 and CHRNA5-A3-B4 was independently and additively associated with increased cigarette consumption, nicotine dependence, and lung cancer risk. CYP2A6 and CHRNA5-A3-B4 appear to be more strongly associated with smoking behaviors and lung cancer risk, respectively.
While lung cancer is largely caused by tobacco smoking, inherited genetic factors play a role in its etiology. Genome-wide association studies (GWAS) in Europeans have robustly demonstrated only three polymorphic variations influencing lung cancer risk. Tumor heterogeneity may have hampered the detection of association signal when all lung cancer subtypes were analyzed together. In a GWAS of 5,355 European smoking lung cancer cases and 4,344 smoking controls, we conducted a pathway-based analysis in lung cancer histologic subtypes with 19,082 SNPs mapping to 917 genes in the HuGE-defined “inflammation” pathway. We identified a susceptibility locus for squamous cell lung carcinoma (SQ) at 12p13.33 (RAD52, rs6489769), and replicated the association in three independent samples totaling 3,359 SQ cases and 9,100 controls (odds ratio=1.20, Pcombined=2.3×10−8).
The combination of pathway-based approaches and information on disease specific subtypes can improve the identification of cancer susceptibility loci in heterogeneous diseases.
Lung cancer; histology; squamous cell carcinoma; pathway analysis; RAD52
Interindividual variation in genetic background may influence the response to chemotherapy and overall survival for patients with advanced-stage non–small cell lung cancer (NSCLC).
To identify genetic variants associated with poor overall survival in these patients, we conducted a genome-wide scan of 307 260 single-nucleotide polymorphisms (SNPs) in 327 advanced-stage NSCLC patients who received platinum-based chemotherapy with or without radiation at the University of Texas MD Anderson Cancer Center (the discovery population). A fast-track replication was performed for 315 patients from the Mayo Clinic followed by a second validation at the University of Pittsburgh in 420 patients enrolled in the Spanish Lung Cancer Group PLATAX clinical trial. A pooled analysis combining the Mayo Clinic and PLATAX populations or all three populations was also used to validate the results. We assessed the association of each SNP with overall survival by multivariable Cox proportional hazard regression analysis. All statistical tests were two-sided.
SNP rs1878022 in the chemokine-like receptor 1 (CMKLR1) was statistically significantly associated with poor overall survival in the MD Anderson discovery population (hazard ratio [HR] of death = 1.59, 95% confidence interval [CI] = 1.32 to 1.92, P = 1.42 × 10−6), in the PLATAX clinical trial (HR of death = 1.23, 95% CI = 1.00 to 1.51, P = .05), in the pooled Mayo Clinic and PLATAX validation (HR of death = 1.22, 95% CI = 1.06 to 1.40, P = .005), and in pooled analysis of all three populations (HR of death = 1.33, 95% CI = 1.19 to 1.48, P = 5.13 × 10−7). Carrying a variant genotype of rs10937823 was associated with decreased overall survival (HR of death = 1.82, 95% CI = 1.42 to 2.33, P = 1.73 × 10−6) in the pooled MD Anderson and Mayo Clinic populations but not in the PLATAX trial patient population (HR of death = 0.96, 95% CI = 0.69 to 1.35).
These results have the potential to contribute to the future development of personalized chemotherapy treatments for individual NSCLC patients.