Genome-wide association studies of European and East Asian populations have identified lung cancer susceptibility loci on chromosomes 5p15.33, 6p22.1-p21.31 and 15q25.1. We investigated whether these regions contain lung cancer susceptibly loci in African-Americans refined previous association signals by utilizing the reduced linkage disequilibrium observed in African-Americans.
1308 African-American cases and 1241 African-American controls from three centers were genotyped for 760 single nucleotide polymorphisms spanning three regions, and additional SNP imputation was performed. Associations between polymorphisms and lung cancer risk were estimated using logistic regression, stratified by tumor histology where appropriate.
The strongest associations were observed on 15q25.1 in/near CHRNA5, including a missense substitution (rs16969968: OR = 1.57, 95% CI = 1.25–1.97, P = 1.1 × 10−4) and variants in the 5′-UTR. Associations on 6p22.1-p21.31 were histology-specific and included a missense variant in BAT2 associated with squamous-cell carcinoma (rs2736158: OR = 0.64, 95% CI = 0.48–0.85, P = 1.82 × 10−3). Associations on 5p15.33 were detected near TERT, the strongest of which was rs2735940 (OR = 0.82, 95% CI = 0.73–0.93, P = 1.1 × 10−3). This association was stronger among cases with adenocarcinoma (OR = 0.75, 95% CI = 0.65–0.86, P = 8.1 × 10−5).
Polymorphisms in 5p15.33, 6p22.1-p21.31 and 15q25.1 are associated with lung cancer in African-Americans. Variants on 5p15.33 are stronger risk factors for adenocarcinoma and variants on 6p21.33 associated only with squamous-cell carcinoma.
Results implicate the BAT2, TERT and CHRNA5 genes in the pathogenesis of specific lung cancer histologies.
Lung cancer; adenocarcinoma; squamous-cell carcinoma; fine-mapping; African-American; genetic association
Recent meta-analyses of European ancestry subjects show strong evidence for association between smoking quantity and multiple genetic variants on chromosome 15q25. This meta-analysis extends the examination of association between distinct genes in the CHRNA5-CHRNA3-CHRNB4 region and smoking quantity to Asian and African American populations to confirm and refine specific reported associations.
Association results for a dichotomized cigarettes smoked per day (CPD) phenotype in 27 datasets (European ancestry (N=14,786), Asian (N=6,889), and African American (N=10,912) for a total of 32,587 smokers) were meta-analyzed by population and results were compared across all three populations.
We demonstrate association between smoking quantity and markers in the chromosome 15q25 region across all three populations, and narrow the region of association. Of the variants tested, only rs16969968 is associated with smoking (p < 0.01) in each of these three populations (OR=1.33, 95%C.I.=1.25–1.42, p=1.1×10−17 in meta-analysis across all population samples). Additional variants displayed a consistent signal in both European ancestry and Asian datasets, but not in African Americans.
The observed consistent association of rs16969968 with heavy smoking across multiple populations, combined with its known biological significance, suggests rs16969968 is most likely a functional variant that alters risk for heavy smoking. We interpret additional association results that differ across populations as providing evidence for additional functional variants, but we are unable to further localize the source of this association. Using the cross-population study paradigm provides valuable insights to narrow regions of interest and inform future biological experiments.
smoking; genetics; meta-analysis; cross-population
Studies in European and East Asian populations have identified lung cancer susceptibility loci in nicotinic acetylcholine receptor (nAChR) genes on chromosome 15q25.1 which also appear to influence smoking behaviors. We sought to determine if genetic variation in nAChR genes influences lung cancer susceptibly in African-Americans, and evaluated the association of these cancer susceptibility loci with smoking behavior. A total of 1308 African-Americans with lung cancer and 1241 African-American controls from three centers were genotyped for 378 single nucleotide polymorphisms (SNPs) spanning the sixteen human nAChR genes. Associations between SNPs and the risk of lung cancer were estimated using logistic regression, adjusted for relevant covariates. Seven SNPs in three nAChR genes were significantly associated with lung cancer at a strict Bonferroni-corrected level, including a novel association on chromosome 2 near the promoter of CHRNA1 (rs3755486: OR = 1.40, 95% CI = 1.18-1.67, P = 1.0 × 10−4). Association analysis of an additional 305 imputed SNPs on 2q31.1 supported this association. Publicly available expression data demonstrated that the rs3755486 risk allele correlates with increased CHRNA1 gene expression. Additional SNP associations were observed on 15q25.1 in genes previously associated with lung cancer, including a missense variant in CHRNA5 (rs16969968: OR = 1.60, 95% CI = 1.27-2.01, P = 5.9 × 10−5). Risk alleles on 15q25.1 also correlated with an increased number of cigarettes smoked per day among the controls. These findings identify a novel lung cancer risk locus on 2q31.1 which correlates with CHRNA1 expression and replicate previous associations on 15q25.1 in African-Americans.
Lung cancer; nicotine dependence; African-Americans; genetic association; smoking
We outline an integrative approach to extend the boundaries of molecular cancer epidemiology by integrating modern and rapidly evolving “omics” technologies into state-of-the-art molecular epidemiology. In this way, one can comprehensively explore the mechanistic underpinnings of epidemiologic observations into cancer risk and outcome. We highlight the exciting opportunities to collaborate across large observational studies and to forge new interdisciplinary collaborative ventures.
multidisciplinary epidemiologic research; integrating new technologies
Tobacco-induced lung cancer is characterized by a deregulated inflammatory microenvironment. Variants in multiple genes in inflammation pathways may contribute to risk of lung cancer.
We therefore conducted a three-stage comprehensive pathway analysis (discovery, replication and meta-analysis) of inflammation gene variants in ever smoking lung cancer cases and controls. A discovery set (1096 cases; 727 controls) and an independent and non-overlapping internal replication set (1154 cases; 1137 controls) were derived from an ongoing case-control study. For discovery, we used an iSelect BeadChip to interrogate a comprehensive panel of 11737 inflammation pathway SNPs and selected nominally significant (p<0.05) SNPs for internal replication.
There were 6 SNPs that achieved statistical significance (p<0.05) in the internal replication dataset with concordant risk estimates for former smokers and 5 concordant and replicated SNPs in current smokers. Replicated hits were further tested in a subsequent meta-analysis using external data derived from two published GWAS and a case-control study. Two of these variants (a BCL2L14 SNP in former smokers and a SNP in IL2RB in current smokers) were further validated. In risk score analyses, there was a 26% increase in risk with each additional adverse allele when we combined the genotyped SNP and the most significant imputed SNP in IL2RB in current smokers and a 36% similar increase in risk for former smokers associated with genotyped and imputed BCL2L14 SNPs.
Before they can be applied for risk prediction efforts, these SNPs should be subject to further external replication and more extensive fine mapping studies.
Inflammation SNPS; lung cancer; smokers
Many studies examining genetic influences on physical activity (PA) have evaluated the impact of single nucleotide polymorphisms (SNPs) related to the development of lifestyle-related chronic diseases, under the hypothesis that they would be associated with PA. However, PA is a multi-determined behavior and associated with a multitude of health consequences. Thus, examining a broader range of candidate genes associated with a boarder range of PA correlates may provide new insights into the genetic underpinnings of PA. In this study we focus on one such correlate – sensation seeking behavior. Participants (N=1,130 Mexican origin youth) provided a saliva sample and data on PA and sensation seeking tendencies in 2008–09. Participants were genotyped for 630 functional and tagging variants in the dopamine, serotonin, and cannabinoid pathways. Overall 30% of participants (males – 37.6%; females – 22.0%) reported ≥60 minutes of PA on five out of seven days. After adjusting for gender, age and population stratification, and applying the Bayesian False Discovery Probability approach for assessing noteworthiness, four gene variants were significantly associated with PA. In a multivariable model, being male, having higher sensation seeking tendencies and at least one copy of the minor allele for SNPs in ACE (rs8066276 OR=1.44; p=0.012) and TPH2 (rs11615016 OR=1.73; p=0.021) were associated with increased likelihood of meeting PA recommendations. Participants with at least one copy of the minor allele for SNPs in SNAP25 (rs363035 OR=0.53; p=0.005) and CNR1 (rs6454672 OR=0.62; p=0.022) have decreased likelihood of meeting PA recommendations. Our findings extend current knowledge of the complex relationship between PA and possible genetic underpinnings.
Physical Activity; Genes; Sensation Seeking; Mexican origin youth
Suboptimal cellular DNA repair capacity (DRC) has been shown to be associated with enhanced cancer risk, but genetic variants affecting the DRC phenotype have not been comprehensively investigated. In this study, with the available DRC phenotype data, we analyzed correlations between the DRC phenotype and genotypes detected by the Illumina 317K platform in 1,774 individuals of European ancestry from a Texas lung cancer genome-wide association study. The discovery phase was followed by a replication in an independent set of 1,374 cases and controls of European ancestry. We applied a generalized linear model with SNPs as predictors and DRC (a continuous variable) as the outcome. Covariates of age, sex, pack-years of smoking, DRC assay-related variables and case-control status of the study participants were adjusted in the model. We validated that reduced DRC was associated with an increased risk of lung cancer in both independent datasets. Several suggestive loci that contributed to the DRC phenotype were defined in ERCC2/XPD, PHACTR2 and DUSP1. In summary, we determined that DRC is an independent risk factor for lung cancer and we defined several genetic loci contributing to DRC phenotype.
DNA repair capacity; genetic susceptibility; genome-wide association; molecular epidemiology
Although tobacco exposure is the predominant risk factor for lung cancer, other environmental agents are established lung carcinogens. Measuring the genotoxic effect of environmental exposures remains equivocal as increases in morbidity and mortality may be attributed to co-exposures such as smoking.
We evaluated genetic instability and risk of lung cancer associated with exposure to environmental agents (e.g., exhaust) and smoking among 500 lung cancer cases and 500 controls using the Cytokinesis-Blocked Micronucleus (CBMN) assay. Linear regression was applied to estimate the adjusted means of the CBMN endpoints (micronuclei and nucleoplasmic bridges). Logistic regression analyses were used to estimate lung cancer risk and to control for potential confounding by age, gender, and smoking.
Cases showed significantly higher levels of micronuclei and nucleoplasmic bridges as compared to controls (mean ± SEM=3.54±0.04 vs.1.81 ±0.04 and mean ± SEM=4.26±0.03 vs. 0.99±0.03, respectively; p <0.001) with no differences among participants with or without reported environmental exposure. No differences were observed when stratified by smoking or environmental exposure among cases or controls. A difference in lung cancer risk was observed between non-exposed male and female heavy smokers, although it was not statistically significant (I2=64.9%; P-value for Q statistic=0.09).
Our study confirms that the CBMN assay is an accurate predictor of lung cancer and supports the premise that heavy smoking may have an effect on DNA repair capacity and in turn modulate the risk of lung cancer.
Identifying factors that increase lung cancer risk may lead to more effective prevention measures.
Lung cancer; CBMN assay; DNA damage; gender differences
Background and Methods
Familial aggregation of lung cancer exists after accounting for cigarette smoking. However, the extent to which family history affects risk by smoking status, histology, relative type and ethnicity is not well described. This pooled analysis included 24 case-control studies in the International Lung Cancer Consortium. Each study collected age of onset/interview, gender, race/ethnicity, cigarette smoking, histology and first-degree family history of lung cancer. Data from 24,380 lung cancer cases and 23,305 healthy controls were analyzed. Unconditional logistic regression models and generalized estimating equations were used to estimate odds ratios and 95% confidence intervals.
Individuals with a first-degree relative with lung cancer had a 1.51-fold increase in risk of lung cancer, after adjustment for smoking and other potential confounders(95% CI: 1.39, 1.63). The association was strongest for those with a family history in a sibling, after adjustment (OR=1.82, 95% CI: 1.62, 2.05). No modifying effect by histologic type was found. Never smokers showed a lower association with positive familial history of lung cancer (OR=1.25, 95% CI: 1.03, 1.52), slightly stronger for those with an affected sibling (OR=1.44, 95% CI: 1.07, 1.93), after adjustment.
The increased risk among never smokers and similar magnitudes of the effect of family history on lung cancer risk across histological types suggests familial aggregation of lung cancer is independent of those associated with cigarette smoking. While the role of genetic variation in the etiology of lung cancer remains to be fully characterized, family history assessment is immediately available and those with a positive history represent a higher risk group.
Asbestos exposure is a known risk factor for lung cancer. Although recent genome-wide association studies (GWASs) have identified some novel loci for lung cancer risk, few addressed genome-wide gene–environment interactions. To determine gene–asbestos interactions in lung cancer risk, we conducted genome-wide gene–environment interaction analyses at levels of single nucleotide polymorphisms (SNPs), genes and pathways, using our published Texas lung cancer GWAS dataset. This dataset included 317 498 SNPs from 1154 lung cancer cases and 1137 cancer-free controls. The initial SNP-level P-values for interactions between genetic variants and self-reported asbestos exposure were estimated by unconditional logistic regression models with adjustment for age, sex, smoking status and pack-years. The P-value for the most significant SNP rs13383928 was 2.17×10–6, which did not reach the genome-wide statistical significance. Using a versatile gene-based test approach, we found that the top significant gene was C7orf54, located on 7q32.1 (P = 8.90×10–5). Interestingly, most of the other significant genes were located on 11q13. When we used an improved gene-set-enrichment analysis approach, we found that the Fas signaling pathway and the antigen processing and presentation pathway were most significant (nominal P < 0.001; false discovery rate < 0.05) among 250 pathways containing 17 572 genes. We believe that our analysis is a pilot study that first describes the gene–asbestos interaction in lung cancer risk at levels of SNPs, genes and pathways. Our findings suggest that immune function regulation-related pathways may be mechanistically involved in asbestos-associated lung cancer risk.
Abbreviations:CIconfidence intervalEenvironmentFDRfalse discovery rateGgeneGSEAgene-set-enrichment analysisGWASgenome-wide association studiesi-GSEAimproved gene-set-enrichment analysis approachORodds ratioSNPsingle nucleotide polymorphism
The development of second primary tumors (SPT) or recurrence alters prognosis for curatively-treated head and neck squamous cell carcinoma (HNSCC) patients. 13-cis-retnoic acid (13-cRA) has been tested as a chemoprevention agent in clinical trials with mixed results. Therefore, we investigated if genetic variants in the PI3K/PTEN/AKT/MTOR pathway could serve as biomarkers to identify which patients are at high risk of an SPT/recurrence while also predicting response to 13-cRA chemoprevention.
A total of 137 pathway SNPs were genotyped in 440 patients from the Retinoid Head and Neck Second Primary Trial and assessed for SPT/recurrence risk and response to 13-cRA. Risk models were created based on epidemiology, clinical, and genetic data.
Twenty-two genetic loci were associated with increased SPT/recurrence risk with six also being associated with a significant benefit following chemoprevention. Combined analysis of these high-risk/high-benefit loci identified a significant (P = 1.54×10−4) dose-response relationship for SPT/recurrence risk, with patients carrying 4–5 high-risk genotypes having a 3.76-fold (95%CI:1.87–7.57) increase in risk in the placebo group (n=215). Patients carrying 4–5 high-risk loci showed the most benefit from 13-cRA chemoprevention with a 73% reduction in SPT/recurrence (95%CI:0.13–0.58) compared to those with the same number of high-risk genotypes who were randomized to receive placebo. Incorporation of these loci into a risk model significantly improved the discriminatory ability over models with epidemiology, clinical, and previously identified genetic variables.
These results demonstrate that loci within this important pathway could identify individuals with a high-risk/high-benefit profile and are a step towards personalized chemoprevention for HNSCC patients.
Although obesity has been directly linked to the development of many cancers, many epidemiological studies have found that body mass index (BMI)—a surrogate marker of obesity—is inversely associated with the risk of lung cancer. These studies are difficult to interpret because of potential confounding by cigarette smoking, a major risk factor for lung cancer that is associated with lower BMI.
We prospectively examined the association between BMI and the risk of lung cancer among 448 732 men and women aged 50–71 years who were recruited during 1995–1996 for the National Institutes of Health–AARP Diet and Health Study. BMI was calculated based on the participant’s self-reported height and weight on the baseline questionnaire. We identified 9437 incident lung carcinomas (including 415 in never smokers) during a mean follow-up of 9.7 years through 2006. Multivariable Cox proportional hazards regression models were used to estimate hazard ratios (HRs) and 95% confidence intervals (CIs) with adjustment for lung cancer risk factors, including smoking status. To address potential bias due to preexisting undiagnosed disease, we excluded potentially unhealthy participants in sensitivity analyses. All statistical tests were two-sided.
The crude incidence rate of lung cancer over the study follow-up period was 233 per 100 000 person-years among men and 192 per 100 000 person-years among women. BMI was inversely associated with the risk of lung cancer among both men and women (BMI ≥35 vs 22.5–24.99 kg/m2: HR = 0.81, 95% CI = 0.70 to 0.94 and HR = 0.73, 95% CI = 0.61 to 0.87, respectively). The inverse association was restricted to current and former smokers and was stronger after adjustment for smoking. Among smokers, the inverse association persisted even after finely stratifying on smoking status, time since quitting smoking, and number of cigarettes smoked per day. Sensitivity analyses did not support the possibility that the inverse association was due to prevalent undiagnosed disease.
Our results suggest that a higher BMI is associated with a reduced risk of lung cancer in current and former smokers. Our inability to attribute the inverse association between BMI and the risk of lung cancer to residual confounding by smoking or to bias suggests the need for considering other explanations.
Evolutionary aspects of the genetic architecture of common human diseases remain enigmatic. The results of more than 200 genome-wide association studies published to date were compiled in a catalog (http://www.genome.gov/26525384/). We used cataloged data to determine whether derived (mutant) alleles are associated with higher risk of human disease more frequently than ancestral alleles. We placed all allelic variants into ten categories of population frequency (0%–100%) in 10% increments. We then analyzed the relationship between allelic frequency, evolutionary status of the polymorphic site (ancestral versus derived), and disease risk status (risk versus protection). Given the same population frequency, derived alleles are more likely to be risk associated than ancestral alleles, as are rarer alleles. The common interpretation of this association is that negative selection prevents fixation of the risk variants. However, disease stratification as early or late onset suggests that weak selection against risk-associated alleles is unlikely a major factor shaping genetic architecture of common diseases. Our results clearly suggest that the duration of existence of an allele in a population is more important. Alleles existing longer tend to show weaker linkage disequilibrium with neighboring alleles, including the causal alleles, and are less likely to tag a SNP-disease association.
Genome-wide association studies; ancestral allele; derived allele; minor allele frequency
Genome-wide association studies have identified variants on chromosome 15q25.1 that increase the risks of both lung cancer and nicotine dependence and associated smoking behavior. However, there remains debate as to whether the association with lung cancer is direct or is mediated by pathways related to smoking behavior. Here, the authors apply a novel method for mediation analysis, allowing for gene-environment interaction, to a lung cancer case-control study (1992–2004) conducted at Massachusetts General Hospital using 2 single nucleotide polymorphisms, rs8034191 and rs1051730, on 15q25.1. The results are validated using data from 3 other lung cancer studies. Tests for additive interaction (P = 2 × 10−10 and P = 1 × 10−9) and multiplicative interaction (P = 0.01 and P = 0.01) were significant. Pooled analyses yielded a direct-effect odds ratio of 1.26 (95% confidence interval (CI): 1.19, 1.33; P = 2 × 10−15) for rs8034191 and an indirect-effect odds ratio of 1.01 (95% CI: 1.00, 1.01; P = 0.09); the proportion of increased risk mediated by smoking was 3.2%. For rs1051730, direct- and indirect-effect odds ratios were 1.26 (95% CI: 1.19, 1.33; P = 1 × 10−15) and 1.00 (95% CI: 0.99, 1.01; P = 0.22), respectively, with a proportion mediated of 2.3%. Adjustment for measurement error in smoking behavior allowing up to 75% measurement error increased the proportions mediated to 12.5% and 9.2%, respectively. These analyses indicate that the association of the variants with lung cancer operates primarily through other pathways.
gene-environment interaction; lung neoplasms; mediation; pathway analysis; smoking
Previous studies have reported that lung cancer risk may either be decreased, increased or unaffected by prior use of menopausal hormone therapy (MHT).
To examine this issue further, we examined relationships among 118,008 women, ages 50–71 years who were recruited during 1995–1996 for the NIH-AARP Diet and Health Study and in whom 2,097 incident lung carcinomas were identified during follow-up through 2006. Multivariable Cox proportional hazards models estimated relative risks (RR) and 95% confidence intervals (CIs) associated with various measures of self-reported MHT use.
We found no evidence that either estrogen therapy (ET)-only or estrogen plus progestin therapy (EPT) use was substantially related to subsequent lung cancer risk (respective RRs and 95% CIs for ever use = 0.97, 0.86–1.09 and 1.03, 0.90–1.17). There were no significant variations according to currency or duration of use of either formulation, nor was there evidence that risks varied within subgroups defined by cigarette smoking or body size. The absence of effect was seen for nearly all lung cancer subtypes, with the exception of an increased risk of undifferentiated/large cell cancers associated with long-term ET-only use (Ptrend=0.02), a relationship not observed among EPT users.
Our results failed to support any substantial alterations in lung cancer risk associated with use of either unopposed estrogen or estrogen plus progestin MHT, even when detailed exposure measures and other risk predictors were considered.
lung cancer; menopausal hormone therapy; risk; histology
In an analysis of 31,717 cancer cases and 26,136 cancer-free controls drawn from 13 genome-wide association studies (GWAS), we observed large chromosomal abnormalities in a subset of clones from DNA obtained from blood or buccal samples. Mosaic chromosomal abnormalities, either aneuploidy or copy-neutral loss of heterozygosity, of size >2 Mb were observed in autosomes of 517 individuals (0.89%) with abnormal cell proportions between 7% and 95%. In cancer-free individuals, the frequency increased with age; 0.23% under 50 and 1.91% between 75 and 79 (p=4.8×10−8). Mosaic abnormalities were more frequent in individuals with solid-tumors (0.97% versus 0.74% in cancer-free individuals, OR=1.25, p=0.016), with a stronger association for cases who had DNA collected prior to diagnosis or treatment (OR=1.45, p=0.0005). Detectable clonal mosaicism was common in individuals for whom DNA was collected at least one year prior to diagnosis of leukemia compared to cancer-free individuals (OR=35.4, p=3.8×10−11). These findings underscore the importance of the role and time-dependent nature of somatic events in the etiology of cancer and other late-onset diseases.
A mediation model explores the direct and indirect effects between an independent variable and a dependent variable by including other variables (or mediators). Mediation analysis has recently been used to dissect the direct and indirect effects of genetic variants on complex diseases using case-control studies. However, bias could arise in the estimations of the genetic variant-mediator association because the presence or absence of the mediator in the study samples is not sampled following the principles of case-control study design. In this case, the mediation analysis using data from case-control studies might lead to biased estimates of coefficients and indirect effects. In this article, we investigated a multiple-mediation model involving a three-path mediating effect through two mediators using case-control study data. We propose an approach to correct bias in coefficients and provide accurate estimates of the specific indirect effects. Our approach can also be used when the original case-control study is frequency matched on one of the mediators. We employed bootstrapping to assess the significance of indirect effects. We conducted simulation studies to investigate the performance of the proposed approach, and showed that it provides more accurate estimates of the indirect effects as well as the percent mediated than standard regressions. We then applied this approach to study the mediating effects of both smoking and chronic obstructive pulmonary disease (COPD) on the association between the CHRNA5-A3 gene locus and lung cancer risk using data from a lung cancer case-control study. The results showed that the genetic variant influences lung cancer risk indirectly through all three different pathways. The percent of genetic association mediated was 18.3% through smoking alone, 30.2% through COPD alone, and 20.6% through the path including both smoking and COPD, and the total genetic variant-lung cancer association explained by the two mediators was 69.1%.
Sensation seeking tendencies tend to manifest during adolescence and are associated with both health-compromising behaviors and health-enhancing behaviors. The purpose of this study is to evaluate the relationship between sensation seeking and physical activity, a health-enhancing behavior, and between sensation seeking and experimenting with cigarettes, a health compromising-behavior, among a cohort of Mexican origin adolescents residing in the United States with different levels of acculturation.
In 2009, 1,154 Mexican origin youth (50.5% girls, mean age 14.3 years (SD = 1.04)) provided data on smoking behavior, physical activity, linguistic acculturation, and sensation seeking. We conducted Pearson’s χ2 tests to examine the associations between categorical demographic characteristics (i.e. gender, age, country of birth and parental educational attainment) and both cigarette experimentation and physical activity and Student’s t-tests to examine mean differences on the continuous variables (i.e. sensation seeking subscale) by the behaviors. We examined mean differences in the demographic characteristics, acculturation, and both behaviors for each of the sensation seeking subscales using analysis of variance (ANOVA). To examine relationships between the sensation seeking subscales, gender, and both behaviors, at different levels of acculturation we completed unconditional logistic regression analyses stratified by level of acculturation.
Overall, 23.3% had experimented with cigarettes and 29.0% reported being physically active for at least 60 minutes/day on at least 5 days/week. Experimenting with cigarettes and being physically active were more prevalent among boys than girls. Among girls, higher levels of sensation seeking tendencies were associated with higher levels of acculturation and experimentation with cigarettes, but not with physical activity. Among boys, higher levels of sensation seeking tendencies were associated with higher levels of acculturation, experimenting with cigarettes and being physically active.
Our results suggest that interventions designed to prevent smoking among Mexican origin youth may need to address social aspects associated with acculturation, paying close attention to gendered manifestations of sensation seeking.
Smoking behavior; Physical activity; Acculturation; Sensation seeking; Gender; Mexican origin youth
Pathway analysis has been proposed as a complement to single SNP analyses in GWAS. This study compared pathway analysis methods using two lung cancer GWAS data sets based on four studies: one a combined data set from Central Europe and Toronto (CETO); the other a combined data set from Germany and MD Anderson (GRMD). We searched the literature for pathway analysis methods that were widely used, representative of other methods, and had available software for performing analysis. We selected the programs EASE, which uses a modified Fishers Exact calculation to test for pathway associations, GenGen (a version of Gene Set Enrichment Analysis (GSEA)), which uses a Kolmogorov-Smirnov-like running sum statistic as the test statistic, and SLAT, which uses a p-value combination approach. We also included a modified version of the SUMSTAT method (mSUMSTAT), which tests for association by averaging χ2 statistics from genotype association tests. There were nearly 18000 genes available for analysis, following mapping of more than 300,000 SNPs from each data set. These were mapped to 421 GO level 4 gene sets for pathway analysis. Among the methods designed to be robust to biases related to gene size and pathway SNP correlation (GenGen, mSUMSTAT and SLAT), the mSUMSTAT approach identified the most significant pathways (8 in CETO and 1 in GRMD). This included a highly plausible association for the acetylcholine receptor activity pathway in both CETO (FDR≤0.001) and GRMD (FDR = 0.009), although two strong association signals at a single gene cluster (CHRNA3-CHRNA5-CHRNB4) drive this result, complicating its interpretation. Few other replicated associations were found using any of these methods. Difficulty in replicating associations hindered our comparison, but results suggest mSUMSTAT has advantages over the other approaches, and may be a useful pathway analysis tool to use alongside other methods such as the commonly used GSEA (GenGen) approach.