PMCCPMCCPMCC

Search tips
Search criteria 

Advanced

 
Logo of hheKargerHomeAlertsResources
 
Hum Hered. 2011 February; 70(4): 292–300.
Published online 2011 February 3. doi:  10.1159/000323318
PMCID: PMC3085519

Genome-Wide Meta-Analysis of Joint Tests for Genetic and Gene-Environment Interaction Effects

Abstract

Background

There is growing interest in the study of gene-environment interactions in the context of genome-wide association studies (GWASs). These studies will likely require meta-analytic approaches to have sufficient power.

Methods

We describe an approach for meta-analysis of a joint test for genetic main effects and gene-environment interaction effects. Using simulation studies based on a meta-analysis of five studies (total n = 10,161), we compare the power of this test to the meta-analysis of marginal test of genetic association and the meta-analysis of standard 1 d.f. interaction tests across a broad range of genetic main effects and gene-environment interaction effects.

Results

We show that the joint meta-analysis is valid and can be more powerful than classical meta-analytic approaches, with a potential gain of power over 50% compared to the marginal test. The standard interaction test had less than 1% power in almost all the situations we considered. We also show that regardless of the test used, sample sizes far exceeding those of a typical individual GWAS will be needed to reliably detect genes with subtle gene-environment interaction patterns.

Conclusion

The joint meta-analysis is an attractive approach to discover markers which may have been missed by initial GWASs focusing on marginal marker-trait associations.

Key Words: Gene-environment interaction, Genome-wide scan, Meta-analysis, Case-control association analysis, Complex disease

Introduction

There is considerable enthusiasm for genome-wide gene-environment interaction studies, in part because such studies may uncover causal loci which marginal tests have low power to detect [1]. However, standard statistical tests for gene-environment interaction, based on departures from additivity on some scale, also require large sample sizes to have reasonable power. Some researchers have developed statistical methods to increase the power of these standard tests by leveraging additional assumptions, such as gene-environment independence in the sampled population [2, 3, 4]. We have proposed a joint test of both the genetic main effect and gene-environment interaction parameters that can be more powerful than either the marginal or standard interaction tests when the genetic effect is weak in one exposure stratum but strong in another [5].

Still, regardless of the test used, the sample sizes required to reliably detect what are likely to be subtle effects will be quite large – larger than typically available in any single study. Hence, detecting gene-environment interactions will likely require a meta-analytic approach, as has been necessary for the identification of smaller and smaller marginal genetic effects. Meta-analysis of a single gene-environment interaction parameter is straight-forward: just as with meta-analyses of marginal additive effects, one could use an inverse-variance weighted fixed-effect approach, for example [6, 7]. Meta-analysis of multiple parameters, required for the joint test, is less well known. Here we derive a general approach to simultaneously estimate global fixed effects for multiple parameters and test whether they are identically null. Specifically, we consider models with genetic main effect plus gene-environment interaction parameters, or, equivalently, models with separate genetic effect parameters for each exposure stratum. We also present a computationally simpler version of this approach in the special case where the exposure is categorical. Using simulation we show that this meta-analysis approach provides valid tests of the null hypothesis that a marker is not associated with a trait in any exposure stratum. We also explore the power of this test over a wide range of models consistent with genetic effects that have been observed in recent genome-wide association studies (GWASs).

Methods

Let equation m1 be the vector of estimated genetic main and gene-environment interaction effects from study i, obtained from fitting the generalized linear model:

equation m2
[A]

For example, for a continuous outcome, the link function g(x) = x is equivalent to standard linear regression, and for a binary outcome, the link g(x) = log(x/(1 – x)) is equivalent to logistic regression. For ease of exposition, the genotype is assumed to be coded additively (0, 1, or 2 copies of the minor allele) and the exposure is assumed to be binary, so that equation m3 is a scalar. The results are easily extended to categorical exposure with three or more levels, for example.

We note that for continuous exposures, both standard and joint tests can be invalid when the exposure main effect is mis-specified [8, 9]. Using the Huber-White robust ‘sandwich’ estimate of the variance-covariance matrix of equation m4 yields valid tests even when the continuous exposure is not modeled accurately. These robust covariance matrices are currently available from some software packages (e.g. ProbABEL [10]), but not others (e.g. PLINK [11]). Alternatively, breaking the continuous variable into categories (e.g. a binary indicator for high versus low exposure) also yields valid tests [8].

Following the exposition in van Houwelingen et al. [12] and assuming the sample size in every study is large enough so that equation m5 is multivariate normal with variance-covariance matrix Σi, we can write the log likelihood for the observed equation m6 as:

equation m7

One can solve for the maximum likelihood estimate equation m8 using the score estimating equation

equation m9

This leads to the weighted least square solution:

equation m10

Given equation m11, the Wald test of the joint null β = 0 is equation m12, where equation m13 is the Fisher information.

The score test also has a simple form:

equation m14
[B]

Both of these tests have a non-central χ2 distribution with 2 d.f. under the null hypothesis. (The assumption of normality is used here to motivate the estimation procedure; the test remains valid if the estimates equation m15 and Σi are unbiased but not normally distributed. Due to the consistency and asymptotic normality of maximum likelihood estimates, the study-specific estimates are likely unbiased and normally distributed when the individual sample sizes are ‘large enough’. Care should be taken when some studies have relatively small sample sizes, e.g. under 100 subjects.)

Although in principle this procedure yields valid and efficient tests of the joint null, in practice it may be difficult to implement, as it requires the covariance of equation m16 and equation m17. Standard packages for the analysis of GWAS data such as PLINK [11] or glu (http://code.google.com/p/glu-genetics/) do not report this covariance without custom modification. (The --robust option in ProbABEL [10] will produce estimates of the variance-covariance matrix.) To avoid this complication, one might conduct a stratified analysis in each study, i.e. estimate the genetic effect in the exposed subjects equation m18 and the effect in unexposed equation m19 separately. Since these estimates are uncorrelated (they are constructed using non-overlapping sets of subjects), the score test simplifies to:

equation m20
[C]

where

equation m21

and equation m22

Note that expression [C] is simply the sum of the usual fixed-effect (inverse-variance weighted) tests for equation m23.

To demonstrate this approach, we used genotype data from five GWASs conducted using European-ancestry subjects: case-control studies of breast cancer (BRCA), type 2 diabetes (T2D) and coronary heart disease (CHD) in the Nurses’ Health Study (NHS), and case-control studies of T2D and CHD in the Health Professionals Follow-up Study (HPFS) [13, 14]. For each subject in each GWAS, a continuous phenotype Y was simulated as a function of two single nuclear polymorphisms (SNP) and one binary environmental exposure:

equation m24

where G1 was the (observed) count of minor alleles for rs505922 (frequency from 34.1 to 35.8% in these studies), G2 was the (observed) count of minor alleles for rs1219648 (frequency from 39.6 to 42.0% in these studies), the exposure E was a Bernoulli 0-1 variable with expectation 0.3, and the residual variation [sm epsilon] was normally distributed with mean 0 and standard deviation σ. E and [sm epsilon]were independent of each other as well as G1 and G2; since rs505922 is on chromosome 9 and rs1219648 is on chromosome 10, G1 and G2 were also effectively independent. We chose this model to examine the performance of the joint meta-analysis (and the marginal and standard interaction meta-analyses) in two situations: one where the effect does not differ across the exposure strata, and one where it does. The two SNPs were chosen arbitrarily, but are representative of the markers present on genome-wide genotyping platforms (rs505922 is on the Affymetrix Axiom and Illumina HumanHap550 arrays, among others, and rs1219648 is on the Affymetrix 6.0, Illumina HumanHap550, and Illumina OmniExpress arrays, among others).

To illustrate the validity of the meta-analytic joint test, we conducted a GWAS of a single realization of Y. This GWAS included 2,543,290 genotyped or imputed SNPs distributed along the 22 autosomal chromosomes that were available for all of the five studies. Because most of these SNPs are not in linkage disequilibrium with G1 or G2 and hence are not associated with the simulated phenotype, results from tests of the phenotype can be used to estimate the type I error rate of the meta-analytic approaches. For this example, the parameters b = (bE, bG1, bG2, bG2E)’ and σ were chosen so that the marginal test of G1 would have high power at the 5 × 10−8 level in the total sample (n = 10,161), and the joint gene-environment test of G2 would have high power in the total sample, while the marginal test of G2 would have modest or low power. The values b = (0.05, 0.12, 0.05, 0.10)andσ = 1.00 satisfied these criteria. Under this model, the genetic variants explained 0.9% of the variation in Y (0.7% for G1 and 0.2% for G2). We estimated subsequently the power of the meta-analytic joint test (and the marginal and standard interaction meta-analyses test) to detect G1 and G2 in this specific situation, by generating 100,000 realizations of Y with these parameters.

To further exemplify the power of the meta-analytic joint test relative to meta-analytic marginal and standard 1 d.f. interaction tests across a broad range of situations, we simulated 1,000 realizations of Y under each of 320 different parameter settings. We kept the values of bG1, and σ fixed at their values in the previous simulation (0.12 and 1.00, respectively), while we varied bE, bG2 and bG2E across a grid defined by bE [set membership] {0.05, 0.1, 0.5, 1, 2}, bG2 [set membership] {0.01, 0.04, 0.07, 0.1}, and bG2E [set membership] {−0.15 to 0.15 by 0.2 step}.

As these studies were conducted using different genotyping platforms (the Illumina 550k for the breast cancer study and Affymetrix 6.0 for the other four studies), we imputed missing genotypes using MACH [15] and the HapMap (rel22) CEU data. We conducted GWAS for SNPs associated with Y using ProbABEL, which takes the dosage files from MACH as input. We compared three tests. In the first test, we estimated the marginal effect of each SNP in each study, and then combined these estimates using standard fixed-effect meta-analysis. In the second, we estimated the usual product interaction term in each study (adjusting for SNP and exposure main effects), and then combined the estimates of these parameters using fixed-effect meta-analysis. In the final test, we estimated SNP effects stratified by study and exposure, and calculated the overall joint test using expression [C] above. We also investigated the effect of conditioning on a main effect for E (but not including a G × E interaction term) on the performance of the marginal tests for G1 and G2.

Results

Estimated type I error derived from the genome-wide association of more than 2.5 million SNPs with a single realization of Y are presented in table table1.1. This table shows the observed and the expected count of false positive tests for seven different p value thresholds. The observed counts are very close to those expected under the null hypothesis of no association. For example, for a p value threshold of 0.001, we expect a total of 2,537 false positives for the joint test and we observed 2,550, which is equivalent to an empirical p value of 0.001005. Figure Figure11 summarizes these results using quantile-quantile plots for the marginal, joint, and standard interaction tests. SNPs not in strong linkage disequilibrium with G1 or G2show no systematic inflation in type I error rate (fig. (fig.1b;1b; the genomic control inflation factor λGC is smaller than 1.02 for all tests). Points corresponding to marginal or joint tests of SNPs in strong linkage disequilibrium with G1 or G2 lie above the expected y = x line (fig. (fig.1a),1a), in accordance with the fact that the null hypothesis of no association does not hold for these SNPs. Table Table22 summarizes the results from this realization for the three different analyses involving G2 by study and overall.

Fig. 1
Quantile-quantile plots for a single realization of Y. Trait values have been simulated using the model Y = bG1G1 + bG2 + bEE + ε, where bG1 = 0.12, bG2 = 0.05, bE = 0.05, and bG2E = 0.1. The genetic inflation factors λGC of the marginal ...
Table 1
Observed and expected count of false positives
Table 2
Example of estimates obtained for G2

As estimated by simulation, the power of the marginal test for G1 in this situation was greater than 98.91%, and the power of the joint test for G1 was 97.72%. On the other hand, the power to detect G2 using the joint test was 69.92%, while the observed power of the marginal test was 48.81%. The power of the standard test for G2-E interaction was only 0.86%. As expected, since we did not simulate a non-multiplicative joint effect for G1 and E, none of 100,000 realizations of the standard G1-E interaction test were significant at the 5 × 10−8 level.

Figure Figure22 shows how the power of these tests changes as more studies are included in a meta-analysis. Generally, power increased non-linearly in the number of studies (reflecting the increase in the total sample size), while the power for any individual study was below 1% for all tests of G1 and G2 (from 0.05% for the marginal test of G2 to 0.002% for the standard G2-E interaction test, and from 0.5% for the marginal test of G1 to 0% for the standard G1-E interaction test). In particular, using fewer than all five available studies led to a concerning decrease of the power for the marginal and joint tests of G2. The removal of one study nearly halved power to detect the Y-G2 association using either of these tests. (The standard G2 × E test had very low power even when using all five studies.) Tests of G2 using less than four studies had less than 10% power. The decrease in power to detect G1 when using fewer studies was smaller than for tests of G2: the power of the marginal test for G1 was 85% when using four studies and 45% when using three studies.

Fig. 2
Power of the marginal, joint and interaction tests as more studies are included in the analysis. HPFS: Health Professional Follow-up Study, NHS: Nurse Health Study, CHD: coronary heart disease, BRCA: breast cancer, T2D: type 2 diabetes. Phenotypic data ...

Figure Figure33 shows the power of the three tests for G2 observed when bE = 0.05 and varying the values of bG2 and bG2E. The power of the joint test to detect G2 is often greater than that of the marginal test. The maximum difference between the power of the joint test and marginal test across these simulated scenarios is 55.46%. The minimum difference, on the other hand, is never more than −7.93%, suggesting that when the joint test has less power than the marginal test its power is not that much smaller. Since the joint test and the standard test of interaction take the environmental effect into account, variation of bE had no effect on these two tests. However, the power of the marginal test of G1 and G2 decreased for values of bE over 0.1, and the difference of power of the joint test over the marginal test was increasing in the same proportion. Hence, the joint test of G2 was more powerful than the marginal test of G2 in all situation where bE was over or equal to 0.5 and the joint test of G1 also was more powerful than the marginal test of G1 when bE was over 1.0.

Fig. 3
Power comparison of the marginal tests of G1 and G2, the standard G2-E interaction test and the joint G2-E test. Phenotypic data (1,000 realizations of Y for each point) were simulated using the model Y = bG1G1 + bG2G2 + bEE + bG2EG2.E + ε, were ...

Some of the biggest power gains for the joint test come in situations when the main effect of G2 (bG2) was small and the G2 × E interaction parameter (bG2E) was large and in the same direction. For example, when bG2 = 0.04 and bG2E = 0.13, the power of the joint test was 83%, while the power of the marginal test was 46%. Large gains in power were also observed when the interaction parameter had the opposite sign as the main effect of G2, and consequently the marginal association between G2 and Y was weak. We note that for some of these situations the standard test of interaction has greater power than the marginal test, but it still can have considerably less power than the joint test.

The standard test of gene-environment interaction always showed very low power (under 20% in all simulations), being slightly more powerful than the two other approaches only in some extreme cases, when gene effect was quite low and the interaction effect quite high and opposite to the gene main effect (e.g. bG2 = 0.04 and bG2E ≥ −0.13).

Adjusting the marginal tests for G1 or G2 by including a main effect term for the exposure E did not substantially alter the power of these tests: the difference in power between the adjusted and unadjusted tests was quite unchanged when the exposure effect (bE) was equal or under 0.1. For exposure effect equal or over 1 the power of the unadjusted test for G2 decreases dramatically, while the power of the unadjusted test for G1 decreases in the same range when the exposure effect is higher (i.e. bE ≥ 2).

Discussion

We have proposed a simple and efficient method for meta-analyses of joint tests of genetic main effect and gene-environment interaction terms. We have shown that this method has an appropriate type I error rate and demonstrated its performance across a wide range of alternative hypotheses via simulation studies. This meta-analytic approach is appropriate for large-scale association studies aimed at discovering markers having low effects on a trait and differing according to an environmental exposure.

Similar to the single-study version of the joint test [5], the meta-analytic test has appropriate type I error rates under the null hypothesis of no association between the tested marker and a phenotype in any stratum defined by the environmental exposure, as long as the marker and the phenotype are uncorrelated in the sampled population [9, 16]. In many situations, including models where the genetic effect in each environmental strata was not null and in the same direction, this meta-analysis of joint test has greater power than either the meta-analysis of standard test for departures from a multiplicative odds ratio model, or the meta-analysis of marginal test for association, making it an attractive test to discover markers which may have been missed by meta-analyses focusing on marginal marker-trait associations (e.g. most meta-analysis of GWAS studies to date).

We have shown that sample sizes far exceeding those of a typical individual GWAS will be needed to reliably detect genes with subtle gene-environment interaction patterns. In the first scenario we simulated, no single study had greater than 1% power to detect a marker whose effect differed across environmental strata (G2). On the other hand, the power of a meta-analysis combining five studies (more than a three-fold increase in sample size relative to the largest individual study) was 58.27 times greater than that of the largest single study; the power of the meta-analysis was greater than 80% for many situations of interest. The large increase in relative power for a smaller increase in relative sample size is due to the non-linear nature of the power curve as a function of sample size. This suggests that meta-analyses incorporating gene-environment interactions may cross a sample-size ‘tipping point’ and identify markers that have gone undetected in individual studies. In the models we simulated, the combined genetic effects of G1 and G2 explained between 0.7 and 1.3% of the total variation in Y, which leads to a high power of the joint and marginal test in the total sample of 10,161 subjects. For a smaller genetic effect, a larger sample size will be required to achieve similar power.

Simulations across a range of genetic main effects and gene-environment interaction effects confirm that in a meta-analysis context, the joint test can be more powerful than either the marginal test of the gene or the standard test of interaction when the genetic effect is weak in one exposure stratum but strong in another, or when the main and interaction effects are in opposite directions (so that the marginal association between the tested marker and the phenotype is weak). However, the latter situation may be of less practical interest since (depending on the context) it may not be biologically plausible; there are few empirical examples of such ‘crossover’ interactions in observational studies [17]. In the range of models we tested, the joint test was often more powerful than the marginal test of the gene, with an absolute gain of power over 50% in the best cases. In situations where the marginal test had better power than the joint test, the difference was small (less than 10% absolute power). The standard test for interaction had low power in all situations we considered; it had slightly greater power than the other two tests only in a few rare, extreme situations. Although we simulated a continuous trait, because the meta-analysis procedure relies on summary parameter estimates (which could be slopes from a linear regression or log odds ratios from a logistic regression), qualitatively similar results should hold for binary disease traits.

Aside from increased power, another potential advantage for the joint test is that it is scale-free. The power and interpretation of tests for the usual gene-environment product term (βg e = 0 in [A]) depend on the link function g(x) [18, 19]. In the context of gene discovery, where the primary goal is establishing associations between markers and a phenotype, no particular scale may have intrinsic interest, and the scale that yields the highest power for the standard interaction test will not be known a priori. For saturated codings of the genetic marker and environmental exposure, the power of the joint test does not depend on the link function for the working model fit to the data [A], but only depends on the true relationship between the marker, exposure and phenotype. However, in situations where departures from additivity on a particular scale are the main interest – for example, when testing for ‘public health interactions’, where the impact of reducing (or increasing) the prevalence of an exposure is larger among genetically susceptible individuals [20] – then the joint test is inappropriate, as it is not a test of interaction per se.

The meta-analysis procedure we propose – [C] – will be useful in typical situations where it is logistically difficult or impossible to share and combine individual-level data. It is easily implemented using standard software for the analysis of GWASs. However, because this particular procedure requires analyses to be conducted within strata of a categorical exposure, in some situations it may lose power relative to analyses that assume a particular form for the interaction term (e.g. ordinal G times continuous E). Moreover, although it does not require that the genetic marker and environmental exposure are independently distributed in the sampled population for validity, it cannot leverage this assumption to increase the power of the individual-study joint tests, as others have proposed [2, 4].

We have sketched a more general approach [B] that uses the variance-covariance matrix for the genetic main effect and gene-environment interaction parameters. This approach could be used to combine evidence from analyses that do not assume a categorical form of the environmental and gene-environment interaction effects, or from analyses that leverage the gene-environment independence assumption. This approach requires straightforward modifications of existing software packages for the analysis and meta-analysis of GWASs. As we were completing this report, we became aware of work by another group that describes the general approach in more detail along with requisite software modifications (unpublished data, Manning et al. [21]). It remains an open question whether a test based on a saturated model for a categorized continuous exposure for both the main effect of the exposure and gene-environment interaction has more power than a test based on a model that simply includes the continuous exposure as a main effect and in a gene-environment product interaction term. Model choice will often be a matter of taste, depending on researchers’ prior beliefs regarding the likely form of any true gene-environment interaction pattern. But the meta-analytic approaches we have outlined here should allow researchers the flexibility to choose the analysis they feel is most appropriate.

Acknowledgements

We thank David Hunter, Frank Hu and Eric Rimm for use of the breast cancer, type 2 diabetes, and coronary heart disease data sets. These studies were supported by grants P01CA087969, 5U01HG004399-02 and HL35464, respectively. P.K. and H.A. were supported by R21DK084529, and H.A. received additional support from the Foundation Bettencourt-Schueller. Drs. S.J.L. and D.B.H. were supported by the Intramural Research Program of the National Institute of Environmental Health Sciences.

References

1. Khoury MJ, Wacholder S. Invited commentary: from genome-wide association studies to gene-environment-wide interaction studies – challenges and opportunities. Am J Epidemiol. 2009;169:227–230. discussion 234–235. [PMC free article] [PubMed]
2. Chatterjee N, Kalaylioglu Z, Carroll RJ. Exploiting gene-environment independence in family-based case-control studies: increased power for detecting associations, interactions and joint effects. Genet Epidemiol. 2005;28:138–156. [PubMed]
3. Murcray CE, Lewinger JP, Gauderman WJ. Gene-environment interaction in genome-wide association studies. Am J Epidemiol. 2009;169:219–226. [PMC free article] [PubMed]
4. Umbach DM, Weinberg CR. Designing and analysing case-control studies to exploit independence of genotype and exposure. Stat Med. 1997;16:1731–1743. [PubMed]
5. Kraft P, Yen YC, Stram DO, Morrison J, Gauderman WJ. Exploiting gene-environment interaction to detect genetic associations. Hum Hered. 2007;63:111–119. [PubMed]
6. de Bakker PI, Ferreira MA, Jia X, Neale BM, Raychaudhuri S, Voight BF. Practical aspects of imputation-driven meta-analysis of genome-wide association studies. Hum Mol Genet. 2008;17:R122–R128. [PMC free article] [PubMed]
7. Zeggini E, Ioannidis JP. Meta-analysis in genome-wide association studies. Pharmacogenomics. 2009;10:191–201. [PMC free article] [PubMed]
8. Cornelis MC, Tchetgen Tchetgen EJ, Liang L, Qi L, Chatterjee N, Hu FB, Kraft P: Gene-environment interactions in genome-wide association studies: a comparative study of tests applied to empirical studies of type 2 diabetes. Under review. [PMC free article] [PubMed]
9. Tchetgen Tchetgen EJ, Kraft P: On the robustness of tests of genetic associations incorporating gene-environment interaction when the environmental exposure is mis-specified. Epidemiology, in press. [PubMed]
10. Aulchenko YS, Struchalin MV, van Duijn CM. ProbABEL package for genome-wide association analysis of imputed data. BMC Bioinformatics. 2010;11:134. [PMC free article] [PubMed]
11. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, Maller J, Sklar P, de Bakker PI, Daly MJ, Sham PC. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81:559–575. [PubMed]
12. van Houwelingen HC, Arends LR, Stijnen T. Advanced methods in meta-analysis: multivariate approach and meta-regression. Stat Med. 2002;21:589–624. [PubMed]
13. Hunter DJ, Kraft P, Jacobs KB, Cox DG, Yeager M, Hankinson SE, Wacholder S, Wang Z, Welch R, Hutchinson A, Wang J, Yu K, Chatterjee N, Orr N, Willett WC, Colditz GA, Ziegler RG, Berg CD, Buys SS, McCarty CA, Feigelson HS, Calle EE, Thun MJ, Hayes RB, Tucker M, Gerhard DS, Fraumeni JF, Jr, Hoover RN, Thomas G, Chanock SJ. A genome-wide association study identifies alleles in FGFR2 associated with risk of sporadic postmenopausal breast cancer. Nat Genet. 2007;39:870–874. [PMC free article] [PubMed]
14. Qi L, Cornelis MC, Kraft P, Stanya KJ, Linda Kao WH, Pankow JS, Dupuis J, Florez JC, Fox CS, Pare G, Sun Q, Girman CJ, Laurie CC, Mirel DB, Manolio TA, Chasman DI, Boerwinkle E, Ridker PM, Hunter DJ, Meigs JB, Lee CH, van Dam RM, Hu FB. Genetic variants at 2q24 are associated with susceptibility to type 2 diabetes. Hum Mol Genet. 2010;19:2706–2715. [PMC free article] [PubMed]
15. Li Y, Abecasis GR. Mach 1.0: Rapid haplotype reconstruction and missing genotype inference. Am J Hum Genet. 2006;S79:2290.
16. Lindstrom S, Yen YC, Spiegelman D, Kraft P. The impact of gene-environment dependence and misclassification in genetic association studies incorporating gene-environment interactions. Hum Hered. 2009;68:171–181. [PMC free article] [PubMed]
17. Weiss NS. Subgroup-specific associations in the face of overall null results: should we rush in or fear to tread? Cancer Epidemiol Biomarkers Prev. 2008;17:1297–1299. [PubMed]
18. Greenland S. Interactions in epidemiology: relevance, identification, and estimation. Epidemiology. 2009;20:14–17. [PubMed]
19. Kraft P, Hunter DJ. The challenge of assessing gene-environment and gene-gene interactions. In: Khouy MJ, Bedrosian SR, Gwinn M, et al., editors. Human Genome Epidemiology. New York: Oxford University Press; 2010. pp. 165–187.
20. Siemiatycki J, Thomas DC. Biological models and statistical interactions: an example from multistage carcinogenesis. Int J Epidemiol. 1981;10:383–387. [PubMed]
21. Manning AK, Lavalley M, Liu CT, Rice K, An P, Liu Y, Miljkovic I, Rasmussen-Torvik L, Harris TB, Province MA, Borecki IB, Florez JC, Meigs JB, Cupples LA, Dupuis J. Meta-analysis of gene-environment interaction: joint estimation of SNP and SNP × environment regression coefficients. Genet Epidemiol. 2011;35:11–18. [PMC free article] [PubMed]

Articles from Human Heredity are provided here courtesy of Karger Publishers