Home | About | Journals | Submit | Contact Us | Français |

**|**Am J Epidemiol**|**PMC3224252

Formats

Article sections

Authors

Related links

Am J Epidemiol. 2011 December 1; 174(11): 1238–1245.

Published online 2011 November 1. doi: 10.1093/aje/kwr248

PMCID: PMC3224252

Received 2011 February 20; Accepted 2011 June 24.

Copyright American Journal of Epidemiology Published by Oxford University Press on behalf of the Johns Hopkins Bloomberg School of Public Health 2011.

This article has been cited by other articles in PMC.

The authors describe a statistical method of combining self-reports and biomarkers that, with adequate control for confounding, will provide nearly unbiased estimates of diet-disease associations and a valid test of the null hypothesis of no association. The method is based on regression calibration. In cases in which the diet-disease association is mediated by the biomarker, the association needs to be estimated as the total dietary effect in a mediation model. However, the hypothesis of no association is best tested through a marginal model that includes as the exposure the regression calibration-estimated intake but not the biomarker. The authors illustrate the method with data from the Carotenoids and Age-Related Eye Disease Study (2001--2004) and show that inclusion of the biomarker in the regression calibration-estimated intake increases the statistical power. This development sheds light on previous analyses of diet-disease associations reported in the literature.

Dietary measurement error causes serious challenges to the detection of associations between diet and disease in epidemiologic studies. Estimated relative risks are attenuated and statistical power is reduced (1). Moreover, increasing the sample size provides only a partial remedy, as the attenuated estimates of relative risk are often as low as 1.10–1.25 and, even when statistically significant, might be indistinguishable from the effects of unknown confounders (2). Methods to reduce error in dietary measurements are therefore of primary importance.

To that aim, in previous work (3, 4), we proposed using Howe’s method or principal components for combining dietary reports with dietary biomarkers. In computer simulations and in a real example from the Carotenoids and Age-Related Eye Disease Study (CAREDS), we demonstrated substantial increases in statistical power with this method. However, the method is limited in 3 ways. First, the relative risks derived cannot be translated directly into relative risks per added unit of intake. Second, the method does not always increase power but rather sometimes decreases it. Third, the method does not derive from the usual statistical framework for modeling measurement error.

In the present article, we propose an alternative method for combining dietary self-reports and biomarkers derived from regression calibration (5), a well-known statistical approach to solving measurement error problems. Through theory and computer simulations, we show that, given adequate control for confounding, this approach gives nearly unbiased relative risk estimates and provides a significance test with power equal to or greater than that of tests based on dietary self-report alone. We apply the method to study the negative association of lutein plus zeaxanthin intake with nuclear cataracts that we reported in a previous article (4).

We also compare our method with a closely related proposal of Prentice et al. (6) and clarify methodological issues related to including a biomarker in the regression calibration equation when that same biomarker is involved in the disease model.

CAREDS is an ancillary study of the Women’s Health Initiative (WHI) Observational Study (7, 8). The CAREDS population included women enrolled at 3 sites: University of Wisconsin (Madison, Wisconsin), University of Iowa (Iowa City, Iowa), and Kaiser Center for Health Research (Portland, Oregon). Of the 3,143 eligible women, 2,005 agreed to participate. Full details of the study design have been published previously (9). All procedures conformed to the Declaration of Helsinki and were approved by institutional review boards at each institution.

Dietary intake was assessed at the WHI Observational Study baseline (1994–1998) by using the WHI semiquantitative food frequency questionnaire (FFQ), which had been pretested (10). Serum samples were collected after subjects had fasted for 10 or more hours at the WHI baseline examinations (1994–1998) and were analyzed for lutein and zeaxanthin (sum of their *trans* isomers) (9). Serum lutein and zeaxanthin measurements were available for 1,787 women. These women comprised the data set for this study. Participants underwent lens photography and eye examinations during the CAREDS baseline study visits between 2001 and 2004 (9) and completed a questionnaire that included questions about time of cataract surgery in each eye, physician-diagnosed history of cataracts, and personal characteristics. The primary outcome was nuclear cataract, defined as a nuclear sclerosis severity score of 4 or greater in the worst eye or a history of cataract extraction in either eye.

Potential confounders used in the CAREDS analyses relating nuclear cataracts to lutein/zeaxanthin were age, smoking, iris color, body mass index (BMI, measured as weight in kilograms divided by height in meters squared), multivitamin use, physical activity level, hormone replacement therapy, and pulse pressure (9). In the present analysis, we adjust for the 2 strongest confounders, age and smoking. We considered adding BMI to the analysis, but this did not materially change the results.

We assume there are both a single usual dietary intake of interest and a single biomarker measure related to the intake. The true values of these are *X _{I}* and

To deal with the effects of the measurement error, we have to know the measurement error model. We assume it takes the following form:

(1)

This model is quite general, allowing the intake measurement error to depend on true intake (α_{11} ≠ 1) and even on true biomarker level (α_{12} ≠ 0); similarly, the biomarker measurement error may depend on true biomarker level (α_{22} ≠ 1) and even on true intake level (α_{21} ≠ 0). The random error (*e*_{1} and *e*_{2}) terms (with zero means) are assumed to be independent of each other, true intake, true biomarker level, and disease outcome, so that the measurement error is nondifferential. The random variables (*X _{I}*,

In our CAREDS example, the measurement error model was simpler, with α_{12} = α_{21} = 0, α_{20} = 0, and α_{22} = 1. Each assumption seemed appropriate in this context. Setting α_{12} = 0 corresponded to assuming that error in dietary reporting of lutein/zeaxanthin intake was unrelated to the serum level, and setting α_{21} = 0 corresponded to assuming that error in measured serum level was unrelated to the intake level. Setting α_{20} = 0 and α_{22} = 1 corresponded to assuming that the measured serum level was an unbiased measure of true serum level.

Denote the disease outcome variable by *Y*. We assume that disease is related to the exposure of interest *X _{I}* through a generalized linear regression model:

(2)

where *E* denotes expectation, *h* denotes the link function (e.g., logistic for binary variables or the identity for a continuous variable), and *Z* represents one or more confounders, measured exactly.

We specifically include the biomarker variable *X _{M}* in disease model 2 because we assume that the biomarker mediates, at least partially, the effect of dietary intake on disease. See Figure 1 for the causal path diagram. This assumption often seems biologically plausible; for example, if the biomarker is a serum level of the nutrient of interest, then the effect of the nutrient intake on disease will likely be at least partially mediated through the biomarker. We assume that, if the biomarker level is influenced by other factors associated with disease, those factors are included in the covariates

Causal pathway diagram describing the relations among dietary intake, biomarker level, confounders, and disease.

Under this causal model (Figure 1), the quantity of most interest is the coefficient for the total association of dietary intake with disease outcome, given by ${\text{\beta}}_{1}^{*}$ in the following model:

(3)

It can be shown that both model 3 and the following equation for ${\text{\beta}}_{1}^{*}$ hold exactly when the disease model is a linear regression and approximately when the disease model is nonlinear, such as for logistic and Cox regression:

(4)

where γ is the coefficient of *X _{I}* in the linear regression of

In the absence of measurement error, we could estimate ${\text{\beta}}_{1}^{*}$ simply by omitting *X _{M}* from model 2 and estimating the coefficient for

Regression calibration (RC) (5) is now widely used to adjust estimates of regression coefficients for measurement error in explanatory variables. Suppose we are interested in estimating ${\text{\beta}}_{1}^{*}$ in model 3. The central idea is to use as the explanatory variable in the regression not *W _{I}* but rather the expectation of the true value

Kipnis et al. (11) demonstrated that RC can be extended by including other variables in the “prediction” of *X _{I}*. Such extra variables increase the precision with which

In the present article, we explore the use of the biomarker *W _{M}* as an extra predictor of

- 1. Unadjusted for measurement error, using the estimated coefficient for
*W*in the model $h\left[E\right(Y\left)\right]={\text{\beta}}_{\text{0}U}{\text{+\beta}}_{\text{1}U}{W}_{I}{\text{+}\text{\beta}}_{ZU}\text{Z}$;_{I} - 2. Usual RC, using the estimated coefficient for
*E*(*X*|_{I}*W*,_{I}*Z*) in the model $h\left[E\right(Y\left)\right]={\text{\beta}}_{\text{0}R}+{\text{\beta}}_{\text{1}R}E\left({X}_{I}\right|{W}_{I},Z)+{\text{\beta}}_{ZR}Z$; - 3. Enhanced RC, with the biomarker used for prediction, using the estimated coefficient for
*E*(*X*|_{I}*W*,_{I}*W*,_{M}*Z*) in the model $h\left[E\right(Y\left)\right]={\text{\beta}}_{0E}+{\text{\beta}}_{1E}E\left({X}_{I}\right|{W}_{I},{W}_{M},Z)+{\text{\beta}}_{ZE}Z$ (the method used by Prentice et al. (6) in an example where the dietary intake was total energy and the “biomarker” was BMI); and - 4. A newly proposed method, in which enhanced RC is used to estimate β
_{1}and β_{2}of model 2 through the model $h\left[E\right(Y\left)\right]={\text{\beta}}_{0N}+{\text{\beta}}_{1N}E\left({X}_{I}\right|{W}_{I},{W}_{M},Z)+{\text{\beta}}_{2N}E({X}_{M}|{W}_{I},{W}_{M},Z)+{\text{\beta}}_{ZN}Z$. ${\text{\beta}}_{1}^{*}$ is then estimated by ${\stackrel{\u02c6}{\text{\beta}}}_{1N}+{\gamma \stackrel{\u02c6}{\text{\beta}}}_{2N}$, where the hat denotes the estimated value. Note that in the regression model, we use $E\left({X}_{M}\right|{W}_{I},{W}_{M},Z)$ in place of*W*to account for any measurement error in the biomarker._{M}

In the Web Appendix, Part A (available at http://aje.oxfordjournals.org/), we present the approximate expected values of these 4 estimators. This enables us to predict the following regarding the bias in estimating ${\text{\beta}}_{1}^{*}$.

- 1. The unadjusted estimate, ${\stackrel{\u02c6}{\text{\beta}}}_{1U}$, is nearly unbiased only if there is no measurement error in dietary intake.
- 2. The usual RC estimate,${\stackrel{\u02c6}{\text{\beta}}}_{1R}$, is nearly unbiased if either β
_{2}is zero (i.e., if there is no mediation by the biomarker) or α_{12}is zero (i.e., if the biomarker provides no information about reported dietary intake over and above that provided by the true intake). The latter scenario is sometimes plausible but not in the example of energy intake and BMI (6). - 3. The enhanced RC estimate, ${\stackrel{\u02c6}{\text{\beta}}}_{1E}$, is nearly unbiased if β
_{2}is zero (i.e., if there is no mediation by the biomarker) but not in any other plausible scenarios. - 4. The newly proposed estimate is nearly unbiased under the more general conditions of the measurement error model 1 and the disease model 2.

In the Results, we present the results of computer simulations that verify these predictions.

To apply these methods to the CAREDS example, we must develop the prediction models, that is, the quantities *E*(*X _{I}*|

Each of the 4 estimation methods described above can also be used to test the hypothesis of no diet-disease association. In each case, the test is obtained by comparing the ratio of the estimate to its standard error with the standard normal distribution. The estimate’s standard error may be computed by using bootstrap methods or, when the measurement error model parameters are assumed known, from the usual model-based estimates.

Two questions arise: 1) Which of the 4 tests are valid? and 2) Which among the valid tests is the most statistically powerful? Answering these questions requires careful definition of the null hypothesis. Specifically, we mean that not only is ${\text{\beta}}_{1}^{*}$ zero, but also that its 2 components β_{1} and β_{2} in model 2 are both zero. It is theoretically possible that ${\text{\beta}}_{1}^{*}$ equals zero even when these 2 components are not zero, namely, when β_{1} = −γ_{1}β_{2}. However, it would be highly unusual for the direct effect of diet on disease (the part not mediated through the biomarker) to be in the opposite direction of the indirect effect (the part mediated) with the 2 effects in precisely the appropriate ratio to cancel each other. Thus, we concentrate on the more plausible hypothesis that β_{1} = β_{2} = 0.

The tests derived from all 4 estimators are nearly valid tests (in the same sense that the RC estimator is nearly unbiased) of the above-mentioned null hypothesis. This happens because, under this hypothesis, the expected value of all 4 estimators is nearly zero. Therefore, although 3 of the estimators can be biased, they are all nearly unbiased when the dietary association is zero.

Furthermore, theory predicts that the power of enhanced RC will be larger than that of usual RC, which is expected to have power similar to that of the unadjusted method (see Web material to Kipnis et al. (11)). The ratio of the required sample size using enhanced RC to that using usual RC is given as

We show in our example and in simulations that, among the 4 tests, enhanced RC has the highest statistical power.

The simulation was designed to mimic data from CAREDS. Parameters for the measurement error and disease models, derived from the literature (3), are shown in Table 1. The different combinations of (β_{1}, β_{2}) values were designed to represent 4 scenarios for the effect of intake on disease: 1) not mediated by the biomarker, 2) fully mediated by the biomarker, 3) partially mediated by the biomarker, and 4) zero. The RC models were assumed known and were calculated from the measurement error parameters.

Parameters for Simulation Study of Both Dietary and Serum Lutein/Zeaxanthin Based on the Models 1 and 2^{a}, Carotenoids in Age-Related Eye Disease Study, 2001–2004

For each simulation, a study with 500 individuals and measurements (*Y*, *W _{I}*,

Equations for predicting true intake (and true biomarker level) are required to implement usual RC, enhanced RC, and the newly proposed method. For usual RC, the quantity *E*(*X _{I}*|

For enhanced RC, the biomarker level was included in the prediction, and we obtained the equation:

Note that the coefficient for *W _{M}*, the log measured serum level, is relatively large, showing its major role in the prediction of true intake.

For the newly proposed method, we also needed a prediction equation for true serum lutein/zeaxanthin, obtained as follows:

Note that the coefficient of *W _{I}*, the log FFQ-reported intake, is small, showing its minor role in predicting serum level.

Estimates of the odds ratio of nuclear cataract associated with a doubling of lutein/zeaxanthin intake derived from the 4 methods are shown in Table 2. The unadjusted estimate of 0.89 was closer to the null value of 1.0 than were the estimates of the other methods, and it just achieved statistical significance (*P* = 0.038). Usual RC yielded a stronger (negative) association (odds ratio = 0.72) but the same level of significance (*P* = 0.038). Enhanced RC estimated an even stronger association (odds ratio = 0.70) that was highly significant (*P* = 0.002). The newly proposed method estimated a weaker association than did the enhanced RC method (odds ratio = 0.74), with a level of significance similar to that of usual RC (*P* = 0.046).

Logistic Regression Analyses Relating Nuclear Cataracts to Dietary Lutein/Zeaxanthin Intake in the Carotenoids in Age-Related Eye Disease Study, 2001–2004

Which of the 4 methods should one choose? For estimation, the newly proposed method is the only one that is nearly unbiased under general models 1 and 2, but in our particular case in which α_{21} = 0, the usual RC method is also nearly unbiased. Therefore, one may choose between them, and, because its standard error is smaller, the newly proposed method would be preferable. With regard to significance testing, all the methods are nearly unbiased, and the natural choice is the enhanced RC method because it is the most powerful.

We calculated the predicted ratio of sample size required using enhanced RC to that required using usual RC,

$$\frac{0.0683}{0.1250}\text{\hspace{0.5em}}=\text{\hspace{0.5em}}\mathrm{0.55.}$$

This value was calculated without reference to the outcome variable. However, CAREDS data allowed us to calculate the sample size savings in relation to testing the association with nuclear cataracts. The sample size ratio for the enhanced RC method versus the usual RC method was estimated as 0.43 (Table 2), which was not dissimilar to the predicted value of 0.55.

Results regarding bias in the estimated risk parameter resembled those predicted by theory (Table 3). The unadjusted estimate was attenuated, except under zero dietary effect. The usual RC method gave nearly unbiased estimates in all scenarios because we set the measurement error parameter α_{21} at zero. The enhanced RC method gave nearly unbiased estimates under no mediation of the dietary effect through the biomarker and also under zero dietary effect, but when mediation occurred, the estimate was biased and inflated away from the null. The newly proposed method gave nearly unbiased estimates in all scenarios.

Simulated Means and Standard Deviations of Estimates of the Marginal Effect ${\text{\beta}}_{1}^{*}$ of Dietary and Serum Lutein/Zeaxanthin on Eye Disease for 4 Different Methods of Estimation^{a}

The precisions of the estimates differed markedly. The standard deviations of the usual RC and newly proposed estimates were similar and larger than those of enhanced RC estimates. Each of the 4 significance tests yielded approximately 5% of significant results under the null hypothesis (Table 4). The tests based on the unadjusted method and RC method were identical and had lower statistical power than did the test based on the enhanced RC method (Table 4). The test based on the newly proposed method had slightly higher power than did the RC method.

We have described a method of combining self-reports and biomarkers, based on RC that, under reasonable assumptions, provides 1) a nearly valid significance test of the diet-disease association with increased power and 2) nearly unbiased estimates of relative risks or odds ratios for the association.

The method relies on prior knowledge or estimation of the statistical model describing the measurement error in self-reports of the dietary intake and in biomarker levels related to dietary intake. For the few existing recovery biomarkers (the doubly labeled water technique (12) for measuring energy intake and 24-hour urinary nitrogen (13) and potassium (14) for measuring protein and potassium intake), this method could be easily applied given the biomarkers’ known quantitative relation to intake (13), although the cost or effort to perform these tests in very large numbers may be prohibitive. For the newly developed predictive biomarker for sugars (15), estimation of the measurement error parameters has been recently described (16). In our example, we extracted such prior knowledge from the literature on carotenoid feeding studies, validation studies of dietary reporting of carotenoids, and cohort studies that investigated carotenoid-disease associations (3). For other concentration biomarkers, such as other serum carotenoids or vitamin C or adipose tissue fatty acids, a similar exercise using previous feeding studies could be attempted; otherwise, new feeding studies will be needed to develop the RC equations lying at the heart of the method. One such feeding study is now being conducted (17). The method is not applicable to foods or food patterns that have no known specific biomarkers.

When the parameters of the measurement error model are estimated from a feeding study, the limited size of the study often limits the precision of the estimates. This uncertainty transfers to the risk estimates obtained from the RC adjustment. One should then use a method to adjust the standard error of the risk estimate to include this extra uncertainty, such as the bootstrap or stacking equations (see Appendix B.3 of Carroll et al. (5)).

The method we propose is linked to a previous proposal to use principal components or Howe’s method to combine different error-prone measures of dietary intake. Our results, which showed that enhanced RC can yield reductions in sample size of approximately 50%, are similar to the savings found using Howe’s method applied to the same data set (4). However, this similarity is serendipitous. It happened that in this data set, Howe’s method yielded a dietary index close to the RC prediction of dietary intake based on self-reported intake and serum level, and consequently provided a nearly optimal analysis. This will not always happen, and neither principal components nor Howe’s method is guaranteed to increase statistical power. In many cases, both methods can actually decrease it. In contrast, enhanced RC is expected always to increase power provided the biomarker improves prediction of dietary intake.

Although the enhanced RC method yields a valid and more powerful significance test of the diet-disease association, it does not in general provide an unbiased estimate of the risk parameter. Whenever there is mediation of the dietary effect through the biomarker, which is often expected, the estimate becomes inflated. We have provided a new method that yields an unbiased estimate, albeit with lower precision than that provided by the enhanced RC method. When the biomarker is uncorrelated with dietary reporting error, the usual RC estimate (based on self-report alone) will also be unbiased. We recommend reporting one of these unbiased estimates together with the result of the significance test based on enhanced RC.

Prentice et al. (6) presented an analysis of the association between energy intake and several invasive cancers based on a calibration equation for energy intake that includes BMI. The authors discussed whether BMI was a confounder or a mediator of the diet-disease association. Assuming it was a mediator, they estimated hazard ratios by using a method that corresponded to enhanced RC. According to the results of the present study, the significance tests of the diet-disease association were valid, but the estimated hazard ratios were biased. If BMI were a confounder, then the risk quantity of interest would no longer be ${\text{\beta}}_{1}^{*}$ but β_{1} in model 2, the confounder-adjusted dietary risk parameter.

Possible confounding of the diet-disease association through the biomarker is the most serious obstacle to using our approach. If there are risk factors for disease that also affect the biomarker, then introducing the biomarker into the prediction of dietary intake while not controlling for these risk factors in the disease model will lead to biased estimation with unknown direction of the bias. Biomarkers known to bear a strong relation to dietary intake, such as recovery (18) and predictive (15) biomarkers, will be largely immune from such concern, but concentration biomarkers are affected by complex metabolic pathways in their regulation and will always be subject to concerns about confounding. If a strong risk factor for the disease is known to affect the biomarker, that risk factor must be included in the disease risk model so as to avoid ascribing its effect to nutritional components. In the CAREDS example, smoking was included in the model because it is associated with nuclear cataract and might also lead to depletion of lutein and zeaxanthin in blood, as it is a source of free radicals and oxidative stress (19).

Another challenge is the need to specify a measurement error model, such as model 1. Naturally, such a model could be incomplete and might omit influential explanatory variables. For a discussion of this challenge and related cost issues, see the Web Appendix, Part C.

In summary, a major obstacle facing the field of nutritional epidemiology is the loss of statistical power for detecting diet-disease associations that result from measurement error. With careful use, the methods described in the present article could yield more powerful tests of such associations together with reliable risk estimates. Use of these methods in other branches of epidemiology is discussed briefly in the Web Appendix, Part C.

Author affiliations: Biostatistics Unit, Gertner Institute for Epidemiology and Health Policy Research, Tel Hashomer, Israel (Laurence S. Freedman); Division of Cancer Prevention, National Cancer Institute, Bethesda, Maryland (Douglas Midthune, Victor Kipnis); Department of Statistics, Texas A&M University, College Station, Texas (Raymond J. Carroll); Division of Cancer Control and Population Sciences, National Cancer Institute, Bethesda, Maryland (Nataša Tasevska, Nancy Potischman); Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, Maryland (Arthur Schatzkin); Department of Ophthalmology and Visual Sciences, School of Medicine and Public Health, University of Wisconsin-Madison, Madison, Wisconsin (Julie Mares); and Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington (Lesley Tinker).

This work was supported by the National Institutes of Health (under contract HHSN261200633000 to L. S. F.), the National Cancer Institute (grant R27-CA057030 to R. J. C.), the National Eye Institute (grants EY013018 and EY016886), the National Heart, Lung, and Blood Institute (for support of the Women’s Health Initiative), and by Research to Prevent Blindness.

*Women’s Health Initiative Investigators—Program Office (National Heart, Lung, and Blood Institute, Bethesda, Maryland)*: Elizabeth Nabel, Jacques Rossouw, Shari Ludlam, Joan McGowan, Leslie Ford, and Nancy Geller. *Clinical Coordinating Centers—Fred Hutchinson Cancer Research Center, Seattle, Washington*: Ross Prentice, Garnet Anderson, Andrea LaCroix, Charles L. Kooperberg, Ruth E. Patterson, and Anne McTiernan; *Medical Research Labs, Highland Heights, Kentucky*: Evan Stein; and *University of California at San Francisco, San Francisco, California*: Steven Cummings. *Clinical Centers—Albert Einstein College of Medicine, Bronx, New York*: Sylvia Wassertheil-Smoller; *Baylor College of Medicine, Houston, Texas*: Aleksandar Rajkovic; *Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts*: JoAnn E. Manson; *Brown University, Providence, Rhode Island*: Charles B. Eaton; *Emory University, Atlanta, Georgia*: Lawrence Phillips; *Fred Hutchinson Cancer Research Center, Seattle, Washington*: Shirley Beresford; *George Washington University Medical Center, Washington, DC*: Lisa Martin; *Los Angeles Biomedical Research Institute at Harbor-UCLA Medical Center, Torrance, California*: Rowan Chlebowski; *Kaiser Permanente Center for Health Research, Portland, Oregon*: Yvonne Michael; *Kaiser Permanente Division of Research, Oakland, California*: Bette Caan; *Medical College of Wisconsin, Milwaukee, Wisconsin*: Jane Morley Kotchen; *MedStar Research Institute/Howard University, Washington, DC*: Barbara V. Howard; *Northwestern University, Evanston, Illinois*: Linda Van Horn; *Rush Medical Center, Chicago, Illinois*: Henry Black; *Stanford Prevention Research Center, Stanford, California*: Marcia L. Stefanick; *State University of New York at Stony Brook, Stony Brook, New York*: Dorothy Lane; *Ohio State University, Columbus, Ohio*: Rebecca Jackson; *University of Alabama at Birmingham, Birmingham, Alabama*: Cora E. Lewis; *University of Arizona, Phoenix, Arizona*: Cynthia A Thomson; *University at Buffalo, Buffalo, New York*: Jean Wactawski-Wende; *University of California at Davis, Sacramento, California*: John Robbins; *University of California at Irvine, California*: F. Allan Hubbell; *University of California at Los Angeles, Los Angeles, California*: Lauren Nathan; *University of California at San Diego, La Jolla/Chula Vista, California*: Robert D. Langer; *University of Cincinnati, Cincinnati, Ohio*: Margery Gass; *University of Florida, Gainesville/Jacksonville, Florida*: Marian Limacher; *University of Hawaii, Honolulu, Hawaii*: J. David Curb; *University of Iowa, Iowa City, Iowa*: Robert Wallace; *University of Massachusetts/Fallon Clinic, Worcester, Massachusetts*: Judith Ockene; *University of Medicine and Dentistry of New Jersey, Newark, New Jersey*: Norman Lasser; *University of Miami, Miami, Florida*: Mary Jo O’Sullivan; *University of Minnesota, Minneapolis, Minnesota*: Karen Margolis; *University of Nevada, Reno, Nevada*: Robert Brunner; *University of North Carolina, Chapel Hill, North Carolina*: Gerardo Heiss; *University of Pittsburgh, Pittsburgh, Pennsylvania*: Lewis Kuller; *University of Tennessee Health Science Center, Memphis, Tennessee*: Karen C. Johnson; *University of Texas Health Science Center, San Antonio, Texas*: Robert Brzyski; *University of Wisconsin, Madison, Wisconsin*: Gloria E. Sarto; *Wake Forest University School of Medicine, Winston-Salem, North Carolina*: Mara Vitolins; and *Wayne State University School of Medicine/Hutzel Hospital, Detroit, Michigan*: Michael Simon. *Women’s Health Initiative Memory Study (Wake Forest University School of Medicine, Winston-Salem, North Carolina)*: Sally Shumaker.

Conflict of interest: none declared.

- BMI
- body mass index
- CAREDS
- Carotenoids and Age-Related Eye Disease Study
- FFQ
- food frequency questionnaire
- RC
- regression calibration
- WHI
- Women’s Health Initiative

1. Freudenheim JL, Marshall JR. The problem of profound mismeasurement and the power of epidemiological studies of diet and cancer. Nutr Cancer. 1988;11(4):243–250. [PubMed]

2. Thiébaut AC, Kipnis V, Chang SC, et al. Dietary fat and postmenopausal invasive breast cancer in the National Institutes of Health-AARP Diet and Health Study cohort. J Natl Cancer Inst. 2007;99(6):451–462. [PubMed]

3. Freedman LS, Kipnis V, Schatzkin A, et al. Can we use biomarkers in combination with self-reports to strengthen the analysis of nutritional epidemiologic studies? Epidemiol Perspect Innov. 2010;7(1):2. (doi:10.1186/1742-5573-7-2) [PMC free article] [PubMed]

4. Freedman LS, Tasevska N, Kipnis V, et al. Gains in statistical power from using a dietary biomarker in combination with self-reported intake to strengthen the analysis of a diet-disease association: an example from CAREDS. Am J Epidemiol. 2010;172(7):836–842. [PMC free article] [PubMed]

5. Carroll RJ, Ruppert D, Stefanski LA, et al. Measurement Error in Nonlinear Models: A Modern Perspective. 2nd ed. Boca Raton, FL: Chapman and Hall/CRC Publishers; 2006.

6. Prentice RL, Shaw PA, Bingham SA, et al. Biomarker-calibrated energy and protein consumption and increased cancer risk among postmenopausal women. Am J Epidemiol. 2009;169(8):977–989. [PMC free article] [PubMed]

7. Langer RD, White E, Lewis CE, et al. The Women’s Health Initiative Observational Study: baseline characteristics of participants and reliability of baseline measures. Ann Epidemiol. 2003;13(suppl 9):S107–S121. [PubMed]

8. Design of the Women’s Health Initiative clinical trial and observational study. The Women’s Health Initiative Study Group. Control Clin Trials. 1998;19(1):61–109. [PubMed]

9. Moeller SM, Voland R, Tinker L, et al. Associations between age-related nuclear cataract and lutein and zeaxanthin in the diet and serum in the Carotenoids in the Age-Related Eye Disease Study (CAREDS), an ancillary study of the Women’s Health Initiative. CAREDS Study Group; Women’s Health Initiative. Arch Ophthalmol. 2008;126(3):354–364. [PMC free article] [PubMed]

10. Patterson RE, Kristal AR, Tinker LF, et al. Measurement characteristics of the Women’s Health Initiative food frequency questionnaire. Ann Epidemiol. 1999;9(3):178–187. [PubMed]

11. Kipnis V, Midthune D, Buckman DW, et al. Modeling data with excess zeros and measurement error: application to evaluating relationships between episodically consumed foods and health outcomes. Biometrics. 2009;65(4):1003–1010. [PMC free article] [PubMed]

12. Schoeller DA. Measurement error energy expenditure in free-living humans by using doubly labeled water. J Nutr. 1988;118(11):1278–1289. [PubMed]

13. Bingham SA, Cummings JH. Urine nitrogen as an independent validatory measure of dietary intake: a study of nitrogen balance in individuals consuming their normal diet. Am J Clin Nutr. 1985;42(6):1276–1289. [PubMed]

14. Tasevska N, Runswick SA, Bingham SA. Urinary potassium is as reliable as urinary nitrogen for use as a recovery biomarker in dietary studies of free living individuals. J Nutr. 2006;136(5):1334–1340. [PubMed]

15. Tasevska N, Runswick SA, McTaggart A, et al. Urinary sucrose and fructose as biomarkers for sugar consumption. Cancer Epidemiol Biomarkers Prev. 2005;14(5):1287–1294. [PubMed]

16. Tasevska N, Midthune D, Potischman N, et al. Use of the predictive sugars biomarker to evaluate self-reported total sugars intake in the Observing Protein and Energy Nutrition (OPEN) Study. Cancer Epidemiol Biomarkers Prev. 2011;20(3):490–500. [PubMed]

17. Prentice RL, Huang Y, Tinker LF, et al. Statistical aspects of the use of biomarkers in nutritional epidemiology research. Stat Biosci. 2009;1(1):112–123. [PMC free article] [PubMed]

18. Kaaks R, Ferrari P, Ciampi A, et al. Uses and limitations of statistical accounting for random error correlations, in the validation of dietary questionnaire assessments. Public Health Nutr. 2002;5(6A):969–976. [PubMed]

19. Handelman GJ, Packer L, Cross CE. Destruction of tocopherols, carotenoids, and retinol in human plasma by cigarette smoke. Am J Clin Nutr. 1996;63(4):559–565. [PubMed]

Articles from American Journal of Epidemiology are provided here courtesy of **Oxford University Press**

PubMed Central Canada is a service of the Canadian Institutes of Health Research (CIHR) working in partnership with the National Research Council's national science library in cooperation with the National Center for Biotechnology Information at the U.S. National Library of Medicine(NCBI/NLM). It includes content provided to the PubMed Central International archive by participating publishers. |