Search tips
Search criteria 


Logo of nihpaAbout Author manuscriptsSubmit a manuscriptHHS Public Access; Author Manuscript; Accepted for publication in peer reviewed journal;
Epidemiology. Author manuscript; available in PMC 2013 January 1.
Published in final edited form as:
PMCID: PMC3237815

Accounting for bias due to selective attrition: The example of smoking and cognitive decline



Selective attrition may introduce bias into analyses of the determinants of cognitive decline. This is a concern especially for risk factors, such as smoking, that strongly influence mortality and drop-out. Using inverse-probability-of-attrition weights (IPAWs), we examined the influence of selective attrition on the estimated association of current smoking (versus never smoking) with cognitive decline.


Chicago Health and Aging Project participants (n=3,713), aged 65–109, who were current smokers or never-smokers underwent cognitive assessments up to 5 times at 3-year intervals. We used pooled logistic regression to fit predictive models of attrition due to death or study drop-out across the follow-up waves. With these models, we computed inverse-probability-of-attrition weights for each observation. We fit unweighted and weighted, multivariable-adjusted generalized-estimating-equation models, contrasting rates of change in cognitive scores in current versus never-smokers. Estimates are expressed as rates of change in z-score per decade.


Over the 12 years of follow-up, smokers had higher mortality than never-smokers (hazard ratio= 1.93 [95% confidence interval= 1.67 to 2.23]). Higher previous cognitive score was associated with increased likelihood of survival and continued participation. In unweighted analyses, current smokers’ cognitive scores declined 0.11 standard units per decade more rapidly than never-smokers’ (95% CI= −0.20 to −0.02). Weighting to account for attrition yielded estimates that were 56%–86% larger, with smokers’ estimated 10-year rate of decline up to 0.20 units faster than never smokers’ (95% CI= −0.36 to −0.04).


Estimates of smoking’s effects on cognitive decline may be underestimated due to differential attrition. Analyses that weight for the inverse probability of attrition help - for this attrition.

Decline in cognitive function is a common occurrence with aging and the hallmark manifestation of dementia.12 Few modifiable risk factors for cognitive decline and dementia have been identified.3 This may be due in part to the particular methodological challenges that affect longitudinal studies of cognitive aging and other late-life health outcomes. One important challenge is selection bias from selective mortality or other forms of attrition that occur after study enrollment. These selection processes will bias estimates of a risk factor’s association with an outcome if selection is influenced by both the risk factor and the outcome or, alternatively, by determinants of the risk factor and the outcome (Figure A).4 Although selection bias is a concern in all longitudinal studies of aging-related outcomes, it is especially relevant in studies of cognitive decline, because impaired cognition strongly predicts morbidity,58 mortality,910 and attrition after study enrollment.1113 Studies of risk factors that are themselves associated with substantial morbidity and mortality, such as smoking, are especially vulnerable to bias due to selective attrition (Figure B).4, 14

Figure 1Figure 1
a. A. Directed acyclic graph (DAG) depicting general causal structure underlying attrition-related selection bias. In this DAG, the risk factor of interest directly influences post-enrollment survival or continuation in the study. The outcome is associated ...

Smoking is thought to increase risk of cognitive decline and dementia in older age, mainly through its well-established vascular effects, although some data suggest potential benefits of nicotine.1517 Findings from previous longitudinal studies of smoking and cognitive decline have been mixed.15, 1826 As is typical in longitudinal studies of older adults, many of these studies over the course of follow-up lost a substantial proportion of their baseline populations—often more than 20%—to attrition. Among five studies reporting on attrition in relation to smoking and cognition,18, 20, 2426 three reported that attrition was associated with smoking, cognition or both.18, 24, 26 Given the strong impact of smoking on morbidity and mortality—smokers have 2–3 times the mortality rate of never-smokers27—and its overall association with study attrition,13 previous studies may have underestimated the adverse relation of smoking to cognitive decline. To our knowledge, no prior studies have quantified and corrected for the potential influence of selective attrition on the estimated relation of smoking to cognitive decline.

Until recently, epidemiologists have had few accessible tools for addressing differential attrition.28 A common approach in regression models of the association between an exposure and outcome of interest is to include terms for factors that predict attrition. This approach is unsatisfactory, because some predictors may be influenced by the exposure, and adjustment for such post-exposure variables is generally known to bias effect estimates.4 For example, although prior cognitive function is a strong predictor of both attrition and future cognitive decline, adjusting for baseline or intermediate measurements of cognitive function could produce estimates that are substantially inflated if the exposure is associated with baseline cognitive score (eFigure 1, Robins, Hernán, Cole and others3031 have developed an inverse-probability-weighting approach to “correct” analyses for differential attrition, based on observed covariate history. This approach allows for use of information on potential intermediates and previous cognitive function while avoiding the pitfalls of conventional adjustment for these variables.

Inverse-probability-weighting methods should be particularly relevant to longitudinal studies of aging-related health outcomes, given the high rates of attrition that are common in this research. We used inverse-probability-of-attrition weighting (IPAW) to examine the influence of attrition-related selection bias on the estimated association between smoking and cognitive decline. We first developed models of the probability of continuing in the study—i.e., remaining alive and not lost to follow-up—and from these models, we computed predicted probabilities of continuation for each observation. For greater specificity, we distinguished between attrition due to mortality and attrition due to other causes (study drop-out), which is often related to frail health.1213 We then used these probabilities to compute analytical weights that are in inverse proportion to the probability of remaining alive and in the study. Observations with characteristics associated with a lower probability of continuation, e.g., physical frailty, were assigned larger weights, thereby “compensating” for the underrepresentation of these types of observations in the observed follow-up data. We then applied the weights to our analyses of the association between smoking and cognitive decline.

We hypothesized that, compared with never-smokers, persons who were current smokers at baseline would experience faster cognitive decline during 12 years of follow-up. Further, we anticipated that the association between current smoking and cognitive change would be larger after accounting for selective mortality and non-death-related drop-out. Because the evaluation of former smoking and cognitive decline entails additional complexities (e.g., accounting for determinants of cessation), we assessed only the contrast between current and never smoking in this analysis.


Study population

We conducted our analyses using data from participants in the ongoing Chicago Health and Aging Project.32 The first wave of recruitment began in 1993 with a door-to-door census of residents living in three geographically defined neighborhoods on the south side of Chicago. Of 8501 adults aged 65 years and older who were identified in this recruitment wave, 1655 declined to participate, 439 died and 249 moved before an in-person assessment could be conducted. This left 6158 participants in the cohort, 18 of whom did not report their smoking status. We focused our analyses on the 3768 participants who, upon enrollment, reported that they were current smokers or that they had never smoked. Of these participants, 55 (1.5%) were missing data on key covariates used for the computation of weights or for the analytical regression models, leaving 3713 persons (891 smokers and 2822 never smokers) for our primary analyses. Participants undergo in-home assessments every three years, and those in our analyses contributed 10,096 observations over five assessment cycles. Information on time-varying covariates was missing for 11 observations, leaving 10,085 observations for our analyses. Some participants returned to the study after skipping a cycle (279, 9.5% of those censored); we classified these participants as permanently censored upon their first missed cycle, effectively treating drop-out as permanent. This permitted fewer assumptions in estimating inverse-probability-of-attrition (IPA) weights (see: Attrition weight estimation, below).

Smoking assessment

At their baseline interviews, participants were asked: “Do you smoke cigarettes now?,” and “Did you ever smoke cigarettes regularly?”. We defined never-smokers as participants who responded “no” to both questions. We defined current smokers as participants who responded affirmatively to the first question. We used baseline smoking status as the exposure of interest, although over the course of follow-up, a small fraction of never smokers (0.7%) reported smoking, and about 35% of current smokers reported quitting.

Cognitive assessment

Participants underwent cognitive assessments at each in-home visit. This assessment consisted of four tests of cognitive function: immediate and delayed recall of 12 ideas in the East Boston story, measures of episodic memory33; the oral version of the Symbol Digit Modalities Test, which measures perceptual speed by giving the participants 90 seconds to identify as many digit-symbol matches as possible 34; and the Mini-Mental State Examination a measure of global cognition.35 Because the four individual measures are highly correlated, we computed a composite measure of global cognition by first converting the raw scores from each test to z scores, using the baseline mean and standard deviation (SD) in the population, and then averaging the z scores.36

Covariate assessment

We used information on both baseline and time-updated covariates. Baseline covariates included self-reported race and ethnicity (assessed with the US census question on race and ethnicity and categorized as African-American or non-Hispanic white) and number of years of completed formal schooling. Time-updated covariates (assessed at baseline and re-assessed at each interview) were all self-reported via structured interview and included usual alcohol intake; self-rated health; diagnosis of heart attack, stroke, diabetes, and high blood pressure; physical disability via the Nagi score37 (ability to perform basic upper and lower extremity functions, where a lower score indicates greater disability); and a composite measure of social networks, where a higher score indicates a more highly populated network.38

Analytic approach

Attrition weight estimation

To account for potentially informative attrition in our analyses, we estimated weights to apply to each observation in models of smoking and cognitive decline. For each wave of visits contributing to our analysis, the weights were based on the inverse of the wave-specific probability of being observed at that wave, and thus of being alive and uncensored at that wave. The intuition behind these weights is that respondents with characteristics similar to the observations missing due to attrition are up-weighted in the analyses of smoking and cognitive decline, so as to represent their original contribution as well as their missing contributions. Because determinants of death may differ from determinants of study drop-out for other reasons, we separately modeled attrition due to death and attrition by other causes.

For each of the two sources of attrition, we first developed separate models of not being censored over the course of follow-up.31 For each planned assessment, let Cikr indicate whether person i is no longer in the study by wave k for reason r, where r is either death (r=1) or loss to follow-up (r=2). Each weight represents the reciprocal of individual i’s probability of remaining both alive and in the study at wave k. We classified a death as occurring at wave k if the participant died between waves k−1 and k, so that for such an individual, Cik1Ci(k−1)1=1.

For each wave of follow-up, we modeled and estimated via pooled logistic regression30 the probability of being alive in that wave, conditional on remaining alive and uncensored in the previous wave. We separately modeled the probability that such a living and previously uncensored participant remained uncensored. To specify the models, we defined a set of variables L, some of which varied over time, we thought likely to influence death or censoring and also affect cognitive function: age, race (African American versus white), sex (male versus female), education (0–8 years, 9–12 years [referent], 13–16 years, 17–30 years), alcohol consumption at the previous visit (none [referent], up to 1 drink/day, >1 drink/day), social network score at the previous visit, cognitive activity at the previous visit, disability score at the previous visit, self-rated health at the previous visit (per unit worsening in rating), chronic cardiovascular conditions, diabetes, global cognitive score at the previous visit, and smoking status (current versus never). We estimated models that included as predictors: the baseline time-constant covariates in L, smoking status (Xi), and the most recent prior values of the time-varying covariates (Li(k−1)), including past measurements of cognitive function. We explored weighting models including additional variables representing the history of the time-varying covariates (e.g., Li(k−1) = (Li0, …, Li(k−1))), but these covariates did not predict censorship or death independently of Li(k−1), and so were dropped from the model. Together, these models were used to calculate the cumulative probability of surviving up to a given follow-up wave and of participating in the assessment at that wave. Weights were applied at the level of observations within individuals, such that for each person-wave contribution to our analysis at wave j, the weight was the inverse of the probability of the conjunction of these two events. These weights can be obtained by the simple product formula:


Implicit to the models we estimated is the Markov assumption that an individual’s probability of contributing to the analysis at wave k, and thus of being alive and uncensored at wave k, depends on his or her history of the collection of time-varying covariates Li(k−1) only through its most recent value Li(k−1). Such an assumption may be relaxed by incorporating additional lagged covariate values, or a user-specified function of such values (e.g., cum(Li(k−1)) =Li0 + … + Li(k−1)) as potential predictors in the weight models. To optimize the fit of our attrition models, we explored several functional forms of time, including as a continuous variable and as a set of cycle indicators. We also evaluated several potentially important cross-products, including cognitive score with smoking and time with cognitive score, smoking and age. We used the same set of covariates in the death and drop-out models, selecting the final covariate set (shown in Table 1), as the set that contained variables with modest-to-strong associations with attrition and for which there were minimal missing data.

Table 1
Baseline characteristics of the study population, and adjusted hazard ratio (HR) and (95% confidence interval) of attrition over five study cycles, estimated from models of continuation.

We present model-based 95% confidence intervals (CIs) for the hazard ratios (HRs) relating each covariate to censoring, under the assumption that the pooled logistic regressions correctly model the hazard of continuation in the study given the entire history of covariates.39 We used the Bayesian information criterion as an indicator of global goodness of fit. To describe each models’ ability to discriminate those who were from those who were not censored, we computed the discordance percentage and the c-statistic. We used the Hosmer-Lemeshow test to describe each model’s calibration across a range of observed risks.4041

From the combination of the two cause-specific models, we computed IPA weights according to Equation 1. These are also called non-stabilized weights because, as the reciprocal of a probability, they are guaranteed to be greater than 1 for contributing observations, and may potentially be very large for a person with a small probability of staying alive and uncensored. As a potential remedy, we also computed wave-specific stabilized IPA weights by multiplying the individual’s non-stabilized weight at that wave by the conditional probability of remaining alive and uncensored up to that wave given a subset of baseline covariates Vi (a subset of Li0) and smoking status. Thus, as the ratio of two probabilities, we generally expect this stabilization to reduce the undue influence of a highly variable non-stabilized weight, and therefore to result in confidence intervals that are narrower than those in analyses using non-stabilized, potentially highly variable weights. Under our assumptions, both non-stabilized and stabilized weights give unbiased effect estimates, provided Vi is entered into the regression model relating smoking to cognitive function over time, and thus effect estimates conditional on Vi are reported in both analyses.42 Applying stabilized weights does not adjust for the covariates Vi that were used in the estimation of the numerator of the model. It is instead necessary to include the Vi as regression covariates in the primary analytic model. The stabilized weight for an individual’s contribution to wave j is thus given by:


Similar to the denominators, we obtained estimates of the numerators via pooled logistic regression analysis in which V consisted of baseline age, sex, race, education, baseline alcohol consumption, and baseline smoking status.

Several assumptions underlie the IPA weight estimation. First, we assume that the attrition process follows an ignorability assumption that states that the conditional probability of remaining alive and in the study in the next wave, given that one has survived and remained uncensored up to the current wave, does not further depend on one’s future cognitive function, given past observed covariates and cognitive measurements.43 In addition, throughout we make the standard positivity assumption43 that for any given wave of the study, and any possible realization of the covariates, smoking status and past cognitive function up to the current wave, there is a positive probability that an individual with that observed history remains alive and in the study in the next wave, given that he or she is alive and uncensored in the current wave.

It is important to note that had attrition been jointly independent of time-varying correlates of cognitive function, then a standard unweighted GEE analysis would have produced valid statistical inferences about the effects of smoking on cognitive function. Remarkably, under the above assumptions of the attrition process’s ignorability and positivity, given the observed time-varying correlates of cognitive function, our analytic approach corrects for selection bias due to attrition, to the extent that it recovers the effect of smoking on cognitive function (possibly conditional on a subset of baseline variables) one would have obtained using a standard GEE analysis, had attrition been jointly independent of all time-varying predictors of cognitive function (possibly conditional on a subset of variables).

Analyses of smoking and cognitive decline

We evaluated the association between current smoking at baseline and cognitive decline using unweighted and IPA-weighted generalized estimating equations (GEE) regression models,39 with working exchangeable correlation matrix, in which we estimated the difference between current and never smokers in rates of decline in global cognitive score. In all models, we regressed the global score on the set of predictors Vi, by including main effect terms for age, sex, race, education (4 categories, described previously), baseline alcohol intake (3 categories, described previously), smoking status, time (years, continuous), and the cross-products of each covariate with time. These analyses included data from all eligible person-wave contributions from participants who had a baseline cognitive score.

For comparison, we fitted unweighted models as well as models that weighted observations using the two sets of IPA-weighted estimates (non-stabilized weights and stabilized weights). Our primary hypothesis on the relation of smoking to cognitive decline was assessed with the cross-product between smoking and time, that is, the estimated difference between current and never smokers in their rates of cognitive decline. To make the estimates easier to interpret, we multiplied all estimated annual changes and differences in annual change by 10, obtaining estimates of change and differences in change over 10 years. To place these effect estimates in context, we compared them with the average rate of cognitive decline among never smokers, represented in the main effect term for time, and leaving all other covariates at their referent levels. Supposing that the rate of cognitive decline among never smokers represents “smoking-free cognitive aging,” we then estimated “excess years of cognitive aging” (over a 10-year interval) among current smokers by dividing the difference in 10-year change by the annual rate of change among never smokers.


Because standard errors from conventional IPA-weighted GEE models can be conservative,39 we generated bootstrap parameter estimates and standard errors.44 Using this approach, we repeated the entire set of analyses—from weight estimation to the estimation of the association of smoking with cognitive decline—on each of 1000 bootstrap samples. A given bootstrap estimate of the difference in rate of cognitive change among current smokers versus never smokers was the mean of the 1000 data sets in which individuals’ observed histories were sampled with replacement from the original data set. We used the bootstrap standard errors to compute 95% confidence intervals (CIs) for the difference in rate of cognitive change.

Using the bootstrap samples, we formally compared each weighted estimate of the association between smoking and cognitive decline with its unweighted counterpart using a Hausman-type specification test, which tests the null hypothesis that the unweighted estimate is consistent with the weighted estimate (the “consistent” estimator).4546 We also compared the estimates as rates of cognitive decline among current smokers as a percentage increase over the rate of decline among never smokers, where the latter rate is the estimate for time.


Of the 3713 participants who had baseline cognitive assessments and nonmissing data on key covariates, 2634 (71%) remained in the sample at the first follow-up, 1722 (46%) remained at the second follow-up, 1274 (34%) remained at the third follow-up, and 756 (20%) remained at the fourth follow-up. Mortality accounted for most (68%) of the attrition.

The variables included in our final censoring models are listed in Table 1. In these multivariable-adjusted analyses, the current smoker group, relative to never smokers, experienced substantially increased mortality risk (HR, 1.93; 95% CI, 1.67–2.23), but no difference in other-cause attrition (Table 1). By contrast, higher cognitive score was associated with markedly reduced risk of both mortality and other-cause attrition. Other strong predictors of mortality included older age, being male, white race, greater degree of disability at the previous visit, worse self-rated health at the previous visit, and diabetes at baseline. Those with the lowest level of education had reduced mortality, while those with the highest level of education were least likely to drop out. The estimates in the predictive model for mortality were markedly different in magnitude from those in the predictive model for other-cause attrition.

Inverse probability of attrition weights

Fit statistics for the weighting models and the distribution of the IPA weights are shown in Table 2. The models of non-censoring generally fit the data well, with good to excellent discrimination between those who died or dropped out and those who continued in the study. The models were also well-calibrated in that they generated predicted risks of attrition that generally matched observed risks, although they tended to perform somewhat more poorly at the highest decile of risk (eAppendix: eFigure 2). Weights generated by the model for censoring due to drop-out were fairly narrowly distributed, reflecting, in part, that we were unable to identify strong predictors of this type of attrition to the extent that we did for death-related censoring.

Table 2
Characteristics of attrition models and the weights they generated.a

Smoking and cognitive decline

In unweighted analyses, the estimated rate of decline on the cognitive tests among never smokers was 0.53 points (in standard units) over 10 years (Table 3). Current smokers’ estimated rate of decline was 0.11 points worse (95% confidence interval [CI], −0.20 to −0.02), on average, resulting in an average rate of decline of 0.64 points over 10 years. With the application of the nonstabilized IPCWs, this difference in rates of cognitive decline increased in magnitude by 56% to −0.17 points (95% CI, −0.31 to −0.02). When we applied the stabilized weights, the difference in rates increased further to −0.20 points over 10 years (95% CI, −0.36 to −0.04), 86% larger than the unweighted estimate. Estimates derived from the application of stabilized weights were slightly more efficient—indicated by the standard error as a fraction of the effect estimate—than those derived from the application of nonstabilized weights. Although the weighted difference estimates were substantially larger than the unweighted estimates, the CIs overlapped substantially, and the Hausman tests did not indicate that the unweighted estimate was significantly different from either the estimates from the analyses using non-stabilized (P=0.3) or stabilized weights (P=0.2).

Table 3
Unweighted and weighted multivariable-adjusteda difference between current and never smokers in cognitive change over 10 years.

Notably, weighted analyses also yielded larger estimates of the average rate of change in cognitive score among never smokers, likely because time in the study also predicted attrition. Nonetheless, the correction for attrition had a slightly greater impact on the estimates for current smokers. For example, considering the reference group (75-year-old females with 9–12 years of education and no alcohol use), the unweighted results suggested that smokers’ decline over 10 years would be as severe as the decline experienced by a non-smoker over 12.1 years. The weighted results suggested that, over 10 years, the smoker would decline as much as a non-smoker would decline over 12.5 years.


In this well-characterized longitudinal study of aging, both smoking and cognitive function were strong predictors of attrition after enrollment. Current smoking was associated with significantly faster rates of cognitive decline in all analyses. The application of IPAWs to these analyses to account for differential attrition patterns yielded estimates that were 56%–86% larger than unweighted estimates. In weighted analyses, estimates of cognitive decline among never smokers increased as well, yet weighting had an even larger impact on estimates of decline among smokers. The Hausman test did not reject at conventional thresholds of statistical significance, indicating we cannot rule out the possibility that the difference in estimates is due to chance variation. Yet the difference in point estimates of the magnitude observed may be considered substantively important. Imprecision in either the weighted or unweighted estimator reduce the statistical power of the Hausman test, so such tests may have insufficient power to reject the null even when point estimates differ substantially.

Differential selection processes distort the joint distribution of smoking and cognitive decline in a study population if cognitive change also influences selection, or if there are other uncontrolled factors that influence both selection and cognitive change. For example, given the lethality of cigarette smoking, smokers who survive may have other beneficial characteristics (e.g., genetic background) that protect them from cognitive decline (eAppendix, eFigure 3). This selection induces an association between the risk factor and cognitive decline, even if there is no true effect. Analyses of the risk factor and cognitive decline will be biased, often toward—but possibly beyond—the null. Our findings may offer some insight into the previously mentioned mixed findings in previous longitudinal studies of smoking and cognitive decline.15, 1925 In light of the findings in our own study, it seems plausible that many of these studies may have underestimated the adverse relation of smoking to cognitive decline. It bears mentioning that the use of more sophisticated measures of cognitive aging—such as more elaborate cognitive testing batteries, imaging, and clinical diagnoses–will not resolve biases stemming from differential attrition.

Our study has some limitations. While we took a detailed approach toward addressing death and drop-out after study enrollment, we did not address attrition prior to enrollment (also known as left truncation). In a study that begins when participants have already reached advanced age, mortality and debilitating morbidity related to smoking and cognitive function have occurred prior to study enrollment, leaving a study population that is already differentially selected. This possibility was highlighted by work complementary to ours, a meta-analysis of cohort studies of smoking and dementia which found that relative risks tended to be smaller—sometimes less than 1.0—for studies whose participants were enrolled at older ages.47 Left truncation processes that generated our study’s population of age-eligible participants may have resulted in conservative estimates of smoking’s association with cognitive decline, even from the IPA-weighted analyses. Indeed, in unweighted analyses stratified by age, we observed associations between smoking and cognitive decline that diminished in the oldest age group (eAppendix, eTable 1).

Many individuals who smoked at baseline quit during follow-up, so some people we classified as “smokers” were in fact “former smokers” at the time of later cognitive assessments. If cessation is more likely with poor cognition, then, in general, using baseline smoking status remediates this source of bias. The same tools of inverse probability weighting that we have used here to handle attrition could in principal also be applied to account for time-varying smoking status.3031 This extension requires a separate model for the determinants of quitting (or initiating) smoking. Our study also did not evaluate the effect of former smoking at baseline on cognitive decline, because such an evaluation entails far more complex methodological considerations around factors influencing cessation, including those that may be causal intermediates. Our IPA-weighted results were premised on the assumption that, conditional on the covariates, individuals who remained in the study and those who did not were “exchangeable” with respect to cognitive outcomes, and, further, that the censoring models were correctly specified. While the first assumption is not empirically testable, goodness-of-fit tests indicated that our models fit the data adequately. In fact, when we used IPAWs based on attrition models composed of other variables (e.g., time instead of cycle, history of hypertension, cognitive activity, previous cognitive change), we obtained IPA-weighted estimates that were consistent with those we reported here (examples in eAppendix, eTable 2) .

Our study also has several important strengths. The CHAP study has complete data on many variables for most participants, which permitted us to explore a wide range of predictors for our censoring models, and ultimately, to develop models that included many strong censoring predictors. In particular, the censoring models allowed us to consider important information on variables that we would otherwise avoid including in models of smoking and cognitive decline, specifically, previous cognitive function and potential intermediate factors such as self-rated health and disability. Finally, this study represents one of the first applications of inverse probability weighting to analyses of risk factors for cognitive decline and has demonstrated that accounting for differential attrition may unveil associations that are larger than those obtained from unweighted analyses, particularly when the risk factor of interest is strongly related to mortality.

It is likely that differential selection may influence the findings on other risk factors for cognitive aging, and, more generally, other aging-related outcomes. Risk factors for mortality such as smoking, elevated blood pressure, hypercholesterolemia, diabetes, and socioeconomic position have often been observed to have diminished impact in “older older” adults as compared with “younger older” adults.4850 This pattern appears in findings on risk factors of dementia, too, whereby a factor that predicts dementia risk among “younger older” cohorts more weakly predicts dementia risk—or fails to predict dementia risk at all—among “oldest old” adults.47, 5155 While some age-dependent patterns have a hypothesized biologic basis,5557 determining which patterns are related to selection and to what degree has critical implications for extrapolating study findings to clinical practice and health policy. This study makes an advance in that direction by addressing the influence of differential attrition on the estimated relation between smoking and cognitive decline.

Supplementary Material


Financial support: Dr. Weuve was supported in part by funds from NIH-NIEHS grant R21 ES016829. Dr. Glymour was supported in part by funds from NIH-NIA grant R21 AG34385.

No acknowledgments to report


Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.


1. Linn RT, Wolf PA, Bachman DL, et al. The 'preclinical phase' of probable Alzheimer's disease. A 13-year prospective study of the Framingham cohort. Arch Neurol. 1995;52(5):485–490. [PubMed]
2. Bennett DA, Wilson RS, Schneider JA, et al. Natural history of mild cognitive impairment in older persons. Neurology. 2002;59(2):198–205. [PubMed]
3. National Institutes of Health State-of-the-Science Conference Statement: Preventing Alzheimer's Disease and Cognitive Decline. National Institute on Aging and Office of Medical Applications Reseach, of the National Institutes of Health; April 26–28, 2010; Bethesda, MD. 2010.
4. Hernán MA, Hernández-Diaz S, Robins JM. A structural approach to selection bias. Epidemiology. 2004;15(5):615–625. [PubMed]
5. Chodosh J, Seeman TE, Keeler E, et al. Cognitive decline in high-functioning older persons is associated with an increased risk of hospitalization. J Am Geriatr Soc. 2004;52(9):1456–1462. [PubMed]
6. Welmerink DB, Longstreth WT, Jr, Lyles MF, Fitzpatrick AL. Cognition and the Risk of Hospitalization for Serious Falls in the Elderly: Results From the Cardiovascular Health Study. J Gerontol A Biol Sci Med Sci. 2010 [PMC free article] [PubMed]
7. Greiner PA, Snowdon DA, Schmitt FA. The loss of independence in activities of daily living: the role of low normal cognitive function in elderly nuns. Am J Public Health. 1996;86(1):62–66. [PubMed]
8. Raji MA, Al Snih S, Ray LA, Patel KV, Markides KS. Cognitive status and incident disability in older Mexican Americans: findings from the Hispanic established population for the epidemiological study of the elderly. Ethn Dis. 2004;14(1):26–31. [PubMed]
9. Yaffe K, Lindquist K, Vittinghoff E, et al. The Effect of Maintaining Cognition on Risk of Disability and Death. J Am Geriatr Soc. 2010 [PMC free article] [PubMed]
10. Bassuk SS, Wypij D, Berkman LF. Cognitive impairment and mortality in the community-dwelling elderly. Am J Epidemiol. 2000;151(7):676–688. [PubMed]
11. Euser SM, Schram MT, Hofman A, Westendorp RG, Breteler MM. Measuring cognitive function with age: the influence of selection by health and survival. Epidemiology. 2008;19(3):440–447. [PubMed]
12. Chatfield MD, Brayne CE, Matthews FE. A systematic literature review of attrition between waves in longitudinal studies in the elderly shows a consistent pattern of dropout between differing studies. J Clin Epidemiol. 2005;58(1):13–19. [PubMed]
13. Matthews FE, Chatfield M, Brayne C. An investigation of whether factors associated with short-term attrition change or persist over ten years: data from the Medical Research Council Cognitive Function and Ageing Study (MRC CFAS) BMC Public Health. 2006;6:185. [PMC free article] [PubMed]
14. Glymour MM, Weuve J, Chen JT. Methodological challenges in causal research on racial and ethnic patterns of cognitive trajectories: measurement, selection, and bias. Neuropsychology Review. 2008;18:194–213. [PMC free article] [PubMed]
15. Anstey KJ, von Sanden C, Salim A, O'Kearney R. Smoking as a risk factor for dementia and cognitive decline: a meta-analysis of prospective studies. Am J Epidemiol. 2007;166(4):367–378. [PubMed]
16. Brayne C. Smoking and the brain. BMJ. 2000;320(7242):1087–1088. [PMC free article] [PubMed]
17. Sacco KA, Bannon KL, George TP. Nicotinic receptor mechanisms and cognition in normal states and neuropsychiatric disorders. J Psychopharmacol. 2004;18(4):457–474. [PMC free article] [PubMed]
18. Sabia S, Marmot M, Dufouil C, Singh-Manoux A. Smoking history and cognitive function in middle age from the Whitehall II study. Arch Intern Med. 2008;168(11):1165–1173. [PMC free article] [PubMed]
19. Peters R, Poulter R, Warner J, Beckett N, Burch L, Bulpitt C. Smoking, dementia and cognitive decline in the elderly, a systematic review. BMC Geriatr. 2008;8:36. [PMC free article] [PubMed]
20. Herbert LE, Scherr PA, Beckett LA, et al. Relation of smoking and low-to-moderate alcohol consumption to change in cognitive function: a longitudinal study in a defined community of older persons. Am J Epidemiol. 1993;137(8):881–891. [PubMed]
21. Knopman D, Boland LL, Mosley T, et al. Cardiovascular risk factors and cognitive decline in middle-aged adults. Neurology. 2001;56(1):42–48. [PubMed]
22. Nooyens AC, van Gelder BM, Verschuren WM. Smoking and cognitive decline among middle-aged men and women: the Doetinchem Cohort Study. Am J Public Health. 2008;98(12):2244–2250. [PubMed]
23. Peters R, Beckett N, Geneva M, et al. Sociodemographic and lifestyle risk factors for incident dementia and cognitive decline in the HYVET. Age Ageing. 2009;38(5):521–527. [PubMed]
24. Richards M, Jarvis MJ, Thompson N, Wadsworth ME. Cigarette smoking and cognitive decline in midlife: evidence from a prospective birth cohort study. Am J Public Health. 2003;93(6):994–998. [PubMed]
25. Yaffe K, Fiocco AJ, Lindquist K, et al. Predictors of maintaining cognitive function in older adults: the Health ABC study. Neurology. 2009;72(23):2029–2035. [PMC free article] [PubMed]
26. Launer LJ, Feskens EJ, Kalmijn S, Kromhout D. Smoking, drinking, and thinking. The Zutphen Elderly Study. Am J Epidemiol. 1996;143(3):219–227. [PubMed]
27. Doll R, Peto R, Boreham J, Sutherland I. Mortality in relation to smoking: 50 years' observations on male British doctors. BMJ. 2004;328(7455):1519. [PMC free article] [PubMed]
28. Greenland S. Chapter 19: Basic methods for sensitivity analysis and external adjustment. In: Rothman KJ, Greenland S, editors. Modern Epidemiology, Second Edition. Philadelphia, PA: Lippincott-Raven; 1998. pp. 343–357.
29. Glymour MM, Weuve J, Berkman LF, Kawachi I, Robins JM. When is baseline adjustment useful in analyses of change? An example with education and cognitive change. Am J Epidemiol. 2005;162(3):267–278. [PubMed]
30. Hernán MA, Brumback B, Robins JM. Marginal structural models to estimate the causal effect of zidovudine on the survival of HIV-positive men. Epidemiology. 2000;11(5):561–570. [PubMed]
31. Cole SR, Hernán MA, Margolick JB, Cohen MH, Robins JM. Marginal structural models for estimating the effect of highly active antiretroviral therapy initiation on CD4 cell count. Am J Epidemiol. 2005;162(5):471–478. [PubMed]
32. Bienias JL, Beckett LA, Bennett DA, Wilson RS, Evans DA. Design of the Chicago Health and Aging Project (CHAP) J Alzheimers Dis. 2003;5(5):349–355. [PubMed]
33. Albert M, Smith LA, Scherr PA, Taylor JO, Evans DA, Funkenstein HH. Use of brief cognitive tests to identify individuals in the community with clinically diagnosed Alzheimer's disease. Int J Neurosci. 1991;57(3–4):167–178. [PubMed]
34. Smith A. Symbol Digit Modalities Test Manual - Revised. Los Angeles: Western Psychological Services; 1982.
35. Folstein MF, Folstein SE, McHugh PR. Mini-Mental State: A practical method for grading the state of patients for the clinician. Journal of Psychiatric Research. 1975;12:189–198. [PubMed]
36. Wilson RS, Bennett DA, Bienias JL, Mendes de Leon CF, Morris MC, Evans DA. Cognitive activity and cognitive decline in a biracial community population. Neurology. 2003;61(6):812–816. [PubMed]
37. Nagi SZ. An epidemiology of disability among adults in the United States. Milbank Mem Fund Q Health Soc. 1976;54(4):439–467. [PubMed]
38. Barnes LL, Mendes de Leon CF, Bienias JL, Evans DA. A longitudinal study of black-white differences in social resources. J Gerontol B Psychol Sci Soc Sci. 2004;59(3):S146–S153. [PubMed]
39. Robins JM, Rotnitzky A. Semiparametric efficiency in multivariate regression models with missing data. Journal of the American Statistical Association. 1995;90(429):122–129.
40. Hosmer DW, Lemeshow S. Applied Logistic Regression. New York: John Wiley & Sons; 1989. Chapter 5: Assessing the fit of the model; pp. 135–175.
41. Cook NR. Use and misuse of the receiver operating characteristic curve in risk prediction. Circulation. 2007;115(7):928–935. [PubMed]
42. Cole SR, Hernán MA. Constructing inverse probability weights for marginal structural models. Am J Epidemiol. 2008;168(6):656–664. [PMC free article] [PubMed]
43. Rotnitzky A, Robins JM. Semiparametric regression estimation in the presence of dependent censoring. Biometrika. 1995;82(4):805–820.
44. Efron B, Tibshirani RJ. An Introduction to the Bootstrap. Boca Raton, FL: Chapman & Hall/CRC; 1993.
45. Hausman JA. Specification tests in econometrics. Econometrica. 1978;46(6):1251–1271.
46. Newey WK. Generalized method of moments specification testing. Journal of Econometrics. 1985;29(3):229–256.
47. Hernán MA, Alonso A, Logroscino G. Cigarette smoking and dementia: potential selection bias in the elderly. Epidemiology. 2008;19(3):448–450. [PubMed]
48. Marang-van de Mheen PJ, Shipley MJ, Witteman JC, Marmot MG, Gunning-Schepers LJ. Decline of the relative risk of death associated with low employment grade at older age: the impact of age related differences in smoking, blood pressure and plasma cholesterol. J Epidemiol Community Health. 2001;55(1):24–28. [PMC free article] [PubMed]
49. Nybo H, Petersen HC, Gaist D, et al. Predictors of mortality in 2,249 nonagenarians--the Danish 1905-Cohort Survey. J Am Geriatr Soc. 2003;51(10):1365–1373. [PubMed]
50. Lewington S, Clarke R, Qizilbash N, Peto R, Collins R. Age-specific relevance of usual blood pressure to vascular mortality: a meta-analysis of individual data for one million adults in 61 prospective studies. Lancet. 2002;360(9349):1903–1913. [PubMed]
51. Kennelly SP, Lawlor BA, Kenny RA. Blood pressure and the risk for dementia: a double edged sword. Ageing Res Rev. 2009;8(2):61–70. [PubMed]
52. Letenneur L, Gilleron V, Commenges D, Helmer C, Orgogozo JM, Dartigues JF. Are sex and educational level independent predictors of dementia and Alzheimer's disease? Incidence data from the PAQUID project. J Neurol Neurosurg Psychiatry. 1999;66(2):177–183. [PMC free article] [PubMed]
53. Luchsinger JA, Patel B, Tang MX, Schupf N, Mayeux R. Measures of adiposity and dementia risk in elderly persons. Arch Neurol. 2007;64(3):392–398. [PMC free article] [PubMed]
54. Rocca WA, Bower JH, Maraganore DM, et al. Increased risk of cognitive impairment or dementia in women who underwent oophorectomy before menopause. Neurology. 2007;69(11):1074–1083. [PubMed]
55. Li G, Rhew IC, Shofer JB, et al. Age-varying association between blood pressure and risk of dementia in those aged 65 and older: a community-based prospective cohort study. J Am Geriatr Soc. 2007;55(8):1161–1167. [PubMed]
56. Rocca WA, Grossardt BR, Shuster LT. Oophorectomy, menopause, estrogen, and cognitive aging: the timing hypothesis. Neurodegener Dis. 2010;7(1–3):163–166. [PMC free article] [PubMed]
57. Fitzpatrick AL, Kuller LH, Lopez OL, et al. Midlife and late-life obesity and the risk of dementia: cardiovascular health study. Arch Neurol. 2009;66(3):336–342. [PMC free article] [PubMed]