These findings from a large representative sample add a new dimension to our understanding of the patterns and consequences of physical activity in the US population. The estimates of both vigorous and moderate activity were extremely low, and contrast dramatically with those obtained by self-report [
8,
25-
27]. Vigorous activity lasting even 1 minute was only observed in 2% of any of the gender-race/ethnic groups and a 10-minute episode of moderate activity - the intensity obtained by walking up stairs - was recorded in only one third of the participants on any day of monitoring. The overall pattern observed among population sub-groups was, however, consistent with expectations. Mexican-American men were somewhat more active than blacks or whites, which might be attributable to physically demanding occupations, while among women whites appeared to be slightly more active than either blacks or Mexican Americans, possibly reflecting leisure time activity. Activity declined sharply with age; after 60 only ~ 15 minutes of moderate activity was recorded among men and 10 minutes among women per day. Despite the low mean levels a highly significant association was observed between activity level and all the major metabolic risk factors for CVD confirming that the measurements were valid and the effects sufficiently large to confer physiologic consequences.
The major finding from these analyses is the demonstration that population estimates of activity levels from surveys by questionnaire are markedly at variance with those obtained by objective measurements. As the only source available from past surveys, questionnaires have been used in analytic research and have informed public policy for the last 50 years. If the data presented here are correct, a re-evaluation of the conclusions from much of this literature would be required. For example, based on national survey data it was assumed in
Healthy People 2010 that 23 percent of adults engaged in vigorous activity of more than 20 minutes per episode at least 3 times per week at the beginning of this decade [
3]. However, in the NHANES data presented here, < 1 percent of the population achieved this level of expenditure. Likewise, current guidelines recommend 150 minutes of moderate or 75 minutes of vigorous activity per week for adults [
6]. Only 0.3%, or 10 of the 3,370 individuals in this sample, achieved that level. This result is in stark contradiction to a recent report using self-reported "usual occupational/domestic activity" in a subset of the same 2003-2006 NHANES participants where 42% of persons with a mean age of 45 met current guidelines [
28]; precision of this self-reported activity measure was apparently low, however, since it was unassociated with CVD risk factors. The findings of this recent report are not atypical as current trends based on questionnaires suggest that a large proportion of the population engages in recreational activity; these trends, however, could well be biased by a social desirability effect [
6,
29,
30].
Is the large scale downward shift of the magnitude described here a plausible assessment of activity patterns in the US population? On the surface the discrepancy between questionnaire and measured activity exceeds reasonable expectation. The only measure of validity available from the NHANES survey itself was replication of the risk factor associations. An extensive literature from observational studies and trials supports the association between exercise and CVD risk factors [
31-
33], therefore replication of these relationships makes it is reasonable to assert that the accelerometer data from NHANES are capturing the physiologic benefit associated with increasing levels of physical activity. Admittedly this validation is indirect, and additional evidence must be sought in external studies which used similar methods. Objective measurement of energy expenditure has only become feasible in the last two decades, and few of the available studies include representative population samples [
17,
18], therefore we know of no other studies bearing directly on this question. Methodological studies suggest that activity estimates from questionnaires are only correlated at approximately 0.2 with DLW, generally viewed as the most accurate approach [
34]. Activity monitors, on the other hand, have been shown to correlate on average at 0.5-0.6 with energy expenditure in activity [
14,
35,
36], representing a substantial increase in precision.
The critical question for these NHANES data, however, is not the degree to which accelerometry places individuals in the correct rank order of increasing activity, but whether the absolute amount of activity is being measured more accurately than by questionnaire. A recent review examined mean differences between estimates from DLW vs. questionnaire in 20 studies [
34]. These studies were extremely heterogeneous in terms of sample size, type of participants and the questionnaire and, not surprisingly, the results were highly inconsistent; questionnaires overestimated energy expenditure by 1,000 kcal/day in some instances and under-estimated by 400 kcal/day in others [
34]. A similarly heterogeneous literature exists on the concordance between accelerometry and DLW [
14]. Contrary to questionnaires, accelerometry slightly under-estimated total expenditure in all but one report, and mean differences tended to be much smaller - in the range of 100 - 200 kcal/day, or about 15 - 25% of physical activity expenditure. A second comprehensive review summarized the concordance between accelerometry and questionnaires in 47 validation studies [
37]. On average, questionnaires recorded 44% more daily energy expenditure than did activity monitors. This second review also found that the degree of heterogeneity in the comparison of questionnaires with DLW was so great that no conclusions were possible, although there was an indication that the discrepancy was larger for women than men [
37]. It must be recognized that the individual studies reviewed used a variety of instruments, each applying a different algorithm to generate caloric expenditure from activity counts, and they may not be directly comparable to the instrument used in NHANES. In general, however, it seems reasonable to conclude that questionnaires are subject to widely varying bias, most often leading to large over-estimates, while accelerometry has a far smaller, contrary tendency to under-estimation. This evidence would suggest that true expenditure among NHANES participants is closer to the accelerometry estimates, although somewhat higher. Nonetheless, even when applying a threshold of counts per minute that was only 30% of the standard set by direct calorimetry relatively few individuals met the guidelines.
Accelerometry is of course subject to potential biases. For example, cycling or activities that require weight bearing will not be adequately captured, although these are infrequent in the general population. Likewise the device is not worn while swimming. Artifactual increases in counts can also occur as a result of external sources of motion, such as riding in a vehicle. Artifact is a particularly important bias for these data since the vast majority of activity was recorded in episodes lasting only 1 minute. These short bursts are unlikely to represent intentional efforts to accumulate fitness-inducing exercise or physically demanding tasks at work of the type that would be captured by questionnaires. In fact, perhaps the most robust conclusion from this survey is that very few Americans engage in sustained activity, such as jogging or long walks, on a frequent basis. Likewise, in an on-going multi-national survey we have also observed a very low frequency of 10-minute bouts of activity in all 5 study populations (A. Luke, unpublished data). Clearly a more detailed analysis of questionnaire data in concert with accelerometry, and preferably DLW, will be required to resolve these questions.
In addition to the associations with metabolic risk factors, significant but quantitatively weak negative associations exist between the activity measures and BMI and obesity. Likewise, after accounting for BMI, the association with CVD risk factors was substantially weakened, highlighting the confounding that would be expected among these variables. The activity-BMI relationship cannot be considered causal, however, since these data are cross-sectional. Causality could be operating in the opposite direction or in both directions simultaneously. In fact, whether increased activity prevents weight gain is a contentious question. Despite the widely held perception that low levels of energy expenditure in activity is an important risk factor for obesity, prospective data do not support this view [
18,
38]. Randomized trials, where activity levels are rigorously measured and no attempt is made to restrict calories, likewise show that even substantial increases in energy expenditure in exercise do not result in weight loss because of compensatory increases in intake [
39,
40]. We conclude, therefore, that the associations observed in the NHANES data presented here between activity and relative weight are spurious - i.e., the direction of the causality is most likely from obesity to lower activity.
Previous analyses of the NHANES activity monitoring data have noted a similar outcome as reported here with regard to levels of activity for the US population and the association with obesity [
13,
15,
41-
44]. Troiano et al. [
15] and Metzger et al. [
13] reported a slightly higher proportion of the US population meeting the current physical activity guidelines than our estimate (i.e., 5%). Metzger et al. used the data to define 5 classes of physical activity, including two classes of very low activity. The combined physical activity level of these 2 classes was less than 25 min of moderate/vigorous physical activity per day and represented almost 79% of the US population [
13].
One of the challenges for activity assessment by accelerometry has been the choice of appropriate summary measures. Multiple alternative measures have been used and the results are often difficult to interpret or compare across studies [
45-
47]. After detailed preliminary analyses we chose three basic metrics - activity counts per minute and time spent in either vigorous or moderate activity. As noted, the 1-minute bouts are subject to over-estimation of activity across the day due to instability of the monitor The 10-minute bouts are potentially meaningful with regard to health benefits, but may not be capturing the true activity patterns of Americans and 2- or 3-minute bouts could yield different results. However, as shown in Figure , much less data will be available with these cut-points. Using data from an on-going multi-country study, we estimated an intraclass correlation coefficient of 0.88 with six days of activity monitoring, indicating the NHANES data presented here with an average of 6 days, characterize the individual's activity patterns over the measurement period quite well (A. Luke, unpublished data).