|Home | About | Journals | Submit | Contact Us | Français|
Aims. The purpose of the study was to develop new self-report instruments to measure the ability to walk, run, and lift objects and describe the distribution of these abilities among older Canadians. Methods. Questions were developed following a focus group. We carried out an online survey among members of the Canadian Association of Retired Persons. The distribution of each ability was described and presented graphically according to age, sex, and number of health conditions. We calculated summary scores for each ability and assessed their reliability and relationships with health status and use of health services. Results. 22% of the subjects reported difficulty walking 100m, 15% were unable to run 10m, and 50% had difficulty lifting 10kg. Men reported higher abilities than women but differences according to age were small. Test-retest reliability ranged from 0.89 for walking to 0.88 for running and 0.81 for lifting. Scores for the three measures correlated with other measures of health status as expected. Conclusions. The study provided new data on self-reported walking, running, and lifting abilities among older Canadians. The new measures are valid, reliable, and easy to interpret. We expect these measures to be useful in clinical and research settings.
Physical activity provides many health benefits, such as lower risk of heart disease and several types of cancer, improved muscle strength and mobility, longer life expectancy, and better self-reported quality of life [1–3]. However, physical activity is often limited among older persons and those with chronic conditions due to physical disabilities. Although there exist a large number of standardized questionnaires to measure physical function [4–7], population data on the distribution of specific abilities, such as walking, running, or lifting, among older adults are somewhat limited. This is partly due to methodological issues in measuring self-reported physical function.
In population surveys, physical function is usually measured using generic or diseases-specific health or quality of life questionnaires. For example, Statistics Canada provides data on the proportion of persons with mobility disability in Canada from the Population and Activity Limitation Survey in which mobility is measured by questions about difficulty in activities of daily living and limitations in the kind or amount of activity a person can do . A 10-item physical function scale is part of the Short Form-36 Health Survey (SF-36), the most commonly used generic health measure . An adaptive, item response theory (IRT) based measure of physical function with a 124-item bank has recently been validated . These multi-item scales combine questions on different aspects or dimensions of physical function, for example, upper and lower body function and activities of daily living, into a single score. Overall scores derived from responses to multiple items covering a wide range of abilities may not be easy to interpret, especially for a clinician unfamiliar with the content of the instrument. In conventional measures, a detailed analysis of responses to each item is possible, but such an analysis is time-consuming and, depending on the items included in the measure, may not provide a comprehensive assessment of, or sufficient detail for, any specific ability. In adaptive testing, the user may not even know which items have been administered.
In this study, we developed self-reported measures of walking, running, and lifting abilities based on the Activity Space Model (ASM) of disability . Ability was conceptualized as a relationship between an objectively defined activity level, expressed in physical units (distance, weight), and subjective perception of difficulty or effort. In this approach, items measuring a given ability are not considered independent. All items pertain to ordered levels of the same activity, defined along a single physical dimension, and ability is represented graphically by a (monotonic) curve. In the ASM, the area under the curve is a reasonable overall measure of a given ability . The main justification for this novel approach is that it may offer some advantages compared with conventional psychometric measures of physical abilities. These advantages include conceptual unidimensionality, improved interpretation, and greater detail and comprehensiveness in the assessment of specific abilities. Our purpose in this article was to demonstrate the usefulness of this approach in measuring physical abilities and describe the ability to walk, run, and lift objects in a large sample of older Canadians.
Questions pertaining to walking, running, and lifting were developed following a 3-hour focus group with 8 individuals. Participants with arthritis or heart disease residing in Vancouver, Canada, were recruited through newspaper advertisements. In the focus group, the participants completed several previously developed questionnaires, and the pros and cons of different questionnaire layouts were discussed. The final format and wording of the questions are shown in Appendix. We used a matrix format with five response options. For walking, the question stem asked “In your present health, how difficult is it for you to walk the following distances?” Five distances were presented in order from the shortest to the longest, spaced equally on a logarithmic scale (except the last one): 10m (30 feet), 100m (1 block), 1km (10–15min walk), 10km (2-hour walk), and 50km (10–12-hour walk). The response options were “not difficult at all,” “a little difficult,” “somewhat difficult,” “very difficult,” and “unable.” For running, the distances and response options were similar. For lifting, the response options were the same and the questions asked about lifting 6 weights from waist to shoulder level, ranging from 250g to 100kg. The weights were also given in pounds and examples of objects were provided for each weight (e.g., a 4-litre milk container or a one-year old child).
We carried out an online survey among members of the Canadian Association of Retired Persons who had previously agreed to receive email requests for participation in research. In addition to the new instruments, the survey questionnaire included an IRT-based computerized adaptive measure of 5 domains of health-related quality of life (CAT-5D-QOL) [12, 13], and questions about overall health, chronic conditions, and use of health services. A randomly chosen subsample of subjects completed the SF-36v2 . The questionnaire was administered on an online survey system developed and hosted at Arthritis Research Canada . All subjects received an introductory email followed by two email reminders. A test-retest was performed on a subsample of the respondents. All subjects consented to participate in the study. The study was approved by the University of British Columbia Behavioral Research Ethics Board.
For each item in the new measures, the difficulty levels were assigned scores from 0 (unable) to 4 (not difficult at all). These scores were converted to percentage scores (0–100%). We plotted individual ability curves, showing the relationship between difficulty and activity levels. Mean ability curves for groups according to age, sex, and number of chronic conditions were derived by calculating the mean difficulty for each level of activity. Summary ability scores for each measure were obtained by summing up the scores for all items, expressed as a percentage of the maximum possible score (0–100%). Higher scores represented higher ability. Scores from the CAT-5D-QOL and SF-36 were norm-based (mean = 50 and SD = 10). They were obtained following established and previously published methods [12, 14] with higher scores denoting better health. Ceiling effect was defined as the proportion of subjects obtaining maximum possible score and floor effect was defined as the proportion with the lowest possible score.
Descriptive data included the frequency of responses to all options on all items. We obtained the distribution plots for the summary scores and calculated means and standard deviations of the scores for each measure. Test-retest reliability of the summary scores was evaluated by calculating the Intraclass Correlation Coefficient (ICC). To determine construct validity, we obtained Pearson's correlations with CAT-5D-QOL and SF-36 domains and assessed the relationships between ability scores and demographic variables, chronic conditions, use of medication, visits to doctors, and hospitalization in the past year in multivariable regression analysis.
The online questionnaire was completed by 1,089 subjects (response rate 35%). Baseline descriptive data are shown in Table 1. The average age of the respondents was 66.3 years (SD 7.1) and 56% were women. About 54% had college or university education and 29% had high school education or less. 24% reported no health conditions, while 29%, 22%, and 26% reported one, two, and three or more conditions, respectively. About 14% percent did not take any medications in the past 4 weeks while 52% reported taking 3 or more medications. The SF-36 was completed by 549 individuals. Baseline mean physical and mental composite scores (norm-based) in this subsample were 44.3 (range 10.9–65.6) and 53.3 (12.4–71.0), respectively.
Frequencies of responses to each question are given in Table 2. Missing data were rare (<5%), except for walking 50km (9.6%). There was very little ceiling effect for any of the measures. Difficulty in walking 10m was reported by 14.3%, 100m by 22.2%, and 1km by 37.8% of the respondents (excluding missing), while 21.6% had no difficulty walking 10km. Only 1.3% were unable to walk 100m. However, 50.0% found it difficult and 15.4% were unable to run 10m (floor effect). Furthermore, 76.4% had difficulty running 100m, 24.5% were unable to run this distance, and 50.3% were unable to run 1km. On the other hand, 24.0% stated they could run 10km. Very few respondents reported problems lifting weights up to 1kg, but 24.7% had difficulty lifting 4kg and 51.1% had difficulty lifting 10kg (3.1% were unable). On the other hand, 75.0% said they could lift 50kg (7.3% without difficulty).
Examples of walking ability curves for persons with varying levels of ability are shown in Figure 1. Mean walking, running, and lifting ability curves for men and women are shown in Figures 2(a)–2(c). The shapes of the curves for walking and lifting are similar, with relatively little difficulty for lower levels of activity, whereas the shape for running is different and characterized by relatively high levels of difficulty. Men reported higher levels of ability than women throughout the full range on all 3 measures, with the largest differences seen for running up to 1km and lifting 10kg or more.
Mean ability curves according to age are shown in Figures 3(a)–3(c). The data indicate very little effect of age on walking all distances up to age 70 in this population and somewhat reduced walking ability after age 70. A similar pattern was seen for running, although the ability curves for all age groups were much lower. In terms of lifting, our data showed virtually no difference between the age groups.
The ability to walk, run, and lift objects differed substantially according to the number of chronic conditions reported (0, 1, 2, 3, and 4+), as shown in Figures 4(a)–4(c). The differences were particularly large for walking 1km and 10km. For running, the number of chronic conditions was also a strong discriminating factor, mainly for the shorter distances (10m to 1km). With respect to lifting, the differences were somewhat smaller but a similar pattern was observed, with greatest differences for lifting 4kg, 10kg, and 50kg.
Mean summary scores were 67.4% (range 0–100%, SD 21.2) for walking, 32.1% (0–100%, 23.0) for running, and 69.2% (17–100%, 13.6) for lifting. Retests were performed between 1.5 and 4.5 weeks after the baseline (mean 18.3 days, SD 3.4 days). Test-retest reliability (ICC) was high: 0.89 for walking (n = 287), 0.88 for running (n = 280), and 0.81 for lifting (n = 289). Correlations among the new measures were 0.71 (walking and running), 0.49 (walking and lifting), and 0.51 (running and lifting). Summary scores correlated as expected with the CAT-5D-QOL and SF-36 domains (Table 3). The highest correlation was between walking ability and the WALK domain of the CAT-5D-QOL (r = 0.87). For lifting ability, the highest correlation with the CAT-5D-QOL domains was with HAND, followed by DAILY, WALK, and PAIN. Among the SF-36 domains (subsample), the strongest correlation for all 3 measures was with physical function and the weakest with mental health (for walking and running abilities) and role emotional (for lifting).
In the regression analysis adjusted for age and sex, lower walking, running, and lifting ability scores were strongly and significantly associated with greater number of conditions and greater use of health services (Table 4). The standardized regression coefficients were highest for walking, followed by running and lifting. For example, the standardized coefficients for walking ranged from −0.45 for the number of chronic conditions to −0.17 for hospitalization.
In this study, we used newly developed and validated questions about physical abilities, based on the Activity Space Model of disability , to describe the ability to walk, run, and lift objects in a large community sample of older Canadians. The study showed how the level of difficulty in basic functions, such as walking, running, or lifting, depends on the level of activity (distance or weight). For a broad range of activity levels, the study provides data on the proportion of individuals with various levels of difficulty, including those unable to perform the activity. We also showed how the ability curves can be presented graphically and how they vary according to sex, age, and number of reported chronic conditions. As expected, we found that men reported a higher level of function for all 3 abilities studied, although the differences varied somewhat according to the type and level of activity. However, differences according to age were relatively small, especially for persons <70 years of age. It is the presence and, notably, the number of chronic conditions that showed the strongest discrimination with respect to walking, running, and lifting abilities. We also found a strong relationship between these abilities and the use of health services, such as hospitalization, visits to physicians, and use of medication.
Limitations of the survey include the possibility of coverage error and a relatively low response rate which may lead to selection bias and limit generalizability of the results. However, response rates in the 30–40% range are not uncommon in online surveys of the general population . Our study provided new data that are not easily obtainable or comparable with published data from other population surveys. For example, in the SF-36 questionnaire there are questions about walking 100 yards, several hundred yards, and more than a mile (with 3 levels of limitation each), but they are scored as part of a 10-item physical function scale. Separate questions about running or lifting are not included. The 2006 Participation and Activity Limitation Survey (PALS) in Canada  asked about ability to walk (yes/no) and difficulty (some, a lot, and unable) walking 1/2 kilometres, walking up/down a flight of stairs, carrying 5kg for 10 meters, standing in line for >20min, standing in one spot for 20min, and moving from one room to another. While such data may be used to determine the proportion of persons with difficulties performing certain activities at a specified level, they provide a very selective and limited picture of any particular ability.
It should be clear that the proportion reporting a specified level of difficulty depends on the level of activity (e.g., distance, weight). Our definition of ability as a relationships between the level of activity and perceived level of difficulty offers greater clarity with respect to the dimensions being measured. The questions are straightforward to answer while the resultant (monotonic) ability curves should be easy to interpret for a clinician. Compared with a response profile from a typical multi-item instrument, a series of ability curves provides an instant and precise assessment of the key abilities. Another potential advantage of this approach is that translation and cross-cultural equivalence of the instrument may be easier to achieve. For these reasons, we would argue that an ability curve is a more comprehensive, detailed, and easier interpretable measure of ability in a specific functional domain than arbitrarily defined proportion disabled responses to individual questions from a standard health status measure, or even a psychometric physical function score. We expect the approach to measuring physical abilities proposed here to be useful in a clinical setting, where precise, rapid, and highly interpretable assessment of specific abilities is important. Such an assessment may also be important in a research context.
The summary score (interpretable as area under the ability curve) is a convenient overall measure of ability, based on responses to multiple, conceptually unidimensional items. While in our view the summary score should not replace graphical representation of abilities, it is useful for statistical analyses and comparisons. Test-retest reliability of the summary scores was high. The scores correlated as expected with specific domains of established health instruments and discriminated very well according to health services use, which demonstrates their convergent and discriminant validity.
We acknowledge that in some research applications the user may only be interested in an overall physical function score and not need to know which abilities are actually evaluated. Also, some users may prefer to rely on instrument developers to decide which activities to include. For some respondents, answering questions pertaining to objectively defined activity levels may be more difficult than referring to familiar daily activities. Therefore, we included additional clues (e.g., walking time) or examples (e.g., objects of different weight) when describing the levels of activity. Some researchers might perhaps argue that estimating the entire ability curve is inefficient and the amount of detail it provides may not be necessary. However, in the context of measuring specific abilities, it is useful to know the levels of activity (e.g., distance to walk) associated with extreme response options (not difficult at all and unable) because of their impact on participation in more complex activities and social roles, such as work, housework, school, or recreation. The assessment can be made more efficient by applying simple skip logic available in most online survey systems. For example, if the respondent is unable to walk 100m, there is no need to ask about 1000m.
The approach to measuring physical abilities presented here could be used to develop a new “gold standard” for measuring physical function. To this end, the ability curve for each ability could be derived by asking participants to perform well-defined activities and measuring the subjective level of difficulty for systematically varied activity levels. Another potential research objective might be a better understanding of the mathematical relationship between activity level and perception of difficulty or effort for various types of activities and various scales of measurement, similar to studies in psychophysics. Finally, self-report measures of other abilities, both elementary (e.g., to climb stairs, stand, bend, and reach) and more complex (e.g., ability to work), could be developed using the methodology presented here and we would encourage this line of research.
The authors wish to thank Courtney Kang for help in conducting the focus group.
Item wording for measures of walking, running, and lifting abilities.
In your present health, how difficult is it for you to walk the following distances:
In your present health, how difficult is it for you to run the following distances:
In your present health, how difficult is it for you to lift the following weights from the waist level to the shoulder level:
The authors report no conflict of interests. The study was funded by a grant from the Canadian Arthritis Network.