The use of global health items permits an efficient way of gathering general perceptions of health. These items provide useful summary information about health and are predictive of health care utilization and subsequent mortality.
Analyses of 10 self-reported global health items obtained from an internet survey as part of the Patient-Reported Outcome Measurement Information System (PROMIS) project. We derived summary scores from the global health items. We estimated the associations of the summary scores with the EQ-5D index score and the PROMIS physical function, pain, fatigue, emotional distress, and social health domain scores.
Exploratory and confirmatory factor analyses supported a two-factor model. Global physical health (GPH; 4 items on overall physical health, physical function, pain, and fatigue) and global mental health (GMH; 4 items on quality of life, mental health, satisfaction with social activities, and emotional problems) scales were created. The scales had internal consistency reliability coefficients of 0.81 and 0.86, respectively. GPH correlated more strongly with the EQ-5D than did GMH (r = 0.76 vs. 0.59). GPH correlated most strongly with pain impact (r = −0.75) whereas GMH correlated most strongly with depressive symptoms (r = −0.71).
Two dimensions representing physical and mental health underlie the global health items in PROMIS. These global health scales can be used to efficiently summarize physical and mental health in patient-reported outcome studies.
Global health; PROMIS; Item response theory; EQ-5D
The Patient-Reported Outcomes Measurement Information System (PROMIS) aims to develop self-reported item banks for clinical research. The PROMIS pediatrics (aged 8–17) project focuses on the development of item banks across several health domains (physical function, pain, fatigue, emotional distress, social role relationships, and asthma symptoms). The psychometric properties of the anxiety and depressive symptom item banks are described.
Participants (n = 1,529) were recruited in public school settings, hospital-based outpatient and subspecialty pediatrics clinics. The anxiety (k = 18) and depressive symptoms (k = 21) items were split between two test administration forms. Hierarchical confirmatory factor-analytic models (CFA) were conducted to evaluate scale dimensionality and local dependence. IRT analyses were then used to finalize item banks and short forms.
CFA results confirmed that anxiety and depressive symptoms are separate constructs and indicative of negative affect. Items with local dependence and DIF were removed resulting in 15 anxiety and 14 depressive symptoms items. The psychometric differences between short forms and simulated computer adaptive tests are presented.
PROMIS pediatric item banks were developed to provide efficient assessment of health-related quality of life domains. This sample provides initial calibrations of anxiety and depressive symptoms item banks and creates PROMIS pediatric instruments, version 1.0.
PROMIS; Anxiety; Depressive symptoms; HRQOL; PRO; Scale development; Surveys; Pediatrics
An aim of the National Institutes of Health (NIH) Patient Reported Outcomes Measurement Information System (PROMIS) initiative is to develop item banks and computerized adaptive tests (CAT) that are applicable across a wide variety of chronic disorders. The PROMIS Pediatric Cooperative Group has concentrated on the development of pediatric self-report item banks for ages 8-17 years. The objective of the present study is to describe the Item Response Theory (IRT) analysis of the NIH PROMIS pediatric pain item bank and the measurement properties of the new unidimensional PROMIS Pediatric Pain Interference Scale. Test forms containing pediatric pain items were completed by a total of 3,048 respondents. IRT analyses regarding scale dimensionality, item local dependence, and differential item functioning were conducted. A pain item pool was developed to yield scores on a T-score scale with a mean of 50 and standard deviation of 10. The recommended 8-item unidimensional short form for the PROMIS Pediatric Pain Interference Scale contains the item set which provides the maximum test information at the mean (50) on the T-score metric. A simulated CAT was computed that provides the most information at five possible score locations (30, 40, 50, 60, and 70 on the T-score metric).
Pain; pediatrics; PROMIS; pain interference; Item Response Theory
One of the PROMIS (Patient-Reported Outcome Measurement Information System) network's primary goals is the development of a comprehensive item bank for patient-reported outcomes of chronic diseases. For its first set of item banks, PROMIS chose to focus on pain, fatigue, emotional distress, physical function, and social function. An essential step for the development of an item pool is the identification, evaluation, and revision of extant questionnaire items for the core item pool. In this work, we also describe the systematic process wherein items are classified for subsequent statistical processing by the PROMIS investigators. Six phases of item development are documented: identification of extant items, item classification and selection, item review and revision, focus group input on domain coverage, cognitive interviews with individual items, and final revision before field testing. Identification of items refers to the systematic search for existing items in currently available scales. Expert item review and revision was conducted by trained professionals who reviewed the wording of each item and revised as appropriate for conventions adopted by the PROMIS network. Focus groups were used to confirm domain definitions and to identify new areas of item development for future PROMIS item banks. Cognitive interviews were used to examine individual items. Items successfully screened through this process were sent to field testing and will be subjected to innovative scale construction procedures.
patient-reported outcomes; cognitive interviews; qualitative methods; questionnaire development
The measurement of pain behavior is a key component of the assessment of persons with chronic pain; however few self-reported pain behavior instruments have been developed. We developed a pain behavior item bank as part of the Patient Reported Outcome Measurement Information System (PROMIS). For the Wave I testing, because of the large number of PROMIS items, a complex sampling approach was used where participants were randomly assigned to either respond to two full item banks or to multiple 7-item blocks of items. A web-based survey was designed and completed by 15,528 members of the general population and 967 individuals with different types of chronic pain. Item response theory (IRT) analysis models were used to evaluate item characteristics and to scale both items and individuals on the pain behavior domain. The pain behavior item bank demonstrated good fit to a unidimensional model (Comparative Fit Index = 0.94). Several iterations of IRT analyses resulted in a final 39 item pain behavior bank, and different IRT models were fit to the total sample and to those participants who experienced some pain. The results indicated that these items demonstrated good coverage of the pain behavior construct. Pain behavior scores were strongly related to pain intensity and moderately related to self-reported general health status. Mean pain behavior scores varied significantly by groups based on pain severity and general health status. The PROMIS pain behavior item bank can be used to develop static short-form and dynamic measures of pain behavior for clinical studies.
Pain behavior; item response theory analysis; patient reported outcomes; psychometric analysis; chronic pain; item banks
Recently, the National Institutes of Health Roadmap for Medical Research initiative led a large-scale effort to develop the Patient-Reported Outcomes Measurement Information System (PROMIS). PROMIS’s main goal was to develop a set of item banks and computerized adaptive tests for the clinical research community. Asthma, as the most common chronic childhood disease, was chosen for a disease-specific pediatric item bank.
The primary objective of this research is to present the details of the psychometric analyses of the asthma domain items.
Item response theory (IRT) analyses were conducted on a 34–asthma item bank. Test forms containing PROMIS Pediatric Asthma domain items were completed by 622 children ages 8 to 12. Items were subsequently evaluated for local dependence, scale dimensionality, and differential item functioning.
A 17-item pool and an 8-item short form for the new PROMIS Pediatric Asthma Impact Scale (PAIS) were generated using IRT. The recommended 8-item short form contains the item set that provides the maximum test information at the mean (50) on the T-score metric. If more score precision is required, the complete 17-item pool is recommended and may be used in toto or as the basis of a computerized adaptive test (CAT). A shorter test form can also be created and scored on the same scale.
The present study presents the PROMIS Pediatric Asthma Impact Scale (PAIS) developed with IRT, and provides the initial calibration data for the items.
asthma; pediatric; quality-of-life; item response theory
In 2002, the NIH launched the ‘Roadmap for Medical Research’. The Patient-Reported Outcomes Measurement Information System (PROMIS®) is one of the Roadmap’s key aspects. To create the next generation of patient-reported outcome measures, PROMIS utilizes item response theory (IRT) and computerized adaptive testing. In 2009, the NIH funded the second wave of PROMIS studies (PROMIS II). PROMIS II studies continue PROMIS’s agenda, but also include new features, including longitudinal analyses and more sociodemographically diverse samples. PROMIS II also includes increased emphasis on pediatric populations and evaluation of PROMIS item banks for clinical research and population science. These aspects bring new psychometric challenges. To address this, investigators associated with PROMIS gathered at the Third Psychometric Summit in September 2010 to identify, describe and discuss pressing psychometric issues and new developments in the field, as well as make analytic recommendations for PROMIS. The summit addressed five general themes: linking, differential item functioning, dimensionality, IRT models for longitudinal applications and new IRT software. In this article, we review the discussions and presentations that occurred at the Third PROMIS Psychometric Summit.
computerized adaptive testing; dimensionality; factor analysis; item response theory; patient-reported outcomes; PROMIS; psychometrics; structural equation modeling
The NIH Patient-Reported Outcomes Measurement Information System (PROMIS) Roadmap initiative is a cooperative group program of research designed to develop, evaluate, and standardize item banks to measure patient-reported outcomes relevant across medical conditions. For adults, 11 domains have been developed in physical, mental, and social health.
The objective of the current study was to assess feasibility and construct validity of PROMIS item banks versus legacy measures in a observational study in systemic sclerosis (SSc).
Patients with SSc in a single academic center completed computerized adaptive technology (CAT) administered PROMIS item banks during the clinic visit and legacy domains (using paper-and-pencil). The construct validity of PROMIS items was evaluated by examining correlations with corresponding legacy measures using multitrait-multimethod analysis.
Participants consisted of 143 SSc patients with an average age of 51.5 years; 71% were female and 68% were Caucasian. The average number of items completed for each CAT-administered item bank ranged from 5 to 8 (69 CAT items per patient), and the average time to complete each CAT-administered item bank ranged from 48 seconds to 1.9 minutes per patient (average time= 11.9 minutes/per patient for 11 banks). All correlations between PROMIS domains and respective legacy measures were large and in the hypothesized direction (ranged from .61 to .82).
Our study supports the construct validity of the CAT-administered PROMIS item banks and shows that they can be administered successfully in a clinic with support staff. Future studies should assess the feasibility of PROMIS item banks in a busy clinical practice
Systemic sclerosis; PROMIS; health-related quality of life; construct validity
Childhood obesity is a growing health concern known to adversely affect quality of life in children and adolescents. The Patient Reported Outcomes Measurement Information System (PROMIS) pediatric measures were developed to capture child self-reports across a variety of health conditions experienced by children and adolescents. The purpose of this study is to begin the process of validation of the PROMIS pediatric measures in children and adolescents affected by obesity.
The pediatric PROMIS instruments were administered to 138 children and adolescents in a cross-sectional study of patient reported outcomes in children aged 8–17 years with age-adjusted body mass index (BMI) greater than the 85th percentile in a design to establish known-group validity. The children completed the depressive symptoms, anxiety, anger, peer relationships, pain interference, fatigue, upper extremity, and mobility PROMIS domains utilizing a computer interface. PROMIS domains and individual items were administered in random order and included a total of 95 items. Patient responses were compared between patients with BMI 85 to < 99th percentile versus ≥ 99th percentile.
136 participants were recruited and had all necessary clinical data for analysis. Of the 136 participants, 5% ended the survey early resulting in missing domain scores at the end of survey administration. In multivariate analysis, patients with BMI ≥ 99th percentile had worse scores for depressive symptoms, anger, fatigue, and mobility (p < 0.05). Parent-reported exercise was associated with better scores for depressive symptoms, anxiety, and fatigue (p < 0.05).
Children and adolescents ranging from overweight to severely obese can complete multiple PROMIS pediatric measures using a computer interface in the outpatient setting. In the 5% with missing domain scores, the missing scores were consistently found in the domains administered last, suggesting the length of the assessment is important. The differences in domain scores found in this study are consistent with previous reports investigating the quality of life in children and adolescents with obesity. We show that the PROMIS instrument represents a feasible and potentially valuable instrument for the future study of the effect of pediatric obesity on quality of life.
Quality of life; Obesity; Patient Reported Outcomes Measurement Information System (PROMIS); Child; Depression
The Health Assessment Questionnaire Disability Index (HAQ) and the SF-36 PF-10, among other instruments, yield sensitive and valid Disability (Physical Function) endpoints. Modern techniques, such as Item Response Theory (IRT), now enable development of more precise instruments using improved items. The NIH Patient Reported Outcomes Measurement Information System (PROMIS) is charged with developing improved IRT-based tools. We compared the ability to detect change in physical function using original (Legacy) instruments with Item-Improved and PROMIS IRT-based instruments.
We studied two Legacy (original) Physical Function/Disability instruments (HAQ, PF-10), their item-improved derivatives (Item-Improved HAQ and PF-10), and the IRT-based PROMIS Physical Function 10- (PROMIS PF 10) and 20-item (PROMIS PF 20) instruments. We compared sensitivity to detect 12-month changes in physical function in 451 rheumatoid arthritis (RA) patients and assessed relative responsiveness using P-values, effect sizes (ES), and sample size requirements.
The study sample was 81% female, 87% Caucasian, 65 years of age, had 14 years of education, and had moderate baseline disability. All instruments were sensitive to detecting change (< 0.05) in physical function over one year. The most responsive instruments in these patients were the Item-Improved HAQ and the PROMIS PF 20. IRT-improved instruments could detect a 1.2% difference with 80% power, while reference instruments could detect only a 2.3% difference (P < 0.01). The best IRT-based instruments required only one-quarter of the sample sizes of the Legacy (PF-10) comparator (95 versus 427). The HAQ outperformed the PF-10 in more impaired populations; the reverse was true in more normal populations. Considering especially the range of severity measured, the PROMIS PF 20 appears the most responsive instrument.
Physical Function scales using item improved or IRT-based items can result in greater responsiveness and precision across a broader range of physical function. This can reduce sample size requirements and thus study costs.
Short-form patient-reported outcome measures are popular because they minimize patient burden. We assessed the efficiency of static short forms and computer adaptive testing (CAT) using data from the Patient-Reported Outcomes Measurement Information System (PROMIS) project.
We evaluated the 28-item PROMIS depressive symptoms bank. We used post hoc simulations based on the PROMIS calibration sample to compare several short-form selection strategies and the PROMIS CAT to the total item bank score.
Compared with full-bank scores, all short forms and CAT produced highly correlated scores, but CAT outperformed each static short form in almost all criteria. However, short-form selection strategies performed only marginally worse than CAT. The performance gap observed in static forms was reduced by using a two-stage branching test format.
Using several polytomous items in a calibrated unidimensional bank to measure depressive symptoms yielded a CAT that provided marginally superior efficiency compared to static short forms. The efficiency of a two-stage semi-adaptive testing strategy was so close to CAT that it warrants further consideration and study.
Computer adaptive testing; PROMIS; Item response theory; Short form; Two-stage testing
Patient-reported physical function is an established outcome domain in clinical studies in rheumatology. To overcome the limitations of the current generation of questionnaires, the Patient-Reported Outcomes Measurement Information System (PROMIS®) project in the USA has developed calibrated item banks for measuring several domains of health status in people with a wide range of chronic diseases. The aim of this study was to translate and cross-culturally adapt the PROMIS physical function item bank to the Dutch language and to pretest it in a sample of patients with arthritis.
The items of the PROMIS physical function item bank were translated using rigorous forward-backward protocols and the translated version was subsequently cognitively pretested in a sample of Dutch patients with rheumatoid arthritis.
Few issues were encountered in the forward-backward translation. Only 5 of the 124 items to be translated had to be rewritten because of culturally inappropriate content. Subsequent pretesting showed that overall, questions of the Dutch version were understood as they were intended, while only one item required rewriting.
Results suggest that the translated version of the PROMIS physical function item bank is semantically and conceptually equivalent to the original. Future work will be directed at creating a Dutch-Flemish final version of the item bank to be used in research with Dutch speaking populations.
Patient-reported outcomes (PROs) are essential when evaluating many new treatments in health care, yet current measures have been limited by a lack of precision, standardization and comparability of scores across studies and diseases. The Patient-Reported Outcomes Measurement Information System (PROMIS™) provides item banks that offer the potential for PRO measurement that is efficient (minimizes item number without compromising reliability) flexible (enables optional use of interchangeable items), and precise (has minimal error in estimate) measurement of commonly-studied PROs. We report results from the first large-scale testing of PROMIS items.
Study Design and Setting
Fourteen item pools were tested in the U.S. general population and clinical groups using an online panel and clinic recruitment. A scale-setting sub-sample was created reflecting demographics proportional to the 2000 U.S. census.
Using item response theory (graded response model), 11 item banks were calibrated on a sample of 21,133, measuring components of self-reported physical, mental and social health, along with a 10-item global health scale. Short forms from each bank were developed and compared to the overall bank as well as with other well-validated and widely accepted (“legacy”) measures. All item banks demonstrated good reliability across the majority of the score distributions. Construct validity was supported by moderate to strong correlations with legacy measures.
PROMIS item banks and their short forms provide evidence they are reliable and precise measures of generic symptoms and functional reports comparable to legacy instruments. Further testing will continue to validate and test PROMIS items and banks in diverse clinical populations.
Outcome Measures; Quality of life; Chronic disease
Pediatric self-report should be considered the standard for measuring patient reported outcomes (PRO) among children. However, circumstances exist when the child is too young, cognitively impaired, or too ill to complete a PRO instrument and a proxy-report is needed. This paper describes the development process including the proxy cognitive interviews and large-field-test survey methods and sample characteristics employed to produce item parameters for the Patient Reported Outcomes Measurement Information System (PROMIS) pediatric proxy-report item banks.
The PROMIS pediatric self-report items were converted into proxy-report items before undergoing cognitive interviews. These items covered six domains (physical function, emotional distress, social peer relationships, fatigue, pain interference, and asthma impact). Caregivers (n = 25) of children ages of 5 and 17 years provided qualitative feedback on proxy-report items to assess any major issues with these items. From May 2008 to March 2009, the large-scale survey enrolled children ages 8-17 years to complete the self-report version and caregivers to complete the proxy-report version of the survey (n = 1548 dyads). Caregivers of children ages 5 to 7 years completed the proxy report survey (n = 432). In addition, caregivers completed other proxy instruments, PedsQL™ 4.0 Generic Core Scales Parent Proxy-Report version, PedsQL™ Asthma Module Parent Proxy-Report version, and KIDSCREEN Parent-Proxy-52.
Item content was well understood by proxies and did not require item revisions but some proxies clearly noted that determining an answer on behalf of their child was difficult for some items. Dyads and caregivers of children ages 5-17 years old were enrolled in the large-scale testing. The majority were female (85%), married (70%), Caucasian (64%) and had at least a high school education (94%). Approximately 50% had children with a chronic health condition, primarily asthma, which was diagnosed or treated within 6 months prior to the
interview. The PROMIS proxy sample scored similar or better on the other proxy instruments compared to normative samples.
The initial calibration data was provided by a diverse set of caregivers of children with a variety of common chronic illnesses and racial/ethnic backgrounds. The PROMIS pediatric proxy-report item banks include physical function (mobility n = 23; upper extremity n = 29), emotional distress (anxiety n = 15; depressive symptoms n = 14; anger n = 5), social peer relationships (n = 15), fatigue (n = 34), pain interference (n = 13), and asthma impact (n = 17).
PROMIS; HRQOL; PRO; Scale development; Parent Proxy; Pediatrics
The authors report on the development and calibration of item banks for depression, anxiety, and anger as part of the Patient-Reported Outcomes Measurement Information System (PROMIS®). Comprehensive literature searches yielded an initial bank of 1,404 items from 305 instruments. After qualitative item analysis (including focus groups and cognitive interviewing), 168 items (56 for each construct) were written in a first person, past tense format with a 7-day time frame and five response options reflecting frequency. The calibration sample included nearly 15,000 respondents. Final banks of 28, 29, and 29 items were calibrated for depression, anxiety, and anger, respectively, using item response theory. Test information curves showed that the PROMIS item banks provided more information than conventional measures in a range of severity from approximately −1 to +3 standard deviations (with higher scores indicating greater distress). Short forms consisting of seven to eight items provided information comparable to legacy measures containing more items.
depression; anxiety; anger; item response theory; measurement
We provide detailed instructions for analyzing patient-reported outcome (PRO) data collected with an existing (legacy) instrument so that scores can be calibrated to the PRO Measurement Information System (PROMIS) metric. This calibration facilitates migration to computerized adaptive test (CAT) PROMIS data collection, while facilitating research using historical legacy data alongside new PROMIS data.
A cross-sectional convenience sample (n = 2,178) from the Universities of Washington and Alabama at Birmingham HIV clinics completed the PROMIS short form and Patient Health Questionnaire (PHQ-9) depression symptom measures between August 2008 and December 2009. We calibrated the tests using item response theory. We compared measurement precision of the PHQ-9, the PROMIS short form, and simulated PROMIS CAT.
Dimensionality analyses confirmed the PHQ-9 could be calibrated to the PROMIS metric. We provide code used to score the PHQ-9 on the PROMIS metric. The mean standard errors of measurement were 0.49 for the PHQ-9, 0.35 for the PROMIS short form, and 0.37, 0.28, and 0.27 for 3-, 8-, and 9-item-simulated CATs.
The strategy described here facilitated migration from a fixed-format legacy scale to PROMIS CAT administration and may be useful in other settings.
Calibration; Computerized adaptive testing; Depression; Item banks; Item response theory; PROMIS
The Patient-Reported Outcomes Measurement Information System (PROMIS) allows assessment of the impact of chronic conditions on health-related quality of life (HRQL) across diseases. We report on the HRQL impact of individual and comorbid conditions as well as conditions that are described as limiting activity.
Study Design and Setting
Data were collected through online and clinic recruitment as part of the PROMIS item calibration sample (n=21,133). Participants reported the presence or absence of 24 chronic health conditions and whether or not their activity was limited by each condition.
Across health status domains, the presence of a chronic condition was associated with poorer scores than those without a diagnosis, particularly for those individuals who reported their condition was disabling. The magnitude of detriment in HRQL was more pronounced for individuals with two or more chronic conditions and could not be explained by sociodemographic factors. Patterns of HRQL deficits varied across disease and comorbidity status.
The impact of chronic conditions, particularly when experienced with comorbid disease, is associated with detriments in HRQL. The negative impact on HRQL varies across symptoms and functional areas within a given condition.
Chronic disease; Comorbidity; Quality of life; Outcome measures
The National Institutes of Health's Patient-Reported Outcomes Measurement Information System (PROMIS) has developed several scales measuring symptoms and function for use by the clinical research community. One advantage of PROMIS is the ability to link other scales to the PROMIS metric.
The objectives of this research are to provide evidence of validity for one of the PROMIS measures, the Pediatric Asthma Impact Scale (PAIS), and to link the PedsQL™ Asthma Symptoms Scale with the metric of the PAIS.
Descriptive statistics were computed describing the relationships among scores on the PAIS, the PedsQL™ Asthma Symptoms, Treatment, Worry, and Communication Scales, and the DISABKIDS Asthma Impact and Worry Scales for approximately 300 children ages 8–17. A novel linkage method based on item response theory (IRT), calibrated projection, was used to link scores on the PedsQL™ Asthma Symptoms Scale with the metric of the PAIS.
The PAIS exhibited strong convergent validity with the PedsQL™ Asthma Symptoms Scale, and less strong relations with the other five scales. The linkage system uses scores on the PedsQL™ Asthma Symptoms Scale to produce relatively precise score estimates on the metric of the PAIS.
Results of this study provide evidence for the validity of the PAIS, and a method to use scores on the PedsQL™ Asthma Symptoms Scale to estimate scores on the metric of the PAIS, in partial fulfillment of the PROMIS goal to provide a lingua franca for health-related quality of life.
PROMIS; HRQOL; PRO; Scale development; Pediatrics; Asthma
This paper describes a large-scale administration of the Patient-Reported Outcomes Measurement Information System (PROMIS) pediatric items to evaluate measurement characteristics.
Each child completed one of seven test forms containing items from a pool of 293 PROMIS items and four legacy scales. PROMIS items covered six domains (physical function, emotional distress, social role relationship, fatigue, pain, and asthma).
From January 2007 to May 2008, 4,129 children aged 8–17 were enrolled. The sample was 51% female, 55% aged 8–12, 42% minority race and 17% were Hispanic ethnicity. Approximately, 35% of the children participating in the survey consulted a clinician for a chronic illness diagnosis or treatment within 6 months prior to study enrollment.
The final PROMIS pediatric item banks include physical function (n = 52 items), emotional distress (n = 35 items), social role relationships (n = 15 items), fatigue (n = 34 items), pain (n = 13 items), and asthma (n = 17 items). The initial calibration data were provided by a diverse set of children with varying health states (e.g., children with a variety of common chronic illnesses) and racial/ethnic backgrounds.
PROMIS; HRQOL; PRO; Scale development; Surveys; Pediatrics
Many patient-reported outcome (PRO) instruments used in systemic sclerosis (SSc) trials are limited by lack of validation, licensing fees, and complicated scoring systems. We assessed the construct validity for discriminative purposes of two new PRO instruments-the Patient Reported Outcomes Measurement Information System 29-item Health Profile (PROMIS-29) and the Functional Assessment of Chronic Illness Therapy-Dyspnea short form (FACIT-Dyspnea), measuring health status and dyspnea in SSc patients.
Seventy-three patients participated in a cross-sectional study at a tertiary SSc program. PROMIS-29, FACIT-Dyspnea, and legacy PRO instruments used in clinical trials (Medical Research Council [MRC] Dyspnea Score, St. George’s Respiratory Questionnaire [SGRQ], Health Assessment Questionnaire-Disability Index [HAQ-DI] and Short-Form 36 [SF-36]) were administered. Composite severity scores using an adaptation of the Medsger Disease Severity Index were generated using clinical, diagnostic and laboratory information. PROMIS-29 and FACIT-Dyspnea scores were compared with legacy PRO measures and composite severity scores.
The mean (range) patient age (84% women) was 51 years (22–72). The mean (range) SSc disease duration from the onset of the first non-Raynaud symptom was 7 years (0–45). Spearman correlation coefficients across FACIT-Dyspnea and PROMIS Physical Functioning scores with legacy PRO instruments were generally high (range=0.64–0.86); those between PROMIS and FACIT-Dyspnea with composite disease severity scores were more modest, but statistically significant (range=0.33–0.48, p<0.01).
PROMIS-29 and FACIT-Dyspnea are valid instruments to measure the health status of SSc patients. PROMIS-29 and FACIT-Dyspnea may be preferable to legacy instruments because they are freely available in multiple languages, and simple to administer, score and interpret.
The National Institutes of Health (NIH) Patient-Reported Outcomes Measurement Information System (PROMIS) Roadmap initiative (www.nihpromis.org) is a 5-year cooperative group program of research designed to develop, validate, and standardize item banks to measure patient-reported outcomes (PROs) relevant across common medical conditions. In this article, we will summarize the organization and scientific activity of the PROMIS network during its first 2 years.
The network consists of 6 primary research sites (PRSs), a statistical coordinating center (SCC), and NIH research scientists. Governed by a steering committee, the network is organized into functional subcommittees and working groups. In the first year, we created an item library and activated 3 interacting protocols: Domain Mapping, Archival Data Analysis, and Qualitative Item Review (QIR). In the second year, we developed and initiated testing of item banks covering 5 broad domains of self-reported health.
The domain mapping process is built on the World Health Organization (WHO) framework of physical, mental, and social health. From this framework, pain, fatigue, emotional distress, physical functioning, social role participation, and global health perceptions were selected for the first wave of testing. Item response theory (IRT)-based analysis of 11 large datasets supplemented and informed item-level qualitative review of nearly 7000 items from available PRO measures in the item library. Items were selected for rewriting or creation with further detailed review before the first round of testing in the general population and target patient populations.
The NIH PROMIS network derived a consensus-based framework for self-reported health, systematically reviewed available instruments and datasets that address the initial PROMIS domains. Qualitative item research led to the first wave of network testing which began in the second year.
outcomes; patient reported outcomes; quality of life; health-related quality of life
We report on the development of short forms from the Patient-Reported Outcomes Measurement Information System (PROMIS™) Sleep Disturbance (SD) and Sleep-Related Impairment (SRI) item banks. Results from post-hoc computerized adaptive testing (CAT) simulations, item discrimination parameters, item means, and clinical judgment were used to select the best-performing 8 items for SD and SRI. The final 8-item short forms provided less test information than the corresponding full banks, but correlated strongly with the longer forms. The short forms had greater measurement precision than the Pittsburgh Sleep Quality Index (PSQI) and the Epworth Sleepiness Scale (ESS) as indicated by larger test information values across the continuum of severity, despite having fewer total items, a major advantage for both research and clinical settings.
The evaluation of patient-reported outcomes (PROs) in health care has seen greater use in recent years, and methods to improve the reliability and validity of PRO instruments are advancing. This paper discusses the cognitive interviewing procedures employed by the Patient Reported Outcomes Measurement Information System (PROMIS) pediatrics group for the purpose of developing a dynamic, electronic item bank for field testing with children and adolescents using novel computer technology. The primary objective of this study was to conduct cognitive interviews with children and adolescents to gain feedback on items measuring physical functioning, emotional health, social health, fatigue, pain, and asthma-specific symptoms.
A total of 88 cognitive interviews were conducted with 77 children and adolescents across two sites on 318 items. From this initial item bank, 25 items were deleted and 35 were revised and underwent a second round of cognitive interviews. A total of 293 items were retained for field testing.
Children as young as 8 years of age were able to comprehend the majority of items, response options, directions, recall period, and identify problems with language that was difficult for them to understand. Cognitive interviews indicated issues with item comprehension on several items which led to alternative wording for these items.
Children ages 8–17 years were able to comprehend most item stems and response options in the present study. Field testing with the resulting items and response options is presently being conducted as part of the PROMIS Pediatric Item Bank development process.
There is growing interest in the use of item response theory (IRT) for creation of measures of health-related quality of life (HRQOL). A first step in IRT modeling is development of item banks. Our aim is to describe the value of including librarians, and to describe processes used by librarians, in the creation of such banks.
Working collaboratively with PROMIS researchers at the University of Pittsburgh, a team of librarians designed and implemented comprehensive literature searches in a selected set of information resources, for the purpose of identifying existing measures of patient-reported emotional distress.
A step-by-step search protocol developed by librarians produced a set of 525 key words and controlled vocabulary terms for use in search statements in 3 bibliographic databases. These searches produced 6169 literature citations, allowing investigators to add 444 measurement scales to their item banks.
Inclusion of librarians on the Pittsburgh PROMIS research team allowed investigators to create large initial item banks, increasing the likelihood that the banks would attain high measurement precision during subsequent psychometric analyses. In addition, a comprehensive literature search protocol was developed that can now serve as a guide for other investigators in the creation of IRT item banks.
databases as topic; outcome assessment(health care); librarians; interdisciplinary communication
Our aim was to assess the content validity of the Patient-Reported Outcomes Measurement Information System (PROMIS) social health item banks by comparing a prespecified conceptual model with concepts that focus-group participants identified as important social-health-related outcomes. These data will inform the process of improving health-related quality-of-life measures.
Twenty-five patients with a range of social limitations due to chronic health conditions were recruited at two sites; four focus groups were conducted. Raters independently classified participants' statements using a hierarchical, nested schema that included health-related outcomes, role performance, role satisfaction, family/friends, work, and leisure.
Key themes that emerged were fulfilling both family and work responsibilities and the distinction between activities done out of responsibility versus enjoyment. Although focus-group participants identified volunteerism and pet ownership as important social-health-related concepts, these were not in our original conceptual model. The concept of satisfaction was often found to overlap with the concept of performance.
Our conceptual model appeared mostly comprehensive but is being further refined to more appropriately (a) distinguish between responsibilities versus discretionary activities, and (b) situate the outcome of satisfaction as it relates to impairment in social and other domains of health.