|Home | About | Journals | Submit | Contact Us | Français|
Many studies have relied on administrative data to identify patients with heart failure (HF).
To systematically review studies that assessed the validity of administrative data for recording HF.
English peer-reviewed articles (1990 to 2008) validating International Classification of Diseases (ICD)-8, -9 and -10 codes from administrative data were included. An expert panel determined which ICD codes should be included to define HF. Frequencies of ICD codes for HF were calculated using up to the 16 diagnostic coding fields available in the Canadian hospital discharge abstract during fiscal years 2000/2001 and 2005/2006.
Between 1992 and 2008, more than 70 different ICD codes for defining HF were used in 25 published studies. Twenty-one studies validated hospital discharge abstract data; three studies validated physician claims and two studies validated ambulatory care data. Eighteen studies reported sensitivity (range 29% to 89%). Specificity and negative predictive value were greater than 70% across 17 studies. Nineteen studies reported positive predictive values (range 12% to 100%). Ten studies reported kappa values (range 0.39 to 0.84).
For Canadian hospital discharge data, ICD-9 and -10 codes 428 and I50 identified HF in 5.50% and 4.80% of discharge records, respectively. Additional HF-related ICD-9 and -10 codes did not impact HF prevalence.
The ICD-9 and -10 codes 428 and I50 were the most commonly used to define HF in hospital discharge data. Validity of administrative data in recording HF varied across the studies and data sources that were assessed.
De nombreuses études se fient à des données administratives pour repérer les patients souffrant d’insuffisance cardiaque (IC).
Procéder à une analyse systématique des études qui ont évalué la validité des données administratives pour documenter l’IC.
Des articles révisés par des pairs rédigés en anglais (1990 à 2008) validant les codes de données administratives de la Classification internationale des maladies (CIM)-8, 9 et 10 étaient inclus dans l’étude. Un comité d’experts a déterminé les codes du CIM qui devraient être inclus pour définir l’IC. La fréquence des codes du CIM pour l’IC était calculée au moyen de jusqu’à 16 champs de codage diagnostique inscrits dans les données canadiennes sur les congés des patients pendant les exercices financiers 2000–2001 et 2005–2006.
Entre 1992 et 2008, plus de 70 codes de CIM différents pour définir l’IC ont été utilisés dans 25 études publiées. Vingt et une études ont validé les données canadiennes sur les congés des patients, trois études, les réclamations des médecins et deux études, les données sur les soins ambulatoires. Dix-huit études ont fait état de la sensibilité (plage de 29 % à 89 %). La spécificité et la valeur prédictive négative étaient supérieures à 70 % dans 17 études. Dix-neuf études ont indiqué une valeur prédictive positive (plage de 12 % à 100 %). Dix études ont précisé les valeurs kappa (plage de 0,39 à 0,84).
Pour ce qui est des données canadiennes sur les congés des patients, les codes 428 et I50 du CIM-9 et du CIM-10 ont permis de recenser une IC dans 5,50 % et 4,80 % des dossiers de congé, respectivement. Les autres codes liés à l’IC du CIM-9 et du CIM-10 n’avaient pas de répercussions sur la prévalence d’IC.
Les codes 428 et I50 du CIM-9 et du CIM-10 étaient les plus utilisés pour définir l’IC dans les données de congé hospitalier. La validité des données administratives pour documenter l’IC variait selon les études et les sources de données évaluées.
Heart failure (HF) has been widely studied because of its high mortality and morbidity rates (1–4). Studies on HF have relied on multiple data sources such as surveys, disease registries, hospital charts and administrative data. Of these sources, administrative data have been increasingly used for health services utilization and outcome evaluation.
Administrative data result from implementing health care delivery, enrolling members into health insurance plans and reimbursing health care providers for services (5). The types of administrative data depend on the health care system and the funding structure. For example, Canada has a nationally funded health care system that collects hospital discharge data at a national level, while insurance registry, physician services and emergency room visit information is collected at a provincial level. Although these data are not intended for research, they are widely used in surveillance and outcome studies. Therefore, the validity of the research results rely on the quality of the data. Data quality is critical for interpreting variation across studies from different geographical areas.
The purpose of the present study was to systematically review the literature pertaining to the validity of administrative data for recording HF and apply the International Classification of Diseases (ICD) HF definitions to Canadian hospital discharge abstract data to explore the prevalence of HF across ICD code definitions.
Only English language articles that compared the validity of ICD codes in administrative databases (eg, physician claims data, hospital discharge) with a reference standard (eg, chart review, registry, survey, case notes) were included. The literature search focused on articles published from 1990 to 2008. To identify peer-reviewed journal articles, both the Ovid MEDLINE database and EMBASE were searched. Because validation studies were not indexed consistently with standard MeSH terms, the following key search terms were used: ‘admin data’, ‘administrative data’, ‘population surveillance’, ‘heart failure’, ‘congestive heart failure’, ‘hospital data’, ‘physician claims’, ‘claims’, ‘hospital discharge’, ‘validation’ and ‘registries’. Next, Boolean operators were used to join search terms within the database search engines to locate articles. The reference lists from these articles were scanned to identify any additional articles that met the study criteria. Finally, using the articles identified, the authors were contacted by e-mail to identify any additional related articles. The articles selected from the citations and abstracts were reviewed by one investigator (SQ) to ensure they met the study criteria. When a decision could not be made regarding applicability, two other researchers (HQ and CB) were consulted to review the article for inclusion or exclusion. In cases for which it was unclear whether the publication used administrative data for validation, the article was excluded.
Studies were abstracted to review measures of validity including sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV). When not reported, these measures were calculated using data within the article. For cases in which ICD codes were missing, an attempt was made to contact the corresponding authors for further details. Validity is an expression of the degree to which a parameter reflects what it is intended to measure (6). In validation studies, it is the ability of administrative databases to distinguish between diseased versus non-diseased. Sensitivity measures how well administrative data detect the presence of a disease for individuals who actually have the disease according to a reference standard, while specificity refers to how well the administrative data avoid the problem of falsely classifying individuals as diseased when they do not have the disease. Agreement was assessed by the kappa statistic, which was classified as poor (kappa less than 0.2), fair (kappa 0.2 to 0.39), moderate (kappa 0.4 to 0.59), good (kappa 0.60 to 0.79) and very good agreement (0.8 to 1.00) (7). In addition, the following descriptive data were abstracted from the studies: author, year of publication and data collection, study location, sample size, ICD version and codes, reference/‘gold’ standard and administrative data.
The review showed great variation in ICD codes for HF definition. To ensure that all HF-related ICD codes were captured, four physicians independently reviewed a list of ICD-9-Clinical Modification and ICD-10 codes that were extracted from HF validation studies. These physicians have extensive experience with using administrative data to conduct cardiology research. For each diagnostic code, the physicians decided whether the code should be used to identify patients with HF in administrative data, responding with “yes”, “no” or “unsure”. Based on these responses, a list of ICD codes was analyzed in the Canadian national hospital Discharge Abstract Database for fiscal years 2000/2001 and 2005/2006 to determine whether inclusion of these codes would change the frequency of documented HF. The Discharge Abstract Database contains up to 16 diagnosis fields for ICD-9 data in 2000/2001 and 25 diagnosis fields for ICD-10 data in 2005/2006. The province of Quebec was excluded from the 2005/2006 data because the province was using the ICD-9 catalogue at that time. The frequency of discharge records for each diagnosis code was calculated. Descriptive statistical analyses were performed using SAS version 9.1 (SAS Institute Inc, USA).
The literature search identified 2596 citations for review (Figure 1), with 47 citations related to the study objectives. Of 25 studies published between 1992 and 2008, nine were completed in Canada, 11 in the United States, and one each in Sweden, Portugal, the United Kingdom, Australia and Denmark. The sample size across the studies ranged from 126 to 58,816 patients (Table 1).
More than 70 ICD codes were used to define HF in the studies. Of these studies, 17 validated ICD-9 codes only, three validated ICD-10 codes only, one study examined ICD-8 codes, and four validated both ICD-9 and ICD-10 codes. Two studies (8,9) did not report the ICD codes used. ICD-8 and -9 code 428, and ICD-10 code I50 were commonly used across studies.
From 25 validation studies, 19 used the medical chart review as the reference standard (Table 2). The remaining six studies (10–15) relied on surveys, registries or recoded records. Methods for determining HF presence varied across studies, especially when chart review was used as the reference standard. Seven studies accepted HF diagnosis documented in the chart (12,16–21) and four studies used documented HF diagnosis or a combination of clinical judgement based on information in the chart (eg, test results, diagnostics) (22–25). Eight studies (15,25–31) ascertained HF presence based on diagnostic criteria used by scoring systems from Framingham, Carlson, Boston, the National Health and Nutrition Examination Survey (NHANES), the New York Heart Association (NYHA) or the European Society of Cardiology (ESC). Four studies (25–27,32) further classified HF according to levels of evidence such as definite or possible HF using the ESC or Boston criteria. Two studies compared the PPV against different clinical definitions of HF, and found that the PPV varied between criteria used (Lee et al , 94% Framingham versus 89% Carlson; McCullough et al , 56% NHANES versus 64% Framingham versus 83% NYHA).
Twenty-one studies validated hospital discharge data, three studies (9,14,21) examined physician claims data and two studies (8,21) examined ambulatory care data. One study (14) linked multiple administrative data sources (facility, physician claims and pharmacy).
Six studies (10,12,21,25,30,31) defined HF using the principal diagnosis or main diagnosis, or most responsible diagnosis only. Four studies (13,16,27,11) examined the difference between using the principal or major diagnosis versus secondary diagnoses. Information about the number of diagnosis fields or position of diagnosis in the coding field was not reported in the remaining studies.
Eighteen studies reported sensitivity values ranging from 29% to 89%, and PPV ranging from 12% to 100%. Specificity was higher than 70% across 14 studies and NPV was higher than 74%. Kappa was reported in 10 studies, ranging from 0.39 to 0.84, suggesting fair to very good agreement.
The results of the physician’s review on HF code are shown in Table 3. In 2000/2001, 180,055 (5.5%) of all discharges were coded with HF (ICD-9-Clinical Modification code 428) in any one of the 16 diagnosis fields, with 34% (n=60,723/180,055) of discharges in the first diagnosis field. In 2005/2006, there were 118,029 (4.8%) discharges with ICD-10 code I50 in any of the coding fields, with 36% (n=42,490/118,029) of discharges in the primary diagnosis field. Adding other HF-related ICD codes to 428 or I50 did not change the proportion significantly (from 5.50% to 5.53% for ICD-9 data, and from 4.80% to 4.90% for ICD-10 data).
Our review demonstrated high variation in the validity of administrative data for recording HF. The sensitivity and PPV ranged widely, but specificity and NPV were substantially high across studies. This suggests that the validity of administrative data should be considered before analyzing and interpreting results. Differences in validity can be explained by factors related to ICD coding systems, organization, coder’s experience, reference standards and type of administrative database.
The validity of administrative data is extremely important given the increased use of these data for surveillance and outcome research. For population surveillance, those admitted to the hospital with HF will most likely represent severe cases of the disease in the late stages of the diagnosis; therefore, relying solely on the HF-defining ICD codes (428 or I50) in hospital discharge data will underestimate the prevalence of disease (32,33). For the majority of studies reviewed, hospital discharge data were the primary administrative data source used for validation. More research is needed to examine the validity of outpatient or community data for population surveillance.
More than 70 ICD-8, -9 and -10 codes were used to define HF across the validation studies. The European Cardiovascular Indicators Surveillance Set (EUROCISS) project (34) suggests that validation studies on HF should consider ICD codes for HF, hypertensive heart, other primary cardiomyopathies, alcoholic cardiomyopathy, secondary cardiomyopathy and chronic cor pulmonale. However, ICD-9 code 428 and ICD-10 code I50 accounted for the majority of HF cases in the Canadian hospital discharge data, and other codes had a smaller impact on the prevalence. However, the impact of each code on HF outcomes should be evaluated in the future.
We noticed that the prevalence of HF differed between ICD-9 and ICD-10 data. There are several potential explanations for this difference, possibly related to the learning curve from ICD-9 to the new coding system ICD-10, the declining incidence of HF in that period or improved HF outpatient management. Quan et al (19) evaluated the validity of HF and found little impact of the coding system change from ICD-9 to ICD-10 on HF validity in Canadian data (sensitivity 71.6% in ICD-9 and 68.6% in ICD-10; PPV 90.5% in ICD-9 and 90.2% in ICD-10).
We found that of nine Canadian studies, six demonstrated high PPV estimates (greater than 85%) and two studies reported PPV values of 51% and 65%. The variation can be partially explained by the variation in reference standards for validating HF across studies. Some studies used HF recorded in the patient’s chart as a reference standard, while other studies defined HF based on clinical or diagnostic evidence in the chart. Studies that relied on chart documentation of HF are likely to produce higher sensitivity values but lower PPVs than those that defined HF using a consistent clinical definition (Framingham, NYHA, Boston and ESC criteria). However, previous studies (35–37) have shown that clinical information is often missing in the patient chart, which can leave the diagnosis questionable. Furthermore, many countries have shown that diagnostic tests are not consistently conducted in all patients with suspected HF (38). Diagnosis of HF is challenging, and cases can be misclassified under other diseases because symptoms can be nonspecific to HF, particularly in the elderly (39).
Various administrative data definitions were used to define HF across studies, and this can explain much of the variability across the results. For example, many studies used a variety of ICD codes to define HF with ICD-9 and -10 codes 428 and I50 as the only codes being consistently applied across all studies. Some studies (10–12,16,25,29,30) defined HF using the principal/most responsible/major diagnosis alone. Using this method meant including patients with HF who were severe enough to be hospitalized or whose HF contributed to the length of stay most likely to generate high PPV estimates. Other studies (16–20,23,28) defined HF using all available diagnosis coding fields, resulting in inclusion of patients with HF as a comorbidity or complication. HF is likely to be under coded when HF is present as a comorbidity (ie, it does not contribute to length of stay significantly), which likely generates low PPV estimates.
The validity of administrative data in recording HF varies across studies; therefore, researchers or users of the data should consider data validity before addressing research questions.
The authors thank Dr Andy Wielgosz, Dr Jack Tu, Dr Douglas Lee and Dr Finlay McAlister for their expertise in reviewing the ICD codes. Special thanks to Dr Jennifer Pereira, Amanda Shane, Dr Sulan Dai, Dr Christina Bancej and Peter Walsh for their assistance in reviewing this manuscript. Dr Quan is supported by Alberta Innovates – Health Solutions.