Search tips
Search criteria 


Logo of injprevInjury PreventionVisit this articleVisit this journalSubmit a manuscriptReceive email alertsContact usBMJ
Inj Prev. 2007 December; 13(6): 422–425.
PMCID: PMC2598290

Completeness and accuracy of International Classification of Disease (ICD) external cause of injury codes in emergency department electronic data


The accuracy of external cause of injury codes (E codes) for work‐related and non‐work‐related injuries in Massachusetts emergency department data were evaluated. Medical records were reviewed and coded by a nosologist with expertise in E coding for a stratified random sample of 1000 probable work‐related (PWR) and 250 probable non‐work‐related (PNWR) cases. Cause of injury E codes were present for 98% of reviewed cases and accurate for 65% of PWR cases and 57% of PNWR cases. Place of occurrence E codes were present in less than 30% of cases. Broad cause of injury categories were accurate for about 85% of cases. Non‐specific categories (not elsewhere classified, not specified) accounted for 34% of broad category misclassifications. Among specified causes, machinery injuries were misclassified most often (39/60, 65%), predominantly as cut/pierce or struck by/against. E codes reliably identify the broad mechanism of injury, but inaccuracies and incompleteness suggest areas for training of hospital admissions staff, providers, and coders.

A knowledge of the external causes of injuries is critical to be able to adequately direct injury prevention efforts. In large administrative databases, this information can be obtained through the International Classification of Disease (ICD) external cause of injury codes (E codes). Many injury prevention experts and organizations have advocated submission of these codes for inclusion in acute care hospital databases.1 However, few studies have looked at the overall accuracy of E codes for injury surveillance.2,3,4,5

As a part of a feasibility study of using electronic emergency department (ED) data for occupational injury surveillance, we evaluated the completeness and accuracy of E codes for work‐related and non‐work‐related injuries reported to a statewide Emergency Department Injury Surveillance System (EDISS). In addition, we conducted a qualitative analysis of the most common E code errors in order to identify potential areas for training hospital personnel.


EDISS collected ED data from a sample, stratified by geography and patient volume, of 12 Massachusetts acute care hospitals. Electronic administrative data were collected on all non‐fatal, non‐admitted visits that were assigned an ICD Ninth Revision Clinical Modification (ICD‐9‐CM) diagnostic code (in any of six fields) in the range 800–999, and/or any ICD‐9‐CM E code in the range of E800–E999. The ICD‐9‐CM codes in EDISS were assigned by trained coders at the hospitals. The information reported included patient demographics, diagnostic codes, E codes, payer source, and the presence and content of an injury‐at‐work field. Probable work‐related (PWR) cases were defined as those visits with either workers' compensation designated as payer or an injury‐at‐work value of “yes”, or both. If neither of these criteria were met, the cases were considered probable non‐work‐related (PNWR).

A study sample of PWR and PNWR cases was drawn from patients aged 14 through 75 years reported to EDISS during the period 1 March 1999 through 29 February 2000 using stratified proportional sampling (PROC SURVEYSELECT; SAS V.8; SAS, Inc, Cary, North Carolina, USA). PWR cases were over‐sampled because the primary research interest was work‐related injuries. Statistical significance of differences between the study sample and EDISS were determined using a Wald statistic.

Chart reviews of sampled cases were performed by data abstractors, who collected diagnostic codes, E codes, and narrative information about the cause and place of occurrence of the injury from the medical records.

An expert nosologist validated the injury diagnosis and assigned validation E codes for the injury cases based on the narrative information from the medical record. If the cause (place) of injury was not documented in the medical record, the case was categorized as “Cause (place) not documented” and the appropriate code was assigned (E988.9, E849.9). Completeness of E coding was measured as the proportion of abstracted cases that had cause or place of occurrence E codes in the electronic EDISS database. Accuracy of E codes was measured as the proportion of E codes in the electronic EDISS database that exactly matched the validation E code (specific digits and number of digits). Broad cause of injury accuracy was assessed using 16 cause of injury categories in the ICD‐9‐CM Framework for Presenting Injury Mortality Data developed by the International Collaborative Effort on Injury Statistics.6


There were 108 328 visits reported to EDISS for the study period. Of these, 17 820 were identified as PWR injuries. Initially 1048 PWR cases and 262 PNWR cases were selected for the study sample; of these, 1250 (95.4%) had medical records available for review. The proportion of male PNWR cases in the study sample was significantly less than the proportion in EDISS (p = 0.046); otherwise, the age and gender distributions were similar.

A total of 1232 primary cause of injury E codes were available for the 1250 cases in the study sample (table 11).). Fifteen cases were judged not to be injuries by the expert coder. Approximately 60% of the cause of injury E codes assigned by hospital coders were four digit codes; the remainder were three digit codes. Transcription errors from medical records to the electronic records were less than 1%.

Table thumbnail
Table 1 Completeness and accuracy of cause of injury E codes, Emergency Department Injury Surveillance System (EDISS), Massachusetts, March 1999–February 2000

Coding for cause of injury (table 11)) was very complete (97–99%), as was documentation of cause in the medical record (944/1000 and 232/250, PWR and PNWR, respectively). Cause of injury E codes were correct for 64.6% of PWR cases and 57.2% of PNWR cases.

Coding for place of occurrence was much less complete (43% and 26%, PWR and PNWR cases, respectively), and documentation of place of occurrence in the medical record was also low (239/1000 and 125/250, PWR and PNWR cases, respectively; table 22).). Among records for which place of occurrence was documented and E codes were available, accuracy was high (row 2 of table 22),), but overall, less than 21% of cases had available and correct E codes for place of occurrence. The submission of place of occurrence E codes to EDISS varied considerably by hospital (0–91% complete).

Table thumbnail
Table 2 Completeness and accuracy of place of occurrence E codes, Emergency Department Injury Surveillance System (EDISS), Massachusetts, March 1999–February 2000

Agreement on 16 cause of injury categories6 was found for 84.3% of PWR cases and 84.8% of PNWR cases. Table 33 shows the distributions of the incorrect cause of injury categories by validation cause of injury categories. Machinery injuries had the highest proportion of misclassified codes (65%, 39 of 60). Injuries with a validated cause of “not specified” accounted for the greatest number of misclassifications (54%, 51 of 94). “Cut/pierce” was the cause of injury category that was most often assigned incorrectly (bottom row, table 33),), followed by “struck by/against” and “overexertion”.

Table thumbnail
Table 3 Distribution of incorrect hospital assigned cause of injury categories by validation cause of injury category, Emergency Department Injury Surveillance System (EDISS), Massachusetts, March 1999–February 2000

A review of the abstracted medical record narratives indicated that the frequent misclassification of cause of injury as “cut/pierce” resulted from coding based on the nature of injury rather than on the cause of injury. For example, “jagged lac(eration) L(eft) thumb on table saw …” and “… metal bar hit right ear, sustaining a lac(eration)…” were coded as “cut/pierce” rather than “machinery” and “struck/by”, respectively. Laceration injuries for which the cause was unspecified or ambiguous in the medical record were also often coded as “cut/pierce”. Similarly, “overexertion” was often coded as the cause of sprains, strains, twisting, and avulsion injuries caused by falls or being struck by/against an object or person. “Struck by/against” was erroneously coded for injuries caused by crushing, which correctly belong under “other specified causes”, and for injuries where the patient was struck by a machine, which are properly coded as machinery injuries.


The estimated proportion of correct cause of injury E codes found in this study (57.6%) was similar to that reported in other studies of E codes of ED data5,7 and somewhat less than that reported for hospital discharge data.2 These relatively high rates of errors in E coding at the detailed level indicate a need for additional training for hospital staff in the documentation and coding of the cause of injury.

Key points

  • Cause of injury E codes were very complete (98%) for injury admissions in the emergency department.
  • Only 30% of reviewed injury admissions had location of injury E codes.
  • Cause of injury E codes were accurate for 65% of probable work‐related cases and 57% of probable non‐work‐related cases.
  • Broad cause of injury categories from the International Collaborative Effort Matrix were correct for about 85% of cases.
  • Common misclassifications suggest accuracy of E codes could be substantially improved through training of hospital staff, including admissions staff, providers, and hospital coders.

The validation E codes in this study are based on the medical record and assume the record is a correct description of the actual cause. If the medical record is not correct, accurate coding may still result in misclassification. This study did not try to validate the medical record; however, the finding that documentation of the cause of injury was very complete (95%) indicates that a cause, if not the cause, of an injury is being addressed by clinical personnel and is available for analysis.

The findings for the accuracy of coding and completeness of documentation for injury place of occurrence are much less encouraging. Place of occurrence E codes were available and correct in less than 21% of records, and documentation of the place of occurrence was largely missing.

The finding that there was about 85% accuracy for broad external cause of injury categories indicates a level of accuracy that is adequate for general category surveillance, although there is room for improvement. However, the levels of accuracy achieved for the wide range of injuries and causes in this broad, population‐based data may not extend to individual types of injury or cause. The detailed analysis indicated that nearly two‐thirds of machinery injuries were misclassified, suggesting that many machinery injuries go unrecognized. Systematic misclassification has also been reported for E coding of drowning cases.8

Our analysis indicates that information on the cause of injury is nearly always available in the medical record, but location of the injury is poorly documented. Healthcare providers should be encouraged to include these elements in clinical notes to aid surveillance activities and prevention efforts. Our detailed analysis also points to some common errors that could serve as a focus for training of hospital‐based coders. We had no information on the training of the hospital coders or on quality control procedures used at the time of the study. Training opportunities and certification of medical coders through professional organizations have increased since that time. Consequently, the level of accuracy found in this study probably represents a lower limit to the accuracy that we could expect under current conditions.


ED - emergency department

EDISS - Emergency Department Injury Surveillance System

ICD - International Classification of Disease

ICD‐9‐CM - ICD Ninth Revision Clinical Modification

PNWR - probable non‐work‐related

PWR - probable work‐related


Competing interests: None.

This work was funded by grant number RO1 OH04262 from the United States Centers for Disease Control and Prevention, National Institute for Occupational Safety and Health.


1. STIPDA Resolution on improving external cause of injury coding. Iowa City: State and Territorial Injury Prevention Directors Association 2004
2. Lemier M, Cummings P, West T. Accuracy of external cause of injury codes reported in Washington State hospital discharge records. Inj Prev 2001. 7334–338.338 [PMC free article] [PubMed]
3. Williamson A, Feyer A ‐ M, Stout N, Usher H. Use of narrative analysis for comparison of the causes of fatal accidents in three countries: New Zealand, Austratlia, and the United States. Inj Prev. 2001;7(Suppl I)15–20.20 [PMC free article] [PubMed]
4. Langley J, Stephenson S, Thorpe C. et al Accuracy of injury coding under ICD‐9 for New Zealand public hospital discharges. Inj Prev 2006. 1258–61.61 [PMC free article] [PubMed]
5. Schwartz R, Nightingale B, Boisoneau D. et al Accuracy of e‐codes assigned to emergency department records. Acad Emerg Med 1995. 2615–620.620 [PubMed]
6. NCHS ICD‐9 Framework for Presenting Injury Mortality Data. Atlanta, GA: National Center for Health Statistics, Centers for Disease Control 2004
7. MacIntyre C, Ackland M, Chandraraj E. Accuracy of injury coding in Victorian hospital morbidity data. Aust N Z J Public Health 1997. 21779–783.783 [PubMed]
8. Smith G, Langley J. Drowning surveillance: how well do e‐codes identify submersion fatalities. Inj Prev 1998. 4135–139.139 [PMC free article] [PubMed]

Articles from Injury Prevention are provided here courtesy of BMJ Group