Search tips
Search criteria 


Logo of nihpaAbout Author manuscriptsSubmit a manuscriptHHS Public Access; Author Manuscript; Accepted for publication in peer reviewed journal;
Accid Anal Prev. Author manuscript; available in PMC 2012 February 27.
Published in final edited form as:
PMCID: PMC3286870

Female Involvement in U.S. Nonfatal Crashes Under a Three-Level Hierarchical Crash Model


Men have long held the lead in motor-vehicle crashes; however, research indicates that women are closing the gap. To further investigate this problem, we applied a hierarchical model to investigate female involvement in fatal crashes in the United States. The hierarchical model recognizes that decisions at higher levels affect the decisions at lower levels. At the top level, the model assumes that the driver's condition (e.g., inattention, fatigue, impairment) affects the next level (e.g., speeding or other failures to obey traffic laws), which subsequently affects the basic maneuvering skills (i.e., the lowest level) were either non-existent, or largely explained by gender differences in alcohol consumption. We found that although female involvement in skill-related crashes was not different from that of males, females were more likely than males to apply wrong maneuvers when speeding was involved. We also found that the most important contributing factor to gender differences in nonfatal crashes can be traced back to gender-based differences in alcohol consumption.

Keywords: female drivers, nonfatal crashes, hierarchical model


Despite significant progress in traffic safety during the past decades, motor-vehicle crashes (MVCs) remain a major source of injury. U.S. males account for most of the traffic fatalities—three times that of females—and thus, have received most of the resources and focus (e.g., Cerrelli, 1998). Current data show, however, that the prevalence of women in fatal MVCs is rising. The National Highway Traffic Safety Administration (NHTSA) reported that the number of male drivers killed in fatal crashes dropped from 45 084 in 1975 to 39 739 in 1994. Yet, during the same period, the number of female drivers in fatal crashes increased from 9356 to 13 430 (NHTSA, 1995). Interestingly, the estimated involvement rate in fatal crashes per 100 000 licensed male drivers has constantly declined over the last 30 years (from 62 in 1975 to 42 in 2003), whereas it has remained unchanged for female drivers for about 15 years (National Center for Statistics and Analysis, 2005). Focusing on fatal crashes, Romano and colleagues (2008) reported that relative female involvement in fatal crashes has indeed increased in the United States, albeit mostly due to an increase in traffic exposure. As mentioned, reasons for this increase in crash fatalities is often cited as related to the different and richer roles modern women are playing in the society. Some explanations focus on household roles and their associated stresses, travel patterns, and driving skills. Other explanations go beyond exposure and argue that the development of changes in normative behaviors among women that occur over time translate into new, riskier driving behaviors (e.g., Pisarski, 1992; Voas et al., 1998). Romano and colleagues (2008) found some evidence suggesting that the latter could be a factor in explaining the relative increase in female crashes, but only for young female drivers. Understanding these changes would require data simultaneously containing both behavioral and crash data, as well as models able to properly fit the data.

Keskinen (1996) suggested the use of hierarchical models to systematize the contribution of behavioral factors to crash risk. Keskinen hierarchical levels of driving behavior (HLDB) model (described in English by Keskinen et al., 2004) postulates that behavioral factors contributing to the likelihood of crashes respond to a hierarchy, with decisions drivers' made at higher levels affecting the decisions at lower levels. At the top level, the HLDB model assumes that the drivers' goals of life affect the reason they drive, which subsequently affect the drivers' ability to master traffic situation and vehicle maneuvering. Because of its focus on behavior, the HLDB model has been applied in Europe to study the risks that novice drivers face (Laapotti et al., 2001) and the development of driver training programs for young drivers (Hatakka et al., 1999, 2002; Berg, 2006).

Although building a hierarchical model containing both behavioral and driving contextual data could become a valid and useful tool for investigating gender-based differences in crash involvement, building such a model is not straightforward. The main reason for this limitation is that simultaneous information on behavioral and contextual driving is rarely available. To deal with this limitation, we adopted a simplified three-level hierarchical model. The three levels we used are (from top to bottom) driving context (e.g., inattention, fatigue, impairment), which we assume has an effect on the next level (e.g., speeding or other failures to obey traffic laws), which subsequently affects the basic maneuvering skills (i.e., the lowest level). The goal of this effort is twofold. First, we want to investigate if the application of hierarchical models is useful to the study of gender-based differences in nonfatal crashes; and second, if there are gender-based differences across the different contributing crash factors.


2.1.1 Conceptual Model

Drinking-and-driving contexts can be viewed as decision-making situations requiring individuals to choose between riskier and safer courses of action (Labouvie and Pinsky, 2001). Like the HLDB, our model recognizes that decisions at higher levels affect the decisions (and skills) at lower levels. At the top level, our model assumes that the person's driving context (e.g., drinking and driving) impacts the decisions at the second level (e.g., speeding or other failures to obey traffic laws), and subsequently, affect the basic maneuvering skills (e.g., controlling the vehicle direction).

Hierarchies in our model are broadly defined, with the sole requirement that factors within one level are dependent upon factors in superior levels, but without specifying how the factors are interconnected. Although such a lack of specification precludes a detailed study of the different mechanisms by which the different factors interact, it simplifies the analyses and makes them tractable.

2.1.2 Data

Crash data for this study were obtained from the 1997–2007 General Estimates System (GES). In 1988, the GES began to provide data about all types of crashes involving all types of vehicles. Data in GES come from a nationally representative probability sample of about 6.4 million police-reported crashes, from which about 50,000 are sampled each year. Crashes included are those that result in a fatality or injury that involves major property damage. GES is a valuable database that provides excellent information about the vehicle, the road characteristics, and the road conditions at the time of the crash; the basic sociodemographic characteristics of the driver; and the drivers' alcohol consumption, maneuvering skills, and sources of distraction. Variables in the GES database appear as listed in police crash reports. Thus, for instance, if the police officer at the crash scene believed that bad weather, alcohol, or an improper driving maneuver were factors contributing to the crash, the officer would have included that information in her/his report. Please notice that contributing factors, such as those just mentioned, are not mutually exclusive (i.e., they may all be present in the same crash).

There were 2,747,480 observations read from the 1997–2007 GES data set. About 13% of these records involved single-vehicle crashes (n=341,467). From these datasets, we eliminated drivers with missing gender information; drivers of non-passenger vehicles (buses, farm and construction vehicles, snowmobiles, etc.), ambulances, military or police vehicles, and nonmoving vehicles (e.g., parked). Also, to increase the likelihood of a correct identification of crash responsibility, only single-vehicle crashes were considered. After these manipulations, 277,672 drivers remained.

2.2 Measures

2.2.1 The Three Crash-Factor Levels

Table 1 shows the GES variables we used and the criteria we applied to build the condition variables at each of the three crash-factor levels. On top of our model is level-3 (driving context: alcohol consumption, fatigue/inattention), followed by level-2 (failing to obey a traffic signal, speeding, other aggressive driving) and level-1 (loss of vehicle control associated with improper vehicle maneuvering or bad weather or surface conditions).

Table 1
GES Variables Used to Create Three Driving Levels and Conditions

2.3 Analyses

2.3.1 Analytical Models

To investigate the gender contribution to each level and crash factor, we applied descriptive analyses (chi-square) as well as multinomial (polytomous) logistic regression. Two types of descriptive analyses were applied. First we evaluated the association between gender and crash-factor level within the two crash-severity levels under consideration (nonfatal injuries and property damage only [PDO]). Then we reversed these comparisons and evaluated the association between gender and crash severity within crash-factor level. To expand these bivariate analyses and adjust by other covariables of interest, we then applied multinomial logistic regressions. Two sets of regressions were applied: one modeling the identification of a level-1 crash-factor as a function of level-2 crash-factors and covariables, and another modeling the identification of a level-2 crash-factor as a function of level-3 crash-factors and covariables. Covariables considered were gender (male, female), age (<21 – the reference level, 21–34, 35–64, and 65 and older), and presence of passengers (none – the reference level, 1, and 2 or more). Although the analysis of main effect models was of interest to us, we were particularly interested in the interaction between gender and all relevant factors and covariables. For instance, in modeling level-1 crash-factors, we were especially interested in the interactions between gender and level-2 crash-factors. If such interaction were significant, it would mean that compared to males, the contribution of females to the likelihood of a level-1 crash-factor would partially depend on the occurrence of a level-2 crash-factor. To evaluate these interactions, we run separate models, including only main effect and models including the interactions of interest. Because of the multiple comparisons performed, we accepted statistical tests to be significant only at a 1% alpha value. STATA 11, which has the ability to accommodate weights and survey designs, such as those used to collect the GES database (“svy” estimation commands), was used for analyses.


3.1 Bivariate Comparisons of Gender and Crash-Factor Level Within Crash Severity

Table 2 shows the number of drivers in the file, by gender and severity (first row). It also shows the percentage of those drivers who appear in the different crash-factor levels. The female/male (F/M) ratios for cases in which no specific crash-factor level was identified are also included to serve as control for comparisons against crashes with an identified crash-factor. Also shown in Table 2 is the F/M ratio corresponding to each combination of crash-factor level and crash severity. Overall, for each male driver in the file, there are .57 females. Compared to males, female involvement was significantly smaller in level-3 crash-factors (F/M = .51) than in other types of crashes—when level 3 crash-factors were absent (F/M = .63)—with such a gender difference also taking place in both types of crash severity (F/M = .49 and F/M = .58 within PDO crashes, and F/M = .53 and F/M = .74 among nonfatal injury crashes). The relatively lower involvement of female drivers in level-3 crash-factors occurred in those in which alcohol was identified. No statistically different gender difference was observed associated to fatigue/inattention.

Table 2
Distribution of Crash Types Within Gender

The role of gender in level-2 crash-factors somewhat mimics that for level-3 crash-factors: compared to males, female involvement was significantly smaller in level-2 crash-factors (F/M = .50) than in other types of crash-factors—when level 2 was absent (F/M = .60)—with such a gender difference also occurring in both types of crash severity. Among the level-2 crash-factors, the relatively lower involvement of female drivers occurred associated to either speeding or other aggressive driving. No statistically different gender difference was observed regarding failure to obey.

The role of gender in shaping the occurrence of level-1 crash-factors seems less clear than in the other two levels. Overall, no statistically significant gender difference was observed among level-1 crash-factors. When individual conditions within level-1 were tested, however, some differences by crash severity surfaced, with female drivers being less involved than males in maneuver PDO crashes, but female drivers were more involved than their male counterparts in weather/surface nonfatal injury crashes.

3.2 Multinomial Logistic Regressions

Table 3 reproduces the outcome of the multinomial logistic regression models for the occurrence of the level-1 crash-factors (dependent variable) as a function of the level-2 crash-factors, gender, and other covariables (the three leftmost columns in Table 3), and level-2 crash-factors (dependent variable) as a function of the level-3 crash-factors, gender, and other covariables (the three rightmost columns in Table 3). Main effects and full models for both regressions are also shown. The Pseudo R2 measure shown in Table 3 indicates that although the models we applied were not bad, they left an important fraction of the variance unexplained. The inclusion of the interaction terms (our full model) increases the goodness of fit of the main effect models only marginally.

Table 3
Multinomial Regressions

3.2.1 Main effects model

Regression of the level-1 crash-factors on the level-2 crash-factors and covariables (main effects only) shows that the likelihood of a maneuver crash-factor increases significantly (p<.01) with speeding (OR=5.309), but not with other aggressive or failure to obey driving situations (). It also tends to decrease as the age of the driver increases, albeit at lower odds ratios after age 34. A maneuver crash-factor was also less likely to be associated with PDO crashes rather than with nonfatal injury crashes. Speeding also increased the likelihood of a weather/surface crash-factor, although the corresponding odds ratio (1.601) was much lower than the one that links speeding with maneuvering (5.309). Neither age nor gender was significant in the main effect model under the 1% alpha criterion. However, under a less strict criterion (e.g., 2% alpha), females would have been found to be less likely to appear in maneuver situations than males (OR=.835, p value=.019) Presence of passengers does not have a significant impact on maneuver situations under the 1% alpha criterion.

The regression of the level-2 crash-factors on the level-3 crash-factors and covariables (main effects only) shows the important role that alcohol plays in inducing speeding and other aggressive crash-factors. The odds ratio associated with alcohol (either alone or in combination with fatigue/inattention) are among the largest of any factor contributing to speeding (OR=5.392 and OR=5.469, for the main effects and full model, respectively). Alcohol in combination with fatigue/inattention was also among the factors that most contributed to crashes associated to other aggressive driving (OR=2.981 and OR=3.305, for the main effects and full model, respectively). The role of the driver's age differs for speeding, other aggressive, and fail to obey crash-factors. Although the likelihood of speeding decreases with age, the likelihood of fail to obey remains relatively constant over all the age groups except for drivers ages 65 and older, who have higher odds of involvement in fail to obey situations than younger drivers. Speeding was also less likely to be associated to PDO crashes rather than to nonfatal injury crashes. Under the main effects model, gender was significant only to explain other aggressive crashes, with females less likely to be involved in these types of crashes than males (OR=.275).

3.2.2 Full model

As mentioned, the inclusion of the interaction terms does not increase much the goodness of fit of the main effect models. Further, the Akaike's information criterion (AIC) suggests that there is no apparent benefit in considering the full model over the main effects model. Nevertheless, our main interest in running our full models was to investigate the interaction between gender and the different variables under study. For the level-1 on level-2 regression model and although it was not significant in the main effect model, gender tested significant in the full model when it interacted with age and crash severity. Older female drivers (aged 35 and up) were more likely to be involved in maneuver situations than younger female drivers. Females also were more likely to be involved in nonfatal injury crashes than in PDO crashes, a finding that replicates the one obtained from the descriptive bivariate analyses. The interactions that were of particular interest to this level-1 on level-2 model were those involving gender and level-2 crash-factors, for they explore if the role of factors such as speeding, fail to obey, or other aggressive factors on maneuver or weather/surface events differs for males and females. Most interactions in this model were not significant, with the exception of female drivers being more likely to be involved in maneuver and weather/surface crashes than their male counterparts when they were also speeding (OR=1.608). It is interesting to note that although statistically correct, the operational meaning of this finding is not apparent. The production of results statistically significant but difficult to consider for operational purposes may signal one of the limitations of the use of hierarchical models, as we did in this study. On the other hand, the use of the hierarchical model is better justified with findings showing that gender differences in nonfatal crashes at any level are highly associated to gender differences in alcohol consumption.

Like for the level-1 on level-2 model, the interactions in the level-2 on level-3 model were largely nonsignificant. The interactions of particular interest for the level-2 on level-3 model were those involving gender and level-3 crash-factors, as they evaluated whether the role of factors such as alcohol or fatigue/inattention varied across gender. Table 3 shows that females were less likely to be involved in other aggressive crashes than their male counterparts when they were under the influence of alcohol and fatigue. The latter result should be taken with extreme caution, however, because of the relatively small number of female drivers in that category.


By looking at different levels of crash-factors, this study allows for some separation of those factors and therefore achieves more detail in studying the role of gender on non-fatal motor vehicle crashes. One of the clearest results coming from this study is that females are less involved in alcohol-related crashes and speeding-related crashes than males. This result is not surprising, however, for it only confirms previous findings in the literature (Liu et al., 2005; Romano et al., 2008). Our lack of finding for gender differences associated to fatigue/inattention or maneuvering suggests that the occurrence of these factors is not circumscribed to a certain gender, but that their occurrence affects the general population. Further, the lack of significance for gender in their association with maneuvering suggests that the skill-levels of males and females may be equivalent.

This study also underscores the implicit complexity of the crash situations, as well as the potentiality of the hierarchical model when applied to crash data. For instance, this study found that female drivers are more prone to be involved in maneuver and weather/surface crash situations than male drivers when speeding, but female drivers were less likely to speed than males. Similar complexity was found by Romano and colleagues (2009) applied a hierarchical model to study the role of gender on fatal crashes. In that study, like in this one, female drivers were less likely to speed or to be involved in alcohol-related crashes than males. Also found in both studies was some evidence suggesting that female drivers were not more prone to skill-related crashes (level-1 crash-factors) than their male counterparts were. Thus, both studies present additional evidence against the popular myth portraying female drivers as more prone to be involved in skill-related crashes than males, particularly when driving occurs under normal traffic conditions. As discussed, however, the evidence coming from this study is not conclusive. We also found that females were more involved in some weather/surface-related crashes (Table 2), and that females might be more prone to maneuver-related crashes when speeding is involved. Unfortunately, reasons for these findings are unclear.

Despite its interesting results, this study is not free of shortcomings. As mentioned, the inclusion of higher-level crash factors in the analytical model seems appropriate because it helped explain some of the variations observed at lower levels. Application of our hierarchical model is not free of problems, however. One such difficulty emanates from the required partition of the database, which resulted in small sample sizes for some categories and forced us to collapse variables into broader categories. Another limitation involves the modeling of crashes as a function of the hierarchically structured factor levels and covariables. An intuitively appealing approach to such modeling would have been the use of path analysis, modeling the role of factors at superior levels (e.g., alcohol) on intermediate levels (e.g., speeding), and lower levels (e.g., maneuvering). Unfortunately, the use of path analysis to the hierarchical model presents several limitations, such as dealing with categorical variables and some paths that are not well defined. Research aiming to find a valid implementation of path analysis to the problem and data under study is already underway by our research team. Unfortunately, this line of research has not yet yielded fruitful results. Finally, due to the unaccounted complexities, some of the results obtained by this study, although statistically significant, are of dubious implications or irrelevant. That is, for instance, the case of our finding that female drivers were more likely to be involved in maneuver and weather/surface crashes than their male counterparts when they were also speeding. While findings like this are statistically correct, their operational and policy implications are not.

In this study, we proposed an alternative strategy. To understand the role of gender at different crash-factor levels, we separated the analyses into two parts: independently examining the contribution of level-3 to the level-2 crash-factors and the level-2 crash-factors to the level-1 crash-factors. Although this approach did not allow for a full integration of the three-level crash-factors, by focusing on the gender-related interaction terms and going from level-1 crash-factors up to level-3, the model provided some meaningful results.

As mentioned, paths showing the influence of crash-factors as they triggered down from level 3 to level 1 could not be investigated. Also, to maximize the likelihood that drivers included in this study were responsible for the crash, we only included single-vehicle crashes. Not all crashes are reported to law enforcement agencies; therefore, crashes involving very little property damage and no injuries are likely to be underreported and therefore absent from the GES. Finally, another limitation of this study (perhaps the most relevant) involves the possibility of bias by those officers who coded the crashes. Police officers may have been more prone to assign different codes to females than males (e.g., giving a maneuver code to females more frequently than to males). Nevertheless, our study only found an association between female drivers and maneuver in situations in which speeding was involved. If there were gender-related bias in the way these types of crashes were registered, then such an association between speeding and maneuver should be revisited.

In summary, by applying a hierarchical model, this study disentangles some of the factors explaining the role of gender in nonfatal crashes. As such, this effort provides support to the application of these models. As a result, we found that many of the gender-based differences associated to skill-related crashes were either nonexistent, or largely explained by gender differences in alcohol consumption. However, our model was not free of problems and some of the findings seem either irrelevant or difficult to interpret for policy considerations. Thus, further research on the use of hierarchical models to investigate gender-differences in motor-vehicle crashes would be advisable.

Table 4
Multinomial Regressions


Grant Support: National Institute of Child Health and Human Development (Grant No. R21HD053840-01A1) and National Institute on Alcohol Abuse and Alcoholism (Grant No. P20 AA017831).


Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.


  • Berg H-Y. Reducing crashes and injuries among young drivers: What kind of prevention should we be focusing on? Injury Prevention. 2006;12(ISuppl 1):i15–18. [PMC free article] [PubMed]
  • Cerrelli EC. Research Note. National Highway Traffic Safety Administration; Washington, DC: 1998. Crash data and rates for age-sex groups of drivers, 1998.
  • Hatakka M, Keskinen E, Gregersen NP, Glad A. Theories and aims of educational and training measures. In: Siegrist S, editor. Driver training Testing and licensing -- towards theory-based management of young drivers' injuries risk in road traffic. Results of EU-Project GADGET (Guarding Automobile Drivers through Guidance Education and Technology) Bfu; Bern: 1999. pp. 13–48. Work Package 3.
  • Hatakka M, Keskinen E, Gregersen NP, Glad A, Hernetkoski K. From control of the vehicle to personal self-control; broadening the perspectives to driver education. Transportation Research Part F: Traffic Psychology and Behaviour. 2002;5(3):201–215.
  • Keskinen E. Why do young drivers have more accidents? In: Mensch BS, Sicherheit, editors. Berichte der Bundesanstalt fur Stra Benwesen. Bergisch Gladbaci; Germany: 1996.
  • Keskinen E, Hatakka M, Laapotti S, Katila A, Peräaho M. Driver Behaviour as a Hierarchical System. In: Rothengatter T, Huguenin RD, editors. Traffic and Transport Psychology. Elsevier, Kidlington; Oxford: 2004. pp. 9–24.
  • Laapotti S, Keskinen E, Hatakka M, Katila A. Novice drivers' accidents and violations - a failure on higher or lower hierarchical levels of driving behaviour. Accident Analysis and Prevention. 2001;33(6):759–769. [PubMed]
  • Labouvie EW, Pinsky I. Substance use and driving: The coexistence of risky and safe behaviors. Addiction. 2001;96(3):473–484. [PubMed]
  • Liu C, Chen C-L, Subramanian R, Utter D. Analysis of Speeding-Related Fatal Motor Vehicle Traffic Crashes. National Highway Traffic Safety Administiration; Washington, DC: 2005. DOT HS 809 839.
  • National Center for Statistics and Analysis 2005. Traffic Safety Facts 2005: A Compilation of Vehicle Crash Data from the.
  • Fatality Analysis Reporting System. the General Estimates System . National Highway Traffic Safety Administration. U.S. Department of Transportation; Washington, DC: DOT HS 810 631.
  • National Highway Traffic Safety Administration . Fatality analysis reporting system (FARS) 1995. National Highway Traffic Safety Administration; Washington, DC: 1995. DOT HS 809 726.
  • Pisarski AE. Travel behavior issues in the 90's. U.S. Department of Transportation; Washington, D.C.: 1992.
  • Romano E, Kelley-Baker T. Alcohol-Related Fatal Crashes in the United States Under a Three-Level Hierarchical Crash Model. The 32nd Annual Scientific Meeting of the Research Society on Alcoholism; San Diego, CA. 2009.
  • Romano E, Kelley-Baker T, Voas RB. Female involvement in fatal crashes: Increasingly riskier or increasingly exposed? Accident Analysis and Prevention. 2008;40(5):1781–1788. [PMC free article] [PubMed]
  • Romano E, Kelley-Baker T, Torres P. Female Involvement in U.S. Fatal Crashes Under A Three-level Hierarchical Crash Model: Mediating and Moderating Factors. Presented at the TRB's 4th International Conference on Women's Issues in Transportation; Irvine, CA. Oct, 2009. 2009.
  • Voas RB, Wells J, Lestina D, Williams A, Greene M. Drinking and driving in the United States: The 1996 National Roadside Survey. Accident Analysis and Prevention. 1998;30(2):267–275. [PubMed]