Search tips
Search criteria 


Logo of hsresearchLink to Publisher's site
Health Serv Res. 2010 August; 45(4): 1083–1104.
PMCID: PMC2910570

The Effect of the WIC Program on the Health of Newborns



To determine the effect of the Special Supplemental Nutrition Program for Women, Infants, and Children (WIC) on birth outcomes.

Data Source

The Child Development Supplement (CDS) of the Panel Study of Income Dynamics (PSID). The PSID provides extensive data on the income and well-being of a representative sample of U.S. families from 1968 to present. The CDS collects information on the children in PSID families ranging from cognitive, behavioral, and health status to their family and neighborhood environment. The first two waves of the CDS were conducted in 1997 and 2002, respectively. We use information on 3,181 children and their mothers.

Study Design

We use propensity score matching with multiple imputations to examine whether WIC program influences birth outcomes: birth weight, prematurity, maternal report of the infant's health, small for gestational age, and placement in the neonatal intensive care unit. Furthermore, we use a fixed-effects model to examine the above outcomes controlling for mother-specific unobservables.

Principal Findings

After using propensity scores to adjust for confounding factors, WIC shows no statistically significant effects for any of six outcomes. Fixed-effects models, however, reveal some effects that are statistically significant and fairly substantial in size. These involve preterm birth and birth weight.


Overall, the WIC program had moderate effects, but findings were sensitive to the estimation method used.

Keywords: WIC, birth outcomes, propensity score, multiple imputations, fixed-effects model

The Special Supplemental Nutrition Program for Women, Infants, and Children (WIC) is the third largest food program in the United States, reaching nearly 8 million women and children at a cost of U.S.$5 billion (USDA 2007a). The program forms a key part of the safety net for poor families: nearly 60 percent of poor children under the age of 4 receive WIC, nearly twice as many as those who receive food stamps (Zedlewski and Rader 2005).

Given the program's scope and prominence, the efficacy of WIC has been studied extensively. However, findings have been inconsistent, and the program remains an active focus of policy evaluation (Ludwig and Miller 2005). The methodological challenge facing researchers is that many characteristics are likely correlated with both WIC use and child health. This correlation introduces bias, leading one to either overstate or understate WIC's effects depending on the nature of the relationships between the measured and unmeasured confounding factors, children's outcomes and WIC participation.

In this paper, we use propensity scores to examine the association between WIC participation and birth outcomes. Under key assumptions, propensity scores approximate a randomized experiment by creating matched groups comprising those from both the treatment and comparison groups who are comparable except for treatment status. We further estimate fixed-effects models that allow for mother-specific unobservables. In addition, we examine how strong confounding with an unobserved variable would have to be to explain the apparent effect (or noneffect) of WIC participation using approaches proposed by Rosenbaum (2002) and Imbens (2003). These analyses involve data from the Panel Study of Income Dynamics—Child Development Supplement (PSID-CDS), a study that has tracked more than 5,000 families for nearly four decades. The data included an over-sample of poor families, making it well suited for examining issues of poverty and child development in the United States.

This study makes two contributions to the existing literature. First, by considering multiple outcomes—birth weight, born preterm, low birth weight, small for gestational age (SGA), neonatal hospitalizations, and maternal report of infant health—this study provides a more complete account of WIC's potential effects than most prior research. Second, because of the scope of the data, we account for the effect of heretofore unmeasured or omitted characteristics (such as maternal IQ and family income) that are likely to confound WIC estimates.


The WIC program promotes the development of fetuses, infants, and young children by offering supplemental food packages, nutritional education, and referrals to health care and other social service providers. Since its inception in the early 1970s, the program has grown tremendously in size and scope. In 1974, the program had 88,000 participants; by 2004, the program had nearly 8 million participants, served by 2,200 agencies throughout the country (USDA 2006). Half of all children <1 year and one-quarter of children between ages 1 and 4 years participate (Oliveira et al. 2002). One in every U.S.$10 spent on food assistance in the United States is devoted to WIC (USDA 2007a).

The WIC program offers three benefits to eligible participants. The first is a food package, containing foods that are high in certain nutrients, such as protein, iron, calcium, vitamin A, and vitamin C (USDA 2007b). Infants up to a year also can receive free and reduced-cost formula. In 2006, the value of the average monthly food package per person was U.S.$39.03, but it was higher for families receiving free and reduced cost formula (Food and Nutrition Service 2007). The second benefit is nutritional counseling, which consists of two mandatory education classes during each 6-month certification period. The final benefit is referrals to health care and other social services.

To qualify for WIC, recipients must meet three requirements. First, they must be a member of a categorically eligible group: pregnant women, postpartum women (eligible for up to 1 year if breastfeeding, 6 months if not breastfeeding), and children up to 5 years old. Second, they must be income eligible, either because their incomes are at or below 185 percent of the federal poverty line or because they participate in the Temporary Aid to Needy Families, Medicaid, or Food Stamp programs. Third, a health care professional must declare them at “nutritional risk,” either because they have a medical-based risk (e.g., anemia, underweight, and poor pregnancy outcomes) or because they have a diet-based risk (e.g., an inadequate diet). In practice, nearly all mothers and children meet the nutritional risk criterion (Currie 2003).

WIC promotes healthy birth outcomes through the provision of nutritious foods that aid in fetal development and/or the promotion of advantageous maternal behaviors through its counseling sessions and referral to other social agencies (Rossi 1998). However, various factors may block these pathways. First, while situations of extreme nutritional deprivation have strong negative effects on birth outcomes (Stein and Susser 1975), the effects of moderate nutritional deficits are much less pronounced (Goldenberg and Rouse 1998; Iams 1998;). Nutrition is linked to poor birth outcomes insofar as the mother has a low prepregnancy body mass index or has inadequate gestational weight gain (Kramer 2003). WIC recipients, though, are more likely to be obese before pregnancy and do not gain substantially more weight during pregnancy than do non-WIC recipients (Bitler and Currie 2005).

Second, the counseling and referral components of WIC are limited, and evidence as to whether they are effective in changing behaviors is inconsistent. WIC offers two voluntary half-hour counseling and educational sessions per 6- month enrollment period, during which WIC workers are required to provide information on nutrition, drug, and alcohol services, and other available social programs (GAO 2004). Some studies have found that these sessions increase prenatal care and decrease smoking and tobacco use, while other work has found either no association or a negative one (Rush et al. 1988; Kahler et al. 1992; Bitler and Currie 2005; Joyce, Gibson, and Colman 2005;). The one maternal behavior where the evidence appears unequivocal is breastfeeding, as research has repeatedly demonstrated that WIC reduces breastfeeding (Chatterji and Brooks-Gunn 2004; Jacknowitz, Novillo, and Tiehen 2007;), presumably because of its provision of free formula.

Despite the mixed evidence supporting the pathways between WIC and birth outcomes, numerous studies have found that prenatal WIC participation is associated with increases in mean birth weights (Lazariu-Bauer et al. 2004; Bitler and Currie 2005;), decreases in the number of low and very low birth weight babies, fewer preterm deliveries, and reductions in neonatal and fetal mortality rates (Kennedy and Kotelchuck 1984; Stockbauer 1987; Rush et al. 1988; Devaney 1992; Brown, Watkins, and Hiett 1996; Moss and Carver 1998; May et al. 2001;). The magnitude of the program's impact differs depending on the degree to which studies address the problem of selection bias, suggesting that endogeneity is a key problem in WIC evaluation (Ludwig and Miller 2005). For example, using data from Pregnancy Nutrition Surveillance System, Joyce et al. (2005) adjusting for the timing of WIC enrollment eliminated the association between WIC and preterm birth. The authors reasoned that women with longer pregnancies had more opportunity to enroll in WIC and hence estimates of WIC on preterm birth were confounded by gestational age bias. In another study, Joyce et al. (2005) use data on births in New York city and find that the advantage of WIC to twins, who are at particular risk of fetal growth restriction, to be minimal.

The literature also demonstrates the advantages and disadvantages of alternative data sources. Administrative data generally have more reliable information on program participation than self-report data, but they have limited information on the range of outcomes and the potential confounders. Lazariu Bauer and colleagues, for example, examine a large administrative database on WIC participation in New York state. The authors do consider 29 potential confounders—these variables are drawn from birth certificate data and program information. However, key characteristics are unavailable, such as mother's cognitive ability or family income. (The study does include rates of child poverty in the geographic region.) As discussed below, these are key predictors of both children's outcomes and WIC participation.

Perhaps most striking is that the data include only WIC participants. The authors suggest that early and late participants are more comparable to each other than to nonparticipants, but that assertion is untestable with these data. Researchers consider dosage an indicator of the program impact, but in an observational study, such assessments suffer from the same difficulties as comparisons of participants with nonparticipants. For the above reasons, we rely on survey data and then address how recall bias might influence our findings.



The study analyzes data from the CDS-PSID. The PSID includes extensive data on the income and well-being of a representative sample of U.S. families for nearly four decades. The study covers a range of topics, including family composition change, food and housing expenditures, employment, income, health, and welfare. Data were collected annually from 1968 to 1997 and biannually after 1997.

The CDS is one component of the PSID. The CDS provides comprehensive, nationally representative, and longitudinal data on children and their caregivers (McGonagle and Schoeni 2006). In 1997, PSID randomly selected 3,563 children aged 0–12 from 2,394 PSID families and collected information on their well-being, ranging from cognitive, behavioral, and health outcomes to family and neighborhood environments. While not shown in the tables, we compared the birth weight distributions of the CDS sample with those reported in the national vital statistics during the same period (1985–1997) and found they were comparable. For example, the percentage of low birth weight babies among non-Hispanic white mothers ranged from 5.6 to 6.5 from the vital statistics, and it was 5.36 for our sample; that for non-Hispanic black mothers ranged from 12.6 to 13.1 from the vital statistics, and it was 13.75 for our sample.

Our study use information on 3,181 children and their families. Three hundred and eighty-two children were excluded because (1) they did not live with their biological mother at the time of interview (n=271); (2) they were missing information on WIC participation (n=101), on race and ethnicity (n=5), and on date of birth (n=5).


The main variable of interest is a mother's prenatal WIC participation. The child's primary care givers (usually their biological mother) reported this information during the 1997 interview.

Our outcome measures include six child health outcomes: the child's birth weight (in grams), whether the child was born low birth weight (e.g., weighed <2,500 g), whether the child was born prematurely (e.g., gestational age is <37 weeks), whether the child was born SGA (e.g., weighing less than a specified percentile of birth weight for a given gestational age; we used a gender-specific SGA measure from Alexander et al. 1996), the mother's rating of the child's health at birth as compared with other babies (1=worse, 2=same, 3=better), and whether the child was placed in the neonatal intensive care unit (NICU).

Our covariates include a broad array of child, maternal, and household characteristics collected over the period of the PSID study. For child covariates, we included the child's age (in months), the child's race and ethnicity (white, non-Hispanic; black, non-Hispanic, other), the child's sex, whether the child was first born to the mother, and the number of siblings. Maternal characteristics included IQ,1 education (high school dropout [omitted category]; completed high school, attended college), her age at the time of the child's birth, and whether she was working, married, or was the head of the household (e.g., the only adult) during the year of the child's birth. Family-level characteristics included the household's income from the year of the child's birth. This list is more extensive than that in prior research.

Matching Procedure

For a brief review of propensity score methodology, see the methodological appendix (Appendix SA2). We used PSMATCH2 (Leuven and Sianesi 2003) for Stata 10 SE (StataCorp. 2007). PSMATCH2 implements a variety of propensity score matching methods to adjust for preobservable differences between a treatment and a control group. The program calculates approximate standard errors on the treatment effects assuming independence among observations, fixed weights, homoskedasticity of the outcome variable within the treated and within the control groups, and that the variance of the outcome does not depend on the propensity score (Leuven and Sianesi 2003).

First, we calculated a propensity score for each individual, the predicted probability of WIC receipt. This prediction was generated using the results of a probit model for which WIC participation was the outcome. The predictors included the variables discussed above. As we mentioned earlier, the propensity score represents a “balancing score”—when matching cases on the propensity score, the distribution of the covariates between those who participated WIC and who did not should be the same (Rosenbaum and Rubin 1983b). This property, however, holds only as sample sizes approach infinity and so is only approximate in any finite sample. As a result, good practice involves comparing the distribution of covariates between groups. If the covariates do not balance, then one should modify the propensity score equation (Morgan and Winship 2007) and reassess balance. While over-fitting the data in this manner would not be desirable for many purposes (e.g., such as testing hypotheses about the determinants of WIC participation), modifying the propensity score in this way ensures that the propensity score better captures between-group differences in the covariates. Preliminary analysis suggests that running separate probit model with additional nonlinear terms for four subgroups defined by child's sex and race improved the balance of covariates.

Having been calculated, propensity scores can be used in a variety of ways, including weighting schemes, matching, or as a covariate (Rosenbaum and Rubin 1983a). The key issue with estimating the effect of WIC is calculating a counterfactual outcome—the outcome expected had the individual not received WIC. We do this using a kernel estimator that essentially creates this estimate for a given case using a weighted average of comparison cases with a similar propensity score. The actual estimated counterfactual is a weighted average of those cases; the closer the propensity score, the greater the weight.2 The econometrics literature suggests that this method is especially effective in estimating the counterfactual (Heckman and Navarro-Lozano 2004).

Handling of Missing Data

Because data were collected at multiple interviews, the amount of missing data is relatively high. Table 1 provides information on item missing data. For key variables such as maternal IQ or family income, the amount of missing is 23 percent and 14 percent, respectively. Using the ICE and MICOMBINE commands (Royston 2005) in Stata 10 SE (StataCorp. 2007), we multiply imputed missing values under an assumption of missing at random, analyzed each dataset, and combined the resulting parameter estimates using Rubin's rules. Appendix SA2 also provides more detail on our multiple imputation strategy.

Table 1
Descriptive Statistics of Variables (before Imputation)

In addition to all the outcomes and covariates used in the previous propensity score analysis, the latest child level weights and two other IQ measures3 were included in the imputation.


Except for Table 1, all results reflect pooled estimates across the imputations using Rubin's rule. Table 1 provides descriptive statistics for the actual data analyzed (before the imputation), disaggregated by whether the mother received WIC.

Among our sample, 43 percent of them reported receiving WIC prenatally. Nine percent of children were born low birth weight, and 8 percent had been born prematurely. One in six mothers did not complete high school, and one-third was unmarried when the child was born.

WIC recipients compare unfavorably to other women in terms of their sociodemographic characteristics and some of their children's outcomes. At birth, children of WIC recipients weighed less (3,226 versus 3,386 g), were more likely to be born low birth weight (11 percent versus 7 percent), or small for gestational age (21 percent versus 11 percent). As for demographic characteristics, mothers who received WIC had lower IQs, less educational attainment, and lower household incomes the year the child was born. Those mothers also were younger, reported larger family size, and were disproportionately African American.

Table 2 presents the results of the model used to generate the propensity scores. The table presents the probit coefficients for each of the four subgroups defined by race and the child's gender, pooling across imputations using Rubin's rules. These analyses essentially identify those characteristics that distinguish WIC participants from other women. This multivariate assessment is appropriate to understanding the overall potential of the covariates to confound unadjusted comparisons of recipients and other women. Comparing across subgroups, the effect of some predictors did vary. In some instances, a characteristic predicted WIC participation only for one gender. For example, for reasons that are not entirely clear, the number of siblings predicted WIC participation only for female children. On the other hand, married mothers were less likely to use WIC than other women, and this relationship is consistent across the four subgroups.

Table 2
Propensity Score Models (Imputed)

As we mentioned earlier, a good practice involves checking the covariate balance after the matching. Propensity-score-based comparisons of WIC participants and nonparticipants revealed no substantial differences in the means of the covariates. For more details on covariate balance, see Appendix SA3.

Table 3 presents the estimated effects of WIC participation on the birth outcomes. Each panel represents a different comparison. For each, we present the estimated effect, the standard error, and the p-value. For continuous outcomes, the effect is the mean difference for the outcome for a WIC participant and the estimated counterfactual. For the dichotomous outcomes (like preterm), the effect is the difference in the predicted probability of the outcome.

Table 3
Effects of WIC Program on Birth Outcomes

First, consider the unmatched samples in the leftmost panel. Results indicated that children born to WIC-recipients had lower scores on the maternal health rating of the child, were more likely to be born low birth weight, born small for gestational age, and had lower birth weights. Given the obvious confounding, one could argue that these figures are of little or no use, but they do represent a baseline against which adjusted figures can be assessed. (One also can compare these figures to other datasets to assess across-study differences in sample composition.)

For the matched sample (second panel), the between-group differences were no longer significant. The differences between the matched and unmatched sample were quite striking: for example, in the unmatched sample, the difference in birth weight between the two groups was 159 g (or around 5.6 ounces). In the matched sample, the difference was only 18 g (or 0.7 ounces) and was no longer statistically significant at 0.05 level. Likewise, the difference in the number of babies born low birth weight changed from 4 percent higher in the unmatched sample to a 1 percent decrease in the matched sample. Most of the outcomes were pointing to the anticipated direction (improved outcomes) and, if not, matching reduced the magnitude of negative effects.

Sensitivity Analyses

We addressed several limitations of our analyses and methodology. First, one potential problem with our data is the length of recall required for the women with older children in 1997. We limited the analyses to the 1,627 children ages 7 and younger.

Second, the analyses reported above include nonrecipients who may not have been eligible for WIC and so are not comparable to those using WIC. We reduced this heterogeneity by limiting the analysis to those women enrolled in the Medicaid program at the time of the child's birth. Doing so makes our study more comparable to prior research such as Bitler and Currie (2005), who limit their analyses to women whose delivery was paid for by Medicaid. In both instances, while statistical significance varies somewhat, the fundamental results of the earlier analyses are unchanged.

As noted before, propensity score methodology does not allow for between-group differences in unobserved factors affecting both participation and child outcomes. We addressed this problem in two ways. First, we estimated fixed-effects models that allowed for mother-specific unobservables. In essence, these analyses involved sibling, within-family comparisons and have been used to examine the impact on child outcomes of a range of maternal and family characteristics such as breastfeeding, mother's age, and neighborhood conditions (Rosenzweig and Wolpin 1995; Aaronson 1997; Der, Batty, and Deary 2006; Holmlund, 2005;). In this case, fixed-effect estimation involves comparisons of children born to mothers when receiving WIC with their siblings born to the same mother when she was not using WIC. These results are reported in the last two panels of Table 3. These analyses shed two insights. First, comparisons of the fixed- and random-effects estimates indicate that the two sets of estimates differ statistically. (At the request of a reviewer, we do not present the latter here, but they are available from the authors.) In general, the fixed-effects estimates are larger.

Even within fixed-effect estimation, time-varying unobserved differences may confound comparisons of those children who did and did not receive WIC. Such factors would vary within child and may reflect changes in the mother's circumstances or broader developments in the community. For that reason, we conducted sensitivity analyses like those proposed by Rosenbaum (2002) and Imbens (2003). Such sensitivity analyses assess how strongly confounding with an unobserved variable would have to be to explain the apparent effect (or noneffect) of a treatment (WIC participation). Following Rosenbaum (2002), we began by grouping the data into pairs based on the propensity score. Each pair included a WIC participant and a matching nonparticipant drawn without replacement.4

First, we examined the pattern of matched pairs assuming no unobserved confounding. Among these pairs, one can calculate a nonparametric test (the sign test) of the relationship between WIC and the outcome using discordant pairs. Taking low birth weight as an example, our matching produced 1,373 pairs. Of these, 245 pairs were “discordant”—one birth in the pair was low birth weight and one was not, and the low birth weight baby was born to the WIC participant in 141 (58 percent) pairs. If WIC users and nonusers were perfectly matched, any deviation from 50 percent (the null hypothesis) would reflect the effect of WIC. One can use binomial distribution to calculate the corresponding p-value on the observed effect.

The sensitivity analyses then considered whether this pattern could be explained by hypothetical, unobserved dichotomous characteristic (“healthy” versus “unhealthy”). The link between “healthy” status and WIC participation would have to be relatively strong to explain the null findings. For birth weight, for example, 64 percent of women enrolling in WIC would have to be drawn from the “unhealthy” class. This relationship would have had to exist over and above the balancing of observed characteristics. Our sensitivity analyses for the other outcomes were similar and are available from the first author.


Using propensity scores, this study has examined the association between WIC and the health of newborns. In general, we find modest effects of the WIC program, and our findings seem robust to changes in model specification. We also considered the possibility of unobserved confounding, estimating fixed-effects estimates and Rosenbaum bounds. Neither of these supplemental analyses suggests that our findings for maternal rating of the child's health, for spending time in the NICU, or being small for gestational age are affected by WIC.

While prior studies are far from uniform on the benefits of WIC, these effects are generally smaller than those found in earlier research. We believe several explanations are possible. First, we are able to include more and better covariates in our analyses that adjust for the confounding caused by such characteristics. As a result, earlier studies may exaggerate the effect of WIC.

Second, the handling of Medicaid status actually may inflate the estimated effect of the program. Medicaid participation presumably will be positively associated with both children's health and with WIC participation. The latter may involve a direct effect—WIC may enroll women in Medicaid. According to the standard formula for omitted variable bias, failing to include Medicaid status as a covariate may inflate the estimated effect of WIC (Greene 2008). However, that framework assumes the Medicaid itself is not confounded by unobservables (Sobel 2008). If such confounding exists, then Medicaid is a “collider,” and conditioning on it establishes a spurious correlation between unobserved determinants of Medicaid enrollment and WIC (Pearl 2000, 2005; Greenland and Pearl 2008). If women who are of poorer health enroll in Medicaid (which seems likely if providers enroll women in the program), the effect of WIC is inflated. As shown above, when we limit our analyses to Medicaid enrollees, the estimated effects are larger (though still not statistically significant).

However, some results are encouraging from a program perspective. We do find fairly substantial effects of WIC for some outcomes in the fixed-effect analyses. These effects involve two outcomes related to birth weight and preterm status, and all three are fairly substantial. They appear, however, only in the fixed-effect analysis. As a result, the policy implications drawn by the reader likely depend on how much confidence he or she has in the ignorability assumption. The most straightforward argument is that WIC reflects time-varying unobserved determinants of health, and that the fixed-effect estimates are preferred. From that perspective, the bottom line for the paper is much more positive.

However, the difference between the propensity score-based and fixed-effect estimates may be explained by relatively subtle forces. One reason the fixed- and propensity-score estimates differ is that even the latter were larger in the sample of families with two or more children. One explanation is that some predictors of family size—either observed or unobserved—may moderate the effect of WIC. In that case, the fixed-effect estimates indicate not that the propensity score estimates are wrong but are estimating a different effect. If larger families are more disadvantaged, then perhaps the program's effect is larger among those families.

Another possibility is methodological in nature. It is also possible that controlling for the fixed effects actually increased other biases, such as that involving time-varying unobservables and WIC participation. This pattern would imply that time-varying and time-invariant unobservables bias the findings in different directions. Whether that is the case is unknown. Our sensitivity analyses, however, show that the net effect of these two biases would have to be fairly strong to explain the null findings.

Other limitations of our own analyses remain. First, data on WIC were based on maternal recall. The rate of WIC participation is somewhat lower than one would anticipate. However, for recall to influence the estimated impact of WIC, the relationship between actual and reported WIC status would have to be moderated by the outcomes of interest. For example, if healthy and unhealthy women underreport their WIC participation to the same degree, the effect of WIC would not be affected. We have no evidence that this effect exists, and resolving the issue would require administrative data on participation.

Second, we might have adjusted for additional covariates, and doing so might reveal benefits of the program otherwise obscured. Indeed, many more variables are included in the data that might be used to calculate the propensity score. However, as we performed these analyses, as more covariates were added, the estimate of the program effect moved closer and closer to zero.

Third, this study has examined prenatal WIC participation, but pregnant women constitute only 11 percent of the WIC caseload (USDA 2006). Therefore, our results do not address the effectiveness of the program for postpartum women and children.

Finally, our study suffers from the limitations inherent to observational studies. However, the putative gold standard of evaluation—a randomized trial—is not possible or even desirable for a nearly universal program like WIC. A body of research considers the circumstances under which observational analyses produces findings similar to a randomized experiment (Cook, Shadish, and Wong, 2008; Steiner et al. in press). That literature suggests that a well-done observational study can produce findings quite similar to those of a randomized trial (where both exist). The key issue seems to be an understanding of the process shaping program participation and the availability of appropriate covariates that capture the key, shared determinants of program participation and the outcomes of interest.


Joint Acknowledgment/Disclosure Statement: This study was not funded by any governmental or nongovernmental agency.

Disclosures: None.

Disclaimers: None.


1This measure is taken from the passage completion test of Woodcock Johnson test-revised administered to primary caregivers in a 1997 CDS interview.

2Even though we run race-, gender-specific propensity score model, every child only has one propensity score. In calculating the kernel density estimate, we did not limit eligible cases to those of the same race and gender.

3They are grandmother's and mother's IQ, both taken from a short form IQ test with 13 sentence completion questions (taken from the Lorge-Thorndike intelligence test) administered in the 1972 interview (Veroff, McClelland, and Marquis, 1971).

4We matched the pairs using the optimal matching procedure in MatchIt (Ho et al. 2007), a software package in R.


Additional supporting information may be found in the online version of this article:

Appendix SA1: Author Matrix.

Appendix SA2. Methodological Appendix.

Appendix SA3. Details on Balancing.

Please note: Wiley-Blackwell is not responsible for the content or functionality of any supporting materials supplied by the authors. Any queries (other than missing material) should be directed to the corresponding author for the article.


  • Aaronson D. Using Sibling Data to Estimate the Impact of Neighborhoods on Children's Educational Outcomes. The Journal of Human Resources. 1997;33(4):915–46.
  • Alexander GR, Himes JH, Kaufman RB, Mor J, Kogan M. A United States National Reference for Fetal Growth. Obstetrics and Gynecology. 1996;87(2):163–68. [PubMed]
  • Allison PD. Missing Data. Thousand Oaks, CA: Sage; 2002.
  • Angrist J, Pischke S. Mostly Harmless Econometrics: An Empiricist's Companion. Princeton, NJ: Princeton University Press; 2008.
  • Bitler M, Currie J. Does WIC Work? The Effects of EIC on Pregnancy and Birth Outcomes. Journal of Policy Analysis and Management. 2005;24(1):73–91. [PubMed]
  • Brown HL, Watkins K, Hiett KA. Fetus-Placenta-Newborn: The Impact of the Women, Infants, and Children Food Supplement Program on Birth Outcome. American Journal of Obstetrics and Gynecology. 1996;174(4):1279–83. [PubMed]
  • Chatterji P, Brooks-Gunn J. WIC Participation, Breastfeeding Practices, and Well-Child Care among Unmarried, Low-Income Mothers. American Journal of Public Health. 2004;94(8):1324–27. [PubMed]
  • Cook TD, Shadish WR, Wong VC. Three Conditions under Which Experiments and Observational Studies Produce Comparable Causal Estimates: New Findings from Within-Study Comparisons. Journal of Policy Analysis and Management. 2008;27(4):724–50.
  • Currie J. U.S. Food and Nutrition Programs. In: Moffitt R, editor. Means-Tested Transfer Programs in the United States. Chicago: The University of Chicago Press; 2003. pp. 199–289.
  • Dehejia RH, Wahba S. Propensity Score-Matching Methods for Nonexperimental Causal Studies. Review of Economics and Statistics. 2002;84(1):151–61.
  • Der G, Batty GD, Deary IJ. Effect of Breast Feeding on Intelligence in Children: Prospective Study, Sibling Pairs Analysis, and Meta-Analysis. British Medical Journal. 2006;333(7575):945–50. [PMC free article] [PubMed]
  • Devaney BL. Very Low Birthweight among Medicaid Newborns in Five States: The Effects of Prenatal WIC Participation. Alexandria, VA: U.S. Department of Agriculture; 1992.
  • Foster EM, Fang GY. and the Conduct Problems Prevention Research Group. Alternative Methods for Handling Attrition in Prevention Research: An Illustration Using Data from Fast Track. Evaluation Review. 2004;28:434–64. [PMC free article] [PubMed]
  • Foster EM, Krivelyova A. Non-Ignorable Non-Response in Longitudinal Studies. In: Menard S, editor. Handbook of Longitudinal Research. 2007. pp. 185–196. Burlington, MA: Elsevier.
  • GAO. 2004. Nutrition Education : USDA Provides Services through Multiple Programs, But Stronger Linkages among Efforts Are Needed. Washington, DC: Government Printing Office.
  • Gibson-Davis CM, Foster EM. A Cautionary Tale: Using Propensity Scores to Estimate the Impact of Food Stamps on Food Insecurity. Social Service Review. 2006;80:93–126.
  • Goldenberg RL, Rouse DJ. Prevention of Premature Birth. New England Journal of Medicine. 1998;339(5):313–20. [PubMed]
  • Greene WH. Econometric Analysis. Upper Saddle River, NJ: Prentice Hall; 2008.
  • Greenland S, Pearl J. Causal Diagrams. In: Boslaugh S, McNutt L-A, editors. Encyclopedia of Epidemiology. Los Angeles, CA: Sage; 2008. pp. 149–56.
  • Heckman JJ, Navarro-Lozano S. Using Matching, Instrumental Variables, and Control Functions to Estimate Economic Choice Models. Review of Economics and Statistics. 2004;86(1):30–57.
  • Heckman JJ, Vytlacil EJ. Econometric Evaluation of Social Programs, Part I: Causal Models, Structural Models and Econometric Policy Evaluation. Handbook of Econometrics. 2007;6:4779–874.
  • Ho DE, Imai K, King G, Stuart EA. Matching as Nonparametric Preprocessing for Reducing Model Dependence in Parametric Causal Inference. Political Analysis. 2007;15(3):199–236.
  • Holmlund H. Estimating Long-Term Consequences of Teenage Childbearing —An Examination of the Siblings Approach. Journal of Human Resources. 2005;40(3):716–43.
  • Iams J. Prevention of Preterm Birth. New England Journal of Medicine. 1998;339(5):313–230. [PubMed]
  • Imbens GW. Sensitivity to Exogeneity Assumptions in Program Evaluation. The American Economic Review. 2003;93:126–32.
  • Jacknowitz A, Novillo D, Tiehen L. Special Supplemental Nutrition Program for Women, Infants, and Children and Infant Feeding Practices. Pediatrics. 2007;119(2):281–89. [PubMed]
  • Joyce T, Gibson D, Colman S. The Changing Association between Prenatal Participation in WIC and Birth Outcomes in New York City. Journal of Policy Analysis and Management. 2005;24(4):661–85. [PubMed]
  • Kahler LR, O'Shea RM, Duffy LC, Buck GM. Factors Associated with Rates of Participation in WIC by Eligible Pregnant Women. Public Health Reports. 1992;107(1):60–5. [PMC free article] [PubMed]
  • Kennedy ET, Kotelchuck M. The Effect of WIC Supplemental Feeding on Birth Weight: A Case-Control Analysis. The American Journal of Clinical Nutrition. 1984;40:579–85. [PubMed]
  • Kramer MS. The Epidemiology of Adverse Pregnancy Outcomes: An Overview. The Journal of Nutrition. 2003;133:1592S–96S. [PubMed]
  • Lazariu-Bauer V, Stratton H, Pruzek R, Woelfel ML. A Comparative Analysis of Effects of Early versus Late Prenatal WIC Participation on Birth Weight: NYS, 1995. Maternal and Child Health Journal. 2004;8(2):77–86. [PubMed]
  • Lee M. Micro-Econometrics for Policy, Program, and Treatment Effects. Oxford, England: Oxford University Press; 2005.
  • Leuven E, Sianesi B. 2003. “PSMATCH2: Stata Module to Perform Full Mahalanobis and Propensity Score Matching, Common Support Graphing, and Covariate Imbalance Testing.” Available at
  • Little RJA. Pattern-Mixture Models for Multivariate Incomplete Data. Journal of the American Statistical Association. 1993;88(421):125–34.
  • Little RJA. A Class of Pattern-Mixture Models for Normal Incomplete Data. Econometrika. 1994;81(3):471–83.
  • Little RJA, Rubin DB. Statistical Analysis with Missing Data. Hoboken, NJ: John Wiley and Sons; 1987.
  • Little RJA. Statistical Analysis with Missing Data. Hoboken, NJ: John Wiley and Sons Inc; 2002.
  • Ludwig J, Miller M. Interpreting the WIC Debate. Journal of Policy Analysis and Management. 2005;24(4):691–701. [PubMed]
  • May R, Tramp J, Bass K, Schroeder S, Kerr MA. Early Postnatal Growth of Low Birthweight Infants in the WIC Program. American Journal of Human Biology. 2001;13:261–67. [PubMed]
  • McGonagle KA, Schoeni RF. 2006. “The Panel Study of Income Dynamics: Overview and Summary of Scientific Contributions after Nearly 40 Years” [accessed on May 21, 2007, 2006]. Available at
  • Morgan SL, Winship C. Counterfactuals and Causal Inference: Methods and Principles for Social Research. Cambridge, England: Cambridge University Press; 2007.
  • Moss NE, Carver K. The Effect of WIC and Medicaid on Infant Mortality in the United States. American Journal of Public Health. 1998;88(9):1354–61. [PubMed]
  • Oliveira V, Racine E, Olmstead J, Ghelfi LM. 2002. The WIC Program: Background, Trends, and Issues. Food Assistance and Nutrition Research Report No. 27, Food and Rural Economics Division, Economic Research Service, U.S. Department of Agriculture.
  • Pearl J. Causality: Models, Reasoning, and Inference. Cambridge, England: Cambridge University Press; 2000.
  • Pearl J. Direct and Indirect Effects. Proceedings of the American Statistical Association Joint Statistical Meetings. 2005;411(20):1572–81.
  • Rosenbaum P, Rubin D. The Central Role of the Propensity Score in Observational Studies for Causal Effects. Biometrika. 1983a;70(1):41–55.
  • Rosenbaum PR. Observational Studies. New York: Springer-Verlag; 2002.
  • Rosenbaum PR, Rubin DB. The Central Role of the Propensity Score in Observational Studies for Causal Effects. Biometrika. 1983b;70(1):41–55.
  • Rosenzweig M, Wolpin K. Sisters, Siblings, and Mothers: The Effect of Teen-Age Childbearing on Birth Outcomes in a Dynamic Family Context. Econometrica. 1995;63(2):303–26. [PubMed]
  • Rossi PH. Feeding the Poor: Assessing Federal Food Aid. Washington, DC: The American Enterprise Institute; 1998.
  • Royston P. Multiple Imputation of Missing Values: Update of Ice. Stata Journal. 2005;5(4):527–36.
  • Rubin DB. Multiple Imputation for Nonresponse in Surveys. New York: John Wiley and Sons; 1987.
  • Rush D, Leighton J, Sloan NL, Alvir JM, Horvitz DG, Seaver WB, Garbowski GC, Johnson SS, Kulka RA, Devore JW, Lynch JT, Woodside MB, Shanklin DB. The National WIC Evaluation: Evaluation of the Special Supplemental Food Program for Women, Infants, and Children: Study of Infants and Children. American Journal of Clinical Nutrition. 1988;48:484–511. [PubMed]
  • Sobel ME. Identification of Causal Parameters in Randomized Studies With Mediating Variables. Journal of Educational and Behavioral Statistics. 2008;33(2):230–51.
  • StataCorp. Stata Statistical Software (Release 10) College Station, TX: Stata-Corp; 2007.
  • Stein Z, Susser M. The Dutch Famine, 1944–1945, and the Reproductive Process. I. Effects on the Six Indices at Birth. Pediatric Research. 1975;9:70–6. [PubMed]
  • Steiner PM, Cook TD, Shadish WR, Clark MH. in press. The Importance of Covariate Selection in Controlling for Selection Bias in Observational Studies.”Psychological Methods. [PubMed]
  • Stockbauer JW. WIC Prenatal Participation and Its Relation to Pregnancy Outcomes in Missouri: A Second Look. American Journal of Public Health. 1987;77:813–18. [PubMed]
  • UCLA: Academic Technology Services and Statistical Consulting Group. “Multiple Imputation Using ICE” [accessed on August 11, 2009]. Available at
  • USDA. 2006. WIC Participant and Program Characteristics 2004. Alexandria, VA: USDA.
  • USDA. 2007a. “The Food Assistance Landscape Fiscal Year 2006.” Washington, DC: U.S. Department of Agriculture, Economic Research Service.
  • USDA. “WIC Program: Average Monthly Benefit per Person”. 2007b. [accessed on May 23, 2007]. Available at
  • Verbeke G, Molenberghs G. Linear Mixed Models for Longitudinal Data. New York: Springer-Verlag; 2000.
  • Veroff J, McClelland L, Marquis K. 1971. Measuring Intelligence and Achievement Motivation in Surveys: Final Report to HEW, OEO. Ann Arbor, MI: University of Michigan.
  • Winship C, Morgan SL. The Estimation of Causal Effects from Observational Data. Annual Review of Sociology. 1999;25(1):659–707.
  • Zedlewski SR, Rader K. New Federalism, National Survey of America's Families. Washington, DC: The Urban Institute; 2005. Feeding America's Low-Income Children. Series B, No. B-65.

Articles from Health Services Research are provided here courtesy of Health Research & Educational Trust