The Pittsburgh Influenza Prevention Project (PIPP) is a prospective randomized study of the effectiveness of non-pharmaceutical interventions (NPI) conducted in ten public elementary schools in the city of Pittsburgh. The five schools in the intervention arm were provided with NPI training program including hand hygiene, hand etiquette, and cover your cough behaviors, as well as supplies of alcohol-based hand sanitizer for all classrooms. In addition, parents received educational materials about NPIs. Prior to and during the influenza season, absences were monitored and absentees were screened by phone for the presence of ILI. During the season, home visits were made to collect specimens for laboratory tests (rapid test and PCR). ILI is defined as fever (≥ 100°F) plus cough or sore throat.
The 2007–2008 influenza season started near the end of December and ended around the end of April. The epidemic curves for overall and pathogen-specific ILI episodes are shown in . Most ILI episodes that occurred prior to the influenza epidemic were not lab-tested by design. We restrict all analyses to the period of 12/31/2007 – 4/18/2008, during which a total of 380 ILI episodes were observed in 352 out of 3959 students. Further breakdown of these numbers by pathogen type and laboratory test is given in . The influenza B epidemic occurred slightly later than influenza A, and only a single person was infected by both influenza types. We assume that ILI episodes with negative laboratory results for both influenza types were caused by a single non-influenza pathogen. The distribution of pathogen types among infected students does not differ significantly across schools (not shown).
| Table 2Summary of outcomes during 12/31/2007–4/18/2008 by intervention arm in the PIPP study of NPI effectiveness in the 2007–2008 winter season. Five schools were in each intervention arm. |
We denote the non-influenza pathogen, influenza A and influenza B as pathogens 1, 2 and 3. For a study of large contact groups, it is reasonable to assume that the community-to-person infection rate is time dependent and proportional to the number of cases in the community which may be represented by the observed number of ILI onsets in the schools. Let
T0=12/31/2007 and
T1=4/18/2008 be the starting and stopping dates. We assume each student in the study population was susceptible to the three pathogens at
T0, unless infection with either influenza A or influenza B was lab-confirmed before
T0 in the same season. If vaccination and infection history before the epidemic season were known at the individual level, prior immunity could be built into the model. Let
Cm(
t) be the average number of ILI onsets corresponding to pathogen
m reported over the three days
t − 1,
t and
t + 1. This smoothing step ensures a non-zero infection rate for days with zero ILI counts by chance. We assume that the community-to-person infection hazard is
ωmCm(
t)/
m, where
m = Σ
tI(
Cm(
t)
> 0)
Cm(
t)/Σ
tI(
Cm(
t)
> 0).
The observed numbers of influenza A and B cases in this study are relatively low, on average about 5 cases per school. To avoid non-identifiability of parameters, we assume the two types of influenza share the same natural history parameters, i.e.,
a2 =
a3 and
q2l =
q3l,
l = 1, 2, 3. Based on knowledge from influenza challenge studies (
Murphy et al., 1980), we set Δ
m = 7 days for all
m. The interference matrix used for the analysis is
which reflects that infection with influenza A (pathogen 2) or influenza B (pathogen 3) generates specific immunity to the same kind of infection. For infections with the non-influenza pathogen, the short-term nonspecific immunity against all pathogens is the only choice compatible with the data. The exact duration of non-specific immunity is arbitrarily chosen and subject to sensitivity analysis.
In an initial analysis assuming that all schools shared the same ωm for all m, the estimated transmission hazard of influenza B is more than 60% higher in the intervention arm than that in the control arm with marginal statistical significance. Although the influenza B epidemic occurred later than influenza A, this counter-intuitive association is unlikely due to the depletion of susceptibles by influenza A because (1) the cross-immunity is limited, if any, between the two different influenza types, and (2) the numbers of cases are small compared to the pool of uninfected subjects. A careful examination of the data revealed that the four schools with the most frequent influenza B cases, one in the control arm and three in the intervention arm, are all located to the south of the remaining six schools (). It is likely that schools in different geographical areas were subject to different levels of risk of influenza B from the communities. As a result, we assume that the four schools in the south had a different baseline community-to-person infection hazard of influenza B and denote it by ω3b.
As the intervention was implemented at the school level, only the product
ξm =
θm
m (assuming multiplicativity) is estimable, which corresponds to the total efficacy VE
T.m = 1 −
ξm. Some features of this study warrant further consideration and yield several candidate models. First of all, the contact level of students with their home community during the holidays in which the schools were closed may differ from that during regular school days. A possible solution is to assume the community-to-person infection hazard of pathogen
m is
ρωm during holidays and
ωm during school days, where
ρ ![[set membership]](/corehtml/pmc/pmcents/x2208.gif)
(0, ∞). Intuitively, one would expect
ρ > 1. For a susceptible individual absent during school days for personal reasons,
ρωm also applies. Secondly, within-school contacts tend to be more frequent within grades compared to between grades. To account for this extra clustering, we may let the transmission hazard be
γm within school but between grades and
ηγm within grade for pathogen
m, with
η > 0. We used the following three models to analyze the data:
- Model I : ρ = 1, and η = 1.
- Model II : ρ is unknown, but η = 1.
- Model III : both ρ and η are unknown.
A convenient criterion for model selection in the Bayesian paradigm is the deviance information criterion (DIC) (
Spiegelhalter et al., 2002). However, minimal exploration has been done with the DIC in missing data problems (
Celeux et al., 2006;
Daniels and Hogan, 2008). Two proposals for the DIC in models with missing data are the DIC based on the observed data likelihood and the DIC based on the full data likelihood. The former is very difficult to compute for the models proposed here as the observed data likelihood cannot be expressed in closed form; the latter is computationally more tractable but has not been well studied. Alternatively, a Reversible-Jump MCMC algorithm (
Green, 1995) could be used to account for uncertainty about the underlying model. Here we compare models based on posterior credible sets due to the nesting structure of the three models under consideration.
As there are 395 students for whom grade is unknown, an extra data augmentation step is necessary in Model III to incorporate these students into the analysis. Note that there are 10 schools, each with 6 grades, in the PIPP study. We assume the grade of individual
i in school
s follows a multinomial distribution with probability vector
ξs = (
ξs1, …,
ξs6)′, where

,
s = 1, …, 10. We specify a Dirichlet(
16×1) prior for
ξs,
s = 1, …, 10. Let
ysg is the number of individuals in grade
g of school
s. The following term for grade assignment,

, is appended to the full joint probability in (
8). Assuming that grades are MAR, in the MCMC algorithm we add the following: (1) sample
ξs from Dirichlet(
ys1 + 1, …,
ys6 + 1),
s = 1, …, 10; and (2) sample each missing grade
Gi of individual
i in school
si from the conditional probability
if individual
i had no ILI; otherwise, sample
Gi together with (
vi,
i) from
where
Li is the number of all possible values of the triple (
vi,
i,
Gi). Slightly different from (
11), we have

instead of

because here the exposure history of individual
i depends on his/her grade
Gi.
The estimates for
η, 1.91 (95% CS:0.18, 9.4), in model III do not provide strong evidence for
η > 1, whereas those for
ρ, 2.05 (95% CS:1.31, 3.01) in model II and 2.02 (95% CS:1.36, 2.91) in model III, do for
ρ > 1. Therefore, we chose model II for the subsequent analyses. presents results in the left and the right panels assuming that the RIC mode is located at the end of day
ik and of day
ik + 1, respectively, with the left panel as our primary results. We do not find statistical significance for the effectiveness of the NPI intervention in preventing influenza-related ILI. However, the NPI intervention tends to reduce the risk of symptomatic infection with influenza A by 32% (95% CS:–13%, 60%) and the risk of symptomatic infection with the non-influenza pathogen by 27% (95% CS:2%, 45%). The intervention showed no effect on the risk of symptomatic infection with influenza B.
| Table 3Analysis of the PIPP Study with influenza A and influenza B epidemics in the 2007–2008 winter season. Posterior medians and 95% credible sets are presented. |
Influenza A tended to be more transmissible person-to-person than influenza B in this epidemic, the posterior median of
γ2/
γ3 being 4.73 (95% CS: 0.77, 112.33). While the two pathogens generated about the same number of cases, the average length between successive ILI onsets within schools in the control arm is 4.4 days for influenza A and 9.7 days for influenza B. Consequently, influenza B cases are far less likely to be explained by secondary infections. The estimates for the SAR in do not take into account the possibility of absence due to the disease or holidays. To obtain an estimate for the effective SAR adjusted for absence, we fixed the duration of the incubation period for influenza A and influenza B at two days, the average duration estimated for seasonal influenza. As a result, the infectious period is now fixed relative to the symptom onset day and can be represented by days 1–7 with day 2 being the symptom onset day. Let
ek be the probability of absence for day
k of the infectious period,
k = 1, …, 7, which we assume depends only on
k but not the underlying pathogen. Next, we estimate (
e1, …,
e7)′ by the proportions of absence regardless of the reason among all ILI episodes, (0.488, 0.472, 0.312, 0.375, 0.436, 0.512, 0.474)′. The effective SAR can be formulated as

,
m = 2,3, where 2/7 is the mode of the RIC. We estimate the effective SAR as 0.00093 (95% CS:0.00027, 0.0018) for influenza A and 0.00020 (95% CS:0.0000077, 0.00079) for influenza B in the PIPP study.
Yang et al. (2009) reported an effective SAR of 0.00075 (95% CI: 0.00055, 0.0010) for the novel pandemic influenza A(H1N1) strain in an outbreak that occurred in a private high school in New York City. Our estimate for the PIPP study is likely higher because elementary-school students are more susceptible than high-school students.
The community-to-person infection hazard during the holidays is more than 2 times that during school days. The four schools in the south experienced twice the community-to-person hazard of influenza B compared to the six schools in the north, but the 95% CS of the difference covers zero, possibly due to the small number of cases per school.
The estimates were not sensitive to the location of the mode of the RIC, as indicated by the similarity between the two panels in . Sensitivity to the natural history parameter setting was assessed assuming (1) a longer incubation period with Hm = 4; (2) a longer maximum infectious period with Δm = 10; and (3) a fixed one-day latent period when the incubation period is at least two days. Only mild to moderate changes were observed in the estimates for the transmission hazards, and the estimates for the intervention effects are highly robust. The key estimates are also not sensitive to moderate changes in the interference matrix. Sensitivity to the assumed prior distributions was only examined for the person-to-person transmission hazards and the intervention efficacies for the two influenza types. For each of these parameters, we replace the uniform prior with a prior that imposes high prior mass on the 1%, 10%, 50%, 90% and 99% quantiles of the posterior distribution obtained from the model using the uniform prior. Each time, the prior of only one parameter is changed. The non-flat prior is proportional to exp {−(g(x) − g(μ))2/(2σ2), where μ is the chosen quantile, σ = 2, and g(x) = logit (x − min)/(max − min) with min and max being the prior bounds of the parameter. In , we plot the posterior medians under the non-flat priors relative to the primary estimates in . A steeper slope indicates higher sensitivity. Among the hazard and efficacy parameters for influenza A and B, only the person-to-person transmission hazard of influenza B is relatively sensitive to its prior distribution.