Visualization and quantitation of WRvFire virus. Initially, it was important to investigate the sensitivity of the system and to correlate the levels of bioluminescence recorded in live animals with viral loads measured in various organs by in vitro assays. Five-week-old BALB/c mice were infected with 105 PFU of WRvFire i.n. and were imaged on days 1 to 10 postinfection. Strong signals were detected at various time points in the nasal cavity, lungs, spleen, liver, and ovaries. In addition, pox development was also noted in WRvFire-infected mice (Fig. ).
To determine whether there was a direct correlation between the light detected by the camera and the amount of luciferase present in the organ, BALB/c mice were infected with WRvFire at 104, 105, or 106 PFU. Six days after infection, the mice were subjected to bioimaging. Bioluminescence in the lungs and livers was determined using the ROI tool as shown in Fig. , and the total fluxes emitted by infected organs were calculated. The animals were sacrificed, and the lungs and livers were harvested and tested individually in the luciferase reporter gene assay ex vivo. The levels of total fluxes emitted by the lungs and livers recorded in live animals were compared with the numbers of luciferase units per gram tissues obtained from the same 16 animals (a total of 32 organs) (Fig. ). Linear correlation (R2 = 0.82) was observed between the values of luciferase expression detected by in vivo and ex vivo assays, suggesting that bioimaging provided an accurate evaluation of luciferase expression in individual organs.
To confirm that WRvFire viral loads could be quantitatively assayed by bioimaging, the traditional PFU assay was employed. Twelve BALB/c mice were infected i.n. with WRvFire at 105 PFU/animal, and images of six infected mice per time point were taken 3 and 4 days postinfection before euthanasia (Fig. ). The lungs and spleens were collected, and the viral loads in these organs were determined by plaque assay. The numbers of PFU per gram of tissue detected in the organs of individual mice were plotted against the values of WRvFire expression in the same organs detected by bioimaging of live animals. The data were processed by linear regression analyses. In both organs, a very strong correlation was found between bioluminescence and the traditional PFU assay. The R2 values were 0.83 and 0.97 for lungs and spleens, respectively (Fig. ). Together, these data demonstrated that the dynamic range and the sensitivity of the bioimaging assay were similar to those of the ex vivo viral load measurements. Therefore, bioimaging could be evaluated further for its ability to predict survival after prophylactic vaccination or therapies.
Bioimaging of animals following vaccine-induced protection from lethal challenge. To determine whether protection from lethality was correlated with a reduction in bioluminescence in live animals, BALB/c mice were inoculated with the Dryvax vaccine either 2 or 3 weeks prior to challenge with WRvFire. All control mice succumbed to death within 8 days postinfection, and all mice that received Dryvax vaccine i.p. were protected (Fig. ). Images were collected daily using the IVIS instrument and were used to quantify the total flux in the nasal cavity and in the lungs of individual animals (Fig. ). In both the unimmunized and the Dryvax-immunized groups, the initial luciferase signals in the nasal cavity were on the order of 108 photons/s/cm2/sr. The signals in the unimmunized animals increased about 1.5 to 2 log units during the following days and reached a plateau between days 5 and 7 (Fig. ). In contrast, in Dryvax-immunized animals, a rapid reduction in bioluminescence was observed, with a complete loss of signals in the nasal cavity by day 4 (Fig. ). Bioluminescence in the lungs of unimmunized mice ranged between 105 and 106 photons/s/cm2/sr on day 1, increased approximately 1.5 to 2 log units, and plateaued on day 5 (Fig. ). Five mice in the group of mice immunized 3 weeks prior to challenge showed initial signals in the lungs on day 2 (5 × 106 photons/s/cm2/sr) that disappeared on day 3. No bioluminescence was detected in the lungs of the remaining Dryvax-immunized animals on day 2. The values of the daily bioluminescence measurements in the nasal cavities and in the lungs in unimmunized and in Dryvax-immunized mice were subjected to t tests (Table ). Following initial viral replication in the nasal cavity observed in both groups of mice on day 1, the differences in bioluminescence signals between the two groups of mice gradually increased, as reflected in dramatically increased t values between days 2 and 7 (Fig. and Table ). In agreement with the observed higher variability of bioluminescence in the lungs, the t statistic for the lungs were lower than for the nasal cavity. Nevertheless, the differences between unimmunized and Dryvax-immunized groups were significant starting from day 3 (t ≥ 2) (Table ). No bioluminescence was detected in the livers, spleens, and ovaries of protected animals (data not shown). These data showed that protection from lethality conferred by prophylactic Dryvax immunization was correlated with the absence of bioluminescence in the internal organs and with significantly lower bioluminescence in the nasal cavity and in the lungs of protected mice within 2 days postchallenge.
| TABLE 1.t statistic for the total fluxes in challenge-only mice compared with Dryvax-immunized micea |
Bioimaging of animals treated with VIGIV prior to WRvFire challenge. While active immunization with a potent vaccine such as Dryvax is likely to provide complete protection from morbidity and mortality, there are currently no global vaccination campaigns planned. In the event of smallpox outbreaks, active vaccination of all high-risk individuals, such as military personnel, designated emergency “first responders,” and health care providers, might not be feasible. Instead, preexposure treatments with anti-vaccinia virus immunoglobulin G, while not providing sterilizing immunity, could provide protection from morbidity and lethality. Therefore, it was important to determine whether bioimaging can detect differences between nonsurviving and surviving animals in WRvFire replication and dissemination to internal organs following prophylactic treatments with VIGIV. To this end, we performed six experiments in which protection from lethality was monitored in animals receiving placebo (PBS control) or VIGIV at doses ranging from 0.3 to 30 mg/animal 2 days before challenge. One representative experiment is shown in Fig. . BALB/c mice were inoculated with VIGIV at the indicated concentrations, followed by i.n. challenge with 1 × 105 PFU of WRvFire. The results of this experiment showed that 80% of untreated mice died between days 7 and 10 postinfection. Among the VIGIV-pretreated groups, 30 mg and 10 mg VIGIV (corresponding to 3,576 units and 1,192 units/animal, respectively) completely protected mice from lethality; VIGIV at 1 and 3 mg protected 50% of the animals, and 0.3 mg of VIGIV did not protect at all (Fig. ). Six experiments with partial protection from lethality by VIGIV were performed, and in all experiments, images of infected mice were collected (Fig. ). Total fluxes in the nasal cavities, lungs, livers, and spleens of 200 animals were recorded for 10 days (or prior to mortality) (Fig. ). In total, 147 animals survived and 53 died. Mean bioluminescence values reached maximum levels on day 5 in the lungs, nasal cavities, and livers and on day 3 in the spleens (Fig. ). The calculated t values confirmed that the differences in the total fluxes in internal organs between surviving and nonsurviving animals were significant ( Table ).
| TABLE 2.t statistic for the total fluxes in surviving (n = 147) versus nonsurviving (n = 53) animalsa |
Applying statistical methods to bioluminsecence measurements to generate models predicting lethality in VIGIV-pretreated and WRvFire-challenged animals. Statistical analysis showed that the mean levels of total fluxes recorded from the upper respiratory tract, lungs, and other internal organs were significantly different between surviving and nonsurviving animals. It was also apparent that VIGIV pretreatments did not provide complete control of virus replication in the upper respiratory tract and lungs and did not completely prevent dissemination of WRvFire to the internal organs, even in animals that survived lethal challenge. Some animals that exhibited strong bioluminescence survived, and some animals with low levels of signal died. In other words, there was a clear overlap in the bioluminescence levels between surviving and nonsurviving animals. Therefore, a direct comparison of the fluxes alone was not sufficient to predict survival or lethality with perfect accuracy.
In order to develop a reliable prediction model, we applied ROC curve analysis. To identify early markers for prediction, we used bioluminescences recorded on days 1 to 5. AUCs were determined for each organ and for each time point using total fluxes in surviving and nonsurviving animals from the experiments shown in Fig. (Table ; see Fig. S1 in the supplemental material). The AUC values for days 1 to 5 were between 0.78 and 0.85, 0.72 and 0.83, 0.66 and 0.91, and 0.80 and 0.91 for the nasal cavity, lungs, liver, and spleen, respectively (Table ). Maximum AUC values were obtained on day 5 for the nasal cavity, lungs, and liver and on day 3 for the spleen, suggesting that the greater divergence in bioluminescence between surviving and nonsurviving animals coincided with the day of maximum mean bioluminescence value for each organ (Table and Fig. ). Measurements of total fluxes in the spleens on day 3 predicted lethality with 94% sensitivity (94% of the animals that died were correctly identified as nonsurviving [true positives {TP}] based on bioluminescence above the selected threshold) and with 73% specificity (73% of the animals that survived were correctly predicted to survive [true negatives {TN}] based on observed bioluminescence that was below the selected threshold) (Table ). In the same animals, total fluxes in the liver on day 5 provided 100% sensitivity and 72% specificity, which were higher than the combined sensitivities and specificities obtained with total fluxes in the nasal cavity and lungs at any time point (Table ). This suggested that bioluminescence in the spleen on day 3 and the liver on day 5 may provide better predictive models than total fluxes recorded in the nasal cavity or lungs.
| TABLE 3.ROC analysis of bioluminescence in the internal organs of WRvFire-infected mice recorded on days 1 to 5 postinfection |
The goal of the study was to determine whether bioimaging can substitute for lethality as an endpoint for the evaluation of antismallpox vaccines and therapies. Therefore, we chose a targeted sensitivity of >90% as a primary parameter, i.e., over 90% of animals with lethal outcomes are correctly identified as TP. Assuming that the sensitivity is expected to be >90%, we sought the organ and time point that provided the highest specificity, i.e., correctly predicted surviving animals as TN if their bioluminescence was below the selected threshold and, accordingly, identified fewer surviving animals with bioluminescence above the threshold (false positives [FP]) (Table ). In this scenario, the highest specificities (the highest percentage of identified TN) were achieved with total flux measurements from day 5 in the nasal cavity, lungs, and liver and on day 3 using fluxes in the spleen, with computed specificities between 69 and 78% (Table ). Accordingly, the percentages of expected FP (calculated as 100 − specificity) were between 54 and 65% when total fluxes from day 1 were used for modeling and were reduced to 22 to 33% with total fluxes from days 3 (spleen) and 5 (all other organs) (Table ). The univariate model utilizing total fluxes from the liver on day 5 had a specificity of 78% (with 22% of surviving mice identified as FP), which was the highest rate of correct identification of surviving animals compared with other organs and time points (Table ). In summary, our data showed that if the sensitivity is set to correctly predict lethality in 90% of animals that will in fact die, measurements of total fluxes in the spleen on day 3 and the liver on day 5 postinfection will correctly predict survival in 76 to 78% of the animals that survive.
| TABLE 4.Predictability of lethality and survival outcomes using total fluxes recorded from WRvFire-infected mice |
To determine which organ at which associated time point could serve as an optimal predictor of survival, we used the same models described above and set the specificity at >90%, i.e., over 90% of animals that survive exhibit bioluminescence below the selected threshold (TN) (Table ). We then sought the organ and time point that provided the best sensitivity or most accurately identified the proportion of nonsurviving animals predicted to die (TP). Accordingly, it was expected that the same organ/time point should identify the lowest proportion of nonsurviving animals with bioluminescence below the threshold (false negatives [FN]) (Table ). When the specificity was set at >90%, total flux in the spleen on day 3 provided a sensitivity of 74%, which was higher than the sensitivities computed with total fluxes in the liver, lungs, and nasal cavity at any time point (Table ). At the same time, the univariate model utilizing total flux in the spleen on day 3 identified 26% of FN, which was the lowest rate of incorrect predictions (FN) compared with any other organ or time point. These data confirmed that the total flux in the spleen on day 3 may provide an optimal threshold for determination of survival in WRvFire-infected mice.
Multidimensional prediction of lethality. Using multiple logistic regression with stepwise variable selection, we identified a collection of six total-flux measurements that each contributed significantly to predicting lethality even after the others were taken into account: flux in the liver on days 2, 3, and 5; in the spleen on days 1 and 3; and in the nasal cavity on day 4 (Fig. ). The model with these six bioluminescence measurements had an AUC of 0.96 (Fig. ). At a sensitivity of 90%, this model had a specificity of 88%, meaning that classification based on a threshold applied to a linear combination of these six variables led to only 10% of nonsurviving animals being incorrectly predicted to survive and only 12% of surviving animals being incorrectly predicted to die.
We also performed a second multiple logistic regression with stepwise variable selection using synthetic variables derived from the bioluminescence measurements rather than the measurements themselves (Fig. ). The synthetic variables considered were the mean bioluminescence across the first 5 days postinfection for each organ, the linear rate of increase in bioluminescence for each organ over 5 days (estimated as a linear regression slope), and the maximum bioluminescence across days measured in each organ. This procedure led to a more parsimonious model, with only three variables included: mean bioluminescence in the spleen and nasal cavity and the slope of bioluminescence increase in the spleen. The model yielded an AUC of 0.93 (Fig. ). At a sensitivity of 90%, this model had a specificity of 83%, meaning that classification based on the model led to 10% of nonsurviving animals being incorrectly predicted to survive and 17% of surviving animals being incorrectly predicted to die. These data confirmed that combinations of bioluminescence recorded in several organs and at multiple time points provided the most accurate prediction model of lethality.
Weight loss is not an optimal predictor of lethality. Infection with vaccinia virus causes rapid reduction in weight, and many studies use 25% or 30% weight loss as a threshold for sacrificing vaccinia virus-infected mice (
20). We sought to develop a predictive model based on percentages of weight loss and compared it with our models that utilized bioluminescence. The weights of the same individual animals shown in Fig. were recorded on days 5, 7, and 9, and percentages of weight loss were calculated from control weights measured immediately prior to challenge (Fig. ; see Table S2 in the supplemental material). ROC analysis of percent weight loss on days 5, 7, and 9 generated AUC values of 0.73, 0.85, and 0.91 (see Fig. S2 in the supplemental material). These data suggested that on day 5, weight losses were less predictive than bioluminescence; however, reductions in weight at later time points generated accurate prediction models.
To verify whether 25% or 30% weight loss could accurately predict lethality or survival, we calculated the specificity and sensitivity for each of these thresholds. Setting the threshold on day 5 to ≥25% weight loss provided 98% specificity, meaning that 98% of surviving animals exhibited weight loss of less than 25% and thus were correctly identified as survivors at that time point (Table ). The same threshold provided a sensitivity of 11%: only 5 of 47 nonsurviving animals lost more than 25% of their body weight by day 5. Thus, a large number of nonsurviving animals (42 of 47) were incorrectly identified as survivors (FN), suggesting that although a ≥25% weight loss threshold may be an optimal predictor of survival, it does not accurately predict lethality at day 5 postinfection. The sensitivity of the model was greatly improved on days 7 (97%) and 9 (89%) utilizing the same ≥25% weight loss threshold (Table ), but at the expense of specificity, meaning a large proportion of animals were incorrectly predicted to die based on weight loss (42 and 34% FP on days 7 and 9, respectively) (see Table S1 in the supplemental material). Animals used for ROC analysis reached 30% weight reduction levels only after day 5. Setting the threshold to ≥30% weight loss generated a specificity of 84% for days 7 and 9, suggesting that, as expected, a large proportion of animals that survived until that time point did not lose more than 30% of their weight (Table ). The observed 59% sensitivity on day 7 suggested that many nonsurviving animals (41%) lost less than 30% of their weight and thus were incorrectly classified as survivors (FN) based on the selected threshold (see Table S1 in the supplemental material). These data showed that 25 or 30% weight loss thresholds provided accurate predictions of survival as early as day 5, but lethality predictions were difficult for both thresholds until day 9, when 89% sensitivity was obtained.
| TABLE 5.Prediction of lethality and survival using 25 and 30% weight loss as thresholdsa |
The limited ability of the weight loss parameter to predict lethality was further supported by the simple analysis of the numbers of mice that survived although they lost greater than 25% of their weight (see Table S2 in the supplemental material). Of the 135 animals that survived to day 14, there were a total of 57 and 45 animals that lost more than 25% of their weight by days 7 and 10, respectively (see Table S2 in the supplemental material). The numbers of animals that lost 25 to 29.9%, 30 to 34.9%, or >35% of their weight at each observation time point were not identical, as some of the animals in these groups regained weight and other animals that survived showed weight reduction at the same time point. Importantly, the eventual survival of some mice that lost more than 25%, or even more than 35%, of their weight further supports the results of ROC analysis that demonstrated the low predictive value of weight measurements.