PMCCPMCCPMCC

Search tips
Search criteria 

Advanced

 
Logo of ploscompComputational BiologyView this ArticleSubmit to PLoSGet E-mail AlertsContact UsPublic Library of Science (PLoS)
 
PLoS Comput Biol. 2010 September; 6(9): e1000947.
Published online 2010 September 30. doi:  10.1371/journal.pcbi.1000947
PMCID: PMC2947978

Insights into the Evolution and Emergence of a Novel Infectious Disease

Mark M. Tanaka, Editor

Abstract

Many zoonotic, novel infectious diseases in humans appear as sporadic infections with spatially and temporally restricted outbreaks, as seen with influenza A(H5N1). Adaptation is often a key factor for successfully establishing sustained human-to-human transmission. Here we use simple mathematical models to describe different adaptation scenarios with particular reference to spatial heterogeneity within the human population. We present analytical expressions for the probability of emergence per introduction, as well as the waiting time to a successful emergence event. Furthermore, we derive general analytical results for the statistical properties of emergence events, including the probability distribution of outbreak sizes. We compare our analytical results with a stochastic model, which has previously been studied computationally. Our results suggest that, for typical connection strengths between communities, spatial heterogeneity has only a weak effect on outbreak size distributions, and on the risk of emergence per introduction. For example, if An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e001.jpg or larger, any village connected to a large city by just ten commuters a day is, effectively, just a part of the city when considering the chances of emergence and the outbreak size distribution. We present empirical data on commuting patterns and show that the vast majority of communities for which such data are available are at least this well interconnected. For plausible parameter ranges, the effects of spatial heterogeneity are likely to be dominated by the evolutionary biology of host adaptation. We conclude by discussing implications for surveillance and control of emerging infections.

Author Summary

Emerging infections are a continuing global public health issue, the most recent example being last year's ‘Swine flu’ influenza pandemic. However, for many zoonotic pathogens, some adaptation is required to cross the species barrier from an animal reservoir into humans and cause sustained transmission. Previous work has explored the relationship between the evolutionary biology of an adapting pathogen, and the epidemiology of cases that may arise before such a pathogen becomes pandemic-capable. Here, we extend this work to incorporate what is often an important host ecological feature, the spatial distribution of the host population. Many zoonoses occur away from large population centres. For example, HIV is thought to have entered the human population through bushmeat hunters in the sparsely populated jungles of Central Africa. We ask: when a pathogen is evolving to adapt for human transmission, under what circumstances does the spatial structure underlying the human population become important? We approach this question using mathematical models to explore regimes of connectedness between communities. Our results suggest that most communities are sufficiently interconnected to show no effect on the emergence process. We finish by discussing the implications of these findings for public health.

Introduction

Zoonotic emergence of novel human infections poses a significant risk to global public health. For example, the ‘Spanish flu’ pandemic of 1918 probably originated in birds and caused millions of deaths worldwide [1]. While much less virulent, the subsequent influenza pandemics of 1957, 1968 and 2009 [2], [3] are potent reminders of the capacity of the influenza virus to cross the species barrier into humans. Many other pathogens share this capacity: the SARS outbreak of 2003 [4][6] has been linked to bats and palm civets [7], [8]. In 2008, a novel arenavirus which killed four out of five patients in South Africa was linked to rodents [9].

Previous work [10], [11] has studied models of within-host evolution and between-host transmission, in which an initially poorly transmitting pathogen acquires adaptations to human hosts, following repeated zoonotic introductions until it achieves pandemic potential. These make the natural, simplifying assumption that the host population is homogeneous, so that changes in infection parameters entirely reflect adaptations in the biology of host infection. In reality, however, factors such as human contact patterns [12] and other host heterogeneity [13], [14] may also shape the risk and speed of emergence events. We concentrate here on the heterogeneity in the spatial structure of the human host population, an area which has hitherto received little attention in the context of adapting pathogens. We model spatially separated communities with varying types and strengths of interconnections, for example between a village and a city. Our aim is to study under what regimes such ‘ecological’ structure could have a strong effect on emergence, in comparison with ‘evolutionary’ factors governing the biology of infection.

In the following section we give an overview of the modelling approach. We then present new analytical results for the simple models studied previously, which ignored spatial host population structure. We use these expressions to provide useful answers to important questions: if we knew how a zoonotic pathogen would adapt to human physiology, could we anticipate its emergence? How reliable would such predictions be? Furthermore, can we predict which zoonoses will cause outbreaks which do not turn into epidemics? Next, we ask: how large does a single, finite host population have to be, for population size to have a negligible effect? We then incorporate spatial heterogeneity by separating the human host population into communities. We present a model in which a small village is connected, by human travel, with a large city as an example of the general case of two interconnected communities. We use this model to ask: how strong do community interconnections have to be for us to safely ignore the separation of a population into spatially structured communities, such as cities and villages? We review available commuting data to ask how these thresholds compare with typical human mobility patterns? We close with a discussion of implications for public health.

Materials and Methods

Modelling Evolutionary Adaptation

We build on a model of evolution and emergence originally presented by [10] in which a zoonotic pathogen infects humans, and initially has very poor onward transmissability. Thus for people who are infected by animals the average reproductive number, An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e002.jpg, is well below one (An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e003.jpg). We call this the first reproductive number An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e004.jpg for the wildtype strain. Occasionally, during such zoonotic infections, the pathogen acquires genetic changes that increase its ability to pass to other humans. During any chain of transmission the pathogen might adapt sufficiently that it achieves such ease of human-to-human transmission that An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e005.jpg and an epidemic becomes possible. Such a process can be characterised by a vector of reproductive numbers (An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e006.jpg with An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e007.jpg), and a vector of mutation probabilities (An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e008.jpg with An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e009.jpg) where An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e010.jpg denotes the number of adaptive steps necessary to reach the fully adapted strain An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e011.jpg.

In what follows we restrict our attention to the case of An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e012.jpg, An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e013.jpg and An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e014.jpg, allowing us to model two routes of adaptation with opposite and distinct characteristics while minimising the overall complexity in number of required strains. For both routes of adaptation, the first, wildtype strain has very low transmissibility, the third has pandemic potential, and the second strain has intermediate transmissibility. This intermediate transmissibility is not enough to sustain the novel pathogen within the human host population, but secondary infections by humans are possible. Thus we have An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e015.jpg. Finally, the human adapted strain has a reproductive number An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e016.jpg. Further, we assume an identical mean infectious period for all strains.

Following [11], we first distinguish two routes to adaptation: the ‘punctuated’ scenario has an evolutionary course An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e017.jpg, while the ‘gradual’ scenario has An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e018.jpg, the only difference being An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e019.jpg, the fitness of the intermediate stage.

This leads to the following SIR-model, normalized with respect to the mean infectious period,

equation image
(1)

where An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e021.jpg is the number susceptible, An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e022.jpg is the number infected with strain An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e023.jpg, and An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e024.jpg the number of recovered or removed. We do not include births and deaths as we expect a zoonotic emergence, or extinction, to happen on a much shorter timescale than the human lifespan. We translate this model to a stochastic simulation of a multitype branching process, using the Gillespie algorithm [15]. The infection is seeded in a single random, susceptible host with the wildtype strain.

In general, every introduction has only two possible outcomes: emergence or extinction. Extinction happens if the novel pathogen dies out because it fails to adapt for human transmission or just by stochastic extinction. Hence, the introduction only leads to a limited number of infectious hosts, which we refer to as the ‘outbreak size’.

Conversely, a novel pathogen of zoonotic origin emerges if it is sufficiently adapted for human transmission and begins to spread in a self-sustaining way. Formally, in an unlimited host population the cumulative number of infectious hosts is unbounded as time goes to infinity. Computationally we use a threshold of An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e025.jpg infectious hosts with the fully adapted strain to distinguish between emergence and extinction. This threshold ensures a probability of extinction less than An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e026.jpg [16]. Therefore, the number of falsely identified emergences, which would truly be extinctions, is negligibly small. Moreover, these arbitrarily small probabilities ensure that our simulation results are insensitive to the precise choice of threshold used. In situations where the host population is very small we relax our emergence threshold to smaller numbers of infectious hosts as some population sizes are below An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e027.jpg.

Probability of Emergence

In the special case An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e028.jpg and An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e029.jpg it is possible to calculate the probability of emergence per introduction into the human host population, given the evolutionary course of the pathogen, and the mutation rate with which the pathogen adapts. Our derivations start with assuming one homogeneous human host population of infinite size. This assumption can be easily relaxed as we show later. To calculate the probability of emergence, we define next event probabilities of infection An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e030.jpg, mutation An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e031.jpg, and recovery An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e032.jpg for each individual infected host, therefore the probabilities for what type of event will come next for each infectious host are

equation image
(2)

Note that in general, we can extend this adaptation process to arbitrarily many adaptive steps. The mutations are uni-directional towards the adapted strain. Using a branching-process approach similar to [10], we derive the probability of emergence per introduction as follows (see Text S1, A.1.1, for more details)

equation image
(3)

The probability of emergence can be expressed by the next event probabilities and the probability of extinction An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e035.jpg given an index infection of strain An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e036.jpg. This expression can be solved analytically for all possible routes of adaptation.

Waiting Time to Emergence

Regardless of the underlying population structure and the pathogen's biology, we can make an estimate of the number of introductions needed before an emergence arises An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e037.jpg, given the probability of emergence per introduction An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e038.jpg with An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e039.jpg (see Text S1, A.1.3, for more details)

equation image
(4)

Note that this is the average number of introductions without an emergence. The average number of introductions needed for an emergence event is An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e041.jpg.

In addition, the variance can be obtained in a similar way (see Text S1, A.1.3)

equation image
(5)

This variance leads to a standard deviation of the same order as the average number of introductions, An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e043.jpg. This makes the number of introductions before an emergence inherently unpredictable if the probability of emergence per introduction is small (An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e044.jpg).

Outbreak Size Distribution

Again, in the special case An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e045.jpg, the branching-process approach can be extended to derive the probabilities of outbreak sizes before emergence (see Text S1, A.1.2, for more details). In general, the probability of an outbreak of size An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e046.jpg is defined as An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e047.jpg with An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e048.jpg denoting the strain. The number of infected hosts to start with is denoted by An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e049.jpg. Furthermore, the overall outbreak size probability can be derived using conditional probabilities

equation image
(6)

where An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e051.jpg is the total outbreak size, and An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e052.jpg is the probability of getting An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e053.jpg patient zeros to start with in strain An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e054.jpg. The summation in the derivation of the overall outbreak size probability is over all possible subsets of infectious hosts to start with.

Incorporating Spatial Heterogeneity

We use a metapopulation model to explore the effects of spatial host heterogeneity, effectively dividing the human population into interconnected communities. As an example of the general case of spatially structured communities, we focus on a simple village - city model to approximate the spatial host heterogeneity in rural areas connected, by human mobility, to bigger cities. There are many different types of human mobility between communities such as villages and cities, including short-term commuting and long-term labour migration. Particularly in developing countries, however, information on dominant patterns is sparse. Nonetheless, anecdotal evidence from Vietnam [17], for example, suggests that short term commuting plays an important role: here, a subset of the village residents collects agricultural produce for trading in local markets in the city, and travels to the city on a daily to weekly basis. Accordingly we present a model in which the residence time of villagers in the city is typically less than the infectious period. However, in the supplementary information we also present a model incorporating migration on longer timescales (see Text S1, A.2). These two different models illustrate that our results appear qualitatively robust to different types of human movement.

As before, we have a wildtype pathogen capable of acquiring adaptations for human transmission. Assume a finite number of hosts in the village, and an effectively infinite number in the city. To allow for daily commuting, we label individuals in the city according to whether they are commuters from the village or not (neglecting commuters originating from the city and present in the village). The superscripts An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e055.jpg represent village and city inhabitants respectively, while An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e056.jpg denotes villagers in the city. Village members commute to the city at a per capita rate An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e057.jpg, and return at per capita rate An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e058.jpg. At any one time, a proportion An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e059.jpg of villagers, the commuters, are in the city with An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e060.jpg being set by An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e061.jpg and An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e062.jpg as described below. Further, we neglect susceptible village commuters acquiring infection in the city (this arises formally from the infinite number of hosts in the city). The number of village residents is fixed at An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e063.jpg, and we define the average number of commuters as

equation image
(7)

Figure 1 shows a schematic representation of this commuting model. Normalising time with respect to the mean infectious period, the governing equations are

equation image
(8)

equation image
(9)

equation image
(10)

equation image
(11)

Note that equation (11) arises from the fact that An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e069.jpg. To represent daily commuting, with an infectious period of 5 days, we set An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e070.jpg, and choose An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e071.jpg to give the required average number of commuters An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e072.jpg. To seed a wildtype infection in the village we set An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e073.jpg. In the village-city model, an emergence event is defined as having An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e074.jpg infectious hosts with the fully adapted strain in the city.

Figure 1
Schematic representation of the short-term commuting model.

Results

Validation of Analytical Results

We use the three strain model described before to study the impact of the mutation rate, An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e080.jpg, and the average reproductive number of the intermediate strain, An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e081.jpg, on the probability of emergence per introduction in a single, infinite population. We assume An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e082.jpg as an illustrative spectrum of possible mutation rates. Figure 2 shows the probability of emergence for different mutation rates and average reproductive numbers of the intermediate strain. Not surprisingly, the probability of emergence grows non-linearly with An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e083.jpg and An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e084.jpg. The probability of having no mutation in the second strain is An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e085.jpg where An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e086.jpg is the number of infected hosts with strain An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e087.jpg. While the intermediate reproductive number affects the exponent, the mutation rate has a direct influence on the base. We validate our analytical results by comparing them with the average probability of emergence of An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e088.jpg simulated emergence processes, using one homogeneous population as described in (1). Figure 2 reveals an excellent agreement between the analytical results and simulations.

Figure 2
Comparison of the probability of emergence per introduction.

Effect of Limited Host Population Size

For small host communities, the depletion of susceptible hosts can play a significant role in limiting an ongoing outbreak. What is the effect of a finite population size on these analytical results which assume an infinite host population? Figure 3A compares the simulated outbreak size distribution of different sized populations with our analytical predictions. Note that, for populations greater than An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e097.jpg, there is close agreement between numerical and analytical results. When considering populations of size An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e098.jpg or more, we do not expect population size dependence to have a substantial effect.

Figure 3
Outbreak size distribution.

Effect of Spatial Heterogeneity

The village's number of residents in our commuting model is sufficient to avoid finite size effects on the outbreak size. Furthermore, it is independent of any spatial heterogeneity. The number of residents only has a limiting effect on the outbreak size distribution. Figure 3B shows the outbreak size distribution for our short-term commuting model. The average number of commuters ranges from An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e112.jpg to An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e113.jpg. As we expect, no significant effect on the outbreak size distribution can be seen, even for An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e114.jpg. It validates our assumption of the independence of infectious hosts, necessary for a branching-process formulation, as the simulations closely match the analytical predictions. It is noteworthy here that only the biology of the novel pathogen determines the emergence process, as outbreak sizes group according to the intermediate average reproductive number An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e115.jpg. The minimal deviation for An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e116.jpg commuters is based on the fact that the effective village size is only An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e117.jpg due to the absent commuters.

In Figure 4, we extract the probability of emergence per introduction given a certain number already infected. These data are easily calculated using the outbreak size distribution and the probability of emergence per introduction. Assume an introduction has caused An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e118.jpg infectious hosts already. The probability of extinction is the cumulative probability of getting an outbreak size equal to or larger than An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e119.jpg, renormalized by all possible outcomes (extinctions and emergences) once An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e120.jpg hosts are infectious. This yields the probability of emergence per number of infected. In Figure 4, the effect of spatial heterogeneity can be seen directly. For An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e121.jpg, the village-city simulations agree very well with the analytical results assuming a single, infinite population. But for An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e122.jpg, the probability of emergence converges to approximately An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e123.jpg.

Figure 4
Probability of emergence in the city with short-term commuting.

While Figure 4 shows the probability of emergence as a function of the number infected, the actual outcome is highly unpredictable even if the probability of an event is known as the average waiting time to an emergence shows (see equations 4 and 5). It can be generalized for the probability of emergence given An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e128.jpg infectious hosts. For example, the probability of emergence given five infectious hosts in the gradual route (II) is An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e129.jpg. It follows on average every An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e130.jpg times this happens an emergence will happen. The standard deviation is An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e131.jpg, which leads to the conclusion that even if the probability is known, it is inherently unpredictable when this will actually lead to an emergence.

Spatial Homogeneity Coefficient

This confirms that a pathogen needs a sufficient connection between communities to emerge, despite its ability to cause outbreaks, regardless of the spatial structure. Hence we expect the existence of a threshold where spatial heterogeneity effectively does not matter any more. Previous research has shown that the effect of heterogeneity in spatially structured population models depends on the interconnectivity with a threshold effectively allowing the pathogen to spread between communities [18][20]. Our approach allows new insights, as we do not need to specify the actual number of infectious hosts migrating to a new community. We measure connectivity between communities in terms of the average number of commuters An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e132.jpg for which rich empirical datasets can be found. Figure 5 presents illustrative examples of empirical data of commuting patterns in different parts of the world. Most data has been collected by Offices of Statistics of five countries on three continents [21][25]. A further, two independent studies have been used to estimate commuting patterns of towns in Indonesia [26] and China [27].

Figure 5
Data of commuting patterns in different parts of the world.

We next attempt to quantify the regimes in An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e138.jpg for which spatial heterogeneity may be neglected. We approach this question using a simple analytical derivation for the effect of spatial heterogeneity, which considers only the adapted strain. Assume a connected community such as the village in our village-city model, with a fully adapted pathogen introduced into the village. Given an emergence and epidemic in the village, the probability that this causes an emergence and epidemic in the city is

equation image
(12)

Therefore, An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e140.jpg is a spatial homogeneity coefficient, measuring the impact of spatial heterogeneity on an emergence process. It ranges from An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e141.jpg, leading to two isolated communities, to An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e142.jpg, effectively removing any spatial heterogeneity and forming one homogeneous population. An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e143.jpg is the fraction of the village residents becoming infectious. It can be derived using [28]

equation image
(13)

The spatial homogeneity coefficient depends only on the connection strength expressed in commuters An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e145.jpg and the average reproductive number An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e146.jpg of the fully adapted strain. Though it only considers the fully adapted strain, we expect this coefficient to be a good approximation for a multi-strain model as the vast majority of infectious hosts will transmit the fully adapted strain in the case of an emergence.

Figure 6 gives an overview of the influence of spatial heterogeneity as a function of An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e147.jpg. Effectively, spatial heterogeneity is negligible once a critical number of ten commuters connect the two communities. This is a very low threshold, and empirical data shows that the probability of having a community with less than the critical number of ten commuters is approximately An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e148.jpg for all our data combined.

Figure 6
Deviation between simulated and analytical predicted probability of emergence.

As illustrated by the close fit between analytical and numerical results in figure 6, there is only a small error in the analytical expression arising from neglecting infections with mal-adapted strains. This error is greatest for the gradually adapting pathogen, because an intermediate strain with An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e155.jpg tends to cause larger outbreaks than one with An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e156.jpg. Nevertheless, the deviation remains small.

In light of this agreement, how does the critical average number of commuters vary with the adapted reproductive number? While for An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e157.jpg ten average commuters are sufficient to dissolve spatial heterogeneity, this changes dramatically for smaller average reproductive numbers (see Figure 7). If a well-adapted strain is only just pandemic capable (i.e. An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e158.jpg just above An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e159.jpg) villages with only ten commuters are only An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e160.jpg likely to seed an epidemic in their local city, and spatial structure becomes important again. For example, the critical number of commuters is close to An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e161.jpg for An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e162.jpg. For An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e163.jpg the spatial homogeneity coefficient is approximately An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e164.jpg for one commuter. This agrees with what we find in Figure 4 using simulated results.

Figure 7
Impact of spatial heterogeneity on disease transmission between communities (I).

Discussion

In this article we first present analytical results to calculate epidemiological parameters of a novel disease, adapting to humans. We explore the influence of spatial host contact structure, and validate our result with stochastic simulations of simple village-city models as an example of interconnected communities within a spatially structured population. Our study reveals that for plausible parameter ranges, spatial heterogeneity only has very limited impact on the probability of emergence, as well as the outbreak size distribution. Neither a change in strength of spatial heterogeneity (e.g. number of commuters), nor in its quality (e.g. short term versus long term commuting) shows a significant influence. Our results suggest that only the most remote rural communities would be subject to epidemiological isolation. In particular, the available empirical data suggests that communities tend to be highly interconnected with relatively high connection strengths. Of course, it is the most remote communities of the world for whom we have the least relevant data. More empirical research on spatial heterogeneity is needed to form a better understanding of its effect, and this need is greatest in developing parts of the world.

In addition, population size becomes an important factor only when that population is relatively small An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e168.jpg fewer than An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e169.jpg individuals. Only a small number of infectious hosts are actually involved in the emergence process, which relates to the small reservoir of susceptibles needed for a successful emergence. Moreover, biological processes such as the speed of evolution and the adaptive route show a strong influence on the overall emergence process. We show that epidemiological parameters such as the outbreak size group according to the evolutionary route. Previous research has shown the effect of the pathogen's route of evolutionary adaptation and mutation rate on the probability of emergence per introduction [10], [11]. Our theoretical derivation of the probability of emergence extends this and offers the benefit of being analytically solvable for any possible route of adaptation and any mutation probabilities. We note here that previous work has highlighted the role of other, significant types of heterogeneity in emergence of a novel infection. For example, [29] describe the effect of the pathogens life history, such as the length of infection, on the emergence of a novel pathogen. Further, heterogeneity in human-to-human transmission within a population may have an influence on the course and probability of emergence and outbreaks [13], [14], usually lowering the probability of emergence. While we have concentrated here on simple types of spatial heterogeneity, a significant question for future research is the role of mixed heterogeneities, for example spatially structured populations with additional heterogeneity in the human-to-human transmission.

We also find that the waiting time for an outcome of a novel pathogen's introduction is highly unpredictable, even if the probability for such an event is known. Conversely, this means that an estimate of the underlying epidemiological parameters from observed data will be highly uncertain. Unfortunately, a large number of observations will be necessary to achieve confidence in the parameters, and even a large number of introductions gone extinct do not rule out the possibility of emergence for a pathogen. We came to a similar result [11] using a measurement on the upper bound of the probability of emergence. Moreover, the probability of emergence given a certain number of infectious hosts can be surprisingly low. Even a comparatively large number of infectious hosts can end in extinction, especially for low mutation rates and intermediate-stage average reproductive numbers just below one.

Our work has relevance for important public health issues: if a novel disease is detected in a rural setting, and it appears to be spreading, how feasible is it to contain infection by restricting movements to and from the village? Our results suggest that first, an infeasibly tight level of quarantine would be required for any chance of containment, corresponding to enforcing a low level of An external file that holds a picture, illustration, etc.
Object name is pcbi.1000947.e170.jpg in Figure 6. To all intents and purposes isolation would have to be absolute to be effective. In most circumstances such extreme intervention would not be acceptable. Second, given typical mobility patterns, it is likely that once there is a detectable number of cases in the village, there may already be a significant number of cases outside of it. Therefore quarantining interventions are likely to come too late.

Our work raises important questions for future research: where should surveillance be focused to detect an emergence as early as possible, especially if resources are limited? Given emergence of a novel infection in a rural setting, how much time can we buy through limiting travel to and from major urban centres? These and other questions will undoubtedly benefit from more systematic studies of emergence in the context of population distributions. Nonetheless, theoretical models such as those presented here can offer useful, fundamental insights to guide such studies.

Supporting Information

Text S1

Supplementary material with figures for ‘Insights into the Evolution and Emergence of a Novel Infectious Disease’.

(0.16 MB PDF)

Footnotes

The authors have declared that no competing interests exist.

Ruben J. Kubiak gratefully acknowledges funding for this work from the James Martin 21st Century School (http://www.21school.ox.ac.uk) and the EPSRC (http://www.epsrc.ac.uk); Nimalan Arinaminpathy gratefully acknowledges funding from the James Martin 21st Century School; and Angela R. McLean is a Senior Research fellow of All Souls College (http://www.all-souls.ox.ac.uk) whose support is gratefully acknowledged. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

1. Reid AH, Fanning TG, Hultin JV, Taubenberger JK. Origin and evolution of the 1918 Spanish influenza virus hemagglutinin gene. P Natl Acad Sci USA. 1999;96:1651–1656. [PubMed]
2. Horimoto T, Kawaoka Y. Influenza: lessons from past pandemics, warnings from current incidents. Nat Rev Micro. 2005;3:1740–1526. [PubMed]
3. Novel Swine-Origin Influenza A (H1N1) Virus Investigation Team. Emergence of a novel swine-origin influenza A (H1N1) virus in humans. New Engl J Med. 2009;360:2605–2615. [PubMed]
4. Brockmann D, Hufnagel L, Geisel T. SARS: a case study in emerging infections. Oxford: Oxford University Press, 3rd edition; 2005.
5. Meyers LA, Pourbohloul B, Newman MEJ, Skowronski DM, Brunham RC. Network theory and SARS: predicting outbreak diversity. J Theor Bio. 2005;232:71–81. [PubMed]
6. Skowronski DA, Astell C, Brunham RC, Low DE, Petric M, et al. Severe acute respiratory syndrome (SARS): A year in review. Annu Rev Med. 2005;56:357–381. [PubMed]
7. Guan Y, Zheng BJ, He YQ, Liu XL, Zhuang ZX, et al. Isolation and characterization of viruses related to the SARS coronavirus from animals in Southern China. Science. 2003;302:276–278. [PubMed]
8. Li WD, Shi ZL, Yu M, Ren WZ, Smith C, et al. Bats are natural reservoirs of SARS-like coronaviruses. Science. 2005;310:676–679. [PubMed]
9. Briese T, Paweska JT, McMullan LK, Hutchison SK, Street C, et al. Genetic detection and characterization of Lujo virus, a new hemorrhagic fever-associated arenavirus from Southern Africa. PLoS Pathog. 2009;5:8. [PMC free article] [PubMed]
10. Antia R, Regoes RR, Koella JC, Bergstrom CT. The role of evolution in the emergence of infectious diseases. Nature. 2003;426:658–661. [PubMed]
11. Arinaminpathy N, McLean AR. Evolution and emergence of novel human infections. Proc R Soc B. 2009;276:3937–3943. [PMC free article] [PubMed]
12. Woolhouse ME, Haydon DT, Antia R. Emerging pathogens: the epidemiology and evolution of species jumps. Trends Ecol Evol. 2005;20:238–244. [PubMed]
13. Yates A, Antia R, Regoes RR. How do pathogen evolution and host heterogeneity interact in disease emergence? Proc R Soc B. 2006;273:3075–3083. [PMC free article] [PubMed]
14. Lloyd-Smith JO, Schreiber SJ, Kopp PE, Getz WM. Superspreading and the effect of individual variation on disease emergence. Nature. 2005;438:355–359. [PubMed]
15. Gillespie DT. Exact stochastic simulation of coupled chemical reactions. J Phys Chem. 1977;81:2340–2361.
16. May RM, Gupta S, McLean AR. Infectious disease dynamics: what characterises a successful invader? Philos T R Soc Lon B. 2001;356:901–910. [PMC free article] [PubMed]
17. Thanh HX, Anh DN, Tacoli C. Livelihood diversification and rural-urban linkages in Vietnam's Red River Delta. 2005. IIED Working Paper Series on Rural-Urban Interactions and Livelihood Strategies.
18. Watts DJ, Muhamad R, Medina DC, Dodds PS. Multiscale, resurgent epidemics in a hierarchical metapopulation model. P Natl Acad Sci USA. 2005;102:11157–11162. [PubMed]
19. Vazquez A. Epidemic outbreaks on structured populations. J Theor Bio. 2007;245:125–129. [PubMed]
20. Colizza V, Vespignani A. Epidemic modeling in metapopulation systems with heterogeneous coupling patter: theory and simulations. J Theor Bio. 2008;251:450–467. [PubMed]
21. Instituto Brasileiro de Geografia e Estatística, Brazil. 2000. Censos demográficos 2000.
22. US Census Bureau, United States of America. 2000. Census 2000.
23. Office for National Statistics, United Kingdom. 2001. Census 2001.
24. Census and Statistics Department, Hong Kong. 2001. 2001 population census.
25. Statistics Bureau, Japan. 2005. 2005 population census.
26. Leinbach TR. Rural transport and population mobility in Indonesia. J Dev Areas. 1983;14:349–364. [PubMed]
27. Xu H. Commuting town workers: the case of Quinshan, China. Habitat Int. 2001;25:35–47.
28. Anderson RM, May RM. Infectious diseases of humans: dynamics and control. Oxford: Oxford University Press, 1st edition; 1991.
29. André JB, Day T. The effect of disease life history on the evolutionary emergence of novel pathogens. Proc R Soc B. 2005;272:1949–1956. [PMC free article] [PubMed]

Articles from PLoS Computational Biology are provided here courtesy of Public Library of Science