Human mobility patterns predict divergent epidemic dynamics among cities
Abstract
The epidemic dynamics of infectious diseases vary among cities, but it is unclear how this is caused by patterns of infectious contact among individuals. Here, we ask whether systematic differences in human mobility patterns are sufficient to cause inter-city variation in epidemic dynamics for infectious diseases spread by casual contact between hosts. We analyse census data on the mobility patterns of every full-time worker in 48 Canadian cities, finding a power-law relationship between population size and the level of organization in mobility patterns, where in larger cities, a greater fraction of workers travel to work in a few focal locations. Similarly sized cities also vary in the level of organization in their mobility patterns, equivalent on average to the variation expected from a 2.64-fold change in population size. Systematic variation in mobility patterns is sufficient to cause significant differences among cities in infectious disease dynamics—even among cities of the same size—according to an individual-based model of airborne pathogen transmission parametrized with the mobility data. This suggests that differences among cities in host contact patterns are sufficient to drive differences in infectious disease dynamics and provides a framework for testing the effects of host mobility patterns in city-level disease data.
1. Introduction
Infectious diseases cause morbidity in most humans each year [1] and account for a significant portion of all yearly human mortality [2]. As the global urbanization rate continues to rise past 50%, cities will more often act as focal points for epidemics; providing venues where strangers are more likely to interact, representing the most probable locations of first detection and incurring a greater share of the casualties. Given cities’ pivotal role in the spread of infectious disease, it is important to understand why they exhibit systematic variation in the timing and severity of epidemics [3–5].
Human mobility patterns generate the proximity between individuals prerequisite for the transmission of many infectious diseases. This suggests that cities with different mobility patterns may also differ in the rate at which their inhabitants have infectious contact, leading to variation among cities in the risk of an epidemic [5–7]. Human movement patterns are heterogeneous at a wide range of scales—from within a building [8] to among countries [9–11], as evidenced by diverse sources of data, including the movements of mobile phone users [12,13], air travel patterns [9–11] and census data on commuting patterns [10,14,15]. At each scale, there appear collective mobility patterns maintained far from those predicted by homogeneous random movement. These dissipative structures [16] have the potential to create localized areas where infectious contact rates are systematically amplified [17]. Empirically reconstructed contact networks suggest systematic variation in infectious contact rates across countries, age groups and other sociodemographic factors [18].
Individual variation in rates of infectious contact can significantly alter patterns of disease spread [6,7,15,19–21] and theoretical models of disease dynamics within and among cities (both individual-based simulations [5,7,10,15,22–24] and metapopulation models [10,11,14,15,25–27]) have shown that heterogeneous contact patterns are potentially important in determining urban epidemic dynamics. However, few studies have examined whether empirical variation in intra-city mobility patterns is sufficient to drive detectable differences in epidemic dynamics among cities.
Here, we use 2006 census data on the mobility patterns of 7 225 810 workers in 48 cities to test whether cities differ enough in their mobility patterns to generate differences in their risk of an epidemic. In the first part of the paper, we quantify differences in mobility patterns among cities, using heterogeneity statistics and transportation models to examine whether cities vary systematically in the level of organization in their mobility patterns. In the second part of the paper, we use the commuting data to parametrize a basic model for the spread of an airborne pathogen in each respective city, to test whether the observed differences among cities in mobility patterns are sufficient to generate significant differences among cities in the risk of an epidemic.
2. Material and methods
We analysed data from the 2006 Canadian census [28] on the commuting patterns of every worker in 48 Canadian cities—a total of 7 225 810 individuals (see the electronic supplementary material, table S1). The data for each city are organized by census tract (CT) and record the number of workers who live in CT i and work in CT j, denoted Tij. CTs are contiguous geographical areas where 2500–8000 people reside and are typically the same area as a few city blocks. While the geographical area of a CT is influenced by residential population density, the boundaries of CT are chosen without regard to the number of individuals who travel to work there. Let
represent the number of workers who reside in CT i. Let
represent the number of workers who travel to work in CT j. Let
and σm represent the mean and standard deviation of mj in a given city. Let
represent the total number of workers in a city and N the total population of the city across all CTs that have any workers. Lloyd's ‘mean crowding’ statistic [29]
measures worker density from the perspective of workers in their workplaces: in a given city, m* is the average number of other individuals who work in the same CT as a worker chosen at random, whereas
is the average worker density in a CT chosen at random.
To further characterize inter-city differences in human mobility patterns, we compared how well the patterns for each city could be explained by alternative transportation models [30]. These models respectively describe processes that lead to different degrees of organization in human mobility. The models took the form
, where pij is the probability that an individual who resides in CT i will travel to work in CT j, and
denotes the expected number of commuters between i and j under the transportation model specified by pij. We first used a configuration model [19] for the inter-CT commuting network in each city, which makes the neutral prediction that the probability that a worker who resides in i will work in j is proportional to the total number of individual workplaces in j. The configuration model is closely related to the gravity model from transportation analysis, whose suitability for analysing mobility data is under debate ([30]; see the electronic supplementary material). The second model we examined was the newly described radiation model [30] of transportation patterns, which modulates flow between two locations by accounting for the number of potential destinations in the area between them. Both the configuration and radiation model are parameter free.
We asked if the systematic differences in mobility patterns we discovered were sufficient to cause differences in epidemic dynamics among cities by using the commuting data to parametrize a stochastic, spatially explicit, individual-based model of airborne pathogen transmission for each city (see the electronic supplementary material). Epidemic dynamics that result from home–work movements can also be modelled using recently developed metapopulation frameworks [15,25–27,31]. These models aggregate individual behaviour to consider host mobility and disease spread patterns between subpopulations. Accordingly, in the electronic supplementary material, we explore the consequences of relaxing correlations arising from the preservation of individual identities in our model.
To implement the model, we first used the commuting data to estimate the frequency of contact between each possible pair of workers in a city. We then translated contact frequency into pairwise transmission hazard using a basic model of within-host pathogen dynamics for acute infections. We did this for a range of pathogen transmissibilities (here pathogen transmissibility, λ, expresses the strain-specific ratio between within-host pathogen load and transmission hazard; log10(λ)∈{0, 0.25, 0.5, 0.75, 1}). Our model makes two assumptions: first, we assume that the spatial trajectories of humans in cities can be predicted by their home and workplace locations, which is supported by recent analyses of high-resolution data on the relocation patterns of mobile phone users [13]; second, we assume that excursions from an individual's bed or work station are governed by a stochastic process that is identically distributed across cities. This leads to transmission patterns that conform to mass action within the radius of motion of an individual but are determined by the commuting data at the scale of a city. Although in reality, cities are connected by inter-city commuting, these connections are relatively weak compared with intra-city commuting, and we are testing the prediction that cities can have differences in epidemic dynamics generated endogenously by intra-city mobility patterns. Thus, we model each city separately. If inter-city variability in commuting patterns is sufficient to generate differences among cities in epidemic risk, our model of transmission should predict different disease dynamics in different cities for the same pathogen transmissibility.
3. Results
The structure of the commuting matrix, Tij, varies markedly both within and among cities (figure 1). A striking feature in these visualizations is the appearance of star shapes in some cites. Star shapes appear where commuting flows originating from many distinct CTs are directed towards a single centralized work location. The crowding statistic m* can be used as a measure of the prevalence of star-shaped commuting patterns in a city, because its value increases with the level of aggregation in individual workplace locations [29].
Figure 1. Mobility patterns of workers in cities. The thickness and colour of edges show the number of individuals commuting between CTs. Circles are actually short edges, representing individuals who live and work in the same CT. Larger cities tend to have more highly organized commuting patterns, as measured by the average number of workers who have their workstation in the same CT as a randomly chosen worker (m*). However, cities also show marked differences in organization that are independent of population size.
We find that the average number of workers per CT, Figure 2. Systematic differences in worker mobility patterns among 48 Canadian cities. (a)
, saturates rapidly as N increases. By contrast, m*, which then measures overdispersion in individual workplace locations, exhibits a strong positive correlation with N. This indicates that workers in larger cities tend to organize into a few extraordinarily populated work areas, while maintaining the same average number of workstations per CT as smaller cities (figure 2a). In other words, the prevalence of star-shaped commuting flows in a city is only weakly correlated with the average number of people who work in a CT, but nonetheless varies strongly with total population size, leading commuting patterns in larger cities to be more highly organized around a few focal work locations.

, the mean number of workers per CT (triangles) and m*, the average number of workers in the same CT as a randomly chosen worker (circles), as a function of population size (N). The solid line shows
, the fitted relationship between m* and N. The vertical distance between the dashed lines spans
, where σ is the standard deviation of
, showing the expected absolute difference in m* (on a log scale) between two cities of the same size. The width of the shaded polygon then shows what change in N would produce that difference according to
. (b) Variance explained in each city by the configuration (squares) and radiation (diamonds) models of commuting flows.
Cities also show marked size-independent variation in m*, evident in the ratio of m* to the value predicted by a regression of m* as a function N (see the electronic supplementary material, table S2). In the 48 cities we analysed,
(hereafter ‘excess heterogeneity in mobility patterns’) ranged from 0.43 to 3.07, a 7.14-fold difference. For two cities chosen at random, the average ratio between the larger and smaller values of
is 1.55. This size-independent difference in mobility patterns is equivalent to the predicted size-dependent difference (based on the regression line in figure 2a) that would result from a 2.64-fold change in population size. Thus, differences unrelated to population size are an important component of the variation in worker mobility patterns between cities.
The configuration model explains much of the variation in commuting flow in small cities, but its performance decreases systematically with N, indicating that larger cities have increasingly highly organized mobility patterns (figure 2b). The fit of the gravity model is poorer than that of the configuration model, but it exhibits the same trend of fitting smaller cities better than larger cities (see the electronic supplementary material). The radiation model also shows systematic variation in performance: as the fit of the configuration model declines, the performance of the radiation model increases, performing relatively poorly in small cities and better in larger ones (figure 2b).
Cities with more organized commuting patterns (meaning a larger value of m*) are predicted to have a higher probability (P) of an epidemic following the introduction of a single randomly chosen infected individual (figure 3a; electronic supplementary material, table S3). This relationship persists once the effects of population size on m* and P are removed: among cities of the same size, increased excess heterogeneity in mobility patterns is predicted to cause a significant increase in the risk of an epidemic, relative to the average risk for a city of that size (figure 3b). The change in relative risk produced by increasing excess heterogeneity in mobility patterns is greater for pathogens with lower transmissibility. Less contagious pathogens also show more variability in relative risk among cities.
Figure 3. Epidemic dynamics as a function of heterogeneity in human mobility patterns. (a) Probability that a single infection will spark an epidemic in 48 cities with different levels of organization in their commuting patterns, calculated from 100 simulations for each city for transmissibilities of λ = 1 (triangles) and λ = 10 (circles). Point size is proportional to log N. Lines show logistic regression controlling for transmissibility. (b) Relative risk of an epidemic as a function of excess heterogeneity in mobility patterns. The statistical model for 
is
. Lines show linear regression controlling for transmissibility. (c) Final number infected is positively correlated with the level of heterogeneity in mobility patterns. Lines show fits of linear regression on log-transformed variables; λ = 1 (triangles, dashed line), λ = 10 (circles, solid line). (d) This effect persists when the effects of population size on F and m* are removed.
The predicted number of individuals infected by the end of an epidemic, F, also scales with the level of organization in commuting patterns (figure 3c). As with the probability of an epidemic, the influence of organized host mobility patterns on F is still significant once the effect of population size is removed (by considering the effect of excess heterogeneity in mobility patterns on excess infected—
;
; figure 3d). In summary, the simulations show that extant differences among cities in the level of organization in human mobility patterns are sufficient to significantly alter the risk and severity of an epidemic among cities. The average magnitude and variability of this effect depends on pathogen transmissibility.
4. Discussion
Larger cities depend on higher levels of organization that increase economies of scale [32,33]. Here, we show that increasing organization in cities may also have important consequences for the spread of infectious disease. Whereas epidemic models have typically assumed that human populations are identically mixed for the purposes of infectious contact, our results add to an increasing body of empirical evidence that infectious contact rates in humans vary systematically among populations [7,18,34]. Correspondingly, heterogeneities in human mobility patterns can explain more of the variability in regional epidemic data than analyses which posit identically mixed host populations [14,35,36], and recent metapopulation models of disease spread have been developed to describe recurrent host movements and retain information on individual identity [15,37,38].
An important direction for future work lies in understanding what signals mobility patterns leave in city-level epidemic data when other important factors are integrated into the analysis, such as inter-city variation in age distribution, immune history or movements within and between cities not accounted for by the commuting data. For instance, we simulated the epidemic dynamics of each city independently, but movement of individuals among cities also affects epidemic dynamics [39]. Another area for improvement is that the commuting data we used represents only the movements of working adults, but the dynamics of many respiratory infections depend on transmission among children [40]. In addition to having different levels of susceptibility, younger age groups can have different mobility and contact patterns [18]. We hypothesize, however, that the transmission model described here approximates (albeit imprecisely) the presence of children by creating contacts among working adults who reside in the same CT. In addition, it is plausible that the mobility patterns of children are similar across cities, so while transmission among children is important, the mobility patterns of workers lead to differences among cities. Also, differences in influenza dynamics among US states have been partially explained using only the movement patterns of workers [14].
Our results provide empirical support for the potential importance of contact heterogeneity at the intra-city scale and show new evidence that successfully forecasting epidemics in cities may require us to identify differences in intra-city mobility patterns among cities of similar sizes. Conversely, increasing numbers of infected in larger populations are not necessarily caused exclusively by increases in the number of potential hosts. Instead, increases in the level of heterogeneity in human mobility patterns in larger cities are sufficient in themselves to significantly increase the risk and final size of epidemics. In the face of limited infrastructure for rapidly implementing quarantine and vaccination policies to control the spread of emerging pathogens, an empirical link between human mobility patterns and disease incidence at the scale of individual cities may allow more effective containment strategies, which exploit predictable inter-city differences in the rate of disease transmission. Novel analyses that connect detailed information on human contact patterns with city-level disease data are required in order to test the importance to real epidemics of the systematic differences in mobility patterns we have described here [41].
Acknowledgements
Anurag Agrawal, Monica Geber, Giles Hooker, Matt Holden, Colin Parrish and the Theoretical Ecology Reading Group at Cornell provided comments on previous versions of this manuscript.
Funding statement
This research was supported by Canadian Institutes of Health Research grant no. PTL-97126 to B.P. and by a National Sciences and Engineering Research Council of Canada PGS-D grant to B.D.D.
Footnotes
References
- 1World Health Organization. 2008 Causes of death 2008 summary tables. Geneva, Switzerland: World Health Organization. Google Scholar
- 2World Health Organization. 2004 The global burden of disease: 2004 update. Geneva, Switzerland: World Health Organization. Google Scholar
- 3
Bartlett M . 1956Measles periodicity and community size. J. R. Stat. Soc. A 3, 493–510. Google Scholar - 4
Grenfell B, Bjornstad O& Kappey J . 2001Travelling waves and spatial hierarchies in measles epidemics. Nature 414, 716–723. (doi:10.1038/414716a). Crossref, PubMed, ISI, Google Scholar - 5
Meyers LA, Pourbohloul B, Newman M, Skowronski D& Brunham R . 2005Network theory and SARS: predicting outbreak diversity. J. Theor. Biol. 232, 71–81. (doi:10.1016/j.jtbi.2004.07.026). Crossref, PubMed, ISI, Google Scholar - 6
Taylor C, Marathe A& Beckman R . 2010Same influenza vaccination strategies but different outcomes across US cities?Int. J. Infect. Dis. 14, e792–e795. (doi:10.1016/j.ijid.2010.02.2267). Crossref, PubMed, ISI, Google Scholar - 7
Merler S& Ajelli M . 2010The role of population heterogeneity and human mobility in the spread of pandemic influenza. Proc. R. Soc. B 277, 557–565. (doi:10.1098/rspb.2009.1605). Link, ISI, Google Scholar - 8
Salathé M, Kazandjieva M, Lee JW, Levis P, Feldman MW& Jones JH . 2010A high-resolution human contact network for infectious disease transmission. Proc. Natl Acad. Sci. USA 107, 22 020–22 025. (doi:10.1073/pnas.1009094108). Crossref, ISI, Google Scholar - 9
Colizza V, Barrat A, Barthelemy M, Valleron A-J& Vespignani A . 2007Modeling the worldwide spread of pandemic influenza: baseline case and containment interventions. PLoS Med. 4, e13. (doi:10.1371/journal.pmed.0040013). Crossref, PubMed, ISI, Google Scholar - 10
Balcan D, 2009Seasonal transmission potential and activity peaks of the new influenza A(H1N1): a Monte Carlo likelihood analysis based on human mobility. BMC Med. 7, 45. (doi:10.1186/1741-7015-7-45). Crossref, PubMed, ISI, Google Scholar - 11
Bajardi P, Poletto C, Ramasco JJ, Tizzoni M, Colizza V& Vespignani A . 2011Human mobility networks, travel restrictions, and the global spread of 2009 H1N1 pandemic. PLoS ONE 6, e16591. (doi.org/10.1371/journal.pone.0016591). Crossref, PubMed, ISI, Google Scholar - 12
González MC, Hidalgo CA& Barabasi A-L . 2008Understanding individual human mobility patterns. Nature 453, 779–782. (doi.org/10.1038/nature06958). Crossref, PubMed, ISI, Google Scholar - 13
Song C, Qu Z, Blumm N& Barabási A-L . 2010Limits of predictability in human mobility. Science 327, 1018–1021. (doi:10.1126/science.1177170). Crossref, PubMed, ISI, Google Scholar - 14
Viboud C, Bjørnstad ON, Smith DL, Simonsen L, Miller MA& Grenfell BT . 2006Synchrony, waves, and spatial hierarchies in the spread of influenza. Science 312, 447–451. (doi:10.1126/science.1125237). Crossref, PubMed, ISI, Google Scholar - 15
Balcan D& Vespignani A . 2011Phase transitions in contagion processes mediated by recurrent mobility patterns. Nat. Phys. 7, 581–586. (doi:10.1038/nphys1944). Crossref, PubMed, ISI, Google Scholar - 16
Prigogine I . 1978Time, structure, and fluctuations. Science 201, 777–785. (doi:10.1126/science.201.4358.777). Crossref, PubMed, ISI, Google Scholar - 17
Barabási A-L . 2005The origin of bursts and heavy tails in human dynamics. Nature 435, 207–211. (doi:10.1038/nature03459). Crossref, PubMed, ISI, Google Scholar - 18
Mossong J, 2008Social contacts and mixing patterns relevant to the spread of infectious diseases. PLoS Med. 5, e74. (doi:10.1371/journal.pmed.0050074). Crossref, PubMed, ISI, Google Scholar - 19
Newman MEJ . 2002Spread of epidemic disease on networks. Phys. Rev. E Stat. Nonlin. Soft Matter Phys. E66, 0161128. (doi:10.1103/PhysRevE.66.016128). Google Scholar - 20
Lloyd-Smith J, Schreiber SJ, Kopp PE& Getz W . 2005Superspreading and the effect of individual variation on disease emergence. Nature 438, 355–359. (doi:10.1038/nature04153). Crossref, PubMed, ISI, Google Scholar - 21
Bansal S, Grenfell BT& Meyers LA . 2007When individual behaviour matters: homogeneous and network models in epidemiology. J. R. Soc. Interface 4, 879–891. (doi:10.1098/rsif.2007.1100). Link, ISI, Google Scholar - 22
Eubank S, Guclu H, Kumar VSA, Marathe MV, Srinivasan A, Toroczkai Z& Wang N . 2004Modelling disease outbreaks in realistic urban social networks. Nature 429, 180–184. (doi:10.1038/nature02541). Crossref, PubMed, ISI, Google Scholar - 23
Guzzetta G, Ajelli M, Yang Z& Merler S . 2011Modeling sociodemography to capture tuberculosis transmission dynamics in a low burden setting. J. Theor. Biol. 289, 197–205. (doi:10.1016/j.jtbi.2011.08.032). Crossref, PubMed, ISI, Google Scholar - 24
Fumanelli L, Ajelli M, Manfredi P, Vespignani A& Merler S . 2012Inferring the structure of social contacts from demographic data in the analysis of infectious diseases spread. PLoS Comput. Biol. 8, e1002673. (doi:10.1371/journal.pcbi.1002673). Crossref, PubMed, ISI, Google Scholar - 25
Colizza V& Vespignani A . 2007Invasion threshold in heterogeneous metapopulation networks. Phys. Rev. Lett. 99, 148701. (doi:10.1103/PhysRevLett.99.148701). Crossref, PubMed, ISI, Google Scholar - 26
Colizza V, Pastor-Satorras R& Vespignani A . 2007Reaction–diffusion processes and metapopulation models in heterogeneous networks. Nat. Phys. 3, 276–282. (doi:10.1038/nphys560). Crossref, ISI, Google Scholar - 27
Colizza V& Vespignani A . 2008Epidemic modeling in metapopulation systems with heterogeneous coupling pattern: theory and simulations. J. Theor. Biol. 251, 450–467. (doi:10.1016/j.jtbi.2007.11.028). Crossref, PubMed, ISI, Google Scholar - 28Census of Canada. 2006Census place of work (POW) custom consortium tables. See http://hdl.handle.net/10573/41533. Google Scholar
- 29
Lloyd M . 1967Mean crowding. J. Anim. Ecol. 36, 1–30. (doi:10.2307/3012). Crossref, ISI, Google Scholar - 30
Simini F, González MC, Maritan A& Barabási A-L . 2012A universal model for mobility and migration patterns. Nature 484, 96–100. (doi:10.1038/nature10856). Crossref, PubMed, ISI, Google Scholar - 31
Jesse M, Ezanno P, Davis S& Heesterbeek J . 2008A fully coupled, mechanistic model for infectious disease dynamics in a metapopulation: movement and epidemic duration. J. Theor. Biol. 254, 331–338. (doi:10.1016/j.jtbi.2008.05.038). Crossref, PubMed, ISI, Google Scholar - 32
Bettencourt LMA, Lobo J, Helbing D, Kühnert C& West GB . 2007Growth, innovation, scaling, and the pace of life in cities. Proc. Natl Acad. Sci. USA 104, 7301–7306. (doi:10.1073/pnas.0610172104). Crossref, PubMed, ISI, Google Scholar - 33
Batty M . 2008The size, scale, and shape of cities. Science 319, 769–771. (doi:10.1126/science.1151419). Crossref, PubMed, ISI, Google Scholar - 34
Chowell G, Bettencourt LMA, Johnson N, Alonso WJ& Viboud C . 2008The 1918–1919 influenza pandemic in England and Wales: spatial patterns in transmissibility and mortality impact. Proc. R. Soc. B 275, 501–509. (doi:10.1098/rspb.2007.1477). Link, ISI, Google Scholar - 35
Bharti N, Tatem AJ, Ferrari MJ, Grais RF, Djibo A& Grenfell BT . 2011Explaining seasonal fluctuations of measles in Niger using nighttime lights imagery. Science 334, 1424–1427. (doi:10.1126/science.1210554). Crossref, PubMed, ISI, Google Scholar - 36
Wesolowski A, Eagle N, Tatem AJ, Smith D, Noor AM, Snow RW& Buckee CO . 2012Quantifying the impact of human mobility on malaria. Science 338, 267–270. (doi:10.1126/science.1223467). Crossref, PubMed, ISI, Google Scholar - 37
Keeling MJ, Danon L, Vernon MC& House TA . 2010Individual identity and movement networks for disease metapopulations. Proc. Natl Acad. Sci. USA 107, 8866–8870. (doi:10.1073/pnas.1000416107). Crossref, PubMed, ISI, Google Scholar - 38
Poletto C, Tizzoni M& Colizza V . 2012Heterogeneous length of stay of hosts’ movements and spatial epidemic spread. Sci. Rep. 2, 476. (doi:10.1038/srep00476). Crossref, PubMed, ISI, Google Scholar - 39
Grenfell BT& Bolker BM . 1998Cities and villages: infection hierarchies in a measles metapopulation. Ecol. Lett. 1, 63–70. (doi:10.1046/j.1461-0248.1998.00016.x). Crossref, ISI, Google Scholar - 40
Eames KTD, Tilston NL, Brooks-Pollock E& Edmunds WJ . 2012Measured dynamic social contact patterns explain the spread of H1N1V influenza. PLoS Comput. Biol. 8, e1002425. (doi:10.1371/journal.pcbi.1002425). Crossref, PubMed, ISI, Google Scholar - 41
Tizzoni M, Bajardi P, Poletto C, Ramasco JJ, Balcan D, Goncalves B, Perra N, Colizza V& Vespignani A . 2012Real-time numerical forecast of global epidemic spreading: case study of 2009 A/H1N1pdm. BMC Med. 10, 165. (doi:10.1186/1741-7015-10-165). Crossref, PubMed, ISI, Google Scholar



