Abstract
Understanding demographic and migrational patterns constitutes a great challenge. Millions of individual decisions, motivated by economic, political, demographic, rational and/or emotional reasons underlie the high complexity of demographic dynamics. Significant advances in quantitatively understanding such complexity have been registered in recent years, as those involving the growth of cities but many fundamental issues still defy comprehension. We present here compelling empirical evidence of a high level of regularity regarding time and spatial correlations in urban sprawl, unravelling patterns about the inertia in the growth of cities and their interaction with each other. By using one of the world's most exhaustive extant demographic data basis—that of the Spanish Government's Institute INE, with records covering 111 years and (in 2011) 45 million people, distributed among more than 8000 population nuclei—we show that the inertia of city growth has a characteristic time of 15 years, and its interaction with the growth of other cities has a characteristic distance of 80 km. Distance is shown to be the main factor that entangles two cities (60% of total correlations). The power of our current social theories is thereby enhanced.
1. Introduction
The quantitative description of social human patterns is one of the great challenges of this century. Significant advances have been achieved in understanding the complexity of city growth, urban sprawl, electoral processes and many other social systems [1–17]. One finds that the concomitant patterns can be successfully modelled, involving subjacent universal scaling properties [10,14,18,19], fundamental principles—such as the maximum entropy principle [20–24] or the minimum Fisher information [25,26]—or diffusive and aggregative mechanisms for urban sprawl [27–30]. Also, the interaction between cities (as measured by, for instance, the number of crossed phone calls [31] or human mobility [12]) displays predictable characteristics. Thus, it is plausible to conjecture that some kind of universality underlies collective human behaviour [17,23].
However, many fundamental issues still defy comprehension. Our aim in this work is to answer two question regarding city growth and human migrations: (i) is the growth of cities inertial, i.e. does the population growth in the present year depend on the growth of past years? and (ii) does the growth of a city depend on the growth of neighbouring cities, i.e. does the migration of people from one city to another exhibit spatial patterns? Millions of individual decisions, motivated by economic, political, demographic, rational and/or emotional reasons, underlie the growth rate of a city. Accordingly, one may expect some level of randomness and unpredictability. In this vein, one might think that
(i) if some inertia is present, the growth rate of the present year could be deduced from that in past years, and | |||||
(ii) if some correlation with other cities exists, the growth rate might be predicted from the rates of other cities. | |||||
Thus, the observation and detection of regular space–time patterns in urban population evolution could be viewed as constituting an important step towards understanding collective human dynamics at the macro-scale. Indeed, the parametrization of such regularities could lead to a potential improvement of the present population-projection tools and analysis [32–34].
1.1. Urban growth
The evolution of city population has been described with great success in the past by recourse to Gibrat's law, i.e. geometrical Brownian walkers obeying a dynamical equation that exhibits scale-invariance [6,7,13,19,21,23,24,35]

stands for its temporal change and vi(t) for the growth rate. One finds in the literature that this rate usually displays stochastic behaviour in the form of a Wiener process that complies with
, so that we deal with uncorrelated noise. In spite of its simplicity, this reductionist model is able to describe many of the observations reported for city-rank distributions. Indeed, this equation can be linearized by defining ui(t) = log[Xi(t)] thus obtaining
, which allows one to recover all well-known properties of regular Brownian motion [21]. Indeed, a ‘thermodynamics of urban population flows’—with the pertinent observables—can be derived following the analogy with physics presented in [23]. However, uncorrelated evolution is assumed in [23] for the sake of simplicity, which entails operating with the equivalent of a scale-free ideal gas. Such an assumption was sufficient for explaining the main properties of the macroscopic state of an ensemble of cities, but a higher level theory that would provide deeper understanding is desirable. Indeed, some sort of interaction between cities is of course to be expected, as well as some kind of inertia. The ensuing correlations are of great importance to understand the complex patters of migration and to improve our predictive power with regards to the subjacent dynamics.We present in this work empirical evidence of such correlations in the population dynamics of Spain. In §2.1, we first analyse the statistical properties of cities’ growth rates, reconfirming both proportional growth and other previous observations presented in the literature. We pass next to analyse, in §2.2, the time correlation of the growth rates, using demographic data from a time window of 111 years. We encounter a remarkably regular behaviour. We continue, in §2.3, with the analysis of inter-city correlations: instead of comparing each individual growth rate with the average in its surroundings—as found in the literature—we study here correlations of the growth rate for each particular pair of cities. This is akin to describing the raw two-body interaction between cities and is expected to be of a more fundamental nature than the just mentioned literature studies, that involve mean-field, or coarse-grained, descriptions of the interactions we are interested in. Inspired by physics, we compare the city–city correlation with the distance between them, which leads us to define a characteristic correlation distance. Finally, some discussion and conclusions are given in §3.
2. Results
An exhaustive census dataset is indeed needed, something not easy to come by. Fortunately, the Spanish Government's Institute INE [36] provides information about the population of 8100 municipalities—the smallest administrative unit—over a period of 111 years, from 1900 to 2011. They are distributed over a surface of approximately 500 000 km2 inhabited by more than 45 million people (2011). Figure 1a displays the spatial distribution of the Spanish municipalities, and figure 1b their time evolution. A typical diffusion pattern is visible. The population's arithmetic and geometric means are also plotted. The former grows with time but the later diminishes, indicating that the population has descended in a majority of towns, reflecting on the migration from countryside to large cities, a common pattern in most of the world. This diffusion process is readily discernible: one appreciates that the width of the distribution indeed grows.
Figure 1. Characteristics of the dataset. (a) Spatial distribution of Spanish municipalities. Circle's sizes are proportional to the population's logarithm. (b) Evolution of all municipalities. We also give the population per town (solid line, arithmetic mean; dashed line, geometric mean). (c) Variance of the population-change versus population for each municipality (dots). The median value (dotted line) clearly follows equation (2.2) (solid line). (Online version in colour.)
2.1. Statistical properties of growth rates
In order to analyse in more detail the underlying dynamics, we base our considerations on the developments of earlier studies [21,23,24]. It is shown there that the dynamical growth equation for city populations exhibits the general appearance

—a quantity defined only for convenience in representing the data—can be written as
Comparison with the data entails appealing to numerical time derivatives for each
. We use yearly data from 1996 till 2011 (whenever the appropriate datasets are available for each intermediate year) to generate the graph of figure 1c, which displays the
—pairs for all the Spanish municipalities. One computes


nicely fits equation (2.2), with σv = 0.0119 and σw = 0.47, respectively. Note that FSC fluctuations are larger than multiplicative ones, the later dominating, of course, for large sizes. The transition between both regimes occurs at
inhabitants.2.2. Empirical observation of inertial growth
To find whether there exists a systematic dependence between successive yearly growths (or inertia), we consider first the n-cities average and variance such that






Figure 2. Empirical space–time correlations of population growth. (a) Time correlations versus town sizes for yearly relative growths (dotted line). The shaded area represents the width determined by one standard deviation (darker shadow) and by two of them (lighter shadow). Horizontal dashed lines are just visual aids for the limits at low and high population. (b) Time correlation for the relative growth of towns populated by more that 10 000 inhabitants. Shaded areas represent widths determined by one (darker shadow) and two (lighter shadow) standard deviations, respectively. Inset: same representation, but in a log scale. The exponential fit of equation (2.9) acquires thus more visibility. (c) Comparison of widths: bivariate normal versus empirical correlations distribution. (d) Spatial correlation of Spain's municipalities’ relative growth for populations larger than 10 000 inhabitants. In black, the Lorentz shape of equation (2.12) (for α = 2) compared with the empirical mean (dots). Inset: same representation, with a log scale for the distance. (Online version in colour.)
2.3. Empirical observation of spatial correlations
We pass now to a study of the demographical entanglement between two given cities, as represented by spatial correlations. The correlation coefficient between the ith and jth city reads



Indeed, the correlation is finite at Δr = 0 and decays as ∼Δr–α for large distances. Fitting this function to the data, we obtain ar = 0.33 ± 0.02, r0 = 76 ± 10 km, and α = 1.8 ± 0.3, with a coefficient R2 of 0.9159. Instead, fixing for convenience α = 2 (that yields a Lorentz function), we get ar = 0.33 ± 0.01 and r0 = 79 ± 8 km, with R2 = 0.9156. As the concomitant two ways of fitting are indistinguishable according to the R2 coefficient, we adopt α = 2 for simplicity. As a consequence, the typical ‘demographic distance’ turns out to be (on average) of approximately 80 km, decaying with r–2at large distances. Thus, we face long-range correlations. The influences of other factors, though, make these correlations vanish at about 500 km. We use our data to compare (i) the width of c(Δr) with (ii) that expected for a bivariate normal distribution [37] (see appendix A). The empiric width is larger than the bivariate one: 0.327 versus 0.204 (figure 2c), indicative of the presence of additional, distance-independent, correlations. We deduce that the separation between towns, that is, their mutual distance, is the origin of about a 60% of the total correlation between them.
3. Discussion and conclusion
Summing up, we have demonstrated that the relative growth of a city's population exhibits both (i) inertia and (ii) correlation with the relative growth of neighbouring cities, with distance as the main variable that underlies the town–town interaction. Indeed, these patterns can be used to improve the predictive power of present techniques for demographic projection. However, further improvements are needed in order to identify the undefined correlations within the actual data whose existence we have discovered. We expect that these correlations will depend on local circumstances and also on the particular socio-economic status of each city. Indeed, economic factors such as the market area, market potential or basin of attraction, will contribute to that undefined 40% of the total correlation. One important contribution to these undefined correlations can be attributed to the fact that we use distances between cities regardless of the transportation network. It is expected that the correlation between two well-connected cities (in terms of roads, trains and/or air bridges) will be larger than that of two other cities, separated by the same distance, but without these facilities (or with natural barriers between them as seas, rivers or mountains). Even if all these effects are compensated at the macroscopic level and distance becomes a good observable as we have shown here, a more accurate microscopic determination of the fundamental behaviour of interactions should include these particular local variations, using, for example, the mean time required for travelling from one city to the other. In addition, we have implicitly treated the cities as point-like particles in the sense of idealized bodies of zero dimension—i.e. with no internal structure nor extension in space. This reductionistic ideal scenario helps us to isolate effects or mechanisms and works well at the scale of the distances studied here, but the internal structure of cities should become important at those distances where the correlation saturates (less than 10 km, as shown in the inset of figure 2d). We think that it will be interesting to understand the influence of the correlations here displayed on the morphology of the cities, and we hope that both the interplay between them and the associated mechanisms of road and diffusion dynamics studied in, e.g. [27–30], will be unravelled in the near future. In view of our results, a quantitative model for the evolution of city populations should be able to include these correlations, as well as all the other well-known features of city growth’ statistics, such as the power-law or lognormal distribution of city populations. Work on this subject is in progress.
Acknowledgements
We would like to thank the anonymous reviewers for their valuable comments and suggestions.
Funding statement
This work was partially supported by
Appendix A. Distribution of correlation coefficients
For a bivariate normal distribution, the distribution of correlation coefficients is given by

Footnotes
References
- 1
Zipf GK . 1949Human behavior and the principle of least effort. Cambridge, MA: Addison-Wesley. Google Scholar - 2
Kemeny J& Snell JL . 1978Mathematical models in the social sciences. Cambridge, MA: MIT Press. Google Scholar - 3
Marsil M& Zhang Y-C . 1998Interacting individuals leading to Zipf's law. Phys. Rev. Lett. 80, 2741–2744. (doi:10.1103/PhysRevLett.80.2741). Crossref, ISI, Google Scholar - 4
Costa Filho RN, Almeida MP, Andrade JS& Moreira JE . 1999Scaling behavior in a proportional voting process. Phys. Rev. E 60, 1067–1068. (doi:10.1103/PhysRevE.60.1067). Crossref, ISI, Google Scholar - 5
Axtell RL . 2001Zipf distribution of U.S. firm sizes. Science 293, 1818–1820. (doi:10.1126/science.1062081). Crossref, PubMed, ISI, Google Scholar - 6
Blank A& Solomon S . 2000Power laws in cities population, financial markets and internet sites (scaling in systems with a variable number of components). Physica A 287, 279–288. (doi:10.1016/S0378-4371(00)00464-7). Crossref, ISI, Google Scholar - 7
Gabaix X& Ioannides YM . 2004Handbook of regional and urban economics, vol. 4. Amsterdam, The Netherlands: North-Holland. Google Scholar - 8
Newman MEJ . 2005Power laws, Pareto distributions and Zipf's law. Contemp. Phys. 46, 323–351. (doi:10.1080/00107510500052444). Crossref, ISI, Google Scholar - 9
Newman MEJ, Barabasi AL& Watts DJ . 2006The structure and dynamics of complex networks. Princeton, NJ: Princeton University Press. Google Scholar - 10
Bettencourt LMA, Lobo J, Helbing D, Kuehnert C& West GB . 2007Growth, innovation, scaling, and the pace of life in cities. Proc. Natl Acad. Sci. USA 104, 7301–7306. (doi:10.1073/pnas.0610172104). Crossref, PubMed, ISI, Google Scholar - 11
Batty M . 2008The size, scale, and shape of cities. Science 319, 769–771. (doi:10.1126/science.1151419). Crossref, PubMed, ISI, Google Scholar - 12
González MC, Hidalgo CA& Barabási AL . 2008Understanding individual human mobility patterns. Nature 453, 779–782. (doi:10.1038/nature06958). Crossref, PubMed, ISI, Google Scholar - 13
Rozenfeld H, Rybski D, Andrade JS, Batty M, Stanley HE& Makse HA . 2008Laws of population growth. Proc. Natl Acad. Sci. USA 105, 18 702–18 707. (doi:10.1073/pnas.0807435105). Crossref, ISI, Google Scholar - 14
Um J, Son SW, Lee SI, Jeong H& Kim JB . 2009Scaling laws between population and facility densities. Proc. Natl Acad. Sci. USA 106, 14 236–14 240. (doi:10.1073/pnas.0901898106). Crossref, ISI, Google Scholar - 15
Castellano C, Fortunato S& Loreto V . 2009Statistical physics of social dynamics. Rev. Mod. Phys. 81, 591–646. (doi:10.1103/RevModPhys.81.591). Crossref, ISI, Google Scholar - 16
Adamic L . 2011Unzipping Zipf's law. Nature 474, 164–165. (doi:10.1038/474164a). Crossref, PubMed, ISI, Google Scholar - 17
Simini F, González MC, Maritan A& Barabási AL . 2012A universal model for mobility and migration patterns. Nature 484, 96–100. (doi:10.1038/nature10856). Crossref, PubMed, ISI, Google Scholar - 18
Hernando A, Villuendas D, Vesperinas C, Abad M& Plastino A . 2010Unravelling the size distribution of social groups with information theory in complex networks. Eur. Phys. J. B 76, 87–97. (doi:10.1140/epjb/e2010-00216-1). Crossref, ISI, Google Scholar - 19
Hernando A& Plastino A . 2013Scale-invariance underlying the logistic equation and its social applications. Phys. Lett. A 377, 176–180. (doi:10.1016/j.physleta.2012.10.054). Crossref, ISI, Google Scholar - 20
Baek SK, Bernhardsson S& Minnhagen P . 2011Zipf's law unzipped. New J. Phys. 13, 043004. (doi:10.1088/1367-2630/13/4/043004). Crossref, ISI, Google Scholar - 21
Hernando A, Plastino A& Plastino AR . 2012MaxEnt and dynamical information. Eur. Phys. J. B 85, 147. (doi:10.1140/epjb/e2012-30009-3). Crossref, ISI, Google Scholar - 22
Hernando A& Plastino A . 2012Variational principle underlying scale invariant social systems. Eur. Phys. J. B 85, 293. (doi:10.1140/epjb/e2012-30313-x). Crossref, ISI, Google Scholar - 23
Hernando A& Plastino A . 2012The thermodynamics of urban population flows. Phys. Rev. E 86, 066105. (doi:10.1103/PhysRevE.86.066105). Crossref, ISI, Google Scholar - 24
Hernando A, Hernando R, Plastino A& Plastino AR . 2013The workings of the maximum entropy principle in collective human behaviour. J. R. Soc. Interface 10, 20120758. (doi:10.1098/rsif.2012.0758). Link, ISI, Google Scholar - 25
Hernando A, Puigdomènech D, Villuendas D, Vesperinas C& Plastino A . 2009Zipf's law from a Fisher variational-principle. Phys. Lett. A 374, 18–21. (doi:10.1016/j.physleta.2009.10.027). Crossref, ISI, Google Scholar - 26
Hernando A, Vesperinas C& Plastino A . 2010Fisher information and the thermodynamics of scale-invariant systems. Physica A 389, 490–498. (doi:10.1016/j.physa.2009.09.054). Crossref, ISI, Google Scholar - 27
Strano E, Nicosia V, Latora V, Porta S& Barthelemy M . 2012Elementary processes governing the evolution of road networks. Sci. Rep. 2, 296. (doi:10.1038/srep00296). Crossref, PubMed, ISI, Google Scholar - 28
Masucci AP, Stalinov K& Batty M . 2013Limited urban growth: London's street network dynamics since the 18th century. PLoS ONE 8, e69469. (doi:10.1371/journal.pone.0069469). Crossref, PubMed, ISI, Google Scholar - 29
Barthelemy M, Bordin P, Berestycki H& Gribaldi M . 2013Self-organization versus top-down planning in the evolution of a city. http://arxiv.org/abs/1307.2203). Google Scholar - 30
Makse HA, Andrade JS, Batty M, Havlin S& Stanley HE . 1998Modelling urban growth patterns with correlated percolation. Phys. Rev. E 58, 7054–7062. (doi:10.1103/PhysRevE.58.7054). Crossref, ISI, Google Scholar - 31
Krings G, Calabrese F, Ratti C& Blondel VD . 2009Urban gravity: a model for inter-city telecommunication flows. J. Stat. Mech. 2009, L07003. (doi:10.1088/1742-5468/2009/07/L07003). Crossref, ISI, Google Scholar - 32
Plane DA, Henrie CJ& Perry MJ . 2005Migration up and down the urban hierarchy and across the life course. Proc. Natl Acad. Sci. USA 102, 15 313–15 318. (doi:10.1073/pnas.0507312102). Crossref, ISI, Google Scholar - 33
Willekens FJ& Drewe P . 1984A multiregional model for regional demographic projection. Demographic research and spatial policy (eds, Heide H& Willekens FJ ), pp. 309–334. London, UK: Academic Press. Google Scholar - 34ESPON. DEMIFER—Demographic and Migratory Flows Affecting European Regions and Cities. See http://www.espon.eu/main/Menu_Projects/Menu_AppliedResearch/demifer.html. Google Scholar
- 35
- 36National Statistics Institute of Spain website, Government of Spain. See www.ine.es. Google Scholar
- 37
Weisstein EW . Bivariate normal distribution. From MathWorld-A Wolfram Web Resource. See http://mathworld.wolfram.com/BivariateNormalDistribution.html. Google Scholar


