The growth equation of cities

Verbavatz, Vincent; Barthelemy, Marc

doi:10.1038/s41586-020-2900-x

The growth equation of cities

Article
Published: 18 November 2020

Volume 587, pages 397–401, (2020)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

From

View current issue Submit your manuscript

The growth equation of cities

Download PDF

11k Accesses
52 Citations
77 Altmetric
6 Mentions
Explore all metrics

Abstract

The science of cities seeks to understand and explain regularities observed in the world’s major urban systems. Modelling the population evolution of cities is at the core of this science and of all urban studies. Quantitatively, the most fundamental problem is to understand the hierarchical organization of city population and the statistical occurrence of megacities. This was first thought to be described by a universal principle known as Zipf’s law^1,2; however, the validity of this model has been challenged by recent empirical studies^3,4. A theoretical model must also be able to explain the relatively frequent rises and falls of cities and civilizations⁵, but despite many attempts^6,7,8,9,10 these fundamental questions have not yet been satisfactorily answered. Here we introduce a stochastic equation for modelling population growth in cities, constructed from an empirical analysis of recent datasets (for Canada, France, the UK and the USA). This model reveals how rare, but large, interurban migratory shocks dominate city growth. This equation predicts a complex shape for the distribution of city populations and shows that, owing to finite-time effects, Zipf’s law does not hold in general, implying a more complex organization of cities. It also predicts the existence of multiple temporal variations in the city hierarchy, in agreement with observations⁵. Our result underlines the importance of rare events in the evolution of complex systems¹¹ and, at a more practical level, in urban planning.

Revisiting Urban Economics for Understanding Urban Data

The Geometric Origins of Complex Cities

Modeling the dynamics and spatial heterogeneity of city growth

Article Open access 19 November 2022

Main

Constructing a science of cities has become a crucial task for our societies, which are growing ever more concentrated in urban systems. Better planning could be achieved with a better understanding of city growth and how it affects society and the environment¹². Various important aspects of cities such as urban sprawl, infrastructure development or transport planning depend on the population evolution over time, and multiple theoretical attempts have been made in order to understand this crucial phenomenon.

Growth of cities and Zipf’s law

So far, most research in city growth has been done with the idea that the stationary state for a set of cities is described by Zipf’s law. This law is considered to be a cornerstone of urban economics and geography³, and states that the population distribution of urban areas in a given territory (or country) displays a Pareto law with exponent equal to 2 or, equivalently, that the city populations sorted in decreasing order versus their ranks follow a power law with exponent 1. This alleged regularity through time and space is probably the most striking fact in the science of cities and for more than a century has triggered intense debate and many studies^{1,2,5,10,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28}. This result characterizes the hierarchical organization of cities, and in particular it quantifies the statistical occurrence of large cities. Zipf’s law implies that in any country, the city with the largest population is generally twice as large as the next largest, and so on. It is a signature of the very large heterogeneity of city sizes and shows that cities are not governed by optimal considerations that would lead to one unique size but, on the contrary, that city sizes are broadly distributed and follow some sort of hierarchy¹⁶. The empirical value of the Pareto exponent informs us about the hierarchical degree of a system of cities: a large value of the exponent corresponds to a more equally distributed population among cities, and, vice versa, for small exponent values the corresponding system of cities is very heterogeneous with a few megacities.

Studies in economics have suggested that Zipf’s law is the result of economic shocks and random growth processes^6,7,8. Gabaix¹⁰ proved in a seminal paper that Gibrat’s law of random growth⁹—which assumes a population growth rate independent of the size of the city—can lead to a Zipf law with exponent 1, at the expense of the additional and untested assumption that cities cannot become too small. This model remains the most accepted paradigm to understand city growth. Since then, it has also been understood using simplified theoretical models (without any empirical arguments) that migrations from other cities or countries are determinant in explaining random growth²⁹. However, although most of these theoretical approaches focus on explaining Zipf’s law with exponent 1, recent empirical studies^3,4, supported by an increasing number of data sources, have questioned the existence of such a universal power law and have shown that Zipf’s exponent can vary around 1 depending on the country, the time period, the definition of cities used or the fitting method^13,21,30,31 (we illustrate this in Extended Data Fig. 1, showing that no universal result for the population distribution is observed), leading to the idea that there is no reason to think that Zipf’s law holds in all cases³².

Beyond understanding the stationary distribution of urban populations lies the problem of their temporal evolution. As already noted⁵, the huge number of studies regarding population distribution contrasts with the few analyses of the time evolution of cities. As discussed in that same work⁵, cities and civilizations rise and fall many times on a large range of time scales, and Gabaix’s model is both quantitatively and qualitatively unable to explain these specific chaotic dynamics.

Therefore, a model able to simultaneously explain observations about the stationary population distribution and the temporal dynamics of systems of cities is missing. In particular, we are not at this point able to identify the causes of the diversity of empirical observations about the hierarchical organization of cities, the occurrence of megacities, and the empirical instability in city dynamics seen in the births and deaths of large cities on short time scales. In this respect, we do not need just a quantitative improvement of models but a shift of paradigm.

In this paper, we show that city growth is dominated by rare events—namely large interurban migratory shocks—rather than by the average growth rate. Rare but large positive or negative migratory flows can destabilize the hierarchy and the dynamics of a city on very short time scales, leading to the disordered dynamics of cities observed throughout history. On the basis of an empirical analysis of migrations flows in four countries, in the following we derive a stochastic equation of city growth that is able to explain empirical observations of the statistics and temporal dynamics of cities.

Deriving the equation of city growth

To understand city growth, we require a robust, bottom-up approach, starting from elementary mechanisms governing the evolution of cities. Without loss of generality, the growth dynamics of a system (such as a country) of cities i of size S_i can be decomposed into the sum of an interurban migration term between metropolitan areas and an ‘out-of-system’ term that combines other sources of growth: natural growth (births and deaths) and migrations that do not occur within the system of cities (international migrations and exchanges with smaller towns). We denote by N(i) the set of neighbours of city i, that is, those that exchange a non-zero number of inhabitants. Using the four recent datasets of migrations that we use here (USA, 2012–2017; France, 2003–2008; England and Wales (for simplicity, UK), 2012–2016; Canada, 2012–2016) we find for France and the USA that |N(i)| ∝ ${S}_{i}^{\gamma }$, where γ ≈ 0.5 (Extended Data Fig. 2). The British and Canadian datasets are fully connected, leading to γ = 0. The time (t) evolution of the population size S_i can then be written as

$$\frac{\partial {S}_{i}}{\partial t}={\eta }_{i}{S}_{i}+\sum _{j\in N(i)}{J}_{j\to i}-{J}_{i\to j},$$

(1)

where the quantity η_i is a random variable accounting for the ‘out-of-system’ growth of city i; the data show that η_i is Gaussian-distributed (Extended Data Fig. 3). The flow J_i→j is the number of individuals moving from city i to city j during a period of time dt. If there is an exact balance of migration flows ( J_i→j = J_j→i), the equation becomes equivalent to Gibrat’s model⁹, which predicts a log-normal distribution of populations.

Starting from this general equation (1) is very natural as it amounts to writing the balance of births, deaths and migrations; however—as is often the case when using very general, basic equations—it is difficult to use for making predictions. Simplifications of this equation have been proposed²⁹, wherein various assumptions (such as the gravity model for migration, for example) lead to Gibrat’s model, but miss the very large fluctuations of migrations—as we will see below, this is a crucial ingredient. We also note that this general stochastic equation (1) was discussed in another context³³ and is a central object in the statistical physics of disordered systems. With regard to cities, the migration flow J_i→j depends a priori (and at least) on the populations S_i and S_j and the distance d_ij between cities i and j. Using a standard gravitational model^34,35, we show that for France and the USA, the dominant contribution to J_i→j comes from the populations and that the role of distance appears as a second-order effect (see Supplementary Information for details). This result suggests that the J_i→j term can be represented by a variable of the form ${I}_{0}{S}_{i}^{\mu }{S}_{j}^{\nu }{x}_{ij}$, where the random variables x_ij have an average equal to 1 and encode the noise as well as multiple other effects, including distance. We denote by I_ji = J_i→j/S_i the probability per unit time and per capita of moving from city i to city j. The left panel of Fig. 1 shows that the ratio I_ij/I_ji versus the ratio of populations S_i/S_j displays, on average, linear behaviour. This implies that μ = ν, and that we have, on average, a sort of detailed balance ⟨ J_i→j⟩ = ⟨ J_j→i⟩ (where the angled brackets here denote the average over cities), but that crucially, fluctuations are non-zero. More precisely, if we denote by ${X}_{ij}=(\,{J}_{j\to i}\,-{J}_{i\to j})/{I}_{0}{S}_{i}^{\nu }$ , we observe that these random variables X_ij are heavy-tailed—that is, they are distributed according to a broad law that decreases asymptotically as a power law with exponent α < 2 (see Supplementary Information for more details and empirical evidence). The sum in the second term of the right-hand side of equation (1) can then be rewritten as

$$\sum _{j\in N(i)}{J}_{j\to i}\,-{J}_{i\to j}\,={I}_{0}{S}_{i}^{\nu }\sum _{j\in N(i)}\,{X}_{ij},$$

(2)

and, according to the generalized version of the central limit theorem³⁶ (assuming that correlations between the variables X_ij are negligible), the random variable

$${\zeta }_{i}=\frac{1}{|N{(i)|}^{1/\alpha }}\sum _{j\in N(i)}{X}_{ij}$$

follows a Lévy stable law L_α with parameter α (for large enough N(i)). This is empirically confirmed in Fig. 1 (right panel): French, US, British and Canadian data are better fitted by a Lévy stable law than by any other distribution and the estimates of α (using different methods) are given in Table 1. We are led to the conclusion that the growth of systems of cities is governed by a stochastic differential equation with two independent noises, which reads as follows

$$\frac{\partial {S}_{i}}{\partial t}={\eta }_{i}{S}_{i}+D{S}_{i}^{\beta }{{\zeta }}_{i},$$

(3)

where D ∝ I₀, β = ν + γ/α and η_i is a Gaussian noise with mean the average growth rate r and a dispersion σ. This is the growth equation of cities that governs the dynamics of large urban populations; it is our main result here. In equation (3) both noises are uncorrelated and multiplicative, and Itô’s convention here seems to be more appropriate than Stratonovich’s³⁷ because population sizes at time t are computed independently from interurban migration terms at time t + dt. Estimates for the various parameters together with the prediction for the value of β are given in Table 2.

Table 1 Estimates of parameter α

Full size table

Table 2 Estimates of parameters for the four datasets

Full size table

The central limit theorem, together with the broadness of interurban migration flow, enables us to show that many details in equation (1) are unnecessary and that the dynamics can be described by the more universal equation (3). We conclude that starting from equation (1) is thus less useful than previously thought. The importance of migrations has been previously noted²⁹, but in that work the authors derived a stochastic differential equation with multiplicative Gaussian noise, which we show here to be incorrect: we indeed have a first term with multiplicative noise but also, crucially, we obtain another term that is a multiplicative Lévy noise with zero average. This is a major theoretical shift that is not included in previous studies on urban growth and which has many crucial implications in understanding both the stationary and dynamic properties of cities.

No stationary distribution for cities

Equation (3) governs the evolution of urban populations and analysing it at large times gives indications about the stationary distribution of cities. To discuss the analytical properties of equation (3), we assume that Gaussian fluctuations are negligible compared to the Lévy noise and write η_i ≈ r (see Extended Data Fig. 5). The corresponding Fokker–Planck equation (with Itô’s convention) can be solved using the formalism of fractional-order derivatives and Fox functions^38,39,40,41, leading to the general distribution at time t that can be expanded in powers of S as (see Supplementary Information for derivation and complete expressions of all terms):

$$P(S,t)=\mathop{\sum }\limits_{k=1}^{\infty }\,{C}_{k}\frac{a{(t)}^{-\alpha \beta -\alpha (1-\beta )k}}{{S}^{1+\alpha \beta +\alpha (1-\beta )k}}$$

(4)

where C_k is a prefactor that is a function of α, β and k and independent of t and S, and where $a(t)\propto {\left[{\textstyle \tfrac{r/{D}^{\alpha }}{({{\rm{e}}}^{r\alpha (1-\beta )t}-1)}}\right]}^{1/\alpha (1-\beta )}$ decreases exponentially at large times. This expansion shows that the probability distribution of city sizes is dominated at large S by the order k = 1 and converges towards a Pareto distribution with exponent α ≠ 1. The speed of convergence towards this power law can be estimated with the ratio λ(S, t) of the first and second terms of the expansion equation (4) and leads to:

$$\lambda (S,t)=\frac{{D}^{\alpha }}{r}{\left(\frac{\bar{S}(t)}{S}\right)}^{\alpha (1-\beta )}$$

(5)

where $\bar{S}(t)$ is the mean city size. If λ(S) ≳ 1, the α-exponent regime is not valid in the right tail with threshold S at time t. Estimates of α and β for the four datasets show that finite-time effects are very important in all cases and that a power-law regime is only reached for unrealistically large city sizes (see discussion in Supplementary Information). Hence, the range of city sizes for which we can observe a power-law distribution may not exist in practice and there is no reason in general to observe Zipf’s law or any other stationary distribution. We also note that from equation (4) there is a scaling of the form $P(S,t)={\textstyle \tfrac{1}{S}}F\left({\textstyle \tfrac{S}{\bar{S}(t)}}\right)$ with a scaling function F that depends on the country. We confirmed this scaling form for France (the only country for which we had sufficient data); details can be found in Supplementary Information (see also Extended Data Fig. 6).

In addition, if we perform a power-law fit of the expansion (equation (4)), the upper tail of the city-size distributions may be mistaken for a Pareto tail with a spurious exponent that changes with the definition of the upper tail (Extended Data Fig. 7). This might explain the discrepancies observed in the literature on Zipf’s law. As city sizes increase, the apparent exponent changes and can dramatically deviate from 1, as we initially observe in Extended Data Fig. 1. Following our analysis, the apparent exponent should converge towards the value given by α, as is indeed observed in, for example, France (α = 1.4) and the USA (α = 1.3).

Dynamics: splendour and decline of cities

The validity of our model (equation (3)) can be further tested on the dynamics of systems of cities over large periods of time. This can be done by following the populations and ranks of the system’s cities at different times with the help of ‘rank clocks’, as previously proposed⁵. In that work, it was proven that the micro-dynamics of cities is very turbulent, with many rises and falls of entire cities that cannot result from Gabaix’s model (which is, in essence, Gibrat’s model with a non-zero minimum for city sizes). We show in Fig. 2 the empirical rank clock for France (from 1876 to 2015) and for the results obtained with Gabaix’s model and ours (for the other countries, see Extended Data Fig. 8).

We see that in Gabaix’s model (middle), the city rank is stable on average, and not turbulent: the rank trajectories are concentric and the rank of a city oscillates around its average position. In the real dynamics (left), cities can emerge or die. Very fast changes in rank order can occur, leading to much more turbulent behaviour. In our model (right), the large fluctuations of Lévy noise are able to statistically reproduce such ebbs and flows of cities. More quantitatively, we first compare the average shift per time $d=(\,{\sum }_{t}{\sum }_{i=1}^{N}|{r}_{i}(t)-{r}_{i}(t-1)|)/NT$ over T years and for N cities in the three cases (Table 3) and look at the statistical fluctuations of the rank (see Extended Data Fig. 9): we note that Lévy fluctuations are much more able to reproduce the turbulent properties of the dynamics of cities through time. Indeed, the fast births and deaths of cities—due, for example, to wars, discoveries of new resources, incentive settlement policies, and so on—are statistically explained by broadly distributed migrations and are incompatible with a Gaussian noise. Second, we can compare with the empirical data the predictions of the different models for the time needed to make the largest rank jump (see Extended Data Fig. 10 for France, which typically predicts a duration of order 80 years to make a very large jump). We confirm that Gabaix’s model is unable to reproduce these very large fluctuations and that our equation agrees very well with the data.

Table 3 Average rank shift per unit time, d

Full size table

A new paradigm

In this Article, we build a stochastic equation of growth for cities on the basis of microlevel considerations that is empirically sound and that challenges the paradigm of Zipf’s law and current models of urban growth. We show that microscopic details are irrelevant and that the growth equation obtained is universal. A crucial point in this reasoning is that, although we have on average a sort of detailed balance that would lead to a Gaussian multiplicative-growth process, it is the existence of non-universal and broadly distributed fluctuations of the microscopic migration flows between cities that govern the statistics of city populations. We introduce here a stochastic equation that describes city growth that includes two sources of noise and that predicts an asymptotic power-law regime. However, this stationary regime is not generally reached and finite-time effects cannot be discarded. Our model is also able to statistically reproduce the turbulent micro-dynamics of cities that rapidly rise and fall, in contrast with previous Gaussian-based models of growth⁵.

In addition, our fundamental result exhibits an interesting connection between the behaviour of complex systems and non-equilibrium statistical physics for which microscopic currents and the violation of detailed balance seem to be the rule rather than the exception¹¹. At a practical level, this result also highlights the critical effect of not only interurban migration flows (an ingredient that is not generally considered in urban-planning theories), but also, more importantly, their large fluctuations—which are ultimately connected to the capacity of a city to attract a large number of new citizens. Our approach, which relies in essence on the population budget description and empirical results, provides a solid ground for future research on the temporal evolution of cities, a central problem in urban science.

Methods

For each of the four countries we build a graph of migration flows between metropolitan areas. We have (1) the populations of metropolitan areas and (2) the migration flows between metropolitan areas (described in more detail below).

US migrations

Data of migrations in the USA are taken from the 2013–2017 American Community Survey (ACS)⁴³. Aggregated metro-area-to-metro-area migration flows and counterflows are directly given between 389 metropolitan statistical areas in the USA. More precisely, the ACS asked respondents whether they lived in the same residence one year ago; for people who lived in a different residence, the location of their previous residence was collected.

French interurban migrations

Data of migrations in France are taken from the 2008 INSEE report for residential migrations at the town (commune) level for each individual household⁴⁴. The main residence in 2008 is compared to the main residence in 2003. In order to work at the urban area level, we used the 1999 INSEE list of urban areas and aggregate residential migrations at the metropolitan level, enabling us to analyse migration flows between the 500 largest urban areas in France.

UK interurban migrations

Data of migrations in the UK are taken from 2012–2016 ONS reports on internal migration between English and Welsh local authorities, giving the square matrix of moves each year⁴⁵. In order to work at the urban area level, we used the list of local authorities by OECD functional urban areas and aggregate residential migrations at the metropolitan level, enabling us to analyse migration flows between the 41 largest urban areas in England and Wales.

Canadian interurban migrations

Data of migrations in Canada are taken from 2012–2016 census reports on internal migration between Canadian metropolitan areas⁴⁶. Flows between these areas are given city-to-city for each year between 2012 and 2016 for the top-160 largest cities in Canada.

Data availability

The datasets used in this study are freely available from public repositories^43,44,45,46.

References

Zipf, G. K. Human Behavior and the Principle of Least Effort (Addison-Wesley, 1949).
Auerbach, F. Das Gesetz der Bevölkerungskonzentration. Petermanns Geogr. Mitt. 59, 74–76 (1913).
Google Scholar
Arshad, S., Hu, S. & Ashraf, B. N. Zipf’s law and city size distribution: a survey of the literature and future research agenda. Physica A 492, 75–92 (2018).
Article ADS Google Scholar
Gan, L., Li, D. & Song, S. Is the Zipf law spurious in explaining city-size distributions? Econ. Lett. 92, 256–262 (2006).
Article Google Scholar
Batty, M. Rank clocks. Nature 444, 592–596 (2006).
Article ADS CAS Google Scholar
Duranton, G. & Puga, D. in Handbook of Economic Growth Vol. 2 (eds Aghion, P. & Durlauf, S.) 781–853 (Elsevier, 2014).
Córdoba, J.-C. On the distribution of city sizes. J. Urban Econ. 63, 177–197 (2008).
Article Google Scholar
Rossi-Hansberg, E. & Wright, M. Urban structure and growth. Rev. Econ. Stud. 74, 597–624 (2007).
Article MathSciNet Google Scholar
Gibrat, R. Les inégalités économiques (Librairie du Recueil Sierey, 1931).
Gabaix, X. Zipf’s law for cities: an explanation. Q. J. Econ. 114, 739–767 (1999).
Article Google Scholar
Bouchaud, J.-P. Crises and collective socio-economic phenomena: simple models and challenges. J. Stat. Phys. 151, 567–606 (2013).
Article ADS MathSciNet Google Scholar
Bettencourt, L. & West, G. A unified theory of urban living. Nature 467, 912–913 (2010).
Article ADS CAS Google Scholar
Soo, K. T. Zipf’s law for cities: a cross-country investigation. Reg. Sci. Urban Econ. 35, 239–263 (2005).
Article Google Scholar
Singer, H. The “Courbe des populations”. A parallel to Pareto’s law. Econ. J. 46, 254–263 (1936).
Article Google Scholar
Gabaix, X., Lasry, J. M., Lions, P. L. & Moll, B. The dynamics of inequality. Econometrica 84, 2071–2111 (2016).
Article MathSciNet Google Scholar
Pumain, D. & Moriconi-Ebrard, F. City size distributions and metropolisation. GeoJournal 43, 307–314 (1997).
Article Google Scholar
Barthelemy, M. The Structure and Dynamics of Cities (Cambridge Univ. Press, 2016).
Corominas-Murtra, B. & Solé, R. Universality of Zipf’s law. Phys. Rev. E 82, 011102 (2010).
Article ADS MathSciNet Google Scholar
Pumain, D. Une théorie géographique pour la loi de Zipf. Reg. Dév. 36, 125–150 (2012).
Google Scholar
Marsili, M. & Zhang, Y.-C. Interacting individuals leading to Zipf’s law. Phys. Rev. Lett. 80, 2741–2744 (1998).
Article ADS CAS Google Scholar
Cottineau, C. MetaZipf. A dynamic meta-analysis of city size distributions. PLoS One 12, e0183919 (2017).
Article Google Scholar
Benguigui, L. & Blumenfeld-Lieberthal, E. A dynamic model for city size distribution beyond Zipf’s law. Physica A 384, 613–627 (2007).
Article ADS Google Scholar
Blank, A. & Solomon, S. Power laws in cities population, financial markets and internet sites (scaling in systems with a variable number of components). Physica A 287, 279–288 (2000).
Article ADS MathSciNet Google Scholar
Krugman, P. Confronting the mystery of urban hierarchy. J. Jpn. Int. Econ. 10, 399–418 (1996).
Article Google Scholar
Ioannides, Y. & Overman, H. Zipf’s law for cities: an empirical examination. Reg. Sci. Urban Econ. 33, 127–137 (2003).
Article Google Scholar
Favaro, J. M. & Pumain, D. Gibrat revisited: an urban growth model incorporating spatial interaction and innovation cycles. Geogr. Anal. 43, 261–286 (2011).
Article Google Scholar
Zanette, D. H. & Manrubia, S. C. Role of intermittency in urban development: a model of large-scale city formation. Phys. Rev. Lett. 79, 523–526 (1997).
Article ADS Google Scholar
Cottineau, C., Reuillon, R., Chapron, P., Rey-Coyrehourcq, S. & Pumain, D. A modular modelling framework for hypotheses testing in the simulation of urbanisation. Systems 3, 348–377 (2015).
Article Google Scholar
Bettencourt, L. & Zünd, D. Demography, symmetry and the emergence of universal patterns in urban systems. Nat. Commun. 11, 19 (2020).
Black, D. & Henderson, V. Urban evolution in the USA. J. Econ. Geogr. 3, 343–372 (2003).
Article Google Scholar
Eeckhout, J. Gibrat’s law for (all) cities. Am. Econ. Rev. 94, 1429–1451 (2004).
Article Google Scholar
Benguigui, L. & Blumenfeld-Lieberthal, E. The end of a paradigm: is Zipf’s law universal? J. Geogr. Syst. 13, 87–100 (2011).
Article Google Scholar
Bouchaud, J. P. & Mézard, M. Wealth condensation in a simple model of economy. Physica A 282, 536–545 (2000).
Article ADS Google Scholar
Erlander, S. & Stewart, N. F. The Gravity Model in Transportation Analysis: Theory and Extensions (VSP, 1990).
Simini, F., Gonzalez, M. C., Maritan A. & Barabasi, A.-L. A universal model for mobility and migration patterns. Nature 484, 96–100 (2012).
Article ADS CAS Google Scholar
Gnedenko, B. V. & Kolmogorov, A. N. Limit Distributions for Sums of Independent Random Variables (Addison-Wesley, 1954).
van Kampen, N. G. Itô versus Stratonovich. J. Stat. Phys. 24, 175–187 (1981).
Article ADS Google Scholar
Srokowski, T. Multiplicative Lévy processes: Itô versus Stratonovich interpretation. Phys. Rev. E 80, 051113 (2009).
Article ADS MathSciNet Google Scholar
Jespersen, S. M. R. & Fogedby, H. C. Lévy flights in external force fields: Langevin and fractional Fokker–Planck equations and their solutions. Phys. Rev. E 59, 2736–2745 (1999).
Article ADS CAS Google Scholar
Schertzer, D., Larchevêque, M., Duan, J., Yanovsky, V. & Lovejoy, S. Fractional Fokker–Planck equation for nonlinear stochastic differential equations driven by non-Gaussian Lévy stable noises. J. Math. Phys. 42, 200–212 (2001).
Article ADS MathSciNet Google Scholar
Fox, C. The G and H functions as symmetrical Fourier kernels. Trans. Am. Math. Soc. 98, 395–429 (1961).
MathSciNet MATH Google Scholar
Clauset, A., Shalizi, C. R. & Newman, M. E. Power-law distributions in empirical data. SIAM Rev. 51, 661–703 (2009).
Article ADS MathSciNet Google Scholar
United States Census Bureau. Metro area-to-metro area migration flows: 2013–2017 American Community Survey; https://www.census.gov/data/tables/2017/demo/geographic-mobility/metro-to-metro-migration.html (United States Census Bureau, 2019).
INSEE. Migrations résidentielles en 2008: lieu de résidence actuelle – lieu de résidence antérieure; https://www.insee.fr/fr/statistiques/2022291 (INSEE, 2011).
Park, N. Internal migration: matrices of moves by local authority and region (countries of the UK) 2012–2016; https://www.ons.gov.uk/peoplepopulationandcommunity/populationandmigration/migrationwithintheuk/datasets/matricesofinternalmigrationmovesbetweenlocalauthoritiesandregionsincludingthecountriesofwalesscotlandandnorthernireland (Office for National Statistics, 2019).
Statistics Canada. Interprovincial and intraprovincial migrants, by census metropolitan area of origin and destination for the period from July 1 to June 30; https://doi.org/10.25318/1710008701-eng (Statistics Canada, 2020).

Download references

Acknowledgements

V.V. is supported by the Ecole nationale des ponts et chaussées and by the Complex Systems Institute of Paris Île-de-France (ISC-PIF). V.V. thanks J. Morán for discussions and comments.

Author information

Authors and Affiliations

Institut de Physique Théorique, Université Paris-Saclay, CNRS, CEA, Gif-sur-Yvette, France
Vincent Verbavatz & Marc Barthelemy
École des Ponts ParisTech, Champs-sur-Marne, France
Vincent Verbavatz
Centre d’Etude et de Mathématique Sociales, CNRS/EHESS, Paris, France
Marc Barthelemy

Authors

Vincent Verbavatz
View author publications
You can also search for this author in PubMed Google Scholar
Marc Barthelemy
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

V.V. and M.B. designed the study, V.V. acquired the data, and V.V. and M.B. analysed and interpreted the data and wrote the manuscript.

Corresponding author

Correspondence to Marc Barthelemy.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature thanks Amos Maritan and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data figures and tables

Extended Data Fig. 1 No universal exponent.

We show here the measured Pareto exponent of the upper tail of city-size distributions as a function of the lower threshold defining the tail for the largest cities of eight different countries. The exponents are obtained with a maximum-likelihood estimate (data from https://simplemaps.com/data/world-cities).

Extended Data Fig. 2 In- and out-neighbours.

a, b, Number of in- and out-neighbours (in the sense of graph theory) for the USA (a) and France (b). The red lines correspond to the equality N_in = N_out. In the UK and Canada, we have a fully connected dataset and N_in = N_out = constant. c, d, Number of neighbours for each city as a function of population for the USA (c) and France (d). The dotted red lines indicate the power-law fit |N_i| ∝ ${S}_{i}^{\gamma }$. In the UK and Canada, we have a variance of the normalized quantity fully connected dataset and γ = 0.

Extended Data Fig. 3 Density function of the out-of-system growth rate.

Natural growth and out-of-system migrations include international migrations and exchanges with small towns. The data shown are for US cities in 2013–2017 (top) and French cities (bottom) in 2003–2008, compared to a normal distribution. We note that a power-law fit of the right or the left tail would lead to a Pareto exponent of β ≫ 1. For French cities, we extrapolated the 2003 population in each city from the 1999 and 2006 censuses to test our assumption on the period 2003–2008.

Extended Data Fig. 4 Migration-flow analysis.

Complementary figure to Fig. 2. Empirical left-cumulative distribution function of renormalized migration flows compared to Lévy (continuous red line) and normal (green dashed lines) distributions. Clockwise from top left, distributions are given for France, the USA, Canada and the UK.

Extended Data Fig. 5 Average distribution of city sizes.

Data obtained by 10 numerical runs of the stochastic differential equation (10) with a Gaussian noise with finite variance σ_η, compared with the numerical solution of equation (10) where η = ⟨η⟩ = r. Parameters here are α = 1.3, β = 0.8, r = 0.01, σ_η = 0.06, D = 0.06 and t = 500.

Extended Data Fig. 6 Scatterplot of the quantity P(S, t) × S versus the ratio for France’s top-500 largest cities between 1875 and 2016.

Each colour is a different year. We observe that the plots of all years collapse towards a unique universal function of the ratio in agreement with the result of equation (33) in Supplementary Information.

Extended Data Fig. 7 Power-law fit of the expansion with α = 1.3, as a function of the lower threshold of city sizes, S_min.

The expansion is described in equation (25) in Supplementary Information. The fit gives an apparent exponent α(S_min) with very good quality (R² ≈ 1), although the expansion itself is not a power law. The apparent exponent is smaller than α, but slowly converges towards α = 1.3 as the value of the threshold S_min increases. Parameters here are α = 1.3, β = 0.8, r = 0.01, D = 0.06 and t = 500.

Extended Data Fig. 8 Rank clocks of the USA and the UK.

Top, USA; bottom, UK. The left panels display real data, the middle panels show Gabaix’s model of growth and the right panels give our model of growth. Parallel lines for earlier years are spurious effects resulting from the absence of data for cities out of the top-100 largest in the USA or the top-40 largest in the UK (for these rank clocks, we assigned a random increasing radius to cities without data).

Extended Data Fig. 9 Microdynamics of city rank through time for the largest cities in France, the USA and the UK.

Data is given for the 500 largest cities in France between 1875 and 2016 (top), the 100 largest cities in the United States between 1790 and 1990 (middle) and the 40 largest cities in the UK between 1861 and 1991 (bottom). The left panels display the right-cumulative distribution of the maximal variation of the rank r_i(t) for city, that is, the difference between the highest and the lowest rank in population for each city. The right panels display the typical fluctuations of the rank r_i(t) through time. In the three cases, the Lévy model is able to predict rare but non-negligible large variations of rank such as the sudden birth or death of city, in contrast to Gabaix’s model or Gibrat’s model for growth, for which large fluctuations of rank order do not occur.

Extended Data Fig. 10 Average number of years (and standard dispersion) taken to observe the maximal rank variation ∆r as a function of ∆r.

Although the dispersion is large, Lévy’s model is compatible with real data, in contrast to Gabaix’s model of growth.

Supplementary information

Supplementary Information

This file contains a Supplementary Discussion.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Verbavatz, V., Barthelemy, M. The growth equation of cities. Nature 587, 397–401 (2020). https://doi.org/10.1038/s41586-020-2900-x

Download citation

Received: 13 February 2020
Accepted: 01 September 2020
Published: 18 November 2020
Issue Date: 19 November 2020
DOI: https://doi.org/10.1038/s41586-020-2900-x
Springer Nature Limited

This article is cited by

Defining a city — delineating urban areas using cell-phone data
- Lei Dong
- Fabio Duarte
- Carlo Ratti
Nature Cities (2024)
A typology of activities over a century of urban growth
- Julie Gravier
- Marc Barthelemy
Nature Cities (2024)
A generalized vector-field framework for mobility
- Erjian Liu
- Mattia Mazzoli
- José J. Ramasco
Communications Physics (2024)
Tiebout, Coase and urban scaling
- Chris Webster
The Annals of Regional Science (2024)
A maximum entropy approach for the modelling of car-sharing parking dynamics
- Simone Daniotti
- Bernardo Monechi
- Enrico Ubaldi
Scientific Reports (2023)

The growth equation of cities

Abstract

Similar content being viewed by others

Main

Growth of cities and Zipf’s law

Deriving the equation of city growth

No stationary distribution for cities

Dynamics: splendour and decline of cities

A new paradigm

Methods

US migrations

French interurban migrations

UK interurban migrations

Canadian interurban migrations

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Extended data figures and tables

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Navigation