Measuring Urban Agglomeration: A Refoundation of the Mean City-Population Size Index

Lemelin, Andre; Rubiera-Morollón, Fernando; Gómez-Loscos, Ana

doi:10.1007/s11205-014-0846-9

Measuring Urban Agglomeration: A Refoundation of the Mean City-Population Size Index

Published: 12 December 2014

Volume 125, pages 589–612, (2016)
Cite this article

Social Indicators Research Aims and scope Submit manuscript

Andre Lemelin¹,
Fernando Rubiera-Morollón² &
Ana Gómez-Loscos³

1035 Accesses
8 Citations
Explore all metrics

Abstract

In this paper, we put forth the view that the potential for urbanization economies increases with interaction opportunities. From that premise follow three fundamental properties that an agglomeration index should possess: (1) to increase with the concentration of population and conform to the Pigou–Dalton transfer principle; (2) to increase with the absolute size of constituent population interaction zones; and (3) to be consistent in aggregation. Limiting our attention to pairwise interactions, and invoking the space-analytic foundations of local labor market area (LLMA) delineation, we develop an index of agglomeration based on the number of interaction opportunities per capita in a geographical area. This leads to Arriaga’s mean city-population size, which is the mathematical expectation of the size of the LLMA in which a randomly chosen individual lives. The index has other important properties. It does not require an arbitrary population threshold to separate urban from non-urban areas. It is easily adapted to situations where an LLMA lies partly outside the geographical area for which agglomeration is measured. Finally, it can be satisfactorily approximated when data is truncated or aggregated into size-classes. We apply the index to the Spanish NUTS III regions, and evaluate its performance by examining its correlation with the location quotients of several knowledge intensive business services known to be highly sensitive to urbanization economies. The Arriaga index’s correlations are clearly stronger than those of either the classical degree of urbanization or the Hirshman–Herfindahl concentration index.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A typology of U.S. metropolises by rent burden and its major drivers

Article 06 June 2023

Mikhail Samarin & Madhuri Sharma

The accuracy of crime statistics: assessing the impact of police data bias on geographic crime analysis

Article Open access 26 March 2021

David Buil-Gil, Angelo Moretti & Samuel H. Langton

Gentrification, Neighborhood Change, and Population Health: a Systematic Review

Article 14 January 2020

Alina S. Schnake-Mahl, Jaquelyn L. Jahn, … Mariana Arcaya

Notes

The number of distinct pairs in a group of n individuals is equal to n(n–1)/2. If n is sufficiently large, this can be approximated by n ²/2. More generally, combinatorics shows that the number of interactions involving two persons or more increases more than proportionately with the size of the group.
Among which (in alphabetical order): Canada (Munro et al. 2011); France (DATAR-DARES-INSEE 2011); Italy (ISTAT 1997, 2005, 2006; Sforzi 2012); New Zealand (Statistics New Zealand 2009; Goodyear 2008; Papps and Newell 2002); Portugal (Alfonso and Venâncio 2013); Spain (Rubiera and Viñuela 2012; Boix and Galleto 2006); United Kingdom (Bond and Coombes 2007; ONS 2007); USA (Tolbert and Sizer 1996; USDA ERS 2012).
However, our index does not take into account negative congestion externalities. Referring to Capello and Camagni (2000), it could be said that our index is one component of a “city effect indicator” of positive externalities, while leaving aside congestion externalities which could be accounted for in an “urban overload indicator”.
Dalton (1920, p. 351), Pigou (1912, p. 24). Also Cowell (2009).
Although frequently applied as a index of urban concentration within a country, it was originally proposed as a measure of market concentration, or market power.
Mentioned by both Henderson (2003b), Wheaton and Shishido (1981).
Arriaga investigates the implications of using a truncated index which ignores the bottom end of the size distribution of agglomerations, and concludes that a truncated index is a good approximation, under mildly restrictive hypotheses. But in our case, there is no truncation, because the LLMAs cover the whole territory. Lemelin et al. (2012, Appendix 1) present a version of the index that is based on information aggregated by size classes and therefore deals with truncation.
We assume that each individual has an equal probability of being chosen, in which case relative frequencies are correctly interpreted as probabilities.
Adelman (1969) has shown the “numbers-equivalent” property of the Hirchman–Herfindahl index (H): its inverse (1/H) can be interpreted as the number of equal-sized LLMAs which would exhibit a concentration level equal to H.
It does not, however, increase monotonically with the Pareto parameter interpreted as a measure of concentration. The reason for this is exposed in the Appendix 2.
Such is the case in Spain, for example, where several LLMAs include spatial units that are located in more than one province.
Lemelin et al. (2012, Appendix 1).
This is the most recent data as the 2011 Spanish Population and Housing Census (INE) is not yet available.
More details and applications of LLMAs are found in Fernández and Rubiera (2012). Rubiera and Viñuela (2012) perform an economic evaluation of this spatial unit.
In Appendix 2 of Lemelin et al. (2012), this is compared with an index for which each LLMA has been attributed in its entirety to the province where its centroid is located, which is tantamount to redrawing provincial boundaries. Interestingly, at least in the Spanish case, the two versions of the index are tightly correlated across the 52 provinces.
Of course, the HH index and the classical degree of urbanization do not measure the same thing as our index (see Sect. 2 above), and it is normal that they differ from Fig. 1. But that is not our point.
In addition, we related the three measures of agglomeration to GDP per capita, and found a higher correlation with the Arriaga index. But the correlations were not spectacular, reflecting the fact that other determinants also play a major role.
The reasons for the concentration of such services in large metropolitan areas are strongly connected with the presence of different types of effects directly derived from the existence of agglomeration economies. The diversity and rapidly changing nature of talents and know-how mean that only the largest cities will provide the necessary specialized labor pool. Such industries are, in other words, dependent on a constant stream of face–to–face meetings with a wide (and changing) range of individuals that only can occur in cities, but better in large cities. See Daniels (1985), Illeris (1996), Shearmur and Doloreux (2008), Polèse et al. (2007), Wernerheim and Sharpe (2003), among many others.
Note that the f _ip are independent of the scaling, since both the numerator and denominator are divided by the denominator in equation (1).
The argument that follows can be generalized, albeit laboriously, to the version of the index that deals with the boundary problem. See Appendix 3 of Lemelin et al. (2012).
It makes little difference whether city sizes are absolute or relative to some benchmark, such as Madrid above.
Here, we ignore the boundary problem, which the Pareto distribution approach does not handle anyway.
Note that the denominator of (40) is a CES aggregator function.
The interested reader can find the proof in Lemelin et al. (2012, Appendix 4).
Spreadsheet calculations were performed for values of K from 1 to 100, and a from 0.01 to 2 in increments of 0.01.

References

Adelman, M. A. (1969). Comment on the “H” concentration measure as a numbers-equivalent. The Review of Economics and Statistics, 51(1), 99–101.
Article Google Scholar
Alfonso, A., & Venâncio, A. (2013). The relevance of commuting zones for regional spending efficiency. Working Paper 17/2013/DE/UECE/ADVANCE, Department of Economics, School of Economics and Management, Technical University of Lisbon. http://pascal.iseg.utl.pt/~depeco/wp/wp172013.pdf 2014-01-10
Arriaga, E. E. (1970). A new approach to the measurements of urbanization. Economic Development and Cultural Change, 18(2), 206–218.
Article Google Scholar
Arriaga, E. E. (1975). “Selected measures of urbanization”, Chap. II. In S. Goldstein, D. F. Sly (Eds.), The measurement of urbanization and projection of urban population, Working Paper 2, International Union for the scientific study of population. Committee on urbanization and population redistribution. Ordina Editions, Dolhain, Belgium.
Boix, R., & Galleto, V. (2006). Identificación de Sistemas Locales de Trabajo y Distritos Industriales en España. Dirección General de Política de la Pequeña y Mediana Empresa, Ministerio de Industria, Comercio y Turismo.
Bond, S., & Coombes, M. (2007). 2001-based Travel-To-Work Areas Methodology. Office for National Statistics. Retrieved from 13 January 2013. http://www.ons.gov.uk
Brülhart, M., & Sbergami, F. (2009). Agglomeration and growth: Cross-country evidence. Journal of Urban Economics, 65, 48–63.
Article Google Scholar
Brülhart, M., & Traeger, R. (2005). An account of geographic concentration patterns in Europe. Regional Science and Urban Economics, 35, 597–624.
Article Google Scholar
Capello, R., & Camagni, R. (2000). Beyond optimal city size: An evaluation of alternative growth patterns. Urban Studies, 37(9), 1479–1496.
Article Google Scholar
Casado-Díaz, J. M. (2000). Local labour market areas in Spain: A case study. Regional Studies, 34, 843–856.
Article Google Scholar
Cowell, F. A. (2009). Measuring inequality, LSE perspectives on economic analysis. Oxford: Oxford University Press.
Google Scholar
Crédit Suisse Research Institute. (2012). Opportunities in an Urbanizing World, Zurich, Switzerland. Retrieved from 16 Janaury 2014. https://www.credit-suisse.com/ch/fr/news-and-expertise/research/credit-suisse-research-institute/publications.html
Dalton, H. (1920). The measurement of the inequality of incomes. The Economic Journal, 30(119), 348–361.
Article Google Scholar
Daniels, P. (1985). Service industries: A geographical perspective. New York: Methuen.
Google Scholar
DATAR-DARES-INSEE (2011). Atlas des zones d’emploi 2010. Délégation interministérielle à l’Aménagement du Territoire et à l’Attractivité régionale (DATAR), Direction de l’Animation de la Recherche, des Études et des Statistiques (DARES) and Institut National de la Statistique et des Études Économiques (INSEE). Retrieved from 13 January 2014. http://www.insee.fr/fr/themes/detail.asp?reg_id=0&ref_id=atlas-zone-emploi-2010
Excerpts from this document are downloadable from F. Sforzi’s homepage. Retrieved from 23 January 2014. http://economia.unipr.it/DOCENTI/SFORZI/docs/files/SISTEMI_LOCALI.PDF
Fernández, E., & Rubiera, F. (2012). Defining the spatial scale in modern economic analysis: New challenges from data at local level. Advances in spatial science series. Berlin: Springer.
Book Google Scholar
Glaeser, E. L., Kallal, H. D., Scheinkman, J. A., & Schleifer, A. (1992). Growth in cities. Journal of Political Economy, 100(6), 1126–1152.
Article Google Scholar
Goodyear, R. (2008). Workforces on the move: An examination of commuting patterns to the cities of Auckland, Wellington and Christchurch. Paper presented at NZAE conference, Wellington City, New Zealand, July 2008. Statistics New Zealand. Retrieved from 10 January 2014. http://www.stats.govt.nz/methods/research-papers/nzae/nzae-2008/workforces-on-the-move.aspx
Henderson, J. V. (1988). Urban development: Theory, fact and illusion. Oxford: Oxford University Press.
Google Scholar
Henderson, J. V. (2003a). Urbanization and economic development. Annals of Economics and Finance, 4, 275–341.
Google Scholar
Henderson, J. V. (2003b). The urbanization process and economic growth: The So-What question. Journal of Economic Growth, 8, 47–71.
Article Google Scholar
Hoover, E. M. (1937). Location theory and the shoe and leather industry. Cambridge, MA: Harvard University Press.
Book Google Scholar
Illeris, S. (1996). The service economy: A geographical approach. Chichester, U.K.: Wiley.
Google Scholar
INE. (2001). Censo de Población, 2001. Instituto Nacional de Estadística. http://www.ine.es
Isard, W. (1956). Location and space-economy. Cambridge, MA: The Technology Press of Massachusetts, Institute of Technology.
Google Scholar
ISTAT. (1997). I Sistemi Locali del Lavoro 1991. A cura di F. Sforzi, Collana Argomenti 10, Instituto Nazionale di Statistica, Roma.
ISTAT. (2005). I Sistemi Locali del Lavoro. Censimento 2001. Dati definitivi, Instituto Nazionale di Statistica. Retrieved from 10 January 2014. http://dawinci.istat.it/daWinci/jsp/MD/download/sll_comunicato.pdf
ISTAT. (2006). 8o Censimento generale dell’industria e dei servizi. Distretti industriali e sistemi locali del lavoro 2001. Collana Censimenti, Instituto Nazionale di Statistica. Retrieved from 10 January 2014. http://www.istat.it/it/files/2011/01/Volume_Distretti1.pdf
Lemelin, A., Rubiera-Morrollón F., & Gómez-Loscos, A. (2012). « A territorial index of potential agglomeration economies from urbanization » , Montréal, INRS-UCS, coll. Inédits, 2012–03. http://www.ucs.inrs.ca/sites/default/files/centre_ucs/pdf/Inedit03-12.pdf
Munro, A., Alasia, A., & Bollman, R. D. (2011). “Self-contained labour areas: A proposed delineation and classification by degree of rurality”, Rural and Small Town Canada Analysis Bulletin, 8(8), Catalogue no. 20-006-X, Statistics Canada. Retrieved from 16 January 2014. http://www.statcan.gc.ca/pub/21-006-x/21-006-x2008008-eng.htm
Ohlin, B. (1933). Interregional and internal trade. Cambridge, MA: Harvard University Press.
Google Scholar
ONS. (2007). Introduction to the 2001-based Travel-to-Work Areas. ONS. Retrieved from 13 January 2013. http://www.ons.gov.uk
Papps, K. L., & Newell, J. O. (2002). Identifying functional labour market areas in New Zealand: A reconnaissance study using travel-to-work data. Discussion Paper No. 443, IZA (Institute for the study of labor), Bonn, Germany. Retrieved from 10 January 2014. http://ftp.iza.org/dp443.pdf
Pigou, A.C. (1912). Wealth and Welfare. MacMillan, London. Retrieved from 16 January 2014. https://archive.org/details/cu31924032613386
Polèse, M., Shearmur, R., & Rubiera, F. (2007). Observing regularities in location patters. An analysis of the spatial distribution of economic activity in Spain. European Urban and Regional Studies, 14(2), 157–180.
Article Google Scholar
Rosen, K., & Resnick, M. (1980). The size distribution of cities: An examination of the pareto law and primacy. Journal of Urban Economics, 8, 165–186.
Article Google Scholar
Rubiera, F., & Viñuela, A. (2012). “From funtional areas to analytical regions, where the agglomeration economies make sense”, Chap. 2, p. 23–44. In: E. Fernández, F. Rubiera (Eds.) Defining the spatial scale in modern economic analysis: New challenges from data at local level. Springer.
Sforzi, F. (2012). “From administrative spatial units to local labour market areas”, Chap. 1, p. 3–21. In: E. Fernández, F. Rubiera (Eds.) Defining the spatial scale in modern economic analysis: New challenges from data at local level. Springer.
Shearmur, R., & Doloreux, D. (2008). Urban hierarchy or local milieu? High-order producer service and (or) knowledge-intensive business service location in Canada, 1991–2001. Professional Geographer, 60(3), 333–355.
Article Google Scholar
Spence, M., Annez, P. C., & Buckley, R. M. (2009). Urbanization and Growth. Washington, D.C: Commission on Growth and Development, The World Bank.
Google Scholar
Statistics New Zealand. (2009). Workforces on the move: Commuting patterns in New Zealand. Statistics New Zealand. Retrieved from 14 January 2014. http://www.stats.govt.nz/browse_for_stats/people_and_communities/Geographic-areas/commuting-patterns-in-nz-1996-2006.aspx
Tolbert, C. M.& Sizer, M. (1996). U.S. Commuting Zones and Local Market Areas. A 1990 Update. ERS Staff Paper, Rural Economy Division, Economic Research Service, U.S. Department of Agriculture. Retrieved from 10 January 2014. https://usa.ipums.org/usa/resources/volii/cmz90.pdf
Uchida, H., & Nelson, A. (2010). Agglomeration index: Towards a new measure of urban concentration. In J. Beall, B. Guha-Khasnobis, & R. Kanbur (Eds.), Urbanization and development. Oxford: Oxford University Press.
Google Scholar
USDA ERS. (2012). “Commuting Zones and Labor Market Areas: Documentation” (Web document), Economic Research Department, U.S. Department of Agriculture. Retrieved from 14 January 2014. http://www.ers.usda.gov/data-products/commuting-zones-and-labor-market-areas/documentation
Weber, A. (1909). Ûber den Standort der Industrien. Mohr, TÏbingen; translated by Friedrich, C. J. (1929) as Alfred Weber’s Theory of the Location of Industries, Chicago, IL: University of Chicago Press.
Wernerheim, M., & Sharpe, C. (2003). High-order producer services in metropolitan Canada: How footloose are they? Regional Studies, 37, 469–490.
Article Google Scholar
Wheaton, W., & Shishido, H. (1981). Urban concentration, agglomeration economies, and the level of economic development. Economic Development and Cultural Change, 30, 17–30.
Article Google Scholar
Zhu, Nong, Xubei Luo and Heng-fu Zou (2012): Regional differences in China’s urbanization and its determinants, CEMA Working Papers 535, China Economics and Management Academy, Central University of Finance and Economics. Retrieved from 21 January 2014. http://ideas.repec.org/p/cuf/wpaper/535.html

Download references

Author information

Authors and Affiliations

INRS - Institut National du la Recherche Scientifique, Quebec, Canada
Andre Lemelin
REGIOlab, University of Oviedo, Oviedo, Asturias, Spain
Fernando Rubiera-Morollón
Banco de España, Madrid, Spain
Ana Gómez-Loscos

Authors

Andre Lemelin
View author publications
You can also search for this author in PubMed Google Scholar
Fernando Rubiera-Morollón
View author publications
You can also search for this author in PubMed Google Scholar
Ana Gómez-Loscos
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Fernando Rubiera-Morollón or Ana Gómez-Loscos.

Appendices

Appendix 1: Geometric Interpretation of the Agglomeration Index

Define n ₀ = 0 and let K be the number of LLMAs in geographical area under consideration. Then

$$f_{i} = \frac{{n_{i} }}{{\sum\limits_{j = 1}^{K} {n_{j} } }} = \frac{{n_{i} }}{{\sum\limits_{j = 0}^{K} {n_{j} } }}$$

(15)

is the fraction of population residing in the ith LLMA (with f ₀ = 0), and $F_{i} = \sum\nolimits_{j = 1}^{i} {f_{j} } = \sum\nolimits_{j = 0}^{i} {f_{j} }$ is the cumulative distribution (with F ₀ = 0).^{Footnote 19} LLMAs are assumed to be ordered from smallest to largest. The area above the curve is computed as:

$$I = \sum\limits_{i = 1}^{K} {\left( {1 - F_{i - 1} } \right)\;\left( {n_{i} - n_{i - 1} } \right)}$$

(16)

In our example, this is equal to 0.0242. Note that the first term of formula (14) is the area above the curve to the left of the first LLMA in Fig. 1. This reflects the fact that LLMAs cover the whole territory, so that the threshold between urban and non-urban is irrelevant. The first term in (15) is equal to the size of the smallest LLMA:

$$\left( {1 - F_{0} } \right)\;\left( {n_{1} - n_{0} } \right) = \left( {1 - 0} \right)\;\left( {n_{1} - 0} \right) = n_{1}$$

(17)

Equation (15) can be written as:

$$I = \sum\limits_{i = 1}^{K} {\left( {1 - \sum\limits_{j = 0}^{i - 1} {f_{j} } } \right)\;\left( {n_{i} - n_{i - 1} } \right)}$$

(18)

$$I = \sum\limits_{i = 1}^{K} {\left( {\sum\limits_{j = i}^{K} {f_{j} } } \right)\;\left( {n_{i} - n_{i - 1} } \right)}$$

(19)

$$I = \sum\limits_{i = 1}^{K} {\left( {\sum\limits_{j = i}^{K} {f_{j} } } \right)\;n_{i} } - \sum\limits_{i = 1}^{K} {\left( {\sum\limits_{j = i}^{K} {f_{j} } } \right)\;n_{i - 1} }$$

(20)

Remembering that n ₀ = 0,

$$I = \sum\limits_{i = 1}^{K} {\left( {\sum\limits_{j = i}^{K} {f_{j} } } \right)\;n_{i} } - \sum\limits_{i = 2}^{K} {\left( {\sum\limits_{j = i}^{K} {f_{j} } } \right)\;n_{i - 1} }$$

(21)

$$I = \sum\limits_{i = 1}^{K} {\left( {\sum\limits_{j = i}^{K} {f_{j} } } \right)\;n_{i} } - \sum\limits_{i = 1}^{K - 1} {\left( {\sum\limits_{j = i + 1}^{K} {f_{j} } } \right)\;n_{i} }$$

(22)

$$I = f_{K} n_{K} + \sum\limits_{i = 1}^{K - 1} {\left( {\sum\limits_{j = i}^{K} {f_{j} } } \right)\;n_{i} } - \sum\limits_{i = 1}^{K - 1} {\left( {\sum\limits_{j = i + 1}^{K} {f_{j} } } \right)\;n_{i} }$$

(23)

$$I = f_{K} n_{K} + \sum\limits_{i = 1}^{K - 1} {\left( {\sum\limits_{j = i}^{K} {f_{j} } - \sum\limits_{j = i + 1}^{K} {f_{j} } } \right)\;n_{i} }$$

(24)

$$I = f_{K} n_{K} + \sum\limits_{i = 1}^{K - 1} {f_{i} n_{i} } = \sum\limits_{i = 1}^{K} {f_{i} n_{i} }$$

(25)

which is exactly Eq. (3).

Appendix 2: Transfer Principle

A key property of the index is that it correctly reflects the change in the potential for interactions and urbanization economies of any reallocation of population. This property is close to the Pigou–Dalton transfer principle for measures of inequality, which states that any change in the distribution that unambiguously reduces inequality must be reflected in a decrease in its measure.

Let Δn _i represent the change in the relative population size of the ith LLMA. A reallocation of population is restricted by the condition that $\sum\limits_{i = 1}^{k} {\Delta n_{i} = 0}$. Any reallocation can be represented as a series of reallocations between two LLMAs, and any reallocation between two LLMAs can be represented as a series of reallocations between an LLMA and the following or preceding one when LLMAs are ordered according to size. Therefore, we need only to consider a reallocation of population from the (s–1)th LLMA to the sth (from a LLMA to the next higher ranking one in terms of size):

$$\Delta ns = \, {-}\Delta ns{-} 1 { } > \, 0,{\text{ and}}\;\Delta ni = \, 0{\text{ for}}i \ne s,s{-} 1$$

(26)

According to our theoretical a priori, such a reallocation raises the potential for interactions. What effect does it have on the index?

Following Eq. (2), define:^{Footnote 20}

$$\Delta f_{i} = \frac{{\Delta n_{i} }}{{\sum\limits_{j = 1}^{K} {n_{j} } }} = \frac{{\Delta n_{i} }}{{\sum\limits_{j = 0}^{K} {n_{j} } }}$$

(27)

where, in view of (26),

$$\sum\limits_{j = 1}^{K} {n_{j} } - \Delta n_{s - 1} + \Delta n_{s} = \sum\limits_{j = 1}^{K} {n_{j} }$$

(28)

and

$$\Delta f_{s} = - \Delta f_{s - 1} > 0$$

(29)

The value of the index after the reallocation is:

$$I^{\prime} = \sum\limits_{i \ne s - 1,s}^{{}} {f_{i} n_{i} } + \left( {f_{s - 1} - \Delta f_{s} } \right)\left( {n_{s - 1} - \Delta n_{s} } \right) + \left( {f_{s} + \Delta f_{s} } \right)\left( {n_{s} + \Delta n_{s} } \right)$$

(30)

$$I^{\prime} = \sum\limits_{i \ne s - 1,s}^{{}} {f_{i} n_{i} } + f_{s - 1} n_{s - 1} + f_{s} n_{s} + \left( {f_{s} - f_{s - 1} } \right)\Delta n_{s} + \Delta f_{s} \left( {n_{s} - n_{s - 1} } \right) + 2\Delta f_{s} \Delta n_{s}$$

(31)

$$I^{\prime} = \sum\limits_{i = 1}^{K} {f_{i} n_{i} } + \left( {f_{s} - f_{s - 1} } \right)\Delta n_{s} + \Delta f_{s} \left( {n_{s} - n_{s - 1} } \right) + 2\Delta f_{s} \Delta n_{s}$$

(32)

$$I^{\prime} = I + \left( {f_{s} - f_{s - 1} } \right)\Delta n_{s} + \Delta f_{s} \left( {n_{s} - n_{s - 1} } \right) + 2\Delta f_{s} \Delta n_{s}$$

(33)

Given that the LLMAs are ordered from the smallest to the largest, ns > ns–1, and fs > fs–1, so that I’ > I.

Appendix 3: Relationship With the Pareto Distribution

The empirically estimated exponent of the Pareto city-size distribution (a generalization of Zipf’s rank-size rule) has been used as a measure of the concentration of an urban system (Rosen and Resnick 1980). Following the notation established above, the (discrete) Pareto distribution can be written as:

$$K + 1 - i = An_{i}^{ - a}$$

(34)

where K is the number of cities (ranked from the smallest to the largest), n _i is the size of city i,^{Footnote 21} and A and a are parameters. Parameter A can be calibrated from the size of the largest city:

$$K + 1 - K = 1 = An_{K}^{ - a}$$

(35)

$$A = n_{K}^{a}$$

(36)

Inverting (34), we obtain:

$$n_{i}^{a} = \frac{A}{K + 1 - i}$$

(37)

$$n_{i} = \left( {\frac{A}{K + 1 - i}} \right)^{{{1 \mathord{\left/ {\vphantom {1 a}} \right. \kern-0pt} a}}}$$

(38)

Total urban population is:

$$N = \sum\limits_{i = 1}^{K} {n_{i} } = \sum\limits_{i = 1}^{K} {\left( {\frac{A}{K + 1 - i}} \right)^{{{1 \mathord{\left/ {\vphantom {1 a}} \right. \kern-0pt} a}}} }$$

(39)

And so it is quite straightforward to construct a cumulative distribution similar to the one in Fig. 1 reflecting a theoretical Pareto distribution. It is then possible to apply our proposed index to a theoretical Pareto distribution using formula (3). There results

$$I = \frac{{\sum\limits_{i = 1}^{K} {\left( {\frac{A}{K + 1 - i}} \right)^{{{2 \mathord{\left/ {\vphantom {2 a}} \right. \kern-0pt} a}}} } }}{{\sum\limits_{i = 1}^{K} {\left( {\frac{A}{K + 1 - i}} \right)^{{{1 \mathord{\left/ {\vphantom {1 a}} \right. \kern-0pt} a}}} } }}$$

(40)

where we exploit the identity in Eq. (3).^{Footnote 22} If we assume that the number of cities K and the size of the largest city n _K are fixed, then, using (36), (40) can be written as:^{Footnote 23}

$$I = \frac{{\sum\limits_{i = 1}^{K} {\left( {\frac{{n_{K}^{a} }}{K + 1 - i}} \right)^{{{2 \mathord{\left/ {\vphantom {2 a}} \right. \kern-0pt} a}}} } }}{{\sum\limits_{i = 1}^{K} {\left( {\frac{{n_{K}^{a} }}{K + 1 - i}} \right)^{{{1 \mathord{\left/ {\vphantom {1 a}} \right. \kern-0pt} a}}} } }}$$

(41)

$$I = \frac{{\sum\limits_{i = 1}^{K} {n_{K}^{2} \left( {\frac{1}{K + 1 - i}} \right)^{{{2 \mathord{\left/ {\vphantom {2 a}} \right. \kern-0pt} a}}} } }}{{\sum\limits_{i = 1}^{K} {n_{K} \left( {\frac{1}{K + 1 - i}} \right)^{{{1 \mathord{\left/ {\vphantom {1 a}} \right. \kern-0pt} a}}} } }}$$

(42)

$$I = n_{K} \frac{{\sum\limits_{i = 1}^{K} {\left( {\frac{1}{K + 1 - i}} \right)^{{{2 \mathord{\left/ {\vphantom {2 a}} \right. \kern-0pt} a}}} } }}{{\sum\limits_{i = 1}^{K} {\left( {\frac{1}{K + 1 - i}} \right)^{{{1 \mathord{\left/ {\vphantom {1 a}} \right. \kern-0pt} a}}} } }}$$

(43)

The derivative of the index relative to the Pareto parameter is ^{Footnote 24}

$$\frac{\partial I}{\partial a} = n_{K} \left( \frac{1}{a} \right)^{2}\; \frac{{2\sum\limits_{j = 1}^{K} {\;\sum\limits_{i = 1}^{K} {\left[ {j^{{^{{ - {1 \mathord{\left/ {\vphantom {1 a}} \right. \kern-0pt} a}}} }} i^{{^{{ - {2 \mathord{\left/ {\vphantom {2 a}} \right. \kern-0pt} a}}} }} \ln i} \right]} } - \sum\limits_{j = 1}^{K} {\;\sum\limits_{i = 1}^{K} {\left[ {j^{{^{{ - {2 \mathord{\left/ {\vphantom {2 a}} \right. \kern-0pt} a}}} }} i^{{^{{ - {1 \mathord{\left/ {\vphantom {1 a}} \right. \kern-0pt} a}}} }} \ln i} \right]} } }}{{\left( {\sum\limits_{i = 1}^{K} {i^{{^{{ - {1 \mathord{\left/ {\vphantom {1 a}} \right. \kern-0pt} a}}} }} } } \right)^{2} }}$$

(44)

The sign of that derivative is the sign of its numerator, but we could not determine that sign analytically. Using numerical simulations,^{Footnote 25} we obtain that the derivative is negative for low values of a, and positive for high values. The sign reversal of the derivative is explained by the fact that, for a given number of cities, the size of the smallest city under the rank-size rule, $n_{1} = n_{K} K^{{^{{ - {1 \mathord{\left/ {\vphantom {1 a}} \right. \kern-0pt} a}}} }}$, increases with a, leaving a larger gap to the left of the first point on the cumulative distribution (see Fig. 1). Referring to index computation formula (7), it is easily verified that its first term is equal to n ₁. Indeed, our numerical simulations confirm that, if that first term is omitted, our index is a monotonically decreasing function of parameter a. This is illustrated in Fig. 7).

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lemelin, A., Rubiera-Morollón, F. & Gómez-Loscos, A. Measuring Urban Agglomeration: A Refoundation of the Mean City-Population Size Index. Soc Indic Res 125, 589–612 (2016). https://doi.org/10.1007/s11205-014-0846-9

Download citation

Accepted: 03 December 2014
Published: 12 December 2014
Issue Date: January 2016
DOI: https://doi.org/10.1007/s11205-014-0846-9

Keywords

JEL Classification

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Measuring Urban Agglomeration: A Refoundation of the Mean City-Population Size Index

Abstract

Access this article

Similar content being viewed by others

A typology of U.S. metropolises by rent burden and its major drivers

The accuracy of crime statistics: assessing the impact of police data bias on geographic crime analysis

Gentrification, Neighborhood Change, and Population Health: a Systematic Review

Notes

References

Author information

Authors and Affiliations

Corresponding authors

Appendices

Appendix 1: Geometric Interpretation of the Agglomeration Index

Appendix 2: Transfer Principle

Appendix 3: Relationship With the Pareto Distribution

Rights and permissions

About this article

Cite this article

Keywords

JEL Classification

Navigation

Measuring Urban Agglomeration: A Refoundation of the Mean City-Population Size Index

Abstract

Access this article

Similar content being viewed by others

A typology of U.S. metropolises by rent burden and its major drivers

The accuracy of crime statistics: assessing the impact of police data bias on geographic crime analysis

Gentrification, Neighborhood Change, and Population Health: a Systematic Review

Notes

References

Author information

Authors and Affiliations

Corresponding authors

Appendices

Appendix 1: Geometric Interpretation of the Agglomeration Index

Appendix 2: Transfer Principle

Appendix 3: Relationship With the Pareto Distribution

Rights and permissions

About this article

Cite this article

Share this article

Keywords

JEL Classification

Search

Navigation