Skip to main content
Log in

Measuring Urban Agglomeration: A Refoundation of the Mean City-Population Size Index

  • Published:
Social Indicators Research Aims and scope Submit manuscript

Abstract

In this paper, we put forth the view that the potential for urbanization economies increases with interaction opportunities. From that premise follow three fundamental properties that an agglomeration index should possess: (1) to increase with the concentration of population and conform to the Pigou–Dalton transfer principle; (2) to increase with the absolute size of constituent population interaction zones; and (3) to be consistent in aggregation. Limiting our attention to pairwise interactions, and invoking the space-analytic foundations of local labor market area (LLMA) delineation, we develop an index of agglomeration based on the number of interaction opportunities per capita in a geographical area. This leads to Arriaga’s mean city-population size, which is the mathematical expectation of the size of the LLMA in which a randomly chosen individual lives. The index has other important properties. It does not require an arbitrary population threshold to separate urban from non-urban areas. It is easily adapted to situations where an LLMA lies partly outside the geographical area for which agglomeration is measured. Finally, it can be satisfactorily approximated when data is truncated or aggregated into size-classes. We apply the index to the Spanish NUTS III regions, and evaluate its performance by examining its correlation with the location quotients of several knowledge intensive business services known to be highly sensitive to urbanization economies. The Arriaga index’s correlations are clearly stronger than those of either the classical degree of urbanization or the Hirshman–Herfindahl concentration index.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Notes

  1. The number of distinct pairs in a group of n individuals is equal to n(n–1)/2. If n is sufficiently large, this can be approximated by n 2/2. More generally, combinatorics shows that the number of interactions involving two persons or more increases more than proportionately with the size of the group.

  2. Among which (in alphabetical order): Canada (Munro et al. 2011); France (DATAR-DARES-INSEE 2011); Italy (ISTAT 1997, 2005, 2006; Sforzi 2012); New Zealand (Statistics New Zealand 2009; Goodyear 2008; Papps and Newell 2002); Portugal (Alfonso and Venâncio 2013); Spain (Rubiera and Viñuela 2012; Boix and Galleto 2006); United Kingdom (Bond and Coombes 2007; ONS 2007); USA (Tolbert and Sizer 1996; USDA ERS 2012).

  3. However, our index does not take into account negative congestion externalities. Referring to Capello and Camagni (2000), it could be said that our index is one component of a “city effect indicator” of positive externalities, while leaving aside congestion externalities which could be accounted for in an “urban overload indicator”.

  4. Dalton (1920, p. 351), Pigou (1912, p. 24). Also Cowell (2009).

  5. Although frequently applied as a index of urban concentration within a country, it was originally proposed as a measure of market concentration, or market power.

  6. Mentioned by both Henderson (2003b), Wheaton and Shishido (1981).

  7. Arriaga investigates the implications of using a truncated index which ignores the bottom end of the size distribution of agglomerations, and concludes that a truncated index is a good approximation, under mildly restrictive hypotheses. But in our case, there is no truncation, because the LLMAs cover the whole territory. Lemelin et al. (2012, Appendix 1) present a version of the index that is based on information aggregated by size classes and therefore deals with truncation.

  8. We assume that each individual has an equal probability of being chosen, in which case relative frequencies are correctly interpreted as probabilities.

  9. Adelman (1969) has shown the “numbers-equivalent” property of the Hirchman–Herfindahl index (H): its inverse (1/H) can be interpreted as the number of equal-sized LLMAs which would exhibit a concentration level equal to H.

  10. It does not, however, increase monotonically with the Pareto parameter interpreted as a measure of concentration. The reason for this is exposed in the Appendix 2.

  11. Such is the case in Spain, for example, where several LLMAs include spatial units that are located in more than one province.

  12. Lemelin et al. (2012, Appendix 1).

  13. This is the most recent data as the 2011 Spanish Population and Housing Census (INE) is not yet available.

  14. More details and applications of LLMAs are found in Fernández and Rubiera (2012). Rubiera and Viñuela (2012) perform an economic evaluation of this spatial unit.

  15. In Appendix 2 of Lemelin et al. (2012), this is compared with an index for which each LLMA has been attributed in its entirety to the province where its centroid is located, which is tantamount to redrawing provincial boundaries. Interestingly, at least in the Spanish case, the two versions of the index are tightly correlated across the 52 provinces.

  16. Of course, the HH index and the classical degree of urbanization do not measure the same thing as our index (see Sect. 2 above), and it is normal that they differ from Fig. 1. But that is not our point.

  17. In addition, we related the three measures of agglomeration to GDP per capita, and found a higher correlation with the Arriaga index. But the correlations were not spectacular, reflecting the fact that other determinants also play a major role.

  18. The reasons for the concentration of such services in large metropolitan areas are strongly connected with the presence of different types of effects directly derived from the existence of agglomeration economies. The diversity and rapidly changing nature of talents and know-how mean that only the largest cities will provide the necessary specialized labor pool. Such industries are, in other words, dependent on a constant stream of face–to–face meetings with a wide (and changing) range of individuals that only can occur in cities, but better in large cities. See Daniels (1985), Illeris (1996), Shearmur and Doloreux (2008), Polèse et al. (2007), Wernerheim and Sharpe (2003), among many others.

  19. Note that the f ip are independent of the scaling, since both the numerator and denominator are divided by the denominator in equation (1).

  20. The argument that follows can be generalized, albeit laboriously, to the version of the index that deals with the boundary problem. See Appendix 3 of Lemelin et al. (2012).

  21. It makes little difference whether city sizes are absolute or relative to some benchmark, such as Madrid above.

  22. Here, we ignore the boundary problem, which the Pareto distribution approach does not handle anyway.

  23. Note that the denominator of (40) is a CES aggregator function.

  24. The interested reader can find the proof in Lemelin et al. (2012, Appendix 4).

  25. Spreadsheet calculations were performed for values of K from 1 to 100, and a from 0.01 to 2 in increments of 0.01.

References

  • Adelman, M. A. (1969). Comment on the “H” concentration measure as a numbers-equivalent. The Review of Economics and Statistics, 51(1), 99–101.

    Article  Google Scholar 

  • Alfonso, A., & Venâncio, A. (2013). The relevance of commuting zones for regional spending efficiency. Working Paper 17/2013/DE/UECE/ADVANCE, Department of Economics, School of Economics and Management, Technical University of Lisbon. http://pascal.iseg.utl.pt/~depeco/wp/wp172013.pdf 2014-01-10

  • Arriaga, E. E. (1970). A new approach to the measurements of urbanization. Economic Development and Cultural Change, 18(2), 206–218.

    Article  Google Scholar 

  • Arriaga, E. E. (1975). “Selected measures of urbanization”, Chap. II. In S. Goldstein, D. F. Sly (Eds.), The measurement of urbanization and projection of urban population, Working Paper 2, International Union for the scientific study of population. Committee on urbanization and population redistribution. Ordina Editions, Dolhain, Belgium.

  • Boix, R., & Galleto, V. (2006). Identificación de Sistemas Locales de Trabajo y Distritos Industriales en España. Dirección General de Política de la Pequeña y Mediana Empresa, Ministerio de Industria, Comercio y Turismo.

  • Bond, S., & Coombes, M. (2007). 2001-based Travel-To-Work Areas Methodology. Office for National Statistics. Retrieved from 13 January 2013. http://www.ons.gov.uk

  • Brülhart, M., & Sbergami, F. (2009). Agglomeration and growth: Cross-country evidence. Journal of Urban Economics, 65, 48–63.

    Article  Google Scholar 

  • Brülhart, M., & Traeger, R. (2005). An account of geographic concentration patterns in Europe. Regional Science and Urban Economics, 35, 597–624.

    Article  Google Scholar 

  • Capello, R., & Camagni, R. (2000). Beyond optimal city size: An evaluation of alternative growth patterns. Urban Studies, 37(9), 1479–1496.

    Article  Google Scholar 

  • Casado-Díaz, J. M. (2000). Local labour market areas in Spain: A case study. Regional Studies, 34, 843–856.

    Article  Google Scholar 

  • Cowell, F. A. (2009). Measuring inequality, LSE perspectives on economic analysis. Oxford: Oxford University Press.

    Google Scholar 

  • Crédit Suisse Research Institute. (2012). Opportunities in an Urbanizing World, Zurich, Switzerland. Retrieved from 16 Janaury 2014. https://www.credit-suisse.com/ch/fr/news-and-expertise/research/credit-suisse-research-institute/publications.html

  • Dalton, H. (1920). The measurement of the inequality of incomes. The Economic Journal, 30(119), 348–361.

    Article  Google Scholar 

  • Daniels, P. (1985). Service industries: A geographical perspective. New York: Methuen.

    Google Scholar 

  • DATAR-DARES-INSEE (2011). Atlas des zones d’emploi 2010. Délégation interministérielle à l’Aménagement du Territoire et à l’Attractivité régionale (DATAR), Direction de l’Animation de la Recherche, des Études et des Statistiques (DARES) and Institut National de la Statistique et des Études Économiques (INSEE). Retrieved from 13 January 2014. http://www.insee.fr/fr/themes/detail.asp?reg_id=0&ref_id=atlas-zone-emploi-2010

  • Excerpts from this document are downloadable from F. Sforzi’s homepage. Retrieved from 23 January 2014. http://economia.unipr.it/DOCENTI/SFORZI/docs/files/SISTEMI_LOCALI.PDF

  • Fernández, E., & Rubiera, F. (2012). Defining the spatial scale in modern economic analysis: New challenges from data at local level. Advances in spatial science series. Berlin: Springer.

    Book  Google Scholar 

  • Glaeser, E. L., Kallal, H. D., Scheinkman, J. A., & Schleifer, A. (1992). Growth in cities. Journal of Political Economy, 100(6), 1126–1152.

    Article  Google Scholar 

  • Goodyear, R. (2008). Workforces on the move: An examination of commuting patterns to the cities of Auckland, Wellington and Christchurch. Paper presented at NZAE conference, Wellington City, New Zealand, July 2008. Statistics New Zealand. Retrieved from 10 January 2014. http://www.stats.govt.nz/methods/research-papers/nzae/nzae-2008/workforces-on-the-move.aspx

  • Henderson, J. V. (1988). Urban development: Theory, fact and illusion. Oxford: Oxford University Press.

    Google Scholar 

  • Henderson, J. V. (2003a). Urbanization and economic development. Annals of Economics and Finance, 4, 275–341.

    Google Scholar 

  • Henderson, J. V. (2003b). The urbanization process and economic growth: The So-What question. Journal of Economic Growth, 8, 47–71.

    Article  Google Scholar 

  • Hoover, E. M. (1937). Location theory and the shoe and leather industry. Cambridge, MA: Harvard University Press.

    Book  Google Scholar 

  • Illeris, S. (1996). The service economy: A geographical approach. Chichester, U.K.: Wiley.

    Google Scholar 

  • INE. (2001). Censo de Población, 2001. Instituto Nacional de Estadística. http://www.ine.es

  • Isard, W. (1956). Location and space-economy. Cambridge, MA: The Technology Press of Massachusetts, Institute of Technology.

    Google Scholar 

  • ISTAT. (1997). I Sistemi Locali del Lavoro 1991. A cura di F. Sforzi, Collana Argomenti 10, Instituto Nazionale di Statistica, Roma.

  • ISTAT. (2005). I Sistemi Locali del Lavoro. Censimento 2001. Dati definitivi, Instituto Nazionale di Statistica. Retrieved from 10 January 2014. http://dawinci.istat.it/daWinci/jsp/MD/download/sll_comunicato.pdf

  • ISTAT. (2006). 8o Censimento generale dell’industria e dei servizi. Distretti industriali e sistemi locali del lavoro 2001. Collana Censimenti, Instituto Nazionale di Statistica. Retrieved from 10 January 2014. http://www.istat.it/it/files/2011/01/Volume_Distretti1.pdf

  • Lemelin, A., Rubiera-Morrollón F., & Gómez-Loscos, A. (2012). « A territorial index of potential agglomeration economies from urbanization » , Montréal, INRS-UCS, coll. Inédits, 2012–03. http://www.ucs.inrs.ca/sites/default/files/centre_ucs/pdf/Inedit03-12.pdf

  • Munro, A., Alasia, A., & Bollman, R. D. (2011). “Self-contained labour areas: A proposed delineation and classification by degree of rurality”, Rural and Small Town Canada Analysis Bulletin, 8(8), Catalogue no. 20-006-X, Statistics Canada. Retrieved from 16 January 2014. http://www.statcan.gc.ca/pub/21-006-x/21-006-x2008008-eng.htm

  • Ohlin, B. (1933). Interregional and internal trade. Cambridge, MA: Harvard University Press.

    Google Scholar 

  • ONS. (2007). Introduction to the 2001-based Travel-to-Work Areas. ONS. Retrieved from 13 January 2013. http://www.ons.gov.uk

  • Papps, K. L., & Newell, J. O. (2002). Identifying functional labour market areas in New Zealand: A reconnaissance study using travel-to-work data. Discussion Paper No. 443, IZA (Institute for the study of labor), Bonn, Germany. Retrieved from 10 January 2014. http://ftp.iza.org/dp443.pdf

  • Pigou, A.C. (1912). Wealth and Welfare. MacMillan, London. Retrieved from 16 January 2014. https://archive.org/details/cu31924032613386

  • Polèse, M., Shearmur, R., & Rubiera, F. (2007). Observing regularities in location patters. An analysis of the spatial distribution of economic activity in Spain. European Urban and Regional Studies, 14(2), 157–180.

    Article  Google Scholar 

  • Rosen, K., & Resnick, M. (1980). The size distribution of cities: An examination of the pareto law and primacy. Journal of Urban Economics, 8, 165–186.

    Article  Google Scholar 

  • Rubiera, F., & Viñuela, A. (2012). “From funtional areas to analytical regions, where the agglomeration economies make sense”, Chap. 2, p. 23–44. In: E. Fernández, F. Rubiera (Eds.) Defining the spatial scale in modern economic analysis: New challenges from data at local level. Springer.

  • Sforzi, F. (2012). “From administrative spatial units to local labour market areas”, Chap. 1, p. 3–21. In: E. Fernández, F. Rubiera (Eds.) Defining the spatial scale in modern economic analysis: New challenges from data at local level. Springer.

  • Shearmur, R., & Doloreux, D. (2008). Urban hierarchy or local milieu? High-order producer service and (or) knowledge-intensive business service location in Canada, 1991–2001. Professional Geographer, 60(3), 333–355.

    Article  Google Scholar 

  • Spence, M., Annez, P. C., & Buckley, R. M. (2009). Urbanization and Growth. Washington, D.C: Commission on Growth and Development, The World Bank.

    Google Scholar 

  • Statistics New Zealand. (2009). Workforces on the move: Commuting patterns in New Zealand. Statistics New Zealand. Retrieved from 14 January 2014. http://www.stats.govt.nz/browse_for_stats/people_and_communities/Geographic-areas/commuting-patterns-in-nz-1996-2006.aspx

  • Tolbert, C. M.& Sizer, M. (1996). U.S. Commuting Zones and Local Market Areas. A 1990 Update. ERS Staff Paper, Rural Economy Division, Economic Research Service, U.S. Department of Agriculture. Retrieved from 10 January 2014. https://usa.ipums.org/usa/resources/volii/cmz90.pdf

  • Uchida, H., & Nelson, A. (2010). Agglomeration index: Towards a new measure of urban concentration. In J. Beall, B. Guha-Khasnobis, & R. Kanbur (Eds.), Urbanization and development. Oxford: Oxford University Press.

    Google Scholar 

  • USDA ERS. (2012). “Commuting Zones and Labor Market Areas: Documentation” (Web document), Economic Research Department, U.S. Department of Agriculture. Retrieved from 14 January 2014. http://www.ers.usda.gov/data-products/commuting-zones-and-labor-market-areas/documentation

  • Weber, A. (1909). Ûber den Standort der Industrien. Mohr, TÏbingen; translated by Friedrich, C. J. (1929) as Alfred Weber’s Theory of the Location of Industries, Chicago, IL: University of Chicago Press.

  • Wernerheim, M., & Sharpe, C. (2003). High-order producer services in metropolitan Canada: How footloose are they? Regional Studies, 37, 469–490.

    Article  Google Scholar 

  • Wheaton, W., & Shishido, H. (1981). Urban concentration, agglomeration economies, and the level of economic development. Economic Development and Cultural Change, 30, 17–30.

    Article  Google Scholar 

  • Zhu, Nong, Xubei Luo and Heng-fu Zou (2012): Regional differences in China’s urbanization and its determinants, CEMA Working Papers 535, China Economics and Management Academy, Central University of Finance and Economics. Retrieved from 21 January 2014. http://ideas.repec.org/p/cuf/wpaper/535.html

Download references

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Fernando Rubiera-Morollón or Ana Gómez-Loscos.

Appendices

Appendix 1: Geometric Interpretation of the Agglomeration Index

Define n 0 = 0 and let K be the number of LLMAs in geographical area under consideration. Then

$$f_{i} = \frac{{n_{i} }}{{\sum\limits_{j = 1}^{K} {n_{j} } }} = \frac{{n_{i} }}{{\sum\limits_{j = 0}^{K} {n_{j} } }}$$
(15)

is the fraction of population residing in the ith LLMA (with f 0 = 0), and \(F_{i} = \sum\nolimits_{j = 1}^{i} {f_{j} } = \sum\nolimits_{j = 0}^{i} {f_{j} }\) is the cumulative distribution (with F 0 = 0).Footnote 19 LLMAs are assumed to be ordered from smallest to largest. The area above the curve is computed as:

$$I = \sum\limits_{i = 1}^{K} {\left( {1 - F_{i - 1} } \right)\;\left( {n_{i} - n_{i - 1} } \right)}$$
(16)

In our example, this is equal to 0.0242. Note that the first term of formula (14) is the area above the curve to the left of the first LLMA in Fig. 1. This reflects the fact that LLMAs cover the whole territory, so that the threshold between urban and non-urban is irrelevant. The first term in (15) is equal to the size of the smallest LLMA:

$$\left( {1 - F_{0} } \right)\;\left( {n_{1} - n_{0} } \right) = \left( {1 - 0} \right)\;\left( {n_{1} - 0} \right) = n_{1}$$
(17)

Equation (15) can be written as:

$$I = \sum\limits_{i = 1}^{K} {\left( {1 - \sum\limits_{j = 0}^{i - 1} {f_{j} } } \right)\;\left( {n_{i} - n_{i - 1} } \right)}$$
(18)
$$I = \sum\limits_{i = 1}^{K} {\left( {\sum\limits_{j = i}^{K} {f_{j} } } \right)\;\left( {n_{i} - n_{i - 1} } \right)}$$
(19)
$$I = \sum\limits_{i = 1}^{K} {\left( {\sum\limits_{j = i}^{K} {f_{j} } } \right)\;n_{i} } - \sum\limits_{i = 1}^{K} {\left( {\sum\limits_{j = i}^{K} {f_{j} } } \right)\;n_{i - 1} }$$
(20)

Remembering that n 0 = 0,

$$I = \sum\limits_{i = 1}^{K} {\left( {\sum\limits_{j = i}^{K} {f_{j} } } \right)\;n_{i} } - \sum\limits_{i = 2}^{K} {\left( {\sum\limits_{j = i}^{K} {f_{j} } } \right)\;n_{i - 1} }$$
(21)
$$I = \sum\limits_{i = 1}^{K} {\left( {\sum\limits_{j = i}^{K} {f_{j} } } \right)\;n_{i} } - \sum\limits_{i = 1}^{K - 1} {\left( {\sum\limits_{j = i + 1}^{K} {f_{j} } } \right)\;n_{i} }$$
(22)
$$I = f_{K} n_{K} + \sum\limits_{i = 1}^{K - 1} {\left( {\sum\limits_{j = i}^{K} {f_{j} } } \right)\;n_{i} } - \sum\limits_{i = 1}^{K - 1} {\left( {\sum\limits_{j = i + 1}^{K} {f_{j} } } \right)\;n_{i} }$$
(23)
$$I = f_{K} n_{K} + \sum\limits_{i = 1}^{K - 1} {\left( {\sum\limits_{j = i}^{K} {f_{j} } - \sum\limits_{j = i + 1}^{K} {f_{j} } } \right)\;n_{i} }$$
(24)
$$I = f_{K} n_{K} + \sum\limits_{i = 1}^{K - 1} {f_{i} n_{i} } = \sum\limits_{i = 1}^{K} {f_{i} n_{i} }$$
(25)

which is exactly Eq. (3).

Appendix 2: Transfer Principle

A key property of the index is that it correctly reflects the change in the potential for interactions and urbanization economies of any reallocation of population. This property is close to the Pigou–Dalton transfer principle for measures of inequality, which states that any change in the distribution that unambiguously reduces inequality must be reflected in a decrease in its measure.

Let Δn i represent the change in the relative population size of the ith LLMA. A reallocation of population is restricted by the condition that \(\sum\limits_{i = 1}^{k} {\Delta n_{i} = 0}\). Any reallocation can be represented as a series of reallocations between two LLMAs, and any reallocation between two LLMAs can be represented as a series of reallocations between an LLMA and the following or preceding one when LLMAs are ordered according to size. Therefore, we need only to consider a reallocation of population from the (s–1)th LLMA to the sth (from a LLMA to the next higher ranking one in terms of size):

$$\Delta ns = \, {-}\Delta ns{-} 1 { } > \, 0,{\text{ and}}\;\Delta ni = \, 0{\text{ for}}i \ne s,s{-} 1$$
(26)

According to our theoretical a priori, such a reallocation raises the potential for interactions. What effect does it have on the index?

Following Eq. (2), define:Footnote 20

$$\Delta f_{i} = \frac{{\Delta n_{i} }}{{\sum\limits_{j = 1}^{K} {n_{j} } }} = \frac{{\Delta n_{i} }}{{\sum\limits_{j = 0}^{K} {n_{j} } }}$$
(27)

where, in view of (26),

$$\sum\limits_{j = 1}^{K} {n_{j} } - \Delta n_{s - 1} + \Delta n_{s} = \sum\limits_{j = 1}^{K} {n_{j} }$$
(28)

and

$$\Delta f_{s} = - \Delta f_{s - 1} > 0$$
(29)

The value of the index after the reallocation is:

$$I^{\prime} = \sum\limits_{i \ne s - 1,s}^{{}} {f_{i} n_{i} } + \left( {f_{s - 1} - \Delta f_{s} } \right)\left( {n_{s - 1} - \Delta n_{s} } \right) + \left( {f_{s} + \Delta f_{s} } \right)\left( {n_{s} + \Delta n_{s} } \right)$$
(30)
$$I^{\prime} = \sum\limits_{i \ne s - 1,s}^{{}} {f_{i} n_{i} } + f_{s - 1} n_{s - 1} + f_{s} n_{s} + \left( {f_{s} - f_{s - 1} } \right)\Delta n_{s} + \Delta f_{s} \left( {n_{s} - n_{s - 1} } \right) + 2\Delta f_{s} \Delta n_{s}$$
(31)
$$I^{\prime} = \sum\limits_{i = 1}^{K} {f_{i} n_{i} } + \left( {f_{s} - f_{s - 1} } \right)\Delta n_{s} + \Delta f_{s} \left( {n_{s} - n_{s - 1} } \right) + 2\Delta f_{s} \Delta n_{s}$$
(32)
$$I^{\prime} = I + \left( {f_{s} - f_{s - 1} } \right)\Delta n_{s} + \Delta f_{s} \left( {n_{s} - n_{s - 1} } \right) + 2\Delta f_{s} \Delta n_{s}$$
(33)

Given that the LLMAs are ordered from the smallest to the largest, ns > ns–1, and fs > fs–1, so that I’ > I.

Appendix 3: Relationship With the Pareto Distribution

The empirically estimated exponent of the Pareto city-size distribution (a generalization of Zipf’s rank-size rule) has been used as a measure of the concentration of an urban system (Rosen and Resnick 1980). Following the notation established above, the (discrete) Pareto distribution can be written as:

$$K + 1 - i = An_{i}^{ - a}$$
(34)

where K is the number of cities (ranked from the smallest to the largest), n i is the size of city i,Footnote 21 and A and a are parameters. Parameter A can be calibrated from the size of the largest city:

$$K + 1 - K = 1 = An_{K}^{ - a}$$
(35)
$$A = n_{K}^{a}$$
(36)

Inverting (34), we obtain:

$$n_{i}^{a} = \frac{A}{K + 1 - i}$$
(37)
$$n_{i} = \left( {\frac{A}{K + 1 - i}} \right)^{{{1 \mathord{\left/ {\vphantom {1 a}} \right. \kern-0pt} a}}}$$
(38)

Total urban population is:

$$N = \sum\limits_{i = 1}^{K} {n_{i} } = \sum\limits_{i = 1}^{K} {\left( {\frac{A}{K + 1 - i}} \right)^{{{1 \mathord{\left/ {\vphantom {1 a}} \right. \kern-0pt} a}}} }$$
(39)

And so it is quite straightforward to construct a cumulative distribution similar to the one in Fig. 1 reflecting a theoretical Pareto distribution. It is then possible to apply our proposed index to a theoretical Pareto distribution using formula (3). There results

$$I = \frac{{\sum\limits_{i = 1}^{K} {\left( {\frac{A}{K + 1 - i}} \right)^{{{2 \mathord{\left/ {\vphantom {2 a}} \right. \kern-0pt} a}}} } }}{{\sum\limits_{i = 1}^{K} {\left( {\frac{A}{K + 1 - i}} \right)^{{{1 \mathord{\left/ {\vphantom {1 a}} \right. \kern-0pt} a}}} } }}$$
(40)

where we exploit the identity in Eq. (3).Footnote 22 If we assume that the number of cities K and the size of the largest city n K are fixed, then, using (36), (40) can be written as:Footnote 23

$$I = \frac{{\sum\limits_{i = 1}^{K} {\left( {\frac{{n_{K}^{a} }}{K + 1 - i}} \right)^{{{2 \mathord{\left/ {\vphantom {2 a}} \right. \kern-0pt} a}}} } }}{{\sum\limits_{i = 1}^{K} {\left( {\frac{{n_{K}^{a} }}{K + 1 - i}} \right)^{{{1 \mathord{\left/ {\vphantom {1 a}} \right. \kern-0pt} a}}} } }}$$
(41)
$$I = \frac{{\sum\limits_{i = 1}^{K} {n_{K}^{2} \left( {\frac{1}{K + 1 - i}} \right)^{{{2 \mathord{\left/ {\vphantom {2 a}} \right. \kern-0pt} a}}} } }}{{\sum\limits_{i = 1}^{K} {n_{K} \left( {\frac{1}{K + 1 - i}} \right)^{{{1 \mathord{\left/ {\vphantom {1 a}} \right. \kern-0pt} a}}} } }}$$
(42)
$$I = n_{K} \frac{{\sum\limits_{i = 1}^{K} {\left( {\frac{1}{K + 1 - i}} \right)^{{{2 \mathord{\left/ {\vphantom {2 a}} \right. \kern-0pt} a}}} } }}{{\sum\limits_{i = 1}^{K} {\left( {\frac{1}{K + 1 - i}} \right)^{{{1 \mathord{\left/ {\vphantom {1 a}} \right. \kern-0pt} a}}} } }}$$
(43)

The derivative of the index relative to the Pareto parameter is Footnote 24

$$\frac{\partial I}{\partial a} = n_{K} \left( \frac{1}{a} \right)^{2}\; \frac{{2\sum\limits_{j = 1}^{K} {\;\sum\limits_{i = 1}^{K} {\left[ {j^{{^{{ - {1 \mathord{\left/ {\vphantom {1 a}} \right. \kern-0pt} a}}} }} i^{{^{{ - {2 \mathord{\left/ {\vphantom {2 a}} \right. \kern-0pt} a}}} }} \ln i} \right]} } - \sum\limits_{j = 1}^{K} {\;\sum\limits_{i = 1}^{K} {\left[ {j^{{^{{ - {2 \mathord{\left/ {\vphantom {2 a}} \right. \kern-0pt} a}}} }} i^{{^{{ - {1 \mathord{\left/ {\vphantom {1 a}} \right. \kern-0pt} a}}} }} \ln i} \right]} } }}{{\left( {\sum\limits_{i = 1}^{K} {i^{{^{{ - {1 \mathord{\left/ {\vphantom {1 a}} \right. \kern-0pt} a}}} }} } } \right)^{2} }}$$
(44)

The sign of that derivative is the sign of its numerator, but we could not determine that sign analytically. Using numerical simulations,Footnote 25 we obtain that the derivative is negative for low values of a, and positive for high values. The sign reversal of the derivative is explained by the fact that, for a given number of cities, the size of the smallest city under the rank-size rule, \(n_{1} = n_{K} K^{{^{{ - {1 \mathord{\left/ {\vphantom {1 a}} \right. \kern-0pt} a}}} }}\), increases with a, leaving a larger gap to the left of the first point on the cumulative distribution (see Fig. 1). Referring to index computation formula (7), it is easily verified that its first term is equal to n 1. Indeed, our numerical simulations confirm that, if that first term is omitted, our index is a monotonically decreasing function of parameter a. This is illustrated in Fig. 7).

Fig. 7
figure 7

Relationship of the proposed index to the Pareto elasticity parameter

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Lemelin, A., Rubiera-Morollón, F. & Gómez-Loscos, A. Measuring Urban Agglomeration: A Refoundation of the Mean City-Population Size Index. Soc Indic Res 125, 589–612 (2016). https://doi.org/10.1007/s11205-014-0846-9

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11205-014-0846-9

Keywords

JEL Classification

Navigation