Abstract
In this paper, we put forth the view that the potential for urbanization economies increases with interaction opportunities. From that premise follow three fundamental properties that an agglomeration index should possess: (1) to increase with the concentration of population and conform to the Pigou–Dalton transfer principle; (2) to increase with the absolute size of constituent population interaction zones; and (3) to be consistent in aggregation. Limiting our attention to pairwise interactions, and invoking the space-analytic foundations of local labor market area (LLMA) delineation, we develop an index of agglomeration based on the number of interaction opportunities per capita in a geographical area. This leads to Arriaga’s mean city-population size, which is the mathematical expectation of the size of the LLMA in which a randomly chosen individual lives. The index has other important properties. It does not require an arbitrary population threshold to separate urban from non-urban areas. It is easily adapted to situations where an LLMA lies partly outside the geographical area for which agglomeration is measured. Finally, it can be satisfactorily approximated when data is truncated or aggregated into size-classes. We apply the index to the Spanish NUTS III regions, and evaluate its performance by examining its correlation with the location quotients of several knowledge intensive business services known to be highly sensitive to urbanization economies. The Arriaga index’s correlations are clearly stronger than those of either the classical degree of urbanization or the Hirshman–Herfindahl concentration index.
Similar content being viewed by others
Notes
The number of distinct pairs in a group of n individuals is equal to n(n–1)/2. If n is sufficiently large, this can be approximated by n 2/2. More generally, combinatorics shows that the number of interactions involving two persons or more increases more than proportionately with the size of the group.
Among which (in alphabetical order): Canada (Munro et al. 2011); France (DATAR-DARES-INSEE 2011); Italy (ISTAT 1997, 2005, 2006; Sforzi 2012); New Zealand (Statistics New Zealand 2009; Goodyear 2008; Papps and Newell 2002); Portugal (Alfonso and Venâncio 2013); Spain (Rubiera and Viñuela 2012; Boix and Galleto 2006); United Kingdom (Bond and Coombes 2007; ONS 2007); USA (Tolbert and Sizer 1996; USDA ERS 2012).
However, our index does not take into account negative congestion externalities. Referring to Capello and Camagni (2000), it could be said that our index is one component of a “city effect indicator” of positive externalities, while leaving aside congestion externalities which could be accounted for in an “urban overload indicator”.
Although frequently applied as a index of urban concentration within a country, it was originally proposed as a measure of market concentration, or market power.
Arriaga investigates the implications of using a truncated index which ignores the bottom end of the size distribution of agglomerations, and concludes that a truncated index is a good approximation, under mildly restrictive hypotheses. But in our case, there is no truncation, because the LLMAs cover the whole territory. Lemelin et al. (2012, Appendix 1) present a version of the index that is based on information aggregated by size classes and therefore deals with truncation.
We assume that each individual has an equal probability of being chosen, in which case relative frequencies are correctly interpreted as probabilities.
Adelman (1969) has shown the “numbers-equivalent” property of the Hirchman–Herfindahl index (H): its inverse (1/H) can be interpreted as the number of equal-sized LLMAs which would exhibit a concentration level equal to H.
It does not, however, increase monotonically with the Pareto parameter interpreted as a measure of concentration. The reason for this is exposed in the Appendix 2.
Such is the case in Spain, for example, where several LLMAs include spatial units that are located in more than one province.
Lemelin et al. (2012, Appendix 1).
This is the most recent data as the 2011 Spanish Population and Housing Census (INE) is not yet available.
In Appendix 2 of Lemelin et al. (2012), this is compared with an index for which each LLMA has been attributed in its entirety to the province where its centroid is located, which is tantamount to redrawing provincial boundaries. Interestingly, at least in the Spanish case, the two versions of the index are tightly correlated across the 52 provinces.
In addition, we related the three measures of agglomeration to GDP per capita, and found a higher correlation with the Arriaga index. But the correlations were not spectacular, reflecting the fact that other determinants also play a major role.
The reasons for the concentration of such services in large metropolitan areas are strongly connected with the presence of different types of effects directly derived from the existence of agglomeration economies. The diversity and rapidly changing nature of talents and know-how mean that only the largest cities will provide the necessary specialized labor pool. Such industries are, in other words, dependent on a constant stream of face–to–face meetings with a wide (and changing) range of individuals that only can occur in cities, but better in large cities. See Daniels (1985), Illeris (1996), Shearmur and Doloreux (2008), Polèse et al. (2007), Wernerheim and Sharpe (2003), among many others.
Note that the f ip are independent of the scaling, since both the numerator and denominator are divided by the denominator in equation (1).
The argument that follows can be generalized, albeit laboriously, to the version of the index that deals with the boundary problem. See Appendix 3 of Lemelin et al. (2012).
It makes little difference whether city sizes are absolute or relative to some benchmark, such as Madrid above.
Here, we ignore the boundary problem, which the Pareto distribution approach does not handle anyway.
Note that the denominator of (40) is a CES aggregator function.
The interested reader can find the proof in Lemelin et al. (2012, Appendix 4).
Spreadsheet calculations were performed for values of K from 1 to 100, and a from 0.01 to 2 in increments of 0.01.
References
Adelman, M. A. (1969). Comment on the “H” concentration measure as a numbers-equivalent. The Review of Economics and Statistics, 51(1), 99–101.
Alfonso, A., & Venâncio, A. (2013). The relevance of commuting zones for regional spending efficiency. Working Paper 17/2013/DE/UECE/ADVANCE, Department of Economics, School of Economics and Management, Technical University of Lisbon. http://pascal.iseg.utl.pt/~depeco/wp/wp172013.pdf 2014-01-10
Arriaga, E. E. (1970). A new approach to the measurements of urbanization. Economic Development and Cultural Change, 18(2), 206–218.
Arriaga, E. E. (1975). “Selected measures of urbanization”, Chap. II. In S. Goldstein, D. F. Sly (Eds.), The measurement of urbanization and projection of urban population, Working Paper 2, International Union for the scientific study of population. Committee on urbanization and population redistribution. Ordina Editions, Dolhain, Belgium.
Boix, R., & Galleto, V. (2006). Identificación de Sistemas Locales de Trabajo y Distritos Industriales en España. Dirección General de Política de la Pequeña y Mediana Empresa, Ministerio de Industria, Comercio y Turismo.
Bond, S., & Coombes, M. (2007). 2001-based Travel-To-Work Areas Methodology. Office for National Statistics. Retrieved from 13 January 2013. http://www.ons.gov.uk
Brülhart, M., & Sbergami, F. (2009). Agglomeration and growth: Cross-country evidence. Journal of Urban Economics, 65, 48–63.
Brülhart, M., & Traeger, R. (2005). An account of geographic concentration patterns in Europe. Regional Science and Urban Economics, 35, 597–624.
Capello, R., & Camagni, R. (2000). Beyond optimal city size: An evaluation of alternative growth patterns. Urban Studies, 37(9), 1479–1496.
Casado-Díaz, J. M. (2000). Local labour market areas in Spain: A case study. Regional Studies, 34, 843–856.
Cowell, F. A. (2009). Measuring inequality, LSE perspectives on economic analysis. Oxford: Oxford University Press.
Crédit Suisse Research Institute. (2012). Opportunities in an Urbanizing World, Zurich, Switzerland. Retrieved from 16 Janaury 2014. https://www.credit-suisse.com/ch/fr/news-and-expertise/research/credit-suisse-research-institute/publications.html
Dalton, H. (1920). The measurement of the inequality of incomes. The Economic Journal, 30(119), 348–361.
Daniels, P. (1985). Service industries: A geographical perspective. New York: Methuen.
DATAR-DARES-INSEE (2011). Atlas des zones d’emploi 2010. Délégation interministérielle à l’Aménagement du Territoire et à l’Attractivité régionale (DATAR), Direction de l’Animation de la Recherche, des Études et des Statistiques (DARES) and Institut National de la Statistique et des Études Économiques (INSEE). Retrieved from 13 January 2014. http://www.insee.fr/fr/themes/detail.asp?reg_id=0&ref_id=atlas-zone-emploi-2010
Excerpts from this document are downloadable from F. Sforzi’s homepage. Retrieved from 23 January 2014. http://economia.unipr.it/DOCENTI/SFORZI/docs/files/SISTEMI_LOCALI.PDF
Fernández, E., & Rubiera, F. (2012). Defining the spatial scale in modern economic analysis: New challenges from data at local level. Advances in spatial science series. Berlin: Springer.
Glaeser, E. L., Kallal, H. D., Scheinkman, J. A., & Schleifer, A. (1992). Growth in cities. Journal of Political Economy, 100(6), 1126–1152.
Goodyear, R. (2008). Workforces on the move: An examination of commuting patterns to the cities of Auckland, Wellington and Christchurch. Paper presented at NZAE conference, Wellington City, New Zealand, July 2008. Statistics New Zealand. Retrieved from 10 January 2014. http://www.stats.govt.nz/methods/research-papers/nzae/nzae-2008/workforces-on-the-move.aspx
Henderson, J. V. (1988). Urban development: Theory, fact and illusion. Oxford: Oxford University Press.
Henderson, J. V. (2003a). Urbanization and economic development. Annals of Economics and Finance, 4, 275–341.
Henderson, J. V. (2003b). The urbanization process and economic growth: The So-What question. Journal of Economic Growth, 8, 47–71.
Hoover, E. M. (1937). Location theory and the shoe and leather industry. Cambridge, MA: Harvard University Press.
Illeris, S. (1996). The service economy: A geographical approach. Chichester, U.K.: Wiley.
INE. (2001). Censo de Población, 2001. Instituto Nacional de Estadística. http://www.ine.es
Isard, W. (1956). Location and space-economy. Cambridge, MA: The Technology Press of Massachusetts, Institute of Technology.
ISTAT. (1997). I Sistemi Locali del Lavoro 1991. A cura di F. Sforzi, Collana Argomenti 10, Instituto Nazionale di Statistica, Roma.
ISTAT. (2005). I Sistemi Locali del Lavoro. Censimento 2001. Dati definitivi, Instituto Nazionale di Statistica. Retrieved from 10 January 2014. http://dawinci.istat.it/daWinci/jsp/MD/download/sll_comunicato.pdf
ISTAT. (2006). 8o Censimento generale dell’industria e dei servizi. Distretti industriali e sistemi locali del lavoro 2001. Collana Censimenti, Instituto Nazionale di Statistica. Retrieved from 10 January 2014. http://www.istat.it/it/files/2011/01/Volume_Distretti1.pdf
Lemelin, A., Rubiera-Morrollón F., & Gómez-Loscos, A. (2012). « A territorial index of potential agglomeration economies from urbanization » , Montréal, INRS-UCS, coll. Inédits, 2012–03. http://www.ucs.inrs.ca/sites/default/files/centre_ucs/pdf/Inedit03-12.pdf
Munro, A., Alasia, A., & Bollman, R. D. (2011). “Self-contained labour areas: A proposed delineation and classification by degree of rurality”, Rural and Small Town Canada Analysis Bulletin, 8(8), Catalogue no. 20-006-X, Statistics Canada. Retrieved from 16 January 2014. http://www.statcan.gc.ca/pub/21-006-x/21-006-x2008008-eng.htm
Ohlin, B. (1933). Interregional and internal trade. Cambridge, MA: Harvard University Press.
ONS. (2007). Introduction to the 2001-based Travel-to-Work Areas. ONS. Retrieved from 13 January 2013. http://www.ons.gov.uk
Papps, K. L., & Newell, J. O. (2002). Identifying functional labour market areas in New Zealand: A reconnaissance study using travel-to-work data. Discussion Paper No. 443, IZA (Institute for the study of labor), Bonn, Germany. Retrieved from 10 January 2014. http://ftp.iza.org/dp443.pdf
Pigou, A.C. (1912). Wealth and Welfare. MacMillan, London. Retrieved from 16 January 2014. https://archive.org/details/cu31924032613386
Polèse, M., Shearmur, R., & Rubiera, F. (2007). Observing regularities in location patters. An analysis of the spatial distribution of economic activity in Spain. European Urban and Regional Studies, 14(2), 157–180.
Rosen, K., & Resnick, M. (1980). The size distribution of cities: An examination of the pareto law and primacy. Journal of Urban Economics, 8, 165–186.
Rubiera, F., & Viñuela, A. (2012). “From funtional areas to analytical regions, where the agglomeration economies make sense”, Chap. 2, p. 23–44. In: E. Fernández, F. Rubiera (Eds.) Defining the spatial scale in modern economic analysis: New challenges from data at local level. Springer.
Sforzi, F. (2012). “From administrative spatial units to local labour market areas”, Chap. 1, p. 3–21. In: E. Fernández, F. Rubiera (Eds.) Defining the spatial scale in modern economic analysis: New challenges from data at local level. Springer.
Shearmur, R., & Doloreux, D. (2008). Urban hierarchy or local milieu? High-order producer service and (or) knowledge-intensive business service location in Canada, 1991–2001. Professional Geographer, 60(3), 333–355.
Spence, M., Annez, P. C., & Buckley, R. M. (2009). Urbanization and Growth. Washington, D.C: Commission on Growth and Development, The World Bank.
Statistics New Zealand. (2009). Workforces on the move: Commuting patterns in New Zealand. Statistics New Zealand. Retrieved from 14 January 2014. http://www.stats.govt.nz/browse_for_stats/people_and_communities/Geographic-areas/commuting-patterns-in-nz-1996-2006.aspx
Tolbert, C. M.& Sizer, M. (1996). U.S. Commuting Zones and Local Market Areas. A 1990 Update. ERS Staff Paper, Rural Economy Division, Economic Research Service, U.S. Department of Agriculture. Retrieved from 10 January 2014. https://usa.ipums.org/usa/resources/volii/cmz90.pdf
Uchida, H., & Nelson, A. (2010). Agglomeration index: Towards a new measure of urban concentration. In J. Beall, B. Guha-Khasnobis, & R. Kanbur (Eds.), Urbanization and development. Oxford: Oxford University Press.
USDA ERS. (2012). “Commuting Zones and Labor Market Areas: Documentation” (Web document), Economic Research Department, U.S. Department of Agriculture. Retrieved from 14 January 2014. http://www.ers.usda.gov/data-products/commuting-zones-and-labor-market-areas/documentation
Weber, A. (1909). Ûber den Standort der Industrien. Mohr, TÏbingen; translated by Friedrich, C. J. (1929) as Alfred Weber’s Theory of the Location of Industries, Chicago, IL: University of Chicago Press.
Wernerheim, M., & Sharpe, C. (2003). High-order producer services in metropolitan Canada: How footloose are they? Regional Studies, 37, 469–490.
Wheaton, W., & Shishido, H. (1981). Urban concentration, agglomeration economies, and the level of economic development. Economic Development and Cultural Change, 30, 17–30.
Zhu, Nong, Xubei Luo and Heng-fu Zou (2012): Regional differences in China’s urbanization and its determinants, CEMA Working Papers 535, China Economics and Management Academy, Central University of Finance and Economics. Retrieved from 21 January 2014. http://ideas.repec.org/p/cuf/wpaper/535.html
Author information
Authors and Affiliations
Corresponding authors
Appendices
Appendix 1: Geometric Interpretation of the Agglomeration Index
Define n 0 = 0 and let K be the number of LLMAs in geographical area under consideration. Then
is the fraction of population residing in the ith LLMA (with f 0 = 0), and \(F_{i} = \sum\nolimits_{j = 1}^{i} {f_{j} } = \sum\nolimits_{j = 0}^{i} {f_{j} }\) is the cumulative distribution (with F 0 = 0).Footnote 19 LLMAs are assumed to be ordered from smallest to largest. The area above the curve is computed as:
In our example, this is equal to 0.0242. Note that the first term of formula (14) is the area above the curve to the left of the first LLMA in Fig. 1. This reflects the fact that LLMAs cover the whole territory, so that the threshold between urban and non-urban is irrelevant. The first term in (15) is equal to the size of the smallest LLMA:
Equation (15) can be written as:
Remembering that n 0 = 0,
which is exactly Eq. (3).
Appendix 2: Transfer Principle
A key property of the index is that it correctly reflects the change in the potential for interactions and urbanization economies of any reallocation of population. This property is close to the Pigou–Dalton transfer principle for measures of inequality, which states that any change in the distribution that unambiguously reduces inequality must be reflected in a decrease in its measure.
Let Δn i represent the change in the relative population size of the ith LLMA. A reallocation of population is restricted by the condition that \(\sum\limits_{i = 1}^{k} {\Delta n_{i} = 0}\). Any reallocation can be represented as a series of reallocations between two LLMAs, and any reallocation between two LLMAs can be represented as a series of reallocations between an LLMA and the following or preceding one when LLMAs are ordered according to size. Therefore, we need only to consider a reallocation of population from the (s–1)th LLMA to the sth (from a LLMA to the next higher ranking one in terms of size):
According to our theoretical a priori, such a reallocation raises the potential for interactions. What effect does it have on the index?
Following Eq. (2), define:Footnote 20
where, in view of (26),
and
The value of the index after the reallocation is:
Given that the LLMAs are ordered from the smallest to the largest, ns > ns–1, and fs > fs–1, so that I’ > I.
Appendix 3: Relationship With the Pareto Distribution
The empirically estimated exponent of the Pareto city-size distribution (a generalization of Zipf’s rank-size rule) has been used as a measure of the concentration of an urban system (Rosen and Resnick 1980). Following the notation established above, the (discrete) Pareto distribution can be written as:
where K is the number of cities (ranked from the smallest to the largest), n i is the size of city i,Footnote 21 and A and a are parameters. Parameter A can be calibrated from the size of the largest city:
Inverting (34), we obtain:
Total urban population is:
And so it is quite straightforward to construct a cumulative distribution similar to the one in Fig. 1 reflecting a theoretical Pareto distribution. It is then possible to apply our proposed index to a theoretical Pareto distribution using formula (3). There results
where we exploit the identity in Eq. (3).Footnote 22 If we assume that the number of cities K and the size of the largest city n K are fixed, then, using (36), (40) can be written as:Footnote 23
The derivative of the index relative to the Pareto parameter is Footnote 24
The sign of that derivative is the sign of its numerator, but we could not determine that sign analytically. Using numerical simulations,Footnote 25 we obtain that the derivative is negative for low values of a, and positive for high values. The sign reversal of the derivative is explained by the fact that, for a given number of cities, the size of the smallest city under the rank-size rule, \(n_{1} = n_{K} K^{{^{{ - {1 \mathord{\left/ {\vphantom {1 a}} \right. \kern-0pt} a}}} }}\), increases with a, leaving a larger gap to the left of the first point on the cumulative distribution (see Fig. 1). Referring to index computation formula (7), it is easily verified that its first term is equal to n 1. Indeed, our numerical simulations confirm that, if that first term is omitted, our index is a monotonically decreasing function of parameter a. This is illustrated in Fig. 7).
Rights and permissions
About this article
Cite this article
Lemelin, A., Rubiera-Morollón, F. & Gómez-Loscos, A. Measuring Urban Agglomeration: A Refoundation of the Mean City-Population Size Index. Soc Indic Res 125, 589–612 (2016). https://doi.org/10.1007/s11205-014-0846-9
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11205-014-0846-9