Skip to main content

Measuring Education Inequality

  • Chapter
  • First Online:
Measuring Education Inequality in Developing Countries
  • 704 Accesses

Abstract

This chapter describes the measurement of education inequality that Torpey-Saboe has developed: a Gini coefficient for years of schooling. This Gini coefficient is derived using data from Barro and Lee (Journal of Development Economics 2013, pp. 184–198) on educational attainment for various portions of the population and United Nations Educational, Scientific and Cultural Organization (UNESCO) data on the duration of primary and secondary schooling in countries around the world. Using these datasets, Torpey-Saboe calculates the shares of the population that have attained corresponding shares of the total years of education in the country and constructs a Gini coefficient for education inequality. This measure is unique in that it is the only measure of education inequality suitable for looking at changes in the distribution of education over time and for exploring potential causes, rather than consequences, of education inequality.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 54.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    The data are also broken up according to age cohorts: the population age 20–24, age 25–29, age 30–34, and so on. In order to avoid using repeated information, I use the 25–29-year-old cohort from each year, since by this age educational attainment is typically complete. I then use information on the percentage of this cohort that has completed various levels of education in order to calculate what percentage of the population has what percentage of total educational attainment and to construct the Gini coefficient. More details about the Gini coefficient can be found in the next section on measuring inequality. Please see the appendix to this chapter for further notes on the data and calculation of this measure.

References

  • Araar, A. (2006). On the decomposition of the Gini coefficient: An exact approach, with an illustration using Cameroonian data. Cahier de recherche/Working Paper, 06-02.

    Google Scholar 

  • Atkinson, A. B. (1970). On the measurement of inequality. Journal of Economic Theory, 2(3), 244–263.

    Article  Google Scholar 

  • Barro, R. J., & Lee, J. W. (2013). A new data set of educational attainment in the world, 1950–2010. Journal of Development Economics, 104, 184–198.

    Article  Google Scholar 

  • Conceicao, P., & Ferreira, P. (2000, February 29). The young person’s guide to the theil index: Suggesting intuitive interpretations and exploring analytical applications. UTIP Working Paper No. 14. Retrieved from SSRN https://ssrn.com/abstract=228703 or https://doi.org/10.2139/ssrn.228703.

  • De Maio, F. G. (2007). Income inequality measures. Journal of Epidemiology & Community Health, 61(10), 849–852.

    Article  Google Scholar 

  • Litchfield, J. A. (1999). Inequality: Methods and tools. World Bank, 4.

    Google Scholar 

  • Newbery, D. (1970). A theorem on the measurement of inequality. Journal of Economic Theory, 2(3), 264–266.

    Article  Google Scholar 

  • Rothschild, M., & Stiglitz, J. E. (1973). Some further results on the measurement of inequality. Journal of Economic Theory, 6(2), 188–204.

    Article  Google Scholar 

  • Sen, A. (1976). Poverty: An ordinal approach to measurement. Econometrica: Journal of the Econometric Society, 44, 219–231.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Appendix

Appendix

Notes on the Use of Barro and Lee Data for Calculating the Gini Coefficient of Education Inequality

Using the entire 1950–2010 dataset of all age cohorts would be problematic because the information would be duplicated across various years of surveys. For example, the 25–29-year-old cohort in 2000 would become the 30–34-year-old cohort for 2005. Another option would have been to use all of the age cohorts in a survey from a single year (e.g. 2010), but this has the disadvantage that in many countries the older cohorts have begun dying off, and the less educated may have shorter life expectancy than the more educated, thus presenting an inaccurate picture of education distribution by making the older cohorts look more educated than they actually were. Thus, using the surveys from 1950 to 2010 and the 25–29-year-old cohort means that the dependent variable captures the educational attainment of those born between 1921 and 1985.

The population percentages are mainly given by the Barro and Lee data, but a few categories had to be calculated since some of the Barro and Lee categories overlap. For example, Barro and Lee estimate the percentage of the population attaining some primary education and the percentage of the population that completes primary school, but this latter category also includes all of the former. Therefore, I calculated the following mutually exclusive categories: never attended school, attended some primary, completed primary, attended some secondary, completed secondary, attended some tertiary, and completed tertiary. These categories are all percentages of the population and sum to 100. I then calculated the share of total years of education that each category attained. In order to do this, I had to put a numerical value to the different categories. Data on duration of primary and secondary schooling for various countries is available through UNESCO. Since this data is not available for tertiary systems, I estimate partial tertiary education as primary + secondary +2 years and complete tertiary education as primary + secondary + 4 years.

Notes on Measures of Inequality

Litchfield (1999) provides a good summary of the axioms upon which most scholars have agreed that an inequality measure should meet:

  1. 1.

    The Pigou-Dalton Transfer Principle:

Inequality should rise with transfers from poor to rich and fall with transfers from rich to poor. This principle captures the “Robin Hood” idea that if one were to take from the rich and give to the poor, inequality would decrease. Any summary measure of inequality that we use should rank inequality to be lower after a transfer from rich to poor or higher after a transfer from poor to rich. Clearly, education is different from income in that it cannot be redistributed from one person to another, since once someone has attained a level of education it cannot be taken away. What transfers would refer to in this case is more hypothetical: if we are comparing two different distributions and one could arrive at distribution B by starting with distribution A and then making transfers from poor to rich, then distribution B is less equal than distribution A.

  1. 2.

    Scale Independence:

The measure of inequality should not change if the scale of the underlying quantity changes. For example, if income or education doubled for everyone in the population, inequality should remain the same.

  1. 3.

    Principle of Population:

The measure of inequality should not be sensitive to replications of the population. For example, if there were two “twin” countries, whose distribution of education was identical, and these countries merged to form one country, the measure of inequality should not change.

  1. 4.

    Anonymity/Symmetry:

The inequality measure should be independent of any other attribute other than the distribution being measured. This means that the measure of inequality should not depend on to whom the good is distributed. For example, if the whole population was standing in a circle and everyone could give their education (or income, wealth, etc.) to the person standing next to him or her, the overall measure of inequality should remain unchanged.

  1. 5.

    Decomposability:

If inequality rises among each sub-group of the population, overall inequality should also rise (and vice versa). This means that if the educational attainment gap between groups remains stable, but inequality rises within the groups, overall inequality should also rise.

The most commonly used measures of inequality are: variance, decile ratios, relative mean deviation, coefficient of variation, the Gini coefficient, and the Theil index. Other possibilities for consideration are the standard deviation of logarithms and the mean log deviation; however, I have not included them as these measures are only defined for strictly positive values, and cannot be calculated from my data due to cases in which some proportion of the population has 0 education. First, I will give the explanations and calculations for these indices. Next, I will examine how well they conform to the axioms laid out above.

Variance

$$ {\sigma}^2=\frac{1}{N}\sum \limits_{t=1}^N{\left({x}_i-\mu \right)}^2 $$

This is the average of the squared deviations from the mean. It captures the distribution’s spread.

Decile Ratios

$$ Ratio=\frac{\pi_a}{\pi_b} $$

These are calculated simply by taking the ratio of whatever proportion of the outcome variable (income, education, etc.) is controlled by a certain decile (e.g. the top 10 percent) over that which is controlled by another decile (e.g. the bottom 10 percent). In this way, a ratio of 50 would indicate that the education (or income, wealth, etc.) of the top 10 percent is 50 times higher than the wealth of the bottom 10 percent.

Relative Mean Deviation

$$ RMD=\frac{\frac{1}{N}\sum \limits_{i=1}^N\left|{x}_1-\overline{x}\right|}{\left|\overline{x}\right|} $$

This measure takes the average absolute deviation and divides it by the mean. Dividing by the mean allows for comparison across distributions with different means or different ranges. For example, primary and secondary school last only 10 years in Nepal, but 13 years in Sri Lanka. In order to compare inequality across the two systems, it is necessary to standardize the measure to account for the bigger range of educational attainment in Sri Lanka.

In evaluating these measures, a few immediately fail to achieve the desired qualities. Variance is dependent on scale (fails the test of scale independence), meaning that if educational attainment were to increase by 50 percent for everyone in society, the variance would automatically increase as well. This makes variance an inappropriate choice for comparing inequality among distributions with different ranges.

Decile ratios only capture what is going on at the specific cut-off levels chosen, but fail to take into account transfers in the rest of the distribution. For example, in a 90/10 decile ratio, even if the 20th percentile had to give some of their assets to those in the 80th percentile, the decile ratio could remain unchanged as long as the 90th and 10th percentile groups were not changed. This measure therefore fails the Pigou-Dalton Transfer Principle.

Similarly, relative mean deviation is not sensitive to transfers that are made on the same side of the mean (Atkinson 1970). Transfers could be made from the very worst off to those in the 55th percentile with no recorded change in inequality, thus again failing the Pigou-Dalton Transfer Principle.

The Gini coefficient satisfies Litchfield’s axioms 1–4 and axiom 5 as long as the population sub-groups do not overlap (Litchfield 1999). This means that as long as the worst-off people in one group are all still better than the best-off people in another group, the Gini coefficient will still pass the test of decomposability. Where this might be a problem would be if data were grouped according to criteria other than the distribution of the good itself, such as ethnicity or gender. The way my dataset is built, based on the attainment of different levels of education, this is not a problem.

The coefficient of variation and the Theil index also satisfy all of the principles named above. These two measures are both special cases in the family of Generalized Entropy (GE) index measures, based on information theory. The general formula for GE measures is:

$$ GE\left(\alpha \right)=\frac{1}{\alpha^2-\alpha}\left[\frac{1}{N}\sum \limits_{t=1}^N{\left(\frac{y_i}{\overline{y}}\right)}^{\alpha }-1\right] $$

GE(0) is the mean log deviation, GE(1) is the Theil index and GE(2) is 1/2 the squared coefficient of variation (Litchfield 1999). This class of indicators, unlike the Gini coefficient, has the advantage of being decomposable/additively separable—that is, total inequality can be broken down into the sum of the inequality of different sub-groups (De Maio 2007). Ways of decomposing the Gini coefficient have been found (Araar 2006), but it is a considerably more complex process.

Some critics have argued that the Gini coefficient is not an appropriate measure of inequality for a few reasons. Atkinson argues that all inequality measures include inherent value judgments, and that the best way to approach this is to explicitly specify a social welfare function that includes an inequality aversion parameter, ε. The higher the value of ε, the more sensitive the index becomes to inequalities at the bottom of the distribution. He argues that this approach is more transparent than using an index that purports to be value-free, but is in fact based on particular assumptions about what constitutes more or less inequality (1970). The GE measures, like Atkinson index measures, include a parameter to weight inequalities in different parts of the distribution. For the Gini coefficient, however, not only is such a parameter not part of the formula, but Newberry has shown that the Gini coefficient does not rank inequality in a manner consistent with any additive utility function (Newbery 1970, p. 264). In other words, Atkinson and Newberry argue that if we assume that equality is valued and contributes to social welfare, then we should rank distributions according to a utility function whereby greater equality gives greater social welfare. Additivity simply means that the whole is equal to the sum of the parts.

Nevertheless, Sen (1976) and others defend the use of the Gini coefficient as a good measure of inequality . Furthermore, although the Gini coefficient on its own cannot be written as a utility function as described by Rothschild and Stiglitz (1973) show that the Gini coefficient can still be “part of an equality preferring social welfare function” (p. 199). For this reason and due to its place as the most widely used and widely recognized measure of inequality, I will retain the use of the Gini coefficient, while supplementing it with other measures, such as the coefficient of variation and the Theil index. By employing all three, I can test the robustness of my findings and get a more nuanced picture than could be obtained by sticking with a single measure of inequality (De Maio 2007).

Table 4.2 illustrates the correlation between the Gini index, the Theil index, the coefficient of variation, and the educational attainment of various deciles. The Gini index and the Theil index are highly correlated, at 0.92, but the Gini and the coefficient of variation are only correlated at 0.56. The Theil index is very closely correlated with the coefficient of variation: 0.98.

Table 4.2 Correlation of measures of education inequality

As mentioned before, these measures have different sensitivities to inequality in different parts of the distribution. The Gini coefficient is more sensitive to transfers in the middle of the distribution, the Theil index is sensitive to transfers at either the lower end or the upper end, and the coefficient of variation is equally sensitive to transfers anywhere in the distribution (Atkinson 1970). When the Lorenz curves are far apart, this does not matter much, and all three measures will agree on inequality rankings. When the Lorenz curves intersect, however, different inequality measures may give different rankings. In practice, the three indicators agree on rankings in the vast majority of cases. Nevertheless, as a robustness check, I will run different models to include each one of these measures as the dependent variable. See Table 4.3 for Education Gini coefficients by country and year.

Table 4.3 Education Gini coefficient by country and year

Rights and permissions

Reprints and permissions

Copyright information

© 2019 The Author(s)

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Torpey-Saboe, N. (2019). Measuring Education Inequality. In: Measuring Education Inequality in Developing Countries. Springer, Cham. https://doi.org/10.1007/978-3-319-90629-4_4

Download citation

Publish with us

Policies and ethics