Skip to main content

Advertisement

Log in

Gender Wage Gap Accounting: The Role of Selection Bias

  • Published:
Demography

Abstract

Mulligan and Rubinstein (2008) (MR) argued that changing selection of working females on unobservable characteristics, from negative in the 1970s to positive in the 1990s, accounted for nearly the entire closing of the gender wage gap. We argue that their female wage equation estimates are inconsistent. Correcting this error substantially weakens the role of the rising selection bias (39 % versus 78 %) and strengthens the contribution of declining discrimination (42 % versus 7 %). Our findings resonate better with related literature. We also explain why our finding of positive selection in the 1970s provides additional support for MR’s main hypothesis that an exogenous rise in the market value of unobservable characteristics contributed to the closing of the gender gap.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Notes

  1. MR fleshed out this mechanism via the Gronau-Heckman-Roy model, which provides a theory for the participation decision in the standard Heckman model. The dependence of the selection bias on the price of unobservable characteristics is derived.

  2. Bailey et al. (2012) summarized this literature. The GG closing can be explained by the introduction of anti-discrimination laws and the change in attitudes toward women’s labor market participation (Fernandez and Fogli 2009; Fortin 2015), women catching up to men in terms of their education and labor market experience (Goldin 2006; Goldin and Katz 2011; O’Neill and Polachek 1993), the technological change that depressed wages in typically male-dominated occupations (Black and Juhn 2000; Black and Spitz-Oener 2010; Blau and Kahn 1997), the change in family structure and the availability of birth control (Bailey et al. 2012; Stevenson and Wolfers 2007), and the change in the unobservable skill composition of working women (Mulligan and Rubinstein 2008).

  3. The strong negative association of female participation with spousal income is especially pronounced in the earlier decades (Bar and Leukhina 2011; Blau and Kahn 2007; Mullligan and Rubinstein 2008).

  4. Following MR, we are referring to the two-sector model presented by Roy (1951), as applied by Gronau (1974) and Heckman (1974) to the allocation of women between market and home sectors.

  5. ϕ(⋅) denotes the standard normal density function, and Φ(⋅) is the standard normal cumulative distribution function.

  6. Recall that λ(a) gives the mean of a standard normal random variable, truncated from below at −a. It is therefore a decreasing function.

  7. One exception that would not produce an error in predicted inverse Mills ratios is the case of I h given by a linear combination of components in Z.

  8. We omit 25 % of the MR sample because of the marital restriction and another 1 % because of the restriction that spousal income is positive.

  9. Angrist and Evans (1998) questioned the number of small children as a valid exclusion restriction. However, even after controlling for endogeneity, they showed that some causal effect on female participation remained. For this reason, we retain the number of small children as one of the exclusion restrictions. We repeated our analysis without this restriction and found that the role of changing selection was diminished further.

  10. We follow MR’s definition of outliers, excluding observations with measured hourly wages below $2.80 per hour in 2000 dollars and below the 1st percentile value of the FTFY male hourly wage distribution (which includes wages for nonwhite never-married men aged 18–65) or above the 98th percentile value of the same distribution.

  11. 11 See Oaxaca (1973), Blinder (1973), and Blau and Kahn (1997) for similar ways to decompose the GG.

  12. While we focus on the decomposition of the GG closing at the mean, Fortin and Lemieux (1998) investigated the closing at each point of the wage distribution.

  13. For example, restricted access to high-paying occupations or high-paying tasks can be viewed as a form of discrimination.

  14. As is typical in the literature, we use years of potential experience as a proxy for the actual experience, which is unavailable in the CPS. See Oaxaca (2009) for a discussion of the consequences of using potential experience as a proxy for the actual experience.

  15. Schwiebert (2013), using a different methodology, also found that selection bias was positive in the 1970s.

  16. In the 1990s, the average \(\frac {\partial B}{\partial {\upsigma }_{u}}\) in the benchmark and the MR models are given by 0.75 and 0.57, respectively. Because σ v cannot be identified, the effects reported here are for the case of σ v = 1. We varied σ v over a wide range and found that the relative magnitude of the effects was not affected significantly.

  17. In the original MR sample, which does not exclude unmarried women, the selection bias estimate was even more negative for the 1970s.

References

  • Angrist, J., & Evans, W. (1998). Children and their parents’ labor supply: Evidence from exogenous variation in family size. American Economic Review, 88, 450–477.

    Google Scholar 

  • Bailey, M., Hershbein, B., & Miller, A. (2012). The opt-in revolution? Contraception and the gender gap in wages. American Economic Journal: Applied Economics, 4, 225–254.

    Google Scholar 

  • Bar, M., & Leukhina, O. (2011). On the time allocation of married couples since 1960. Journal of Macroeconomics, 33, 491–510.

    Article  Google Scholar 

  • Black, S., & Juhn, C. (2000). The rise of female professionals: Are women responding to skill demand? American Economic Review, 90, 450–455.

    Article  Google Scholar 

  • Black, S., & Spitz-Oener, A. (2010). Explaining women’s success: Technological change and the skill content of women’s work. Review of Economics and Statistics, 92, 187–194.

    Article  Google Scholar 

  • Blau, F., & Kahn, L. (1997). Swimming upstream: Trends in the gender wage differential in the 1980s. Journal of Labor Economics, 15, 1–42.

    Article  Google Scholar 

  • Blau, F., & Kahn, L. (2007). Changes in the labor supply behavior of married women: 1980–2000. Journal of Labor Economics, 25, 393–438.

    Article  Google Scholar 

  • Blinder, A. (1973). Wage discrimination: reduced form and structural estimates. Journal of Human Resources, 8, 436–455.

    Article  Google Scholar 

  • Devereux, P. (2004). Changes in relative wages and family labor supply. Journal of Human Resources, 39, 696–722.

    Article  Google Scholar 

  • Fernandez, R., & Fogli, A. (2009). Culture: An empirical investigation of beliefs, work, and fertility. American Economic Journal: Macro Economics, 1, 146–177.

    Google Scholar 

  • Fortin, N. M. (2015). Gender role attitudes and women’s labor market participation: Opting-out, AIDS, and the persistent appeal of housewifery. Annals of Economics and Statistics, 117–118, 379–401.

    Article  Google Scholar 

  • Fortin, N. M., & Lemieux, T. (1998). Rank regressions, wage distributions, and the gender gap. Journal of Human Resources, 33, 610–643.

    Article  Google Scholar 

  • Goldin, C. (2006). The quiet revolution that transformed women’s employment, education, and family. American Economic Review, 96 (2), 1–21.

  • Goldin, C., & Katz, L. (2011). Putting the “co” in education: Timing, reasons, and consequences of college coeducation from 1835–present. Journal of Human Capital, 5, 377–417.

    Article  Google Scholar 

  • Greenwood, J., Guner, N., Kocharkov, G., & Santos, C. (2014). Marry your like: Assortative mating and income inequality. American Economic Review (Papers and Proceedings), 104, 348–353.

    Article  Google Scholar 

  • Gronau, R. (1974). Wage comparisons—A selectivity bias. Journal of Political Economy, 82, 1119–1143.

    Article  Google Scholar 

  • Heckman, J. (1974). Shadow prices, market wages, and labor supply. Econometrica, 42, 679–694.

    Article  Google Scholar 

  • Heckman, J. (1979). Sample selection bias as a specification error. Econometrica, 47, 153–161.

    Article  Google Scholar 

  • Jones, L. E., Manuelli, R. E., & McGrattan, E. R. (2015). Why are married women working so much? Journal of Demographic Economics, 81, 75–114.

    Article  Google Scholar 

  • Juhn, C., Murphy, K., & Pierce, B. (1993). Wage inequality and the rise in returns to skill. Journal of Political Economy, 101, 410–442.

    Article  Google Scholar 

  • Karoly, L. (1993). The trend in inequality among families, individuals, and workers in the United States: A Twenty-five Year Perspective. In S. Danziger & P. Gottschalk (Eds.) Uneven tides (pp. 19–97). New York, NY: Russell Sage.

    Google Scholar 

  • Katz, L., & Autor, D. (1999). Changes in the wage structure and earnings inequality. In O. Ashenfelter & D. Card (Eds.) Handbook of labor economics (Vol. 3A, pp. 1463–1555) . Elsevier.

  • Mroz, T. (1987). The sensitivity of an empirical model of married women’s hours of work to economic and statistical assumptions. Econometrica, 55, 765–799.

    Article  Google Scholar 

  • Mulligan, C., & Rubinstein, Y. (2008). Selection, investment, and women’s relative wages over time. Quarterly Journal of Economics, 123, 1061–1110.

    Article  Google Scholar 

  • Oaxaca, R. (1973). Male-female wage differentials in urban labor markets. International Economic Review, 14, 693–709.

    Article  Google Scholar 

  • Oaxaca, R., & Ransom, M. (1999). Identification in detailed wage decompositions. Review of Economics and Statistics, 81, 154–157.

    Article  Google Scholar 

  • Oaxaca, R., & Regan, T (2009). Work experience as a source of specification error in earnings models: Implications for gender wage decompositions. Journal of Population Economics, 22, 463–499.

    Article  Google Scholar 

  • O’Neill, J., & Polachek, S. (1993). Why the gender gap in wages narrowed in the 1980s. Journal of Labor Economics, 11, 205–228.

    Article  Google Scholar 

  • Roy, A. D. (1951). Some thoughts on the distribution of earnings. Oxford Economic Papers, 3, 135–146.

    Google Scholar 

  • Schwiebert, J. (2013). Revisiting the composition of the female workforce—A Heckman selection model with endogeneity (Working paper). Hannover, Germany: Institute of Labor Economics, Leibniz University Hannover.

    Google Scholar 

  • Stevenson, B., & Wolfers, J. (2007). Marriage and divorce: Changes and their driving forces. Journal of Economic Perspectives, 21(2), 27–52.

Download references

Acknowledgments

We are grateful to David Blau, Donna Gilleskie, Shelly Lundberg, Sophocles Mavroeidis, and Sang Soo Park for useful discussions of this paper. We are also grateful to Yona Rubinstein and Casey Mulligan for sharing their Stata codes at the early stages of this project.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Oksana Leukhina.

Appendix: Derivation of Partial Derivatives in Eqs. (10) and (11)

Appendix: Derivation of Partial Derivatives in Eqs. (10) and (11)

Recall that σ v and ρ u v can be expressed in terms of parameters underlying the GHR model, \({\upsigma }_{v}=\left [ {\upsigma }_{u}^{2}+{\upsigma }_{\upvarepsilon }^{2}-2{\uprho }_{u\upvarepsilon } {\upsigma }_{u}{\upsigma }_{\upvarepsilon } \right ]^{1/2}\)and \({\uprho }_{uv}=\frac {{\upsigma }_{u}-{\uprho }_{u\upvarepsilon } {\upsigma }_{\upvarepsilon } }{{\upsigma }_{v}}\).

First, consider the effect of σ u on σ v :

$$\begin{array}{@{}rcl@{}} \frac{\partial {\upsigma}_{v}}{\partial {\upsigma}_{u}} &=&\frac{\partial} { \partial {\upsigma}_{u}}\left[ {{\upsigma}_{u}^{2}}+{\upsigma}_{\upvarepsilon}^{2}-2{\uprho}_{u\upvarepsilon} {\upsigma}_{u}{\upsigma}_{\upvarepsilon} \right]^{1/2}=\frac{1}{2} \left[ {{\upsigma}_{u}^{2}}+{\upsigma}_{\upvarepsilon}^{2}-2{\uprho}_{u\upvarepsilon} {\upsigma}_{u}{\upsigma}_{\upvarepsilon} \right]^{-\frac{1}{2}}\left( 2{\upsigma}_{u}-2{\uprho}_{u\upvarepsilon} {\upsigma}_{\upvarepsilon} \right)\\ &=&\frac{{\upsigma}_{u}-{\uprho}_{u\upvarepsilon} {\upsigma}_{\upvarepsilon} }{\left[ {{\upsigma}_{u}^{2}}+{\upsigma}_{\upvarepsilon} ^{2}-2{\uprho}_{u\upvarepsilon} {\upsigma}_{u}{\upsigma}_{\upvarepsilon} \right]^{\frac{1}{2}}}=\frac{{\upsigma}_{u}-{\uprho}_{u\upvarepsilon} {\upsigma}_{\upvarepsilon} }{{\upsigma}_{v}}={\uprho}_{uv}. \end{array} $$
(14)

Using this in the derivation that follows obtains the first desired result:

$$\begin{array}{@{}rcl@{}} \frac{\partial} {\partial {\upsigma}_{u}}\Pr \left( L^{\ast} >0|Z,I_{h}\right) &=&\frac{\partial} {\partial {\upsigma}_{u}}\Pr \left( \frac{v}{{\upsigma}_{v}} \leq \frac{Z\upgamma -\upalpha I_{h}}{{\upsigma}_{v}}\right) \\ &=&\frac{\partial} {\partial {\upsigma}_{u}}{\Phi} \left( \frac{Z\upgamma -\upalpha I_{h}}{{\upsigma}_{v}}\right) \\ &=&\upphi \left( \frac{Z\upgamma -\upalpha I_{h}}{{\upsigma}_{v}}\right) \left[ -\left( \frac{Z\upgamma -\upalpha I_{h}}{{{\upsigma}_{v}^{2}}}\right) \right] {\uprho}_{uv}. \end{array} $$

Next, consider the effects of σ u on ρ u v , ρ u v σ u , and λ.

$$\begin{array}{@{}rcl@{}} \frac{\partial {\uprho}_{uv}}{\partial {\upsigma}_{u}} &=&\frac{\partial} { \partial {\upsigma}_{u}}\left( \frac{{\upsigma}_{u}-{\uprho}_{u\upvarepsilon} {\upsigma}_{\upvarepsilon} }{{\upsigma}_{v}}\right) =\frac{{\upsigma}_{v}-\left( {\upsigma}_{u}-{\uprho}_{u\upvarepsilon} {\upsigma}_{\upvarepsilon} \right) \frac{\partial {\upsigma}_{v}}{\partial {\upsigma}_{u}}}{{{\upsigma}_{v}^{2}}} \\ &=&\frac{1-\left( \frac{{\upsigma}_{u}-{\uprho}_{u\upvarepsilon} {\upsigma}_{\upvarepsilon} }{{\upsigma}_{v}}\right) {\uprho}_{uv}}{{\upsigma}_{v}}=\frac{1-{\uprho}_{uv}^{2}}{{\upsigma}_{v}}>0. \end{array} $$
$$\frac{\partial \left( {\uprho}_{uv}{\upsigma}_{u}\right)} {\partial {\upsigma}_{u}}= \frac{\partial {\uprho}_{uv}}{\partial {\upsigma}_{u}}{\upsigma}_{u}+{\uprho}_{uv}=\left( \frac{1-{\uprho}_{uv}^{2}}{{\upsigma}_{v}}\right) {\upsigma}_{u}+{\uprho}_{uv}. $$

Finally,

$$\begin{array}{@{}rcl@{}} \frac{\partial \uplambda} {\partial {\upsigma}_{u}} &=&{\uplambda}^{\prime} \left( \frac{Z\upgamma -\upalpha I_{h}}{{\upsigma}_{v}}\right) \left[ \frac{\partial} { \partial {\upsigma}_{u}}\left( \frac{Z\upgamma -\upalpha I_{h}}{{\upsigma}_{v}}\right) \right] \\ &=&{\uplambda}^{\prime} \left( \frac{Z\upgamma -\upalpha I_{h}}{{\upsigma}_{v}}\right) \left[ -\left( \frac{Z\upgamma -\upalpha I_{h}}{{{\upsigma}_{v}^{2}}}\right) \frac{ \partial {\upsigma}_{v}}{\partial {\upsigma}_{u}}\right] \\ &=&-{\uplambda}^{\prime} \left( \frac{Z\upgamma -\upalpha I_{h}}{{\upsigma}_{v}} \right) \left[ \left( \frac{Z\upgamma -\upalpha I_{h}}{{{\upsigma}_{v}^{2}}}\right) {\uprho}_{uv}\right], \end{array} $$

where we used the definition \(\uplambda \left (\frac {Z\upgamma -\upalpha I_{h}}{{\upsigma }_{v}}\right ) =\frac {\upphi \left (\frac {Z\upgamma -\upalpha I_{h}}{ {\upsigma }_{v}}\right )} {\Phi \left (\frac {Z\upgamma -\upalpha I_{h}}{{\upsigma }_{v}} \right )} \) and the above result shown in Eq. (14).

Differentiating the bias B = ρ u v σ u λ gives the second desired result:

$$\begin{array}{@{}rcl@{}} \frac{\partial B}{\partial {\upsigma}_{u}} &=&\frac{\partial \left( {\uprho}_{uv}{\upsigma}_{u}\right)} {\partial {\upsigma}_{u}}\uplambda +\frac{\partial \uplambda} {\partial {\upsigma}_{u}}{\uprho}_{uv}{\upsigma}_{u} \\ &=&\left[ \left( \frac{1-{\uprho}_{uv}^{2}}{{\upsigma}_{v}}\right) {\upsigma}_{u}+{\uprho}_{uv}\right] \uplambda +\left[-{\uplambda}^{\prime} \left( \frac{ Z\upgamma -\upalpha I_{h}}{{\upsigma}_{v}}\right) \right] \left[ \left( \frac{ Z\upgamma -\upalpha I_{h}}{{{\upsigma}_{v}^{2}}}\right) {\uprho}_{uv}\right] {\uprho}_{uv}{\upsigma}_{u}. \end{array} $$
Table 3 Summary statistics
Table 4 Coefficients and standard errors in the estimation of the benchmark and MR models
Table 5 Estimation of the simple benchmark and simple MR models for the 1970s

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Bar, M., Kim, S. & Leukhina, O. Gender Wage Gap Accounting: The Role of Selection Bias. Demography 52, 1729–1750 (2015). https://doi.org/10.1007/s13524-015-0418-x

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13524-015-0418-x

Keywords

Navigation