Skip to main content

Advertisement

Log in

Inequality, participation, and polarization

  • Original Paper
  • Published:
Social Choice and Welfare Aims and scope Submit manuscript

Abstract

The upward co-movement of income inequality and partisan polarization in the U.S. is typically attributed to intensified class conflict or a political wealth bias. This paper formalizes a theory of polarization where changes in the income distribution do not affect citizens’ policy preferences, but instead change their patterns of political participation: aggregate voting decreases relative to aggregate giving, reducing the electoral penalty for partisan policies. By endogenizing party composition the model captures both the ideological and compositional dimensions of polarization, and addresses less-discussed polarization features, such as intra-party homogeneity and the increase in safe seats. According to the model, observed polarization patterns imply that parties have diverged more than candidates, and that the gap between party and candidate divergence has increased with income inequality.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

Notes

  1. Ideological DW-NOMINATE scores are derived from scaling methods that use all roll call votes in the U.S. Congress and assume a spatial model of voting.

  2. However, see Van Weelden (2013) for a model where divergent platforms can be welfare enhancing to a representative voter, and Van Weelden (2015) for conditions under which platform divergence can be socially optimal for a heterogeneous electorate.

  3. The correlation is also present in cross-section: across senators (Garand 2010) and across state legislatures (Shor and McCarty 2011). Time series correlations using longer, pre-1947, series of income inequality, e.g., the share of income going to the top 1% earners (Piketty and Saez 2003), or the inverted Pareto-Lorenz coefficient (Duca and Saving 2016), yield similar magnitudes.

  4. In this paper I focus on individual contributions. These comprise the majority of all political donations in U.S. federal elections. An interesting extension would be to introduce interest-group contributions.

  5. Using a more reduced-form political process, Campante (2011) shows that income inequality may increase the influence of individual donors on the election outcome, reducing the political power of the median voter and resulting in less redistributive positions.

  6. Großer and Palfrey (2014) provide a different rationale for the “disconnect” between partisan polarization and citizen polarization, based on candidate risk-aversion, using a citizen-candidate model with incomplete information.

  7. To my knowledge, this is the first paper that endogenizes internal party composition in a Calvert-Wittman framework. In this respect the paper is related to a small literature that has explicitly modeled the link between parties and candidates, e.g., Austen-Smith (1984), Snyder and Ting (2002). In both papers parties are purely office-motivated, rather than policy-motivated as here. Platform divergence is driven by either electoral or signaling considerations.

  8. Formally, \(\lambda _{1}<\lambda _{2}\) if and only if \(Y_{\lambda _{1}}(y)\) second-order stochastically dominates \(Y_{\lambda _{2}}(y),\) i.e., \(\int _{y< {\hat{y}}}Y_{\lambda _{1}}(y)dy\le \int _{y<{\hat{y}}}Y_{\lambda _{2}}(y)dy,\) for all \({\hat{y}}\) in the income support.

  9. In the context of U.S. politics, the candidate labels LR could be interpreted as liberal and conservative, and the party labels \({\mathcal {L}}, {\mathcal {R}}\) as Democrat and Republican.

  10. Equivalently, one could assume that the candidates \(L_{s},R_{s}\) have ideal points located at \(s-\theta \) and \(s+\theta ,\) respectively, for some positive \(\theta \).

  11. As will be shown below, the logistic functional form helps sidestep thorny equilibrium existence issues common in the Calvert-Wittman framework; see also Feddersen and Gul (2015).

  12. Voter ideology is self-reported on a discrete 1–7 liberal-conservative scale. Donor ideology is estimated by Bonica (2014) on a continuous \(\left[ -2,2\right] \) interval. The statistics presented use both variables rescaled to the common interval \(\left[ -1,1\right] \).

  13. One might argue that some large donors’ preferences and outlays are easier to predict than many voters’ preferences. However, large donors give in multiple races, thus in a given district which candidate will attract more donations may still be difficult to predict since it depends on what happens in all other races.

  14. There is also a well-documented positive association between income and individual voting. Introducing this monotonicity here is not necessary as what matters in the model is aggregate, rather than individual, voting, and aggregate (average) income is fixed to focus on inequality. Individual voting may also increase in ideological extremism, as citizens with stronger preferences are more politically engaged (Jacobson 2012). Embedding this empirical feature in the model would further increase candidates’ incentives for divergence.

  15. Using data from the 2004 and 2008 elections, Spenkuch and Toniatti (2016) find that differences in advertising between the two parties across different media markets lead to differences in the parties’ vote shares. Advertising accounts for about half of campaign spending; other campaign activities such as canvassing and direct mail have also been shown to affect turnout (Gerber and Green 2000).

  16. The tie-breaking assumption rules out situations where an L candidate affiliated with party \({\mathcal {R}}\) runs against an R candidate affiliated with party \({\mathcal {L}}\), and can be relaxed without affecting the main results. The affiliation rule assumed here is agnostic about candidates’ motivation in adopting a party label. Its role is simply to create a correlation between a party and its candidates’ policy positions.

  17. Donations to parties are used for party-building activities that support a party’s legislative agenda, such as organizing party conventions and promoting party discipline and collective action. Some donations to parties may be transferred to candidates, but for simplicity the model abstracts from this possibility. For models that endogenize the majority party’s influence on policymaking in a legislative body see, e.g., Patty (2008) and Diermeier and Vlaicu (2011).

  18. Had one also assumed voting to be strictly increasing in income, then one could claim that aggregate voting at the bottom of the income distribution drops by more than it increases at the top. An immediate corollary of Proposition 1 is relevant to the “puzzle of political participation” first stated by Brody (1978): if income increases individual voting, why hasn’t aggregate voting in U.S. elections increased in the post-1960 period when real per-capita income had an upward trend? While higher individual income may boost individual voting, and higher average income aggregate voting, the attendant higher income inequality can act as a countervailing force to keep aggregate voting depressed.

  19. If candidates were just office-motivated, i.e., they didn’t care about policy outcomes, only about winning the seat, in this setting neither would find it profitable to move away from the center since the electoral penalty would not be offset by a policy benefit.

  20. Thus assuming strict concavity of individual voting in income is unnecessarily strong. All that is needed is that it is less convex than individual giving.

  21. See Callander and Wilson (2007) for an alternative model where candidate divergence is also negatively correlated with aggregate voting.

  22. There is broad agreement with the notion that the ideological divide between the two major U.S. parties has been widening for several decades. Consensus breaks down, however, over whether there has been a commensurate ideological divide in the American public. Abramowitz (2010): “Polarization in Washington reflects polarization within the public, especially the politically engaged segment of the public.” (p. x). Fiorina and Abrams (2008): “It seems reasonable to conclude that the distribution of ideology in the American public has not changed for more than three decades.” (p. 571).

  23. This has not always been driven by incumbency advantage, as illustrated by the high-profile defeat in June 2014 of House Majority Leader Eric Cantor (R-VA) by challenger David Brat, a more conservative Republican.

  24. See Nokken and Poole (2004) for a comprehensive analysis of party defections in the U.S. Congress.

  25. Diermeier et al. (2017) study how inter-party heterogeneity and intra-party homogeneity affect legislative policy through the adoption of partisan procedural rules.

  26. A caveat of this argument is that a partisan gerrymander may not always result in increased district heterogeneity, as the majority party creates many moderate districts to dilute the power of the minority party. The gerrymandering argument is limited, however, also because it cannot account for the polarization seen in the U.S. Senate, where constituencies (state borders) are fixed.

References

  • Abramowitz AI (2010) The disappearing center: engaged citizens, polarization, and American democracy. Yale University Press, New Haven

    Google Scholar 

  • Abramowitz AI, Alexander B, Gunning M (2006) Incumbency, redistricting, and the decline of competition in U.S. house elections. J Polit 68(1):75–88

    Article  Google Scholar 

  • Ansolabehere S, de Figueiredo JM, Snyder JM Jr (2003) Why is there so little money in U.S. politics? J Econ Perspect 17(1):105–130

    Article  Google Scholar 

  • Ansolabehere S, Hersh E (2012) Validation: what big data reveal about survey misreporting and the real electorate. Polit Anal 20(4):437–459

    Article  Google Scholar 

  • Austen-Smith D (1984) Two-party competition with many constituencies. Math Soc Sci 7(2):177–198

    Article  Google Scholar 

  • Azzimonti M (2015) Partisan conflict and private investment. NBER Working Paper 21273

  • Bartels LM (2008) Unequal democracy: the political economy of the new gilded age. Princeton University Press, Princeton

    Google Scholar 

  • Bishop B (2008) The big sort: why the clustering of like-minded America is tearing us apart. Houghton Mifflin Harcourt, USA

  • Bonica A (2014) Mapping the ideological marketplace. Am J Polit Sci 58:367–386

    Article  Google Scholar 

  • Bonica A, McCarty N, Poole KT, Rosenthal H (2013) Why hasn’t democracy slowed rising inequality? J Econ Perspect 27(3):103–124

    Article  Google Scholar 

  • Brody RA (1978) The puzzle of political participation in America. In: King A (ed) The new American political system. AEI, Washington, DC

    Google Scholar 

  • Callander S (2005) Electoral competition in heterogenous districts. J Polit Econ 113(5):1116–1145

    Article  Google Scholar 

  • Callander S, Wilson C (2007) Turnout, polarization, and Duverger’s law. J Polit 69(4):1047–1056

    Article  Google Scholar 

  • Calvert R (1985) Robustness of the multidimensional voting model: candidate motivations, uncertainty, and convergence. Am J Polit Sci 29(1):69–95

    Article  Google Scholar 

  • Campante F (2011) Redistribution in a model of voting and campaign contributions. J Pub Econ 95(7–8):646–656

    Article  Google Scholar 

  • Diermeier D, Vlaicu R (2011) Parties, coalitions, and the internal organization of legislatures. Am Polit Sci Rev 105(2):359–380

    Article  Google Scholar 

  • Diermeier D, Prato C, Vlaicu R (2017) Self-enforcing partisan procedures. Working paper

  • Duca JV, Saving J (2016) Income inequality and political polarization: time series evidence over nine decades. Rev Income Wealth 62(3):445–466

    Article  Google Scholar 

  • Feddersen T, Gul F (2015) Polarization and income inequality: a dynamic model of unequal democracy. Working paper

  • Fiorina MP, Abrams SJ (2008) Political polarization in the American public. Annu Rev Polit Sci 11:563–588

    Article  Google Scholar 

  • Garand J (2010) Income inequality, party polarization, and roll-call voting in the U.S. senate. J Polit 72(4):1109–1128

    Article  Google Scholar 

  • Gerber AS, Green DP (2000) The effects of canvassing, telephone calls, and direct mail on voter turnout: a field experiment. Am Polit Sci Rev 94(3):653–663

    Article  Google Scholar 

  • Großer J, Palfrey T (2014) Candidate entry and political polarization: an antimedian voter theorem. Am J Polit Sci 58(1):127–143

    Article  Google Scholar 

  • Gerring J (1998) Party ideologies in America, 1828–1996. Cambridge University Press, Cambridge

    Book  Google Scholar 

  • Hetherington MJ (2001) Resurgent mass partisanship: the role of elite polarization. Am Polit Sci Rev 95(3):619–631

    Article  Google Scholar 

  • Herrera H, Levine DK, Martinelli C (2008) Policy platforms, campaign spending, and voter participation. J Public Econ 92(3–4):501–513

    Article  Google Scholar 

  • Jacobson GC (2012) The electoral origins of polarized politics: evidence from the 2010 CCES. Am Behav Sci 56(12):1612–1630

    Article  Google Scholar 

  • Kamada Y, Kojima F (2014) Voter preferences, polarization, and electoral politics. Am Econ J Microecon 6(4):203–236

    Article  Google Scholar 

  • Krasa S, Polborn M (2010) Competition between specialized candidates. Am Polit Sci Rev 104(4):745–765

    Article  Google Scholar 

  • Krasa S, Polborn M (2014a) Policy divergence and voter polarization in a structural model of elections. J Law Econ 57(1):31–76

    Article  Google Scholar 

  • Krasa S, Polborn M (2014b) Political competition in legislative elections. Working paper

  • Levendusky M (2009) The partisan sort: how liberals became democrats and conservatives became republicans. University of Chicago Press, Chicago

    Book  Google Scholar 

  • McCarty N, Poole KT, Rosenthal H (2006) Polarized America: the dance of ideology and unequal riches. MIT Press, Cambridge

    Google Scholar 

  • Nokken TP, Poole KT (2004) Congressional party defection in American history. Legislative Stud Q 29:545–568

    Article  Google Scholar 

  • Palfrey T (1984) Spatial equilibrium with entry. Rev Econ Stud 51(1):139–56

    Article  Google Scholar 

  • Patty JW (2008) Equilibrium party government. Am J Polit Sci 52(3):636–655

    Article  Google Scholar 

  • Patty JW, Penn EM (2017) Uncertainty, polarization, and proposal incentives under quadratic voting. Pub Choice 172:109–124

    Article  Google Scholar 

  • Piketty T, Saez E (2003) Income inequality in the United States, 1913–1998. Q J Econ 118(1):1–39

    Article  Google Scholar 

  • Poole KT, Rosenthal H (1984) The polarization of American politics. J Polit 46(4):1061–1079

    Article  Google Scholar 

  • Poole KT, Rosenthal H (1997) Congress: a political-economic history of roll call voting. Oxford University Press, New York

    Google Scholar 

  • Shor B, McCarty N (2011) The ideological mapping of American legislatures. Am Polit Sci Rev 105(3):530–551

    Article  Google Scholar 

  • Snyder JM, Ting MM (2002) An informational rationale for political parties. Am J Polit Sci 46(1):90–110

    Article  Google Scholar 

  • Spenkuch JL, Toniatti D (2016) Political advertising and election outcomes. Working paper

  • Van Weelden R (2013) Candidates, credibility, and re-election incentives. Rev Econ Stud 80(4):1622–1651

    Article  Google Scholar 

  • Van Weelden R (2015) The welfare implications of electoral polarization. Soc Choice Welf 45(4):653–686

    Article  Google Scholar 

  • Wittman D (1983) Candidate motivation: a synthesis of alternative theories. Am Polit Sci Rev 77(1):142–157

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Razvan Vlaicu.

Additional information

I am grateful to Theodore Bergstrom, Dan Bernhardt, Georgy Egorov, Tim Feddersen, Laurel Harbridge, Ethan Kaplan, Phil Keefer, Nolan McCarty, Daven Petitte, Mattias Polborn, Richard Van Weelden, Emanuel Vespa, and seminar audiences at the IDB Research Department, Northwestern University, UC Santa Barbara, University of Chicago, and the University of Maryland. Gregory Haugan provided excellent research assistance. The model was developed while I was a visiting faculty at Northwestern University; I am grateful to the Kellogg School of Management for its hospitality. The findings and interpretations in this paper are those of the author and do not necessarily reflect the views of the Inter-American Development Bank or the governments it represents.

Appendix

Appendix

Proof of Proposition 1

Consider two income distributions \(Y_{\lambda _{1}}(y)\) and \(Y_{\lambda _{2}}(y)\) with inequality levels \(\lambda _{1},\lambda _{2}\) and equal means: \(\int _{0}^{\infty }ydY_{\lambda _{1}}(y)=\int _{0}^{\infty }ydY_{\lambda _{2}}(y)=1\). It is known that under these conditions, \(\lambda _{1}<\lambda _{2}\) if and only if \(Y_{\lambda _{1}}(y)\) second-order stochastically dominates \(Y_{\lambda _{2}}(y).\) Then, by SOSD, because \(\nu \left( y\right) \) is strictly concave in income y,  it follows that \( v(\lambda _{1})=\int _{0}^{\infty }\nu \left( y\right) dY_{\lambda _{1}}\left( y\right) >\int _{0}^{\infty }\nu \left( y\right) dY_{\lambda _{2}}\left( y\right) =v(\lambda _{2}),\) implying that \(\frac{dv(\lambda )}{ d\lambda }<0.\) Because \(\gamma \left( y\right) \) is strictly convex in income y,  we have \(g(\lambda _{1})=\int _{0}^{\infty }\gamma \left( y\right) dY_{\lambda _{1}}\left( y\right) <\int _{0}^{\infty }\gamma \left( y\right) dY_{\lambda _{2}}\left( y\right) =g(\lambda _{2}),\) implying that \( \frac{dg(\lambda )}{d\lambda }>0.\) \(\square \)

Proof of Proposition 2

Solve for an L candidate’s winning probability:

$$\begin{aligned} w_{L_{s}}\left( {\overline{x}}_{s},\lambda \right)= & {} {{\mathbb {P}}}\left\{ \frac{ \left[ Z_{s}\left( {\overline{x}}_{s}\right) \right] ^{\frac{v\left( \lambda \right) }{v\left( \lambda \right) +g\left( \lambda \right) }}\left[ D_{s}\left( {\overline{x}}_{s}|{\tilde{\psi }}_{s}\right) \right] ^{\frac{g\left( \lambda \right) }{v\left( \lambda \right) +g\left( \lambda \right) }}}{\left[ 1-Z_{s}\left( {\overline{x}}_{s}\right) \right] ^{\frac{v\left( \lambda \right) }{v\left( \lambda \right) +g\left( \lambda \right) }}\left[ 1-D_{s}\left( {\overline{x}}_{s}|{\tilde{\psi }}_{s}\right) \right] ^{\frac{ g\left( \lambda \right) }{v\left( \lambda \right) +g\left( \lambda \right) }} }\ge 1\right\} \nonumber \\= & {} {{\mathbb {P}}}\left\{ \frac{\left[ \frac{1}{1+e^{-\frac{{\overline{x}}_{s}-s}{ \sigma _{v}}}}\right] ^{\frac{v\left( \lambda \right) }{v\left( \lambda \right) +g\left( \lambda \right) }}\left[ \frac{1}{1+e^{-\frac{{\overline{x}} _{s}-s-{\tilde{\psi }}_{s}}{\sigma _{g}}}}\right] ^{\frac{g\left( \lambda \right) }{v\left( \lambda \right) +g\left( \lambda \right) }}}{\left[ 1- \frac{1}{1+e^{-\frac{{\overline{x}}_{s}-s}{\sigma _{v}}}}\right] ^{\frac{ v\left( \lambda \right) }{v\left( \lambda \right) +g\left( \lambda \right) }} \left[ 1-\frac{1}{1+e^{-\frac{{\overline{x}}_{s}-s-{\tilde{\psi }}_{s}}{\sigma _{g}}}}\right] ^{\frac{g\left( \lambda \right) }{v\left( \lambda \right) +g\left( \lambda \right) }}}\ge 1\right\} \nonumber \\= & {} {{\mathbb {P}}}\left\{ e^{\frac{{\overline{x}}_{s}-s}{\sigma _{v}}\frac{v\left( \lambda \right) }{v\left( \lambda \right) +g\left( \lambda \right) }}e^{ \frac{{\overline{x}}_{s}-s-{\tilde{\psi }}_{s}}{\sigma _{g}}\frac{g\left( \lambda \right) }{v\left( \lambda \right) +g\left( \lambda \right) }}\ge 1\right\} \end{aligned}$$
(18)
$$\begin{aligned}= & {} {{\mathbb {P}}}\left\{ \left[ \frac{v\left( \lambda \right) }{\sigma _{v}}+ \frac{g\left( \lambda \right) }{\sigma _{g}}\right] \left( {\overline{x}} _{s}-s\right) \ge {\tilde{\psi }}_{s}\frac{g\left( \lambda \right) }{\sigma _{g}}\right\} \nonumber \\= & {} {{\mathbb {P}}}\left\{ {\tilde{\psi }}_{s}\le \left[ 1+\frac{v\left( \lambda \right) /\sigma _{v}}{g\left( \lambda \right) /\sigma _{g}}\right] \left( {\overline{x}}_{s}-s\right) \right\} \nonumber \\= & {} \frac{1}{2}+\frac{1}{\psi }\left[ 1+\frac{v\left( \lambda \right) /\sigma _{v}}{g\left( \lambda \right) /\sigma _{g}}\right] \left( \frac{ x_{L_{s}}+x_{R_{s}}}{2}-s\right) \end{aligned}$$
(19)

and \(w_{R_{s}}\left( {\overline{x}}_{s},\lambda \right) =1-w_{L_{s}}\left( {\overline{x}}_{s},\lambda \right) .\)

The necessary conditions for a candidate equilibrium are, using \( w_{L_{s}}+w_{R_{s}}=1\):

$$\begin{aligned} \frac{\partial }{\partial x_{L_{s}}}u_{L_{s}}= & {} -w_{L_{s}}+\left( x_{R_{s}}-x_{L_{s}}\right) \frac{\partial }{\partial x_{L_{s}}}w_{L_{s}}=0 \end{aligned}$$
(20)
$$\begin{aligned} \frac{\partial }{\partial x_{R_{s}}}u_{R_{s}}= & {} \left( 1-w_{L_{s}}\right) -\left( x_{R_{s}}-x_{L_{s}}\right) \frac{\partial }{\partial x_{R_{s}}} w_{L_{s}}=0 \end{aligned}$$
(21)

and adding these two first-order conditions, and observing that \(\frac{ \partial }{\partial x_{L_{s}}}w_{L_{s}}=\frac{\partial }{\partial x_{R_{s}}} w_{L_{s}}=\frac{1}{2\psi }\left[ 1+\frac{v\left( \lambda \right) /\sigma _{v} }{g\left( \lambda \right) /\sigma _{g}}\right] \) from Eq. (19), yields \(w_{L_{s}}=w_{R_{s}}=\frac{1}{2}.\) Using this in the first-order conditions, one can solve for the candidate equilibrium strategies\(\left( x_{L_{s}},x_{R_{s}}\right) =\left( s-\frac{\Delta x}{2},s+ \frac{\Delta x}{2}\right) \) where candidate divergence is:

$$\begin{aligned} \Delta x\equiv x_{R_{s}}-x_{L_{s}}=\frac{w_{L_{s}}}{\frac{\partial }{ \partial x_{L_{s}}}w_{L_{s}}}=\frac{\psi }{1+\frac{v\left( \lambda \right) /\sigma _{v}}{g\left( \lambda \right) /\sigma _{g}}}. \end{aligned}$$
(22)

To establish that there are no global profitable deviations the following properties suffice: (i) \(u_{L_{s}}(x_{L_{s}},x_{R_{s}})\) is strictly concave in \(x_{L_{s}}\) on \((-\infty ,x_{R_{s}}],\) which implies that the \(x_{L_{s}}\) that solves Eqs. (20)–(21) is a global maximizer of \( u_{L_{s}}(x_{L_{s}},x_{R_{s}})\) on \((-\infty ,x_{R_{s}}]\), and (ii) a candidate \(L_{s}\) deviation from the equilibrium \(x_{L_{s}}\) to some\( x_{L_{s}}^{\prime }>x_{R_{s}}\) strictly lowers \(L_{s}\)’s payoff. The argument for candidate \(R_{s}\) is analogous. For part (i), note that \(\frac{ \partial ^{2}}{\partial x_{L_{s}}^{2}}u_{L_{s}}(x_{L_{s}},x_{R_{s}})=-2\frac{ \partial }{\partial x_{L_{s}}}w_{L_{s}}+\left( x_{R_{s}}-x_{L_{s}}\right) \frac{\partial ^{2}}{\partial x_{L_{s}}^{2}}w_{L_{s}}\) and since by Eq. (19) we have that \(w_{L_{s}}\) is linear and strictly increasing in \(x_{L_{s}}\), it follows that \(\frac{\partial ^{2}}{\partial x_{L_{s}}^{2}} u_{L_{s}}(x_{L_{s}},x_{R_{s}})=-2\frac{\partial }{\partial x_{L_{s}}} w_{L_{s}}<0.\) For part (ii), note that the policy lottery induced by the deviation is first-order stochastically dominated for \(L_{s}\) since \( u_{L_{s}}(x_{L_{s}}^{\prime },x_{R_{s}})<u_{L_{s}}(x_{R_{s}},x_{R_{s}})=-x_{R_{s}}\le -\frac{ x_{L_{s}}+x_{R_{s}}}{2}=u_{L_{s}}(x_{L_{s}},x_{R_{s}}).\) \(\square \)

Proof of Proposition 3

A candidate’s marginal electoral penalty for moving away from the district mean ideology s, namely \(\frac{1}{2\psi }\left[ 1+\frac{v\left( \lambda \right) /\sigma _{v}}{g\left( \lambda \right) /\sigma _{g}}\right] \) is lower the lower the ratio \(v\left( \lambda \right) /g\left( \lambda \right) , \) namely the more important is giving relative to voting. Thus, a candidate policy-motivated deviation from the district mean ideology is more profitable the lower the ratio \(v\left( \lambda \right) /g\left( \lambda \right) \). Taking the difference between the two first-order conditions (20)–(21) and solving for \(x_{R_{s}}-x_{L_{s}}\) yields \(\Delta x= \frac{1}{\frac{\partial }{\partial x_{L_{s}}}w_{L_{s}}+\frac{\partial }{ \partial x_{R_{s}}}w_{L_{s}}}=\frac{\psi }{1+\frac{v\left( \lambda \right) /\sigma _{v}}{g\left( \lambda \right) /\sigma _{g}}}.\) Then, using \( v^{\prime }(\lambda )<0\) and \(g^{\prime }(\lambda )>0\) from Proposition 1 gives that \(\frac{\partial }{\partial \lambda }\Delta x>0.\) The comparative statics with respect to \(\psi ,\sigma _{v},\sigma _{g}\) follow immediately from the expression for \(\Delta x\). \(\square \)

Proof of Proposition 4

Parties anticipate that candidates will play the strategies characterized in Proposition 2. Given candidate strategies, parties calculate their seat shares and expected donations, which together will determine their policy weights. To show equilibrium existence and uniqueness, the proof consists of the following steps. (i) Necessary conditions for a party equilibrium in the case \(X_{{\mathcal {L}}}\le -\frac{\Delta x}{2}\le \frac{\Delta x}{2}\le X_{{\mathcal {R}}}\). (ii) Ruling out global deviations. (iii) Ruling out a party equilibrium with\(-\frac{\Delta x}{2}<X_{{\mathcal {L}}}\le X_{\mathcal {R }}<\frac{\Delta x}{2}.\)

  1. (i)

    Consider an equilibrium with the property \(X_{{\mathcal {L}}}\le -\frac{ \Delta x}{2}\le \frac{\Delta x}{2}\le X_{{\mathcal {R}}}\). To solve for the parties’ policy weights, it is necessary to derive the parties’ seat shares. See Eq. (12) and Fig. 3.

    $$\begin{aligned} h_{{\mathcal {L}}}= & {} \frac{1}{\mu }\left\{ \frac{1}{2}\left[ \left( X_{ {\mathcal {L}}}\!+\!\frac{\mu }{2}\right) -\left( -\frac{\Delta x}{2}-\frac{\mu }{2} \right) \right] +\frac{1}{2}\left[ \left( X_{{\mathcal {R}}}\!-\!\frac{\mu }{2} \right) \!-\!\left( \frac{\Delta x}{2}-\frac{\mu }{2}\right) \right] \right\} \nonumber \\= & {} \frac{1}{\mu }\left( \frac{\mu }{2}+\frac{X_{{\mathcal {L}}}+X_{{\mathcal {R}}} }{2}\right) \end{aligned}$$
    (23)
    $$\begin{aligned} h_{{\mathcal {R}}}= & {} \frac{1}{\mu }\left\{ \frac{1}{2}\left[ \left( \frac{ \Delta x}{2}+\frac{\mu }{2}\right) -\left( X_{{\mathcal {R}}}\!-\!\frac{\mu }{2} \right) \right] \!+\!\frac{1}{2}\left[ \left( -\frac{\Delta x}{2}+\frac{\mu }{2} \right) -\left( X_{{\mathcal {L}}}+\frac{\mu }{2}\right) \right] \right\} \nonumber \\= & {} \frac{1}{\mu }\left( \frac{\mu }{2}-\frac{X_{{\mathcal {L}}}+X_{{\mathcal {R}}} }{2}\right) \end{aligned}$$
    (24)

    and thus \(h_{{\mathcal {L}}}-h_{{\mathcal {R}}}=\frac{1}{\mu }\left( X_{{\mathcal {L}} }+X_{{\mathcal {R}}}\right) =\frac{2{\bar{X}}}{\mu }.\) Policy weights can then be calculated as follows.

    $$\begin{aligned} W_{{\mathcal {L}}}\left( {\bar{X}},\lambda \right)= & {} {{\mathbb {P}}}\left\{ \frac{ \left[ Z\left( \frac{2{\bar{X}}}{\mu }\right) \right] ^{\frac{1}{1+g\left( \lambda \right) }}\left[ D\left( {\bar{X}}|{\tilde{\Psi }}\right) \right] ^{\frac{ g\left( \lambda \right) }{1+g\left( \lambda \right) }}}{\left[ 1-Z\left( \frac{2{\bar{X}}}{\mu }\right) \right] ^{\frac{1}{1+g\left( \lambda \right) }} \left[ 1-D\left( {\bar{X}}|{\tilde{\Psi }}\right) \right] ^{\frac{g\left( \lambda \right) }{1+g\left( \lambda \right) }}}\ge 1\right\} \end{aligned}$$
    (25)
    $$\begin{aligned}= & {} {{\mathbb {P}}}\left\{ \frac{\left[ \frac{1}{1+e^{-\frac{{\bar{X}}}{\mu }}} \right] ^{\frac{1}{1+g\left( \lambda \right) }}\left[ \frac{1}{1+e^{-\frac{ {\bar{X}}-{\tilde{\Psi }}}{\Sigma _{g}}}}\right] ^{\frac{g\left( \lambda \right) }{1+g\left( \lambda \right) }}}{\left[ 1-\frac{1}{1+e^{-\frac{{\bar{X}}}{\mu }} }\right] ^{\frac{1}{1+g\left( \lambda \right) }}\left[ 1-\frac{1}{1+e^{- \frac{{\bar{X}}-{\tilde{\Psi }}}{\Sigma _{g}}}}\right] ^{\frac{g\left( \lambda \right) }{1+g\left( \lambda \right) }}}\ge 1\right\} \nonumber \\= & {} {{\mathbb {P}}}\left\{ e^{\frac{{\bar{X}}}{\mu }\frac{1}{1+g\left( \lambda \right) }}e^{\frac{{\bar{X}}-{\tilde{\Psi }}}{\Sigma _{g}}\frac{g\left( \lambda \right) }{1+g\left( \lambda \right) }}\ge 1\right\} ={{\mathbb {P}}}\left\{ \frac{{\bar{X}}}{\mu }+\frac{{\bar{X}}-{\tilde{\Psi }}}{\Sigma _{g}}g\left( \lambda \right) \ge 0\right\} \nonumber \\= & {} {{\mathbb {P}}}\left\{ \frac{{\bar{X}}}{\mu }+\frac{{\bar{X}}}{\Sigma _{g}}g\left( \lambda \right) \ge g\left( \lambda \right) \frac{{\tilde{\Psi }}}{\Sigma _{g}} \right\} ={{\mathbb {P}}}\left\{ {\tilde{\Psi }}\le \left[ 1+\frac{1/\mu }{g\left( \lambda \right) /\Sigma _{g}}\right] {\bar{X}}\right\} \nonumber \\= & {} \frac{1}{2}+\frac{1}{\Psi }\left[ 1+\frac{1/\mu }{g\left( \lambda \right) /\Sigma _{g}}\right] \frac{X_{{\mathcal {L}}}+X_{{\mathcal {R}}}}{2} \end{aligned}$$
    (26)

    and \(W_{{\mathcal {R}}}\left( {\bar{X}},\lambda \right) =1-W_{{\mathcal {L}}}\left( {\bar{X}},\lambda \right) .\) The necessary conditions for a party equilibrium are, using \(W_{{\mathcal {L}}}+W_{{\mathcal {R}}}=1\):

    $$\begin{aligned} \frac{\partial }{\partial X_{{\mathcal {L}}}}U_{{\mathcal {L}}}= & {} -W_{{\mathcal {L}} }+\left( X_{{\mathcal {R}}}-X_{{\mathcal {L}}}\right) \frac{\partial }{\partial X_{ {\mathcal {L}}}}W_{{\mathcal {L}}}=0 \end{aligned}$$
    (27)
    $$\begin{aligned} \frac{\partial }{\partial X_{{\mathcal {R}}}}U_{{\mathcal {R}}}= & {} \left( 1-W_{ {\mathcal {L}}}\right) -\left( X_{{\mathcal {R}}}-X_{{\mathcal {L}}}\right) \frac{ \partial }{\partial X_{{\mathcal {R}}}}W_{{\mathcal {L}}}=0 \end{aligned}$$
    (28)

    and adding together these two first-order conditions, and observing that\( \frac{\partial }{\partial X_{{\mathcal {L}}}}W_{{\mathcal {L}}}=\frac{\partial }{ \partial X_{{\mathcal {R}}}}W_{{\mathcal {L}}}=\frac{1}{2\Psi }\left[ 1+\frac{ 1/\mu }{g\left( \lambda \right) /\Sigma _{g}}\right] \) from Eq. (26), yields \(W_{{\mathcal {L}}}=W_{{\mathcal {R}}}=\frac{1}{2}.\) Using this in the first-order conditions, one can solve for the party equilibrium strategies \(\left( X_{{\mathcal {L}}},X_{{\mathcal {R}}}\right) =\left( -\frac{ \Delta X}{2},\frac{\Delta X}{2}\right) \) where:

    $$\begin{aligned} \Delta X\equiv X_{{\mathcal {R}}}-X_{{\mathcal {L}}}=\frac{W_{{\mathcal {L}}}}{\frac{ \partial }{\partial X_{{\mathcal {L}}}}W_{{\mathcal {L}}}}=\frac{\Psi }{1+\frac{ 1/\mu }{g\left( \lambda \right) /\Sigma _{g}}} \end{aligned}$$
    (29)
  2. (ii)

    To establish that the party strategies above constitute an equilibrium, not only local but also global deviations have to be unprofitable. The argument is made for party \({\mathcal {L}}\) only, since by symmetry it also applies to party \({\mathcal {R}}\). First note that parties’ payoffs in the conjectured equilibrium are zero by symmetry. Party \({\mathcal {L}}\) deviations in the interval \(\left( -\infty ,-\frac{\Delta x}{2}\right] \) are not profitable because party \({\mathcal {L}}\)’s payoff function in this interval is strictly concave in \(X_{{\mathcal {L}}}.\) To see this, note that\(\frac{ \partial ^{2}}{\partial X_{{\mathcal {L}}}^{2}}U_{{\mathcal {L}}}=-2\frac{\partial }{\partial X_{{\mathcal {L}}}}W_{{\mathcal {L}}}+\left( X_{{\mathcal {R}}}-X_{ {\mathcal {L}}}\right) \frac{\partial ^{2}}{\partial X_{{\mathcal {L}}}^{2}}W_{ {\mathcal {L}}}=-2\frac{\partial }{\partial X_{{\mathcal {L}}}}W_{{\mathcal {L}}}<0\) because \(W_{{\mathcal {L}}}\) is strictly increasing and linear in \(X_{\mathcal {L }}.\) See Eq. (26). Party \({\mathcal {L}}\) deviations in the interval \(\left[ 0,\infty \right) \) are not profitable because they produce at best a non-negative policy, which gives party \({\mathcal {L}}\) at most a zero payoff. Party \({\mathcal {L}}\) deviations in the interval \(\left( -\frac{ \Delta x}{2},0\right) \) yield a negative payoff. To see this, note that as party \({\mathcal {L}}\) moves its position \(X_{{\mathcal {L}}}\) beyond \(-\frac{ \Delta x}{2},\) its policy weight \(W_{{\mathcal {L}}}\) drops faster than a commensurate rightward deviation between \(-\frac{\Delta X}{2}\) and \(-\frac{ \Delta x}{2},\) because in the former deviation its seat share shrinks as the other party’s stays constant, whereas in the latter deviation its seat share increases as the other party’s declines. See Fig. 3. Thus, party \(\mathcal { L}\)’s payoff drops faster than if it were to gain seats at the same rate.

  3. (iii)

    To establish uniqueness, an equilibrium with the property \(-\frac{ \Delta x}{2}<X_{{\mathcal {L}}}\le X_{{\mathcal {R}}}<\frac{\Delta x}{2}\) has to be ruled out. Suppose parties pick positions with these properties. To solve for the parties’ policy weights, it is necessary to first derive the parties’ seat shares:

    $$\begin{aligned} h_{{\mathcal {L}}}= & {} \frac{1}{\mu }\left\{ \frac{1}{2}\left[ \left( -\frac{ \Delta x}{2}+\frac{\mu }{2}\right) -\left( X_{{\mathcal {L}}}-\frac{\mu }{2} \right) \right] \right\} =\frac{1}{2\mu }\left( \mu -\frac{\Delta x}{2}-X_{ {\mathcal {L}}}\right) \nonumber \\ \end{aligned}$$
    (30)
    $$\begin{aligned} h_{{\mathcal {R}}}= & {} \frac{1}{\mu }\left\{ \frac{1}{2}\left[ \left( X_{ {\mathcal {R}}}+\frac{\mu }{2}\right) -\left( \frac{\Delta x}{2}-\frac{\mu }{2} \right) \right] \right\} =\frac{1}{2\mu }\left( \mu -\frac{\Delta x}{2}+X_{ {\mathcal {R}}}\right) \nonumber \\ \end{aligned}$$
    (31)

    and thus \(h_{{\mathcal {L}}}-h_{{\mathcal {R}}}=-\frac{1}{2\mu }\left( X_{\mathcal { L}}+X_{{\mathcal {R}}}\right) =-\frac{{\bar{X}}}{\mu }.\) Following a computation as in equation (26), policy weights are as follows:

    $$\begin{aligned} W_{{\mathcal {L}}}\left( {\bar{X}},\lambda \right) =\frac{1}{2}+\frac{1}{\Psi } \left[ 1-\frac{1/2\mu }{g\left( \lambda \right) /\Sigma _{g}}\right] \frac{ X_{{\mathcal {L}}}+X_{{\mathcal {R}}}}{2} \end{aligned}$$
    (32)

    and \(W_{{\mathcal {R}}}\left( {\bar{X}},\lambda \right) =1-W_{{\mathcal {L}}}\left( {\bar{X}},\lambda \right) .\) Solving the necessary equilibrium conditions in Eqs. (27)–(28), gives the party equilibrium strategies \(\left( X_{{\mathcal {L}}},X_{{\mathcal {R}}}\right) =\left( -\frac{\Delta X}{2}, \frac{\Delta X}{2}\right) \) where:

    $$\begin{aligned} \Delta X\equiv X_{{\mathcal {R}}}-X_{{\mathcal {L}}}=\frac{W_{{\mathcal {L}}}}{\frac{ \partial }{\partial X_{{\mathcal {L}}}}W_{{\mathcal {L}}}}=\frac{\Psi }{1-\frac{ 1/2\mu }{g\left( \lambda \right) /\Sigma _{g}}} \end{aligned}$$
    (33)

    However, since \(\frac{\Psi }{1-\frac{1/2\mu }{g\left( \lambda \right) /\Sigma _{g}}}>\frac{\Psi }{1+\frac{1/\mu }{g\left( \lambda \right) /\Sigma _{g}}},\) this larger party divergence is not consistent with the restriction \(- \frac{\Delta x}{2}<X_{{\mathcal {L}}}\le X_{{\mathcal {R}}}<\frac{\Delta x}{2},\) which implies a smaller party divergence, thus ruling out an equilibrium in this range. \(\square \)

Proof of Proposition 5

A party’s marginal electoral penalty for moving away from the district mean, namely \(\frac{1}{2\Psi }\left[ 1+\frac{1/\mu }{g\left( \lambda \right) /\Sigma _{g}}\right] \) is lower the higher is aggregate giving \(g\left( \lambda \right) \). Thus, a party’s policy-motivated deviation from the mean of district means is more profitable the higher is \(g\left( \lambda \right) \) . Taking the difference between the two first-order conditions (27)-(28) and solving for \(X_{{\mathcal {R}}}-X_{{\mathcal {L}}}\) yields \(\Delta X=\frac{1}{\frac{\partial }{\partial X_{{\mathcal {L}}}}W_{{\mathcal {L}}}+\frac{ \partial }{\partial X_{{\mathcal {R}}}}W_{{\mathcal {L}}}}=\frac{\Psi }{1+\frac{ 1/\mu }{g\left( \lambda \right) /\Sigma _{g}}}.\) Then, using \(g^{\prime }(\lambda )>0\) from Proposition 1 gives that \(\frac{\partial }{\partial \lambda }\Delta X>0.\) The comparative statics with respect to \(\Psi ,\mu ,\Sigma _{g}\) follow immediately from the expression for \(\Delta X.\) \(\square \)

Proof of Proposition 6

Equilibrium partisan polarization can be expressed as a function of candidate divergence and party divergence.

$$\begin{aligned} \Delta {{\mathbb {E}}}\left( x\right)= & {} \frac{1}{\mu }\left( \int _{k_{s}\in {\mathcal {R}}}x_{k_{s}}ds-\int _{k_{s}\in {\mathcal {L}}}x_{k_{s}}ds\right) =\frac{ 2}{\mu }\int _{k_{s}\in {\mathcal {R}}}x_{k_{s}}ds \nonumber \\= & {} \frac{2}{\mu }\left[ \left( \frac{\mu }{2}+\frac{\Delta x}{2}\right) -\left( X_{{\mathcal {R}}}-\frac{\mu }{2}\right) \right] \left[ \left( X_{ {\mathcal {R}}}-\frac{\mu }{2}\right) +\left( \frac{\mu }{2}+\frac{\Delta x}{2} \right) \right] /2 \nonumber \\&+\frac{2}{\mu }\left[ \left( \frac{\mu }{2}-\frac{\Delta x}{2}\right) -\left( X_{{\mathcal {L}}}+\frac{\mu }{2}\right) \right] \left[ \left( X_{ {\mathcal {L}}}+\frac{\mu }{2}\right) +\left( \frac{\mu }{2}-\frac{\Delta x}{2} \right) \right] /2 \nonumber \\= & {} \frac{1}{\mu }\left\{ \left[ \left( \frac{\mu }{2}+\frac{\Delta x}{2} \right) ^{2}-\left( X_{{\mathcal {R}}}-\frac{\mu }{2}\right) ^{2}\right] +\left[ \left( \frac{\mu }{2}-\frac{\Delta x}{2}\right) ^{2}-\left( X_{{\mathcal {L}}}+ \frac{\mu }{2}\right) ^{2}\right] \right\} \nonumber \\= & {} \frac{1}{\mu }\left[ \frac{\mu \Delta x}{2}+\frac{\left( \Delta x\right) ^{2}}{4}-X_{{\mathcal {R}}}^{2}+X_{{\mathcal {R}}}\mu -\frac{\mu \Delta x}{2}+ \frac{\left( \Delta x\right) ^{2}}{4}-X_{{\mathcal {L}}}^{2}-X_{{\mathcal {L}}}\mu \right] \nonumber \\= & {} \frac{1}{\mu }\left[ \frac{\left( \Delta x\right) ^{2}}{2}-X_{{\mathcal {R}} }^{2}+X_{{\mathcal {R}}}\mu -X_{{\mathcal {L}}}^{2}-X_{{\mathcal {L}}}\mu \right] \nonumber \\= & {} \frac{1}{\mu }\left[ \frac{\left( \Delta x\right) ^{2}}{2}-\frac{\left( \Delta X\right) ^{2}}{2}+\mu \Delta X\right] \nonumber \\= & {} \frac{1}{\mu }\left[ \frac{\left( \Delta x\right) ^{2}}{2}+\frac{\Delta X\left( 2\mu -\Delta X\right) }{2}\right] \nonumber \\= & {} \Delta x+\left( \Delta X-\Delta x\right) \left( 1-\frac{\Delta X+\Delta x}{2\mu }\right) \end{aligned}$$
(34)

Ideological polarization is \(\Delta x\) and compositional polarization is \( \left( \Delta X-\Delta x\right) \left( 1-\frac{\Delta X+\Delta x}{2\mu } \right) ,\) which is positive if and only if \(\Delta X>\Delta x.\) If \(\Delta X=\Delta x,\) then \(\Delta {{\mathbb {E}}}\left( x\right) =\Delta x\) is pure ideological polarization. See Fig. 4, middle panel. If \(\Delta x=0,\) then \( \Delta {{\mathbb {E}}}\left( x\right) =\frac{\Delta X\left( 2\mu -\Delta X\right) }{2\mu }\) is pure compositional polarization. See Fig. 4, bottom panel. \(\square \)

Proof of Proposition 7

The derivations of the different features of equilibrium polarization are as follows. See also Fig. 3. Inter-party heterogeneity is the policy difference between the rightmost \({\mathcal {R}}\) party member and the leftmost \({\mathcal {L}}\) party member:

$$\begin{aligned} \max _{k_{s}\in {\mathcal {R}}}x_{k_{s}}-\min _{k_{s}\in {\mathcal {L}} }x_{k_{s}}=\left( \frac{\Delta x}{2}+\frac{\mu }{2}\right) -\left( -\frac{ \Delta x}{2}-\frac{\mu }{2}\right) =\mu +\Delta x \end{aligned}$$
(35)

Intra-party heterogeneity is the policy difference between the rightmost and leftmost member within each party, and by equilibrium symmetry, is the same in both parties:

$$\begin{aligned} \max _{k_{s}\in {\mathcal {R}}}x_{k_{s}}-\min _{k_{s}\in {\mathcal {R}}}x_{k_{s}}= & {} \left( \frac{\Delta x}{2}+\frac{\mu }{2}\right) -\left( X_{{\mathcal {R}}}- \frac{\mu }{2}\right) =\mu -\frac{1}{2}\left( \Delta X-\Delta x\right) \nonumber \\ \end{aligned}$$
(36)
$$\begin{aligned} \max _{k_{s}\in {\mathcal {L}}}x_{k_{s}}-\min _{k_{s}\in {\mathcal {L}}}x_{k_{s}}= & {} \left( X_{{\mathcal {L}}}+\frac{\mu }{2}\right) -\left( -\frac{\Delta x}{2}- \frac{\mu }{2}\right) =\mu -\frac{1}{2}\left( \Delta X-\Delta x\right) \nonumber \\ \end{aligned}$$
(37)

as \(X_{{\mathcal {R}}}=-X_{{\mathcal {L}}}=\frac{\Delta X}{2}\) in equilibrium. Party overlap is the policy difference between the rightmost \({\mathcal {L}}\) party member and the leftmost \({\mathcal {R}}\) party member:

$$\begin{aligned} \max _{k_{s}\in {\mathcal {L}}}x_{k_{s}}-\min _{k_{s}\in {\mathcal {R}} }x_{k_{s}}=\left( X_{{\mathcal {L}}}+\frac{\mu }{2}\right) -\left( X_{\mathcal {R }}-\frac{\mu }{2}\right) =\mu -\Delta X \end{aligned}$$
(38)

The fraction of safe seats is the fraction of districts where both candidates are affiliated with the same party:

$$\begin{aligned} \frac{1}{\mu }\left( \int _{L_{s},R_{s}\in {\mathcal {L}}}ds+\int _{L_{s},R_{s} \in {\mathcal {R}}}ds\right)= & {} \frac{1}{\mu }\left\{ \left[ \left( X_{\mathcal { R}}-\frac{\mu }{2}\right) -\left( \frac{\Delta x}{2}-\frac{\mu }{2}\right) \right] \right. \nonumber \\&\quad \left. +\left[ \left( -\frac{\Delta x}{2}+\frac{\mu }{2}\right) -\left( X_{ {\mathcal {L}}}+\frac{\mu }{2}\right) \right] \right\} \nonumber \\= & {} \frac{1}{\mu }\left[ \left( X_{{\mathcal {R}}}-X_{{\mathcal {L}}}\right) -\Delta x\right] =\frac{1}{\mu }\left( \Delta X-\Delta x\right) \nonumber \\ \end{aligned}$$
(39)

According to Propositions 3 and 5, both \(\Delta x\) and \(\Delta X\) are strictly increasing in inequality \(\lambda .\) Condition (17) implies that \(\frac{\partial }{\partial \lambda }\Delta X>\frac{\partial }{\partial \lambda }\Delta x,\) which means that \(\Delta X-\Delta x\) is strictly increasing in inequality. \(\square \)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Vlaicu, R. Inequality, participation, and polarization. Soc Choice Welf 50, 597–624 (2018). https://doi.org/10.1007/s00355-017-1098-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00355-017-1098-9

Navigation