1 Introduction

Prospect theory (PT; Tversky and Kahneman 1992) is regarded as one of the most successful descriptive theories for risk and ambiguity (Starmer 2000; Kahneman and Tversky 2000; Wakker 2010; Barberis 2013). Relative to classical expected utility (EU), PT incorporates nonlinear treatment of probabilities (Preston and Baratta 1948; Allais 1953; Quiggin 1982), or nonadditive event uncertainty resulting from ambiguity (Ellsberg 1961; Schmeidler 1989) and reference-dependence. The latter requires the existence of a reference point, relative to which outcomes are seen as gains or losses, and constitutes a distinctive feature and a key assumption of PT. What exactly determines the reference point has been left unspecified, and not offering a plausible explanation for how the reference point is derived from primitives, i.e., from preferences over prospects, is regarded as a major shortcoming of PT (Fudenberg 2006, p. 696 footnote 2; Pesendorfer 2006, pp. 713–716).

We develop a revealed preference technique based on probability midpoints to show that PT can be obtained from preferences without assuming the reference point as exogenously given. Starting from an indifference between two prospects, probability midpoints are obtained by shifting probability mass across outcomes, such that a new indifference results (Kuilen and Wakker 2011). By keeping the outcomes of these prospects ordered in terms of preference and commonly fixed, and repeatedly shifting probability mass across adjacent outcomes, one can elicit a sequence of probabilities that are perceived equally far apart in terms of preferences.Footnote 1 If the reference point is meaningful for preferences, the elicited probability midpoints will be affected when probability mass is shifted from losses to gains. This can be used to reveal the location of the reference point. Indeed, this feature of behavior is exploited to show how the reference point in PT is revealed from choices. It is in this sense that we obtain the reference point endogenous to the model.

To position our contribution, let us recall that the von Neumann and Morgenstern (1944) foundations for EU imply a linear treatment of probabilities. This means that elicited probability midpoints are arithmetic midpoints and independent of the type or magnitude of outcomes. The EU-axioms do not impose a specific interpretation for outcomes or restrictions on utility, except for the latter being a cardinal measure. For instance, one can interpret real-valued outcomes as final wealth positions (a frequent assumption made in theoretical applications) or as changes relative to a reference point (usually an implicit assumption made in experimental studies).Footnote 2 In particular, imposing a utility value of 0 at the reference point, and treating outcomes with negative utility values as losses and those with positive utility as gains, is compatible with the EU-axioms as long as probabilities are treated linearly. The requirement that utility is 0 at the reference point then appears as an arbitrary restriction of the class of admissible cardinal utilities to a smaller ratio-scale subclass. By contrast, in PT such restrictions on admissible utility functions follow from the asymmetric treatment of the probabilities attached to gains and losses and, thus, properties that capture deviations from linearity in probabilities. It is precisely this asymmetric nonlinear treatment of probabilities, revealed as an inconsistency in elicited probability midpoints, that we exploit in order to provide general foundations for PT.

A nonlinear treatment of probabilities has also been incorporated into the rank-dependent utility (RDU) model (Quiggin 1981, 1982; Segal 1987; Wakker 1994). RDU can be seen as a special case of PT where the presence of a reference point is immaterial for attitudes toward probabilities. General foundations for RDU were provided by Nakamura (1995), Abdellaoui (2002), Abdellaoui and Wakker (2005), and Zank (2010), and they can readily be used to derive PT if the reference point is known in advance. Without knowing that a reference point exists, deriving PT from RDU becomes a challenge. To achieve foundation for PT, we employ a consistency test for specifically elicited probability midpoints. Consistency means that the treatment of probabilities is insensitive to replacements of the stimuli used to elicit midpoints. As our consistency property does not impose restrictions on the admissible probability weighting functions under PT, probability midpoint consistency can accommodate a wide range of behaviors (e.g., risk behavior captured through the popular inverse-S-shaped probability weighting functions; Prelec 1998).

To give further intuition for our main preference tool, suppose we have identified the reference point. Then our condition requires that probability midpoints elicited from preferences are independent of the outcomes (i.e., the stimuli used in the elicitations), whenever the latter are of the same type (i.e., either they are all gains or they are all losses). This is a natural requirement for the treatment of probabilities under PT, where a distinct nonlinear treatment for probabilities of gains as compared to probabilities of losses is explicitly allowed for; it can be inferred, for example, from the reflection examples in Kahneman and Tversky (1979, p. 268). This feature of reference-dependent behavior, which we call sign-dependence, has been widely documented.Footnote 3 Sign-dependence can be inferred from the presence of distinct probability midpoints for gains than for losses, as implied by empirically elicited parametric forms (e.g., Tversky and Kahneman 1992; Abdellaoui 2000). Conversely, sign independence of probability midpoints suggests that the reference point is immaterial for the treatment of probabilities (as in EU or RDU). Hence, if we do not know the location of the reference point, we can employ probability midpoints elicited for different outcomes to test for sign independence. This leads to a revealed preference technique, where replacing a gain by a different gain does not affect elicited probability midpoints and neither should the replacement of a loss by a different loss affect such midpoints; inconsistent midpoints are revealed only if a gain is replaced by a loss or vice versa. In a nutshell, our main preference tool requires that sign independence of revealed probability midpoints is violated once at the most, in which case we identify the location of the reference point.

In what follows, we present preliminary notation and formal expressions for the models of EU, RDU and PT (Sect. 2), informally introduce probability midpoints, and look at the distinct predictions for midpoints resulting from these models. In Sect. 3 we proceed by recalling the preference conditions shared by all three models. We highlight potential difficulties in deriving PT-foundations by giving examples of reference-dependent preferences that are similar to PT-preferences, which do not, in general, allow for the identification of both probability weighting functions. In the literature such preferences have hitherto been circumvented. In Sect. 4 we formally generalize the notion of probability midpoints and present our main preference condition and theorem for the case where the set of outcomes is finite. Extensions are discussed in Sect. 5. In particular, we allude to a procedure that shows how our probability midpoint tool can be employed to identify the location of reference points. The literature on reference points is growing, and different models and approaches have emerged; for instance, there are choice situations in which the reference point may not be a unique degenerate outcome as in PT; thus, a brief summary of this literature is in order. This is done in Sect. 6, where we also discuss issues related to midpoints, the central tool for our PT-foundation. Concluding remarks are in Sect. 7. The Appendix contains further elaborations and proofs.

2 Preliminaries

This section recalls the standard framework for decision under risk and the decision models of expected utility, rank-dependent utility and prospect theory, explaining how the second model extends the first through deviating from the linear treatment of probabilities and how the latter model extends the second through the reference point impacting the nonlinear probability treatment.

2.1 Notation

Let X denote the nonempty set of outcomes. A prospect is a finite probability distribution over X. Prospects are labeled as \( P=(p_{1}:x_{1},\ldots ,p_{n}:x_{n})\) with the usual interpretation that outcome \(x_{j}\in X\) is obtained with probability \(p_{j}\), for \(j=1,\ldots ,n \). Naturally, \(p_{j}\ge 0\) for each \(j=1,\ldots ,n\) and \( \sum _{i=1}^{n}p_{i}=1\). Let \({\mathcal {L}}\) denote the set of all prospects.

A preference relation, denoted \(\succcurlyeq ,\) is assumed over \( {\mathcal {L}}\). Its restriction to subsets of \({\mathcal {L}}\) (e.g., all degenerate prospects where one of the outcomes is received for sure) is also denoted by \(\succcurlyeq \). The symbol \(\succcurlyeq \) means “weak preference” from which \(\succ \) (strict preference) and \(\sim \) (indifference) are defined as usual. The function Vrepresents (or is a representation of) the preference \(\succcurlyeq \) on \({\mathcal {L}}\), if V assigns a real value to each prospect, such that for all \(P,Q\in {\mathcal {L}}\) we have \(P\succcurlyeq Q\Leftrightarrow V(P)\ge V(Q)\). This general representation V will be required to satisfy several properties including those that reflect the behavior corresponding to the treatment of probabilities.

Next, we recall the functional expressions of expected utility, rank-dependent utility and prospect theory, which are specific representations of the preference \(\succcurlyeq \) on \({\mathcal {L}}\). In all these models a utility function, u, for outcomes exists that is strictly monotonic (that is, \(u:X\rightarrow {\mathbb {R}}\) satisfies \( u(x_{i})\ge u(x_{j})\Leftrightarrow x_{i}\succcurlyeq x_{j}\)). As a result, outcomes that are indifferent receive the same utility value. To simplify the exposition we henceforth assume, without loss of generality, that no two distinct outcomes in X are indifferent. This allows us to strictly rank outcomes from best to worst within a prospect; this particular ordering of outcomes is meaningful in rank-dependent models.

2.1.1 Expected utility

Under expected utility (EU) prospects \(P=(p_{1}:x_{1},\ldots ,p_{n}:x_{n})\) are evaluated and compared using the representation

$$\begin{aligned} \mathrm{EU}(P)=\overset{n}{\sum _{j=1}}p_{j}u(x_{j}), \end{aligned}$$
(1)

for a utility function, u, which assigns a real number to each outcome and is strictly monotone. Under EU the utility is cardinal, i.e., it is unique up to multiplication by a positive number A and addition of a constant B. In EU the probabilities are treated linearly. This can be inferred from the “weight” \(p_{j}\) corresponding to the utility for outcome \(x_{j}\) in the above equation. The next model deviates from the linear treatment of probabilities.

2.1.2 Rank-dependent utility

The nonlinear treatment of probabilities in rank-dependent models is incorporated by making the relative position of outcomes meaningful for behavior. We can assume, without loss of generality, that outcomes within a prospect are ordered, as traditionally done, from best to worst in preference or rank-ordered; i.e., writing \(P=(p_{1}:x_{1},\ldots ,p_{n}:x_{n})\) explicitly means that \(x_{1}\succ \cdots \succ x_{n}\). Rank-dependent utility (RDU) holds if the preference is represented by

$$\begin{aligned} \mathrm{RDU}(P)=\overset{n}{\sum _{j=1}}\left[ w\left( p_{1}+\cdots +p_{j}\right) -w\left( p_{1}+\cdots +p_{j-1}\right) \right] u(x_{j}), \end{aligned}$$
(2)

where we use the convention that the sum \(\sum _{j=i}^{l}p_{j}=0\) for \(l<i\). Utility in RDU is similar to EU. Additionally, RDU involves a weighting function, w,  for (decumulative) probabilities that is uniquely determined. Formally, the weighting function is a mapping from the probability interval [0, 1] into [0, 1] that is strictly increasing with \(w(0)=0\) and \( w(1)=1\). In this paper the axiomatically derived weighting functions are continuous on [0, 1].Footnote 4

The probability of obtaining a better-ranked outcome is referred to as good-news probability (formally called rank in Wakker 2010, Definition 5.4.1). In Eq. (2) the “decision weight” corresponding to the utility for outcome \( x_{j}\) is the difference between transformed good-news probabilities. Denoting by \(p_{j}^{d}:=\sum _{i=1}^{j}p_{i}\) the decumulative probability of \(x_{j},j=1,\ldots ,n\), one can rewrite Eq. (2) as

$$\begin{aligned} \mathrm{RDU}(P)=\overset{n-1}{\sum _{j=1}}w\left( p_{j}^{d}\right) \left[ u(x_{j})-u(x_{j+1})\right] +u(x_{n}). \end{aligned}$$

Clearly, having \(w(p)=p\) for all probabilities p in the latter equation implies EU. This confirms that w under RDU captures deviations from linear treatment of good-news probabilities.

To simplify the subsequent exposition, we also present the expression for RDU, which uses the dual probability weighting function, \({\hat{w}} (p):=1-w(1-p)\) for all \(p\in [0,1]\). The dual weighting function is defined over cumulative or bad-news probabilities. Then, we can rewrite RDU for P as follows

$$\begin{aligned} \mathrm{RDU}(P)=u(x_{1})+\overset{n}{\sum _{j=2}}{\hat{w}}\left( p_{j}^{c}\right) \left[ u(x_{j})-u(x_{j-1})\right] , \end{aligned}$$

with transformed cumulative probabilities \(p_{j}^{c}:= \sum _{i=j}^{n}p_{i},j=1,\ldots ,n\).

2.1.3 Prospect theory

The main model of interest in this paper extends RDU by incorporating reference-dependence. That is, it assumes an outcome, \(r\in X\), called the reference point, such that outcomes strictly preferred to it are gains and have a positive utility, and outcomes strictly dispreferred to r are losses with a negative utility value. The treatment of probabilities depends on whether the latter are associated with gains or with losses. Specifically, under prospect theory (PT), a prospect \(P=(p_{1}:x_{1},\ldots ,p_{n}:x_{n})\) is evaluated according to the rank-ordering of outcomes and also according to the position of outcomes relative to the reference point. That is, there are two weighting functions \( w^{+}\) and \(w^{-}\) and a utility function for outcomes with \(u(r)=0\), such that:

  • if all outcomes in P are (weakly) preferred to the reference point (i.e., we have no losses), then P is evaluated by

    $$\begin{aligned} \mathrm{PT}(P)=\mathrm{RDU}^{+}(P)\text { using the weighting function }w^{+}\text {;} \end{aligned}$$
  • if all outcomes in P are (weakly) dispreferred to the reference point (i.e., we have no gains), then P is evaluated by

    $$\begin{aligned} \mathrm{PT}(P)=\mathrm{RDU}^{-}(P)\text { using the weighting function }w^{-}\text {;} \end{aligned}$$
  • if P assigns positive probability to both gains and losses, then the PT-value of P is the sum of P’s gain and loss parts. That is, with \( P^{+} \), the gain part of P, being the prospect “P with all losses replaced by r” and the loss part, \(P^{-}\), being the prospect “P with all gains replaced by r ,” the PT-value of P is given by

    $$\begin{aligned} \mathrm{PT}(P)=\mathrm{PT}(P^{+})+\mathrm{PT}(P^{-}), \end{aligned}$$

    where the qualification \(u(r)=0\) applies. It is custom for PT to express the treatment of probabilities for losses using the dual weighting function, \({\hat{w}}^{-}\). For instance, if \(r=x_{k}\) for some \(2\le k\le n-1 \), the PT-value of P is

    $$\begin{aligned} \mathrm{PT}(P)= & {} \sum _{j=1}^{k-1}w^{+}\left( p_{j}^{d}\right) \left[ u(x_{j})-u(x_{j+1})\right] \nonumber \\&\quad +\sum _{j=k+1}^{n}{\hat{w}}^{-}\left( p_{j}^{c}\right) \left[ u(x_{j})-u(x_{j-1})\right] . \end{aligned}$$
    (3)

From the preceding three cases one can infer that specific uniqueness results apply for PT. Indeed, if X contains no losses (or if X contains no gains), then RDU holds. Similarly, if \(w^{+}=w^{-}\), PT reduces to an RDU-representation irrespective of the interpretation given to outcomes. In all of these cases we have a cardinal utility and a (single) uniquely determined weighting function capturing the treatment of probabilities. Thus, as the reference point is meaningless for that treatment so is the restriction \(u(r)=0\). However, if X contains gains and losses and \( w^{+}\ne w^{-}\), we have two weighting functions that are uniquely determined; (only) in this case \(u(r)=0\) must hold and, therefore, the utility is a ratio scale (i.e., it is unique up to multiplication by a positive constant). As a consequence, gains have positive utility and losses are assigned a negative utility value; hence, we formally refer to the condition \(w^{+}\ne w^{-}\) as sign-dependence.

Except for Schmidt and Zank (2012), all existing foundations for PT assume the reference point as given from the outset; hence, also the uniqueness of the reference point is assumed. Here we drop the assumption of knowing the reference point in advance and we also dispense of structural assumptions for outcomes (e.g., requiring outcomes to be real valued) that are usually imposed for obtaining PT-foundations. Therefore, in our uniqueness results we explicitly state that for PT it is the sign-dependence that implies the uniqueness of the reference point. This is indeed the feature that distinguishes PT-preferences from RDU-preferences and, hence, from EU-preferences. The corollary of this observation is that, in principle, sign-dependence can be used to reveal the reference point from behavior. To do this, we employ preference conditions that build on revealed or elicited probability midpoints. Next, we present this tool, and we look at the implication of midpoints for the just presented models.

2.2 Probability midpoints

Probability midpoints are measurements revealed from preferences; they are derived using shifts in probability mass across outcomes. To motivate the midpoint concept, suppose we have two prospects PQ over outcomes in the set \(\{x_{1},x_{m},x_{n}\}\subset X,\) with the usual ranking \(x_{1}\succ x_{m}\succ x_{n}\). Let

$$\begin{aligned} P=\left( \alpha :x_{1},1-p-\alpha :x_{m},p:x_{n}\right) \text { and }Q=\left( \beta :x_{1},1-q-\beta :x_{m},q:x_{n}\right) \end{aligned}$$

be such that \(\alpha <\beta \) and \(P\sim Q\). (Similar arguments are used if \( \beta <\alpha \).) The latter indifference indicates that the probability mass \(\beta -\alpha \) for \(x_{1}\) compensates for the difference between probabilities p and q for \(x_{n}\) in the prospects P and Q. This can be inferred from Fig. 1, which depicts the corresponding indifference sets in the probability triangle with outcomes \(x_{1},x_{m}\) and \(x_{n}\), where probabilities of \(x_{n}\) (\(x_{1}\)) are indicated on the horizontal (vertical) axis and the remaining probability is given to outcome \(x_{m}\).

Fig. 1
figure 1

Elicited probability midpoint \(\beta \)

We now shift probability mass \( \beta -\alpha \) from \(x_{m}\) to \(x_{1}\) in prospect P, which requires a joint shift of probability mass \(\gamma -\beta \) from \(x_{m}\) to \(x_{1}\) in prospect Q to obtain an indifference between the resulting prospects, i.e., we have

$$\begin{aligned} P^{\prime }:=\left( \beta :x_{1},1-p-\beta :x_{m},p:x_{n}\right) \sim \left( \gamma :x_{1},1-q-\gamma :x_{m},q:x_{n}\right) =:Q^{\prime }. \end{aligned}$$

Considering the vertical axis, Fig. 1 illustrates that the “preference-distance” \(\beta -\alpha \) between P and \(P^{\prime }\) is similar to the preference-distance \(\gamma -\beta \) between Q and \(Q^{\prime }\). That is, substituting \(\alpha \) for \( \beta \) in P leads to an improvement that is equivalent in terms of the preference to the improvement resulting from substituting \(\beta \) for \( \gamma \) in Q. Thus, in terms of the measured preference-distance, it means that \(\beta \) is perceived half-way between \(\alpha \) and \(\gamma \) based on the probability–outcome pairs \((p:x_{n})\) and \((q:x_{n})\) in P and Q, respectively, as gauges. We call \(\beta \) a (revealed) probability midpoint between \(\alpha \) and \(\gamma \).

The representations introduced in the preceding subsection make specific predictions for probability midpoints. In particular, they all imply that probability midpoints are, with model-specific qualifications, independent of the gauges used to reveal those midpoints. We explore these implications next.

2.2.1 Model-specific midpoints

To illustrate the implications of midpoints for our models, consider EU first. Substitution of EU for the indifferences \(P\sim Q\) and \(P^{\prime }\sim Q^{\prime }\) used to reveal the probability midpoint \(\beta \) in Fig. 1 implies, respectively,

$$\begin{aligned} \alpha u(x_{1})+(1-p-\alpha )u(x_{m})+pu(x_{n})= & {} \beta u(x_{1})+(1-q-\beta )u(x_{m})+qu(x_{n}) \\ \text {and }\beta u(x_{1})+(1-p-\beta )u(x_{m})+pu(x_{n})= & {} \gamma u(x_{1}) +(1-q-\gamma )u(x_{m})+qu(x_{n}). \end{aligned}$$

Taking the difference between the resulting two equations and cancelling common terms yields

$$\begin{aligned} \beta =\frac{\alpha +\gamma }{2}, \end{aligned}$$

proving that, beyond probability midpoints being independent of the outcomes used to elicit them, EU demands that \(\beta \) is the arithmetic midpoint between \(\alpha \) and \(\gamma \). By contrast, RDU dispenses of such linearity in the treatment of probabilities, but maintains the independence of the revealed midpoint from outcomes. Similar to the preceding derivation, for RDU we obtain

$$\begin{aligned} w(\beta )=\frac{w(\alpha )+w(\gamma )}{2}, \end{aligned}$$

showing that, on the probability weighting scale, \(\beta \) is perceived half-way between \(\alpha \) and \(\gamma \).

As with RDU, PT allows for nonlinear treatment of probabilities, but it marginally restricts the independence of probability midpoints from outcomes. Specifically, implementing the preceding elicitation of probability midpoints for the case that \(x_{1},x_{m}\) and \(x_{n}\) are gains we obtain, similar to RDU,

$$\begin{aligned} w^{+}(\beta )=\frac{w^{+}(\alpha )+w^{+}(\gamma )}{2}. \end{aligned}$$

The elicitation of midpoints for the case where the gains \( \{x_{1},x_{m},x_{n}\}\) are replaced by a set of losses \(\{y_{1},y_{m},y_{n} \} \) can be implemented using the same \(\alpha \) and \(\beta \) and initially identifying \({\tilde{p}}\) and \({\tilde{q}}\) such that

$$\begin{aligned} {\tilde{P}}:=(\alpha :y_{1},1-{\tilde{p}}-\alpha :y_{m},{\tilde{p}}:y_{n})\sim (\beta :y_{1},1-{\tilde{q}}-\beta :y_{m},{\tilde{q}}:y_{n})=:{\tilde{Q}} \end{aligned}$$

and, subsequently, finding probability \(\gamma ^{*}\) such that

$$\begin{aligned} {\tilde{P}}^{\prime }:=(\beta :y_{1},1-{\tilde{p}}-\beta :y_{m},{\tilde{p}} :y_{n})\sim (\gamma ^{*}:y_{1},1-{\tilde{q}}-\gamma ^{*}:y_{m},\tilde{q }:y_{n})=:{\tilde{Q}}^{\prime }. \end{aligned}$$

Now, substitution of PT in the preceding two indifferences, taking differences of the implied equations and elimination of common terms, leads to

$$\begin{aligned} w^{-}(\beta )=\frac{w^{-}(\alpha )+w^{-}(\gamma ^{*})}{2}. \end{aligned}$$

In general, we have \(\gamma \ne \gamma ^{*}\) (unless we have sign independence under PT) and this inconsistency means that \(\beta \), the probability midpoint of \(\alpha \) and \(\gamma \) for \(w^{+}\), is not necessarily a midpoint of \(\alpha \) and \(\gamma \) for \(w^{-}\). In the presence of sign-dependence it must be the case that (at least some) probability midpoints for gains are not identical to those for losses. It is indeed this observation that, in a reversed sense, is exploited to formulate preference conditions that identify the reference point. Before doing so, we recall the standard preference conditions that are shared by all the models considered above.

3 Additive representation

The decision models presented in the preceding section share several properties that, when combined, imply an additive representation over prospects. For our specific framework these properties of preferences have been presented before (e.g., in Zank 2010); corresponding properties for general rank-ordered sets were provided in Wakker (1993). For completeness we recall these properties here and summarize their implications for preferences in a lemma. This also allows us to highlight some potential difficulties that are particular to PT-preferences. Since it simplifies the exposition, we assume that the set of outcomes is finite, i.e., \( X=\{x_{1},\ldots ,x_{n}\}\) for some natural number n. In Sect. 5 we indicate how our results can be extended to infinite sets of outcomes where, in contrast to the finite outcome case, preferences may be sign-dependent, but the reference point need not be included in the outcome set. Corresponding examples are then provided.

3.1 Traditional preference conditions

It is well known that, for the existence of a representing function for the preference, a necessary requirement is that the preference relation \( \succcurlyeq \) is complete (\(P\succcurlyeq Q\) or \(P\preccurlyeq Q\) for all \(P,Q\in {\mathcal {L}}\)) and transitive; that is, the preference relation is a weak order. We summarize this property as our first axiom.

  • Weak Order: The preference relation satisfies completeness and transitivity.

Further common properties for EU-, RDU- and PT-preferences are those of (first-order stochastic) dominance and of continuity in probabilities. They are presented next. We say that P first-order stochastically dominates Q if \( \sum _{x\in X}p_{x}\ge \sum _{x\in X}q_{x}\) and \(P\ne Q\), where \(p_{x}\) and \( q_{x}\) refer to the probability that P, respectively, Q assign to outcome \(x\in X\).

  • Dominance: The preference relation satisfies dominance (or monotonicity in decumulative probabilities) if \( P\succ Q\) whenever P first-order stochastically dominates Q.

Since Herstein and Milnor (1953), it is custom to regard \({\mathcal {L}}\) as a mixture space endowed with the operation of probability mixing. That is, for each probability \(\mu \in [0,1]\) and all prospects \(P,Q\in {\mathcal {L}} \), the \(\mu \)-probability mixture of P and Q, denoted \(\mu P+(1-\mu )Q\), which assigns probability \(\mu p_{x}+(1-\mu )q_{x}\) to outcome \(x\in X\), is also a prospect in \({\mathcal {L}}\). We use this operation in the definition of continuity for the preference relation \(\succcurlyeq \).

  • Continuity: The preference relation \(\succcurlyeq \) satisfies Jensen-continuity on the set of prospects \({\mathcal {L}}\) if for all prospects \(P\succ Q\) and R there exist \(\rho ,\mu \in (0,1)\) such that \( \rho P+(1-\rho )R\succ Q\) and \(P\succ \mu R+(1-\mu )Q\).

A monotonic weak order that satisfies Jensen-continuity on \({\mathcal {L}}\) also satisfies the stronger Euclidean-continuity on \({\mathcal {L}}\) (see, for example, Abdellaoui 2002, Lemma 18). Consequently, the three conditions taken together imply the existence of a continuous function \(V:{\mathcal {L}} \rightarrow {\mathbb {R}},\) strictly monotonic in decumulative probabilities, that represents \(\succcurlyeq \) on \({\mathcal {L}}\) (Debreu 1954).Footnote 5 The next subsection provides a condition under which the representation V is additively separable. We present it separately, as the strong formulation of the property implies EU and, thus, imposes a linear treatment of probabilities, while the weaker version allows more flexibility for the treatment of probabilities as required in RDU and PT.

3.2 Additive separability properties

This subsection presents an independence property that is shared by EU, RDU and PT. It is formulated as a preference condition involving common shifts of probability mass between outcomes. We have informally used shifts in probabilities when defining elicited probability midpoints. Probability shifts can be regarded as substitutions of an outcome with probability \( \varepsilon \) by a different outcome with that very same probability. Given prospect \(P=(p_{1}:x_{1},\ldots ,p_{n}:x_{n})\) we denote the prospect resulting from a shift of probability \(\varepsilon \) from outcome \(x_{i}\) to outcome \(x_{j}\) in P as the prospect \(\varepsilon _{i,j}P:=(p_{1}^{\prime }:x_{1},\ldots ,p_{n}^{\prime }:x_{n})\), with \(p_{i}^{\prime }=p_{i}-\varepsilon ,p_{j}^{\prime }=p_{j}+\varepsilon ,\) and \(p_{m}^{\prime }=p_{m}\) for \(m\notin \{i,j\}\). Whenever we use this notation, it is implicitly assumed that \(p_{i}\ge \varepsilon >0\) to ensure that \( \varepsilon _{i,j}P\) is a well-defined prospect in \({\mathcal {L}}\).

In terms of the indifference sets of the preference, these substitutions mean that, following a probability shift, a prospect is moved from one indifference set to a different one. We are interested in the effect of such substitutions for prospects that are on the same indifference set. For example, EU satisfies independence of the preferences under common probability shifts, meaning that indifference sets of a preference are mapped into new indifference sets when common probability shifts are applied. This property was termed “replacement invariance” in Machina (1989) and constitutes a sure thing principle for risky prospects similar to Savage (1954) sure thing principle for uncertainty. We adopt the latter terminology for the next axiom.

  • Sure Thing Principle for Risk: The preference relation \( \succcurlyeq \) satisfies the sure thing principle (STP) for risk if

    $$\begin{aligned} P\succcurlyeq Q\Leftrightarrow \varepsilon _{i,i+1}P\succcurlyeq \varepsilon _{i,i+1}Q, \end{aligned}$$

    whenever \(P,Q,\varepsilon _{i,i+1}P,\varepsilon _{i,i+1}Q\in {\mathcal {L}}\).

STP is necessary for expected utility; it is also sufficient for EU in the presence of weak order, first-order stochastic dominance and Jensen-continuity (see, for example, Webb and Zank 2011, Theorem 5). The common consequence effect of Allais (1953) constitutes a violation of the sure thing principle for risk. Accordingly, RDU and PT, which both can accommodate the Allais paradox, satisfy a restricted version of the principle:

  • Comonotonic STP: The preference relation \(\succcurlyeq \) satisfies the comonotonic sure thing principle (CSTP) for risk if

    $$\begin{aligned} P\succcurlyeq Q\Leftrightarrow \varepsilon _{i,i+1}P\succcurlyeq \varepsilon _{i,i+1}Q, \end{aligned}$$

    whenever \(P,Q,\varepsilon _{i,i+1}P,\varepsilon _{i,i+1}Q\in {\mathcal {L}}\) are such that \(p_{i}^{d}=q_{i}^{d}\).

CSTP says that common probability shifts maintain the preference between two prospects if the two prospects offer identical good-news probabilities for \(x_{i-1}\). That is, the probability of obtaining \(x_{i}\), the outcome from which probability is shifted away, or a better outcome is the same in both prospects. This is equivalent to saying that the cumulative probability of obtaining \(x_{i+1}\) or a worse outcome is the same in both prospects, so they have identical bad-news probabilities too. Hence, a shift of a common probability in P and Q results in new prospects where the outcome \(x_{i}\) is also of common rank. If one writes prospects as (de)cumulative distributions over X, one immediately observes that CSTP translates into an independence requirement on a rank-ordered or comonotonic set of cumulated distributions, hence the name for this property.

When CSTP is combined with the preference conditions in the preceding subsection, it implies an additive separability property across outcomes for the representing function V. The result, formally stated here, generally applies to the preference restricted to all prospects except perhaps the best prospect (\(x_{1}\)) and the worst prospect (\(x_{n}\)); the proof follows from Wakker’s (1993, Theorem 3.2) result for additive representations on comonotonic sets.

Lemma 1

The following two statements are equivalent for a preference relation \(\succcurlyeq \) on \({\mathcal {L}}\):

  1. (i)

    The preference relation \(\succcurlyeq \) on \({\mathcal {L}}\backslash \{x_{1},x_{n}\}\) is represented by an additive function

    $$\begin{aligned} V(P)=\overset{n-1}{\sum _{j=1}}V_{j}(p_{j}^{d}), \end{aligned}$$
    (4)

    with strictly increasing functions \(V_{1},\ldots ,V_{n-1}:\) [0, 1] \( \rightarrow \)\({\mathbb {R}}\) which are continuous and bounded with the exception of \(V_{1}\) and \(V_{n-1}\), which could be unbounded at 1 and at 0, respectively.Footnote 6

  2. (ii)

    The preference relation \(\succcurlyeq \) is a Jensen-continuous weak order that satisfies dominance and the comonotonic sure thing principle for risk.

The functions \(V_{1},\ldots ,V_{n-1}\) are jointly cardinal; that is, they are unique up to multiplication by a common positive constant and addition of a real number. The representation can continuously be extended to hold on \({\mathcal {L}}\) if \(V_{1}\) is bounded from above and \(V_{n-1}\) is bounded from below.

As shown in Wakker (1993, Proposition 3.5), by adding further preference conditions that imply a separation of the treatment of probabilities from the utility value assigned to outcomes, one can generally obtain boundedness of all functions in Lemma 1. This has been exploited, for instance, in the RDU-derivations in Diecidue et al. (2009) and in Webb and Zank (2011). For PT this also applies, provided that there are at least two gains and two losses, whence proportionality arguments can be exploited (see our Theorem 2). As we do not make assumptions about which outcome is the reference point, we cannot, in general, invoke information about the number of gains and losses from the outset. This means that the potential unboundedness reported in Lemma cannot a priori be excluded. Consequently, this leads to difficulties for deriving a standard PT-representation, as we discuss next.

3.3 Extended prospect theory

Based on the additive representation of Lemma 1, in this subsection we further explore the consequences for obtaining PT-foundations when one of \(V_{1}\) or \(V_{n-1}\) is unbounded. In general, such unboundedness precludes the identification of probability weighting functions as is required for PT. Our main result shows that we are able to obtain representations for “extended PT-preferences” where a reference point can nonetheless be identified and, further, a utility for losses and a corresponding probability weighting function (if \(V_{1}\) is unbounded), or a utility for gains with the associated probability weighting function (if \(V_{n-1}\) is unbounded), can still be derived. In particular, at most one of \(V_{1}\) or \( V_{n-1}\) can be unbounded. We provide specific examples to illustrate these extreme cases as they highlight specific types of preferences that are plausible when general outcome sets and reference-dependence are jointly considered. In the foundational literature on PT such behavior has hitherto been excluded due to the structural requirements on the set outcomes and the properties reflected in the corresponding utility functions.

Example 1

Assume that the representation of Lemma 1 holds for \(X=\{x_{1},x_{2},x_{3}\}\) with \(x_{1}\succ x_{2}\succ x_{3}\) and that \(V_{1}\) and \(V_{2}\) are bounded. Then, the representation is a PT-functional with \(r=x_{2}\) as the reference point, unless \(\succcurlyeq \) is represented by RDU.

The derivation of the PT-representation in Example 1 proceeds as follows. As \(V_{1}\) and \(V_{2}\) are jointly cardinal, we can pick the representation of preferences with \(V_{1}(0)=0=V_{2}(1)\). By setting \( w^{+}(p):=V_{1}(p)/V_{1}(1)\) for all \(p\in [0,1]\) and \( u(x_{1}):=V_{1}(1)\), \({\hat{w}}^{-}(p):=1-V_{2}(p)/V_{2}(0)\) for all \(p\in [0,1]\) and \(u(x_{3}):=V_{2}(0)\), and further \(u(x_{2}):=0\), we obtain

$$\begin{aligned} V(p_{1}:x_{1},p_{2}:x_{2},p_{3}:x_{3})=w^{+}(p_{1})u(x_{1})+{\hat{w}} ^{-}(p_{3})u(x_{3}), \end{aligned}$$
(5)

which is a PT-representation with reference point \(r=x_{2}\).

If \(w^{+}(p)=w^{-}(p)\) for all \(p\in [0,1]\), one can show that Eq. (5) reduces to an RDU-representation with the corresponding uniqueness results. For the case that \(w^{+}\ne w^{-} \), one can show that the values \(V_{1}(0)\) and \(V_{2}(1)\) are constants that are immaterial for the ranking of prospects, such that the representation in Eq. (5) is indeed PT with the corresponding uniqueness results; this aspect is further elaborated on in the Appendix.

For the case of Example 1, we are not aware of a preference condition, that explicitly identifies \(x_{2}\) as a reference point in a manner that pins down PT with a ratio-scale utility, beyond the properties that imply additive separability as required in Lemma 1.Footnote 7 More generally, we observe that it is only in the case of \(n=3\) that a reference point may exist, but that both \(V_{1}\) and \( V_{n-1}(=V_{2})\) in Lemma 1 are unbounded. Such unboundedness precludes a separation of utility and sign-dependent probability weighting functions. Including such representations under a general notion of PT-preferences means that Lemma 1 already gives necessary and sufficient conditions for a corresponding representation. Given this observation, from here onward we require that X has at least four strictly rank-ordered outcomes. The next two examples show that the assumption of more than three outcomes restricts, but does not eliminate unboundedness for a representation in the presence of reference-dependence.

Example 2

Consider the case that \(n\ge 4\) and that the preference on \({\mathcal {L}}\backslash \{x_{1}\}\) is represented by

$$\begin{aligned} W(P)=V_{1}(p_{1})+\sum _{j=3}^{n}{\hat{w}}^{-}\left( p_{j}^{c}\right) \left[ u(x_{j})-u(x_{j-1})\right] , \end{aligned}$$
(6)

where u and \({\hat{w}}^{-}\) are as in PT with \(r=x_{2}\) and \(V_{1}(p_{1})\) converges to \(\infty \) when \(p_{1}\) approaches 1, e.g., \(V_{1}(p_{1})=[ \frac{p_{1}}{1-p_{1}}]u(x_{1})\) for some positive number \(u(x_{1})\).

One can regard the preference in Example 2 as that of a patient who has been diagnosed with a severe disease, such as cancer. Suppose some potential medical interventions can lead to a range of outcomes, the best being \(x_{1}=\)“fully cured from cancer,” while other interventions may prolong life duration but do not offer positive probability for \(x_{1}\). It is conceivable that the latter interventions are all perceived as leading to losses and, hence, they are perceived unattractive relative to an intervention with a positive probability for \(x_{1}\). A related example is documented in Thaler and Johnson (1990) and analyzed in Barberis et al. (2001). After having faced a series of losses, many investors attempt to break even by engaging in very risky trades, despite the chances of breaking even being relatively small. Such investors appear to perceive the event of breaking even as leading to an extremely attractive outcome, while the complementary event is viewed as leading to losses of a tolerable magnitude in utility terms. The counterpart of Example 2 is the following one.

Example 3

Consider the case that \(n\ge 4\) and that the preference on \({\mathcal {L}}\backslash \{x_{n}\}\) is represented by

$$\begin{aligned} {\bar{W}}(P)= \sum _{j=1}^{n-2}w^{+}\left( p_{j}^{d}\right) \left[ u(x_{j})-u(x_{j+1})\right] +V_{n-1}(1-p_{n}), \end{aligned}$$
(7)

where u and \(w^{+}\) are as in PT with \(r=x_{n-1}\) and \(V_{n-1}\) is as in Lemma 1 and converging to \(-\infty \) at 0. Specifically, consider \(V_{n-1}(1-p_{n})=[\frac{p_{n}}{1-p_{n}}]u(x_{n})\) for some negative number \(u(x_{n})\).

The functional in Example 3 can be thought of as a representation for a preference with an extreme form of aversion or pessimism, where the possible loss \(x_{n}\) is extremely unattractive (e.g., ruin) and any prospect with a positive probability for \(x_{n}\) will be regarded inferior to a prospect with zero probability for \(x_{n}\). Individuals exhibiting this form of pessimism are willing to buy insurance at prices far above the actuarially fair value to completely avoid the loss \( x_{n}\). Such behavior is reported in relation to substantially increased demand for flood and earthquake insurance after a corresponding event has occurred (Kunreuther et al. 1978; Palm 1995)Footnote 8 and in the willingness to pay for the complete elimination of risk associated with a hazardous product (Viscusi et al. 1987).

Having elaborated on potential issues for deriving PT when the set of outcomes includes a single gain or a single loss, we proceed by keeping in mind that in such special cases extreme sensitivity in probabilities of best or worst outcomes may preclude a derivation of PT in which both probability weighting functions are uniquely specified. Instead, we may obtain what we term extended prospect theory: Preferences are either represented by PT on \({\mathcal {L}}\), or they are represented on \({\mathcal {L}}\backslash \{x_{1}\}\) by the functional in Eq. (6) with \( V_{1}:[0,1)\rightarrow [0,\infty )\) unbounded at \(p=1\), or they are represented on \({\mathcal {L}}\backslash \{x_{n}\}\) by Eq. (7) with \(V_{n-1}:(0,1]\rightarrow (-\infty ,0]\) unbounded at \( p=0 \). As the preference conditions presented in the next section show, in such cases we can still identify the reference point from behavior if preferences are reference-dependent.

4 Consistent probability midpoints

We have already established that for EU-preferences probability midpoints are algebraic midpoints, that they are independent of outcomes and, hence, reference point independent. Similarly, RDU-preferences imply reference independence although, due to flexibility in the treatment of probabilities, RDU allows probability midpoints to differ from algebraic midpoints. PT-preferences are different. They allow for elicited probability midpoints to depend on the type (i.e., gain or loss), but not the magnitude of outcomes.

This section introduces consistency conditions for probability midpoints which, when added to statement (ii) of Lemma 1, allow for the identification of a reference point. To this aim, it is instructive to explore the implications of elicited probability midpoints for the preference representation in Lemma 1. Recall the indifferences \(P\sim Q\) and \(P^{\prime }\sim Q^{\prime }\) in Fig. 1. After substitution of the additive representation, subtracting the first equation from the second and cancelling common terms, we obtain \(V_{1}(\beta )-V_{1}(\alpha )=V_{1}(\gamma )-V_{1}(\beta )\) or, equivalently,

$$\begin{aligned} V_{1}(\beta )=\frac{V_{1}(\gamma )+V_{1}(\alpha )}{2}. \end{aligned}$$
(8)

In terms of \(V_{1}\), which captures the treatment of probabilities associated with outcome \(x_{1}\), Eq. (8) states that \(\beta \) is a \(V_{1}\)-midpoint between \(\alpha \) and \(\gamma \). The properties of the additive representation imply that locally one can always find midpoints and that the latter do not depend on other outcome stimuli. For instance, taking an outcome \(x_{m^{\prime }}\ne x_{m}\) such that \( x_{1}\succ x_{m^{\prime }}\succ x_{n}\) and considering preferences among prospects over the set \(\{x_{1},x_{m^{\prime }},x_{n}\}\subset X\), one can find probabilities \({\bar{p}},{\bar{q}}\) such that \({\bar{P}}=(\alpha :x_{1},1- {\bar{p}}-\alpha :x_{m^{\prime }},{\bar{p}}:x_{n})\sim (\beta :x_{1},1-{\bar{q}} -\beta :x_{m^{\prime }},{\bar{q}}:x_{n})={\bar{Q}}\). Now, shifting \(\beta -\alpha \) from \(x_{m^{\prime }}\) to \(x_{1}\) in \({\bar{P}}\) requires an equivalent shift of \({\bar{\gamma }}-\beta \) from \(x_{m^{\prime }}\) to \(x_{1}\) in \({\bar{Q}}\) to obtain a new indifference, say \({\bar{P}}^{\prime }\sim {\bar{Q}} ^{\prime }\). As before, substitution of the representation in Lemma for the indifferences \({\bar{P}}\sim {\bar{Q}}\) and \({\bar{P}}^{\prime }\sim {\bar{Q}}^{\prime },\) subtraction of the resulting equations and cancellation of common terms, gives

$$\begin{aligned} V_{1}(\beta )=\frac{V_{1}({\bar{\gamma }})+V_{1}(\alpha )}{2}, \end{aligned}$$

which can only hold if \({\bar{\gamma }}=\gamma \), for otherwise Eq. (8) is violated.

The preceding analysis shows that revealed probability midpoints elicited through shifts of probability mass to \(x_{1}\) are meaningful for \(V_{1}\), i.e., they are consistent and independent of stimuli other than \(x_{1}\). Similarly, midpoints for each function \(V_{j},j=2,\ldots ,n-1\) can be elicited and are meaningful concepts given the additive separable representation of Lemma 1. Next we present properties that demand further consistency for elicited probability midpoints.

4.1 Good-news and bad-news midpoint consistency

The properties presented next demand that probability midpoints derived from probability shifts to best (worst) outcomes are also midpoints when the same probability mass is shifted to outcomes of intermediate rank. These behavioral qualifications for preferences are referred to as consistency properties for probability midpoints.

  • Good-News Midpoint Consistency: For \(m\in \{2,\ldots ,n-1\}\), the preference relation \(\succcurlyeq \) satisfies good-news midpoint consistency (GMC) above \(x_{m}\), if for \(P=(\alpha :x_{1},p_{m}-\alpha :x_{m},p_{m+1}:x_{m+1},\ldots ,p_{n}:x_{n})\) and \(Q=(\beta :x_{1},q_{m}-\beta :x_{m},q_{m+1}:x_{m+1},\ldots ,q_{n}:x_{n})\) we have

    $$ \begin{aligned} P\sim Q \& (\beta -\alpha )_{m,1}P\sim (\gamma -\beta )_{m,1}Q\Rightarrow (\beta -\alpha )_{m,{\tilde{m}}}P\sim (\gamma -\beta )_{m, {\tilde{m}}}Q, \end{aligned}$$

    for all \({\tilde{m}}\in \{1,\ldots ,m\}\) whenever \(\alpha \le \beta \le \gamma \) are probabilities such that \(P,Q,(\beta -\alpha )_{m,1}P,\) and \( (\gamma -\beta )_{m,1}Q\) are from \({\mathcal {L}}\).Footnote 9

It is worth contrasting the normative content of GMC with that of the more restrictive von Neumann and Morgenstern independence axiom. In GMC, the probability mass \(\beta -\alpha \) in \((\beta -\alpha )_{m,1}P\) is additional good-news probability for \(x_{1}\) relative to P. Based on preferences, this improvement is equivalent to the additional good-news probability \(\gamma -\beta \) in \((\gamma -\beta )_{m,1}Q\) as compared to Q . GMC requires that such good-news probabilities are of equivalent value for preferences when shifted to a common outcome \(x_{{\tilde{m}}}\) (i.e., a degenerate prospect) that is ranked between \(x_{1}\) and \(x_{m}\). The latter aspect indicates that preferences may be sensitive to the rank-ordering of outcomes when shifting equivalent good-news probabilities. Such rank-ordering restrictions do not apply to the von Neumann and Morgenstern independence axiom where probability mass is taken away proportionally from each outcome and shifted to another common, and not necessarily degenerate, prospect S:

$$\begin{aligned} P\sim Q\Leftrightarrow \rho S+(1-\rho )P\sim \rho S+(1-\rho )Q \end{aligned}$$

for all \(\rho \in [0,1]\) and all \(P,Q,S\in {\mathcal {L}}\).Footnote 10 Clearly, as no further restrictions are placed on the “common risk” type S, the latter property applies more generally than GMC and is, thus, more powerful. In particular, the independence axiom implies that elicited probability midpoints are also algebraic midpoints, the characteristic property of EU-preferences.

It can be verified that RDU satisfies GMC above \(x_{m}\) for all \(m=2,\ldots ,n-1\). This has been shown in Zank (2010). Similarly, this holds for the preference with the representation of Example 3. The next lemma shows that PT satisfies GMC above \(x_{m}\) if all outcomes above \(x_{m}\) are gains, hence, in particular if \(x_{m}\) is the reference point.

Lemma 2

Assume that the preference \(\succcurlyeq \) on \( {\mathcal {L}}\) is represented by PT (or by extended PT on \({\mathcal {L}} \backslash \{x_{n}\})\). Then \(\succcurlyeq \) satisfies good-news midpoint consistency above x whenever all outcomes ranked above x are gains.

The next property is the dual analog of GMC as it focuses on shifting equivalent probability mass to outcomes of lower rank.

  • Bad-News Midpoint Consistency: For \(m\in \{2,\ldots ,n-1\}\), the preference relation \(\succcurlyeq \) satisfies bad-news midpoint consistency (BMC) below \(x_{m}\), if for \(P=(p_{1}:x_{1},\ldots ,p_{m-1}:x_{m-1},p_{m}-\alpha :x_{m},\alpha :x_{n})\) and \( Q=(q_{1}:x_{1},\ldots ,q_{m-1}:x_{m-1},q_{m}-\beta :x_{m},\beta :x_{n})\) we have

    $$ \begin{aligned} P\sim Q \& (\beta -\alpha )_{m,n}P\sim (\gamma -\beta )_{m,n}Q\Rightarrow (\beta -\alpha )_{m,{\tilde{m}}}P\sim (\gamma -\beta )_{m, {\tilde{m}}}Q, \end{aligned}$$

    for all \({\tilde{m}}\in \{m,\ldots ,n\}\) whenever \(\alpha \le \beta \le \gamma \) are probabilities such that \(P,Q,(\beta -\alpha )_{m,n}P,\) and \( (\gamma -\beta )_{m,n}Q\) are from \({\mathcal {L}}\).

The preference represented by the function in Example 2 satisfies BMC below \(x_{m}\) for all \(m\in \{2,\ldots ,n-1\}\). Likewise, RDU-preferences satisfy BMC below each outcome. The next lemma confirms that PT satisfies BMC below all outcomes which are not gains, that is, below all losses and below the reference point.

Lemma 3

Assume that the preference \(\succcurlyeq \) on \( {\mathcal {L}}\) is represented by PT (or by extended PT on \({\mathcal {L}} \backslash \{x_{1}\})\). Then \(\succcurlyeq \) satisfies bad-news midpoint consistency below x whenever all outcomes ranked below x are losses.

If we have reference-dependence and we know the location of the reference point, then it is easy to formulate a sign-dependent midpoint consistency condition that characterizes PT when combined with the properties in Lemma 1. If there are two or more gains and two or more losses, all we need is GMC above all outcomes that are not losses and BMC below all outcomes that are not gains. Although not stated formally, this result is new and provides PT-foundations for general sets of outcomes, thereby directly extending the RDU foundations of Nakamura (1995), Abdellaoui (2002), Abdellaoui and Wakker (2005), and Zank (2010). In the case that there is one gain or one loss, we further need to assume that the additive representation consists of bounded functions; alternatively, the domain of the preference is limited to all but the best and worst prospects, in which case we obtain foundations for extended PT. Next we proceed without assuming that we know the location of the reference point.

4.2 Reference point revealing midpoint consistency

In general, we do not know which of the outcomes in X is the reference point. To obtain PT-foundations we develop a technique to first reveal the reference point before invoking the appropriate midpoint consistency properties. For instance, we can take outcomes in turn and verify if probability midpoints are independent of the outcomes used to reveal those midpoints. For PT-preferences a violation of this test of midpoint consistency is allowed once at the most. By starting with probability midpoint elicitations at best (worst) outcomes and moving sequentially toward lower (better)-ranked outcomes, we can test for “probability midpoint consistency” for all outcomes. Having finitely many outcomes means that this process will terminate with either universal consistency (i.e., there is no reference point affecting preferences) or a single violation (i.e., there exists a reference point that affects preferences). Observing multiple violations of consistency means that PT cannot hold. While this will be excluded by our main preference condition, it is clear that our consistency test can also be used to falsify PT. Applying the consistency property repeatedly comes down to an algorithmic procedure (see also Sect. 5.3) that allows us to identify outcomes that are gains and outcomes that are losses if preferences are reference-dependent. This motivates the following property.Footnote 11

  • Reference Point Revealing Midpoint Consistency: The preference relation \(\succcurlyeq \) satisfies reference point revealing midpoint consistency (RMC) if for each \(m\in \{2,\ldots ,n-1\}\) the preference satisfies good-news midpoint consistency above \(x_{m}\) or bad-news midpoint consistency below \(x_{m-1}\) (or both).

Before turning to the main result, we elaborate further on some implications of RMC. Suppose that \(m_{\max }\in \{2,\ldots ,n-1\}\) is the largest index such that GMC above \(x_{m_{\max }}\) holds. Then, in the presence of an additive representation, by repeated application of GMC, one can show that GMC above \(x_{j}\) holds for all \(j\le m_{\max }\), that is, for all outcomes ranked above \(x_{m_{\max }}\). Similarly, if \(m_{\min }\in \{2,\ldots ,n-1\}\) is the smallest index such that BMC below \(x_{m_{\min }-1}\) holds, then BMC below \(x_{j}\) holds for all \(j\ge m_{\min }-1\) in the presence of an additive representation. Therefore, as RMC demands for each \(m\in \{2,\ldots ,n-1\}\) that GMC above \(x_{m}\) or BMC below \(x_{m-1}\) holds, either \(m_{\max }=m_{\min }=k\), i.e., a single index \(k\in \{2,\ldots ,n-1\}\) exists such that GMC above \(x_{k}\) or BMC below \(x_{k-1}\) holds, or (as shown in the Appendix) for all \(m\in \{2,\ldots ,n-1\}\) both GMC above \(x_{m}\) and BMC below \(x_{m-1}\) hold. In the latter case we have sign independence, while in the former case we have sign-dependence and \(r=x_{k}\) is the unique reference point. Under each of these conclusions one can derive (extended) PT. We conclude this section by summarizing this result.

Theorem 1

The following two statements are equivalent for a preference relation \(\succcurlyeq \) on \({\mathcal {L}}\):

  1. (i)

    The preference relation \(\succcurlyeq \) is represented by extended PT.

  2. (ii)

    The preference relation \(\succcurlyeq \) is a Jensen-continuous weak order that satisfies dominance, the comonotonic sure thing principle for risk and reference point revealing midpoint consistency.

Where they exist, the probability weighting functions are uniquely determined. Further, if \(w^{+}=w^{-}\), utility is cardinal as preferences agree with rank-dependent utility; otherwise, the utility function is a ratio scale and there is a unique index \(k\in \{2,\ldots ,n-1\}\) such that the reference point is \(r=x_{k}\).

The proof of the preceding theorem is in the Appendix. The next section looks at extensions of our main result.

5 Extensions

In the previous sections we have assumed that no outcomes are indifferent. This requirement can be relaxed if there are at least four strictly ordered outcomes in the finite set X. All results remain valid if we restrict the analysis to the set of representatives for each indifference set of outcomes; within an indifference set all outcomes have the same utility value. As pointed out in Sect. 3, our results also remain valid if we include the case that there are exactly three strictly ordered outcomes, however, then RMC trivially holds and, given Lemma 1 with boundedness conditions for the additive representation satisfied, PT follows as indicated in Example 1. If boundedness conditions do not hold one must allow for extended PT-representations as suggested in Examples 2 and 3 , where the utility of the single gain, respectively, the single loss and the corresponding probability weighting function cannot be identified separately as the representing functions exhibit asymptotic behavior when approaching extreme probabilities. Such behavior is excluded if preferences agree with RDU, the special case of PT with sign independence. Finally, we recall that, in the trivial case of having at most two strictly ordered outcomes, the dominance property ensures the existence of an ordinal representation; it is well known that there is insufficient structure on the set of prospects to obtain more meaningful results for the two-outcome case.

In our derivation of PT it has been essential that the weighting functions are continuous at 0 and at 1. Discontinuities at these extreme probabilities are, however, empirically meaningful. We could adopt a weaker version of continuity for probabilities that is restricted to prospects that have common best and worst outcomes each with a positive objective probability. Such conditions have been used in Cohen (1992) and more recently in Webb and Zank (2011) where probability weighting functions are derived that are linear and discontinuous at extreme probabilities. These weighting functions can then be described by two parameters. As Webb and Zank show, this relaxation of continuity in probabilities comes at a price. They require additional structural assumptions for the preference in order to obtain consistency of those parameters across sets of prospects with different worst and best outcomes. Also, specific consistency principles that imply the uniqueness of those parameters are required. We conjecture that in our framework such consistency principles can be formulated for nonlinear weighting functions that are discontinuous at 0 and at 1. A formal derivation of PT with such weighting functions is, however, beyond the scope of this paper.

5.1 Omitting asymptotic behavior

In this subsection we provide a variant of the probability midpoint condition that excludes unboundedness of the functions \(V_{1}\) and \(V_{n-1}\) in Theorem 1. If sign-dependence holds, the property can be seen as a condition that imposes the existence of at least two gains and at least two losses. With a further outcome that acts as the reference point, this property therefore assumes at least five strictly ordered outcomes. Under this mild richness assumption for outcomes, the following property demands consistent probability midpoints at the two best outcomes and, separately, consistent midpoints at the two worst outcomes and, additionally, RMC.

  • Bounded RMC: The preference relation \(\succcurlyeq \) satisfies bounded RMC (b-RMC) if RMC holds and further GMC holds above \(x_{3}\) and BMC holds below \(x_{n-2}\).

The augmented b-RMC property uses the information that the two best outcomes are of the same type (both are gains) and that the two worst outcomes are also of the same type (they are losses). Such information is not usually available to the analyst, although in some cases such assumptions can plausibly be made. For instance, when a decision maker wishes to sell a good at an auction, there might be several low prices at which selling would definitely be perceived as making a loss (being substantially below the reservation price), while there might be several high prices at which selling the good is perceived as a definite gain.

If the set of outcomes is too small, declaring some outcomes as gains and some as losses may in fact pin down the model. For example, it should be clear that if there are only four strictly rank-ordered outcomes in X, then b-RMC implies that GMC holds above \(x_{3}\) and BMC holds below \(x_{2}\). For the additive representation in Lemma 1 this has the implication that probability midpoints for \(V_{1}\) are also midpoints for \(V_{2}\) (inferred from GMC above \(x_{3}\)) and probability midpoints for \(V_{3}\) are also midpoints for \(V_{2}\) (inferred from BMC below \(x_{2}\)). Hence, midpoints are consistent throughout and this excludes sign-dependence; hence, RDU is implied.

The next result explicitly assumes that there are at least five strictly ordered outcomes, to allow for reference-dependence when b-RMC is invoked.

Theorem 2

Assume that the finite set of outcomes X has at least five strictly ranked outcomes. The following two statements are equivalent for a preference relation \(\succcurlyeq \) on \({\mathcal {L}}\):

  1. (i)

    The preference relation \(\succcurlyeq \) is represented by PT with either sign-dependence (in which case the reference point is \(r=x_{k}\) for some \(k\in \{3,\ldots ,n-2\})\) or sign independence (RDU holds).

  2. (ii)

    The preference relation \(\succcurlyeq \) is a Jensen-continuous weak order that satisfies dominance, the comonotonic sure thing principle for risk and bounded reference point revealing midpoint consistency.

The probability weighting functions are uniquely determined. Further, if \( w^{+}=w^{-}\), utility is cardinal as preferences agree with rank-dependent utility; otherwise, the utility function is a ratio scale and the reference point is unique.

The proof of the preceding result is in the Appendix. In Theorem we exploit the fact that the set of outcomes is finite. If X is infinite a property like b-RMC cannot simply be extended because, for example, for uncountable sets identifying the two best outcomes or the two worst outcomes may not be possible. But RMC can be formulated to hold on specific subsets of prospects as our next subsection shows. We now proceed to the discussion of how to obtain extended PT-foundations for infinite outcome sets.

5.2 Infinite outcome sets

As in the preceding sections, we restrict our analysis to the case that there are no indifferent outcomes. When X is infinite some of our properties need to be extended. Weak order, continuity and dominance are as before. CSTP is invoked for each restriction of the preference to sets of prospects over finitely many common outcomes. For a finite set of outcomes \( Y\subset X\) denote by \({\mathcal {L}}_{Y}\) the set of prospects that give zero probability to outcomes not contained in Y. We now formulate the extension of CSTP.

  • Comonotonic STP: The preference relation \(\succcurlyeq \) satisfies the comonotonic sure thing principle (CSTP) on \( {\mathcal {L}}\) if \(\succcurlyeq \) satisfies CSTP on \({\mathcal {L}}_{Y}\) for all finite sets \(Y\subset X\).

One implication of (extended) PT is that, if the preference is restricted to prospects over a finite set of outcomes that contains only gains or the reference point, call it \(X_{g}\) (g stands for good outcomes), then RDU represents that restriction of the preference. Similarly, RDU represents the preference restricted to prospects over a finite set of outcomes that contains only losses or the reference point, say \(X_{b}\) (b referring to bad outcomes). The two RDU functions need not have the same probability weighting function; in particular, sign-dependence requires distinct weighting functions for the probabilities of gains and losses. Then, for prospects over \({\mathcal {L}}_{X_{g}\cup X_{b}}\), RMC will be satisfied only when the reference point is included in the union of the sets \(X_{g}\) and \( X_{b}\). For infinite sets of outcomes we, therefore, propose the following extension of RMC.

  • Extended RMC: The preference relation \(\succcurlyeq \) satisfies extended reference point revealing midpoint consistency (e-RMC) on \({\mathcal {L}}\) if for each outcome \(x\in X\) one (or both) of the following statements apply:

    1. (a)

      For each finite set Y with all outcomes ranked above x the preference \(\succcurlyeq \) restricted to \({\mathcal {L}}_{Y\cup \{x\}}\) satisfies GMC above y and BMC below y for each \(y\in Y\).

    2. (b)

      For each finite set Z with all outcomes ranked below x the preference \(\succcurlyeq \) restricted to \({\mathcal {L}}_{\{x\}\cup Z}\) satisfies BMC below z and GMC above z for each \(z\in Z\).

Before presenting the next result, we point out that, for obtaining extended PT-foundations for \(\succcurlyeq \) on \({\mathcal {L}}\), we employ similar tools as in the derivation of Theorem 1 but restricted to each set \( {\mathcal {L}}_{Y}\) for finite \(Y\subset X\). In the presence of weak order, continuity and dominance, CSTP characterizes a general representation that is additive on each set \({\mathcal {L}}_{Y}\) for finite \(Y\subset X\). Similar results were provided in Chew and Wakker (1996) for the general setup with ambiguity. When combined with the other properties, e-RMC implies extended PT on each set \({\mathcal {L}}_{Y}\), but it need not imply the existence of a uniquely determined reference point because the subset Y may not contain the reference point. In general, the lack of structural assumptions on the set of outcomes X means that a reference point \(r\in X\) may not exist, for instance, as it is not part of the considered outcome set, even though preferences are sign-dependent. The following example illustrates.

Example 4

Assume that \(X=[-50,0)\cup (0,100]\) and that the preference \(\succcurlyeq \) on \({\mathcal {L}}_{X}\) is represented by

$$\begin{aligned} V(p_{1}:x_{1},\ldots ,p_{n}:x_{n})= & {} \sum _{j=1}^{k-1}w^{+}\left( p_{j}^{d}\right) \left[ u(x_{j})-u(x_{j+1})\right] +w^{+}\left( p_{k}^{d}\right) u(x_{k})\\&\quad +\,{\hat{w}}^{-}(p_{k+1}^{c})u(x_{k+1})\\&\quad +\sum _{j=k+2}^{n}{\hat{w}}^{-}\left( p_{j}^{c}\right) \left[ u(x_{j})-u(x_{j-1})\right] , \end{aligned}$$

with strictly increasing and continuous weighting functions \(w^{+}\) and \( w^{-}\), and a utility function defined as \(u(x)=x\) if \(x\in (0,100]\) and \( u(x)=2x\) if \(x\in [-50,0)\), and k denoting the number of positive outcomes in the prospect \((p_{1}:x_{1},\ldots ,p_{n}:x_{n})\).

The preference in Example 4 satisfies all properties required for the existence of a general additive representation (weak order, Jensen-continuity, dominance, CSTP) and also e-RMC. However, the value 0, which acts as a reference point, is not an outcome that is contained in the set X. Hence, the preference is sign-dependent, but no reference point within X exists; the reference point is “outside the model.”

Demanding convexity for the outcome set can circumvent the problems alluded to in Example 4. But even if the set of outcomes is a closed interval, we may have sign-dependent preferences but no reference point. This can be inferred from the following example that resembles features from Example 3.

Example 5

Assume that \(X=[0,100]\) and that the preference \( \succcurlyeq \) on \({\mathcal {L}}_{X}\backslash \{0\}\) is represented by

$$\begin{aligned} V(p_{1}:x_{1},\ldots ,p_{n-1}:x_{n-1},p_{n}:0)= & {} \sum _{j=1}^{n-2}w^{+}\left( p_{j}^{d}\right) \left[ u(x_{j})-u(x_{j+1})\right] \\&+\,w^{+}(p_{n-1}^{d})u\left( x_{n-1}\right) -\left[ \frac{p_{n}}{1-p_{n}}\right] , \end{aligned}$$

with strictly increasing and continuous weighting functions \(w^{+}\) and \( w^{-}\), and a utility function defined as \(u(x)=x\) if \(x\in (0,100]\) (that can discontinuously be extended to \(u(0)=-1)\), and \(n-1\) referring to the number of positive outcomes in the prospect evaluated by V.

To exclude preferences that are sign-dependent, but where a reference point cannot be identified, such as the preferences in Examples 4 and 5 one needs to add further structural assumptions for the set of outcomes or exclude extreme outcomes (e.g., requiring that X is an open interval in \({\mathbb {R}}\)). Alternatively, one can consider preferences where the reference point is outside the model as in the extended PT of Example 5 with an “imaginary” reference point \(r^{*}\). We provide foundations for both cases. The first result provides a preference foundation for PT for the case that the set of outcomes is an open interval of the real numbers. As the set of outcomes is a connected separable topological space that does not contain a best or worst outcome, the reference point can be identified within the model.

Theorem 3

Assume \(X=(a,b)\) for some real numbers \(a<b\) and that the preference over outcomes agrees with the natural ordering of real numbers. The following two statements are equivalent for a preference relation \(\succcurlyeq \) on \({\mathcal {L}}\):

  1. (i)

    The preference relation \(\succcurlyeq \) on \({\mathcal {L}}\) is represented by PT.

  2. (ii)

    The preference relation \(\succcurlyeq \) is a Jensen-continuous weak order that satisfies dominance, the comonotonic sure thing principle for risk and extended reference point revealing midpoint consistency.

The probability weighting functions are uniquely determined. Further, if \( w^{+}=w^{-}\), utility is cardinal as preferences agree with rank-dependent utility. Otherwise, the utility function is a ratio scale and the reference point \(r\in X\) is unique.

The proof of Theorem 3 is in the Appendix. From that proof one can infer that Theorem 3 also applies for the case that \(X={\mathbb {R}}\), which is the most frequent assumption considered in the literature. The next result applies to the most general case in which no best and worst outcomes are allowed.

Theorem 4

Suppose X is a set of outcomes that contains no extreme outcomes (i.e., X has no best and no worst outcome). The following two statements are equivalent for a preference relation \( \succcurlyeq \) on \({\mathcal {L}}\):

  1. (i)

    The preference relation \(\succcurlyeq \) on \({\mathcal {L}}\) is represented by PT with a possibly imaginary reference point.

  2. (ii)

    The preference relation \(\succcurlyeq \) is a Jensen-continuous weak order that satisfies dominance, the comonotonic sure thing principle for risk and extended reference point revealing midpoint consistency.

The probability weighting functions are uniquely determined. Further, if \( w^{+}=w^{-}\), utility is cardinal as preferences agree with rank-dependent utility. Otherwise, the utility function is a ratio scale and, if the reference point \(r\in X\), it is unique.

The proof of Theorem 4 is in the Appendix. Having explored how the probability midpoint tool can be used to obtain information about sign-dependence and reference points from a foundational perspective, we proceed to a practical application of midpoint consistency. The next subsection indicates how the midpoint tool can be applied to empirically test for sign-dependence.

5.3 Detecting reference points empirically

Our theoretical results, in particular the application of GMC and BMC as combined in RMC suggests that it is possible to test for sign-dependence using probability midpoints. Here we present a tool that can be used to experimentally implement such a test. Suppose, for simplicity of exposition, that we have the best outcome, labeled \(G\in X\) (which may objectively be seen as a gain), and the worst outcome, labeled \(L\in X\) (potentially regarded as a loss). Let there be finitely many outcomes ranked between G and L, say \(G\succ y_{1}\succ \cdots \succ y_{s}\succ L\) for some \(s\ge 2\). Assume that this list of intermediate outcomes is exhaustive and that we have PT-preferences, but do not know if one of the \(y_{j}\)’s is the reference point. An example of a procedure to identify the reference point involves repeated elicitations of probability midpoints and subsequent checks for consistency for those midpoints. This results in an algorithm searching for a reference point, as follows.

Step 1: Fix a small probability \(\alpha \) (e.g., \(\alpha =0.15\)) and probabilities pq (e.g., \(p=0.1,q=0.2\)). Next consider the prospect \( P_{y_{1}}^{\alpha }=(\alpha :G,1-p-\alpha :y_{1},p:L)\) and elicit the probability \(\beta \) that makes a subject indifferent between \( P_{y_{1}}^{\alpha }\) and \(Q_{y_{1}}^{\beta }=(\beta :G,1-q-\beta :y_{1},q:L)\) . Such elicitations can be facilitated by using, for instance, choice lists in which outcomes are fixed and probabilities vary, as suggested in Holt and Laury (2002).

Step 2: Replace \(\alpha \) by \(\beta \) in \(P_{y_{1}}^{\alpha }\), hence obtaining \(P_{y_{1}}^{\beta }\), and elicit \(\gamma \) such that \( P_{y_{1}}^{\beta }\sim Q_{y_{1}}^{\gamma }\). This way the experimenter obtains a probability midpoint \(\beta \) between \(\alpha \) and \(\gamma \).

Step 3: Next, replace \(y_{1}\) by \(y_{2}\) in \(P_{y_{1}}^{\alpha }\) and obtain the prospect \(P_{y_{2}}^{\alpha }=(\alpha :G,1-p-\alpha :y_{2},p:L)\). Subsequently, elicit the probability \({\tilde{q}}\) that makes a subject indifferent between \(P_{y_{2}}^{\alpha }\) and \(Q_{y_{2}}^{\beta }=(\beta :G,1-{\tilde{q}}-\beta :y_{2},{\tilde{q}}:L)\).

Step 4: As in Step 2, replace \(\alpha \) by \(\beta \) in \( P_{y_{2}}^{\alpha }\), giving \(P_{y_{2}}^{\beta }\), and elicit \(\gamma ^{*}\) such that \(P_{y_{2}}^{\beta }\sim Q_{y_{2}}^{\gamma ^{*}}\). The experimenter obtains a probability midpoint \(\beta \) between \(\alpha \) and \( \gamma ^{*}\). If \(\gamma =\gamma ^{*}\) we have observed a consistency, from which we conclude that \(y_{1}\) and \(y_{2}\) are outcomes with utilities of the same sign; we proceed to the next step. Otherwise, if \( \gamma \ne \gamma ^{*}\) we have an inconsistency, which can occur only if \(y_{1}\) and \(y_{2}\) have utilities of a different sign. Therefore, \(y_{1}\) is identified as the reference point and the “search algorithm” stops.

Step \((3+m)\) (\(m=2,\ldots ,s\)): If \(\gamma =\gamma ^{*}\) at Step \((2+m)\), repeat Steps \((1+m)\) and \( (2+m) \), with \(y_{m+1}\) and \(y_{m}\) replacing \(y_{m}\) and \(y_{m-1}\), respectively.

If this procedure terminates after Step \((3+m^{*})\) for some index \( m^{*}\le s\), we conclude that \(y_{m^{*}}\) is the reference point; otherwise, there is no reference point that affects the treatment of probabilities.

A few comments on the above procedure are in order. Obviously, there are alternative ways of implementing the above search procedure. For instance, one can start the procedure in Steps 1 and 2 at any outcome \(y_{m},m\in \{1,\ldots ,s\}\) and adjust the subsequent elicitation steps, or one could elicit midpoints using outcome \(y_{s}\) in Steps 3 and 4 in the above procedure and then continue the elicitation of midpoints by alternation between the remaining best and worst ranked outcomes that have not yet been identified as gains or losses by the procedure. If X is an open interval in \({\mathbb {R}}\), one can use this procedure to narrow down the interval of outcomes in which the reference point is located by repeating the procedure on specific finite subsets Y of X.

While the above search procedure appears compelling and, from a theoretically perspective, feasible, implementing the procedure in experiments would need to account for some practicalities. First, the probability interval is narrow such that small changes in stimuli in the form of actual probabilities may hardly be noticed by subjects. This can be circumvented by “scaling up” the stimuli, e.g., by framing choices as events resulting from draws using urns containing 100 or even 1000 equally likely balls. Second, the midpoint procedure is based on eliciting indifferences. Irrespective of whether indifferences are elicited by varying outcomes or by varying probabilities, such elicitation tasks are cognitively demanding for many subjects and appropriate experimental procedures are needed. The aforementioned choice lists design (Holt and Laury 2002), which invokes an final interpolation step, has proven to be quite an efficient mechanism in dealing with the issue of eliciting indifferences.

Third, the decision criterion in how far to tolerate differences between \( \gamma \) and \(\gamma ^{*}\) in Step 4 of the above elicitation procedure usually needs to be specified as a rule where small differences can be regarded as a measurement error and large ones as a genuine inconsistency. What determines such bounds is essentially an empirical question, and setting appropriate thresholds can be based on existing data regarding empirically observed probability weighting functions. Fourth, chained measurements, as employed in our search procedure, have been criticized on the grounds of incentive compatibility and error propagation. Both aspects of the elicitation procedure are theoretically important, but empirically, these issues are not a serious concern as subjects treat choice tasks in isolation (Kahneman and Tversky 1979; Cubitt et al. 1998; see also Abdellaoui et al. 2005, on negligible error propagation).

Finally, in contrast to EU- or RDU-preferences, the choice of outcome stimuli to detect sign-dependence is important. The experimenter needs to ensure that the range of outcomes chosen to implement the above procedure is not too narrow (such that reference points are excluded) and likewise that the number of stimuli is not too large as this raises the number of required elicitations. This calls for a trade-off between precision of the method and the cognitive demands put on subjects that, in turn, can influence the precision in the obtained data. Finding the right balance is, however, a common challenge for all experimental studies.

6 Discussion

Here, we review some of the literature on endogenous reference points before we comment on the relation of our midpoint principle to midpoint notions developed elsewhere.

6.1 Models with reference points

The majority of existing PT-derivations assume the reference point is exogenously given (e.g., Tversky and Kahneman 1992; Wakker and Tversky 1993; Chateauneuf and Wakker 1999; Köbberling and Wakker 2003; Neilson 2006). In alternative models, attention has been paid to endogenous reference points that are choice-dependent (e.g., Gul 1991; Sugden 2003; Delquié and Cillo 2006; Bleichrodt 2007; Schmidt et al. 2008). Such multiple reference points are explicitly allowed for in the reference-dependent theories of Munro and Sugden (2003) and Sagi (2006) where, motivated by the status quo effect, adjustments of preferences to new reference points are permitted. There the decision maker can be seen as having multiple preferences, each depending on a reference point. Those preferences are required to be consistent in the sense that they do not generate cyclical choices. Such consistency requirements for behavior are also appearing in theories that build on the classical revealed preference approach, however, by using choice functions that are reference-dependent, such as in Apesteguia and Ballester (2009) and Bossert and Sprumont (2009).

Multiple reference points can also be found in the choice model of Ok et al. (2015). Those reference points are feasible alternatives in a choice set, but they are always dominated by some other alternatives and, hence, are never revealed preferred. By contrast, the endogenous reference points in Shalev (2000, 2002), Kőszegi and Rabin (2006, 2007), and Kőszegi (2010), correspond to a person’s rational expectations held in the recent past, which in turn are determined in the so-called personal equilibrium. Since there may be multiple equilibria, the decision maker is required to choose the most preferred one, e.g., a preferred personal equilibrium in Kőszegi and Rabin (2006) and Kőszegi (2010). Beyond the lack of uniqueness, the choice aspect is markedly different to the reference point concept in PT where, as pointed out by Shleifer (2012, p.1086), the reference point cannot be chosen deliberately.

Schmidt and Zank (2012) provide an alternative way to identify reference points from primitives by exploiting PT features, such as diminishing sensitivity of the utility (convexity for losses and concavity for gains) and consistent utility measurement paired with sign-dependence. The present approach is complementary to Schmidt and Zank (2012), and, unlike theirs, it does not impose structural richness on the set of outcomes. As a result, the present foundations for PT can be extended to more general settings like health and insurance where outcomes might be discrete, thus allowing for wider applications of PT.

6.2 Outcome and probability midpoints

Consistency requirements for outcomes are familiar in economics and finance and are commonly used for utility measurements or for comparative analyses. Specifically, the shape of utility functions can be inferred from preference-based outcome midpoints (Baillon et al. 2012, Theorem 2.2), where a strictly concave utility requires that the utility midpoint of two outcomes is always below the corresponding algebraic midpoint and, further, independent of the probabilities of those outcomes. Similarly, consistently lower midpoints indicate more concavity of one utility relative to the other. The utility midpoint tool, which applies likewise to expected utility and nonexpected utility theories, has been advanced further in Baillon et al. (2012) to compare the utility for risk with that for ambiguity.

Similarly to the comparison of utility functions based on outcome midpoints, attitudes toward probabilities can be inferred from comparisons of the probability weighting functions for risk and ambiguity (Abdellaoui et al. 2011) by adopting an analogous midpoint technique for probabilities. In Kuilen and Wakker (2011) it was demonstrated that probability midpoints are suitable for experimental studies and their elicitation is relatively easy. In empirical research, however, utility measurements were required prior to the elicitation of probability weighting functions (Abdellaoui 2000; Bleichrodt and Pinto 2000, and Kuilen and Wakker 2011). Our elicitation method and the derived results suggest that one can completely dispense of utility-based measurements when employing the probability midpoint tool. Indeed, we advance the probability midpoint tool in three different ways: Our PT-foundation delivers in one stroke a new preference tool for experimental testing and empirical measurements, a tool for comparative analyses and, more fundamentally, a method to identify reference points endogenously from behavior.

7 Conclusion

A consequence of reference-dependence is that risk behavior in PT is manifested through a combination of attitudes toward gains and losses, captured by a utility function (e.g., concave for gains and convex for losses), and attitudes toward probabilities of gains and losses, captured by corresponding probability weighting functions (e.g., inverse-S-shaped), which together imply a fourfold pattern of risk attitudes (Tversky and Kahneman 1992; Wu and Gonzalez 1996; Prelec 1998).Footnote 12 Loss aversion, the component of risk attitude expressed as a stronger sensitivity toward losses as compared to equally sized gains (Wakker and Tversky 1993; Neilson 2002; Köbberling and Wakker 2005; Blavatskyy 2011) is a further cornerstone of PT that completes the picture on empirically observed choice behavior under risk. Yet, without knowledge of the location of the reference point, both loss aversion and the fourfold pattern become ambiguous concepts. While a model with an unspecified reference point gives additional flexibility for the analyst (e.g., it leads to an easier organization of experimental and field data, or it allows for framing effects to be incorporated), the absence of behavioral conditions that imply the existence of the reference point renders PT too general and it makes the model difficult to falsify. This is unwarranted both empirically and foundationally.

The importance of having sound preference foundations for decision models, in particular for PT, has recently been reiterated by Kothiyal et al. (2011, pp. 196–197). If a continuous utility is not available because outcomes are discrete (e.g., as in health or insurance), the relationship between the empirical primitive (i.e., the preference relation) and the assumptions of PT becomes unclear. In that case, one can no longer be sure that the PT-predictions are in line with the behavior underlying the preferences.

Our aim was to make PT falsifiable. We have addressed the foundational aspect of the reference point in PT under risk and showed that a specific consistency property for probability midpoints can be formulated in a way that allows for the identification of the reference point and, jointly, of PT. The conditions presented here are necessary and sufficient for PT; thus, they clarify which assumptions one makes by invoking the model. In particular, the new foundations highlight the difference between expected utility, rank-dependent utility and prospect theory in a transparent way. Before, this was not possible as foundations for PT in the von Neumann and Morgenstern setting were not available. In the presence of standard preference conditions, the two principles of good-news and bad-news midpoint consistency, when combined as in our RMC properties, are sufficient to obtain either RDU, a special case of PT, or PT with sign-dependent probability weighting and a reference point endogenously revealed from behavior. This way, we have obtained a complete foundation for prospect theory.