Keynes (1921) and Knight (1921) emphasized the need to develop theories for decision making when probabilities are unknown. This led Savage (1954) and others to provide a behavioral foundation of (subjective) expected utility: if no objective probabilities are available, then subjective probabilities should be used instead. However, Ellsberg (1961) provided two paradoxes showing that Savage’s theory fails descriptively, and according to some also normatively (Ellsberg 1961; Cerreia-Vioglio et al. 2011; Gilboa and Schmeidler 1989; Klibanoff et al. 2005). It led to the development of modern ambiguity theories; i.e., decision theories for unknown probabilities that deviate from expected utility.

Anscombe and Aumann (1963; AA henceforth) presented a two-stage model of uncertainty to obtain a simpler foundation of expected utility than Savage’s.Footnote 1Gilboa and Schmeidler (1989) and Schmeidler (1989) showed that the AA two-stage model is well suited for another purpose: to analyze ambiguity theoretically. Since then, the AA model has become the most-used model for this alternative purpose.

The AA model makes two ancillary assumptionsβ€”expected utility for risk and backward induction (see SectionΒ 1)β€”that do not concern ambiguity.Footnote 2 These assumptions have been justified on normative grounds but fail descriptively, as many studies have shown (references in SectionΒ 1). They are made only to facilitate the theoretical analysis of ambiguity by providing a convenient linear mixture operation. We show how these ancillary assumptions can be relaxed to become descriptively valid while maintaining the mixture operation. We thus make the AA model suited for descriptive purposes while maintaining its analytical power. AA-based theories of ambiguity can then be applied and tested descriptively while avoiding confounds due to violated ancillary assumptions. We call our modification of the AA model the reduced AA (rAA) model.

We demonstrate the applicability of the rAA method in an experiment (SectionΒ 3). This experiment is simple but, as we will see, suffices to falsify most current AA-based ambiguity theories, due to reference dependence. The second, theoretical, part of the paper (SectionΒ 4 and further) provides a reference dependent generalization of Schmeidler’s (1989) Choquet expected utility to accommodate the empirical violations found in the first part. This result amounts to extending the AA model to cover Tversky and Kahneman’s (1992) prospect theory. Unlike the second part of the paper, the first part avoids using advanced theory so as to provide ready tools to test AA theories for experimentalists. The two parts can be read independently, but are joined in this paper to combine a negative empirical finding on some theories with a positive result on a new theory that solves the problems found. We give a one-sentence description of the rAA method at the end of SectionΒ 2. A detailed outline of the paper is at the end of the next section.

1 Background (substantive and ancillary assumptions) and outline

This section presents a basic version of the AA model so as to motivate the method that we introduce in the next section. A formalized and general version of the AA model will be presented in the theoretical part of the paper, starting in SectionΒ 4. FigureΒ 1a depicts a standard β€œSavage” act for decision under uncertainty. E1,…,E n denote mutually exclusive and exhaustive events. That is, exactly one will obtain, but it is uncertain which one. Following AA, we assume that a horse race takes place with n horses participating, and exactly one will win. Event E i refers to horse i winning. The act yields consequence x i if event E i obtains. We mostly assume that consequences are monetary, although they can be anything. U(x i ) is the utility of consequence x i . V denotes a general functional that represents preferences. It is increasing in all its arguments. Savage (1954) considered the case where V gives subjective expected utility. Nowadays, there is much interest in ambiguity theories, where V can be any such theory, e.g., a multiple prior theory. Such theories are also the topic of this paper.

Fig. 1
figure 1

Traditional (one-stage) choice objects

In decision under risk, we assume probabilities to be known. Then choices are between lotteries (probability distributions). FigureΒ 1b denotes a lottery yielding x j with probability p j . Following AA, we assume that a roulette wheel is spun to generate the probabilities. Besides the expected utility evaluation depicted, many deviating models have been studied (Starmer 2000).

FigureΒ 2 depicts an act in AA’s model. Both uncertainty and risk are involved. The act is like a Savage act in Fig.Β 1a, but now consequences are lotteries, i.e., probability distributions over β€œoutcomes” x i j . Uncertainty is resolved in two stages. First nature chooses which event E i obtains, resulting in the corresponding lottery. Next the lottery is resolved, resulting in outcome x i j with probability p i j , j = 1,…,m.Footnote 3 In AA’s model, acts are evaluated as depicted. First, every lottery of the second stage is evaluated by its expected utility. Next, an ambiguity functional V is applied to those expected utilities as it was to utilities in Fig.Β 1a. The evaluation of the ambiguity by the functional V is of central interest in the modern ambiguity literature. The evaluation of the lotteries only serves to facilitate the analysis of ambiguity in the first stage. The evaluation of each lottery in the second stage is independent of what happens at the other branches in the figure. We can, for instance, replace each lottery by its certainty equivalent derived β€œin isolation” in Fig.Β 1b, and then evaluate the resulting ambiguous act as in Fig.Β 1a. That is, we are using backward induction here.

Fig. 2
figure 2

An Anscombe-Aumann act and its evaluation

We list the two assumptions made, and add two more: (1) lotteries, being unambiguous, are evaluated using expected utility (EU); (2) backward induction is used to evaluate the two stages; (3) there is no reference dependence, with gains and losses treated the same; (4) there is universal ambiguity aversion. The last two assumptions concern ambiguity and are, therefore, of central interest. They are called substantive. Assumptions 1 and 2 define the AA model, with its two-stage structure. They only serve to simplify the mathematical analysis and are, therefore, called ancillary.

The purpose of this paper is descriptive. We, therefore, wish to avoid descriptive problems of the ancillary assumptions. As regards the first assumption, Allais’ (1953) thought experiment provided the first evidence against EU for risk, later confirmed by many empirical studies. It led to the popular prospect theory (Kahneman and Tversky 1979; Tversky and Kahneman 1992). Surveys of violations of EU for risk include Birnbaum (2008), Edwards (1954), Fehr-Duda and Epper (2012), Fox et al. (2015), Schmidt (2004), Slovic et al. (1988), and Starmer (2000). In view of the many violations of EU found, Assumption (1) is currently considered to be descriptively unsatisfactory. Several authors argued that it is also normatively undesirable (Allais 1953; Machina 1989).

Assumption (2), backward induction, is a kind of monotonicity condition. If we only focus on consequences that are sure money amounts (degenerate lotteries; Fig.Β 1a), then the condition is uncontroversial. However, it becomes debatable if consequences are nondegenerate lotteries as in Fig.Β 2. Then the condition implies that the decision maker’s evaluation of the lottery faced there, i.e., of the act conditional on the event E i that obtained, is independent of what happens outside of E i . This is a form of separability rather than of monotonicity (Bommier 2017 p. 106; Machina 1989 p. 1624), which may be undesirable for ambiguous events E i . Although most papers using the AA model do not discuss this assumption explicitly, several recent papers have criticized it (Bommier 2017; Bommier et al. 2017 Footnote 7; Cheridito et al. 2015; Machina 2014 p. 385 3rd bulleted point; Saito 2015; Schneider and Schonger 2017; Skiadas 2013 p. 63; Wakker 2010Section 10.7.3).

Dynamic optimization principles such as backward induction that are self-evident under expected utility become problematic and cannot all be satisfied under nonexpected utility (Machina 1989). Several authors have therefore argued against backward induction for nonexpected utility on normative grounds.Footnote 4 Many studies have found empirical violations of backward induction.Footnote 5 We conclude that both ancillary assumptions are descriptively problematic and, according to several authors, also normatively problematic. Our rAA model therefore aims to avoid the problems just discussed.

We now turn to a detailed outline of the paper. SectionΒ 2 explains the rAA model informally, showing how to test AA theories without being affected by violations of the ancillary assumptions. In particular, no two-stage uncertainty as in Fig.Β 2 occurs in the rAA model, and we only use stimuli as in Fig.Β 1. An additional advantage of our stimuli is that they are less complex, reducing the burden for subjects and the noise in the data. Dominiak and Schnedler (2011) and Oechssler et al. (2016) tested Schmeidler’s (1989) uncertainty aversion for two-stage acts, and found no clear relations with Ellsberg-type ambiguity aversion. This can be taken as evidence against the descriptive usefulness of two-stage acts.

SectionΒ 3 illustrates our approach in a simple experiment. Unsurprisingly, we find that losses are treated differently, with more ambiguity seeking, than gains (reference dependence). We have thus tested and falsified the substantive Assumptions 3 and 4. Many studies have demonstrated reference dependence outside of ambiguity, and several have done so within ambiguity.Footnote 6 Our experiment shows it in a simpler way and is the first to have done so for the AA model. It may be conjectured that AA theories could indirectly model the reference dependence found. This conjecture holds true for the smooth model (Klibanoff et al. 2005) and other utility-driven theories of ambiguity.Footnote 7 However, we prove that it does not hold true for most commonly used AA theories, because weak certainty independence, a necessary condition for most theories,Footnote 8 is violated. Baillon and Placido (2017) also tested this condition and also found it violated. Generalizations of these theories are therefore desirable. We turn to those in the next, theoretical, part of the paper, with definitions and basic results in SectionΒ 4 and the reference dependent generalization of Schmeidler (1989) in SectionΒ 5. Faro (2005, Ch. 3) provided an alternative ambiguity model with reference dependence.

Our generalization of Schmeidler’s model can accommodate loss aversion, and ambiguity aversion for gains combined with ambiguity seeking for losses, as in prospect theory. In many applications of ambiguity (asset markets, insurance, health) the gain-loss distinction is important, and descriptive models that assume reference-independent universal ambiguity aversion cannot accommodate this. As regards our finding of violations of weak certainty independence, reference dependence is the only generalization needed to accommodate these violations. Weak certainty independence remains satisfied if we restrict our attention to gains or to losses. SectionΒ 6 analyzes loss aversion under ambiguity. A discussion, with implications for existing ambiguity theories, is in SectionΒ 7. SectionΒ 8 concludes.

A model-theoretic isomorphism of the rAA model with the full AA model is in AppendixΒ E. Its implications can be stated in simple terms for experimentalists, without requiring a study of its formal content: Although the rAA model is a submodel of the full AA model, every ambiguity property that can be defined in the full AA model can be tested in the rAA model using the method explained in the next section. No information on ambiguity is lost by restricting to the rAA model. A simple test such as the one in SectionΒ 3 can be devised for every ambiguity condition other than weak certainty independence.

The first, empirical part of this paper, preceding SectionΒ 4, makes empirical studies of the AA model possible, providing an easy recipe. It is accessible to readers with no mathematical background. We postpone formal definitions and results to the second, theoretical part, in SectionΒ 4 and further. Given the negative finding in the first part, with violations of most existing AA ambiguity theories, the second part presents a positive result: the first reference-dependent AA theory.

2 The reduced AA model and the AA twin of the decision maker

This section explains the reduced AA model informally, so that it can easily be used by experimenters. AppendixΒ E gives a formal presentation. FigureΒ 3a depicts a two-stage AA act as in Fig.Β 2.

Fig. 3
figure 3

Relating a general two-stage act of the AA model to a one-stage (β€œrAA”) act

We do not use two-stage acts when empirically measuring the preferences of the decision maker. We only consider one-stage acts as: (1) in Fig.Β 3b, where all second-stage lotteries are degenerate and only uncertainty about the horses matters, or: (2) in Fig.Β 4, where the first-stage uncertainty, not depicted, is degenerate and only the risks of the roulette wheel matter. In Fig.Β 4, we avoid degenerate lotteries by only considering lotteries that give the worst outcome, βˆ’β€‰20 in our case, with a probability of at least 0.2, and give the best outcome, 10, with a probability of at least 0.2.

Fig. 4
figure 4

Defining a conditional certainty equivalent

The preference relation of the decision maker over the domain of one-stage acts just described (Figs.Β 3b andΒ 4) is denoted \(\succcurlyeq \). This domain and \(\succcurlyeq \) are called the reduced AA (rAA) model. We assume that EU (expected utility) holds for risky choices \(\succcurlyeq \) in the rAA domain. Most violations of EU occur when tails of distributions are relevant, but on the RAA domain the tails are fixed and play no role. Hence, EU is empirically plausible here, and we assume it. Further explanation and references are in SectionΒ 7. As for the ancillary assumption of backward induction, it is vacuous on the rAA domain.

In theoretical analyses of the AA model, two-stage acts do play a role. To capture them in our rAA method, we do not consider the actual preferences of the decision maker over them, but instead we consider a preference relation \(\succcurlyeq ^{*}\) of what we call the AA twin of the decision maker. The asterisk indicates that these preferences do not need to agree with the actual empirical preferences of the decision maker, but belong to her idealized AA twin. This \(\succcurlyeq ^{*}\) agrees with \(\succcurlyeq \) on the rAA domain, but extends it to the whole AA model, and is required to satisfy the AA conditions (EU for risk and backward induction). As we explain next, \(\succcurlyeq ^{*}\) exists and is uniquely determined this way.

Consider Fig.Β 4. Because the stimuli come from the rAA domain, the indifference also holds for βˆΌβˆ— instead of ∼. Because \(\succcurlyeq ^{*}\) satisfies EU, the βˆΌβˆ— indifference is maintained if we remove the β€œcommon-consequence” upper and lower 0.2 branches, and then the β€œcommon-ratio” 0.6 probabilities. That is, for each i, CA i for sure is βˆΌβˆ— equivalent to the lottery at branch E i in Fig.Β 3a:

$$ CA_{i} \sim^{*} (p_{i1}: x_{i1},\ldots, p_{im}: x_{im}), $$
(1)

using the obvious notation for lotteries. By backward induction (CE substitution), the act in Fig.Β 3a is βˆΌβˆ— indifferent to the act in Fig.Β 3b, which is again in the rAA domain governed by \(\succcurlyeq \). This way, the βˆΌβˆ— indifference class of every two-stage AA act is uniquely determined and, hence, so is \(\succcurlyeq ^{*}\). We can infer the whole relation \(\succcurlyeq ^{*}\) this way. We summarize the procedure, for any preference relationship \(\succcurlyeq ^{*}\):

  1. (1)

    Every act from rAA is left unaltered because \(\succcurlyeq ^{*}\) agrees with \(\succcurlyeq \) on the rAA domain.

  2. (2)

    For every lottery, its CA certainty equivalent is defined through Eq.Β 1 and Fig.Β 4.

  3. (3)

    Every two-stage act is replaced by a one-stage act as in Fig.Β 3.

Point (2) means that CA s are \(\succcurlyeq ^{*}\) certainty equivalents. Stating the rAA method in one sentence:

We can find out any AA preference \(\succcurlyeq ^{*}\) from rAA preferences \(\succcurlyeq \) by using the substitution in Fig.Β 3.

We can thus apply all techniques from the AA model to analyze \(\succcurlyeq ^{*}\) and infer properties of the uncertainty attitude of \(\succcurlyeq ^{*}\) on horse acts using only \(\succcurlyeq \) preferences on the rAA domain as empirical inputs. The uncertainty attitudeβ€”which may deviate from subjective expected utilityβ€”of the AA twin \(\succcurlyeq ^{*}\) is identical to that of \(\succcurlyeq \). Thus, all results from the AA literature immediately apply to \(\succcurlyeq \).

In applications, if only few CAs are to be measured, then we can measure each one separately as in Fig.Β 4. If there are many, we can carry out a few measurements as in Fig.Β 4, derive the EU utility function from them, and use it to determine all CAs that we need. Two drawbacks of the rAA method must be acknowledged. First, the stimuli used for measuring risk attitudes in Fig.Β 4 are made more complex by the mixing in of the best and worst outcomes. Second, when testing mixture conditions from the full AA model, we have to modify every two-stage act into an rAA act as just described.

The following section gives an illustration of the rAA method, showing how it can be used to test AA theories experimentally. We test weak certainty independence there, a preference condition necessary for many AA theories.

3 Experimental illustration of the reduced AA model and reference dependence

This section demonstrates the rAA model in a small experiment. First, we present a common example. The unit of payment in the example can be taken to be money or utility. In the experiment that follows, the unit of payment will be utility and not money, so that the violations found there directly pertain to the general AA model. Because the rAA model is a submodel of the full AA model (but large enough to recover the latter entirely), any violation of a preference condition found from \(\succcurlyeq \) in the rAA model immediately gives a violation of that preference condition for \(\succcurlyeq ^{*}\) in the full AA model.

Example 1

(Reflection of ambiguity attitudes) A known urn K contains 50 red (R) and 50 black (B) balls. An unknown (ambiguous) urn A contains 100 black and red balls in unknown proportion. One ball will be drawn at random from each urn, and its color will be inspected. R k denotes the event of a red ball drawn from the known urn, and B k , R a , and B a are analogous. People usually prefer to receive € 10 under B k (and 0 otherwise) rather than under B a and they also prefer to receive € 10 under R k rather than under R a . These choices reveal ambiguity aversion for gains.

We next multiply all outcomes by βˆ’β€‰1, turning them into losses. This change of sign can affect decision attitudes. Many people now prefer to lose € 10 under B a rather than under B k and also to lose € 10 under R a rather than under R k . That is, many people exhibit ambiguity seeking for losses. β–‘

The above example illustrates that ambiguity attitudes are different for gains than for losses, making it desirable to separate these, similar to what has been found for risk (Tversky and Kahneman 1992). This separation is impossible in most current ambiguity theories. We tested the above choices in our experiment. Subjects were N = 45 undergraduate students from Tilburg University. We asked both for preferences with red as the winning color and for preferences with black as the winning color. This way we avoided suspicion about the experimenter rigging the composition of the unknown urn (Pulford 2009).

We scaled utility to be 0 at 0 and 10 at € 10. That is, the winning amount was always € 10. We wanted the loss outcome to be βˆ’β€‰10 in utility units for each subject, which required a different monetary outcome Ξ± for each subject. Thus, under EU as assumed in the AA model and as holding for the AA twins of the subjects, we must have, with the usual notation for lotteries (probability distributions over money),

figure a

One simplifying notation for lotteries: we often rewrite (p : Ξ±, 1 βˆ’ p : Ξ²) as Ξ± p Ξ². The indifference displayed involves a degenerate (nonrisky) prospect (€0), and those are known to cause many violations of the assumed EU.Footnote 9 We therefore use the modification in Fig.Β 4. We write R = (€ 100.5(βˆ’β‚¬20)), and rather elicit the following indifference from our subjects, as in Fig.Β 4, using the common probabilistic mixtures of lotteries, and mixing in R with weight 0.4:

figure b

Under EU as holding for the AA twin, the latter indifference also holds for βˆΌβˆ— and is equivalent to the former, but the latter indifference is less prone to violations of EU, so that our subjects agree with their AA twins here.

To elicit the indifference in Eq. 3 from each subject, we asked each subject to choose between lotteries (replacing Ξ± in Eq. 3 by βˆ’ j),

figure c

for each j = 0,2,4,…,18,20. If the subject switched from risky to safe between βˆ’ j and βˆ’ j βˆ’β€‰2, we defined Ξ± to be the midpoint between these two values, i.e., Ξ± = βˆ’j βˆ’β€‰1. We then assumed indifference between the safe and risky prospect with that outcome Ξ± instead of βˆ’ j in the risky prospect. We used the monetary outcome Ξ±, depending on the subject, as the loss outcome for this subject. This way the loss outcome was βˆ’β€‰10 in utility units for each subject (as for their AA twin).Footnote 10 Details of the experiment are in the Online Appendix.

We elicited the preferences of Example 1 from our subjects using utility units, with the gain outcome € 10 giving utility + 10, and the loss outcome Ξ± giving utility βˆ’β€‰10. Combining the bets on the two colors, the number of ambiguity averse choices was larger for gains than for losses (1.49 vs. 1.20, z = 2.01, p < .05, Wilcoxon test, two-sided), showing that ambiguity attitudes are different for gains than for losses. We replicate strong ambiguity aversion (z = 3.77, p < .01, Wilcoxon test, two-sided) for gains, but we cannot reject the null of ambiguity neutrality (z = 1.57, p > .10, Wilcoxon test, two-sided) for losses.Footnote 11 Our experiment confirms that attitudes towards ambiguity are different for gains than for losses, suggesting violations of most ambiguity models used today. The following sections will formalize this claim.

4 Definitions, notation, classical expected utility, and Choquet expected utility for mixture spaces

This section provides definitions and well-known results. Proofs are in Ryan (2009). We present our main theorems for general mixture spaces, which covers the traditional two-stage AA model, our rAA model, and also some other models. By Observation 5 in the Appendix, all results proved in the literature for the traditional two-stage AA model also hold for general mixture spaces. M denotes a set of consequences, with generic elements x, y. M is a mixture space: it is endowed with a mixture operationx p y : M Γ— [0,1] Γ— M β†’ M, also denoted px + (1 βˆ’ p)y, satisfying (i) x1y = x [identity]; (ii) x p y = y1βˆ’px [commutativity]; (iii) (x p y) q y = x p q y [associativity]. The first example below was popularized by Schmeidler (1989) and Gilboa and Schmeidler (1989).

Example 2 (Two-stage AA model)

D denotes a set of (deterministic) outcomes, and M consists of all (roulette) lotteries, which are probability distributions over D taking finitely many values. The mixture operation concerns probabilistic mixing. β–‘

Example 3

M = IR and mixing is the natural mixing of real numbers. β–‘

Our rAA model provides another example (AppendixΒ E). S denotes the state space. It is endowed with an algebra of subsets, called events. An algebra contains S and βˆ… and is closed under complementation and finite unions and intersections. An actf = (E1:f1,...,E n :f n ) takes values f i in M and the E i ’s are events partitioning the state space. The set of acts, denoted \(\mathcal {A}\), is endowed with pointwise mixing, which satisfies all conditions for mixture operations. Hence, \(\mathcal {A}\) itself is also a mixture space. A constant act f assigns the same consequence f(s) = x to all s. It is identified with this consequence.

Preferences are over the set of acts \(\mathcal {A}\) and are denoted \(\succcurlyeq \), inducing preferences \(\succcurlyeq \) over consequences through constant acts. Strict preference ≻ and indifference ∼ are defined as usual. A function V represents\(\succcurlyeq \) if \(V : \mathcal {A} \rightarrow I R\) and \(f \succcurlyeq g \Leftrightarrow V(f) \geq V(g)\). If a representing function exists then \(\succcurlyeq \) is a weak order, i.e., \(\succcurlyeq \) is complete (for all acts f and g, \(f \succcurlyeq g\) or \(g \succcurlyeq f\)) and transitive. \(\succcurlyeq \) is nontrivial if (not f ∼ g) for some f and g in \(\mathcal {A} \).

Continuity holds if, whenever f ≻ g and g ≻ h, there are p and q in (0,1) such that f p h ≻ g and f q h β‰Ί g. Hence, continuity relates to the mixing of consequences and does not refer to variations in states of nature. In the two-stage AA model, continuity relates to probability (as part of consequences). An affine function u on M satisfies u(x p y) = pu(x) + (1 βˆ’ p)u(y). In the two-stage AA model, a function is affine if and only if it is EU (defined in AppendixΒ E; it follows from substitution and induction).

Monotonicity holds if \(f\succcurlyeq g\) whenever \(f(s) \succcurlyeq g(s)\) for all s in S. It is nontrivial if the f(s)’s are nondegenerate lotteries as in Example 2. Monotonicity then implies that the decision maker’s evaluation of f(s), i.e., of f conditional on state s, is independent of what happens outside of s. It was discussed in SectionΒ 1.

The following condition is the most important one in the axiomatization of affine representations and, hence, of EU.

Definition 1

Independence holds on M if

$$x \succ y \Rightarrow x_{p} c \succ y_{p} c $$

for all 0 < p < 1 and consequences x, y, and c. β–‘

Theorem 1 (von Neumann-Morgenstern)

The following two statements are equivalent:

  1. (i)

    There exists an affine representationuon the consequence space M.

  2. (ii)

    The preference relation \(\succcurlyeq \) when restricted to M satisfies the following three conditions: (a) weak ordering; (b) continuity; (c) independence.

In (i), uis unique up to level and unit. β–‘

Uniqueness of uup to level and unit means that another function uβˆ— satisfies the same conditions as u if and only if uβˆ— = Ο„ + Οƒu for some real Ο„ and positive Οƒ. Affinity, independence, and Theorem 1 can be applied to any mixture set other than M, such as the set of acts \(\mathcal {A}\). Formally, our term AA model refers to Example 2 plus the preference conditions considered so far in this section, being weak ordering, continuity, monotonicity, and independence on M, implying an affine (i.e., EU) representation on M. It is a two-stage model. It does not further restrict ambiguity attitudes, i.e., the preference relation over acts, and is assumed in most papers on ambiguity nowadays. We now turn to two classic results.

Anscombe and Aumann’s subjective expected utility. A probability measureP on S maps the events to [0,1] such that P(βˆ…) = 0, P(S) = 1, and P is additive (P(E βˆͺ F) = P(E) + P(F) for all disjoint events E and F). Subjective expected utility (SEU) holds if there exists a probability measure P on S and a function u on M, such that \(\succcurlyeq \) is represented by

$$ SEU: f \mapsto \int_{S} u(f(s))dP . $$
(4)

Theorem 2 (Anscombe and Aumann)

The following two statements are equivalent:

  1. (i)

    Subjective expected utility holds with a nonconstant affine u on M.

  2. (ii)

    The preference relation \(\succcurlyeq \) satisfies the following conditions: (a) nontrivial weak ordering; (b) continuity; (c) monotonicity; (d) independence.

The probabilities Pon Sare uniquely determined and u on M is unique up to level and unit. β–‘

If we apply the above theorem to Example 3, we obtain subjective expected value as in de Finetti (1937; Wakker 2010 Theorem 1.6.1). Thus, two classical derivations of subjective probabilities, by Anscombe and Aumann (1963) and by de Finetti (1937), are based on the same underlying mathematics.Schmeidler’s Choquet Expected Utility. A capacityv on S maps events to [0,1], such that v(βˆ…) = 0, v(S) = 1, and E βŠƒ F β‡’ v(E) β‰₯ v(F) (set-monotonicity). Unless stated otherwise, we use a rank-ordered notation for acts f = (E1:x1,β‹― ,E n :x n ), i.e., \(x_{1} \succcurlyeq \cdots \succcurlyeq x_{n}\) is implicitly understood. Let v be a capacity on S. Then, for any function w\(: S \rightarrow \mathbb {R}\), the Choquet integral of w with respect to v, denoted \(\int w dv\), is

$$ \int_{0}^{\infty} v(\{s\in S: w(s)\geq\tau \})d\tau + \int_{-\infty}^{0} [v(\{s\in S: w(s)\geq\tau \})-1] d\tau. $$
(5)

Choquet expected utility holds if there exist a capacity v and a function u on M such that preferences are represented by

$$ CEU: f \mapsto \int_{S} u(f(s))dv. $$
(6)

Two acts f and g in \(\mathcal {A}\) are comonotonic if for no s and t in S, f(s) ≻ f(t) and g(s) β‰Ί g(t). Thus, any constant act is comonotonic with any other act. A set of acts is comonotonic if every pair of its elements is comonotonic.

Definition 2

Comonotonic independence holds if

$$f \succ g \Rightarrow f_{p} c \succ g_{p} c $$

for all 0 < p < 1 and comonotonic acts f, g, and c. β–‘

Under comonotonic independence, preference is not affected by mixing with constant acts (consequences) (with some technical details added in Lemma 3). Because constant acts are comonotonic with each other, comonotonic independence on \(\mathcal {A}\) still implies independence on M.

Theorem 3 (Schmeidler)

The following two statements are equivalent:

  1. (i)

    Choquet expected utility holds with nonconstant affine u on M ;

  2. (ii)

    The preference relation \(\succcurlyeq \) satisfies the following conditions: (a) nontrivial weak ordering; (b) continuity; (c) monotonicity; (d) comonotonic independence.

The capacity v on Sis uniquely determined and uon Mis unique up to level and unit. β–‘

If we apply the above theorem to Example 3, we obtain a derivation of Choquet expected utility with linear utility that is alternative to Chateauneuf (1991, Theorem 1). Cerreia-Vioglio et al. (2015) provide a recent survey of applications.

Comonotonic independence implies a condition assumed by most models for ambiguity proposed in the literature.

Definition 3

Weak certainty independence holds if

$$f_{q} x \succcurlyeq g_{q} x \Rightarrow f_{q} y \succcurlyeq g_{q} y $$

for all 0 < q < 1, acts f, g, and all consequences x, y. β–‘

That is, preference between two mixtures involving the same constant act x with the same weight 1 βˆ’ q is not affected if x is replaced by another constant act y. This condition follows from comonotonic independence because both preferences between the mixtures should agree with the unmixed preference between f and g (again, with some technical details added in Lemma 3). Grant and Polak (2013) demonstrated that the condition can be interpreted as constant absolute uncertainty aversion: adding a constant to all utility levels does not affect preference. For a detailed analysis see Skiadas (2013).

5 Reference dependence in the AA model

Example 1 violates CEU, as we explain next. In the gain preference \(10_{B_{k}}0 \succ 10_{B_{a}}0\), the best outcome (= consequence) 10 is preferred under B k , implying the strict inequality v(B k ) > v(B a ). In the loss preference \(0_{B_{a}}(-10) \succcurlyeq 0_{B_{k}}(-10)\), the best outcome 0 is preferred under B a , implying the opposite inequality v(B a ) β‰₯ v(B k ). A contradiction has resulted. This reasoning does not use any assumption about the utilities (10 and βˆ’β€‰10 in our case) of the outcomes other than that they are of different signs (with u(0) = 0). For later purposes, we show that even weak certainty independence is violated. In the proof of the following observation, we essentially use the linear (probabilistic) mixing of outcomes typical of the AA model.

Observation 1

Example 1 violates comonotonic independence and even weak certainty independence.β–‘

Example 1 has confirmed for the AA model what many empirical studies have found for other models: ambiguity attitudes are different for gains than for losses (reviewed by Trautmann and van de Kuilen 2015), violating CEU and most other ambiguity models. Hence, generalizations incorporating reference dependence are warranted. This section presents such a generalization. As in all main results, the analysis will be analogous to Schmeidler’s analysis of rank dependence in Choquet expected utility as much as possible. Given this restriction, we stay as close as possible to the analysis of Tversky and Kahneman (1992).

In prospect theory there is a special role for a reference point, denoted πœƒ. In our model it is a consequence that indicates a neutral level of preference. It is often the status quo of the decision maker. In Example 1, the deterministic outcome 0 was the reference point. Under the certainty equivalent condition in the AA model, we can always take a deterministic outcome as reference point. Sugden (2003) emphasized the interest of nondegenerate reference points. Many modern studies consider endogenous reference points that can vary (KΓΆszegi and Rabin 2006). Our axiomatization concerns one fixed reference point. Extensions to variable reference points can be obtained by techniques as in Schmidt (2003).

Other consequences are evaluated relative to the reference point. A consequence f(s) is a gain if f(s) ≻ πœƒ, a loss if f(s) β‰Ί πœƒ, and it is neutral if f(s) ∼ πœƒ. An act f is mixed if there exist s and t in S such that f(s) ≻ πœƒ and f(t) β‰Ί πœƒ. For an act f, the gain part f+ has f+(s) = f(s) if \(f(s) \succcurlyeq \theta \) and f+(s) = πœƒ if f(s) β‰Ί πœƒ. The loss part fβˆ’ is defined similarly, where all gains are now replaced by the reference point. Prospect theory allows different ambiguity attitudes towards gains than towards losses. We therefore use two capacities, v+ for gains and vβˆ’ for losses. It is more natural to use a dual way of integration for losses. We thus define the dual of vβˆ’, denoted \(\hat {v}^{-}\), by \(\hat {v}^{-}(A) = 1- v^{-}(A^{c})\) for events A.

Prospect theory (also called cumulative prospect theory in the literature) holds if there exist two capacities v+ and vβˆ’ and a function U on consequences with U(πœƒ) = 0 such that \(\succcurlyeq \) is represented by

$$ PT: f \mapsto \int_{S} U(f^{+}(s))dv^{+} + \int_{S} U(f^{-}(s))d\hat{v}^{-}. $$
(7)

We call U in Eq.Β 7 the (overall) utility function. There is a basic utility u and a loss aversion parameter Ξ» > 0, such that

$$\begin{array}{@{}rcl@{}} U(x) = u(x) \text{~if~} x \succ \theta \end{array} $$
(8)
$$\begin{array}{@{}rcl@{}} U(x) = u(x) = 0 \text{~if~} x \sim \theta \end{array} $$
(9)
$$\begin{array}{@{}rcl@{}} U(x) = \lambda u(x) \text{~if~} x \prec \theta. \end{array} $$
(10)

For reasons explained later, we call Ξ» the ambiguity-loss aversion parameter (see SectionΒ 6). Because U(πœƒ) = 0, we now add the scaling convention that also u(πœƒ) = 0. For identifying the separation of U into u and Ξ», further assumptions are needed. We consider a new kind of separation based on the AA model and the mixture space setup of this paper. Wakker (2010 Chs. 8 and 12) discusses other separations in other models. The parameter Ξ» is immaterial for preferences over consequences M, affecting neither preferences between gains or losses, nor within. Thus, loss aversion in our model does not affect preferences over M (consequences), that is, over lotteries (risk) in the AA model. It only concerns ambiguity.

For later purposes, we rewrite Eq.Β 7 as

$$ PT= \sum^{n}_{i = 1} \pi_{i} U(f(i)) $$
(11)

with decision weights Ο€ i defined as follows. Assume, for act (E1:x1,...,E n :x n ), the rank-ordering \(x_{1} \succcurlyeq \cdots \succcurlyeq x_{k} \succcurlyeq \theta \succcurlyeq x_{k + 1} \succcurlyeq \cdots \succcurlyeq x_{n}\). We define

$$\begin{array}{@{}rcl@{}} \text{for} \,i \leq k: \quad \pi_{i}=\pi^{+}_{i} &=& v^{+} \left( \cup_{j = 1}^{i} E_{j} \right) - v^{+} \left( \cup_{j = 1}^{i-1} E_{j} \right); \end{array} $$
(12)
$$\begin{array}{@{}rcl@{}}[0.5cm] \text{for}\, i > k: \quad \pi_{i}=\pi^{-}_{i} &=& v^{-} \left( \cup_{j=i}^{n} E_{j} \right)- v^{-} \left( \cup_{j=i + 1}^{n} E_{j} \right). \end{array} $$
(13)

For gain events, the decision weight depends on cumulative events that yield better consequences. For loss events, the decision weight similarly depends on decumulative events that yield worse consequences. CEU analyzed in the preceding section is the special case of PT where vβˆ’ is the dual of v+ and Ξ» in Eq.Β 10 is 1.

We next turn to preference conditions that characterize prospect theory. We generalize comonotonicity by adapting a concept of Tversky and Kahneman (1992) to the present context. Two acts f and g are cosigned if they are comonotonic and if there exists no s in S such that f(s) ≻ πœƒ and g(s) β‰Ί πœƒ. Note that, whereas for any act g and any constant act f, f is comonotonic with g, an analogous result need not hold for cosignedness. Only if the constant act is neutral, is it cosigned with every other act. This point complicates the proofs in the Appendix. A set of acts is cosigned if every pair is cosigned. We generalize comonotonic independence to allow reference dependence:

Definition 4

Cosigned independence holds if

$$f \succ g \Rightarrow f_{p} c \succ g_{p} c $$

for all 0 < p < 1 and cosigned acts f, g, and c. β–‘

\(\succcurlyeq \) is truly mixed if there exists an act f with f+ ≻ πœƒ and πœƒ ≻ fβˆ’. Double matching holds if, for all acts f and g, f+ ∼ g+ and fβˆ’βˆΌ gβˆ’ implies f ∼ g. In a different context, Wakker and Tversky (1993) showed that more general conditions can be used. Our aim here is not to adapt those to the AA model, but we stay as close as possible to Tversky and Kahneman (1992) and use their double matching and true mixedness to achieve maximal comparability and accessibility. We now present the main theorem of this paper.

Theorem 4

Assume true mixedness. The following two statements are equivalent:

  1. (i)

    Prospect theory holds with U as in Eqs.Β 8–10.

  2. (ii)

    The preference relation \(\succcurlyeq \) satisfies the following conditions: (a) nontrivial weak ordering; (b) continuity; (c) monotonicity; (d) cosigned independence; (e) double matching.

The capacities are uniquely determined and the global utility function U is unique up to its unit. β–‘

Tversky and Kahneman (1992 Theorem 2) provided a behavioral foundation of prospect theory in a Savagean-like framework, where outcomes are monetary with no probabilities or multiple stages involved. They thus avoided the ancillary assumptions of the AA model. As a price to pay, they did not have the convenient mixture structure typical of the AA model, making measurements and analyses of behavioral properties more difficult. They used conditions similar to (a)-(c) that are standard in most behavioral foundations, and also condition (e). Their main axiom, sign-comonotonic tradeoff consistency, had to be more complex than our main axiom (d). Several generalizations were provided for the Savagean framework, mainly weakening true mixedness and double matching, with extensions to multiattribute outcomes, connected topological outcome spaces, and nonsimple prospects, but always using a complex sign-comonotonic tradeoff consistency (Bleichrodt and Miyamoto 2003; Bleichrodt et al. 2009; KΓΆbberling and Wakker 2003; Kothiyal et al. 2011; Wakker 2010 Theorem 12.3.5; Wakker and Tversky 1993). Closest to our theorem is Schmidt and Zank’s (2009) result, who used linear utility with respect to monetary outcomes, as in Example 3. Our paper provides the first axiomatization of PT for the AA model. The difference between the aforementioned results and ours is similar to that between Savage (1954)/Wakker (2010 Theorem 4.6.4) versus Anscombe and Aumann (1963), or Gilboa (1987)/Wakker (1989) versus Schmeidler (1989).

We give the proof of the following observation in the main text because it is clarifying.

Observation 2

Example 1 can be accommodated by prospect theory.

Proof

To see that the observation holds, choose, in Example 1, v+(B k ) > v+(B a ), v+(R k ) > v+(R a ), vβˆ’(B k ) > vβˆ’(B a ), and vβˆ’(R k ) > vβˆ’(R a ). Remember here that large values of vβˆ’ correspond with low values of its dual capacity as used in the Choquet integral.

We can take vβˆ’ different than v+, letting vβˆ’ accommodate ambiguity seeking in agreement with empirical evidence.

Observation 3

For the preference relation \(\succcurlyeq \) restricted to consequences, there exists an affine representation u if and only if \(\succcurlyeq \) satisfies nontrivial weak ordering, continuity, and cosigned independence.β–‘

For consequences, cosigned independence means that independence in Definition 1 is restricted to cases where the consequences x, c, y are all better or all worse than the reference point.

6 Measurements and interpretations of ambiguity loss aversion

This section considers a number of interpretations of the ambiguity-loss aversion parameter Ξ» in Theorem 4 and Eqs.Β 8–10. We first show how Ξ» can be directly revealed from preference. This direct measurement is typical of the AA model with its mixture operation, and cannot be used in other models.

Observation 4

For all fin \(\mathcal {A}\),x+, xβˆ’ ∈ M, and \(\lambda \in \mathbb {R}\), if f ∼ πœƒ, f+ ∼ x+ ≻ πœƒ, and fβˆ’βˆΌ xβˆ’β‰Ί πœƒ, then \(x^{+}_{\hspace {0.1cm} \frac {1}{1+\lambda }} x^{-} \sim \theta \).β–‘

In other words, with f, x+, and xβˆ’ as in the observation, we find p such that \(x^{+}_{p} x^{-} \sim \theta \), and then solve Ξ» from \(\frac {1}{1+\lambda } = p\) (\(\lambda =\frac {1-p}{p}\)). The condition in the theorem is intuitive: The indifference \(x^{+}_{\hspace {0.1cm} \frac {1}{1+\lambda }} x^{-} \sim \theta \) shows that, when mixing consequences (lotteries in the AA model), the loss must be weighted Ξ» times more than the gain to obtain neutrality. Under ambiguity, however, f combines the preference values of x+ and xβˆ’ in an β€œunweighted” manner (see the unweighted sum of the gain- and loss-part in Eq.Β 7), leading to the same neutrality level. Apparently, under ambiguity, losses are weighted Ξ» times more than when mixing consequences (risk in the AA model). In the AA model, with consequences referring to lotteries and decision under risk, Ξ» indicates how much more losses are overweighted under ambiguity than they are under risk. Thus, Ξ» purely reflects ambiguity attitude.

In the smooth ambiguity model (Klibanoff et al. 2005), ambiguity attitudes depend entirely on the outcomes faced (in the domain of its second-order ambiguity-utility transformation function Ο†), and sign dependence is a special case of such a dependency. The smooth model can accommodate extra loss aversion due to ambiguity in the same way as our parameter Ξ» does: through a kink of its Ο† at 0. The smooth model differs from our model because we capture other aspects of ambiguity attitudes through functions operating on events, rather than on outcomes.

For a first prediction on values of Ξ», we consider an extreme view on loss aversion for the AA model. It entails that all loss aversion shows up under risk, and that no additional loss aversion is expected due to ambiguity. This interpretation is most natural if loss aversion only reflects extra suffering experienced under losses, rather than an overweighting of losses without them bringing disproportional suffering when experienced. That is, this extreme interpretation ascribes loss aversion entirely to the (utility of) consequences. Then it is natural to predict that Ξ» = 1, with no special role for ambiguity. We display the preference condition axiomatizating this prediction and showing how the prediction can be tested:

Neutral ambiguity-loss aversion holds if Ξ» = 1 in Observation 4.

A less extreme interpretation of ambiguity-loss aversion is as follows: There is loss aversion under risk, which can be measured in whatever is the best way provided in the literature.Footnote 12 For monetary outcomes with a fixed reference point as considered in this paper, loss aversion will generate a kink of risky utility at that reference point. As an aside, in our model loss aversion under risk does not imply violations of expected utility and is fully compatible with our AA model, simply giving a kinked function u. Ambiguity can give extra loss aversion and it can amplify (Ξ» > 1) or moderate (Ξ» < 1) it. The following preference condition characterizes Ξ»:

Nonneutral ambiguity-loss aversion. For all f in \(\mathcal {A}\), x+, xβˆ’ ∈ M, and \(\lambda \in \mathbb {R}\), if f ∼ πœƒ, f+ ∼ x+ ≻ πœƒ, and fβˆ’βˆΌ xβˆ’β‰Ί πœƒ, then \(x^{+}_{\hspace {0.1cm} 0.5} x^{-} \succ \theta \) if and only if Ξ» > 1, and \(x^{+}_{\hspace {0.1cm} 0.5} x^{-} \prec \theta \) if and only if Ξ» < 1.

Abdellaoui et al. (2016) measured loss aversion under risk and ambiguity separa- tely and found them to be the same. Baltussen et al. (2016) also found them to be the same in one treatment (outside the β€œlimelight”), but not in the other (in the limelight).

In the two-stage AA model, some consequences are outcomes and others are lotteries. Reference dependence in this paper takes lotteries as a whole, and their indifference class determines if they are gains or losses. This is analogous to the way in which Schmeidler (1989) modeled rank dependence, which also concerned lotteries as a whole. Another approach can be considered, both for reference dependence and rank dependence, where outcomes within a lottery are perceived as gains or losses and are weighted in a rank dependent manner. Here, as elsewhere, we followed Schmeidler’s approach. Tversky and Kahneman (1981, p. 456 penultimate paragraph) recommended this approach for reference dependence. In the rAA model, subjects are never required to perceive whole lotteries in a reference or rank dependent manner, but we implement it ourselves, and subjects only see the CAs that we inserted. Hence, the above issue is no problem for us.

7 Discussion

Kreps (1988p. 101) wrote about the non-descriptive nature of two-stage acts in the AA model:

imaginary objects. …makes perfectly good sense in normative applications …But this is avery dicey and perhaps completely useless procedure in descriptive applications. …what sense does it make …because the items concerned don’t exist? Ithink we have to view the theory to follow [the traditional two-stage AA model] as being as close to purely normative as anything that we do in this book.

A pragmatic objection can be raised against the rAA model. The mixture operation of outcomes is not as easy to implement as in the original AA model. Now a mixture is not done by just multiplying probabilities, but it requires observing an indifference. But such observations are easy to obtain, as our experiment demonstrated. They concern stimuli that are easier to understand for subjects than two-stage acts.

We next analyze to what extent we have succeeded in avoiding violations of EU in the rAA model. Because we always assign a non-negligible probability (0.2 in our experiment) to the best outcome and to the worst outcome, for the preferences that we consider, the nonlinear processing of probability typical of nonEU is only relevant in the middle of the domain, bounded away from p = 0 and p = 1. The common empirical finding is that deviations from linearity mostly occur at the boundaries (Baucells and Villasis 2015; Starmer 2000; Tversky and Kahneman 1992; Viscusi and Evans 2006; Wakker 2010 p. 208).Footnote 13 Hence, the deviations from EU are weak for the stimuli in the rAA model. We recall here that loss aversion is incorporated in u, as a kink at zero.

Some papers considered relaxations of the four assumptions of the AA model listed in SectionΒ 1. Dean and Ortoleva (2017 Footnote 7) suggested using the rAA domain, but did not elaborate on it and still used the second ancillary assumption of AA (backward induction). They did however relax the first ancillary assumption of EU. Their axioms used an endogenous utility midpoint operation, which serves a purpose similar to our substitution of CA i s in Fig.Β 4. They are, to our best knowledge, the first who succeeded in using the AA model without assuming EU in the second stage. Borah and Kops (2016) analyzed the AA model theoretically on a restricted domain similar to ours. In a theoretical study, Bommier (2017) did consider two-stage AA acts, but he neither assumed EU for risk nor backward induction, instead using a sort of dual forward-induction type optimization. He analyzed ambiguity aversion as defined in his setting, but did not consider reference dependence.

8 Conclusion

To date, the AA ambiguity model could only be used for normative purposes (Kreps 1988 p. 101). We have made it suitable for descriptive purposes. We demonstrated how the two major descriptive problems (violations of EU for risk and of backward induction) can be resolved through a reduced AA model (rAA). The rAA model introduces an imaginary AA twin \(\succcurlyeq ^{*}\) for a real decision maker \(\succcurlyeq \), where every \(\succcurlyeq ^{*}\) relationship can be derived from an rAA \(\succcurlyeq \) relationship through Fig.Β 3. Next, we can apply any AA theorem available in the literature to \(\succcurlyeq ^{*}\), and its conclusions regarding ambiguity attitudes are valid for the real decision maker \(\succcurlyeq \). In a simple experiment we showed how the rAA model can be implemented and how the AA model can be tested in general. A formal model-theoretic isomorphism showed that the rAA model maintains the full analytical power of the AA model.

We conducted the first empirical test of a preference condition in the AA model that is not confounded by violations of the ancillary assumptions. This test sufficed to falsify two assumptions of the majority of AA ambiguity theories today: weak certainty independence and reference independenceβ€”the latter often assumed implicitly. We benefited from an additional advantage of the reduced AA model: it only needs one-stage stimuli and those are easy to understand for subjects.

To accommodate the violations found, we introduced a reference dependent generalization of the first decision model of ambiguity that received a behavioral foundation: Schmeidler’s (1989) Choquet expected utility. Our generalization amounts to extending the AA model to prospect theory. We provided a behavioral foundation. Topics for future research include the development of reference dependent generalizations of the many other ambiguity theories in the literature, and empirical tests of such models. We hope that our paper will advance descriptive applications of ambiguity AA theories, having removed the major obstacles.