We will present the most important elements of set theory, because without appropriate knowledge one cannot acquire a sufficient understanding of Brownian motion. Set theory is also needed when it comes to the theory of random variables, probability theory, information economics, or game theory. Since set theory is not dealt with in sufficient detail in formal training of economists, we will discuss the required issues here.

2.1 Notation and Set Operations

Term of Set

A set is a collection of various objects. If you want to describe a set you have to specify its elements. This happens in curly brackets where the elements are shown following a colon or a vertical line. The set

$$\displaystyle \begin{aligned} M:=\{x\in\mathbb{R}\;|\;a x+b x^2\ge 0\} \end{aligned} $$
(2.1)

contains those real numbers x that satisfy the inequality ax + bx 2 ≥ 0. If a set contains only a single element, it is called a point set.

If the numbers are real, one likes to use abbreviations in notation. The set A of numbers greater than 0 and less than 1 should actually be written in the form

$$\displaystyle \begin{aligned} A:= \{ x\in\mathbb{R}\; | \; 0<x<1\} \end{aligned} $$
(2.2)

which is quite cumbersome. Instead, the more compact notation

$$\displaystyle \begin{aligned} A:= (0,1) \end{aligned} $$
(2.3)

is used. This applies also to half-open and closed intervals.

The set of real numbers is denoted by \(\mathbb {R}\), the set of natural numbers by \(\mathbb {N}\), the set of integers by \(\mathbb {Z}\), and the set of rational numbers by \(\mathbb {Q}\).Footnote 1 The empty set ∅ contains no elements.

The elements are grouped together in a set, regardless of their sequence. So if Ω consists of the two elements u and d, it does not matter whether Ω = {u, d} or Ω = {d, u} is written. In some economic applications, however, the order of the elements is important. One speaks then of a “pair” and writes (u, d) if the sequence of elements is relevant. There can also be more than two entries such as (u, d, d, u). In this case one speaks of a “tuple.” If these pairs or tuples are combined into a set, this set is no longer Ω but a new set. Depending on the length of the tuple this new set is called Ω2 for two entries in a pair and ΩT for T entries in the tuple.Footnote 2

Set Operations

You can unite sets and you can calculate their intersection and difference. Considering two sets this means the following.

The union contains all their elements. The symbol ∪ is used to identify the union. For example, the following applies

$$\displaystyle \begin{aligned} \{1,2\}=\{1\}\cup \{2\}. \end{aligned}$$

The intersection contains elements found in both sets. The symbol ∩ indicates intersection. For example, the following is true

$$\displaystyle \begin{aligned} \emptyset=\{1\}\cap \{2\}. \end{aligned}$$

Sets whose intersection is empty are called disjoint.

Let us focus on a set A. We denote this a subset of B if it only contains elements from B regardless whether these are all or only some elements. It is in short A ⊂ B. Then the following always applies

$$\displaystyle \begin{aligned} A\subset B\qquad \implies\qquad A\cup B=B\qquad \text{and}\qquad A\cap B=A. \end{aligned} $$
(2.4)

Each of the last two relationships characterizes subsets.

The difference AB of two sets is the set containing all elements from A that are not in B. For the difference rules of calculation apply which are similar to arithmetics.Footnote 3 When A and B are subsets of a set C, then the following is valid:

$$\displaystyle \begin{aligned} C{\setminus} (A\cap B)&=(C{\setminus} A)\cup (C{\setminus} B) \end{aligned} $$
(2.5)
$$\displaystyle \begin{aligned} C{\setminus} (A\cup B)&=(C{\setminus} A)\cap (C{\setminus} B) \end{aligned} $$
(2.6)
$$\displaystyle \begin{aligned} C{\setminus} (C{\setminus} A)&=A. \end{aligned} $$
(2.7)

One can convince oneself of the correctness of these rules with the help of the so-called Venn diagramsFootnote 4; we are using them here without further explanation. A Venn diagram is a simple symbolic representation in which the sets are always indicated by circles or ellipses. The drawing illustrates the statements on union, intersection, and difference, see Fig. 2.1.

Fig. 2.1
figure 1

Intersection of sets A and B

The graph shows, for example, that the set A ∩ B (the inner part of both ellipses) represents a subset of A ∪ B (the totality of both ellipses), thus A ∩ B ⊂ A ∪ B.

Computation Rules

The following rule applies to set operations and is based on the rules of arithmetic: line operation takes precedence over union and intersection. As a result, brackets that include a difference can be omitted. For example, one writes

$$\displaystyle \begin{aligned} (A{\setminus} B)\cap C \quad \text{shorter}\quad A{\setminus} B \cap C. \end{aligned} $$
(2.8)

We know that 1∕2 + 3 is something other than 1∕(2 + 3). A similar case can be found in Fig. 2.2. There are two terms that differ only in the brackets: AB ∩ C and the set A∖(B ∩ C). The second set differs from the first in a small but not negligible part: it contains those elements of A which are not in C.

Fig. 2.2
figure 2

The difference between AB ∩ C (left) and A∖(B ∩ C) (right)

If the difference BA is formed, where B =  Ω represents the initial set of all elements, the result is also called complement of A and one simply writes A c.

Often set operations are identified with corresponding calculation rules from arithmetic: the difference “looks like” subtraction, while the union reminds one of addition. Note, however, that there are some calculation rules that are in clear contrast to arithmetic. For example, the following applies to all A:

$$\displaystyle \begin{aligned} A\cap A=A \qquad \text{and}\qquad A\cup A=A. \end{aligned} $$
(2.9)

There is no equivalent in the arithmetic of real numbers.

An Exercise

We illustrate the calculation rules described using two equations. For this purpose, we represent a union A ∪ B of two sets by two new sets such that the two new sets are disjoint. This is relatively easy because

$$\displaystyle \begin{aligned} A\cup B= (A{\setminus} B) \;\cup\; B \end{aligned} $$
(2.10)

must be fulfilled. Let us first realize that the union on the right is indeed identical to A ∪ B; second, these two new sets are disjoint. The second condition is obviously met, because AB by definition only contains elements that are not included in B, i.e.,

$$\displaystyle \begin{aligned} A{\setminus} B \cap B =\emptyset. \end{aligned} $$
(2.11)

If the union of the two sets is to be determined precisely, the procedure would be as follows:

$$\displaystyle \begin{aligned} A{\setminus} B \cup B &=\left\{x | x\in A{\setminus} B \;\text{or}\; x\in B \right\} \\ &=\left\{x | \left(x\in A \;\text{and}\;x\not\in B\right) \;\text{or}\; x\in B \right\} \\ &=\left\{x | x\in A \;\text{or}\; x\in B \right\} \\ &=A\cup B. \end{aligned} $$
(2.12)

The Venn diagram in Fig. 2.3 illustrates our considerations. Similarly, it is clear that for any set A and B

$$\displaystyle \begin{aligned} A= (A{\setminus} B) \;\cup\; (A\cap B) \end{aligned} $$
(2.13)

applies.

Fig. 2.3
figure 3

The union of the disjoint sets AB (left, blue) and B results in A ∪ B (right)

At times we have to deal with infinite operations of unions and intersections. Let us assume that an infinite sequence of sets A 1, A 2, … exists.Footnote 5 The infinite union

$$\displaystyle \begin{aligned} \bigcup_{n=1}^\infty A_n \end{aligned} $$
(2.14)

is the set that contains all elements from each set A n. Figure 2.4 illustrates such an infinite union. Likewise, the intersection

$$\displaystyle \begin{aligned} \bigcap_{n=1}^\infty A_n \end{aligned} $$
(2.15)

is the set which contains only those elements existing in all sets A n.

Fig. 2.4
figure 4

An infinite union of ascending subsets in the Venn diagram

As an example

$$\displaystyle \begin{aligned} \mathbb{N}=\bigcup_{n=0}^\infty \{n\} \end{aligned} $$
(2.16)

applies for the set of natural numbers because the infinite union includes all natural numbers. Likewise we have

$$\displaystyle \begin{aligned} \emptyset=\bigcap_{n=0}^\infty [n,\infty) \end{aligned} $$
(2.17)

since an element that should be in all half-open intervals [n, ) must be greater than any natural number n—and such a number does not exist.Footnote 6

Another example of real numbers is a good illustration of the concept. As A n we choose the interval \(A_n:=\left [\frac {1}{n}, 1-\frac {1}{n}\right ]\) with n > 1. If n increases the interval also increases which is why A n is a subset of A n+1. (A n)n=1,… is indeed a sequence of subsets. The limit of this sequence is then the interval

$$\displaystyle \begin{aligned} \bigcup_{n=1}^\infty \left[\frac{1}{n}, 1-\frac{1}{n}\right]=(0,1), \end{aligned} $$
(2.18)

because each number in the open interval (0, 1) lies (for sufficiently large n) in a set A n. Further, the boundary values 0 and 1 lie neither in the open interval (0, 1) nor in one of the sets A n. We hope that both examples help to understand what is meant by an infinite sequence of sets.

Power Set

The set of all subsets of a set Ω is called the power set which is denoted by \({\mathcal {P}}(\Omega )\). One has to realize that the power set is much larger than the set itself.

Think about a situation with six elements Ω = {1, 2, 3, 4, 5, 6}. If we look at all subsets of this finite event space, we arrive at a total of 26 = 64 subsets of the event space, namelyFootnote 7

$$\displaystyle \begin{aligned}\displaystyle {\mathcal{P}}(\Omega)=\left\{\emptyset, \{1\}, \{2\}, \ldots, \{6\}, \{1, 2\}, \{1, 3\}, \{1, 4\}, \ldots, \right. \\\displaystyle \left. \{1, 2, 3\}, \{1, 2, 4\}, \ldots, \{1, 2, 3, 4, 5, 6\} \right\}. \end{aligned} $$

Also for infinite sets, the power set is much larger than the initial set. This is a bit surprising, because it is not clear, why one can distinguish different “levels” (more precisely cardinalities) of infinity. We have put these considerations in Sect. 7.1.Footnote 8

2.2 Events and Sets

In colloquial language it is said that “events occur.” But what is an event? Specific examples are a dice roll, the share price at the end of a trading day, or the move of a chess player. In economic contexts, an event often determines an economic result (such as a pay-out, a pay-in, a profit).

However, an economist is usually not satisfied with the statement that this or that event could occur. Rather, economists calculate the expected values or variances of payments triggered by those events. In order to do so, mathematicians operate with the term “set.” To understand this, let us take a closer look at the example of a dice.

Dice Roll

We can trust that everyone knows which characteristics an ideal dice has. If we take a closer look at several possible events related to a dice roll and identify them with certain symbols:

A 1 :

The one was rolled.

A 2 :

An odd score was rolled.

A 3 :

The dice can no longer be found.

A 4 :

The score cannot be read.

A 5 :

A dice has been rolled without the score being noted.

We neither care whether the dice is thrown with the left hand or with the right hand nor whether it is pushed of the table. What matters is the score on the top. Therefore, one could describe the event of the dice roll alone by the score that appears at the end. This has the inestimable advantage that all conceivable events are completely described by six numbers. The mathematician ignores everything else.

Thus, the events A 3 and A 4 no longer differ for the mathematician: A 3 = A 4. Since obviously no scores are given, the mathematician even writes A 3 = A 4 = ∅.

For the events A 2 and A 5, the actual score is not reported. However, there is something that distinguishes the two events from the events A 3 and A 4: while A 3 and A 4 hide the scores, A 2 and A 5 do not. Here a number was definitely determined, but we were not told which it was. Mathematically, this is expressed for the last event by noting all possible scores, i.e., A 5 = {1, 2, 3, 4, 5, 6}.

Let us look at event A 1. In this event we are sure that the score was one. But then a mathematician uses the score to identify the event, i.e., A 1 = {1}.Footnote 9 In the same way, you can describe the event A 2 by enumerating all odd numbers. This means A 2 = {1, 3, 5}.

We realize that A 1 represents a subset of A 2, that is A 1 ⊂ A 2. There is a very clear interpretation for this set-theoretical representation: whenever the event A 1 occurs, the event A 2 is also true. And indeed, it is also true that the number of points is odd when a player rolls the number one. Of course, the opposite does not hold.

Elementary Event

To prepare for the following chapter on measures, it is useful to introduce the terms “elementary event” and “event space.”Footnote 10 All these sets are specific sets.

Elementary events are those sets that have no “genuine subsets.” What do we mean by this term? Since the empty set and the set itself constitute subsets, those two must not be considered. All remaining sets are the genuine ones. Elementary events therefore contain only a single element and are the smallest events which are conceivable.

The event space contains all events that one wants to look at. In the following we describe the terms descriptively with the help of examples.

To get an idea of an elementary event, imagine rolling the dice once and ask yourself what scores can occur. These are 1, 2, 3, 4, 5, and 6. Since these results cannot be broken down further, we call a set an elementary event if it contains one of these numbers.

While the term is easy to understand in context of a dice roll, it is not as simple if we consider realizations of a share price: here, one must know which listings are admitted on a stock exchange and if, for example, only full Dollar quotes or quotes in jumps of 10 cents are permitted. The identification of elementary events becomes even more complicated when one thinks of the results of a parliamentary election.

Event Space

We use this term to denote the set of all elementary events, commonly denoted by Ω. It is either a finite or an infinite set.Footnote 11

Event

One does not always only want to discuss elementary events. Rather, one often wants to describe the effects that follow from a combination of several elementary events: “When rolling an odd number …” or “At a day temperature above freezing ….” In this case we speak of composite events. Sometimes we simply use the term event. Such an event usually represents as set of elementary events. An event is thus a (arbitrary) subset of the event space, or A ⊂ Ω. A then stands for a (possibly compound) event.

Example 2.1 (Multiple Dice Rolls)

Set theory also allows us to characterize somewhat more complex events. Think of games in which the dice are rolled not once but several times in a row.Footnote 12 This is also easy to handle mathematically. If you have three rolls, you only have to note the three numbers in a row. Now there are two possibilities when rolling the dice several times: either the order in which the scores appear is important or it is not. If the order is relevant, the event would be described by a triple, i.e., (2, 1, 2) with three rolls. If the order is meaningless, the mathematician would note that the obtained numbers belong to the set {1, 2}.

Example 2.2 (Coin Toss, Once and Several Times)

We will later look at a situation where the result will depend on the toss of a coin. Two outcomes are relevant: heads or tails. The chance of a coin standing on the edge is usually excluded as being improbable.Footnote 13 Then a coin toss can be described by an element of the set {u, d}, where u stands for heads and d for tails. Thus the coin toss is similar to the dice roll, but here we have only two instead of six elementary events.

The situation becomes a little more complex when we look at multiple coin tosses. We will discuss details on page 10 in Example 2.4.

Example 2.3 (A Share Price)

Assume that the prices of a share correspond to any real nonnegative number. The event space Ω of a share price at a future point in time thus corresponds to \(\Omega =\mathbb {R}_+\).Footnote 14 This event space contains an infinite number of possible elementary events. Hence, the so-called power set is infinite, too.Footnote 15 The power set contains all (open and closed) subintervals of real numbers as well as their unions and intersections. Such a set is extremely large.

We will use this event space again when we discuss the Lebesgue measure and the Stieltjes measure.

So far we have explained these terms using the simple examples of a dice roll, a coin toss, and a single share price. One could therefore think that the event space Ω will always have to be constructed very simply. That is by no means the case. To illustrate this point, we present more complicated examples. To do so it is necessary, however, to clarify the difference between discrete-time and continuous-time models.

Discrete-Time Models

If you proceed in a discrete manner, you assume that the share price is quoted at t = 0, 1, …. There are periods between these dates in which no trading takes place and, as a result, no price is determined. Whether the time periods between the dates are long (a year) or short (a minute or a second) is a technical question, but not fundamental. It is crucial that the trade is interrupted again and again. In such models it is often assumed that the price movements from a point in time to the next are also of a discrete nature with a price either rising or falling by a (fixed) percentage. In such a case, we are dealing with a discrete-time model of share price development.

While t = 0 denotes the present, we characterize all future times with the natural numbers t = 1, 2, … up to the terminal date T. The terminal date can be infinite, T →. In this case there is “no end of the world.”

Example 2.4 (Binomial Model)

To get a vivid idea of the discrete-time concept, we now consider a simple binomial model with a finite number T future points in time, T > 1.

Let us assume that the price of a share today is S 0. A decision-maker may use the idea that this price will change at any future time either by the factor u(t) (for up) or by the factor d(t) (for down) with u(t) > d(t) > 0. The symbol ω t ∈{u(t), d(t)} in this simple model means nothing else than a process that causes the previous stock price S t−1 to change by the factor u(t) or the factor d(t), so that either S t = S t−1 ⋅ u(t) or S t = S t−1 ⋅ d(t) applies. If one has such a ω t in mind, one could speak of an “elementary event of a time.” However, there is no such term in the literature. If one examines all consecutive processes ω t for t = 1, 2, …, T, then one is dealing with a vector, and exactly such a vector is meant when the literature talks about discrete-time models of elementary events. An elementary event is therefore a vector Footnote 16

$$\displaystyle \begin{aligned} \omega=\left(\omega_1, \omega_2, \ldots, \omega_T\right)\in\Omega^T. \end{aligned} $$
(2.19)

The event space is then the set ΩT. Unlike the one-period model elementary events are no longer elements of Ω but vectors of elements of Ω. Thus, events are sets of vectors. If one specifies this for the binomial model with T = 2 future points in time, four possible elementary events can be distinguished,

$$\displaystyle \begin{aligned} \omega = \begin{cases} u(1), u(2) &\\ u(1), d(2) &\\ d(1), u(2) &\\ d(1), d(2) \end{cases}\,. \end{aligned} $$
(2.20)

In some economic examples, this model is used with an infinite time horizon. For the sake of simplicity, however, it is assumed that the factors u and d are constant over time. Thus, events are determined by a sequence of u’s and d’s. Any elementary event can be written as an infinite tuple

$$\displaystyle \begin{aligned} \omega=(\omega_1,\ldots)\in\{u,d\}^\infty. \end{aligned} $$
(2.21)

Such an event space is often illustrated by a so-called binomial model. A graphical representation is used in which the entry in the event vector (i.e., a ω t ∈{u, d}) is expressed by an upward or downward movement. Figure 2.5 represents such a model for the first three points in time. The particular path uud, i.e., an elementary event is highlighted. All paths are cut off at t = 4 and the movements continuing into infinity are only indicated. This corresponds to a coin toss with infinite repetition.

Fig. 2.5
figure 5

Binomial model with events up to t = 3, the event uud is highlighted

Our example provides further insights. Look at Fig. 2.5 and concentrate on the elementary event uud. Note that in addition to this path there exist two further elementary events (udu and duu). While their states in t = 3 are identical, the three elementary events are not. One also speaks of a recombining binomial model.

Such models are often used in the theory of evaluating derivatives. However, even if the paths uud, udu, and duu for the underlying asset (typically a share) result in the same payment at the time t = 3, this is not necessarily the case when valuing an option on this asset. There exist also derivatives where this value of the option at time t = 3 depends on the path that the underlying asset has taken, while it is hard to distinguish between the elementary events uud, udu, and duu in Fig. 2.5.

Continuous-Time Models

What changes when looking at a continuous-time model? A continuous model is based on the assumption that equity trading is never interrupted between two points in time. Rather, the market is trading on an ongoing basis. This implies that a stock price is given at any instant. The price is moving permanently. Furthermore, assuming that the share price can attain any value within an interval (however defined) the model is continuous in time and state.

Whether one prefers discrete or continuous models has absolutely nothing to do with the nature of reality. Rather, it is a question of usefulness.

Possible developments of a share price within a time interval [0, T] can no longer be described with the help of tuples or vectors (ω 1, ω 2, …, ω T): the number of entries would have to be infinite regardless of the size of the time interval. Although this “vector” does have integer column entries at any point in time, we need a column for each real number. Such an object is mathematically no longer a vector but a function.

Just as in the discrete model, an elementary event should describe the possible development of a share price over the entire time interval. If this event can be represented as a real number, then we must characterize it as a function,

$$\displaystyle \begin{aligned} \omega : [0, T] \,\rightarrow\, \Omega \,. \end{aligned} $$
(2.22)

An elementary event could be either an increasing or a decreasing development of the share price within [0, T]. In the first case ω would be a growing function, and in the second a decreasing one. Most elementary events will not have the property of monotonicity. Instead, one will usually observe irregular ups and downs. From now on events constitute sets of continuous functions.

Example 2.5 (Share Price Evolution)

In Example 2.4, we have studied the share prices at several future dates. In this example all time indices from today (t = 0) to the final date (t →) are available. Future share prices will then no longer be numbers but functions of time.

Then the event space Ω will get more elaborate. After all a share price evolution is a function of real numbers. We reasonably assume that this function is continuous (i.e., shows no jumps). Ω must then contain all continuous functions \(f(t):[0,\infty )\rightarrow \mathbb {R}\).Footnote 17 This set is also referred to in the literature as C[0, ). The letter C indicates “continuous.”

Anyone who wants to study the Brownian motion carefully must know that there are continuous functions and differentiable functions, but they are not identical. First of all, one can prove with little effort that a differentiable function must also be continuous. But the inverse does not have to be true: continuous functions are not necessarily differentiable. Using an example of Weierstraß we show on page 107 how such functions can be constructed. The role of these functions in a Brownian motion will be discussed later.

It is not a problem to imagine single elementary events of C[0, ). In Fig. 2.6 we have shown three conceivable share price developments. One of the shown share prices always grows at the same rate, another one fluctuates almost like a sinus function, and, finally, there is a share price development that could perhaps actually be observed on a stock exchange. Each of these functions is an elementary event from the set C[0, ).

Fig. 2.6
figure 6

Three elementary events in the event space C[0, )

There is no doubt that sinusoidal or linear share price trends are highly unlikely. It is not at all clear how to define and measure probabilities of share price evolutions at this point. Although unlikely, both the linear and the sinusoidal movement cannot be excluded. By contrast, no events in the sense defined here would be curves that are not continuous and show jumps. Equally unthinkable are share price developments which do not move forward in time but show a “time reversal,” i.e., move back into the past.Footnote 18

We would like to emphasize that all considerations are deterministic. Although uncertainty exists, we do not have probabilities yet. All examples of share price evolutions can occur. The three events mentioned (including the “random” function) assume that the future values will be described by the function f(t). Probabilistic considerations will be introduced later.

The Brownian motion uses the event space Ω = C[0, ). Usually it is assumed that all elements of the event space start in one and the same point; for all functions W(t) ∈ C[0, ) then W(0) = a applies. In the figure we have chosen a = 0, this specification will later also apply to the Brownian motions.

Figure 2.6 shows three functions being continuous and starting in the same point. These two conditions are typical for every path of the Brownian motion. However, a third characteristic of paths in Brownian motion is not recognizable in Fig. 2.6 and will be discussed later.