Skip to content

Martingales

A single random variable models one uncertain quantity. To model a quantity that evolves, like a gambler’s fortune or the price of an asset, we need a whole family of random variables indexed by time, together with a bookkeeping of what is known at each instant. Those two ingredients are the stochastic process and the filtration.

A martingale is the special case where, given everything known so far, the expected next value equals the current one, i.e., the mathematical model of a fair game.

Most commonly the index set is T={0,1,2,}T = \{0, 1, 2, \ldots\}, giving a discrete-time process, or T=[0,)T = [0, \infty), giving a continuous-time process. Everything in this section is discrete-time, so T={0,1,2,}T = \{0, 1, 2, \ldots\} throughout.

The second ingredient records how information accumulates.

The canonical example is the filtration generated by the process itself,

Fn=σ(X0,,Xn),\cF_n = \sigma(X_0, \ldots, X_n),

the σ\sigma-field carrying exactly the information in the first n+1n+1 observations: everything that can be decided once X0,,XnX_0, \ldots, X_n have been seen, and nothing more.

For a process and a filtration to fit together, the process must not look into the future.

Measurability with respect to Ft\cF_t means XtX_t is determined by the information available at time tt. So adaptedness says that the value of the process at time tt is known at time tt. A process is always adapted to the filtration it generates, since XnX_n is σ(X0,,Xn)\sigma(X_0, \ldots, X_n)-measurable by construction.

To see what adaptedness rules out, here is a process that fails it.

Example (a process that is not adapted). Let {Xn}n0\{X_n\}_{n \ge 0} be any process with generated filtration Fn=σ(X0,,Xn)\cF_n = \sigma(X_0, \ldots, X_n), and define the running three-term average

Yn=13(Xn1+Xn+Xn+1),Y0=X0.Y_n = \tfrac{1}{3}\big(X_{n-1} + X_n + X_{n+1}\big), \qquad Y_0 = X_0.

Then YnY_n depends on Xn+1X_{n+1}, which Fn\cF_n knows nothing about, so YnY_n is not Fn\cF_n-measurable. Hence {Yn}n0\{Y_n\}_{n \ge 0} is not adapted: computing YnY_n requires peeking one step into the future.

Adaptedness is the compatibility condition that ties a stochastic process to a filtration, and every definition that follows assumes it.

Condition (1) keeps the conditional expectation in (2) well-defined. Condition (2) is the martingale property, and it carries the whole idea: the expected future value, given all the information up to the present, is exactly the present value. The best forecast of tomorrow is today.

Relaxing the equality in (2) to an inequality gives the two one-sided variants:

E(Xn+1Fn)Xn        {Xn} is a supermartingale,\E(X_{n+1} \mid \cF_n) \le X_n \;\implies\; \{X_n\} \text{ is a } \textbf{supermartingale}, E(Xn+1Fn)Xn        {Xn} is a submartingale.\E(X_{n+1} \mid \cF_n) \ge X_n \;\implies\; \{X_n\} \text{ is a } \textbf{submartingale}.

The gambling reading makes the names concrete. Taking expectations on both sides of the martingale property and using the tower property, E[Xn+1]=E[E(Xn+1Fn)]=E[Xn]\E[X_{n+1}] = \E[\E(X_{n+1} \mid \cF_n)] = \E[X_n], so the mean is constant in time:

E[Xn]=E[X0]for every n.\E[X_n] = \E[X_0] \qquad \text{for every } n.

A martingale is a fair game: on average you neither gain nor lose. The same computation on the one-sided versions makes the sequence of means monotone:

  • Supermartingale: E[X0]E[X1]E[X2]\E[X_0] \ge \E[X_1] \ge \E[X_2] \ge \cdots. You do not gain on average.
  • Submartingale: E[X0]E[X1]E[X2]\E[X_0] \le \E[X_1] \le \E[X_2] \le \cdots. You do not lose on average.

The terminology runs opposite to the everyday sense of the prefixes: a supermartingale is the one whose expectation drifts down. A useful way to remember it is that a supermartingale sits above where it is heading, while a submartingale sits below.

The cleanest first example of a martingale is the simple random walk.

Let {Xn}n1\{X_n\}_{n \ge 1} be independent and identically distributed with

P(Xn=1)=P(Xn=1)=12,\Pr(X_n = 1) = \Pr(X_n = -1) = \tfrac{1}{2},

and define the partial sums

S0=0,Sn=X1++Xn(n1).S_0 = 0, \qquad S_n = X_1 + \cdots + X_n \quad (n \ge 1).

At each step the walk moves one unit forward or backward with equal probability. The two adjectives name these two features: symmetric because +1+1 and 1-1 are equally likely, and simple because every step has size exactly one.

Take the generated filtration Fn=σ(X1,,Xn)\cF_n = \sigma(X_1, \ldots, X_n) for n1n \ge 1, with F0={,Ω}\cF_0 = \{\emptyset, \Omega\} the trivial σ\sigma-field. Then {Sn}n0\{S_n\}_{n \ge 0} is an {Fn}\{\cF_n\}-martingale.

  1. Integrability. Since each Xi{1,+1}X_i \in \{-1, +1\}, the walk after nn steps satisfies Snn|S_n| \le n, so ESnn<\E|S_n| \le n < \infty.

  2. Martingale property. Split off the latest increment and apply linearity:

    E[Sn+1Fn]=E[Sn+Xn+1Fn]=E[SnFn]+E[Xn+1Fn].\E[S_{n+1} \mid \cF_n] = \E[S_n + X_{n+1} \mid \cF_n] = \E[S_n \mid \cF_n] + \E[X_{n+1} \mid \cF_n].

    The first term simplifies because SnS_n is Fn\cF_n-measurable, so it passes through conditioning untouched: E[SnFn]=Sn\E[S_n \mid \cF_n] = S_n. The second term simplifies because Xn+1X_{n+1} is independent of Fn=σ(X1,,Xn)\cF_n = \sigma(X_1, \ldots, X_n), so conditioning has no effect: E[Xn+1Fn]=E[Xn+1]=0\E[X_{n+1} \mid \cF_n] = \E[X_{n+1}] = 0. Therefore

    E[Sn+1Fn]=Sn+0=Sn.\E[S_{n+1} \mid \cF_n] = S_n + 0 = S_n.

This makes the generalization to a biased walk immediate. Suppose the increments are i.i.d. and integrable with E[X1]=a0\E[X_1] = a \neq 0. The same split gives E[Sn+1Fn]=Sn+a\E[S_{n+1} \mid \cF_n] = S_n + a, so

{Sn} is a {supermartingaleif a<0,submartingaleif a>0.\{S_n\} \text{ is a } \begin{cases} \textbf{supermartingale} & \text{if } a < 0, \\ \textbf{submartingale} & \text{if } a > 0. \end{cases}

To turn a biased walk back into a martingale, subtract the accumulated mean. The centered walk

Mn=SnnE[X1]=X1++XnnE[X1]M_n = S_n - n\,\E[X_1] = X_1 + \cdots + X_n - n\,\E[X_1]

satisfies

E[Mn+1Fn]=(Sn+a)(n+1)a=Snna=Mn,\E[M_{n+1} \mid \cF_n] = (S_n + a) - (n+1)a = S_n - na = M_n,

so {Mn}n0\{M_n\}_{n \ge 0} is an {Fn}\{\cF_n\}-martingale. All of these processes, symmetric or biased, centered or not, are still called random walks.