A single random variable models one uncertain quantity. To model a quantity that evolves, like a gambler’s fortune or the price of an asset, we need a whole family of random variables indexed by time, together with a bookkeeping of what is known at each instant. Those two ingredients are the stochastic process and the filtration.
A martingale is the special case where, given everything known so far, the expected next value equals the current one, i.e., the mathematical model of a fair game.
Most commonly the index set is T={0,1,2,…}, giving a discrete-time process, or T=[0,∞), giving a continuous-time process. Everything in this section is discrete-time, so T={0,1,2,…} throughout.
The second ingredient records how information accumulates.
The canonical example is the filtration generated by the process itself,
Fn=σ(X0,…,Xn),
the σ-field carrying exactly the information in the first n+1 observations: everything that can be decided once X0,…,Xn have been seen, and nothing more.
For a process and a filtration to fit together, the process must not look into the future.
Measurability with respect to Ft means Xt is determined by the information available at time t. So adaptedness says that the value of the process at time t is known at time t. A process is always adapted to the filtration it generates, since Xn is σ(X0,…,Xn)-measurable by construction.
To see what adaptedness rules out, here is a process that fails it.
Example (a process that is not adapted). Let {Xn}n≥0 be any process with generated filtration Fn=σ(X0,…,Xn), and define the running three-term average
Yn=31(Xn−1+Xn+Xn+1),Y0=X0.
Then Yn depends on Xn+1, which Fn knows nothing about, so Yn is not Fn-measurable. Hence {Yn}n≥0 is not adapted: computing Yn requires peeking one step into the future.
Adaptedness is the compatibility condition that ties a stochastic process to a filtration, and every definition that follows assumes it.
Condition (1) keeps the conditional expectation in (2) well-defined. Condition (2) is the martingale property, and it carries the whole idea: the expected future value, given all the information up to the present, is exactly the present value. The best forecast of tomorrow is today.
Relaxing the equality in (2) to an inequality gives the two one-sided variants:
E(Xn+1∣Fn)≤Xn⟹{Xn} is a supermartingale,E(Xn+1∣Fn)≥Xn⟹{Xn} is a submartingale.
The gambling reading makes the names concrete. Taking expectations on both sides of the martingale property and using the tower property, E[Xn+1]=E[E(Xn+1∣Fn)]=E[Xn], so the mean is constant in time:
E[Xn]=E[X0]for every n.
A martingale is a fair game: on average you neither gain nor lose. The same computation on the one-sided versions makes the sequence of means monotone:
Supermartingale:E[X0]≥E[X1]≥E[X2]≥⋯. You do not gain on average.
Submartingale:E[X0]≤E[X1]≤E[X2]≤⋯. You do not lose on average.
The terminology runs opposite to the everyday sense of the prefixes: a supermartingale is the one whose expectation drifts down. A useful way to remember it is that a supermartingale sits above where it is heading, while a submartingale sits below.
The cleanest first example of a martingale is the simple random walk.
Let {Xn}n≥1 be independent and identically distributed with
P(Xn=1)=P(Xn=−1)=21,
and define the partial sums
S0=0,Sn=X1+⋯+Xn(n≥1).
At each step the walk moves one unit forward or backward with equal probability. The two adjectives name these two features: symmetric because +1 and −1 are equally likely, and simple because every step has size exactly one.
Take the generated filtration Fn=σ(X1,…,Xn) for n≥1, with F0={∅,Ω} the trivial σ-field. Then {Sn}n≥0 is an {Fn}-martingale.
Integrability. Since each Xi∈{−1,+1}, the walk after n steps satisfies ∣Sn∣≤n, so E∣Sn∣≤n<∞.
Martingale property. Split off the latest increment and apply linearity:
The first term simplifies because Sn is Fn-measurable, so it passes through conditioning untouched: E[Sn∣Fn]=Sn. The second term simplifies because Xn+1 is independent of Fn=σ(X1,…,Xn), so conditioning has no effect: E[Xn+1∣Fn]=E[Xn+1]=0. Therefore
E[Sn+1∣Fn]=Sn+0=Sn.
This makes the generalization to a biased walk immediate. Suppose the increments are i.i.d. and integrable with E[X1]=a=0. The same split gives E[Sn+1∣Fn]=Sn+a, so
{Sn} is a {supermartingalesubmartingaleif a<0,if a>0.
To turn a biased walk back into a martingale, subtract the accumulated mean. The centered walk
Mn=Sn−nE[X1]=X1+⋯+Xn−nE[X1]
satisfies
E[Mn+1∣Fn]=(Sn+a)−(n+1)a=Sn−na=Mn,
so {Mn}n≥0 is an {Fn}-martingale. All of these processes, symmetric or biased, centered or not, are still called random walks.