Skip to content

Uniqueness

The existence proof produced a conditional expectation E(XG)\E(X \mid \cG). A natural follow-up: how many such random variables are there? The answer is essentially one, up to behavior on a P\Pr-null set.

The proposition above says that any two conditional expectations of the same XX given the same G\cG agree on Ω\Omega except on a P\Pr-null set. A specific representative is called a version of E(XG)\E(X \mid \cG). Different versions differ only on a P\Pr-null set, so any statement about E(XG)\E(X \mid \cG) that is invariant under modification on a null set (such as E[Y]\E[Y], Var(Y)\Var(Y), or any almost-sure inequality) is unambiguous. When writing E(XG)\E(X \mid \cG) without qualification, we tacitly mean “a version” and the choice does not matter for any almost-sure statement.

The general definition conditions on a σ\sigma-field. The elementary notion conditions on a random variable. These are reconciled by reducing the second to the first.

This recovers the elementary formulas as special cases. When XX is discrete with P(X=x)>0\Pr(X = x) > 0 for each xx in its support, σ(X)\sigma(X) is generated by the partition {{X=x}}x\{\{X = x\}\}_x, and the block-average picture applied to this partition recovers

E(YX)(ω)  =  xE(YX=x)1{X=x}(ω).\E(Y \mid X)(\omega) \;=\; \sum_x \E(Y \mid X = x) \, \mathbb{1}_{\{X = x\}}(\omega).

More generally, since E(YX)\E(Y \mid X) is σ(X)\sigma(X)-measurable, the Doob-Dynkin lemma guarantees a Borel-measurable g:RRg : \R \to \R with E(YX)=g(X)\E(Y \mid X) = g(X) a.s. The function gg is the regression function of YY on XX, and E(YX=x):=g(x)\E(Y \mid X = x) := g(x) recovers the elementary “given X=xX = x” notation, this time as a function defined up to PX\Pr_X-null sets (where PX\Pr_X is the distribution of XX).