Regular Conditional Distribution

The introduction chose Idea 2 (define conditional expectation directly) over Idea 1 (define the conditional distribution first, then derive the expectation as an integral) because the second route is harder. With the theory of conditional expectation in hand, we can now circle back and construct conditional distributions rigorously. The right object is the regular conditional distribution (RCD).

Motivation

How should we define the conditional distribution of $X$ given $Y = y$ when $Y$ is continuous? Concretely, what is $\Pr(X \in A \mid Y = y)$ ?

The elementary formula $\Pr(A \mid B) = \frac{\Pr(A \cap B)}{\Pr(B)}$ breaks down since $\Pr(Y = y)=0$ .

A natural rephrasing uses conditional expectation:

\Pr(X \in A \mid Y = y) \;=\; \E\!\left[ \mathbb{1}_{\{X \in A\}} \,\big|\, Y \right](\omega) \quad \text{for some } \omega \in \{Y = y\}.

But conditional expectation is only defined up to a $\Pr$ -null set, and $\{Y = y\}$ is itself a $\Pr$ -null set. The value of $\E[\mathbb{1}_{\{X \in A\}} \mid Y]$ on $\{Y = y\}$ is unconstrained: it can be modified freely without changing the version. Worse, as $A$ ranges over Borel sets, the family of conditional probabilities so obtained need not assemble into a probability measure (countable additivity in $A$ holds only a.s., and the exceptional null set may depend on the chosen Borel cover).

The fix is to demand a single function $\omega \mapsto \mu(\omega, \cdot)$ that is simultaneously:

a version of $\Pr(X \in A \mid \cG)$ for each fixed $A$ ,
a probability measure in $A$ for each fixed $\omega$ .

The second requirement is the regularity condition.

Definition

Condition (1) is the “conditional probability” content: $\mu(\cdot, A)$ matches the conditional expectation of the indicator $\mathbb{1}_{\{X \in A\}}$ . Condition (2) is the regularity: for each typical $\omega$ , the slice $\mu(\omega, \cdot)$ is a genuine probability measure on $\R$ , not just a collection of numbers indexed by Borel sets.

Recovering conditional expectation

Once an RCD exists, conditional expectation of any function of $X$ is computed by integrating against it.

Statement
Proof

Let $\mu(\omega, \cdot)$ be an RCD for $X$ given $\cG$ . For any Borel-measurable $f : \R \to \R$ with $\E|f(X)| < \infty$ ,

\E[f(X) \mid \cG](\omega) \;=\; \int_\R f(x) \, \mu(\omega, dx) \quad \text{a.s.}

The reading: once an RCD exists, conditional expectation is just integration against the RCD. This recovers the elementary picture from the introduction: conditioning on $Y = y$ corresponds to integration against the conditional measure $\mu(\omega, \cdot)$ for any $\omega \in \{Y = y\}$ .

Existence

The construction proceeds via conditional cumulative distribution functions on the rationals, then extends to $\R$ , then converts to a measure.

Remarks

The role of $\R$ . The existence proof relied on $\R$ in exactly one place: building a measure from its values on the countable $\pi$ -system of half-lines $\{(-\infty, r] : r \in \Q\}$ . The same machinery works for any random variable taking values in a Borel space (a measurable space isomorphic to a Borel subset of a Polish space), which covers $\R^n$ , separable metric spaces, and most distributions of practical interest. For general measurable target spaces, RCDs need not exist.
Uniqueness. Two RCDs of $X$ given $\cG$ agree as measures for $\Pr$ -almost every $\omega$ . The proof mirrors the integration-identity argument above: the $\pi$ - $\lambda$ theorem upgrades equality on the rational half-lines to equality on all Borel sets, $\Pr$ -a.s. in $\omega$ .
Disintegration. When $\cG = \sigma(Y)$ for some random variable $Y$ taking values in a Borel space, RCDs assemble into a disintegration of the joint law of $(X, Y)$ : a Markov kernel $K(y, dx) := \mu(\omega, dx)$ for any $\omega$ with $Y(\omega) = y$ , well-defined up to a $\Pr_Y$ -null set of $y$ . Conditioning takes the elementary form $\E[f(X) \mid Y = y] = \int_\R f(x) \, K(y, dx)$ , closing the loop with the introduction’s discussion of the discrete and absolutely continuous cases.