Regular Conditional Distribution
The introduction chose Idea 2 (define conditional expectation directly) over Idea 1 (define the conditional distribution first, then derive the expectation as an integral) because the second route is harder. With the theory of conditional expectation in hand, we can now circle back and construct conditional distributions rigorously. The right object is the regular conditional distribution (RCD).
Motivation
Section titled “Motivation”How should we define the conditional distribution of given when is continuous? Concretely, what is ?
The elementary formula breaks down since .
A natural rephrasing uses conditional expectation:
But conditional expectation is only defined up to a -null set, and is itself a -null set. The value of on is unconstrained: it can be modified freely without changing the version. Worse, as ranges over Borel sets, the family of conditional probabilities so obtained need not assemble into a probability measure (countable additivity in holds only a.s., and the exceptional null set may depend on the chosen Borel cover).
The fix is to demand a single function that is simultaneously:
- a version of for each fixed ,
- a probability measure in for each fixed .
The second requirement is the regularity condition.
Definition
Section titled “Definition”Condition (1) is the “conditional probability” content: matches the conditional expectation of the indicator . Condition (2) is the regularity: for each typical , the slice is a genuine probability measure on , not just a collection of numbers indexed by Borel sets.
Recovering conditional expectation
Section titled “Recovering conditional expectation”Once an RCD exists, conditional expectation of any function of is computed by integrating against it.
The reading: once an RCD exists, conditional expectation is just integration against the RCD. This recovers the elementary picture from the introduction: conditioning on corresponds to integration against the conditional measure for any .
Existence
Section titled “Existence”The construction proceeds via conditional cumulative distribution functions on the rationals, then extends to , then converts to a measure.
Remarks
Section titled “Remarks”- The role of . The existence proof relied on in exactly one place: building a measure from its values on the countable -system of half-lines . The same machinery works for any random variable taking values in a Borel space (a measurable space isomorphic to a Borel subset of a Polish space), which covers , separable metric spaces, and most distributions of practical interest. For general measurable target spaces, RCDs need not exist.
- Uniqueness. Two RCDs of given agree as measures for -almost every . The proof mirrors the integration-identity argument above: the - theorem upgrades equality on the rational half-lines to equality on all Borel sets, -a.s. in .
- Disintegration. When for some random variable taking values in a Borel space, RCDs assemble into a disintegration of the joint law of : a Markov kernel for any with , well-defined up to a -null set of . Conditioning takes the elementary form , closing the loop with the introduction’s discussion of the discrete and absolutely continuous cases.