Let (Ω,F,P) be a probability space, G⊆F a sub-σ-field, and X an integrable random variable, E∣X∣<∞. Then a conditional expectation E(X∣G) exists: there is a G-measurable random variable Y such that
∫AXdP=∫AYdPfor every A∈G. Case 1: X≥0.
Let μ=P∣G, the original probability measure P restricted to the sub-σ-field G. So μ is a measure on (Ω,G). Define a second set function on G by
ν(A):=∫AXdP,A∈G.Since X≥0, the integral is non-negative; countable additivity of ν follows from the monotone convergence theorem. So ν is a measure on (Ω,G). It is finite because ν(Ω)=E[X]≤E∣X∣<∞.
Absolute continuity. For any A∈G,
μ(A)=0⟺P(A)=0⟹∫AXdP=0⟺ν(A)=0.The middle implication uses the fact that integrating any function over a P-null set gives zero. So ν≪μ.
Apply Radon-Nikodym. Both μ and ν are finite, hence σ-finite, measures on (Ω,G). By the Radon-Nikodym theorem, there is a G-measurable function Y=dμdν such that
ν(A)=∫AYdμfor every A∈G.Identify dμ with dP on G-events. Since μ=P∣G agrees with P on every set in G, and Y is G-measurable, integrating Y over A∈G against μ or against P gives the same value:
∫AYdμ=∫AYdP.Chaining the identities,
∫AXdP=ν(A)=∫AYdμ=∫AYdPfor every A∈G.So Y satisfies both conditions of the definition: it is G-measurable, and it has the same integral as X over every G-event. This finishes Case 1.
Case 2: general integrable X.
Split X into positive and negative parts,
X=X+−X−,X+=max(X,0),X−=max(−X,0).Both X+ and X− are non-negative and integrable, since E[X±]≤E∣X∣<∞. Case 1 applied separately to X+ and X− produces G-measurable
Y1=E(X+∣G),Y2=E(X−∣G).Define Y:=Y1−Y2. Then Y is G-measurable (a difference of G-measurable functions), and for every A∈G,
∫AXdP=∫AX+dP−∫AX−dP=∫AY1dP−∫AY2dP=∫AYdP.The first equality uses linearity of the integral on X=X+−X−; the second is Case 1 applied to X+ and X−; the third is linearity in reverse. So Y is a conditional expectation of X given G.