Let { X n } n ≥ 1 \{X_n\}_{n \ge 1} { X n } n ≥ 1 be a sequence of random variables such that X n → d X X_n \xrightarrow{d} X X n d X .
Then there exists a probability space ( Ω , F , P ) (\Omega, \mathcal{F}, \mathbb{P}) ( Ω , F , P ) and random variables on it, { Y n } n ≥ 1 \{Y_n\}_{n \ge 1} { Y n } n ≥ 1 and Y Y Y , such that:
Same Distributions : X n = d Y n X_n \stackrel{d}{=} Y_n X n = d Y n for all n n n , and X = d Y X \stackrel{d}{=} Y X = d Y .
Almost Sure Convergence :
Y n → a . s . Y as n → ∞ Y_n \xrightarrow{a.s.} Y \quad \text{as } n \to \infty Y n a . s . Y as n → ∞
Construction :
We use the standard uniform space as our common probability space:
( Ω , F , P ) = ( [ 0 , 1 ] , B , λ ) (\Omega, \mathcal{F}, \mathbb{P}) = ([0, 1], \mathcal{B}, \lambda) ( Ω , F , P ) = ([ 0 , 1 ] , B , λ ) where λ \lambda λ is the Lebesgue measure.
Let F n F_n F n be the CDF of X n X_n X n and F F F be the CDF of X X X .
We define Y n Y_n Y n and Y Y Y using the generalized inverse (quantile function):
Y n ( ω ) = F n − 1 ( ω ) = inf { x : F n ( x ) ≥ ω } Y ( ω ) = F − 1 ( ω ) = inf { x : F ( x ) ≥ ω } \begin{aligned}
Y_n(\omega) &= F_n^{-1}(\omega) = \inf \{x : F_n(x) \ge \omega\} \\
Y(\omega) &= F^{-1}(\omega) = \inf \{x : F(x) \ge \omega\}
\end{aligned} Y n ( ω ) Y ( ω ) = F n − 1 ( ω ) = inf { x : F n ( x ) ≥ ω } = F − 1 ( ω ) = inf { x : F ( x ) ≥ ω } (Note: The notes define F − 1 ( x ) = sup { y : F ( y ) < x } F^{-1}(x) = \sup\{y : F(y) < x\} F − 1 ( x ) = sup { y : F ( y ) < x } . This is equivalent for dist. functions).
To visualize the connection between the “flat parts” of F F F and the continuity of F − 1 F^{-1} F − 1 , consider the following definitions for any x ∈ [ 0 , 1 ) x \in [0,1) x ∈ [ 0 , 1 ) :
a x = sup { y : F ( y ) < x } = F − 1 ( x ) , b x = inf { y : F ( y ) > x } a_x = \sup \{y : F(y) < x\} = F^{-1}(x), \quad b_x = \inf \{y : F(y) > x\} a x = sup { y : F ( y ) < x } = F − 1 ( x ) , b x = inf { y : F ( y ) > x } If F F F has a flat part at height x x x , then a x ≠ b x a_x \neq b_x a x = b x . If strictly increasing, a x = b x a_x = b_x a x = b x .
From the properties of the generalized inverse (Inverse Transform Sampling), we know that if ω ∼ Unif [ 0 , 1 ] \omega \sim \text{Unif}[0,1] ω ∼ Unif [ 0 , 1 ] , then F − 1 ( ω ) F^{-1}(\omega) F − 1 ( ω ) has CDF F F F . Thus:
Y n = d X n and Y = d X Y_n \stackrel{d}{=} X_n \quad \text{and} \quad Y \stackrel{d}{=} X Y n = d X n and Y = d X Convergence Proof :
We want to show Y n ( ω ) → Y ( ω ) Y_n(\omega) \to Y(\omega) Y n ( ω ) → Y ( ω ) for almost all ω ∈ ( 0 , 1 ) \omega \in (0,1) ω ∈ ( 0 , 1 ) .
Let Ω 0 ⊂ ( 0 , 1 ) \Omega_0 \subset (0,1) Ω 0 ⊂ ( 0 , 1 ) be the set of points where F − 1 F^{-1} F − 1 is continuous (or related to the “flat parts” of F F F ).
Specifically, consider the intervals where F F F is constant. The set of values x x x where F ( x ) F(x) F ( x ) is flat corresponds to jumps in F − 1 F^{-1} F − 1 . Since F − 1 F^{-1} F − 1 is monotone, it has at most countably many discontinuities.
For any ω ∈ Ω 0 \omega \in \Omega_0 ω ∈ Ω 0 (which is almost everywhere), we show convergence.
Lower Bound (lim inf Y n ≥ Y \liminf Y_n \ge Y lim inf Y n ≥ Y )
Let ω ∈ Ω 0 \omega \in \Omega_0 ω ∈ Ω 0 .
Pick any y < Y ( ω ) = F − 1 ( ω ) y < Y(\omega) = F^{-1}(\omega) y < Y ( ω ) = F − 1 ( ω ) .
This implies F ( y ) < ω F(y) < \omega F ( y ) < ω (from the definition of inverse, if ω \omega ω is not in a flat gap).
Let’s choose y y y such that F F F is continuous at y y y (possible since discontinuities are countable).
Since X n → d X X_n \xrightarrow{d} X X n d X , we have F n ( y ) → F ( y ) F_n(y) \to F(y) F n ( y ) → F ( y ) .
Since F ( y ) < ω F(y) < \omega F ( y ) < ω , for sufficiently large n n n , we must have F n ( y ) < ω F_n(y) < \omega F n ( y ) < ω .
By definition of inverse:
F n ( y ) < ω ⟹ y ≤ F n − 1 ( ω ) = Y n ( ω ) F_n(y) < \omega \implies y \le F_n^{-1}(\omega) = Y_n(\omega) F n ( y ) < ω ⟹ y ≤ F n − 1 ( ω ) = Y n ( ω )
Thus, for large n n n , Y n ( ω ) ≥ y Y_n(\omega) \ge y Y n ( ω ) ≥ y .
Taking the liminf:
lim inf n → ∞ Y n ( ω ) ≥ y \liminf_{n \to \infty} Y_n(\omega) \ge y n → ∞ lim inf Y n ( ω ) ≥ y
Since this holds for any y < Y ( ω ) y < Y(\omega) y < Y ( ω ) (where F F F is continuous), let y ↑ Y ( ω ) y \uparrow Y(\omega) y ↑ Y ( ω ) :
lim inf n → ∞ Y n ( ω ) ≥ Y ( ω ) \liminf_{n \to \infty} Y_n(\omega) \ge Y(\omega) n → ∞ lim inf Y n ( ω ) ≥ Y ( ω )
Upper Bound (lim sup Y n ≤ Y \limsup Y_n \le Y lim sup Y n ≤ Y )
Similarly, for ω ∈ Ω 0 \omega \in \Omega_0 ω ∈ Ω 0 , pick z > Y ( ω ) z > Y(\omega) z > Y ( ω ) .
This implies F ( z ) > ω F(z) > \omega F ( z ) > ω . Choose z z z such that F F F is continuous at z z z .
Then F n ( z ) → F ( z ) > ω F_n(z) \to F(z) > \omega F n ( z ) → F ( z ) > ω .
For large n n n , F n ( z ) > ω ⟹ Y n ( ω ) ≤ z F_n(z) > \omega \implies Y_n(\omega) \le z F n ( z ) > ω ⟹ Y n ( ω ) ≤ z .
Taking the limsup:
lim sup n → ∞ Y n ( ω ) ≤ z \limsup_{n \to \infty} Y_n(\omega) \le z n → ∞ lim sup Y n ( ω ) ≤ z
Letting z ↓ Y ( ω ) z \downarrow Y(\omega) z ↓ Y ( ω ) :
lim sup n → ∞ Y n ( ω ) ≤ Y ( ω ) \limsup_{n \to \infty} Y_n(\omega) \le Y(\omega) n → ∞ lim sup Y n ( ω ) ≤ Y ( ω )
Combining steps 1 and 2, Y n ( ω ) → Y ( ω ) Y_n(\omega) \to Y(\omega) Y n ( ω ) → Y ( ω ) for all ω ∈ Ω 0 \omega \in \Omega_0 ω ∈ Ω 0 , which has probability 1.