Glivenko-Cantelli Theorem

The Glivenko–Cantelli theorem is sometimes called the Fundamental Theorem of Statistics: from i.i.d. samples of an unknown distribution, the empirical distribution function converges to the true distribution function uniformly (not just pointwise) and almost surely. It is the strongest possible “you can learn $F$ from a sample” statement at the level of CDFs.

Setup: Empirical distribution function

Let $X_1, X_2, \ldots$ be i.i.d. samples from an unknown distribution with distribution function $F$ . The empirical distribution function based on the first $n$ samples is

F_n(x) = \frac{1}{n} \sum_{m=1}^{n} \mathbb{1}_{\{X_m \le x\}}.

This estimates $\mathbb{P}(X_1 \le x) = F(x)$ by the sample fraction of observations at most $x$ .

By the SLLN applied to the bounded random variables $\mathbb{1}_{\{X_m \le x\}}$ (mean $F(x)$ ), for each fixed $x$ ,

F_n(x) \xrightarrow{a.s.} F(x).

The same argument applied to $\mathbb{1}_{\{X_m < x\}}$ gives the left-limit version

F_n(x-) = \frac{1}{n} \sum_{m=1}^{n} \mathbb{1}_{\{X_m < x\}} \xrightarrow{a.s.} F(x-).

Glivenko–Cantelli upgrades pointwise convergence to uniform convergence in $x$ .

As $n \to \infty$ ,

\sup_{x \in \mathbb{R}} \big|F_n(x) - F(x)\big| \xrightarrow{a.s.} 0.

Reading the result

Uniformity is what’s new. Pointwise a.s. convergence $F_n(x) \to F(x)$ for each $x$ is immediate from the SLLN. The work is in showing the worst $x$ also behaves, regardless of how badly $F$ may jump.
Distribution-free rate. The bound $\sup_x |F_n - F| \le 2/k$ depends only on the grid size $k$ , not on $F$ . This is the qualitative form of the Dvoretzky–Kiefer–Wolfowitz inequality, which gives the sharp quantitative rate $\sup_x |F_n - F| = O_p(n^{-1/2})$ .
Why “Fundamental Theorem of Statistics”. Without knowing $F$ , observing samples lets you reconstruct it uniformly well from $F_n$ . Every functional of $F$ that is continuous in the sup-norm topology can therefore be consistently estimated by the corresponding functional of $F_n$ (quantiles, expectations of bounded continuous functions, Kolmogorov–Smirnov-type statistics).