Skip to content

Glivenko-Cantelli Theorem

The Glivenko–Cantelli theorem is sometimes called the Fundamental Theorem of Statistics: from i.i.d. samples of an unknown distribution, the empirical distribution function converges to the true distribution function uniformly (not just pointwise) and almost surely. It is the strongest possible “you can learn FF from a sample” statement at the level of CDFs.

Let X1,X2,X_1, X_2, \ldots be i.i.d. samples from an unknown distribution with distribution function FF. The empirical distribution function based on the first nn samples is

Fn(x)=1nm=1n1{Xmx}.F_n(x) = \frac{1}{n} \sum_{m=1}^{n} \mathbb{1}_{\{X_m \le x\}}.

This estimates P(X1x)=F(x)\mathbb{P}(X_1 \le x) = F(x) by the sample fraction of observations at most xx.

By the SLLN applied to the bounded random variables 1{Xmx}\mathbb{1}_{\{X_m \le x\}} (mean F(x)F(x)), for each fixed xx,

Fn(x)a.s.F(x).F_n(x) \xrightarrow{a.s.} F(x).

The same argument applied to 1{Xm<x}\mathbb{1}_{\{X_m < x\}} gives the left-limit version

Fn(x)=1nm=1n1{Xm<x}a.s.F(x).F_n(x-) = \frac{1}{n} \sum_{m=1}^{n} \mathbb{1}_{\{X_m < x\}} \xrightarrow{a.s.} F(x-).

Glivenko–Cantelli upgrades pointwise convergence to uniform convergence in xx.

  • Uniformity is what’s new. Pointwise a.s. convergence Fn(x)F(x)F_n(x) \to F(x) for each xx is immediate from the SLLN. The work is in showing the worst xx also behaves, regardless of how badly FF may jump.

  • Distribution-free rate. The bound supxFnF2/k\sup_x |F_n - F| \le 2/k depends only on the grid size kk, not on FF. This is the qualitative form of the Dvoretzky–Kiefer–Wolfowitz inequality, which gives the sharp quantitative rate supxFnF=Op(n1/2)\sup_x |F_n - F| = O_p(n^{-1/2}).

  • Why “Fundamental Theorem of Statistics”. Without knowing FF, observing samples lets you reconstruct it uniformly well from FnF_n. Every functional of FF that is continuous in the sup-norm topology can therefore be consistently estimated by the corresponding functional of FnF_n (quantiles, expectations of bounded continuous functions, Kolmogorov–Smirnov-type statistics).