Glivenko-Cantelli Theorem
The Glivenko–Cantelli theorem is sometimes called the Fundamental Theorem of Statistics: from i.i.d. samples of an unknown distribution, the empirical distribution function converges to the true distribution function uniformly (not just pointwise) and almost surely. It is the strongest possible “you can learn from a sample” statement at the level of CDFs.
Setup: Empirical distribution function
Section titled “Setup: Empirical distribution function”Let be i.i.d. samples from an unknown distribution with distribution function . The empirical distribution function based on the first samples is
This estimates by the sample fraction of observations at most .
By the SLLN applied to the bounded random variables (mean ), for each fixed ,
The same argument applied to gives the left-limit version
Glivenko–Cantelli upgrades pointwise convergence to uniform convergence in .
Theorem
Section titled “Theorem”Reading the result
Section titled “Reading the result”-
Uniformity is what’s new. Pointwise a.s. convergence for each is immediate from the SLLN. The work is in showing the worst also behaves, regardless of how badly may jump.
-
Distribution-free rate. The bound depends only on the grid size , not on . This is the qualitative form of the Dvoretzky–Kiefer–Wolfowitz inequality, which gives the sharp quantitative rate .
-
Why “Fundamental Theorem of Statistics”. Without knowing , observing samples lets you reconstruct it uniformly well from . Every functional of that is continuous in the sup-norm topology can therefore be consistently estimated by the corresponding functional of (quantiles, expectations of bounded continuous functions, Kolmogorov–Smirnov-type statistics).