Examples
A few canonical settings where the CLT applies. Each example specifies and , then the standardization
gives the limit shape.
Interactive: Convolution → Gaussian
Section titled “Interactive: Convolution → Gaussian”Pick a base distribution and slide . The blue bars are the exact distribution of the -fold sum (computed by repeatedly convolving the base PMF with itself, not simulated). The red dashed curve is the Gaussian with mean and variance . Notice how rapidly the bars match the curve as grows, even when the base is asymmetric (Bernoulli(0.3)) or bimodal.
Binomial → Normal (de Moivre–Laplace)
Section titled “Binomial → Normal (de Moivre–Laplace)”The historical CLT: a sum of independent Bernoulli variables is Binomial. With and ,
Reading. For large , . This is the de Moivre–Laplace theorem (1733/1812), historically the first CLT, predating the i.i.d. CLT by over a century. The widget’s Bernoulli(0.3) option is exactly this setting: -fold convolution of Bernoulli is Binomial, and the dashed Gaussian overlay is the de Moivre-Laplace approximation . Slide to watch the binomial bars relax onto the bell curve.
Rule of thumb. The approximation is excellent when and . For small (rare events), the approximation degrades and the Poisson limit is more appropriate.
Uniform → Normal (Irwin–Hall)
Section titled “Uniform → Normal (Irwin–Hall)”For i.i.d. , and , so
Reading. The sum has the Irwin–Hall distribution, supported on . Even at , the distribution is already strikingly bell-shaped (this is the basis for the classic “sum of 12 uniforms minus 6” trick for crude Gaussian random number generation).
Exponential → Normal (Gamma)
Section titled “Exponential → Normal (Gamma)”For i.i.d. , and . The sum is :
Reading. The exponential distribution is heavily right-skewed (skewness ). The CLT applies, but convergence is slower than for symmetric distributions: the gamma’s skewness is , so even at a noticeable rightward bias remains. The widget above renders this exactly: select Exponential(1) and slide . At the curve is the bare exponential decay; the rightward tail visibly persists past , while the symmetric die has already locked onto the Gaussian by then.
Dice → Normal
Section titled “Dice → Normal”Rolling fair six-sided dice and summing gives with and . The standardized sum converges to .
Reading. Already by the histogram of the standardized sum is virtually indistinguishable from at the resolution typically used. This is why averaging dice rolls is the canonical introduction to CLT in undergraduate texts.
Non-example: Cauchy → Cauchy
Section titled “Non-example: Cauchy → Cauchy”The Cauchy distribution has density and no finite mean, let alone variance. The CLT does not apply.
In fact, for i.i.d. Cauchy variables,
not just in the limit. Averaging never narrows the distribution; the sample mean is no better an estimator of the (nonexistent) “center” than a single observation. This is the canonical heavy-tailed counterexample, ruled out by the finite-variance hypothesis of CLT 1.