CLT 2

The Lindeberg–Feller CLT extends the classical i.i.d. CLT to triangular arrays of independent but not necessarily identically distributed row entries. The price for this generality is a single technical condition (the Lindeberg condition) controlling how much mass the entries place in their tails.

The intuitive content: a large number of small independent random effects, regardless of how they are individually distributed, is approximately normal. This is the form of the CLT that justifies the working assumption in experimental physics and statistics that measurement errors are Gaussian.

For each $n \ge 1$ , let $X_{n,1}, \ldots, X_{n,n}$ be independent random variables with $\mathbb{E}[X_{n,m}] = 0$ . Suppose:

Convergence of variance.
$\sum_{m=1}^{n} \mathbb{E}[X_{n,m}^2] \;\longrightarrow\; \sigma^2 > 0.$
Lindeberg condition. For every $\epsilon > 0$ ,
$\lim_{n \to \infty} \sum_{m=1}^{n} \mathbb{E}\!\left[|X_{n,m}|^2 \,;\, |X_{n,m}| > \epsilon\right] = 0.$

Then, with $S_n = \sum_{m=1}^{n} X_{n,m}$ ,

S_n \xrightarrow{d} \sigma \cdot N(0, 1) \;\equiv\; N(0, \sigma^2) \quad \text{as } n \to \infty.

Conditions 1 and 2 together are called the Lindeberg–Feller conditions. The notation $\mathbb{E}[X; A] := \mathbb{E}[X \, \mathbb{1}_A] = \int_A X \, d\mathbb{P}$ is used throughout.

Reading the result

Lindeberg controls the tails uniformly across the row. It demands that, in the limit, no single $X_{n,m}$ dominates: the total second moment concentrated on the tails $\{|X_{n,m}| > \epsilon\}$ becomes negligible. This is what rules out heavy-tailed contributions that would break Gaussian convergence.
Asymptotic negligibility comes for free. Step 3 of the proof shows that the Lindeberg condition automatically forces $\max_m \sigma_{n,m}^2 \to 0$ . Operationally, no row entry is allowed to retain a non-vanishing fraction of the total variance. This is precisely the “many small effects” intuition.
Sufficient conditions for Lindeberg. Two practical ones:
- Bounded entries. If $|X_{n,m}| \le c_n$ uniformly with $c_n \to 0$ , then for any $\epsilon > 0$ , $\{|X_{n,m}| > \epsilon\}$ is eventually empty, so the Lindeberg sum is eventually zero.
- Lyapunov condition. If $\sum_m \mathbb{E}[|X_{n,m}|^{2+\delta}] \to 0$ for some $\delta > 0$ , then Lindeberg holds. (Markov’s inequality: $\mathbb{E}[X^2; |X| > \epsilon] \le \epsilon^{-\delta} \mathbb{E}[|X|^{2+\delta}]$ .)
Recovers CLT 1. Given i.i.d. $\{X_m\}$ with mean $\mu$ and variance $\sigma^2 < \infty$ , place them in a triangular array by setting
$X_{n,m} = \frac{X_m - \mu}{\sqrt{n}}, \quad 1 \le m \le n.$
Each row entry is centered, and condition 1 holds with the target variance $\sigma^2$ :
$\sum_{m=1}^{n} \mathbb{E}[X_{n,m}^2] = \sum_{m=1}^{n} \frac{\sigma^2}{n} = \sigma^2.$
For the Lindeberg condition, all $n$ entries on row $n$ share the same distribution, so the sum collapses to $n$ copies of a single expectation:
$\begin{aligned} \sum_{m=1}^{n} \mathbb{E}\!\left[X_{n,m}^2 \,;\, |X_{n,m}| > \epsilon\right] &= n \cdot \mathbb{E}\!\left[\frac{(X_1 - \mu)^2}{n} \,;\, |X_1 - \mu| > \epsilon \sqrt{n}\right] \\ &= \mathbb{E}\!\left[(X_1 - \mu)^2 \,;\, |X_1 - \mu| > \epsilon \sqrt{n}\right]. \end{aligned}$
The integrand is dominated by $(X_1 - \mu)^2$ (integrable) and the event $\{|X_1 - \mu| > \epsilon \sqrt{n}\}$ shrinks to $\emptyset$ as $n \to \infty$ , so by DCT the expectation tends to $0$ . Lindeberg–Feller now gives
$\sum_{m=1}^{n} X_{n,m} = \frac{S_n - n\mu}{\sqrt{n}} \xrightarrow{d} N(0, \sigma^2),$
which is exactly CLT 1.