Skip to content

CLT 2

The Lindeberg–Feller CLT extends the classical i.i.d. CLT to triangular arrays of independent but not necessarily identically distributed row entries. The price for this generality is a single technical condition (the Lindeberg condition) controlling how much mass the entries place in their tails.

The intuitive content: a large number of small independent random effects, regardless of how they are individually distributed, is approximately normal. This is the form of the CLT that justifies the working assumption in experimental physics and statistics that measurement errors are Gaussian.

  • Lindeberg controls the tails uniformly across the row. It demands that, in the limit, no single Xn,mX_{n,m} dominates: the total second moment concentrated on the tails {Xn,m>ϵ}\{|X_{n,m}| > \epsilon\} becomes negligible. This is what rules out heavy-tailed contributions that would break Gaussian convergence.

  • Asymptotic negligibility comes for free. Step 3 of the proof shows that the Lindeberg condition automatically forces maxmσn,m20\max_m \sigma_{n,m}^2 \to 0. Operationally, no row entry is allowed to retain a non-vanishing fraction of the total variance. This is precisely the “many small effects” intuition.

  • Sufficient conditions for Lindeberg. Two practical ones:

    • Bounded entries. If Xn,mcn|X_{n,m}| \le c_n uniformly with cn0c_n \to 0, then for any ϵ>0\epsilon > 0, {Xn,m>ϵ}\{|X_{n,m}| > \epsilon\} is eventually empty, so the Lindeberg sum is eventually zero.
    • Lyapunov condition. If mE[Xn,m2+δ]0\sum_m \mathbb{E}[|X_{n,m}|^{2+\delta}] \to 0 for some δ>0\delta > 0, then Lindeberg holds. (Markov’s inequality: E[X2;X>ϵ]ϵδE[X2+δ]\mathbb{E}[X^2; |X| > \epsilon] \le \epsilon^{-\delta} \mathbb{E}[|X|^{2+\delta}].)
  • Recovers CLT 1. Given i.i.d. {Xm}\{X_m\} with mean μ\mu and variance σ2<\sigma^2 < \infty, place them in a triangular array by setting

    Xn,m=Xmμn,1mn.X_{n,m} = \frac{X_m - \mu}{\sqrt{n}}, \quad 1 \le m \le n.

    Each row entry is centered, and condition 1 holds with the target variance σ2\sigma^2:

    m=1nE[Xn,m2]=m=1nσ2n=σ2.\sum_{m=1}^{n} \mathbb{E}[X_{n,m}^2] = \sum_{m=1}^{n} \frac{\sigma^2}{n} = \sigma^2.

    For the Lindeberg condition, all nn entries on row nn share the same distribution, so the sum collapses to nn copies of a single expectation:

    m=1nE ⁣[Xn,m2;Xn,m>ϵ]=nE ⁣[(X1μ)2n;X1μ>ϵn]=E ⁣[(X1μ)2;X1μ>ϵn].\begin{aligned} \sum_{m=1}^{n} \mathbb{E}\!\left[X_{n,m}^2 \,;\, |X_{n,m}| > \epsilon\right] &= n \cdot \mathbb{E}\!\left[\frac{(X_1 - \mu)^2}{n} \,;\, |X_1 - \mu| > \epsilon \sqrt{n}\right] \\ &= \mathbb{E}\!\left[(X_1 - \mu)^2 \,;\, |X_1 - \mu| > \epsilon \sqrt{n}\right]. \end{aligned}

    The integrand is dominated by (X1μ)2(X_1 - \mu)^2 (integrable) and the event {X1μ>ϵn}\{|X_1 - \mu| > \epsilon \sqrt{n}\} shrinks to \emptyset as nn \to \infty, so by DCT the expectation tends to 00. Lindeberg–Feller now gives

    m=1nXn,m=SnnμndN(0,σ2),\sum_{m=1}^{n} X_{n,m} = \frac{S_n - n\mu}{\sqrt{n}} \xrightarrow{d} N(0, \sigma^2),

    which is exactly CLT 1.