Skip to content

Wald's Extension

Wald’s identity gives the mean of a random sum Y=i=1NXiY = \sum_{i=1}^N X_i. The natural follow-up: what is the full distribution of YY? Conditioning on NN and combining with characteristic functions gives a clean answer: the characteristic function of YY is the composition of the generating function of NN with the characteristic function of X1X_1.

The generating function uniquely determines the distribution of ZZ (just as the characteristic function does for general distributions), and its derivatives at t=1t = 1 recover the moments: gZ(1)=E[Z]g_Z'(1) = \E[Z], etc.

The reading: the characteristic function of YY is the composition gNφX1g_N \circ \varphi_{X_1}. Random summation in time-domain becomes function composition in the transform domain.

Worked example: Geometric number of exponential summands

Section titled “Worked example: Geometric number of exponential summands”

Let X1,X2,X_1, X_2, \ldots be i.i.d. Exponential(λ)\text{Exponential}(\lambda), and let NGeometric(p)N \sim \text{Geometric}(p) on {1,2,}\{1, 2, \ldots\} with P(N=k)=(1p)k1p\Pr(N = k) = (1-p)^{k-1} p, independent of the XiX_i. The standard formulas:

φX1(t)  =  λλit,gN(t)  =  pt1(1p)t.\varphi_{X_1}(t) \;=\; \frac{\lambda}{\lambda - it}, \qquad g_N(t) \;=\; \frac{p t}{1 - (1 - p) t}.

Apply the theorem:

φY(t)  =  gN ⁣(φX1(t))=pλλit1(1p)λλit=pλ(λit)(1p)λ=pλpλit.\begin{aligned} \varphi_Y(t) \;=\; g_N\!\big( \varphi_{X_1}(t) \big) &= \frac{p \cdot \frac{\lambda}{\lambda - it}}{1 - (1 - p) \cdot \frac{\lambda}{\lambda - it}} \\ &= \frac{p \lambda}{(\lambda - it) - (1 - p) \lambda} \\ &= \frac{p \lambda}{p \lambda - it}. \end{aligned}

This is the characteristic function of Exponential(pλ)\text{Exponential}(p\lambda). So

Y  =  i=1NXi    Exponential(pλ).Y \;=\; \sum_{i=1}^N X_i \;\sim\; \text{Exponential}(p \lambda).

A geometric number of i.i.d. exponentials is itself exponential, with rate scaled by the success probability pp. The mean is consistent with Wald’s identity:

E[Y]  =  E[X1]E[N]  =  1λ1p  =  1pλ.\E[Y] \;=\; \E[X_1] \cdot \E[N] \;=\; \frac{1}{\lambda} \cdot \frac{1}{p} \;=\; \frac{1}{p \lambda}.

Why this is stronger than Wald’s Identity

Section titled “Why this is stronger than Wald’s Identity”

Wald’s identity gives only the first moment, E[Y]=E[X1]E[N]\E[Y] = \E[X_1] \, \E[N]. Wald’s extension gives the entire characteristic function, which by the continuity theorem determines the entire distribution. From it one can extract all moments (by differentiating), recognize known distributions (as above), and prove distributional convergence results for random sums.