The uniqueness follows from the fact that F 0 \cF_0 F 0 is a π \pi π -system (closed under finite intersections) and P \Pr P is a probability measure on F 0 \cF_0 F 0 . By the uniqueness theorem for measures, the extension to σ ( F 0 ) \sigma(\cF_0) σ ( F 0 ) is unique. More rigorous proof below.
Definition: π-system
A collection of sets P \cP P of Ω \Omega Ω is called a π-system if it is closed under finite intersections, i.e., for any A , B ∈ P A, B \in \cP A , B ∈ P , A ∩ B ∈ P A \cap B \in \cP A ∩ B ∈ P .
Here’s a theorem that relates π \pi π -systems and λ \lambda λ -systems.
π-λ Theorem
If P \cP P is a π \pi π -system and L \cL L is a λ \lambda λ -system such that P ⊆ L \cP \subseteq \cL P ⊆ L , then σ ( P ) ⊆ L \sigma(\cP) \subseteq \cL σ ( P ) ⊆ L .
Let L ( P ) L(\cP) L ( P ) be the smallest λ \lambda λ -system containing P \cP P (in other words, the intersection of all λ \lambda λ -systems containing P \cP P ). It is easy to check that L ( P ) L(\cP) L ( P ) is also a λ \lambda λ -system.
Hence, let L ( P ) L(\cP) L ( P ) be the smallest λ \lambda λ -system containing P \cP P .
Now define
G A : = { B ∈ L : A ∩ B ∈ L ( P ) } , \cG_A := \{B \in \cL : A \cap B \in L(\cP)\}, G A := { B ∈ L : A ∩ B ∈ L ( P )} ,
for any A ∈ P A \in \cP A ∈ P . We claim that G A \cG_A G A is a λ \lambda λ -system. To see this, let’s verify the three conditions of a λ \lambda λ -system:
Ω ∈ G A \Omega \in \cG_A Ω ∈ G A since A ∩ Ω = A ∈ L ( P ) A \cap \Omega = A \in L(\cP) A ∩ Ω = A ∈ L ( P ) .
If B ∈ G A B \in \cG_A B ∈ G A , then both A ∈ L ( P ) A\in L(\cP) A ∈ L ( P ) (assumption) and A ∩ B ∈ L ( P ) A \cap B \in L(\cP) A ∩ B ∈ L ( P ) (defintion of G A \cG_A G A ). This means A ∩ B c ∈ L ( P ) A \cap B^c \in L(\cP) A ∩ B c ∈ L ( P ) (by Fact (b) above).
By definition of G A \cG_A G A , we thus have B c ∈ G A B^c \in \cG_A B c ∈ G A .
If B 1 , B 2 , … ∈ G A B_1, B_2, \ldots \in \cG_A B 1 , B 2 , … ∈ G A (all disjoint sets),
then A ∩ B 1 , A ∩ B 2 , … A\cap B_1, A\cap B_2, \ldots A ∩ B 1 , A ∩ B 2 , … are disjoint sets in L ( P ) L(\cP) L ( P ) (by definition of G A \cG_A G A ). Since L ( P ) L(\cP) L ( P ) is a λ \lambda λ -system, we have
⋃ i = 1 ∞ ( A ∩ B i ) ∈ L ( P ) \bigcup_{i=1}^\infty (A\cap B_i) \in L(\cP) ⋃ i = 1 ∞ ( A ∩ B i ) ∈ L ( P ) .
⟹ A ⋂ ( ⋃ i = 1 ∞ B i ) ∈ L ( P ) \implies A \bigcap \left(\bigcup_{i=1}^\infty B_i\right) \in L(\cP) ⟹ A ⋂ ( ⋃ i = 1 ∞ B i ) ∈ L ( P ) (disjoint union property)
⟹ ⋃ i = 1 ∞ B i ∈ G A \implies \bigcup_{i=1}^\infty B_i \in \cG_A ⟹ ⋃ i = 1 ∞ B i ∈ G A (by definition of G A \cG_A G A ).
Therefore, G A \cG_A G A is a λ \lambda λ -system.
Now, if A ∈ P A\in \cP A ∈ P , then for any B ∈ P B\in \cP B ∈ P , A ∩ B ∈ P ⊆ L ( P ) A\cap B\in \cP\subseteq L(\cP) A ∩ B ∈ P ⊆ L ( P ) . Thus, B ∈ G A B\in \cG_A B ∈ G A for all B ∈ P B\in \cP B ∈ P , implying that P ⊆ G A \cP\subseteq \cG_A P ⊆ G A .
This means that L ( P ) ⊆ G A L(\cP)\subseteq \cG_A L ( P ) ⊆ G A , since G A \cG_A G A is a λ \lambda λ -system that contains P \cP P and L ( P ) L(\cP) L ( P ) is the smallest λ \lambda λ -system containing P \cP P .
Now assume that A ∈ P A\in \cP A ∈ P and B ∈ L ( P ) B\in L(\cP) B ∈ L ( P ) . Then B ∈ G A B\in \cG_A B ∈ G A because L ( P ) ⊆ G A L(\cP)\subseteq \cG_A L ( P ) ⊆ G A . By definition of G A \cG_A G A , we have A ∩ B ∈ L ( P ) A\cap B\in L(\cP) A ∩ B ∈ L ( P ) . This also means A ∈ G B A\in \cG_B A ∈ G B . This holds for any A ∈ P A\in \cP A ∈ P . We thus have
B ∈ L ( P ) ⟹ P ⊆ G B ⟹ L ( P ) ⊆ G B . B\in L(\cP) \implies \cP \subseteq \cG_B \implies L(\cP) \subseteq \cG_B. B ∈ L ( P ) ⟹ P ⊆ G B ⟹ L ( P ) ⊆ G B .
Therefore, for any A , B ∈ L ( P ) A,B\in L(\cP) A , B ∈ L ( P ) , we have from L ( P ) ⊆ G B L(\cP)\subseteq \cG_B L ( P ) ⊆ G B that A ∈ G B A\in \cG_B A ∈ G B
⟹ A ∩ B ∈ L ( P ) ⟹ L ( P ) is a π -system ⟹ L ( P ) is both a π -system and a λ -system ⟹ L ( P ) is a σ -field (by Fact (a)) . \begin{aligned}
&\implies A\cap B\in L(\cP)\\
&\implies L(\cP) \text{ is a } \pi\text{-system}\\
&\implies L(\cP) \text{ is both a } \pi\text{-system and a } \lambda\text{-system}\\
&\implies L(\cP) \text{ is a } \sigma\text{-field (by Fact (a))}.
\end{aligned} ⟹ A ∩ B ∈ L ( P ) ⟹ L ( P ) is a π -system ⟹ L ( P ) is both a π -system and a λ -system ⟹ L ( P ) is a σ -field (by Fact (a)) .
Hence, P ⊆ σ ( P ) ⊆ L ( P ) ⊆ L \cP \subseteq \sigma(\cP) \subseteq L(\cP) \subseteq \cL P ⊆ σ ( P ) ⊆ L ( P ) ⊆ L . In words,
P \cP P is a subset of σ ( P ) \sigma(\cP) σ ( P ) , the smallest σ \sigma σ -field containing P \cP P .
σ ( P ) \sigma(\cP) σ ( P ) is a subset of L ( P ) L(\cP) L ( P ) , the smallest λ \lambda λ -system containing P \cP P (the intersection of all λ \lambda λ -systems containing P \cP P ).
L ( P ) L(\cP) L ( P ) is a subset of L \cL L which is some arbitrary λ \lambda λ -system containing P \cP P .
This completes the proof.
Corollary
Let P 1 P_1 P 1 and P 2 P_2 P 2 be two probability measures that agree on the π \pi π -system P \cP P , i.e., P 1 ( A ) = P 2 ( A ) P_1(A) = P_2(A) P 1 ( A ) = P 2 ( A ) for all A ∈ P A\in \cP A ∈ P . Then they also agree on σ ( P ) \sigma(\cP) σ ( P ) , i.e., P 1 ( A ) = P 2 ( A ) P_1(A) = P_2(A) P 1 ( A ) = P 2 ( A ) for all A ∈ σ ( P ) A\in \sigma(\cP) A ∈ σ ( P ) .
The key idea is to define the set on which P 1 P_1 P 1 and P 2 P_2 P 2 agree and show that it is a λ \lambda λ -system. Then given that it is also a π \pi π -system, we can use the π \pi π -λ \lambda λ theorem to show that it is a σ \sigma σ -field.
Define L : = { A ∈ σ ( P ) : P 1 ( A ) = P 2 ( A ) } \cL := \{A \in \sigma(\cP) : P_1(A) = P_2(A)\} L := { A ∈ σ ( P ) : P 1 ( A ) = P 2 ( A )} . Check that L \cL L is a λ \lambda λ -system.
P 1 ( Ω ) = P 2 ( Ω ) = 1 P_1(\Omega) = P_2(\Omega) = 1 P 1 ( Ω ) = P 2 ( Ω ) = 1 .
We have that
A ∈ L ⟹ P 1 ( A ) = P 2 ( A ) ⟹ P 1 ( A c ) = 1 − P 1 ( A ) = 1 − P 2 ( A ) = P 2 ( A c ) ⟹ A c ∈ L . \begin{aligned}
A\in \cL &\implies P_1(A) = P_2(A)\\
&\implies P_1(A^c) = 1 - P_1(A) = 1 - P_2(A) = P_2(A^c)\\
&\implies A^c \in \cL.
\end{aligned} A ∈ L ⟹ P 1 ( A ) = P 2 ( A ) ⟹ P 1 ( A c ) = 1 − P 1 ( A ) = 1 − P 2 ( A ) = P 2 ( A c ) ⟹ A c ∈ L .
By countable additivity of P 1 P_1 P 1 and P 2 P_2 P 2 , we have for A 1 , A 2 , … ∈ L A_1, A_2, \ldots \in \cL A 1 , A 2 , … ∈ L disjoint sets that
⟹ P 1 ( ⋃ i A i ) = ∑ i P 1 ( A i ) = ∑ i P 2 ( A i ) = P 2 ( ⋃ i A i ) . ⟹ ⋃ i A i ∈ L . \begin{aligned}
&\implies P_1(\bigcup_i A_i) = \sum_i P_1(A_i) = \sum_i P_2(A_i) = P_2(\bigcup_i A_i).\\
&\implies \bigcup_i A_i \in \cL.
\end{aligned} ⟹ P 1 ( i ⋃ A i ) = i ∑ P 1 ( A i ) = i ∑ P 2 ( A i ) = P 2 ( i ⋃ A i ) . ⟹ i ⋃ A i ∈ L .
Therefore, L \cL L is a λ \lambda λ -system containing P \cP P (because P \cP P is the original set where P 1 P_1 P 1 and P 2 P_2 P 2 agree so P ⊆ L \cP \subseteq \cL P ⊆ L ). Then by the π \pi π -λ \lambda λ theorem, σ ( P ) ⊆ L \sigma(\cP) \subseteq \cL σ ( P ) ⊆ L . But L ⊆ σ ( P ) \cL \subseteq \sigma(\cP) L ⊆ σ ( P ) by definition. Hence, L = σ ( P ) \cL = \sigma(\cP) L = σ ( P ) , implying that P 1 ( A ) = P 2 ( A ) P_1(A) = P_2(A) P 1 ( A ) = P 2 ( A ) for all A ∈ σ ( P ) A\in \sigma(\cP) A ∈ σ ( P ) .