Let X X X be a random variable . Then X X X induces a probability measure μ \mu μ on ( R , B ) (\mathbb{R}, \mathcal{B}) ( R , B ) (where B \mathcal{B} B is the Borel σ \sigma σ -field ) by setting:
μ ( B ) = P ( X ∈ B ) for all B ∈ B \mu(B) = \mathbb{P}(X \in B) \quad \text{for all } B \in \mathcal{B} μ ( B ) = P ( X ∈ B ) for all B ∈ B
μ \mu μ is called the distribution of X X X .
Definition: Distribution Function
The function F F F given by
F ( x ) = P ( X ≤ x ) F(x) = \mathbb{P}(X \le x) F ( x ) = P ( X ≤ x ) is called the distribution function of X X X .
Note
The distribution is fully characterized by F F F .
F ( x ) = μ ( ( − ∞ , x ] ) F(x) = \mu((-\infty, x]) F ( x ) = μ (( − ∞ , x ]) Since the collection { ( − ∞ , x ] , x ∈ R } \{(-\infty, x], x \in \mathbb{R}\} {( − ∞ , x ] , x ∈ R } is a π \pi π -system and it generates B \mathcal{B} B (refer to how B \mathcal{B} B was generated), it follows that F ( x ) F(x) F ( x ) uniquely determines μ \mu μ .
We learned here that we don’t need a function defined on all sets in B \mathcal{B} B , as B \mathcal{B} B is quite large. We only need to define it on sets of the form ( − ∞ , b ] (-\infty, b] ( − ∞ , b ] for b ∈ R b \in \mathbb{R} b ∈ R .
F F F is non-decreasing.
lim x → ∞ F ( x ) = 1 , lim x → − ∞ F ( x ) = 0 \lim_{x \to \infty} F(x) = 1, \quad \lim_{x \to -\infty} F(x) = 0 lim x → ∞ F ( x ) = 1 , lim x → − ∞ F ( x ) = 0
F F F is right continuous: lim y ↓ x F ( y ) = F ( x ) = P ( X ≤ x ) \lim_{y \downarrow x} F(y) = F(x) = \mathbb{P}(X \le x) lim y ↓ x F ( y ) = F ( x ) = P ( X ≤ x )
lim y ↑ x F ( y ) = P ( X < x ) = F ( x − ) \lim_{y \uparrow x} F(y) = \mathbb{P}(X < x) = F(x-) lim y ↑ x F ( y ) = P ( X < x ) = F ( x − ) (notation)
P ( X = x ) = F ( x ) − F ( x − ) \mathbb{P}(X=x) = F(x) - F(x-) P ( X = x ) = F ( x ) − F ( x − )
Proofs
Monotonicity of probability:
x 1 ≤ x 2 ⟹ ( − ∞ , x 1 ] ⊆ ( − ∞ , x 2 ] ⟹ P ( X ∈ ( − ∞ , x 1 ] ) ≤ P ( X ∈ ( − ∞ , x 2 ] ) ⟹ F ( x 1 ) ≤ F ( x 2 ) \begin{aligned}
x_1 \le x_2 &\implies (-\infty, x_1] \subseteq (-\infty, x_2] \\
&\implies \mathbb{P}(X \in (-\infty, x_1]) \le \mathbb{P}(X \in (-\infty, x_2]) \\
&\implies F(x_1) \le F(x_2)
\end{aligned} x 1 ≤ x 2 ⟹ ( − ∞ , x 1 ] ⊆ ( − ∞ , x 2 ] ⟹ P ( X ∈ ( − ∞ , x 1 ]) ≤ P ( X ∈ ( − ∞ , x 2 ]) ⟹ F ( x 1 ) ≤ F ( x 2 ) Continuity from above or below of probability:
lim x → ∞ ( − ∞ , x ] = R ⟹ lim x → ∞ P ( X ≤ x ) = P ( X ∈ R ) = 1 lim x → − ∞ ( − ∞ , x ] = ∅ ⟹ P ( X ∈ ∅ ) = 0 \begin{aligned}
\lim_{x \to \infty} (-\infty, x] = \mathbb{R} &\implies \lim_{x \to \infty} \mathbb{P}(X \le x) = \mathbb{P}(X \in \mathbb{R}) = 1 \\
\lim_{x \to -\infty} (-\infty, x] = \emptyset &\implies \mathbb{P}(X \in \emptyset) = 0
\end{aligned} x → ∞ lim ( − ∞ , x ] = R x → − ∞ lim ( − ∞ , x ] = ∅ ⟹ x → ∞ lim P ( X ≤ x ) = P ( X ∈ R ) = 1 ⟹ P ( X ∈ ∅ ) = 0 Continuity from above:
lim y ↓ x F ( y ) = lim y ↓ x P ( X ≤ y ) \lim_{y \downarrow x} F(y) = \lim_{y \downarrow x} \mathbb{P}(X \le y) y ↓ x lim F ( y ) = y ↓ x lim P ( X ≤ y ) As y ↓ x y \downarrow x y ↓ x , { X ≤ y } \{X \le y\} { X ≤ y } is a decreasing sequence of events, and has limit:
lim y ↓ x { X ≤ y } = ⋂ y ↓ x { X ≤ y } = { X ≤ x } \lim_{y \downarrow x} \{X \le y\} = \bigcap_{y \downarrow x} \{X \le y\} = \{X \le x\} y ↓ x lim { X ≤ y } = y ↓ x ⋂ { X ≤ y } = { X ≤ x } ⟹ lim y ↓ x P ( X ≤ y ) = P ( X ≤ x ) = F ( x ) \implies \lim_{y \downarrow x} \mathbb{P}(X \le y) = \mathbb{P}(X \le x) = F(x) ⟹ y ↓ x lim P ( X ≤ y ) = P ( X ≤ x ) = F ( x ) Similar to 3, note that:
lim y ↑ x { X ≤ y } = ⋃ y ↑ x { X ≤ y } = { X < x } \lim_{y \uparrow x} \{X \le y\} = \bigcup_{y \uparrow x} \{X \le y\} = \{X < x\} y ↑ x lim { X ≤ y } = y ↑ x ⋃ { X ≤ y } = { X < x } ⟹ lim y ↑ x P ( X ≤ y ) = P ( X < x ) \implies \lim_{y \uparrow x} \mathbb{P}(X \le y) = \mathbb{P}(X < x) ⟹ y ↑ x lim P ( X ≤ y ) = P ( X < x ) Intuitively, this property tells us that the probability of X X X taking exactly the value x x x is equal to the size of the “jump” in the distribution function at x x x .
To prove this, we can simply take the difference of the results from Property 3 (F ( x ) F(x) F ( x ) ) and Property 4 (F ( x − ) F(x-) F ( x − ) ).
Formally, note that { X = x } = { X ≤ x } ∖ { X < x } \{X = x\} = \{X \le x\} \setminus \{X < x\} { X = x } = { X ≤ x } ∖ { X < x } .
Since { X < x } ⊆ { X ≤ x } \{X < x\} \subseteq \{X \le x\} { X < x } ⊆ { X ≤ x } , we have:
P ( X = x ) = P ( X ≤ x ) − P ( X < x ) = F ( x ) − F ( x − ) \begin{aligned}
\mathbb{P}(X=x) &= \mathbb{P}(X \le x) - \mathbb{P}(X < x) \\
&= F(x) - F(x-)
\end{aligned} P ( X = x ) = P ( X ≤ x ) − P ( X < x ) = F ( x ) − F ( x − )