Random Variables

Intuitively, a random variable represents a numerical value determined by the outcome of a random experiment. For example, if we roll a die, the outcome is “the face that shows up”, and a random variable $X$ could be “the number of dots on the face”.

In our formal framework, we define a random variable as a function mapping the sample space to real numbers. But not just any function — it must preserve the structure of our probability space.

Definition

Measurable Mapping Visualization

If $(S, \cA) = (\R, \cB)$ , then $X$ is called a random variable.

When we discuss events defined by the value of $X$ , we often use the shorthand notation $\{X \in B\}$ for $\{\omega : X(\omega) \in B\}$ .

Why do we need this condition?

We want to be able to answer probabilistic questions about $X$ , such as “What is the probability that $X$ is greater than 5?”. In set notation, we are asking for $\Pr(\{\omega : X(\omega) > 5\})$ .

For this probability to be defined, the subset $\{\omega : X(\omega) > 5\}$ must be an event (i.e., it must belong to $\cF$ ), because the probability measure $\Pr$ is only defined on $\cF$ . The condition $X^{-1}(B) \in \cF$ ensures precisely this: for any “reasonable” question we ask about the value of $X$ (represented by a Borel set $B$ ), the set of outcomes satisfying it is an event we can measure.

Sigma-field generated by X

Every random variable $X$ naturally comes with a $\sigma$ -field that describes the information contained in $X$ .

Intuition: $\sigma(X)$ is the smallest $\sigma$ -field on $\Omega$ that makes $X$ a random variable. It contains exactly the events whose occurrence (or non-occurrence) can be determined just by knowing the value of $X$ . If $\sigma(X)$ is smaller than $\cF$ , it means $X$ “forgets” some information about the outcome $\omega$ .

Distribution of X

A random variable $X$ transports the probability measure from $(\Omega, \cF)$ to $(\R, \cB)$ . This “transported” measure is what we usually call the distribution of $X$ .

Everything we check about the distribution of $X$ (like its PDF, CDF, mean, variance) is essentially a property of this measure $\mu_X$ .

Examples

Properties of Measurable Functions

Independence of Random Variables

We say random variables are independent if the information they generate is independent.

This is equivalent to the condition that for any Borel sets $C, D \in \cB$ :

\Pr(X \in C, Y \in D) = \Pr(X \in C)\Pr(Y \in D)

or in terms of CDFs: $F_{X,Y}(x,y) = F_X(x)F_Y(y)$ .