Independence

Next: Geostatistics in Hydrology: Kriging Up: Random Vectors Previous: Random Vectors Contents

Independence

The conditional probability of given is defined as

$\displaystyle P(A \vert B) := {P( A \cap B ) \over P(B) }$

provided both

and

are events and $P(B) \not= 0$ .

One can say that and are independent if $P(A \vert B) = P(A)$ and $P(B \vert A) = P(B)$ . Thus and are independent if and only if

$\displaystyle P(A \cap B) = P(A) P(B)$

(1.7)

Clearly, this last condition makes sense even if either

vanishes, hence can be (and usually is) taken as the definition of independence of

and

Two random variables and are independent if the events $(X \le x)$ and $(Y \le y)$ are independent in the sense of (1.7) for each choice of $x,y \in$ ${\NUMBERS R}$ , i.e. if

$\displaystyle P(X \le x, Y \le y) = P(X \le x) P(Y \le y)$

(1.8)

In other words

and

are independent if and only if

$\displaystyle F_{X,Y}(x,y) = F_X(x) F_Y(y).$

(1.9)

A random vector $X \in$ ${\NUMBERS R}$ has independent components if all its marginals $F_{X_i{_1}, \ldots, X_i{_k} }$ have the multiplicative property

$\displaystyle F_{X_i{_1}, \ldots, X_i{_k} } = F_{X_i{_1} } \cdots F_{X_i{_k} } ,$

for each ordered

-tuple $( i_1, \ldots, i_k )$ with $\{ i_1, \ldots, i_k \} \subset \{1, \ldots, n \}$ with $k\leq n$ . N.B. It does not suffice to ask that $F_{X_1, \ldots, X_n }$ have the multiplicative property for

only.

If and have a joint density $f_{X Y}$ , then both and have a density ( and , respectively), namely the marginals

$\displaystyle f_X(x)$	$\displaystyle =$	$\displaystyle \int_{-\infty}^{+\infty} f_{X Y}(x, y) dy$
$\displaystyle f_Y(y)$	$\displaystyle =$	$\displaystyle \int_{-\infty}^{+\infty} f_{X Y}(x, y) dx .$

The vectors and are independent with a joint density $f_{X,Y}$ , then

$\displaystyle f_{X Y}(x, y) = f_X(x) f_Y(y)$

(1.10)

Conversely, if the joint density factors into the marginal densities, then (1.9) holds. Thus (1.10) is a necessary and sufficient condition for independence.

Let and be independent and let $\phi$ and $\psi$ be ``regular'' functions. Then:

$\displaystyle E[\phi(X) \psi(Y)]$	$\displaystyle =$	$\displaystyle \int_{-\infty}^{+\infty} \int_{-\infty}^{+\infty} \phi(x) \psi(y) d F_{X Y}(x, y)$
	$\displaystyle =$	$\displaystyle \int_{-\infty}^{+\infty} \phi(x) d F_{X}(x) \int_{-\infty}^{+\infty} \psi(y) d F_{Y}(y),$

i.e. under independence of

and

$\displaystyle E[\phi(X) \psi(Y)]= E[\phi(X)] E[\psi(Y)]$

(1.11)

In particular, if

are independent, then

Cov $\displaystyle [X, Y] = \left( \begin{array}{cc} \mbox{Var}[X] & 0 \ 0 & \mbox{Var}[Y] \end{array} \right)$

In fact, by (1.11)

$\displaystyle E[(X - E[X]) (Y - E[Y])] = E[(X - E[X])] E[(Y - E[Y])] = 0$

Moreover

Var $\displaystyle [ a X + b Y] = a^2$ Var $\displaystyle [X] + 2 a b E[(X - E[X]) (Y - E[Y])] + b^2$ Var $\displaystyle [Y]$

i.e.

Var $\displaystyle [ a X + b Y] = a^2$ Var $\displaystyle [X] + b^2$ Var $\displaystyle [Y]$

and

are independent.

The above results can be generalized for random vectors. If an -dimensional random vector has independent components, then

Cov $\displaystyle [X] =$ diag $\displaystyle ($ Var $\displaystyle [X_1], \ldots,$ Var $\displaystyle [X_n])$

and

Var $\displaystyle [c^T X] = c_1^2$ Var $\displaystyle [X_1] + \cdots + c_n^2$ Var $\displaystyle [X_n]$

Let us apply the above results to the following particular situation: sampling a given random variable, like when a measurement is repeated a certain number of times.

Suppose is a given random variable. A sample of length from is a sequence of independent random variables $X_1, \ldots, X_n$ , each of them having the same distribution as . The components of the sample are said to be independent and identically distributed (``iid" for short). The sample mean

$\displaystyle M_n := { X_1 + \ldots + X_n \over n }$

(1.12)

is computed in order to estimate the ``value" of

$\displaystyle E[X] = m,$ Var $\displaystyle [X] = \sigma^2$

Then

$\displaystyle E M_n = m,$ Var $\displaystyle M_n = {\sigma^2 \over n}$

(1.13)

and the advantage of forming the sample average becomes apparent: While the mean is unaltered, the variance reduces when the number

of observations increases. Thus, forming the arithmetic mean of a sample of measurements results in a higher precision of the estimate. The situation is as depicted in figure 1.8.