Random Vectors

Posted on 2024-07-09 Edited on 2025-06-17 In Mathematics Views: 32

Source:

Random Vectors from the textbook Introduction to Probability, Statistics, and Random Processes by Hossein Pishro-Nik.
Random Vectors and the Variance–Covariance Matrix

Random Vectors

Notation

Symbol	Type	Description
$\vec{X}$	Random vector	A vector of jointly distributed random variables $\vec{X} = [X_{1}, X_{2}, \dots, X_{p}]^{T}$
$E [\vec{X}]$	Vector	Expectation of the random vector $\vec{X}$
${\vec{μ}}_{\vec{X}}$ , $\vec{μ}$	Vector	Alternative notation for the expectation $E [\vec{X}]$
$Var (\vec{X})$	Matrix	Variance-covariance matrix (or simply covariance matrix) of $\vec{X}$
$Σ_{\vec{X}}$ , $Σ$ , $Cov (\vec{X})$	Matrix	Alternative notations for the variance-covariance matrix
$Cov (\vec{X}, \vec{Y})$	Matrix	Covariance matrix between two random vectors $\vec{X}$ and $\vec{Y}$
$A \vec{X}$	Linear transformation	Transformation of the random vector $\vec{X}$ by a $k \times p$ matrix $A$
$Cov (X_{i}, Y_{j})$	Scalar	Covariance between the $i$ -th component of $\vec{X}$ and the $j$ -th component of $\vec{Y}$
$[E [\vec{X}]]$	Column vector	Expectation of the random vector $\vec{X}$ expressed as a column matrix
$E [\vec{X} {\vec{Y}}^{T}]$	Matrix	Matrix of expected pairwise products between components of $\vec{X}$ and $\vec{Y}$
$X_{1}, X_{2}, \dots, X_{p}$	Random variables	Components of the random vector $\vec{X}$

Distinction: Variance-Covariance Matrix vs. Covariance Matrix:

The terms variance-covariance matrix and covariance matrix are often used interchangeably in many contexts, but they can have subtle distinctions depending on the scenario:

Variance-Covariance Matrix: Refers specifically to the covariance matrix of a single random vector.
Covariance Matrix: A more general term that applies to the covariance between two random vectors.

Abbreviations

Abbreviation	Description
r.v.	Random variable

Notation

The expected value of a random vector $\vec{X}$ is often denoted by $E (\vec{X})$ , $E [\vec{X}]$ , $E \vec{X}$ , with E also often stylized as $E$ or $E$ , or symbolically as ${\vec{μ}}_{\vec{X}}$ or simply $\vec{μ}$ .
The variance of random vector $\vec{X}$ is typically designated as $Var (\vec{X})$ , or sometimes as $Cov (\vec{X})$ . Since the variance is a variance-covariance matrix, it's also denoted as $Σ_{\vec{X}}$ or $Σ$ . The element at i-th row j-th column is $Σ_{i j}$
The covariance of two random vectors $\vec{X}$ and $\vec{Y}$ is typically designated as $Cov (\vec{X}, \vec{Y})$ . Since the covariance is a variance-covariance matrix, it's also denoted as $Σ_{(\vec{X}, \vec{Y})}$ .

Abbrevations

Definition

Definition: A random vector $\vec{X}$ is a vector $\vec{X} = [\begin{matrix} X_{1} \\ X_{2} \\ \cdot \\ \cdot \\ X_{p} \end{matrix}]$ of jointly distributed random variables $X_{1}, \dots, X_{p}$ . As is customary in linear algebra, we will write vectors as column matrices whenever convenient.

Expectation of a random vector

Definition: The expectation $E \vec{X}$ of a random vector $\vec{X} = {[X_{1}, X_{2}, \dots, X_{p}]}^{T}$ is given by $E [\vec{X}] = [\begin{matrix} E [X_{1}] \\ E [X_{2}] \\ ⋮ \\ E [X_{p}] \end{matrix}]$ It's also denoted as ${\vec{μ}}_{\vec{X}}$ or $\vec{μ}$ .

Linearity of expectation

Recalling that, the expectation for random variables is a linear operation, this linearity also holds for random vectors.

The linearity properties of the expectation can be expressed compactly by stating that for any $k \times p$ -matrix $A$ and any $1 \times j$ -matrix $B$ , $E [A \vec{X}] = A E [\vec{X}] and E [\vec{X} B] = E [\vec{X}] B$

Variance of a random vector

The variance of a random vector $\vec{X}$ is represented as a matrix, known as the variance-covariance matrix (often simply referred to as the covariance matrix in some literature). $Var (\vec{X}) = Cov (\vec{X}, \vec{X}) = E [(\vec{X} - E [\vec{X}]) (\vec{X} - E [\vec{X}])^{T}] .$ It's also denoted as $Σ_{\vec{X}}$ , $Σ$ , or $Cov (\vec{X})$ .

Expectation --> Variance

One important property is that, $\begin{aligned} Var (\vec{X}) ≜ Cov (\vec{X}, \vec{X}) & = E [(\vec{X} - E [\vec{X}]) (\vec{X} - E [\vec{X}])^{T}] \\ = E [\vec{X} {\vec{X}}^{T}] - E [\vec{X}] E [\vec{X}]^{T} \end{aligned} .$ The proof is easily derived from covariance operator $Cov (\vec{X}, \vec{Y})$ .

Covariance between two random vectors

For two jointly distributed real-valued random vectors $\vec{X}$ and $\vec{Y}$ , the covariance is represented as a matrix, called the covariance matrix: $Cov (\vec{X}, \vec{Y}) = E [(\vec{X} - E [\vec{X}]) (\vec{Y} - E [\vec{Y}])^{T}]$ It's also denoted as $Σ_{(\vec{X}, \vec{Y})}$ .

The covariance matrix

For two random vectors $\vec{X} = {[X_{1}, X_{2}, \dots, X_{p}]}^{T} \in R^{p}$ and $\vec{Y} = {[Y_{1}, Y_{2}, \dots, Y_{q}]}^{T} \in R^{q}$ , their covariance matrix is a $p \times q$ matrix defined as:

$Cov (\vec{X}, \vec{Y}) = [\begin{array}{cccc} Cov (X_{1}, Y_{1}) & Cov (X_{1}, Y_{2}) & \dots & Cov (X_{1}, Y_{q}) \\ Cov (X_{2}, Y_{1}) & Cov (X_{2}, Y_{2}) & \dots & Cov (X_{2}, Y_{q}) \\ ⋮ & ⋮ & ⋱ & ⋮ \\ Cov (X_{p}, Y_{1}) & Cov (X_{p}, Y_{2}) & \dots & Cov (X_{p}, Y_{q}) \end{array}]$

Here: - $Cov (X_{i}, Y_{j})$ represents the covariance between the random variables $X_{i} ($ from $\vec{X})$ and $Y_{j}$ (from $\vec{Y}$ ). - If $\vec{X} = \vec{Y}$ , this matrix reduces to the variance-covariance matrix of $\vec{X}$ , which is symmetric because $Cov (X_{i}, Y_{j}) = Cov (X_{j}, Y_{i})$ by the definition of covariance for random variables.

Expectation --> Covariance

$\begin{aligned} Cov (\vec{X}, \vec{Y}) & = E [(\vec{X} - E [\vec{X}]) (\vec{Y} - E [\vec{Y}])^{T}] \\ = E [\vec{X} {\vec{Y}}^{T} - \vec{X} E [\vec{Y}]^{T} - E [\vec{X}] {\vec{Y}}^{T} + E [\vec{X}] E [\vec{Y}]^{T}] \\ = E [\vec{X} {\vec{Y}}^{T}] - E [\vec{X} E [\vec{Y}]^{T}] - E [E [\vec{X}] {\vec{Y}}^{T}] + E [E [\vec{X}] E [\vec{Y}]^{T}] \\ = E [\vec{X} {\vec{Y}}^{T}] - E [\vec{X}] E [\vec{Y}]^{T} - E [\vec{X}] E [\vec{Y}]^{T} + E [\vec{X}] E [\vec{Y}]^{T} \\ = E [\vec{X} {\vec{Y}}^{T}] - E [\vec{X}] E [\vec{Y}]^{T} \end{aligned}$

Linear combinations of random variables

Consider random variables $X_{1}, \dots, X_{p}$ . We want to find the expectation and variance of a new random variable $L (X_{1}, \dots, X_{p})$ obtained as a linear combination of $X_{1}, \dots, X_{p}$ ; that is, $L (X_{1}, \dots, X_{p}) = \sum_{i = 1}^{p} a_{i} X_{i} .$

Using vector-matrix notation we can write this in a compact way: $L (\vec{X}) = {\vec{a}}^{T} \vec{X},$ where ${\vec{a}}^{T} = [a_{1}, \dots, a_{p}]$ . Then we get: $E [L (\vec{X})] = E [{\vec{a}}^{T} \vec{X}] = {\vec{a}}^{T} E \vec{X},$ and $\begin{aligned} Var [L (\vec{X})] & = E [L (\vec{X}) L (\vec{X})^{T}] - E [L (\vec{X})] E [L (\vec{X})]^{T} \\ = E [{\vec{a}}^{T} \vec{X} {\vec{X}}^{T} \vec{a}] - E ({\vec{a}}^{T} \vec{X}) {[E ({\vec{a}}^{T} \vec{X})]}^{T} \\ = {\vec{a}}^{T} E [\vec{X} {\vec{X}}^{T}] \vec{a} - {\vec{a}}^{T} E \vec{X} (E \vec{X})^{T} \vec{a} \\ = {\vec{a}}^{T} (E [\vec{X} {\vec{X}}^{T}] - E \vec{X} (E \vec{X})^{T}) \vec{a} \\ = {\vec{a}}^{T} Cov (\vec{X}) \vec{a} \end{aligned}$

Thus, knowing $E \vec{X}$ and $Cov (\vec{X})$ , we can easily find the expectation and variance of any linear combination of $X_{1}, \dots, X_{p}$ .

Collary: $Σ$ is positive semi-definite

Corollary: If $Σ$ is the covariance matrix of a random vector, then for any constant vector $\vec{a}$ we have ${\vec{a}}^{T} Σ \vec{a} \geq 0 .$

That is, $Σ$ satisfies the property of being a positive semi-definite (PSD) matrix.

Proof: According to the previous section, ${\vec{a}}^{T} Σ \vec{a}$ is the variance of a random variable. We know that variance is always non-negative.

This suggests the question: Given a symmetric, positive semi-definite matrix, is it the covariance matrix of some random vector? The answer is yes.

#TODO

Linear transform of a random vector

Consider a random vector $\vec{X}$ with covariance matrix $Σ$ . Then, for any $k$ dimensional constant vector $\vec{c}$ and any $p \times k$ -matrix $A$ , the $k$ dimensional random vector $\vec{c} + A^{T} \vec{X}$ has mean $\vec{c} + A^{T} E \vec{X}$ and has covariance matrix $Cov (\vec{c} + A^{T} \vec{X}) = A^{T} Σ A .$

The proof is quite simple:

Let $\vec{Y} = \vec{c} + A^{T} \vec{X}$ , due to the linearity expectation operator, its expectation is $E [\vec{Y}] = E [\vec{c} + A^{T} \vec{X}] = \vec{c} + A^{T} E [\vec{X}] .$ Thus, $\vec{Y} - E [\vec{Y}] = (\vec{c} + A^{T} \vec{X}) - (\vec{c} + A^{T} E [\vec{X}]) = A^{T} (\vec{X} - E [\vec{X}]) .$ Therefore, $\begin{aligned} Cov (\vec{c} + A^{T} \vec{X}) & = Cov (\vec{Y}) \\ = E [(\vec{Y} - E [\vec{Y}]) (\vec{Y} - E [\vec{Y}])^{T}] \\ = E [(A^{T} (\vec{X} - E [\vec{X}])) (A^{T} (\vec{X} - E [\vec{X}]))^{T}] \\ = E [A^{T} (\vec{X} - E [\vec{X}]) (\vec{X} - E [\vec{X}])^{T} A] \\ = A^{T} E [(\vec{X} - E [\vec{X}]) (\vec{X} - E [\vec{X}])^{T}] A \\ = A^{T} Σ A \end{aligned}$ Remember that $Σ ≜ Cov (\vec{X})$ .

What if all elements are independent?

If $X_{1}, X_{2}, \dots, X_{p}$ are i.i.d. (independent identically distributed), then $Cov ({[X_{1}, X_{2}, \dots, X_{p}]}^{T})$ , or the covariance matrix $Σ$ , is a diagonal matrix with $σ^{2}$ on the diagonal and zeros elsewhere: $Σ = [\begin{array}{cccc} σ^{2} & 0 & \dots & 0 \\ 0 & σ^{2} & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & σ^{2} \end{array}] = σ^{2} I_{p}$ where $I_{p}$ is the $p \times p$ identity matrix.

Proof:

The diagonal elements $Σ_{i i}$ represent the variance of each $X_{i}$ :

$Σ_{i i} = Var (X_{i}) = σ^{2} for all i$

The off-diagonal elements $Σ_{i j}$ represent the covariance between different $X_{i}$ and $X_{j}$ . Since $X_{i}$ and $X_{j}$ are independent, we have:

$Σ_{i j} = Cov (X_{i}, X_{j}) = 0 for i \neq j$

Random Vectors

Notation

Abbreviations

Notation

Abbrevations

Definition

Expectation of a random vector

Linearity of expectation

Variance of a random vector

Expectation --> Variance

Covariance between two random vectors

The covariance matrix

Expectation --> Covariance

Linear combinations of random variables

Collary: Σ is positive semi-definite

Linear transform of a random vector

What if all elements are independent?

Collary: $Σ$ is positive semi-definite