1

Suppose $X$ is a $n \times p$ matrix, where $rank(X) < p$. Since $X$ is not full rank, then it is not invertible.

I'm trying to understand whether functions of $X$ are invertible:

  1. $X^TX$
  2. $X^TX + \lambda I_p$ ($\lambda > 0$ is some scalar, and $I_p$ is a $p \times p$ identity matrix)
  3. $X(X^TX + \lambda I_p)^{-1}X^T$

My intuition is that 1) is NOT invertible, but 2) IS invertible. Given that 2) is invertible, however, 3) is NOT invertible.

Is this correct? Can anyone help me understand why this is?

Adrian
  • 2,076

1 Answers1

1

$\DeclareMathOperator{\rank}{rank}$

$1$ is not invertible, and it is because $\rank X = \rank X^TX$. See Prove that $\text{rank}(X^TX)=\text{rank}(X)$.

$2$ is invertible, as long as $\lambda > 0$. See When is $\mathbf{X}^{T}\mathbf{X}+\lambda\mathbf{I}$ invertible?.

$3$ is known as the hat matrix for ridge regression. Thanks to Brian Borchers. If $n > p$,

$$\rank X(X^TX + \lambda I)^{-1}X^T \leq \min (\rank X, \rank (X^TX + \lambda I)^{-1}X^T)$$

We know that $\rank X <p$ and $(X^TX + \lambda I)^{-1}X^T$ is a $p \times n$ matrix. Thus $\rank (X^TX + \lambda I)^{-1}X^T \leq \min(n, p) = p$ and it must be the case that

$$\rank X(X^TX + \lambda I)^{-1}X^T\leq \min (\rank X, \rank (X^TX + \lambda I)^{-1}X^T) < p < n$$

hence $X(X^TX + \lambda I)^{-1}X^T$ (an $n \times n$ matrix) is not invertible.

Clarinetist
  • 20,278
  • 10
  • 72
  • 137