quasi-PCA reconstruction of the matrix by orthogonal basis

Question

let's say I have a "data" matrix $X$ of $N$ rows and $p$ cols with $N \gg p$. Now PCA with $L$ components can be formulated as $$X_L = argmin_{Y:rank(Y) = L} ||X- Y||^2_F $$, where Y is taken to be an $L$-dimensional approximation of $X$.

It's known that $X_L = X W_L W^T_L$ (see here e.g.) where $W_L$ is $pxL$ matrix of first $L$ principal components of $X$. However, suppose I instead want to choose an arbitrary orthonormal basis $w_1, w_2, ... w_p$ and construct $W_L = [w_1 w_2 ... w_L]$ for $L = 1,2, .. p$. Am I guaranteed that the "unexplained" variance $||X- X_L||^2_F$ is going to monotonically decrease in $L$, the number of dimensions I retain, if the matrix $W_L$ is built iteratively (that is, if we fixed $w_1$ we may only add $w_2, w_3$ etc, but never throw away $w_1$)? It feels like the answer is "yes", but maybe I'm missing something? It definitely decreases from $L = 1$ to $L = p$.

score 3 · Accepted Answer · edited Jul 27 '22 at 10:12

3

$|X - X_L|^2 = tr(X - X_L)(X - X_L)'$

and $(X - X_L) = X * (I - W_LW_L')$

So $|X - X_L|^2 = tr(X * (I - W_LW_L') * X')$, where we've used the fact that $(I - W_LW_L')' = (I - W_LW_L') = (I - W_LW_L')^2$.

So $|X - X_L|^2 - |X - X_{L+1}|^2 = tr(X * (W_{L+1}W_{L+1}' - W_LW_L') * X') = tr(X * (w_{L+1}w_{L+1}' * X') = tr(w_{L+1}' * X'X * (w_{L+1}) \geq 0$

edited Jul 27 '22 at 10:12

PNM

409

answered Jun 24 '22 at 21:50

user1070987

46

quasi-PCA reconstruction of the matrix by orthogonal basis

1 Answers1