When calculating Weighted Least Square Solution, after taking the derivative, we will have the following equation:
$X^\top WX\beta=X^\top Wy$
where $X_{n\times m}$ is the data matrix, with $n\geq m$ and $X$ in full rank;
$W$ is the weight matrix, with $W$ being non-zero diagonal matrix.
I want to know how can we prove that $X^\top WX$ is non-singular, in which case we can derive the weighted least square solution $\beta=(X^\top WX)^{-1}X^\top Wy$