I have a few questions about the last part. How does applying the 8.11 make the rest of the terms 0?
and for the next equality, it looks like we move (T - $\lambda_1$I)$^k$ to operate on $v_1$ first. How can we do that without knowing all terms commute with each other?
