I am reading from Linear Algebra in Action by Dym and am working through the proof of the Birkhoff-von Neumann Theorem. For the sake of clarity, ill write up everything until the point where I get lost.
Theorem: Let $P \in \mathbb{R}^{n\times n}$ be a doubly stochastic matrix.Then $P$ is a convex combination of finitely many permutation matrices.
Proof: If $P$ is a permutation matrix, then the assertion is self-evident. If $P$ is not a permutation matrix, them, in the view of Lemma 23.13
Lemma 23.13: Let $A \in \mathbb{R}^{n \times n}$ be a doubly stochastic matrix. Then $\operatorname{per}(A)>0$.
and the fact that $P$ is doubly stochastic, there exists a permutation $\sigma$ of the integers $\{1,\dots,n\}$ such that $1 > p_{1\sigma(1)}p_{2\sigma(2)} \cdots p_{n\sigma(n)} > 0.$ Let $$\lambda_1=\min\{p_{1\sigma(1)},p_{2\sigma(2)},\dots, p_{n\sigma(n)}\}$$ and let $\Pi_1$ be the permutation matrix with 1's in the $i\sigma(i)$ position for $i=1,\dots,n$. Then it is readily checked that $$P_1 = \frac{P-\lambda_1\Pi_1}{1-\lambda_1}$$ is a doubly stochastic matrix with at least one more zero entry than $P$ and that ...
I do not see how that is clearly the case? Can someone explain that to me?