Let $T: L^2(\mathbb R) \to L^2(\mathbb R)$ be an operator that commutes with all translations $f(x) \mapsto f(x-y)$. Why does it follow that the Fourier transform of $Tf$ is then given by multiplication with a (bounded) function? In Bounded linear operators that commute with translation the user robjohn gives a seemingly easy to follow proof. But I have some problems understanding the calculation: let $\psi$ be the Gaussian such that $\hat \psi = \psi$. Then
\begin{align} \psi(\xi)\;(Tf)^\wedge(\xi) &=(\psi*Tf)^\wedge(\xi)\\ &=\left(\int_{\mathbb{R}^n}\psi(x-y)\;Tf(y)\;\mathrm{d}y\right)^\wedge(\xi)\\ &=\left(\int_{\mathbb{R}^n}\psi(y)\;Tf(x-y)\;\mathrm{d}y\right)^\wedge(\xi)\\ &=\left(\int_{\mathbb{R}^n}T^{\;*}\psi(y)\;f(x-y)\;\mathrm{d}y\right)^\wedge(\xi)\\ &=(T^{\;*}\psi)^\wedge(\xi)\hat{f}(\xi) \end{align}
1) Where exactly is the assumption that $T$ commutes with translations, is used?
In the literature i only found a more general proof, e.g., in Stein: Introduction to Fourier analysis on Euclidean spaces, Theorem 3.16, but I am interested in a simple proof in the above basic setting.
Also on more conceptual level I would like to understand how one can see this as the fact that Fourier transform diagonalizes translation invariant operators.