For an image denoising problem, the author has a functional $E$ defined
$$E(u) = \iint_\Omega F \;\mathrm d\Omega$$
which he wants to minimize. $F$ is defined as
$$F = \|\nabla u \|^2 = u_x^2 + u_y^2$$
Then, the E-L equations are derived:
$$\frac{\partial E}{\partial u} = \frac{\partial F}{\partial u} - \frac{\mathrm d}{\mathrm dx} \frac{\partial F}{\partial u_x} - \frac{\mathrm d}{\mathrm dy} \frac{\partial F}{\partial u_y} = 0$$
Then it is mentioned that gradient descent method is used to minimize the functional $E$ by using
$$\frac{\partial u}{\partial t} = u_{xx} + u_{yy}$$
which is the heat equation. I understand both equations, and have solved the heat equation numerically before. I also worked with functionals. I do not understand however how the author jumps from the Euler-Lagrange equations to the gradient descent method. How is the time variable $t$ included? Any detailed derivation, proof on this relation would be welcome. I found some papers on the Net, the one by Colding et al. looked promising.
References
Tobias H. Colding, William P. Minicozzi II, Minimal surfaces and mean curvature flow, 2011.
Leonid I. Rudin, Stanley Osher, Emad Fatemi, Nonlinear total variation based noise removal algorithms [ PDF ], Physica D: Nonlinear Phenomena, Volume 60, Issues 1–4, 1 November 1992.