This question addresses an argument in the convergence proof of an SQP method I found in a text book.
Let $f \colon \mathbb R^n \to \mathbb R$ be twice continuously differentiable. Let there be given sequences of directions $s_k \to \hat s$ in $\mathbb R^n$ and of step lengths $\sigma_k \to 0$ in $\mathbb R$. Also, let $x_k \to \hat x$ in $\mathbb{R}^n$ with $x_{k+1} = x_k + \sigma_k s_k$.
I wish to prove that
$$\begin{equation}\tag{1}\label{eq:lemma} \lim_{k\to\infty} \frac{1}{\sigma_k}\big( f(x_k + \sigma_k s_k) - f(x_k) \big) - \nabla f(x_k)^\top s_k = 0. \end{equation}$$
By definition of $\nabla f(x_k)^\top s_k$ and by continuity of $\nabla f,$ we have
$$\begin{align}\tag{2}\label{eq:limits} \lim_{x_k \to \hat x} \lim_{s_k \to \hat s} \lim_{\sigma_k \to 0} \frac{f(x_k + \sigma_k s_k) - f(x_k)}{\sigma_k} = \lim_{x_k \to \hat x} \lim_{s_k \to \hat s} \nabla f(x_k)^\top s_k = \nabla f(\hat x)^\top \hat s. \end{align}$$
I have a couple of questions:
In order to use \eqref{eq:limits} for \eqref{eq:lemma}, I need to argue why it is valid to compute the limits of $\sigma_k,s_k,x_k$ one after another, even though in \eqref{eq:limits} they are approached in parallel. Can you help?
Does \eqref{eq:lemma} hold true for any converging sequence $x_k \to \hat x$?
Does \eqref{eq:lemma} hold true for any sequence $x_k$, not necessarily converging?
If that approach works, I don't see why it wouldn't also give a positive answer for question 2.
– Sep 14 '18 at 10:41