3

Consider the problem

minimize $f(x)=||Ax-b||_2^2$,

where $A$ is an $m\times n$ matrix with $m\geq n$, and $b$ is a vector of length $m$. Assume that the rank of $A$ is equal to $n$.

We can write down the first-order necessary condition for optimality: If $x_*$ is a local minimizer, then $\triangledown f(x_*)=0$.

Is this also a sufficient condition?

Ian
  • 1,391

1 Answers1

2

Yes, this is also sufficient. The quick way to see this is simply to see that $f(x)$ is convex, so its extrema are its minimizers. Alternatively, you can calculate directly the derivative of $f$: if $f'(x) = 0$, then for every vector $v$,

\begin{align} 0 = f'(x) v & = \lim_{\epsilon \rightarrow 0} \frac{1}{\epsilon} [f(x + \epsilon v) - f(x)]\\ & = \frac{1}{\epsilon} [ \| A(x + \epsilon v) - b\|^2 - \|Ax - b\|^2 ] \\ & = \langle Ax - b, Av \rangle + \langle Av, Ax - b \rangle \\ & = \langle A^T A x - A^T b, v \rangle + \langle v, A^T Ax - A^T b \rangle \end{align} If this holds for every $v$, then $A^T Ax - A^T b = 0$. These are precisely the so-called normal equations that one can solve to find the minimizer for the linear least-squares problem.

Christopher A. Wong
  • 22,445
  • 3
  • 51
  • 82
  • I am asking a somewhat similar question here: https://math.stackexchange.com/questions/2502215/tangent-cone-to-a-set-at-a-given-point-and-first-order-necessary-optimality-cond in addition to something about the tangent cone and was wondering if it would be possible for you to take a look. Thank you! –  Nov 02 '17 at 23:55
  • there is a 100 point bounty on it. –  Nov 04 '17 at 23:51