Why is the lagrange dual function concave?

Question

In a book I'm reading (Convex Optimization by Boyd and Vandenberghe) it says

I'm struggling to understand the last sentence. Why can one conclude concavity from having a pointwise infimum of a family of affine functions?

score 12 · Accepted Answer · edited Apr 15 '21 at 03:04

12

Because the Lagrangian $L(x,\lambda,\mu)$ is affine in $\lambda$ and $\mu$, the Lagrange dual function $d(\lambda,\nu) = \inf_{x\in \mathcal{D}}L(x,\lambda,\nu)$ is always concave because it is the pointwise infimum of a set of affine functions, which is always concave. (You can also show that the supremum of a set of convex functions is convex.)

edited Apr 15 '21 at 03:04

Saimouer

29

answered Jul 26 '15 at 12:08

A.Γ.

30,381

What does affine in $\lambda$ and $\mu$ mean? – user2820379 Jul 26 '15 at 12:27
3

@user2820379 Affine function is simply a linear function plus a constant (with respect to $\lambda$ and $\mu$ for any fixed $x$). – A.Γ. Jul 26 '15 at 12:30
1

Isn't an affine function both concave and convex? – TracyYXChen May 09 '20 at 23:17
1

@TracyChen Yes, but infimum of convex is not convex in general, so convexity of $L$ is not of the primal interest here when dealing with $\inf$. – A.Γ. May 10 '20 at 06:39
I am following this discussion and I am convinced the infimum of affine functions is a concave function. But then I come across this example that makes everything confusing again. If $f(x) = -\max(x,0)$ and $h(x) = |x|-3$ (for a equality constraint), then the Lagrange dual function can be worked out as $g(\gamma)=-3\gamma$ if $\gamma\leq 1$ and $g(\gamma)=-\infty$ if $\gamma < 1$. This $g(\gamma)$ doesn't look like a concave function, isn't it? – William Wong Nov 15 '21 at 03:48
1

@WilliamWong I believe it should be $\gamma\ge 1$ in the first condition. It is a concave function: if you look at the hypograph it is convex. It is located under the line $g=-3\gamma$ over the convex set $\gamma\ge 1$. – A.Γ. Nov 15 '21 at 08:03
Thanks for the correction, yes $g(\gamma)=-3\gamma$ if $\gamma \geq 1$. Now if we consider the domain to be $\gamma \geq 1$ then it is a concave function, but shouldn't we consider the whole number line as the domain, in which case it would not be a concave function, wouldn't it? – William Wong Nov 15 '21 at 18:25
1

@WilliamWong Yes, it would as the set ${y\in\mathbb{R}\colon y\le -\infty}$ is empty. Extended values $\pm\infty$ are not included in the epigraphs/hypographs. – A.Γ. Nov 15 '21 at 20:08

score 3 · Answer 2 · edited Dec 15 '22 at 13:02

The book referenced is Convex Optimization by Boyd and Vandenberghe. To better see the "pointwise infimum", consider a slight change/abuse of notation: $L_x(\xi) = L(x, \lambda, \nu)$ where $\xi = (\lambda, \nu)$. For a fixed $x$, $L_x(\xi)$ is affine in $\xi$ so $\{L_x(\xi) \,:\, x \in \mathcal{D}\}$ is a family of affine functions and its pointwise infimum is $$g(\xi) = \inf_x \,\{L_x(\xi)\,:\,x\in\mathcal{D}\}$$ Now we can use @A.Γ.'s pointer to show that $g$ is concave by showing that the epigraph of $-g$ is convex. For a given $\xi$, we have $g(\xi) \le L_x(\xi)$ for any $L_x$ from the family so $(\xi, -g(\xi))$ is always "above" $(\xi, -L_x(\xi)$). Hence $\rm{epi}(-g) \subset \bigcap_x \rm{epi}(-L_x)$.

score 0 · Answer 3 · answered Apr 21 '18 at 22:35

Concavity of the dual function is very much a non-intuitive property.

One way to show it is to use the fact that a function is convex if and only if its epigraph is a convex set. The epigraph of a function $f(\vec x)$ is the set of points 'above' that function: $$\left\{(\vec x,y) \mid y \geq f(\vec x)\right\}$$

For the dual function we have the pointwise infimum of a family of affine functions: $$D(\vec \lambda, \vec \nu) = \inf_{\vec x} \mathcal{L}(\vec x,\vec \lambda, \vec \nu) = \inf_{\vec x} A(\vec x) \begin{bmatrix} \vec \lambda \\ \vec \nu \end{bmatrix} + \vec b(\vec x)$$

That is, we can re-write the Lagrangian to have the form $A\vec \lambda + \vec b$ for some matrix $A$ and vector $\vec b$ that both depend on $\vec x$.

Loosely speaking, pointwise means we can pick different values for $\vec x$ depending on the value of $\vec \lambda$.

The epigraph of $\mathcal{L}$ is for any given value of $\vec x$ is going to be a convex set, as once $\vec x$ is fixed the function is affine, and affine functions are both convex and concave. If we flip this notion, we can look at negative epigraphs, or the set of points 'below' the function. This negative epigraph of $D$ is going to be the intersection of the negative eopigraphs of all possible functions formed by fixing all possible values of $\vec x$. The intersection of convex sets is a convex set, so this negative epigraph is a convex set. The negative epigraph of a function is a convex set if and only if the function is concave, so the dual function $D$ must be concave!

It helps a bit to draw this out on a sheet of paper.

memeplex · Answer 4 · 2018-12-18T17:00:06.897

I think it's easier to visualize the maximization case, in which the sup is convex. Say you change a multiplier in the direction that relaxes its constraint, then the lagrangian, being affine on the multiplier, is at least linearly better respect to the change and there is probably room for improvement above linearity because of the relaxation, hence you get convexity. Indeed, the constraint evaluated at the current optimum is a subgradient of the lagrangian. In terms of the epigraph, after changing the multiplier you will stay at least in the same affine member of the family, but you may as well scale up to the next one.

Why is the lagrange dual function concave?

4 Answers4

Linked