18

I hear that $f(x)=x^4$ is a strictly convex function $\forall x \in \Re$. However, strict convexity condition is that the second derivative should be positive $\forall x \in \Re$. For the mentioned function, the second derivative is zero at $x=0$, which is in the domain of $f$. Therefore, it should not be a strict convex function. But I am pretty sure it is because I heard it in Prof. Boyd's lecture.

Am I missing something obvious?

  • 14
    If the second derivative is positive, then the function is strictly convex. The converse is not true. This is an example. – Julien Mar 05 '13 at 04:36
  • Some people call a function uniformly convex if it satisfies $f''\ge c$ for some positive constant $c$. This is a stronger property than strict convexity. –  Mar 05 '13 at 04:47
  • 3
    It's analogous to $x^3$: that function is an increasing function, even though it hesitates briefly at $x=0$. – André Nicolas Mar 05 '13 at 04:53
  • @5pm: Of course, uniform convexity is stronger yet than having a positive second derivative. (Consider the basic exponential function, for example.) – Cameron Buie Mar 05 '13 at 05:00
  • Note that the second derivative is positive almost everywhere. I'm pretty sure that this is also enough to get strict convexity. – Albert Dec 14 '13 at 09:54
  • There might be some confusion here between strict convexity and strong convexity. – Rodrigo de Azevedo Mar 25 '25 at 13:12

5 Answers5

17

If the second derivative is strictly positive, then the function is strictly convex. However, the converse need not be true. A function $f:\Bbb R\to\Bbb R$ is strictly convex if and only if for all $x,y\in\Bbb R$ with $x\neq y$ we have $$f\bigl(tx+(1-t)y\bigr)<tf(x)+(1-t)f(y)$$ for all $0<t<1$. $f(x)=x^4$ is indeed strictly convex.

This is a common misconception. Many make the same mistake regarding the relationship between positive first derivative and increasing functions (the former implies the latter, but not vice versa). See here and here, for examples of people making such errors.

Cameron Buie
  • 105,149
  • Ok. So the converse is not true... that is even if second derivative at x=0 is zero, we say that $x^4$ is strictly convex. Now my question is how to analytically see if the function is indeed strictly convex? (In case, we cannot plot the function and see if the line segment joining any two points lies above the function) – Autonomous Mar 05 '13 at 05:00
  • 1
    So is there any way to prove an arbitrary function (even or odd) is strictly convex? – Autonomous Mar 05 '13 at 05:06
  • 3
    You may find it easier to show that it is strictly midpoint convex--that is, that $$f\left(\frac{x+y}2\right)<\frac{f(x)+f(y)}2$$ whenever $x\neq y$. Midpoint convexity is equivalent to convexity for continuous functions. In general, though, you're probably going to have to do some grunt-work calculations. – Cameron Buie Mar 05 '13 at 05:09
6

Strict Convexity is when $f(tx+(1-t)y)<tf(x)+(1-t)f(y)$. The fact that $f''(x)\geq 0$ implies $f$ is convex, however, it does not necessarily imply that $f$ is not strictly convex. In the case of $f(x)=x^4$, you get $f''(x)=12x^2\geq 0$ but in fact $f$ is strictly convex. Indeed, any line segement lies above the curve.

Alex R.
  • 33,289
0

You can use the condition $\langle \nabla f(y) - \nabla f(x), y - x \rangle > 0$ to check the strict convexity. For $f(x)=x^4$, the above condition becomes $4(y-x)^2 \left[(y+ \frac{1}{2}x)^2 + \frac{3}{4}x^2\right] > 0$ with $x \neq y$. This example reminds us that the positive definite Hessian is sufficient, but not necessary, for strict convexity.

0

To check that a function is strictly convex, you can use the following criterion: the graph of a function that is convex but not strictly convex contains a straight line interval. In other words, such a function coincides with a linear function on some interval, or yet in other words, its second derivative vanishes identically on some interval. Since this is not the case for $x^4$, and it is convex, we can conclude that it is strictly convex.

Proof of the criterion: If a function is convex but not strictly convex, then if satisfies $$f(tx+(1-t)y)=tf(x)+(1-t)f(y)$$ for some $x,y$ and $t\in(0,1).$ Geometrically, it means that the point $Z=(z,f(z))$ lies on the straight line segment $[XY]$ connecting $X=(x,f(x))$ and $Y=(y,f(y))$, for some $z\in(x,y)$. Now, pick any $u,v$ with $x<u<z<v<y.$ By convexity, both points $U=(u,f(u))$ and $V=(v,f(v))$ are below or on $[XY]$, hence so is the whole segment $[UV]$ between them. Moreover, if at least one of $U,V$ were not on $[XY]$, then the whole open segment $(UV)$ would be strictly below $[XY]$. But by convexity, the graph of $f$ between $u$ and $v$ is below $[UV]$, in particular, so is the point $Z$ on that graph. Therefore, $Z$ could not be on $[XY]$, which is a contradiction showing that both $U$ and $V$ must be on $[XY]$. But since $u$ and $v$ were arbitrary, this means that the whole graph of $f$ between $x$ and $y$ is just the segment $[XY]$.

Kostya_I
  • 1,374
0

Second derivative being positive is for strong convexity. There is a subtle difference between strict convexity and strong convexity. A strongly convex function is strictly convex but the converse need not be true.

The condition for strict convexity is strict Jensen's inequality as pointed out by Alex R.

Hari
  • 239