64

Posted on MO since it is unanswered in MSE

It is known that the number of real roots of a random polynomial with real coefficients is much smaller than the number of complex roots. WLOG, assume that the coefficients are uniformly random in $(-1,1)$ for if not then we can divide each coefficient by the coefficient with the largest absolutely value to scale each coefficient to $(-1,1)$. Then the number of real roots of a polynomial of degree $n$ is asymptotic to $\displaystyle \frac{2\log n}{\pi} + o(1)$. Similar asymptotics hold for other distribution of the coefficients however for the rest of this post we assume that the coefficients are uniformly random in $(-1,1)$. This means that the number of complex roots is approximately $\displaystyle n - \frac{2\log n}{\pi}$.

Definition 1: The largest root of a polynomial is the root with the largest modulus. Definition 2: The smallest root of a polynomial is the root with the smallest modulus.

Roots scatter plot

The above graph shows the roots of a polynomial of degree $101$; the largest root is in the top right corner in green.

Is the largest or the smallest root more likely to be complex or real? The naive guess is that the largest or the smallest root is more likely to be complex than real because there are exponentially more complex roots than real roots as seen from the above asymptotic.

Probability that the largest root is real

However, experimental data shows that

  1. Probability that the largest root is real is equal to the probability that the smallest root is real and this probability is greater than that of either of them being complex.
  2. This probability decreases to $1/2$ as $n \to \infty$ as shown in the above graph (created using a Monte Carlo simulation with $10^5$ trails for each value of $n$).
  3. Note: Instead of uniform distribution, if we assume that the coefficients are normally distributed with mean $0$ and standard deviation $1$ and scaled to $(-1,1)$, the above observation and limiting probabilities hold.

It is counter intuitive that despite being much exponentially fewer in number, real roots are more likely to contain the largest as well as the smallest roots. In this sense, the largest as well as the smallest roots is biased towards reals.

Question 1: What is the reason for this bias?

Question 2: Does the probability that the largest (or the smallest) root of a polynomial of degree $n$ is real approach $\frac{1}{2}$ as $n \to \infty$?

Update 2-May-2024: We can quantify the observed bias as follows. Let $P(L|R)$ be the probability that a root is the largest given that it is real and let $P(L|C)$ be the probability that a root is the largest given that it is complex. Similarly, let $P(S|R)$ be the probability that a root is the smallest given that it is real and let $P(S|C)$ be the probability that a root is the smallest given that it is complex. Then the experimental data says that

$$ P(L|R) = P(S|R) \approx \frac{\pi}{4\log n}, $$

$$ P(L|C) = P(S|C) \approx \frac{\pi}{2n\pi - 4\log n}. $$

Related: What is the probability that the absolute value of the roots of a polynomial of degree $n$ is greater than $x$?

  • Comments have been moved to chat; please do not continue the discussion here. Before posting a comment below this one, please review the purposes of comments. Comments that do not request clarification or suggest improvements usually belong as an answer, on [meta], or in [chat]. Comments continuing discussion may be removed. – Xander Henderson May 10 '24 at 12:51

1 Answers1

26

Note: The main idea of this argument is basically the same as in the Persiflage blog post mentioned in charmd's comment, though I do get slightly better bounds.


Not a complete solution, but I can show that there is at least a $6\%$ chance that the largest root is real (regardless of degree).

We'll instead consider the smallest root, which is equivalent per Varun Vejalla's comment.

Consider the polynomial $p(z) = a_0 + a_1z + \ldots + a_n z^n$. The key idea is that when $|z|$ is small, the behavior of $p(z)$ is generally dominated by the first few low-degree terms. Since (for example) linear polynomials only have real roots, it is not too surprising that the smallest root of $p(z)$ is frequently real.

Concretely, suppose $|a_0| \in [0, \tfrac{1}{9}]$ and $|a_1| \in [\tfrac{5}{6}, 1]$. Then I claim that $p(z)$ has exactly one root inside the circle $|z| = \tfrac{1}{3}$ and hence its smallest root is real. Indeed, on this circle we have $$ |a_1 z | \ge \frac{5}{6}\cdot \frac{1}{3} = \frac{5}{18} $$ whereas

\begin{align*} |p(z)-a_1 z| &\le |a_0| + |a_2 z^2| + |a_3 z^3| + \ldots \\ &< \frac{1}{9} + \frac{\tfrac{1}{3^2}}{1-\tfrac{1}{3}} = \frac{5}{18} \end{align*}

so by Rouché's theorem $p(z)$ has the same number of roots in the circle as $|a_1 z|$, namely one. And since there's only one root, it must be real.

From the restrictions on $a_0$ and $a_1$, this situation happens $\tfrac{1}{9} \cdot \tfrac{1}{6} \approx 1.9\%$ of the time.


We can do slightly better by finding all pairs $(a_0, a_1)$ for which this Rouché argument works, on some circle of the form $|z| = R$.

Following the same reasoning as before, we have

$$|a_0 + a_2z^2 + a_3z^3 + \ldots| < |a_0| + \frac{R^2}{1-R}$$ and we want this to be less than $|a_1 z| = |a_1| R$. In other words, we want $$(|a_1|+1)R^2 - (|a_0| + |a_1|)R + |a_0| < 0 \tag{$*$}$$ to hold for some $0 < R < 1$. This quadratic is positive at $R=0$ and $R=1$, and its minimum occurs at $R = \frac{|a_0| + |a_1|}{2|a_1|+2}$ (which is always between $0$ and $1$), so $(*)$ holds for some $0 < R < 1$ if and only if the discriminant is nonnegative, i.e. $$ (|a_0| + |a_1|)^2 \ge 4|a_0|(|a_1|+1)$$ i.e. $$ |a_1| \ge |a_0| + 2\sqrt{|a_0|}.$$

So we have shown that if $|a_1| \ge |a_0| + 2\sqrt{|a_0|}$, then the smallest root of $p(z)$ is real.

What is the probability that this occurs? If $|a_0|$ is larger than $(\sqrt{2}-1)^2$, then $|a_1|$ would have to be larger than $1$, which can't happen. Otherwise, $|a_1|$ can be any value between $|a_0|+2\sqrt{|a_0|}$ and $1$, so the probability is

$$\int_0^{(\sqrt{2}-1)^2} 1-(x+2\sqrt{x})\,dx = \frac{23-16\sqrt{2}}{6} \approx 6.2 \%.$$

Jyrki Lahtonen
  • 140,891
Benjamin Wright
  • 2,825
  • 12
  • 15
  • 2
    A great start! And also accessible to most (for a suitable definition of "most"). – Jyrki Lahtonen May 10 '24 at 06:48
  • @ Benjamin Wright. Congratulations on winning the bounty. Probably we can extend similar argument to include $a_2$ for a cubic equation taking advantage of the fact that a polynomial of an odd degree has at least one real root; hence the other two roots are either both real or both complex. But $4$-th power and higher it will get complicated as there will be too many cases to consider. – Nilotpal Sinha May 10 '24 at 08:09
  • 1
    Varun Vejalla's comment has been lost, could you put it into your answer? – Vincent May 10 '24 at 13:17
  • 2
    @Vincent it’s moved to chat here: https://chat.stackexchange.com/transcript/message/65633472#65633472 – Luke Taylor May 10 '24 at 15:39
  • 3
    @Vincent I took the liberty of editing the two links to comments. Now they both lead to the chatroom, where the comments were relocated. – Jyrki Lahtonen May 10 '24 at 17:45