5

I'm trying to find conditions on the characteristic polynomial, $p$, of a matrix such that the pre-image of matrices with characteristic polynomial $p$ form a manifold.

More precisely, we can write down a map $\phi: \textrm{Mat}_n(\mathbb{R}) \to \mathbb{R}[x]$ given by $\phi(A) = \textrm{det}(xI-A)$ and so now the question reduces to finding $p \in \mathbb{R}[x]$ such that the derivative of $f$ evaluated at pre-images of $p$ is surjective.

Using the "Jacobi formula" for the derivative of a determinant, namely $T_Af(X) = \mathrm{det}(A)Tr(A^{-1}X)$ where $f(A) = \mathrm{det}(A)$, I think we can deduce that the determinant of $\phi$ is given by:

$$T_A(\phi)(X) = det(xI-A)tr((xI-A)^{-1}X)$$

This is where I first get in to trouble, how do we know in this case that $xI-A$ is even invertible? Surely this is only true in some finite interval (bounded by the norm of $A$)? Is this actually the correct formula for the derivative?

It also seems impossible to determine if this map is surjective or not? How can we understand which polynomials are hit by this map?

Edit: I've made some progress but am still struggling with this. Thanks to Qiaochu Yuan I'm now trying to show this when $p$ has distinct real roots.

To find the derivative we can write $\phi(A+tH) = det(xI - (A+tH))$ and attempt to evaluate $\frac{\phi(A+tH) - \phi(A)}{t}$ as $t \to 0$.

Now the numerator here gives:

$$x^n + ... - tr(A+tH)x + det(A+tH) - det(xI-A)$$

Expanding out the determinants:

$$x^n + ... - tr(A)x - t tr(H)x + det(A+tH) - x^n + ... + tr(A)x - det(A)$$

Now I'm left with the $x^n$ terms cancelling and the constant term will again just be the derivative of $det(A)$ which is $det(A)tr(A^{-1}H)$ and the $x$ term will be $-tr(H)$. The difficulty is, I have no idea what the $1<k<n-1$ terms should be, and these are the important ones because they describe the tangent space of the n monic polynomials!

Wooster
  • 3,885
  • 1
    $\phi$ surjects onto the monic polynomials; every monic polynomial is the characteristic polynomial of a matrix, namely its companion matrix (https://en.wikipedia.org/wiki/Companion_matrix). – Qiaochu Yuan May 04 '16 at 18:35
  • Thanks for your comment - If the map is surjective does this imply the derivative is surjective? – Wooster May 04 '16 at 19:37
  • No, certainly not. I think $d \phi$ will end up being surjective at the matrices whose characteristic polynomial have distinct roots (over $\mathbb{C}$), but not in general. – Qiaochu Yuan May 04 '16 at 19:39
  • What is the intuition for that? Do you know if my formula for the derivative is correct? – Wooster May 04 '16 at 19:42
  • The idea is that if you slightly deform a polynomial with distinct roots the result also has distinct roots, so all you've done is move each root a little. Any matrix with such a characteristic polynomial is diagonalizable (over $\mathbb{C}$), so you can slightly deform all the corresponding preimages by deforming the eigenvalues without changing the eigenvectors. I haven't thought about whether your formula is correct. – Qiaochu Yuan May 04 '16 at 19:46
  • Okay, I think I have the idea - intuitively the idea that you can continuously move the pre-images around means that you have a manifold structure? I'll have a go at making that precise. – Wooster May 04 '16 at 19:51
  • @QiaochuYuan I'm still having some difficulty with this, I've added a little more to the question and wondered if you might know where to take it from there? – Wooster May 14 '16 at 09:55
  • 1
    You can rewrite Jacobi's formula so that it is valid even when $xI-A$ is not invertible.
    Remember that if $\operatorname{adj}(A)$ is the adjugate of $A$ (the transpose of the cofactor matrix of $A$), then $A\operatorname{adj}(A) = \det(A)I$, so that if $A$ is invertible then $A^{-1} = \operatorname{adj}(A) / \det(A)$.
    So here, $T_A(\phi)(X) = \det(xI-A)\operatorname{tr}((xI-A)^{-1}X) = \operatorname{tr}(\operatorname{adj}(xI-A)X)$.
    – mercio May 17 '16 at 17:43
  • I expect that $T_A(\phi)$ won't be surjective when $A$ has eigenspaces of dimension larger than $1$. – mercio May 17 '16 at 17:44

1 Answers1

2

Firstly we see that $\phi $ maps $M_n({\bf R})$ to the space of polynomial of degree n with $x^n$ as a term of highest degree. So wee sould think of $\phi$ as a map from $M_n$ to the affine space of polynomila pf the form $x^n+Q$ with $deg(Q)\leq n-1$. Then $T_A(\phi)$ is a linear map from $M_n({\bf R})$ to the vector space $R_{n-1}[X]$

We keep the notation of Mercio for the adjugate matrix, and accept that $T_A(\phi)X=tr(adj(xI-a)X)$.

In particular :

Lemma 1 : $im(T_A(\phi)$ is the linear subspace space of ${\bf R}_{n-1}[X]$generated by the coefficients of $adj(xI-A)$

Recall that the transpose of cofactor matrix of a given matrix $A$, $adj(A)$ satisfies $adj(A)A=det(A) id$ (Cramer formula). If the rank of $A$ is $n$, $c(A)$ is invertible. Furthermore,one checks that if the rank of $A$ is $\leq n-2$, $adj(A)$ is $0$ (compute $c(A)$ in a base where the two first vector are in the kernel). In particular

Lemma 2 : the complex number $x$ being fixed $adj(xI-A)=0$ iff $x$ is a (complex) eigenvalue of $A$ such that the eigenspace is of dimension at least 2. then $x$ is a double root of the caracteristic polynomial of $A$.

Claim. The polynomial $P $ is a critical value of $\phi$ iff it has a double root.

Proof. If $P$ has a (perhaps complex) double root say $x_0$, consider $A$ a matrix with $P$ as a characteristic polynomial and $x_0$ as eigenvalue with eigenspace of dimension at least 2. Then the entries of the matrix $adj(xI-A)$ are polynomials which vanishes at $x=x_0$, so $T_A(\phi)$ is not surjective : it is contained in the subspace of polynomials which vanishes at $x=x_0$.

If all roots of $P$ are simple, $A$ is diagonalizable over ${\bf C}$,with eigenvalues $x_1,,...,x_n$. A simple computation in the (complex) digonalisation base, the coefficient of $ad(A-xI)$ are the complex Lagrange polynomials $L_k=\Pi _{i\not =k}(x-x_i)$. It is known that these polynomials is a base of ${\bf C}_{n-1}[X].$ So the complexification of $T_A(\phi)$ is a linear surjective map. But is is known that a linear map between two real vector spaces is surjective iff its complexification is : indeed a real $(n,m)$ matrix has the same rank over $\bf R$ and $\bf C$.

Thomas
  • 8,588