Definition of gradient of a function $f$ in Riemannian manifold

Question

I'm reading Semi Riemannian Geometry with applications to relativity by Barret Oneill and I'm trying understand the definition of gradient of a function $f$ in Riemannian Manifold. I know that motivation for define the gradient of a function $f$ in Riemannian Geometry is preserve the fact that $ \langle grad \ f , X \rangle = df(X)$ in $\mathbb{R}^n$, where $X$ is a vector field. On the one hand, $df(X) = \sum_{i=1}^{i=n} \frac{\partial f}{\partial x^i} X^i$. On the other hand, $$\langle grad \ f , X \rangle = \langle \sum_{i=1}^{i=n} (grad \ f)^i \frac{\partial }{\partial x^i} , \sum_{j=1}^{j=n} X^j \frac{\partial }{\partial x^j} \rangle = \sum_{j=1}^{j=n} \sum_{i=1}^{i=n} (grad \ f)^i \ X^j \langle \frac{\partial }{\partial x^i} , \frac{\partial }{\partial x^j} \rangle = \sum_{j=1}^{j=n} \sum_{i=1}^{i=n} (grad \ f)^i \ X^j g_{ij}.$$ If $ \langle grad \ f , X \rangle = df(X)$, then $\sum_{i=1}^{i=n} \frac{\partial f}{\partial x^i} X^i = \sum_{j=1}^{j=n} \sum_{i=1}^{i=n} (grad \ f)^i \ X^j g_{ij} \ (*)$, but the author affirms that $$grad \ f := \sum_{j=1}^{j=n} \sum_{i=1}^{i=n} g^{ij} \frac{\partial f}{\partial x^i} \frac{\partial }{\partial x^j}.$$ I know that $g^{ij}$ represent the element of the matrix $G^{-1}$, where $G$ is the matrix of tensor metric, but I don't understand how the conclude that $grad \ f$ is this.

Thank you in advance for any help!

EDIT:

I tried develop the equation $(*)$ and I thought that I got it how $grad \ f$ is defined. I will put my development here.

$\sum_{i=1}^{i=n} \frac{\partial f}{\partial x^i} X^i = \sum_{j=1}^{j=n} \sum_{i=1}^{i=n} (grad \ f)^i \ X^j g_{ij} \Longrightarrow$

$\sum_{i=j}^{j=n} \frac{\partial f}{\partial x^j} X^j = \sum_{i=1}^{i=n} (grad \ f)^i \left( \sum_{j=1}^{j=n} (g_{ij} X^j) \right)$

In matricial form, we have

$[\frac{\partial f}{\partial x^1} \cdots \frac{\partial f}{\partial x^n}] \cdot [X^1 \cdots X^n]^T = [(grad \ f)^1 \cdots (grad \ f)^n] \cdot G \cdot [X^1 \cdots X^n]^T$, where $[X^1 \cdots X^n]^T$ is the transpose of matrix $[X^1 \cdots X^n]$, then

$[\frac{\partial f}{\partial x^1} \cdots \frac{\partial f}{\partial x^n}] \cdot [X^1 \cdots X^n]^T - [(grad \ f)^1 \cdots (grad \ f)^n] \cdot G \cdot [X^1 \cdots X^n]^T = 0 \Longrightarrow$

$\left( [\frac{\partial f}{\partial x^1} \cdots \frac{\partial f}{\partial x^n}] - [(grad \ f)^1 \cdots (grad \ f)^n] \cdot G \right) \cdot [X^1 \cdots X^n]^T = 0 \Longrightarrow$

$[(grad \ f)^1 \cdots (grad \ f)^n] \cdot G = [\frac{\partial f}{\partial x^1} \cdots \frac{\partial f}{\partial x^n}] \Longrightarrow$

$[(grad \ f)^1 \cdots (grad \ f)^n] = [\frac{\partial f}{\partial x^1} \cdots \frac{\partial f}{\partial x^n}] \cdot G^{-1} \Longrightarrow$

$(grad \ f)^j = \sum_{i=1}^{i=n} \frac{\partial f}{\partial x^i} g^{ij}$. We can $grad \ f = \sum_{j=1}^{j=n} (grad \ f)^j \frac{\partial }{\partial x^j}$, then $grad \ f = \sum_{j=1}^{j=n} \left( \sum_{i=1}^{i=n} \frac{\partial f}{\partial x^i} g^{ij} \right) \frac{\partial }{\partial x^j} = \sum_{j=1}^{j=n} \sum_{i=1}^{i=n} \frac{\partial f}{\partial x^i} g^{ij} \frac{\partial }{\partial x^j}$

The gradient is uniquely defined so you just need to plug that formula into the definition and check that the equation holds up. — user375366, Apr 11 '17 at 07:01
@George The point is, when you compute $g(\operatorname{grad}f, F)$ in coordinates, there will be a factor of the type $\sum_{k} g^{ik} g_{kj} = \delta^{i}_{j}$. — Andrew D. Hwang, Apr 11 '17 at 11:45
@AndrewD.Hwang, can I affirm this in general case? Because I read the result that affirms a Riemannian manifold $M^n$ is locally isometric to $\mathbb{R}^n$ if and only if the curvature $R = 0$ and here I don't know nothing about the curvature of the manifold. — George, Apr 11 '17 at 12:14
@George: Not sure I understand your comment. The curvature of a manifold being zero has nothing to do with the product of the metric (in local coordinates) and its inverse matrix being the identity. — Andrew D. Hwang, Apr 11 '17 at 15:35
@AndrewD.Hwang,sorry, I did a mess, I understood what you said now, but I think that I understood how gradient of $f$ was defined, I edited my topic and put the reason. — George, Apr 15 '17 at 17:45

score 23 · Accepted Answer · answered Apr 24 '17 at 21:33

I like to think of this in terms of matrix algebra. Let's let our vector fields and one-forms be column vectors, where the one-forms act by transpose-multiplication.

In a coordinate chart, an arbitrary vector field $V$ pulls back to a vector field on our coordinate patch in $\mathbb{R}^n$, the metric tensor pulls back to a matrix field $g$, with inverse matrix field $g^{-1}$, and the differential of $f$ is a one-form $df$. The computation defining the gradient $\nabla f$ is: $$ \langle \nabla f, V\rangle = df(V) $$ which becomes in coordinates: $$ (\nabla f)^Tg V = df^T V $$ As this is true for any $V$, we have the matrix identity $$ \nabla f^T g = df^T $$ which gives $$ g^T\nabla f = df $$ and since $g$ is a symmetric matrix, inverting we have $$ \nabla f = g^{-1}df $$

If you write it out with sums and coordinate vector fields, you will be performing this computation at the level of matrix entries, but I find this approach much cleaner.

Yes, it's easier see the gradient in this way, thanks for the answer! — George, Apr 27 '17 at 01:37

score 1 · Answer 2 · answered Apr 24 '17 at 20:58

What you are doing is (I think) correct. However you are doing it, in my opinion, in a too complicated manner. Here's one possibility how to do it more easily. Let us write $grad f=\nabla f=\sum_{i=1}^nf^i\partial_i$.

1) "The coordinate way" It is easier not to use a general $X$, but a specific choice, namely $\partial_j$. We obtain $$\partial_j f= df(\partial_j)=g(\nabla f, \partial_j)=g(\sum_{i=1}^nf^i\partial_i,\partial_j)=\sum_{i=1}^nf^ig_{ij}.$$ Now fix some $k$, multiply the above equation by $g^{jk}$ and sum over all &j&. Thus $$ \sum_{j=1}^n \partial_j f g^{jk}= \sum_{j=1}^n \sum_{i=1}^nf^ig_{ij} g^{jk}=\sum_{i=1}^nf^i \sum_{j=1}^n g_{ij} g^{jk}=\sum_{i=1}^nf^i\delta_{ik}=f^k.$$

2) "The matrix way" As you have realised yourself (I just switch the role of $\nabla f$ and $X$) $$[X^1 ... X^n]G [f^1 ... f^n]^T=[X^1 ... X^n][\partial_1 f ... \partial_n f]^T$$ or $$ G [f^1 ... f^n]^T=[\partial_1 f ... \partial_n f]^T.$$ By multiplying with $G^{-1}$ we obtain $$ [f^1 ... f^n]^T=G^{-1} [\partial_1 f ... \partial_n f]^T.$$ When you remind yourself how a vector and a matrix are multiplied you see that you get the same result.

Furtheore you can see that 1) and 2) are essentially the same, because fixinkg $k$, multiplying with $g^{jk}$ and summing over all $j$ is essentially the same as multiplying by $G^{-1}$.

Definition of gradient of a function $f$ in Riemannian manifold

2 Answers2

Linked