I'm confused with the notations of partial derivative on manifolds in Tu's An Introduction to Manifolds..
Just to make clear the notations I'm using, what I've known and which part I'm confusing, please let me to restate the question formally. Let $(M, \mathcal A)$ be an $n$-dimensional smooth manifold, $(U, \phi)=(U, x^i)$ be the chart containing some $p\in M$ and $f:U\to \mathbb R$ be a smooth function. The basis for $T_{\phi(p)}\mathbb R^n$ is $\left\{\left.\frac{\partial}{\partial r^i} \right\vert_{\phi(p)} \right\}_{i=1}^n$ and $r^i:\mathbb R^n\to \mathbb R, (a^1, a^2, \cdots, a^n) \mapsto a^i$ are the canonical coordinate functions, which means $x^i=r^i\circ\phi$. With these notation, the basis for $T_p U=T_pM$ is the preimage of those for $T_{\phi(p)}\mathbb R^n$ under the isomorphism $\phi_{*,p}$, i.e.
$$\left.\frac{\partial }{\partial x^i} \right\vert_p = \left(\phi^{-1} \right)_{*,\phi(p)} \left( \left.\frac{\partial}{\partial r^i} \right\vert_{\phi(p)} \right), i=1, \cdots n $$
Acting on $f$, we have
$$\left.\frac{\partial f}{\partial x^i} \right\vert_p = \left.\frac{\partial (f\circ \phi^{-1})}{\partial r^i} \right\vert_{\phi(p)}$$
I'm clear about the procedure of getting the basis as the preimages, but quite confused with the notations and the actual meaning.
According to this question (if the answer is right), the canonical coordinate functions $r^i:\mathbb R^n\to \mathbb R$ are actually the projections, then what does the expression $\frac{\partial (f\circ \phi^{-1})}{\partial r^i}|_{\phi(p)}$ exactly mean by taking partial derivatives on functions (which lie in function space)? And similarly for $\frac{\partial f}{\partial x^i}|_p$? By the usual meaning in calculus, isn't that supposed to take partial derivatives on variables lying in $\mathbb R^n$?
Personally I got several interpretations for this, but not quite sure whether they were right and I still feel some parts about the interpretation that doesn't make sense.
My first interpretation is "by abuse of notation". That is, we let $r^i=r^i(r)$ for $r\in \mathbb R^n$ and then $\frac{\partial (f\circ \phi^{-1})}{\partial r^i}|_{\phi(p)}$ becomes what we’ve meant. But we still cannot explain $\frac{\partial f}{\partial x^i}|_p$, since by our abuse, we get $x^i(p)=r^i \circ \phi(p))=r^i$. If we insist that $\partial()$ in the denominator be variables, we have to rewrite the left part as $\frac{\partial f}{\partial r^i}|_p$. This is absurd because $f$ is not the function of $r^i$. However, if we give up explaining $\frac{\partial }{\partial x^i}|_p$ on the left and regard it as a symbolic notation, or even rewrite it as $\partial_i$, somehow it then make sense.
However, this means that we have different interpretation on the symbols $\frac{\partial }{\partial x^i}|_p$ and $\frac{\partial }{\partial r^i}|_{\phi(p)}$, which in I personnally think should be consistent, since $\frac{\partial }{\partial x^i}|_p$ w.r.t. manifolds $M$ is just the more general case for $\mathbb R^n$.
My second interpretation is by regarding the coordinates and coordinate functions as the same. First consider $\mathbb R^n$, it itself is a manifold. Thus we can interpret it as an underlying topological manifold $M_{\mathbb R^n}$ with no coordinates at first and then was assigned each point a coordinate by specifying a chart $(M_{\mathbb R^n}, r^i)$ and the relative atlas, where $(r^i)$ is the cartesian orthogonal coordinates. The process of assigning coordinates is actually a coordinate map from $M_{\mathbb R^n}$ to the math object $\mathbb R^n$. That is, whenever we say the coordinates of some point $p$, we are actually saying the coordinate maps $(r^i):M_{\mathbb R^n} \to \mathbb R^n$.
With this interpretation and back to $\left.\frac{\partial f}{\partial x^i} \right\vert_p = \left.\frac{\partial (f\circ \phi^{-1})}{\partial r^i} \right\vert_{\phi(p)}$, we can regard both $\frac{\partial }{\partial x^i} ,\frac{\partial }{\partial r^i}$ as symbolic notation and redefine the notations for partial derivatives as the symbolic form “functions on functions”.
This version somehow seems more reasonable, but you may have noticed that I’ve change the definition of $r^i:{\mathbb R^n} \to \mathbb R$ (projection map) to $r^i: M_{\mathbb R^n} \to \mathbb R$ here (coordinate map). So the question is still unsolved.
What’s more, I also attempted to adapt the notation $\left.\frac{\partial f}{\partial x^i} \right\vert_p = \left.\frac{\partial (f\circ \phi^{-1})}{\partial r^i} \right\vert_{\phi(p)}$ on a concrete example, say the unit circle $S^1 \subset \mathbb R^2$. Let a chart be $(U, \phi)=(U, \rho, \theta)$ with polar coordinates and $q\in U$ in the first quadrant. Let $r^i$ be the coordinate function (i.e. projection) on $\mathbb R^2 \supset \phi(U)$ . Let’s use $(\xi, \eta)$ to describe the locations of points in $S^1$and $(\hat\xi, \hat\eta)$ the locations of points in $\mathbb R^2 \supset \phi(U)$.
Consider the function $f:U\to \mathbb R, (\xi, \eta) \mapsto \xi+\eta$. We have
$$ \begin{align*} \phi:(\xi, \eta)&\mapsto (\sqrt{\xi^2+\eta^2},\arctan \frac \xi\eta)\\ \rho=r^1\circ \phi:(\xi, \eta) &\mapsto \sqrt{ \xi^2+\eta^2}\\ \theta=r^2\circ \phi:(\xi, \eta) &\mapsto\arctan \frac \xi\eta \\ \phi^{-1}:(\hat\xi, \hat\eta)&\mapsto (\hat\xi \cos \hat\eta, \hat\xi \sin \hat\eta)\\ f \circ \phi^{-1}: (\hat\xi, \hat\eta) &\mapsto \hat\xi (\cos \hat\eta+ \sin \hat\eta) \end{align*} $$
Things are going right so far but the next thing is how can we compute $\left.\frac{\partial (f\circ \phi^{-1})}{\partial r^1} \right\vert_{\phi(q)}$?
Is it right to interpret it as computing partial derivatives on $\hat \xi$ , since $\frac{\partial }{\partial r^1}$ actually means to compute partial derivatives on the 1st variable? Then why not just use $\frac{\partial }{\partial \hat \xi}$ ?
Or does that mean we are supposed to use $(r^1, r^2)$ to describe the location of $\phi(q)$'s, and that my first interpretation should be right?
Well, suppose it’s right. We resymbolize $\phi^{-1}:(r^1, r^2)\mapsto (r^1 \cos r^2, r^1 \sin r^2)\,,\, f \circ \phi^{-1}: (r^1, r^2) \mapsto r^1 (\cos r^2+ \sin r^2)$, and let $\left.\frac{\partial (f\circ \phi^{-1})}{\partial r^i} \right\vert_{\phi(q)}$ be the usual meaning in calculus and $\left.\frac{\partial }{\partial \rho} \right\vert_{q}$ be symbolic. Then we have
$$ \left.\frac{\partial f}{\partial \rho} \right\vert_p = \left.\frac{\partial (f\circ \phi^{-1})}{\partial r^1} \right\vert_{\phi(p)} = \cos r^2+\sin r^2\\ \left.\frac{\partial f}{\partial \theta} \right\vert_p = \left.\frac{\partial (f\circ \phi^{-1})}{\partial r^2} \right\vert_{\phi(p)} = r^1 (\cos r^2-\sin r^2) $$
But the role of $\rho, r^1$ and $\theta, r^2$ are quite same, why not just use the same symbol? And what’s more, in the equation above, it’s $\left.\frac{\partial (f\circ \phi^{-1})}{\partial r^2} \right\vert_{\phi(p)}$ that’s more symbolic while $\left.\frac{\partial f}{\partial \theta} \right\vert_p$ somehow seems to have some practical meaning.
So what exactly do $\frac{\partial }{\partial x^i} ,\frac{\partial }{\partial r^i}$ and the coordinate functions mean?