To motivate my question, let's start with an example:
Example: Find $\tfrac{df}{dt}$ and $\tfrac{\partial f}{\partial t}$ if $f(x,t)=xt$, where $x=x(t)$.
From my current understanding, the total derivative $\tfrac{df}{dt}$ cares about us substituting $t$ into $x$ to give us
$$\dfrac{df}{dt}=\dfrac{d}{dt}(xt)=\dfrac{d}{dt}(x(t)\cdot t)=\dfrac{dx}{dt}t+x$$
and the partial derivative $\tfrac{\partial f}{\partial t}$ doesn't care about the substitution as it sees $x$ and $t$ as two separate variables and keeps $x$ constant for the differentiation
$$\dfrac{\partial f}{\partial t} = \dfrac{\partial}{\partial t}(xt)=x$$
What throws me off is when we have a function such as $f(x(u,v),y(u,v))$. Finding $\tfrac{\partial f}{\partial u}$ for example would be found by doing
$$\dfrac{\partial f}{\partial u}=\dfrac{\partial f}{\partial x}\dfrac{\partial x}{\partial u}+\dfrac{\partial f}{\partial y}\dfrac{\partial y}{\partial u}$$
My question is this: Why doesn't $\tfrac{\partial f}{\partial u}=0$? The function $f$ is only a function of $x$ and $y$ originally and there are no $u$'s to be found in the original function. You may say we perform the substitution to get $x=x(u,v)$ and $y=y(u,v)$ making $f$ a function of $u$ and $v$ but from the example above, for $\tfrac{\partial f}{\partial t}$ we do not substitute $x=x(t)$ and kept $x$ as is. So why are functions of two variables different and require the use of the multivariable chain rule?