Questions tagged [chain-rule]

For questions involving the chain rule in analysis. The chain rule is a special rule to differentiate a composition (chain) of several functions. In single-variable calculus, we found that one of the most useful differentiation rules is the chain rule, which allows us to find the derivative of the composition of two functions. The same thing is true for multivariable calculus, but this time we have to deal with more than one form of the chain rule.

The chain rule provides us with a technique for finding the derivative of composite functions, with the number of functions that make up the composition determining how many differentiation steps are necessary.

In one-dimensional case takes the simplest form as $$f(g(x))'=f'(g(x))g'(x)$$ where both $f(x)$ and $g(x)$ are functions of one variable.

${}$

CHAIN RULE FOR ONE INDEPENDENT VARIABLE: Suppose that $x=g(t)$ and $y=h(t)$ are differentiable functions of $t$ and $z=f(x,y)$ is a differentiable function of $x$ and $y$ . Then $z=f(x(t),y(t))$ is a differentiable function of $t$ and

$$\frac{\mathrm dz}{\mathrm dt}=\frac{\partial z}{\partial x}\cdot \frac{\mathrm dx}{\mathrm dt}+\frac{\partial z}{\partial y}\cdot \frac{\mathrm dy}{\mathrm dt}$$ where the ordinary derivatives are evaluated at $t$ and the partial derivatives are evaluated at $(x,y)$ .

${}$

CHAIN RULE FOR TWO INDEPENDENT VARIABLE: Suppose $x=g(u,v)$ and $y=h(u,v)$ are differentiable functions of $u$ and $ v$ , and $z=f(x,y)$ is a differentiable function of $x$ and $y$ . Then, $z=f(g(u,v),h(u,v))$ is a differentiable function of $u$ and $v$ , and

$$\frac{\partial z}{\partial u}=\frac{\partial z}{\partial x}\cdot \frac{\partial x}{\partial u}+\frac{\partial z}{\partial y}\cdot \frac{\partial y}{\partial u}$$ and $$\frac{\partial z}{\partial v}=\frac{\partial z}{\partial x}\cdot \frac{\partial x}{\partial v}+\frac{\partial z}{\partial y}\cdot \frac{\partial y}{\partial v}$$

${}$

The Generalized Chain Rule: Let $w=f(x_1,x_2,…,x_m)$ be a differentiable function of $m$ independent variables, and for each $i \in 1, \ldots, m,$ let $x_i=x_i(t_1,t_2,\ldots,t_n)$ be a differentiable function of $n$ independent variables. Then $$\frac{\partial w}{\partial t_j}=\frac{\partial w}{\partial x_1}\cdot \frac{\partial x_1}{\partial t_j}+\frac{\partial w}{\partial x_2}\cdot \frac{\partial x_2}{\partial t_j}+\cdots+\frac{\partial w}{\partial x_m}\cdot \frac{\partial x_m}{\partial t_j}$$ for any $j \in 1,2, \ldots ,n$.


You can use and tags for multidimensional cases.

References:

https://en.wikipedia.org/wiki/Chain_rule

1661 questions
56
votes
8 answers

Chain Rule Intuition

We know that the chain rule is used to differentiate a composite function ,say $$f(x) = h(g(x))$$ It's defined as the derivative of the outside function times the derivative of the inner function or the other way around. $$\frac{\mathrm{dy}…
alok
  • 4,010
21
votes
1 answer

Understanding the chain rule in probability theory

When my teacher told us about the chain rule I found it quite easy, but when I am trying to prove something based on this rule I kind of get confused about what are the allowed forms of this rule. For example, I can't understand why I can…
Moj
  • 313
15
votes
7 answers

What's my confusion with the chain rule? (Differentiating $x^x$)

When deriving $x^x$, why can't you choose $u$ to be $x$, and find $\dfrac{d(x^u)}{du} \dfrac{du}{dx} = x^x$? Or you could go the other way and find $\dfrac{d(u^x)}{du}\dfrac{du}{dx}$, giving $\ln(x)\cdot{x^x}$? Both methods seem to be equally wrong.
jg mr chapb
  • 1,602
14
votes
4 answers

Derivative of an even function is odd and vice versa

This is the question: "Show that the derivative of an even function is odd and that the derivative of an odd function is even. (Write the equation that says $f$ is even, and differentiate both sides, using the chain rule.)" I already read numerous…
Py42
  • 628
14
votes
2 answers

Chain rule using tree diagram — why does it work?

In multivariable calculus, I was taught to compute the chain rule by drawing a "tree diagram" (a directed acyclic graph) representing the dependence of one variable on the others. I now want to understand the theory behind it. Examples: Let $y$ and…
13
votes
2 answers

Why is this famous proof of the chain rule called "technically incorrect" in this pdf?

So I was looking through various proofs of the chain rule...and I came across this paper. The first proof given is complete and quite well-explained. But another simplistic proof is given in the end...which is mentioned as "technically incorrect".…
SirXYZ
  • 960
13
votes
2 answers

Why can you mix Partial Derivatives with Ordinary Derivatives in the Chain Rule?

This question is a simplified version of this previous question asked by myself. The following is a short extract from a book I am reading: If $u=(x^2+2y)^2 + 4$ and $p=x^2 + 2y$ $\space$ then $u=p^2 + 4=f(p)$ therefore $$\frac{\partial u}{\partial…
12
votes
3 answers

About a chain rule for Wronskians

The Wronskian of $(n-1)$ times differentiable functions $f_1, \ldots, f_n$ is defined as the determinant $$ W(f_1, \ldots, f_n)(x) = \begin{vmatrix} f_1(x) & f_2(x) & \cdots & f_n(x) \\ f_1'(x) & f_2'(x) & \cdots & f_n'(x) \\ \vdots & \vdots &…
Martin R
  • 128,226
12
votes
1 answer

When is the derivative of $f(g(x))$ equal to $g(f'(x))$?

By the chain rule, we know that the derivative of $f(g(x))$ is $f'(g(x))g'(x)$. Question: When is the derivative of $f(g(x))$ equal to $g(f'(x))$? Trivial solutions include the following: Let $f$ be any differentiable function and $g$ be the…
12
votes
1 answer

Understanding the chain rule in the Wirtinger calculus

The Wirtinger differential operators are defined by: \begin{equation} \frac{\partial}{\partial z} = \frac{1}{2}\left(\frac{\partial}{\partial x} - i\frac{\partial}{\partial y}\right) \\ \frac{\partial}{\partial \bar{z}} =…
11Kilobytes
  • 1,149
12
votes
3 answers

Why is there only one term on the RHS of this chain rule with partial derivatives?

I know that if $u=u(s,t)$ and $s=s(x,y)$ and $t=t(x,y)$ then the chain rule is $$\begin{align}\color{blue}{\fbox{$\frac{\partial u}{\partial x}=\frac{\partial u}{\partial s}\times \frac{\partial s}{\partial x}+\frac{\partial u}{\partial t}\times…
12
votes
2 answers

Derivation of the multivariate chain rule

I can't believe I couldn't find this information online, but could someone provide me a good proof of the multivariate chain rule ? \begin{align} \frac{df}{dt} = \frac{df}{dx}\frac{dx}{dt} + \frac{df}{dy}\frac{dy}{dt} \end{align} I found multiple…
9
votes
2 answers

Derivative of norm 2

I'm struggling a bit using the chain rule. Given the function $\phi$ defined as: $\phi(x) = ||{A\bf{x}-b}||_2$ where $A$ is a matrix and $b$ is a vector. What is the gradient $\nabla\phi$ and how should I proceed to compute it?
f.saint
  • 103
9
votes
1 answer

Neural Networks and the Chain Rule

With neural networks, back-propagation is an implementation of the chain rule. However, the chain rule is only applicable for differentiable functions. With non-differentiable functions, there is no chain rule that works in general. And so, it…
9
votes
1 answer

Chain rule characterizes the derivative?

To what extend does the chain rule $(f\circ g)'=(f'\circ g)\cdot g'$ characterize the derivative $(\ldots)'$? More precisely: If a non-constant operator $T$ defined on all differentiable functions, satisfies the relation $$T(f\circ g)=((Tf)\circ…
Carucel
  • 1,203
1
2 3
99 100