probability function of a function of random variables

Question

For any random variable that is a function of other random variables, e.g. $Z = g(X_1,X_2,X_3) = 5X_1 + (X_2X_3)^2$, is there a general formula/method to find $f_Z(z)$ given one knows $X_1,X_2,X_3$ are all independent and one also knows $f_{X_1}(x_1),f_{X_2}(x_2),f_{X_3}(x_3)$?

I know the existence of convolution i.e. to find $f_Z(z)$ when $Z=X_1+X_2+...+X_n$ but apart from convolution I know no other formula/method to find the probability function of a function of random variables.

In principle, Yes. Consider the Jacobian formula. However, computing an explicit formula might not be easy, in general. — Aditya Ghosh, Dec 20 '19 at 13:21
@AdityaGhosh Does Jacobian formula refer to the change of variable technique? — cjkilimanjaro, Dec 20 '19 at 13:58
Yes. It is the change of variable technique, for functions of several variables. — Aditya Ghosh, Dec 20 '19 at 14:01
@AdityaGhosh Okey, nice. And it can be used to find $f_Z(z)$ for any linear combination of $X_1, X_2, ... , X_n$ as long as we know $f_{X_1}(x_1) , f_{X_2}(x_2), ... , f_{X_n}(x_n)$, right? — cjkilimanjaro, Dec 20 '19 at 14:03
Yes, in principle. But as I said earlier, the Integrals may be hard to compute and it is possible that there is no explicit closed-form expression. — Aditya Ghosh, Dec 20 '19 at 14:06
@AdityaGhosh I looked it up. I think you miss understood my question. My question is not in regards to a vector $Z$, but rather what $f_Z(z)$ would be given $Z=g(X_1,X_2,...,X_n)$ i.e. $Z$ is one random variable (not a vector of random variables) and $Z$ is a function of a bunch of other random variables. the change of variable technique can be used to find $f_{Z_1,Z_2,...,Z_n}$ when each $Z_i$ is a linear combination of $X_1,X_2,...,X_n$ i.e. we go from a joint distribution to another joint distribution only the variables are different. Please correct me if I have misunderstood. — cjkilimanjaro, Dec 20 '19 at 14:31

score 1 · Answer 1 · answered Dec 20 '19 at 13:19

If $Z=g(X_i)$, then general formula would be: $$ f_Z(z) = \int \delta\left(g(x_i)-z\right)f_{X_i}(x_i)dx_i, $$ where $\delta(x)$ is Kronecker delta function and $f_{X_i}(x_i)$ is probability density function of vector $x_i$.

If $X_i$ has independent components, then joint pdf is a product of pdfs: $f_{X_i}(x_i)=f_{X_1}(x_1)f_{X_2}(x_2)\ldots f_{X_n}(x_n)$.

Finally, if it's possible to express one of the components $X_i$ from the function $g$, you can contract one of the integral and delta function: $$ f_{Z}(z) = \int\limits_{-\infty}^\infty\int\limits_{-\infty}^\infty\int\limits_{-\infty}^\infty \delta\left(5x_1+(x_2x_3)^2-z\right)f_{X_1}(x_1)f_{X_2}(x_2)f_{X_3}(x_3)dx_1dx_2dx_3=\\ \int\limits_{-\infty}^\infty\int\limits_{-\infty}^\infty \frac15f_{X_1}\left(\frac{z-(x_2x_3)^2}5\right)f_{X_2}(x_2)f_{X_3}(x_3)dx_2dx_3, $$

score 0 · Answer 2 · answered Dec 20 '19 at 13:23

This is a good question. I'd say the most general purpose technique is to find the CDF and then differentiate. I.e., calculate: $$F_Z(z) = \int_{(x_1,\dots,x_n):g(x_1,\dots,x_n) \leq z} f_{X_1,\dots,X_n}(x_1,\dots,x_n) \ dx_1\cdots \ dx_n$$ and then $$f_Z(z) = \frac{d}{dz}F_Z(z).$$ I think this will work in your particular example as long as you can determine the region of $(x_1,x_2,x_3)$ where $5x_1+(x_2x_3)^2 \leq z$. However, one can certainly come up with functions $g$ for which this won't be possible.

probability function of a function of random variables

2 Answers2

Linked