good morning everyone.
I understand that kernel density estimation is a non-parametric technique used to estimate the probability density function of a random variable from a sample of data, with the kernel density defined as follows:
$$ \hat{f}(x) = \frac{1}{n h} \sum_{i=1}^{n} K\left(\frac{x - X_i}{h}\right) $$
where:
- $\hat{f}(x)$ is the density estimate at point $x$,
- $n$ is the number of observations,
- $h$ is the bandwidth
- ${K}(u)$ is the kernel function.
I assume that to obtain the cumulative distribution function, one must integrate $\hat{f}(x)$
$$ F(x) = \int_{-\infty}^{x} \hat{f}(t) \, dt $$
The thing is, I have read that the Epanechnikov kernel is the most efficient, which is defined as follows:
$$ K(u) = \frac{3}{4}(1 - u^2) \quad \text{if } \vert u \vert \leq 1 $$ $$ K(u) = 0 \quad \text{if } \vert u \vert > 1, $$
I would like to know how to calculate $F(x)$ using this type of kernel.
Thank you very much, and best regards.