1

Let $F$ be a CDF and $\hat F$ its empirical analogue. The Glivenko-Cantelli Theorem states that $\Vert \hat F - F\Vert_\infty\rightarrow 0$ almost surely. Can we say anything about the total variation of $\hat F - F$?

Part of the problem is that I don't even know what definition of total variation can be used in this case. I know the definition of bounded pointwise variation for a real-valued function $f$ defined on a bounded interval $[a,b]\subset\mathbb R$: $$V_\text{bpv}(f) := \sup_{P\in\mathcal P}\sum_{i=1}^{n_P}\vert f(x_{i+1}) - f(x_i)\vert,$$ where $\mathcal P$ is the collection of all partitions $P = \{x_1, x_2,\dots, x_{n_P}\}$ of $[a,b]$. There is another (more advanced) definition for a real-valued function defined on an open subset $\Omega\subseteq\mathbb R$: $$V_\text{tcv}(f) := \sup_{h\in\mathcal H} \int_\Omega f \operatorname{div}h,$$ where $\mathcal H$ is the set of all continuously differentiable real-valued functions of compact support contained in $\Omega$ with essential supremum $\leq 1$.

Both definitions are not equivalent. Since the second definition has some advantages (specifically, it allows all of $\mathbb R$), I would go with the second definition. CDFs are real-valued functions defined on $\mathbb R$, so we set $\Omega = \mathbb R$. (Sometimes CDFs are defined as real-valued functions defined on $[-\infty,\infty]$. Does this change anything of the following?) For any $h\in\mathcal H$, we have: $$\int_\mathbb R (\hat F-F)\operatorname{div}h \leq \Vert \hat F - F\Vert_\infty \int_{\mathbb R} \operatorname{div}h = \Vert \hat F - F\Vert_\infty\int_{\mathbb R} h'$$ with $h'$ denoting the derivative of $h$. Let $K$ denote the support of $h$. Then, by definition $h(x) = 0$ for all $x\in\mathbb R\setminus K$, and $h(x)\neq 0$ for $x\in K$. That is, $h'(x) = 0$ for all $x\in \mathbb R\setminus K$ and $h'(x)\neq 0$ for all $x\in K$. Therefore, $$R_h:=\int_\mathbb R h' = \int_K h'.$$ By definition, $h'$ is continuous. So we have a continuous function on a compact set $K$, which means that $h'$ must be bounded. Therefore, there is a $C\geq 0$ such that $\vert h'\vert \leq C$, and since $K$ is compact, it has finite Lebesgue measure. As a result, $R_h$ is bounded. Finally, the Glivenko-Cantelli Theorem implies $V_\text{tcv}(\hat F - F)\rightarrow 0$ almost surely.

Is my reasoning correct? I never worked with total variation before, so I may overlooked something.

Edit: I think that my approach has the following problems:

  • Derivative at the boundary: I claimed that $h'(x) = 0$ for all $x\in\mathbb R\setminus K$ and $h(x) \neq 0$ for all $x\in K$. But is this is really the case? One could think of a function that has very steep slope close to the boundary. Sure, $h'$ is still continuous on all of $\mathbb R$, so there has to be some regularity. However, I feel that my argument is not rigorous enough.

  • The upper bound depends on $h$ (resp. $h'$): I show that $R_h$ is bounded. But the supremum over all $h\in\mathcal H$ (of this bound) may not be bounded. That is, my ultimate statement may not be true.

I added a bounty to draw more attention to this question.

Context: In my earlier question, I wanted to control $$I:=\int f\,\mathrm dg$$ with $g = \hat F - F$. By the triangle inequality: $$\left\vert\int f\,\mathrm dg\right\vert \leq \int\vert f\vert\,\mathrm dV(g).$$ I know the result holds for $V = V_\text{bpv}$, but I am not sure whether it holds for $V = V_\text{tcv}$, too. Should this be the case, then, with the assumption of a bounded $f$ and all previous steps being correct, I would have not only determined a bound for $I$ but also showed that $I\rightarrow 0$.

  • 2
    Quertiloper, as you already guessed in your edit, the claim that $h^\prime(x)\equiv 0$ on $\Bbb R\setminus K$ is surely correct but the problem is that you can find functions $h\in\mathcal H$ whose first derivative grows unbounded. Think for example of a "trapezoidal" shaped $h$ whose flat top is such that $h=1$ and sufficiently "rounded": you can increase the steepness of the oblique sides as much as you want, seeing that $h^\prime$ grows unbounded. Thus the approach above is doomed to fail. – Daniele Tampieri Jun 24 '25 at 16:39
  • Thank you Daniele for confirming the validity of my doubts. Do you mean that there is no chance that the variation of $\hat F-F$ vanishes, or is it just my approach (too loose bounds) that is doomed to fail? – Quertiopler Jun 24 '25 at 22:34
  • It is just the approach based on the Tonelli-Cesari variation that does not work easily in this context. The proposition you are trying to prove seems correct to me i.e. $$| \hat F- F|_{BV}\to 0. $$ Note that I used the $BV$-norm instead of the $TV$-norm since $\hat F, F\in BV(\Bbb R)$ and so is $\hat F-F$ since $BV(\Bbb R)$ is a vector space. – Daniele Tampieri Jun 25 '25 at 04:26
  • Assume $P$ is continuous, that is, $P$ assigns all point masses $0$. Then the total variation distance $d_{TV}(\hat{F}n, F) = \sup{A \in \mathcal{B}(\mathbb{R})}|\hat{F_n}(A) - F(A)| \geq 1$ for all $n$ by using $A = {X_1, \dots, X_n}$. – Mason Jun 25 '25 at 20:04

0 Answers0