Questions tagged [robust-statistics]

Robust statistics are statistics with good performance for data drawn from a wide range of probability distributions, especially for distributions that are not normally distributed. Robust statistical methods have been developed for many common problems, such as estimating location, scale and regression parameters.

Robust statistics are insensitive to deviations from their underlying assumptions and outliers. Such methods are useful it is not possible to detect and remove outliers or to appropriately test the assumptions required by a given statistic. A robust statistic is meant to achieve three goals:

efficiency - it should have an optimal or nearly optimal efficiency as the assumed model stability - small deviations from the assumptions should have only a small influence on performance breakdown - larger deviations from the assumptions should not lead to a complete failure Examples of robust statistics are median regression as estimation technique, or Huber-White standard errors for statistical inference. Note that "robust" is not equivalent to "better". Robustness is always based on compromise as it sacrifices efficiency to ensure against larger deviations from the assumptions from the model (Anscombe, 1960).

For further reading see

Huber, P.J. and Ronchetti, E.M. (2009) "Robust Statistics", 2nd Edition, Wiley Series in Probability and Statistics, John Wiley & Sons, Inc., New Jersey Anscombe, F.J. (1960) "Rejection of Outliers", Technometrics, Vol. 2, pp. 123-147

67 questions
6
votes
1 answer

Divergence based robust inference

The term 'divergence' means a function $D$ which takes two probability distributions $g,f$ as input and puts out a non-negative real number $D(g,f)$. I have learnt that the inference based on minimizing the following divergence is robust against…
5
votes
1 answer

Least Absolute Deviation (LAD) Line Fitting / Regression

I want to implement robust line fitting over a set of $n$ points $(x_i,y_i)$ by means of the Least Absolute Deviation method, which minimizes the sum $$\sum_{i=1}^n |y_i-a-bx_i|.$$ As described for instance in Numerical Recipes, you can find the…
5
votes
2 answers

Robust orientation of a point cloud

I have 2D point clouds which are 4-way symmetrical (invariant by 90° rotation). The points are usually arranged on the nodes of a square grid, densely populated, but some cases can be more complicated. I do know the point pattern in advance. I need…
user65203
4
votes
0 answers

Is there a robust version of the moving least squares or of the Savitzky–Golay filter?

Is there a name for the following type of filter? I want to filter a noisy signal $f(x) = f_0(x) + noise(x)$ (where $f_0$ is a noiseless signal), to get a filtered signal $f_\text{F}(x)$ while preserving edges and first derivatives. If all I wanted…
HelloGoodbye
  • 645
  • 3
  • 14
4
votes
1 answer

Explaining the standard deviation formula

I'm revisiting standard deviation for the first time years, and I can't for the life of me recall the difference between two formulas. In particular, I'm also looking for how we arrived at these forumulas. Firstly we have for the sample standard…
3
votes
1 answer

How to find Influence function?

Derive $IF(x;T,F)$ when $$\displaystyle T(F)=\int_{F^{-1}(\alpha)}^{F^{-1}(1-\alpha)}x ~dF(x)$$ Here $IF$ stands for Influence function. Trial: Here $$\begin{align}IF(x;T,F) &=\lim_{t\to 0}\frac{T((1-t)F+t\Delta_x)-T(F)}t \\ &=\lim_{t\to…
Argha
  • 4,741
3
votes
3 answers

About the closest linear function to an arbitrary function in L1 norm

Let $\mu$ be a probability distribution over $\mathbb{R}^n$. All functions discussed henceforth are from $\mathbb{R}^n$ to $\mathbb{R}$. Let $l^\ast$ be a linear function and $f$ be a function such $f=l^\ast$ on a set that contains strictly larger…
3
votes
1 answer

Infimum of the set of medians.

Let $\{F_{n}\}_{n}$ be a sequence of cumulative distribution functions such that converge to $F$, in the sense that $F_{n}(x)\rightarrow F(x)$ for all $x\in\mathbb{R}$. We define the function infimum of the median by $MEi(F) = \inf\{x\in\mathbb{R} :…
3
votes
1 answer

Robust Optimization: Using Bertsimas and Sim approach for linear model formulation (maximization problem)

I tried to use Bertsimas and Sim approach for an uncertain linear model, but the thing is the answer I got for the Bertsimas and Sim linear model when Γ = 2 is different from Soyster model's result, I tried other uncertain models and I got the same…
3
votes
3 answers

Robust line segment fitting to a digital path

Robust line fitting to a set of 2D points is a well studied problem for which several approaches are known. They usually consider the point cloud as unstructured. I call a digital path a sequence of pixels (points with integer coordinates) that are…
user65203
3
votes
0 answers

Projection onto the Set of Orthogonal Matrices with the ${L}_{1}$ Norm

I want to find an orthogonal matrix $O\in SO(n)$ such that $\|Y - OX \|_1$ is minimized, where X and Y are matrices (of appropriate sizes). I know that there is a solution to this problem using SVD for $L^2$ for example,…
3
votes
1 answer

Time series determined by other time series

Intuitive Question Suppose I'm given a set of $k$ time-series $\{X_t^1,\dots, X_t^k\}$. Is there a way to determine how much of each series is dependent on the others. Formal Question More precisely if for each $j \in \{1,\ldots,k\}$, $X^j_t$ is…
3
votes
2 answers

What does saying that an estimator is robust mean?

In statistics, how can you tell whether an estimator is robust or not? I need to discuss whether the maximum likelihood estimators (MLE) of the normal distribution are robust or not. The MLE are $$\hat{\mu} = \begin{pmatrix} \sum_{i=1}^n x_i/n \\…
3
votes
1 answer

What are problems an $M$ - estimator is trying to solve?

Anyone here have any experience with $M$ - estimators and do you think you can give a brief explanation that the problem an $M$ - estimator is trying to solve ? Thanks.
2
votes
1 answer

What is the formal definition of the breakdown value of a statistic

On page 482 of Statistical Inference (Second Edition) by Casella & Berger, the authors define the breakdown value as follows: Defintion 10.2.2 Let $X_{(1)} < \dots < X_{(n)} $ be an ordered sample of size $n$, and let $T_n$ be a statistic based on…
1
2 3 4 5