Questions tagged [robust-statistics]

Robust statistics are statistics with good performance for data drawn from a wide range of probability distributions, especially for distributions that are not normally distributed. Robust statistical methods have been developed for many common problems, such as estimating location, scale and regression parameters.

Robust statistics are insensitive to deviations from their underlying assumptions and outliers. Such methods are useful it is not possible to detect and remove outliers or to appropriately test the assumptions required by a given statistic. A robust statistic is meant to achieve three goals:

efficiency - it should have an optimal or nearly optimal efficiency as the assumed model stability - small deviations from the assumptions should have only a small influence on performance breakdown - larger deviations from the assumptions should not lead to a complete failure Examples of robust statistics are median regression as estimation technique, or Huber-White standard errors for statistical inference. Note that "robust" is not equivalent to "better". Robustness is always based on compromise as it sacrifices efficiency to ensure against larger deviations from the assumptions from the model (Anscombe, 1960).

For further reading see

Huber, P.J. and Ronchetti, E.M. (2009) "Robust Statistics", 2nd Edition, Wiley Series in Probability and Statistics, John Wiley & Sons, Inc., New Jersey Anscombe, F.J. (1960) "Rejection of Outliers", Technometrics, Vol. 2, pp. 123-147

72 questions
1
vote
0 answers

Determining if a bag of wheat cent pennies has been searched - using statistics.

I have collected a sample of 5000 wheat cent pennies, and recorded the number of each year and mint mark $\in \{P, D, S\}$. I have also found out how many wheat cent pennies were minted at each mint, during each year. What would the formula be to…
0
votes
1 answer

Transform t-stat into skewness-adjusted t-stat

I'm trying to calculate a one-sample skewness-adjusted t-stat (the null hypothesis is a mean of 1) as proposed by Johnson (1978): $$ J = t + \frac{gt^2}{3\sqrt{n}} + \frac{g}{6\sqrt{n}}, $$ $t$ is the conventional t-statistic and $g$ is the skewness…
0
votes
0 answers

Query in "Certified adversarial robustness via randomized smoothing" Cohen et al paper

I am going through the proof sketch for Randomized Smoothing from paper: Cohen, Jeremy, Elan Rosenfeld, and Zico Kolter. "Certified adversarial robustness via randomized smoothing." In international conference on machine learning, pp. 1310-1320.…
user1953366
  • 191
  • 7
0
votes
0 answers

Estimate scale parameter from 5% contaminated mean-zero normal sample

I have a normal sample but that is contaminated in the left and the right tails (no more than 2.5% each). The contamination gives rise to high and low values. I wanted to know what methods I have in order to estimate ths sigma of the N(0, sigma^2)…
user3236841
  • 255
  • 1
  • 8