Outliers
Re: Outliers
In what model? For a linear regression, there have been many ways proposed to detect outliers. The Baltagi textbook example baltp193.rpf computes five of them. Most of these are refinements on the simpler |e|/sigma criterion, computing a different sigma for each data point rather than just using the regression sigma. For an ARIMA model, the BOXJENK instruction has automatic outlier detection which is implemented by testing the effect of adding various dummies to the model.
Re: Outliers
My whole idea is to find outliers(event) before choosing the model. In the data of monthly frequency, I just want to figure out the months as outliers. I do not like to eliminate them, but just to see which months appear as outliers.
Re: Outliers
However, an outlier is specific to a model. If you have Y,X pairs
Y X
0 0
1 1
2 2
50 50
looking at the Y values in isolation (in effect thinking of a "model" in which the Y's are i.i.d. N(mu,sigma^2)), the 4th observation appears to be an outlier. In the model Y=a+bX, it isn't; it's right on the regression line along with everyone else. Even if you assume i.i.d. data, what would be seen as an outlier in Normally distributed data might be perfectly reasonable for a fat-tailed distribution like a Cauchy.
Y X
0 0
1 1
2 2
50 50
looking at the Y values in isolation (in effect thinking of a "model" in which the Y's are i.i.d. N(mu,sigma^2)), the 4th observation appears to be an outlier. In the model Y=a+bX, it isn't; it's right on the regression line along with everyone else. Even if you assume i.i.d. data, what would be seen as an outlier in Normally distributed data might be perfectly reasonable for a fat-tailed distribution like a Cauchy.