Notice: On March 31, it was announced that Statalist is moving from an email list to a forum. The old list will shut down on April 23, and its replacement, statalist.org is already up and running.
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: st: outliers
Maarten buis <firstname.lastname@example.org>
Re: st: outliers
Fri, 27 Aug 2010 13:00:45 +0000 (GMT)
--- On Fri, 27/8/10, email@example.com wrote:
> More broadly: when would you suggest to use mmregress
> instead of regress (also with robust option)? Can we say
> that mmregress is always better than the simple OLS? Or it
> can be used only in the presence of a large number of
> outliers? and for how many outliers would you suggest the
> mmregres instaead of regress?
Unfortunately there can be no general recipe we can follow
here. Remember that what we are trying to do is the following:
We have a question, we observe stuff, we summerize the stuff
using a model, we answer our question based on that summary.
Outliers are just observations that don't fit well in our
model. This can mean two things, either there is something
wrong witht the observations or there is something wrong with
There are several ways in which a computer can quantify how
well an observation fits within the model, but there is no way
a computer can decide whether it is the model or the observation
that is to blame.
The solution is to know your data, figure out why a certain
observations have been classified as outliers. If you have many
of those, don't only focus on various forms of "robust"
regression, also consider that variables may have non-linear
effects, i.e. try transformations. That is the art of using
statistics for research.
Maarten L. Buis
Institut fuer Soziologie
* For searches and help try: