Re: Re-re-post: Stata 11 - Factor variables in a regression command
Date
Sat, 01 May 2010 14:13:11 -0400
At 12:20 PM 5/1/2010, Richard Williams wrote:
These factor variables are nice but make sure you understand what
parameterization you are getting and how to interpret it!!!
Personally I think there is much to be said for explicitly including
the main effects so I can make sure they are there and to make my
commands easier to read, i.e. I prefer
logit y i.a i.b a#b
over
logit y a##b
Backtracking a bit -- if the model is rather complicated (2 way
interactions, 3 way interactions, squared and cubic terms, whatever)
the ## notation may be good in that it seems to ensure that you don't
miss any of the lower level effects, e.g. if you have a 3 way
interaction, you will get the main effects, all the 2 way
interactions, and the 3 way interactions. This saves some typing and
also means you don't accidentally miss something. For example,
I would have preferred that the 1.white appear right after the 1.yr89
and the 1.male (i.e. have all the main effects together), but
everything you want is in there somewhere.