|  | 
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: st: Model selection using AIC/BIC and other information  criteria
At 06:07 PM 6/23/2009, kokootchke wrote:
Dear all,
I have a model that says that the return or yield spread of a bond 
issued by a country depends non-linearly on the country's 
probability of default. If I assume that this probability of default 
follows a logistic form, I get that the log spread depends linearly 
on "stuff" which I take to be macroeconomic variables. To choose the 
best model, I use AIC/BIC.
One interesting fact I observe is that in some cases, I see that 
both AIC and BIC select a model that contains some variable X even 
when a lot of data points are missing for that particular variable, 
which means I actually lose a lot of observations when I include 
such variable X.
More specifically, I have:
MODEL 1
regress log_spread a b c X
estat ic
which gives AIC = 915
then,
MODEL 2
regress log_spread a b c
estat ic
which gives AIC = 1500
but the OLS in model 1 uses 1200 observations while the OLS in model 
2 uses 2800 observations (because 1600 observations are missing in 
variable X)!!
You would think that this would be because X is very relevant to 
explain the spread, but in fact I see some cases when this variable 
is statistically insignificant!!
Somebody can correct me if I am wrong, but I don't think it is legit 
to compare BIC and AIC statistics that have been estimated on 
different samples.  I don't think these stats are totally immune to 
differences in sample size -- and even if they were the two samples 
used might be very different, e.g. maybe those 1600 missing cases are 
all bonds from the US.
I'm guessing a fairer comparison would be
nestreg, lr: reg log_spread (a b c) X
The same sample will be used for both regressions and you will get 
BIC and AIC stats at the end.
I think your bigger concern, though, is losing more than half your 
cases when you include X.  You need to find out why those data are 
missing and then decide what to do about it.
-------------------------------------------
Richard Williams, Notre Dame Dept of Sociology
OFFICE: (574)631-6668, (574)631-6463
HOME:   (574)289-5227
EMAIL:  [email protected]
WWW:    http://www.nd.edu/~rwilliam
*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/