Notice: On March 31, it was **announced** that Statalist is moving from an email list to a **forum**. The old list will shut down on April 23, and its replacement, **statalist.org** is already up and running.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

From |
William Hauser <whauseriii@gmail.com> |

To |
statalist@hsphsun2.harvard.edu |

Subject |
st: coldiag and pertub |

Date |
Sun, 13 Nov 2011 13:28:13 -0500 |

Hi all, I'm running a logit model with a single interaction term and factor variable notation. I'm having trouble understanding the collinearity diagnostics. The model is of the form: logit y c.var1##c.var2 i.var3 c.var4 There are 11 variables in the model in total but I have omitted them above for brevity. Stata is version 12. coldiag2 won't run after logit but I did plug in my independent variables into the command and got a condition index number of 397 which is, as you would expect, driven by the association between a, b, and the interaction term aXb. I'm not sure what to make of this. My instinct is that the high condition index is ok because it is the result of the interaction term but I'm unsure. More troubling, without the interaction term the index is still 98 but this appears driven by the association of of var1 with the "constant term." I'm not sure what the constant term is, the coldiag2 help file is vague about this. I know the old coldiag program defaulted to no constant but the newer version includes the constant by default. If I omit the constant using the "nocons" option then the condition index is a more reasonable value of 20. What is the constant term and what do I make of all this? I've also experimented with the perturb command but can't find any guidance about how to interpret the results or how to specify the perturbations. For example, how large should the perturbation be for the continuous variables? 1 standard deviation? Surely the results are a function of how large the perturbation is. It's also not clear to me what's reported in the summary table from the command. Is the "mean" the average coefficient observed across the 100 iterations? Or is it the mean variation in the coefficient across the 100 iterations? The help file for perturb is rather vague and I can't find any examples using perturb on the UCLA Stata website either. Any guidance you could offer would be greatly appreciated. Will * * For searches and help try: * http://www.stata.com/help.cgi?search * http://www.stata.com/support/statalist/faq * http://www.ats.ucla.edu/stat/stata/

- Prev by Date:
**st: Normal CDF** - Next by Date:
**st: rearranging cross-section to longitudinal cross-section** - Previous by thread:
**st: Normal CDF** - Next by thread:
**st: rearranging cross-section to longitudinal cross-section** - Index(es):