Stata: Data Analysis and Statistical Software

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: Stepwise Regression using Multiple Imputed Data - stata 11

From	Richard Williams <[email protected]>
To	[email protected], <[email protected]>
Subject	Re: st: Stepwise Regression using Multiple Imputed Data - stata 11
Date	Tue, 09 Aug 2011 00:44:52 -0500

At 09:34 PM 8/8/2011, Russell Simco wrote:

Hi,
The "stepwise" model selection program is not available when usingthe "mi estimate:" prefix in stata-11. I would like to do thismanually using "mi estimate: ologit".

Besides all the usual concerns with stepwise, with MI there would bethe added problem that arbitrary or minor changes in your imputationprocedure, e.g. using a different random number seed, would producedifferent imputed data sets and hence could lead to differentvariable selection. It wouldn't surprise me if there are otherreasons why the use of sw with mi is invalid. You might take a lookat the svy & stepwise FAQ for other ideas on how to proceed:


http://www.stata.com/support/faqs/stat/stepsvy.html

But, if you are bound and determined to do stepwise manually anyway,it doesn't seem that hard (other than possibly being extremelytedious). Suppose you have x1-x10. Run 10 bivariate regressions of ywith each X. The X with the largest T value wins the first round(assuming it meets your criterion for entry, e.g. is significant atthe .05 level). Lets say x1 wins.

In the 2nd round, run 9 regressions, e.g. y with x1 and x2, y with x1and x3, etc. Winner (i.e. the added X with the largest significant Tvalue) advances to the 3rd round.


Repeat process until no more x's are statistically significant.

Or, work backwards. Regress Y on x1-x10. The least significant x(assuming at least one meets your criteria for removal) gets dropped.Repeat as needed.


The manual (at least in Stata 12) also describes the logic behind stepwise.

Overall, I am not sure why you would want to do this, I think it isprobably a bad idea to do it, and if I was going to do it I'd betempted to do it with the original un-imputed data (at least then myfate isn't being decided by an arbitrary random number seed). But ifyou really really want to do it seems straightforward albeit tedious.



-------------------------------------------
Richard Williams, Notre Dame Dept of Sociology
OFFICE: (574)631-6668, (574)631-6463
HOME:   (574)289-5227
EMAIL:  [email protected]
WWW:    http://www.nd.edu/~rwilliam

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

References:
- st: Stepwise Regression using Multiple Imputed Data - stata 11
  - From: Russell Simco <[email protected]>

Prev by Date: Re: st: Granger causality test in Stata10
Next by Date: Re: st: xtabond with constraints
Previous by thread: st: Stepwise Regression using Multiple Imputed Data - stata 11
Next by thread: st: interpreting quantile regression with categorical predictors
Index(es):
- Date
- Thread