Stata The Stata listserver
[Date Prev][Date Next][Thread Prev][Thread Next][Date index][Thread index]

st: Re: imputing categorical variables


From   "Scott Merryman" <[email protected]>
To   <[email protected]>
Subject   st: Re: imputing categorical variables
Date   Tue, 21 Oct 2003 19:14:45 -0500

----- Original Message ----- 
From: "Bellessa Frost, Michelle" <[email protected]>
To: <[email protected]>
Sent: Monday, October 20, 2003 12:36 PM
Subject: st: imputing categorical variables


> Hello,
>
> Is there any way to impute a value for data missing from categorical variables
in Stata? The Stata "impute" command uses OLS to estimate missing values,
appropriate only for continuous variables. I need to deal with missing data for
non-continuous variables.
>
> Theoretically, I could use logit and multinomial logit models, with the
predict command, to obtain predicted values for missing cases. But, when there
are missing cases in any of the variables I condition on (the independent
variables in the regression), I get a missing for the predicted variable in that
case.
>
> Has anyone dealt with something like this before?
>
> Thanks--Michelle
>
>

You may want to take a look at -whotdeck- and -hotdeckvar-  From the help file
of hotdeckvar:

"Hotdeck imputation is especially useful for discrete variables (e.g 0/1 dummy
variable) where the imputed values shouldn't take any other values. Regression
imputation with impute would result into intermediate values (e.g. 0.56 for 0/1
dummy variables)."

Use -findit hotdeck- to locate and download.

Hope this helps,
Scott



*
*   For searches and help try:
*   http://www.stata.com/support/faqs/res/findit.html
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/



© Copyright 1996–2024 StataCorp LLC   |   Terms of use   |   Privacy   |   Contact us   |   What's new   |   Site index