Stata: Data Analysis and Statistical Software

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

st: MI and z-standardisation

From	"J.B. Kirkbride" <[email protected]>
To	[email protected]
Subject	st: MI and z-standardisation
Date	14 Apr 2011 12:53:38 +0100

Dear Stata Users

I would appreciate your guidance on the following topic regarding multipleimputation (MI) and z-standardisation. I am currently learning MI using theexcellent stata help resources, but have an issue I can't find much supportfor.

I have a small dataset of 54 subjects, 4 of whom have missing data on avariable which measures social capital in their neighbourhood, let's callthis variable "sc". It is a continuous variable with an approximate normaldistribution. I wish to use this variable in the substantive analysis(eventually, a cox regression) as a predictor, using MI to estimate missingvalues. The best way to include this in such an analysis is as az-standardised variable with a mean of 0 and sd of 1, to make parameterestimates more interpretable.

I have followed the MI commands and can obtain MI estimates for sc. Myquestion is as follows:

I am unclear how/when/if to perform z-transformation on the multiplyimputed data. I have considered two options:

1. Prior to MI, generate "zsc" using the "egen zsc=std(sc)" command andthen run the appropriate MI commands, including "mi impute" on "zsc" toobtain direct estimates of the missing zsc values under an MI scenario.

2. Estimate missing values of "sc" using "mi impute" and then transform thevariable after imputation using the command "mi passive: egen zsc=std(sc)".(An aside, I am assuming here that this is the correct way to specify"zsc", as it is a function of "sc"; your input would be welcome).

Either way, when I check the summary distribution of zsc for the Mthimputation ("mi xeq 0 1 20: summ zsc"), I do not quite get back the zscvariable with a mean of 0 & sd of 1, obviously, as the imputed values arejust that, though the summaries for each imputative are reasonably close tothis value (i.e. mean~-.03, sd~.99).


So my questions are really:

A. Can I still use the zsc variable in my substantive analysis and make theassumption it still has a mean of 0 / sd of 1?


B. Is either method (1 vs 2) preferable?

C. Is there another, preferable, way of achieving z-standardisationbefore/after MI?

D. Should I be using z-standardisations at all with MI?Many thanks in advance for your help with this matter.Best wishesJames*

*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

Prev by Date: st: survival analysis query regarding stsplit, tvc or using enter and exit options
Next by Date: Re: st: save9 and c(changed)=1 status
Previous by thread: st: survival analysis query regarding stsplit, tvc or using enter and exit options
Next by thread: st: Rectifying y-axis labels using a tiny .scheme
Index(es):
- Date
- Thread