Stata: Data Analysis and Statistical Software

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: Missing Values in Multiple Imputation Data Sets

From	Richard Williams <[email protected]>
To	[email protected], statalist <[email protected]>
Subject	Re: st: Missing Values in Multiple Imputation Data Sets
Date	Tue, 05 Feb 2013 10:09:06 -0500

One easy thing to check: Stata has both hard andsoft missings. Soft missings (coded .) can beimputed, hard missings (coded .a through .z)cannot. So check to see if you are using hardmissings, and if so determine whether it isreasonable to recode them as soft missings. Youmay not want to recode if the hard missingsreflect things like Not Applicable responses.

Beyond that the messages are pretty clear. Youcan increase the matsize, make sure you aren'ttreating a continuous variable as categorical, etc.

Especially if you have only been using Stata for3 weeks, I would recommend starting simple andbuilding up. This looks like a pretty complicatedmodel. Start with fewer variables and build up.It may be easier to identify which variables are causing you grief.


At 06:55 AM 2/5/2013, Patti Fritz wrote:

Hello All,
I am a novice to Stata, having just begun usingit 3 weeks ago. Therefore, my questions arelikely very naive. I am trying to conductmultiple imputation of a data set that primarilycontains positively skewed/frequentlyover-dispersed ordinal predictor variables(i.e., history of various forms offamily-of-origin aggression assessed on 3- or4-point scales from not at all to a lot); myother predictors consist of demographicvariables (e.g., age, gender, income, number ofmarriages, marital status). My outcome variablesare dichotomous (partner violence) and thesample size is N = 3,635. There is substantialmissing data on some of the items (e.g., 21%),and complete data across all variables is onlyfound among 556 cases). When I attempt toconduct mi impute chained commands (I've triedpoisson, and nbreg), there are still somemissing values in some of the imputed data sets.Specifically, every variable still has somemissing data across the 20 imputed data sets. Is there!
  a way to alleviate this problem?
I can get the mi impute mvn command to work, butafter reading some of the literature, I'm notsure this is the best approach. In addition, Ihaven't been able to get the mi chained (ologit)command to work because I get the following error:
matsize too small
Â Â Â You have attempted to create a matrixwith too many rows or columns or attempted to fit aÂ Â Â model with too many variables.Â You needto increase matsize; it is currently 800.Â Use set
Â Â Â  matsize; see help matsize.
Â Â Â If you are using factor variables andincluded an interaction that has lots of missingÂ Â Â cells, either increase matsize or setemptycells drop to reduce the required matrix size;
Â Â Â  see help set emptycells.
Â Â Â If you are using factor variables, youmight have accidentally treated a continuous variableÂ Â Â as a categorical, resulting in lots ofcategories.Â Use the c. operator on such variables.
error occurred during imputation of g2sabuse_s05 ipvabuse_s05 s2gabuse
_g04 ipvabuse_g04
f2sslap_s94 m2sslap_s94 f2gslap_s94 m2gslap_s94s2gslap_s94 g2sslap_s94 p2pslap_s94 f2sslap_s05f2sabuse_s05 m2sslap_s05 m2sabuse_s05f2gslap_g04 f2gabuse_g04 cp260hec cb103redr gb103redr on m
= 1
r(908);
I only have Stata/IC 12.1 on my laptop, but I'mgoing to see if the computer labs at work havethe SE version. If yes, I'll see if I can getthe ologit command to work. Regardless, I'minterested in finding out why missing values arefound in the imputed data sets.
Thanks in advance for your time and assistance. They are greatly appreciated!

Cheers,

Patti



Patti A. Timmons Fritz, Ph.D., C. Psych.
Associate Professor
Department of Psychology
University of Windsor
Windsor, Ontario N9B 3P4
Canada



*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/faqs/resources/statalist-faq/
*   http://www.ats.ucla.edu/stat/stata/


-------------------------------------------
Richard Williams, Notre Dame Dept of Sociology
OFFICE: (574)631-6668, (574)631-6463
HOME:   (574)289-5227
EMAIL:  [email protected]
WWW:    http://www.nd.edu/~rwilliam


*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/faqs/resources/statalist-faq/
*   http://www.ats.ucla.edu/stat/stata/

References:
- st: Missing Values in Multiple Imputation Data Sets
  - From: Patti Fritz <[email protected]>

Prev by Date: st: nl command - error#130 expression too long
Next by Date: Re: st: nl command - error#130 expression too long
Previous by thread: st: Missing Values in Multiple Imputation Data Sets
Next by thread: st: Stata13 Wishlist- dealing with large number of fixed effects and dummy variables
Index(es):
- Date
- Thread