Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
From | Sara Head <sara.head@gmail.com> |
To | statalist@hsphsun2.harvard.edu |
Subject | st: svyset with DHS subsample |
Date | Tue, 17 May 2011 13:47:45 -0400 |
Hi there, I am setting up Demographic and Health Survey data (from Bangladesh 2007, women's survey) for analysis in Stata 11.1. The survey is based on a two-stage stratified sample of households. Additionally, households were preselected for domestic violence questions (outcome variables in my analysis; if there was more than one eligible female per household, a respondent was randomly selected through a simple selection procedure based on the Kish Grid). I've written the svyset commands as: gen psu = v021 gen strata = v023 gen sampwt = (v005/1000000) //per DHS instruction// gen dvsampwt = d005 //no DHS instruction to adjust// svyset psu [pweight=dvsampwt], strata(strata) where : summ psu strata sampwt dvsampwt Variable | Obs Mean Std. Dev. Min Max -------------+-------------------------------------------------------- psu | 10146 180.7909 104.1327 1 361 strata | 10146 10.66499 6.26573 1 22 sampwt | 10146 1.004513 .5912652 .13565 3.592687 dvsampwt | 4195 996578.1 764690.7 110423 1.08e+07 I am unsure if this code is correct. 1) Since this is a two-stage stratified sample with further selection for violence questions, it seems the svyset command should be more along the lines of : svyset su1 [pweight=pw], strata(strata) || _n, fpc(fpc2) ? 2) I used dvsampwt instead of the sampwt variable; I can't find information in the survey report / recode map about how the dv weight was calculated. I'd like to assume it took the larger sampling design into account. Any thoughts greatly appreciated, Sara * * For searches and help try: * http://www.stata.com/help.cgi?search * http://www.stata.com/support/statalist/faq * http://www.ats.ucla.edu/stat/stata/