Bookmark and Share

Notice: On March 31, it was announced that Statalist is moving from an email list to a forum. The old list will shut down on April 23, and its replacement, statalist.org is already up and running.


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

st: svyset with DHS subsample


From   Sara Head <sara.head@gmail.com>
To   statalist@hsphsun2.harvard.edu
Subject   st: svyset with DHS subsample
Date   Tue, 17 May 2011 13:47:45 -0400

Hi there,

I am setting up Demographic and Health Survey data (from Bangladesh
2007, women's survey) for analysis in Stata 11.1.

The survey is based on a two-stage stratified sample of households.
Additionally, households were preselected for domestic violence
questions (outcome variables in my analysis; if there was more than
one eligible female per household, a respondent was randomly selected
through a simple selection procedure based on the Kish Grid).

I've written the svyset commands as:

gen psu = v021
gen strata = v023
gen sampwt = (v005/1000000) //per DHS instruction//
gen dvsampwt = d005 //no DHS instruction to adjust//

svyset psu [pweight=dvsampwt], strata(strata)

where :
summ psu strata sampwt dvsampwt

    Variable |       Obs        Mean    Std. Dev.       Min        Max
-------------+--------------------------------------------------------
         psu |     10146    180.7909    104.1327          1        361
      strata |     10146    10.66499     6.26573          1         22
      sampwt |     10146    1.004513    .5912652     .13565   3.592687
    dvsampwt |      4195    996578.1    764690.7     110423   1.08e+07



I am unsure if this code is correct.
1) Since this is a two-stage stratified sample with further selection
for violence questions, it seems the svyset command should be more
along the lines of : svyset su1 [pweight=pw], strata(strata) || _n,
fpc(fpc2) ?
2) I used dvsampwt instead of the sampwt variable; I can't find
information in the survey report / recode map about how the dv weight
was calculated. I'd like to assume it took the larger sampling design
into account.

Any thoughts greatly appreciated,
Sara

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/


© Copyright 1996–2014 StataCorp LP   |   Terms of use   |   Privacy   |   Contact us   |   Site index