Bookmark and Share

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

st: RE: Probability and non-response weights; how can I create a composite weight?

From   "Scholes, Shaun" <>
To   "" <>
Subject   st: RE: Probability and non-response weights; how can I create a composite weight?
Date   Mon, 21 Nov 2011 11:20:43 +0000

Arturo, here are my thoughts.

1) Any adjustments to sampling weights (whether through direct standardisation or poststratification) will change both the estimates and standard errors.
2) I cannot see any objection to the composite weight you suggest. I would be very interested to know why such a weight is not recommended.
3) If using individual level data then the sum of the composite weight over all individuals should equal the population total of individuals (if all women in responding households are surveyed then 
the responding households need to be weighted up to compensate for the non-responding HHs in each sampled village; and the sampled number of villages need to be weighted up to the total number of villages).
This you seem to do in your composite weight.

Hope this helps

-----Original Message-----
From: [] On Behalf Of Arturo Rodriguez
Sent: 18 November 2011 21:14
Subject: st: Probability and non-response weights; how can I create a composite weight?

Hi everybody,

I am working with survey data which was collected through a one-stage cluster sample with stratification (i.e. health clinics in a region were selected to be part of the study first and then villages surrounding those clinics were randomly sampled, all households in the selected villages and all women in those households were then surveyed).

My question is regarding the proper use of a composite design weight.
Here are the details:

I have calculated probability weights for each h_th stratum ... (i.e.
pw = total_villages_h / sampled_villages_h) I have also calculated unit non-response weights for each h_th stratum ... (i.e. unrw = total_eligible_households_h / sampled_households_h)

I then created a composite weights (sw) by multiplying pw * unrw = sw and used the following command to define my survey data:
svyset village_number [pweight = sw], strata(clinic) fpc(total_villages_h)

I have read that using a composite weight (sw = pw*unrw) is not recommended and that I should weight my sample first by pw and then weight again by unrw.
My first question is: Can STATA do this automatically? How? If not, do I have to multiply each of my 300+ variables by pw and then by unrw?
What is the most efficient way
to do this?

I asked STATA to calculate estimated population totals for each of the areas surrounding the clinics but I am getting figures that are very low compared to what is known to be true. Is there anything I can do to correct this statistically? Or is it just that my sample is not representative?

Will adding poststrata and postweight to my svyset make any difference in obtaining better "population around clinic" point estimates? Or will it only change my STD ERRORS?

Thanks in advance for all the wisdom.
*   For searches and help try:

*   For searches and help try:

© Copyright 1996–2018 StataCorp LLC   |   Terms of use   |   Privacy   |   Contact us   |   Site index