[Date Prev][Date Next][Thread Prev][Thread Next][Date index][Thread index]

Re: st: GEE and weighting

From	"Stas Kolenikov" <[email protected]>
To	[email protected]
Subject	Re: st: GEE and weighting
Date	Thu, 5 Apr 2007 09:42:34 -0500

I guess if you are able to track the selection probabilities for
individuals really well, you could use

svyset health_center [pw=weight]
svy: logit whatever

Or, if you have selection probabilities for both health centers and
individuals within those, you could use -gllamm- for two-level
modeling with weights for both levels.

-xtgee- would give you some gain in precision over -logit- if the
correlation structure is specified correctly, but other than that,
-logit- should do fine, too. The -svyset- here is effectively
analogous to specifying -, cluster(cpihfano)- with the
(quasi-)likelihood commands, which is another way to correct for
correlation/clustering in your data.

On 4/4/07, Gwyneth Vance <[email protected]> wrote:

What does one do with binary, clustered data that must be weighted?

I am working on a project using Stata 9.  The goal is to develop models
of various binary outcome measures pertaining to improved counseling by
health providers.  I am, however, running into several challenges.  The
first is that the sample taken was a cluster sample.  Individuals were
interviewed at various health centers, so the health center was the
primary sampling unit-health centers were selected for participation and
then the individual study participants.  Originally, I thought that I
could use regression with GEE to account for the clustering in the data;
however, I discovered a second problem that may limit my ability to do
so.  Within each cluster, the individuals sampled were not sampled in
equal proportion on an important variable, which was provider training.
In other words, clients who received counseling from trained providers
were over-represented in the sample.

I thought the solution would be to apply a sample weight (pweight
command in Stata), but Stata does not allow the pweight to vary by unit
within a panel.  That is, the individuals within a cluster are not
allowed to have their own weight, only the panel or cluster may be
weighted.  Below are the commands I keyed in, and the error message that
I received.

Command:
iis cpihfano
xtgee cpi41 cpi2 ce39 [pweight = weight], family(binomial 1) link(logit)
corr(exchangeable)

Error Message:
weight must be constant within cpihfano
r(199);

I have done a bit of research on the topic, but am getting no where
other to discover that this problem may be; as yet, unsolved (please
refer to this link for further explanation
http://www.stata.com/support/faqs/stat/xtweight.html ).

So, what can one do with binary, clustered data that should be weighted?
Does anyone know if progress has been made on this front?  What
solutions have others devised in similar situations? I can provide more
detail if necessary.

Gwyneth

*
*   For searches and help try:
*   http://www.stata.com/support/faqs/res/findit.html
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/


--
Stas Kolenikov
http://stas.kolenikov.name
*
*   For searches and help try:
*   http://www.stata.com/support/faqs/res/findit.html
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

Follow-Ups:
- RE: st: GEE and weighting
  - From: "Gwyneth Vance" <[email protected]>

References:
- st: GEE and weighting
  - From: "Gwyneth Vance" <[email protected]>

Prev by Date: Re: st: RE: RE: histogram
Next by Date: RE: st: dataset containing duplicate variables names
Previous by thread: st: GEE and weighting
Next by thread: RE: st: GEE and weighting
Index(es):
- Date
- Thread