[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: svyset when looking at children of respondents

From	Steven Samuels <[email protected]>
To	[email protected]
Subject	Re: st: svyset when looking at children of respondents
Date	Sat, 25 Oct 2008 18:26:16 -0400

On Oct 25, 2008, at 5:11 PM, Ergo, Alex wrote:

Thanks a lot for your advice, Steve! It's extremely helpful.
There's just one point in your response I'm not sure I understand.What do you mean when you write that each child will inherit theprobability weight of his/her mother? Do I not need to adjust theweights to account for the fact that the number of childrenincluded in the analysis will vary across women?

No. The survey programs will automatically take the number ofchildren into account. If you were selecting a child at random fromeach woman's birth history you might want to re-weight.

Also, would you agree that, in your first suggested approach, Ishould keep women without children in the dataset and use the'subpop' option with the svyset command?

I disagree. I would keep only the women with children in the dataset, and I would not use -subpop-. I -subpop- only for descriptivestudies; your study has an analytic purpose. (See WE Deming, SomeTheory of Sampling", Dover Reprint, 1984). In essence, you areconditioning on the population of women with births and, also, ontheir covariate values.

Once again, many thanks for your help.
Alex

You are welcome. You haven't told us much about your study question.However, if I were doing your analysis, I would first ignore thesurvey design and fit multi-level models with -xtmelogit-. I wouldthen use results from these analyses as initial parameter estimatesin -gllamm-. With -svy: logistic- approach, you will get noinformation about between-woman variation in the probability of aninfant death. (This might be available from other sources). You runthe risk that the number of births is related to the probability ofinfant death. Sample sizes should be uninformative, but pregnancyoutcomes may not be.

I also withdraw my suggestion to include prior birth outcomes ascovariates. Higher numbers may be predictive of greater risk solelybecause they reflect the mother's inherently greater risk. Also, theeffects of exposures may be may be changed. If an exposure hasalready increased the risk of a prior infant death, its *additional*effect on a current pregnancy will be reduced. This is a tricky areato interpret: Is risk related to birth order? To the number ofchildren already in the house? Perhaps. But,these predictors maythemselves be outcomes related to infant death. Luckily, JHSPH hasfine reproductive epidemiologists and statisticians to advise you.


-Steve

________________________________________

From: [email protected] [owner-[email protected]] On Behalf Of Steven Samuels[[email protected]]

Sent: Friday, October 24, 2008 2:21 PM
To: [email protected]
Subject: Re: st: svyset when looking at children of respondents

Alex:

Too start off,  each child will inherit the probability weight,
stratum, and cluster assignments of its mother.

I see two ways of accounting for clustering by mother:

1. Add mother as a last stage of "sampling" in your -svyset- command,
but with no finite population correction. See the -svyset- help or
the Stata Survey manual.  Use Stata's -svy: logit- command.

2. Fit a multilevel logistic model (level 1: child, level 2: mother)
with -gllamm- (downloadable from SSC, manual http:// www.bepress.com/
ucbbiostat/paper160). -gllamm- accepts probability weights and a
cluster unit above than the highest level in your model. -gllamm- is
not a -svy- enabled program, so you cannot use the stratum
information in the survey design. However you can use stratum-level
covariates. Stas Kolenikov has a demo at http://www.unc.edu/~skolenik/
stata/gllamm-demo.html

If you are interested in conditional, rather than marginal,
predictions, you might choose to ignore the survey sampling weights
altogether.

I recommend the -glamm- option. With -glamm- you will be able to
model woman-level effects as fixed and random.

You should be aware of a potential bias in selecting the births for
your study data. Women may prefer to end their pregnancies with a
successful one (in some places, perhaps, with a successful male
birth). If this is the case, you should exclude a woman's last birth
from your data.   To guard against this problem, you may also include
as a covariate the outcomes of prior pregnancies and births. I would
not recommend this if you are interested in marginal, rather than
conditional, prediction. (-gllamm- will do both kinds.)

If you want to use Stata's -svy- commands, and you are combining
multiple surveys, there are other issues. I suggest that you create
"super-strata" which cross countries or survey periods with the
within-survey strata.

Good luck!

-Steve

On Oct 24, 2008, at 12:44 PM, Ergo, Alex wrote:

Dear A
I'm working with large population surveys. The individuals
interviewed are women of
reproductive age. Among many other things, the respondents provide
information relating
to their children. All this information is stored in the
respondent's record.
I would like to run some logistic regressions with infant mortality
as dependent
variable (1 if child died within the month following birth; 0
otherwise). In order
to create this dependent variable, I need to reshape the dataset
from wide to long
so as to have one live birth per record. I do not consider all the
children for
which information is available, but only those born up to 10 years
before the
mother's interview date.
In this situation, what is the most appropriate approach to account
for the complex
survey design? I thought of using the svyset command, but I'm not
sure how.  More
particularly, should I account for the clustering of live births at
the level of the
respondent and for the fact that respondents who did not have any
live birth in the
last 10 years are omitted from the regression analysis? If so, how?
Should I adjust
the sample weights when more than one child is from the same mother?
I'm using STATA 9.2.
I hope someone can help me with this.  Thanks in advance!
Alex

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/


*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/


*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

Follow-Ups:
- RE: st: svyset when looking at children of respondents
  - From: "Ergo, Alex" <[email protected]>

References:
- st: svyset when looking at children of respondents
  - From: "Ergo, Alex" <[email protected]>
- Re: st: svyset when looking at children of respondents
  - From: Steven Samuels <[email protected]>
- RE: st: svyset when looking at children of respondents
  - From: "Ergo, Alex" <[email protected]>

Prev by Date: st: RE: re: problem using predictnl after obtaining non-linear estimates
Next by Date: st: Re: statalist-digest V4 #3224
Previous by thread: RE: st: svyset when looking at children of respondents
Next by thread: RE: st: svyset when looking at children of respondents
Index(es):
- Date
- Thread