Stata: Data Analysis and Statistical Software

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: Comparing Chi2/L2 in different samples using bootstrap

From	Steven Samuels <[email protected]>
To	[email protected]
Subject	Re: st: Comparing Chi2/L2 in different samples using bootstrap
Date	Mon, 6 Dec 2010 09:22:33 -0500

Dmitry:

Contrary to your belief, it is very likely that the data sets can bepooled: just incorporate survey year into the stratum definition.Write a command like "gen new_stratum = group(year stratum)". Then -svyset- the combined samnple with the new stratum variable, but withPSUs and weights from from the individual years.

Differing yearly design effects and sample sizes have no bearing onthe validity of this approach. There might be some difficulty if thetypes and sizes of PSUs changed greatly between years. Also, you willhave to take special steps if the surveys were rotating panels.


Steve

Steven J. Samuels
[email protected]
18 Cantine's Island
Saugerties NY 12477
USA
Voice: 845-246-0774
Fax:    206-202-4783




On Dec 6, 2010, at 8:44 AM, Dmitriy Poznyak wrote:

Hello all,

I am estimating three identical multinomial models with bootstrap forthe different years of survey data, for instance. 1991, 1999 and 2007.Aside from comparing predicted probabilities, which I assume shouldn'tpose any problem, I need to compare Chi2/L2 coefficients for thedifferent variables in the model. The rationale for doing this, isthat the fit of the individual predictors (e.g. social-demographicstuff) declines through time. Here's where the question arises.Clearly, samples in different years have different size, and perhapsdifferent design effects, and so on.

In order to possibly address these issues I ran the bootstrappedmodels with the same number of iterations in each case:bootstrap, reps(2000) force: mlogit vote5 x y z ... ,base(1) cl(zip),[pweight=weight1], rrrNext, I test the effect of the predictors: test x; test z, etc.Again, the models' specification is identical for all years; whatdiffers is the sample size and design.

Considering the bootstrap method being used, will it be possible tocompare Chi2/L2 and perhaps pseudo R2 coefficients for differentsamples in this case, and, if not, what would be my strategy. Notethat pooling datasets is not feasible due to several reasons, likeweighting, etc.


Thanks for your suggestions,
Dmitriy
*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

References:
- st: Comparing Chi2/L2 in different samples using bootstrap
  - From: Dmitriy Poznyak <[email protected]>

Prev by Date: st: Comparing Chi2/L2 in different samples using bootstrap
Next by Date: Re: st: Comparing Chi2/L2 in different samples using bootstrap
Previous by thread: st: Comparing Chi2/L2 in different samples using bootstrap
Next by thread: Re: st: Comparing Chi2/L2 in different samples using bootstrap
Index(es):
- Date
- Thread