From   Steven Samuels <>
Date   Thu, 8 Feb 2007 14:03:53 -0500

I think that Stata will also work if strata & PSU's do overlap years. For example: ordinarily "YEAR" would be a first-level stratum and PSU's would be sampled independently within a year. However if a PSU sampled in one year is re-used for subsequent years, then the YEAR of PSU selection would still be the highest level stratum, but the year in which the observation is made would be the one to use for the analysis of time trends.


On Feb 8, 2007, at 1:22 PM, Austin Nichols wrote:

Hillel Alpert --
Stata should work fine, assuming your stratum and psu codes do not
overlap across years (I assume you are discussing repeated cross
sections, not panel data). If they do, you will want to redefine svy
variables by appending the year to the stratum and psu codes, e.g.
egen newstrata=group(`r(strata1)' year)
egen newpsu=group(`r(su1)' year)
svyset newpsu [pw=`r(wvar)'], strata(newstrata)
The weights should be appropriate without adjustment, I think.

On 2/8/07, Hillel Alpert <> wrote:

Could someone please advise whether and how Stata can handle national survey data with multiple years and complex survey sample design?

A single year data can readily be handled in Stata based on the weight and strata variable. Can we analyze the appended data over two or more years the same way, or is a program such as SUDAAN needed? The analysis will be of time trends, and we assume the sampling was not identical from year to year.

Hillel Alpert
