[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

st: Question about svyset command

From	[email protected]
To	[email protected]
Subject	st: Question about svyset command
Date	Thu, 19 Feb 2009 00:04:19 -0500

I�a beginner Stata user and have a question about the svyset commandin Stata that I hope someone can help me with.

For some background, I'm engaged in a logistic regression model thatexamines the likelihood of either a plaintiff or defendant filing apost trial motion. The database I'm working with is the Civil JusticeSurvey of State Courts (CJSSC). The CJSSC provides case level data forall tort, contract, and real property trials conclude in a sample of 46of the nation's 75 most populous counties in 2005. Data are collectedon about 8,000 trials in these 46 counties which are weighted torepresent about 10,500 trials concluded in the nation's 75 mostpopulous counties. I understand that one of the nice features of Statais that it allows you to take into account the sampling structure of adataset when doing logistic regression modeling. Here is the Stata codethat I used to take in account the sampling structure of these civiltrial data:

svyset sitecode [pweight=bwgt0], strata(strata) fpc(fpc1) || su2,fpc(fpc2)


Where
Sitecode = County where the civil trial took place

Bwgt0 = Weights to weight the data from 46 to the 75 most populouscounties

Strata = Strata where the counties are located. The dataset has 5 strata

fpc1 = The probability of a county appearing in the sample. Forexample, a county with a weight of 2 would have a 50% probability ofappearing in the sampl

su2 = Unique identifier that identifies the trials that occurred ineach of the 46 countiesFpc2 = 1 for all 8,000 trials disposed in the 46 counties. I gave fpc2a value of 1 because I wanted to tell Stata that the trials had a 100%probability of showing up in these 46 counties.I think that I got the part of this programming that deals with thefirst level of the sample design correct. It’s the second level thatI’m having some problems with At the second level of the sample design,I'm trying to correct for the fact that I have data for every civiltrial concluded in the 46 counties. Basically, I want to tell Statathat part of this sample is actually a census of all trials concludedin the 46 counties in 2005. I understand Stata has a finite populationcorrection command that takes into account the census like format ofthese data. The logistic regression results were the same irrespectiveof whether I used the 1st or 2nd stages in the sample design. I thinkthis is telling me that Stata is not correcting for the census likeaspect of this sample. Can anyone give me some guidance as to whetherI'm correctly taking into account the sampling structure of these data.In particular, I would like to know whether I'm using the fpc2 factorcorrectly. Any assistance you could give on this matter would be verymuch appreciated.

Thanks
Thomas Cohen


*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

Follow-Ups:
- Re: st: Question about svyset command
  - From: Stas Kolenikov <[email protected]>
- Re: st: Question about svyset command
  - From: Steven Samuels <[email protected]>
- st: suest and micombine
  - From: <[email protected]>

Prev by Date: st: Covariance matrix when explanatory variables are estimated quantities
Next by Date: st: suest and micombine
Previous by thread: st: Covariance matrix when explanatory variables are estimated quantities
Next by thread: st: suest and micombine
Index(es):
- Date
- Thread