st: conditional estimates for survey data

Subject   st: conditional estimates for survey data
Date   Thu, 24 Oct 2002 14:38:59 -0400 (EDT)


I am analyzing a data set containing national survey data with over 7.5
million observations. According to the Stata manual and Cochran's book on
sampling, I should use my entire data set to obtain unconditional
estimates for the standard error. Any spliting, including the use of
commands such as "if", would generate conditional estimates. Given that
running the analysis with over 7.5 million observations seems to be
computationally challenging, I am now trying to understand the meaning of
the conditional estimates I would obtain by restricting my population only
to the ones with the condition of interest. Here are my questions to the

1. How do I interpret conditional estimates of standard error? Are they
generalizable to the target population (individuals with the condition of
interest in the entire country)?

2. Would these estimates be smaller than the ones obtained if it were
possible to make the estimates based on the entire patient population?

3. Is a conditional standard error necessarily biased in comparison to
the unconditional?

4. Can weights be adjusted for a given subpopulation? If so, I would
appreciate any references on the subject.

many thanks,


Ricardo Pietrobon, MD
Duke University Medical Center

