Bookmark and Share

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: svy subpop option and e(sample)

From   Hitesh Chandwani <>
Subject   Re: st: svy subpop option and e(sample)
Date   Thu, 26 May 2011 20:37:32 -0400


You said in an earlier message: For a large enough subpopulation, the
correct standard error for the ratio is indistinguishable from the
standard error that assumes that the sample size was fixed (Lohr,
2009, p. 135, shows the formula for a SRS).

How large is large enough? I am facing a similar problem. I extracted
my subpopulation of interest and have 300,000 observations. My
original data had 75 million observations with 61 variables. I cannot
use the entire data due to insufficient RAM on my computer (I will
need about 30-odd GB of RAM to analyze the data as a whole). I had to
ask someone with access to such a powerful machine to extract the data
for me.

If the standard errors for data this large are not going to be very
biased, I can report the variance estimation issue as a limitation of
the analysis. If the data are not large enough, then I will need to
compute dummy variables for all PSUs not represented in the extracted

I would appreciate any help on the matter.

Hitesh S. Chandwani
University of Texas at Austin
*   For searches and help try:

© Copyright 1996–2017 StataCorp LLC   |   Terms of use   |   Privacy   |   Contact us   |   Site index