Dear Phil

On Mar 18, 2010, at 6:20 PM, Michael Mitchell wrote:Here is the tabulation of race and sex by race.<snip>. tab sex race, missing 1=male, | 1=white, 2=black, 3=other 2=female | White Black Other . | Total -----------+--------------------------------------------+---------- male | 1,676 193 35 34 | 1,938 female | 1,824 238 34 37 | 2,133 -----------+--------------------------------------------+---------- Total | 3,500 431 69 71 | 4,071<snip>But now I want to analyze just the sub-population of males (sex==1)and it shows that the number of obs is now 4037 (see below). How canthe number of observations increase when adding a -subpop()- option?There are suddenly 37 extra observations. Note this corresponds tothe number of females with a missing race.. svy , subpop(if sex==1): tab race, count format(%13.2fc) (running tabulate on estimation sample)Number of strata = 1 Number of obs= 4037Number of PSUs = 4037 Population size =7932333.9Subpop. no. of obs= 1904Subpop. size =3780355.3Design df= 4036This is as it should be, since information about race is not requiredon those observations outside of the subpopulation. Remember,observations outside the subpopulation are relevant only insofar asthey reflect the variability in the proportion(s) of sampled PSUs withat least one observation in the subpopulation.In fact, at one point Stata did not behave properly in this regard;this was fixed in an update to Stata 10 on 02apr2008 (see -helpwhatsnew10- and search for "02apr2008").-- Phil

