|  |  | 
[Date Prev][Date Next][Thread Prev][Thread Next][Date index][Thread index]
st: Why does one command "lose" observations compared to another command?
Dear Stata Listmembers:
I have come across a problem that I can't figure out. I don't know why I 
am "losing" 873 observations with one stata command versus the other -  
for mean age by race, if age>=18. Both age and race are 100% present in 
the total dataset - total (N=21004) and for age>=18 (N=11,441).
This is complex survey data and of course, I would use the survey 
commands for analysis, but it still doesn't make sense why the 
non-survey command would use less observations (even so, the point 
estimates are identical.
Any insight into this would be greatly appreciated.
Best regards,
Suzy
Here is the output for both:
svy, subpop(if age>=18): mean age, over(race)
(running mean on estimation sample)
Survey: Mean estimation
Number of strata =      28          Number of obs    =   21004
Number of PSUs   =      57          Population size  = 2.8e+08
                                   Subpop. no. obs  =   11441
                                   Subpop. size     = 2.1e+08
                                   Design df        =      29
           1: race = 1
           2: race = 2
           3: race = 3
           4: race = 4
           5: race = 5
--------------------------------------------------------------
            |             Linearized
       Over |       Mean   Std. Err.     [95% Conf. Interval]
-------------+------------------------------------------------
age          |
          1 |   46.87004   .3829944      46.08673    47.65335
          2 |   42.29357   .4316838      41.41068    43.17646
          3 |   36.81336   .5968429      35.59268    38.03404
          4 |   41.05272   .9506344      39.10846    42.99699
          5 |   41.83675   1.234968      39.31096    44.36254
--------------------------------------------------------------
mean age [pweight = mecwt4] if age>=18, over(race)
Mean estimation                     Number of obs    =   10568
           1: race = 1
           2: race = 2
           3: race = 3
           4: race = 4
           5: race = 5
--------------------------------------------------------------
       Over |       Mean   Std. Err.     [95% Conf. Interval]
-------------+------------------------------------------------
age          |
          1 |   46.87004   .2514365      46.37718     47.3629
          2 |   42.29357    .343518      41.62021    42.96693
          3 |   36.81336   .2793463      36.26579    37.36094
          4 |   41.05272   .9437899      39.20272    42.90273
          5 |   41.83675   .7637752      40.33961    43.33389
--------------------------------------------------------------
.
*
*   For searches and help try:
*   http://www.stata.com/support/faqs/res/findit.html
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/