Stata: Data Analysis and Statistical Software

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: subpops vs. over & lincom t vs. regress t in svyset data

From	Nick Cox <[email protected]>
To	[email protected]
Subject	Re: st: subpops vs. over & lincom t vs. regress t in svyset data
Date	Mon, 23 Jan 2012 09:40:25 +0000

The original post was on 3 January

http://www.stata.com/statalist/archive/2012-01/msg00350.html

and Shaun Scholes replied on 10 January:

http://www.stata.com/statalist/archive/2012-01/msg00348.html
http://www.stata.com/statalist/archive/2012-01/msg00350.html

Nick

On Mon, Jan 23, 2012 at 3:13 AM, Michael Costello
<[email protected]> wrote:
> I originally sent this e-mail three weeks ago, but didn't receive a
> response.  I was very much hoping for one, so I thought I would
> repost.
> -M.
>
> I'm working with many many similar survey weighted datasets of
> international education data.  Often I am tasked with creating tables
> of statistics (means, variances, counts, t-statistics, effect size,
> etc.) for many subpopulations and over several phases (baseline,
> midterm, final).
>
> We had been calculating our statistics using -svy: varname,
> over(subpops)- rather than using many -svy, subpop(subpops): mean
> varname- functions in quick succession, as the returned values were
> equal.  In a more recent database, the values are not equal, and I'm
> wondering why that is.  The subpopulation I was working with was
> gender (female=1, male=0).  Could the discrepancies be due to the
> handful of observations with gender = . (missing), or is there some
> other difference in the calculations?  It appears that using the
> -subpop- option treats those observations as non-existent.  How does
> -over- treat them?
>
> I'm also trying to find out the difference between the t-statistic
> that is printed when I do a -lincom- function and the t-statistic that
> is printed when I do a regress function.  For example:
>
> svy: regress score gender
> vs.
> svy: mean score, over(gender)
> lincom [score]Male - [score]Female
>
> I believe that the regression function uses a pooled standard error
> SE, while the -lincom- uses an unpooled calculation, but I was hoping
> for some confirmation on that.
>
> Thanks so much for all your help and advice!  You folks are always so
> helpful and informative.
>

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

References:
- st: subpops vs. over & lincom t vs. regress t in svyset data
  - From: Michael Costello <[email protected]>
- Re: st: subpops vs. over & lincom t vs. regress t in svyset data
  - From: Michael Costello <[email protected]>

Prev by Date: Re: st: How to loop a <clean> script over multiple files?
Next by Date: Re: st: Calculating an Index
Previous by thread: Re: st: subpops vs. over & lincom t vs. regress t in svyset data
Next by thread: Re: st: subpops vs. over & lincom t vs. regress t in svyset data
Index(es):
- Date
- Thread