Stata: Data Analysis and Statistical Software

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RE: st: Difference of means and t-test

From	Richard Williams <[email protected]>
To	[email protected]
Subject	RE: st: Difference of means and t-test
Date	Tue, 15 Jun 2010 08:43:22 -0500

At 02:25 PM 6/14/2010, Nick Cox wrote:

I don't think our views are contradictory. It is clearly true that you
can get results from summary statistics alone. But erecting fake
Gaussians with those summaries is not equivalent to reconstructing the
original data. That is my point, and no more. It is akin to arguments at
a higher level about "sufficient statistics". If something is normal,
then it is sufficient to know mean and sd, but there isn't a reverse
argument.

At 11:19 AM 6/14/2010, Nick Cox wrote:
>-- except that will surely overstate the strength of the conclusions,
in
>so far as the real distributions are unlikely to be exactly Gaussian.

Still, it is incorrect to say that constructing fake Gaussians "willsurely overstate the strength of the conclusions." The p values arebased on various assumptions, e.g. normally distributed,homoskedastic errors. If the assumptions are wrong, the p values arewrong. But, whether the assumptions are correct or not, thecalculation of the test statistics and coefficients are the same,i.e. for regression-type problems if you've got the means,correlations and standard deviations there are all sorts of thingsyou can compute without having the rest of the data. You run aregression or Anova with the "fake" data and you'll get the exactsame results as with the real data.

Of course, without having the original data, you can't, say, dodiagnostic tests of assumptions, analyze subsets of the data, add anx^2 term, etc. So, yes, you greatly prefer having the real data! Butif the real data aren't available there is still a lot you can do. Idon't know why the original poster was using ttesti instead of ttest,but if it was because he only had summary statistics available to himthen it would be possible for him to run an Anova the way I suggestedand the numbers he would get would be the same as if he had the realdata. There probably wouldn't be a whole lot else he could dothough, e.g. the predict command and most other post-estimationcommands won't be of much use without the real data.



-------------------------------------------
Richard Williams, Notre Dame Dept of Sociology
OFFICE: (574)631-6668, (574)631-6463
HOME:   (574)289-5227
EMAIL:  [email protected]
WWW:    http://www.nd.edu/~rwilliam

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

Follow-Ups:
- RE: st: Difference of means and t-test
  - From: "Nick Cox" <[email protected]>

References:
- st: Difference of means and t-test
  - From: [email protected]
- Re: st: Difference of means and t-test
  - From: Richard Williams <[email protected]>
- RE: st: Difference of means and t-test
  - From: "Nick Cox" <[email protected]>
- RE: st: Difference of means and t-test
  - From: Richard Williams <[email protected]>
- RE: st: Difference of means and t-test
  - From: "Nick Cox" <[email protected]>

Prev by Date: Re: st: Estimation results
Next by Date: Re: st: RE: interacting two covariates
Previous by thread: RE: st: Difference of means and t-test
Next by thread: RE: st: Difference of means and t-test
Index(es):
- Date
- Thread