Stata The Stata listserver
[Date Prev][Date Next][Thread Prev][Thread Next][Date index][Thread index]

st: Corr2data questions

From   Richard Williams <>
Subject   st: Corr2data questions
Date   Mon, 22 Dec 2003 09:17:53 -0500

1. The corr2data command is handy if, say, there is a published analysis that includes the means, correlations and sds, and you want to replicate or modify the work (e.g. add or drop variables). I do this in some classroom exercises. At the same time, you have to remember that these are not the original data, and you are very limited in what you can do, e.g. you can't analyze subsets of the data, compute interaction terms, etc. All you can do is basic correlational and regression analysis with no modifications of the data (correct?). If I had to invent a term, I would call a data set created by corr2data a pseudo-replication of the original data, but is there a standard term already in use?

2. Is there any reason the N for the corr2data command has to be the same as in the original data? I did a little experiment where I created a data set with 200,000 cases and ran a regression. I then created a 2nd data set with N = 200 and ran a regression specifying fw=1000. Results were virtually identical. Anyway, if the original data set was monstrous, this might be a way of saving disk space and computing time.

Richard Williams, Associate Professor
OFFICE: (574)631-6668, (574)631-6463
FAX: (574)288-4373
HOME: (574)289-5227
EMAIL: Richard.A.Williams.5@ND.Edu
WWW (personal):
WWW (department):

* For searches and help try:

© Copyright 1996–2017 StataCorp LLC   |   Terms of use   |   Privacy   |   Contact us   |   What's new   |   Site index