[Date Prev][Date Next][Thread Prev][Thread Next][Date index][Thread index]

Re: st: statsby slowness

From	"E. Paul Wileyto" <[email protected]>
To	[email protected]
Subject	Re: st: statsby slowness
Date	Tue, 14 Aug 2007 06:40:00 -0400

For large numbers of "by groups," I have found it works better/faster to write my own script that repeatedly pulls the original file from a tempfile, and then uses a keep statment to restrict the analysis. Statsby spends too much time looking to see who is in.

P

David Airey wrote:

.

At what point does one give up using statsby? With just three variables in my data set,

ssrownum, iso_VSV, expression

the following command does OK with 1000 by groups (< 20 cases in a group), but is not useable with 20,000 by groups.

statsby n=r(N) spearman=r(rho) p=r(p), by(ssrownum): spearman iso_VSV expression

Why?

I posted something similar a long time ago compared speeds of ttest with if versus in and versus regress, but I'm not happy at the moment.

--
David C. Airey, Ph.D.
Research Assistant Professor

*
* For searches and help try:
* http://www.stata.com/support/faqs/res/findit.html
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/

--
E. Paul Wileyto, Ph.D.
Assistant Professor of Biostatistics
Tobacco Use Research Center
School of Medicine, U. of Pennsylvania
3535 Market Street, Suite 4100
Philadelphia, PA 19104-3309

215-746-7147
Fax: 215-746-7140
[email protected]
http://mail.med.upenn.edu/~epw/
*
* For searches and help try:
* http://www.stata.com/support/faqs/res/findit.html
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/

References:
- st: statsby slowness
  - From: David Airey <[email protected]>

Prev by Date: Re: st: Effects of Restrictions on R2
Next by Date: Re: st: Spline regressions and 2sls Estimators
Previous by thread: st: statsby slowness
Next by thread: st: statsby slowness
Index(es):
- Date
- Thread