Stata: Data Analysis and Statistical Software

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

st: RE: Outputs as inputs - how to efficiently process a series of routines?

From	Nick Cox <[email protected]>
To	"'[email protected]'" <[email protected]>
Subject	st: RE: Outputs as inputs - how to efficiently process a series of routines?
Date	Tue, 31 Aug 2010 10:05:18 +0100

-statsby- is your friend. See the manual entry in [D] and also 

SJ-10-1 gr0045  . . . . . . . . . . . . . Speaking Stata: The statsby strategy
        . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .  N. J. Cox
        Q1/10   SJ 10(1):143--151                                (no commands)
        demonstrates the use of statsby to prepare a reduced
        dataset for subsequent graphing

Nick 
[email protected] 

Philip Burgess

This is a data management question rather than a statistical issue....

I have a dataset which is stratified by 4 variables:

1. Sample - kids, adults, or older persons;
2. Year - 2006, 2007, 2008 or 2009;
3. Treatment setting - inpatient, residential or ambulatory;
4. Status - baseline, follow-up or change.

Thus, the overall structure is a 3 x 4 x 3 x 3 = 108 unique strata.

The outcome variable is usually a summary score and I need to estimate
various statistics (say mean, SD, percentiles) for each of the strata;
and I also need to estimate the internal consistency of the outcome
measure with Cronbach's Alpha.

I need to use the estimated statistic(s) as 'input' in a variety of
other calculations (e.g., calculate overall Effect Size using the
mean, the SD; other calculations require Alpha).

I know these statistics are available immediately after execution - I
can get these using the command - return list - . After that, I can
generate a new variable - gen double alpha = r(alpha)- and then run -
collapse (first) alpha - to get the required statistic(s).

The problem is that I have 108 strata and whereas I can 'manually'
code each of these variants (and save as temp files, then - append -
all 108 to save a single file - this is both inefficient and the risk
of error (i.e., me!) is high.

Is there a way around this?

I should add that I have mainly used SPSS for these kinds of data
management issues. Theoretically, using SPSS commands that 'split' the
data file by the required partitions and then using its Output
Management System will achieve the required output. This used to work
with earlier versions but not the current release - hence my efforts
with Stata.


*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

Follow-Ups:
- Re: st: RE: Outputs as inputs - how to efficiently process a series of routines?
  - From: Philip Burgess <[email protected]>

References:
- st: Outputs as inputs – how to efficiently process a series of routines?
  - From: Philip Burgess <[email protected]>

Prev by Date: Re: st:about the fixed-effects model
Next by Date: RE: st: Re: FORTRAN
Previous by thread: st: Outputs as inputs – how to efficiently process a series of routines?
Next by thread: Re: st: RE: Outputs as inputs - how to efficiently process a series of routines?
Index(es):
- Date
- Thread