[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

st: Re: increasing time burden during resampling

From   "Joseph Coveney" <>
To   <>
Subject   st: Re: increasing time burden during resampling
Date   Tue, 29 Dec 2009 12:12:03 +0900

Al Feiveson wrote:

Hi - I am running Stata 11 on Windows XP and I am implementing a form of
multiple testing using a resampling method as described in Westfall & Young's
text. Basically, for each iteration the method is: 

1)resample the data modified to a joint null situation (I use the Stata -bs-
command for this) 
2) fit a model to the resampled modified data 
3) do a bunch of tests using -test-  
4) save the test results.

These operations are identical for each iteration, yet the time per iteration
increases roughly linearly until it becomes prohibitive to continue. 

If I stop after a given number of iterations, I must close out all Stata
processes before starting again, else the bogged down state still holds.

Previous versions of this that used the estimated coefficients and standard
errors without -test- didn't seem to have this problem. So I suspect it has
something to do with repeated use of the -test- command. Maybe something needs
to be reset or cleared after each iteration?

I tried increasing the memory allocated to Stata, but that didn't seem to help.
I would appreciate any suggestions for improving efficiency here.


Could the problem lie with -bs-?  I think that -bs- keeps track of the original
configuration of the dataset with pointers or hidden variables or something.
If, between iterations, you're not allowing -bs- to restore the original
configuration, then these pointers, hidden variables or whatever will cumulate
linearly with each iteration and tie up resources.

It looks like you've already done the Westfall-Young "maxT" (previous versions
using only coefficients and their standard errors) and are now going for their
"minP" (current version using -test- results).  Is adding -test- to the routine
the only change, or could there also be some difference in how -bs- is used,
e.g., by making changes to the dataset (saving model results in the dataset via,
say, -estimates-) before -bs- has had the chance to restore the dataset and
release resources used in keeping track of its original configuration?

Joseph Coveney

*   For searches and help try:

© Copyright 1996–2017 StataCorp LLC   |   Terms of use   |   Privacy   |   Contact us   |   What's new   |   Site index