Stata: Data Analysis and Statistical Software

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

st: bootstrap test, combined bootstrap datasets, statistical properties of the bootstrap

From	Gianluca Cafiso <[email protected]>
To	Stata List <[email protected]>
Subject	st: bootstrap test, combined bootstrap datasets, statistical properties of the bootstrap
Date	Tue, 04 May 2010 10:43:39 +0200

Dear Statalisters,

I have a question about a bootstrap test I am developing. My doubtconcerns the statistical properties of the bootstrap test as I haveenvisaged it, and not how to technically implement it into Stata.


____________________________________

My statistic of interest is the product of two sub-statistics; it is:
dif_L= dif_TF * dif_GCI

where the ?dif? suffix denotes a time difference, TF is a mean value,and GCI an Entropy Index.


I am interested in testing ?Ho: dif_L>0?, against ?H1: dif_L<=0?.

The series of data used to compute ?dif_TF? has a different size (N2observations) than the series used for ?dif_GCI?(N1 observations).

If the two sub-statistics TF and GCI were generated from two series ofthe same size,this would bring me to the general case and I would simply write aprogramme togenerate the ?dif_L? overall statistic and bootstrap it as usual. Butsince this is not the case, I have thought to do the following:

1- First Bootstrap for ?dif_TF?: I generate the ?dif_TF? statistic,bootstrap it (R repetitions) and store the dataset generated by thebootstrap (generates R samples of size N2, but one dataset with Robservations for the estimated dif_TF, 1st dataset).

2- Second Bootstrap for ?dif_GCI?: I generate the ?dif_GCI?statistic, bootstrap it (R repetitions) and store the datasetgenerated by the bootstrap (generates R samples of size N1, but onedataset with R observations for the estimated dif_GCI, 2nd dataset).

3- Multiply the two dataset: Since each of the R observations in eachdataset (1 and 2) is an estimate of the statistic dif_TF and dif_GCI,by multiplying the two dataset I generate a unique dataset with Robservations for the combined statistics:


dif_Lj= dif_TFj * dif_GCIj  where j=1,?,R.

4- Use the Percentile Bootstrap of the dataset generated at point 3for the combined statistic ?dif_Lj? to the test the null hypothesis.


My doubt is the following:

Is the test based on the unique dataset (as generated at point 3)still valid? Or, for the so-generated dataset, do the usualdistributional properties -on which Bootstrap-based tests are based-not hold?



Any help, suggestion, reference on this is really welcome.

Many thanks. Gianluca Cafiso

___________________________________________________________
Dr. Gianluca Cafiso
Research Fellow, Economics Department-University of Catania.
Corso Italia 55, Catania, Italy.

e-mail: [email protected]
tel.:  +39 0957537745


----------------------------------------------------------------
Universita' di Catania - A.P.Se.Ma.
Servizio di Posta Elettronica


*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

Follow-Ups:
- Re: st: bootstrap test, combined bootstrap datasets, statistical properties of the bootstrap
  - From: Stas Kolenikov <[email protected]>

Prev by Date: st: pajek2stata available from ssc
Next by Date: st: -rtnorm()- and -runningprod()- available from SSC
Previous by thread: st: pajek2stata available from ssc
Next by thread: Re: st: bootstrap test, combined bootstrap datasets, statistical properties of the bootstrap
Index(es):
- Date
- Thread