[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

st: Matching, bootstrapping, sub-sampling

From	Joachim Wagner <[email protected]>
To	[email protected]
Subject	st: Matching, bootstrapping, sub-sampling
Date	Wed, 22 Apr 2009 08:22:07 +0200


Dear List:

This is both a Stata related and a statistics question.

Short version:

If bootstrapping is invalid for estimating the standard errors forthe ATT after nearest neighbor matching, does sub-sampling help, andif so, how?


Long version:

psmatch2 is rather popular among many of us (according to downloadstatistics). Although the help file warns that it is "unclear whetherthe bootstrap is valid in this context" bootstrapping is popular toestimate the standard errors of the Average Treatment Effect on theTreated (ATT), too. But the times they are (expected to be)a-changin' : In the November 2008 issue of the Econometrica AlbertoAbadie and Guido Imbens published a paper entitled "On the failure ofthe bootstrap for Matching Estimators" arguing that bootstrapstandard errors are not valid as a basis for inference with simplenearest-neighbor matching estimators with replacement and a fixednumber of neighbors. This result is popularized in a recent survey byImbens and Jeffrey Wooldridge (Recent developments in theeconometrics of program evaluation, published in the Journal ofEconomic Literature in March 2009). (For those of you who are workingin different fields let me add that both journals are among the topjournals in economics/econometrics.)

What is to be done? One suggestion found in both articles goes likethis (Imbens and Wooldridge, p. 42): "In cases where bootstrapping isnot valid, often subsampling (..) remains valid, but this has notbeen applied in practice." The authors refer to Dimitris N. Politiset al., Subsampling, New York: Springer 1999. Subsampling means usingonly a fraction, say, 75 percent, of the sample for a bootstrap draw.

Contrary to what Imbens and Wooldridge say there are some (working)papers using sub-sampling and bootstrapping to compute the standarderrors of the ATT. They use ca. 75 percent of the sample in doingso. Nobody (as yet) told me why - the authors argue that others do soas well, or they do not reveal the somewhat secret formula, or ruleof thumb, applied.


Two questions:

1. Can someone please explain in (more or less) plain English whysubsampling is a solution?

2. How large should the subsamples be, and why?

Many thanks in advance for any comments etc.

Joachim




*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

Prev by Date: Re: st: Passing varlist to a Program
Next by Date: st: AW: Re: Combining Heckman and Tobit Models
Previous by thread: st: mim: xtmelogit & mim: xtmixed
Next by thread: st: Generating comparison tables
Index(es):
- Date
- Thread