Bookmark and Share

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

st: Regional dummies in cross section treatment effects

From   Henrik Wiig <[email protected]>
To   "[email protected]" <[email protected]>
Subject   st: Regional dummies in cross section treatment effects
Date   Mon, 26 Sep 2011 17:04:15 +0200


I got a cross section dataset of 1200 households were we assume households in a specific type of community was imposed a treatment T more or less as a natural experiement.

The survey was collected with the idea of comparing the outcome variable Y (% share between 0 and 1) from households in such treatment communities with households in controll communities.

1) If the assumption of natural experiment holds, then a ttest comparing means between HH in treatment and control should be enough to indicate whether treatment has had a significant impact or not on Y, correct? What other types of test could I use. The dependent variable Y is a share truncated on both sides, i.e. 0 and 1.

2) Sceptics would argue treatment might be correlated with variable X1 and X2. I hence run a Propensity Score Matching model in the following way

pscore T  X1 X2, pscore(ps1) comsup
attk Y T, pscore(ps1) logit comsup boot reps(500) detail

I then get the following messeage:

The balancing property is not satisfied
Try a different specification of the propensity score

but the numbers keep pouring out anyway...

Her comes the main questions: What does the balancing requierement really do? My original understanding is as follows:

Compare Y in household with treatment to Y in household in control group if the following conditions are satisfied:

1) pscore_T is more or less equal to pscore_C
2) X1_T is more or less equal to X1_C
3) X2_T is more or less equal to X2_C

However, now someone tells me that common support implies that more extreme HH are thrown out until the mean of X1_T is more or less equal to the mean of X1_C, and similar for X2_T and X2_C.

To my understanding this implies that we do not compare households with extreme values of X1 and X2 in T and C, even though they are comparable between eachother while not the rest of the households in the sample. Is this not throwing away useful information?

If I want to keep such extreme pairs, what do I do?

3) The communities are taken from 10 different districts, which might have different unobservable cultural features that might affect the outcome variable Y. The original idea was hence just to compare households T with C within each district, not between districts using PSM. Then, I supposed that all comparable pairs was aggregated into one file, giving one global result of difference in means of Y between treated and control group.

How do I do this?

Hopefully someone out there can help so that I canstart writing my paper....



*   For searches and help try:

© Copyright 1996–2018 StataCorp LLC   |   Terms of use   |   Privacy   |   Contact us   |   Site index