Bookmark and Share

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

st: Losing Observations in Logit


From   Isobel Williams <[email protected]>
To   "[email protected]" <[email protected]>
Subject   st: Losing Observations in Logit
Date   Sat, 1 Mar 2014 13:48:32 +0000

Dear All,

I am running a logistic regression, and then pairing them using propensity score matching. Within the dataset, I have divided the data into groups A, B, and C:

generate groupA=1 if zona==1 
replace groupA=1 if cla_tam==1
replace groupA=1 if h_selec==1
replace groupA=0 if zona==0
replace groupA=0 if cla_tam==2
replace groupA=0 if cla_tam==3
replace groupA=0 if h_selec==0
replace groupA=0 if h_selec==9 


generate groupB=1 if zona==1
replace groupB=1 if cla_tam==1
replace groupB=1 if h_selec==0
replace groupB=0 if zona==0
replace groupB=0 if cla_tam==2
replace groupB=0 if cla_tam==3
replace groupB=0 if h_selec==1
replace groupB=0 if h_selec==9

generate groupC=1 if zona==0
replace groupC=1 if cla_tam==1
replace groupC=1 if h_selec==0
replace groupC=0 if zona==1
replace groupC=0 if cla_tam==2
replace groupC=0 if cla_tam==3
replace groupC=0 if h_selec==1
replace groupC=0 if h_selec==9

when I run the tab command for all groups, Stata tells me that:

. tab groupA

     groupA |      Freq.     Percent        Cum.
------------+-----------------------------------
          0 |     45,316       58.47       58.47
          1 |     32,187       41.53      100.00
------------+-----------------------------------
      Total |     77,503      100.00

. tab groupB

     groupB |      Freq.     Percent        Cum.
------------+-----------------------------------
          0 |     63,277       81.64       81.64
          1 |     14,226       18.36      100.00
------------+-----------------------------------
      Total |     77,503      100.00


. tab groupC

     groupC |      Freq.     Percent        Cum.
------------+-----------------------------------
          0 |     56,780       73.26       73.26
          1 |     20,723       26.74      100.00
------------+-----------------------------------
      Total |     77,503      100.00


However, when I run a logistic regression on propensity score matching between groups A and B, Stata tells me that I only have 7099 observations. 

Furthermore, when I wrote the code for keep if, the programme says "63227 observations deleted".

Here is what I tried to do in estimating a logit propensity score and match (nearest neighbor) between groups A & B:

preserve
keep if groupA==1 & groupB==1 
logit treat floor fmiss wall hhinc2....
predict double ps1
psmatch2 treat, outcome (S06A20) pscore (ps1) caliper(0.2) common logit
restore

The objective is to estimate a logit/propensity score and match observations from group A with group B. Any help on this matter would be very much appreciated.

Many thanks,
Isobel Williams 		 	   		  
*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/faqs/resources/statalist-faq/
*   http://www.ats.ucla.edu/stat/stata/


© Copyright 1996–2018 StataCorp LLC   |   Terms of use   |   Privacy   |   Contact us   |   Site index