# st: How to extract matched sample in psmatch2

 From "Wald,Kenneth D" To "statalist@hsphsun2.harvard.edu" Subject st: How to extract matched sample in psmatch2 Date Sun, 30 Oct 2011 21:02:58 +0000

```I'm using psmatch2 to generate a comparison sample of one group of survey respondents with another, defined by religious affiliation. I've run a basic analysis but can't figure out how the program gets to the next step--creating the matched sample so I can confirm that the covariates are indeed balanced. For the moment, I don't want to run multivariate analysis or use weighting. I just want examine the values of certain other variables in the two groups, the base and the matched subsample. I've read the documentation and Guo and Fraser but nothing walks me through the program. So I'd be grateful if somebody could recommend something that really explains the Stata routine to do that.

Here's the commands I've used and the output. The program generates pscores (_pscore). I did randomize the order of cases prior to estimating the equation as advised. Two other questions if you're inclined. With a dichotomous grouping variable, why did Stata use probit instead of logit which I thought was the default. And, given the distinctive geographical distribution of the base group, I used two location variables--state and region. They are nominal variables and so don't really belong in this kind of model. (I also need the year variable so the sample is balanced on that as well.)

Thanks.

psmatch2 jewish citysize nielsen employ year race3 region9 sex state agematch faminc educ3, caliper(11.8)

Probit regression                                 Number of obs   =      91143
LR chi2(11)     =    3863.45
Prob > chi2     =     0.0000
Log likelihood =  -8533.966                       Pseudo R2       =     0.1846

------------------------------------------------------------------------------
jewish |      Coef.   Std. Err.      z    P>|z|     [95% Conf. Interval]
-------------+----------------------------------------------------------------
citysize |   .1399825   .0074226    18.86   0.000     .1254345    .1545305
nielsen |     .26099    .023412    11.15   0.000     .2151035    .3068766
employ |  -.0502017   .0233954    -2.15   0.032    -.0960559   -.0043475
year |  -.0005044   .0023767    -0.21   0.832    -.0051626    .0041538
race3 |   1.132129    .069947    16.19   0.000     .9950352    1.269222
region9 |  -.0962822   .0044346   -21.71   0.000    -.1049738   -.0875906
sex |   .0162921   .0209977     0.78   0.438    -.0248626    .0574468
state |  -.0041037    .000857    -4.79   0.000    -.0057833    -.002424
agematch |   .0741419   .0133769     5.54   0.000     .0479236    .1003602
faminc |   .0929483   .0111876     8.31   0.000      .071021    .1148755
educ3 |   .3533498   .0197301    17.91   0.000     .3146796      .39202
_cons |  -4.425644   4.717873    -0.94   0.348     -13.6725    4.821217
------------------------------------------------------------------------------
There are observations with identical propensity score values.
The sort order of the data could affect your results.
Make sure that the sort order is random before calling psmatch2.

