From
Andrea Rispoli <andrea.rspl@gmail.com>

To
statalist@hsphsun2.harvard.edu

Subject
st: random sampling matching the characteristics of the sample

Date
Tue, 1 May 2012 00:44:57 +0100

Dear Statalisters, I am running a regression model: y=f(x, age, size) where x is a dummy variable that can take value 1 or 0. Since in my sample x=1 for 10% of the sample and x=0 for 90% of the sample, I would like to identify a random subsample among the group x=0 so that it is more "comparable" in terms of size with the subsample for which x=1. My problem is that I would like that the selected subsample (in which x=0) matched the characteristics of the first subsample (x=1) on the other dimensions (e,g age and size). For instance, if I take the subsample x=1, mean of age = 37, mean of size=45. I would like to randomly select the second subsample (x=0), so that mean of age = 37, mean of size=45 as it is the case in the first subsample (x=1). Do you have any suggestions on how I could achieve such result in stata? Thank you very much in advance for all your help!!! Kind Regards * * For searches and help try: * http://www.stata.com/help.cgi?search * http://www.stata.com/support/statalist/faq * http://www.ats.ucla.edu/stat/stata/

