Dear Stan, Thank you. This is the request of a reviewer. Would you recommend that I simply chose a random sample? On Tue, May 1, 2012 at 3:13 AM, Stas Kolenikov <skolenik@gmail.com> wrote: > So why exactly do you want to do this? You will only lose in > precision, provided your model is OK; if it is badly misspecified, > then God only knows how your coefficients could jump around, so you > probably should not trust either specification, anyway. > > On Mon, Apr 30, 2012 at 6:44 PM, Andrea Rispoli <andrea.rspl@gmail.com> wrote: >> Dear Statalisters, >> I am running a regression model: y=f(x, age, size) where x is a dummy >> variable that can take value 1 or 0. >> Since in my sample x=1 for 10% of the sample and x=0 for 90% of the >> sample, I would like to identify a random subsample among the group >> x=0 so that it is more "comparable" in terms of size with the >> subsample for which x=1. >> >> My problem is that I would like that the selected subsample (in which >> x=0) matched the characteristics of the first subsample (x=1) on the >> other dimensions (e,g age and size). >> For instance, if I take the subsample x=1, mean of age = 37, mean of size=45. >> I would like to randomly select the second subsample (x=0), so that >> mean of age = 37, mean of size=45 as it is the case in the first >> subsample (x=1). >> >> Do you have any suggestions on how I could achieve such result in stata? >> >> Thank you very much in advance for all your help!!! >> Kind Regards > > > > -- > Stas Kolenikov, also found at http://stas.kolenikov.name > Small print: I use this email account for mailing lists only.

