Notice: On March 31, it was **announced** that Statalist is moving from an email list to a **forum**. The old list will shut down on April 23, and its replacement, **statalist.org** is already up and running.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

From |
Andrea Rispoli <andrea.rspl@gmail.com> |

To |
statalist@hsphsun2.harvard.edu |

Subject |
st: random sampling matching the characteristics of the sample |

Date |
Tue, 1 May 2012 00:44:57 +0100 |

Dear Statalisters, I am running a regression model: y=f(x, age, size) where x is a dummy variable that can take value 1 or 0. Since in my sample x=1 for 10% of the sample and x=0 for 90% of the sample, I would like to identify a random subsample among the group x=0 so that it is more "comparable" in terms of size with the subsample for which x=1. My problem is that I would like that the selected subsample (in which x=0) matched the characteristics of the first subsample (x=1) on the other dimensions (e,g age and size). For instance, if I take the subsample x=1, mean of age = 37, mean of size=45. I would like to randomly select the second subsample (x=0), so that mean of age = 37, mean of size=45 as it is the case in the first subsample (x=1). Do you have any suggestions on how I could achieve such result in stata? Thank you very much in advance for all your help!!! Kind Regards * * For searches and help try: * http://www.stata.com/help.cgi?search * http://www.stata.com/support/statalist/faq * http://www.ats.ucla.edu/stat/stata/

**Follow-Ups**:**Re: st: random sampling matching the characteristics of the sample***From:*Stas Kolenikov <skolenik@gmail.com>

- Prev by Date:
**st: cmp and condition numbers** - Next by Date:
**st: RE: Correlation Coefficients** - Previous by thread:
**st: cmp and condition numbers** - Next by thread:
**Re: st: random sampling matching the characteristics of the sample** - Index(es):