Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.

# Re: st: drawnorm for discrete variables

 From Tyler Frazier To statalist@hsphsun2.harvard.edu Subject Re: st: drawnorm for discrete variables Date Fri, 30 Apr 2010 15:57:56 +0200

```Hi, thank you for your response and observations.

I am looking for a relatively simple answer, which I plan to improve
upon in time.

The data will be used in an urban simulation which incorporates a
number of location choice models, regression, and other simple models.
For now I would like to establish a method for synthetically
generating the data, for use in the larger spatial, framework.  At
this point, I am seeking a plausible solution for running the urban
simulation in order to observe its behavior.

I want to generate a population of approximately 200,000
persons/households. Ideally, the persons/households datasets for this
african city would have the following variables (see below).

As indicated, some of these variable are continuous (income), while
others are categorical (tribal affiliation) or binary (sex)

the survey itself is a fairly large sample, with about 3500 persons in
approximately 1100 households, for the applicable metropolitan area.
there is also a question of how to iteratively sample household and
person data

Any suggestions on the direction I should take are greatly appreciated.

Best regards,
Ty

. summarize

Variable |       Obs        Mean    Std. Dev.       Min        Max
-------------+--------------------------------------------------------
hhrelate |      3447    2.714244    1.887394          1         10
sex |      3447    1.506818    .5000261          1          2
-------------+--------------------------------------------------------
age |      3447    27.49956    17.14807          3         90
ethnicit |      3291    22.25433    16.06378          1         90
religion |      3445    5.190131    4.651516          1         96
marstatu |      2808    3.746439    2.297089          1          6
educatio |      3181    3.600126    3.494621          1         16
-------------+--------------------------------------------------------
income |      3444     6046706    2.21e+07          0   5.76e+08
occupati |      1382    561.4363    245.7125         11        933
industry |      1376    5334.578    2261.297        111       9900
sector |       757    6.015852    2.695727          1         10

On Fri, Apr 30, 2010 at 3:39 PM, Maarten buis <maartenbuis@yahoo.co.uk> wrote:
> --- On Fri, 30/4/10, Tyler Frazier wrote:
>> How to synthetically generate a population from a
>> sample where the variables are continuous, discrete
>> and binary?
>
> The answer can range from very simple to very hard
> (impossible), and it depends on what you want to
> use the sample for and on what kind of information
> you have to base your sampling on.
>
> -- Maarten
>
> --------------------------
> Maarten L. Buis
> Institut fuer Soziologie
> Universitaet Tuebingen
> Wilhelmstrasse 36
> 72074 Tuebingen
> Germany
>
> http://www.maartenbuis.nl
> --------------------------
>
>
>
>
> *
> *   For searches and help try:
> *   http://www.stata.com/help.cgi?search
> *   http://www.stata.com/support/statalist/faq
> *   http://www.ats.ucla.edu/stat/stata/
>

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/
```