Bookmark and Share

Notice: On March 31, it was announced that Statalist is moving from an email list to a forum. The old list will shut down on April 23, and its replacement, statalist.org is already up and running.


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: drawnorm for discrete variables


From   Tyler Frazier <tyfrazier@gmail.com>
To   statalist@hsphsun2.harvard.edu
Subject   Re: st: drawnorm for discrete variables
Date   Fri, 30 Apr 2010 15:57:56 +0200

Hi, thank you for your response and observations.

I am looking for a relatively simple answer, which I plan to improve
upon in time.

The data will be used in an urban simulation which incorporates a
number of location choice models, regression, and other simple models.
 For now I would like to establish a method for synthetically
generating the data, for use in the larger spatial, framework.  At
this point, I am seeking a plausible solution for running the urban
simulation in order to observe its behavior.

I want to generate a population of approximately 200,000
persons/households. Ideally, the persons/households datasets for this
african city would have the following variables (see below).

As indicated, some of these variable are continuous (income), while
others are categorical (tribal affiliation) or binary (sex)

the survey itself is a fairly large sample, with about 3500 persons in
approximately 1100 households, for the applicable metropolitan area.
there is also a question of how to iteratively sample household and
person data

Any suggestions on the direction I should take are greatly appreciated.

Best regards,
Ty

. summarize

    Variable |       Obs        Mean    Std. Dev.       Min        Max
-------------+--------------------------------------------------------
    hhrelate |      3447    2.714244    1.887394          1         10
         sex |      3447    1.506818    .5000261          1          2
-------------+--------------------------------------------------------
         age |      3447    27.49956    17.14807          3         90
    ethnicit |      3291    22.25433    16.06378          1         90
    religion |      3445    5.190131    4.651516          1         96
    marstatu |      2808    3.746439    2.297089          1          6
    educatio |      3181    3.600126    3.494621          1         16
-------------+--------------------------------------------------------
      income |      3444     6046706    2.21e+07          0   5.76e+08
    occupati |      1382    561.4363    245.7125         11        933
    industry |      1376    5334.578    2261.297        111       9900
      sector |       757    6.015852    2.695727          1         10




On Fri, Apr 30, 2010 at 3:39 PM, Maarten buis <maartenbuis@yahoo.co.uk> wrote:
> --- On Fri, 30/4/10, Tyler Frazier wrote:
>> How to synthetically generate a population from a
>> sample where the variables are continuous, discrete
>> and binary?
>
> The answer can range from very simple to very hard
> (impossible), and it depends on what you want to
> use the sample for and on what kind of information
> you have to base your sampling on.
>
> -- Maarten
>
> --------------------------
> Maarten L. Buis
> Institut fuer Soziologie
> Universitaet Tuebingen
> Wilhelmstrasse 36
> 72074 Tuebingen
> Germany
>
> http://www.maartenbuis.nl
> --------------------------
>
>
>
>
> *
> *   For searches and help try:
> *   http://www.stata.com/help.cgi?search
> *   http://www.stata.com/support/statalist/faq
> *   http://www.ats.ucla.edu/stat/stata/
>

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/


© Copyright 1996–2014 StataCorp LP   |   Terms of use   |   Privacy   |   Contact us   |   Site index