# Re: st: R: Estimating the number of workers in each industry in each district - flag: Stata 9/2 SE

 From Steve Samuels To statalist@hsphsun2.harvard.edu Subject Re: st: R: Estimating the number of workers in each industry in each district - flag: Stata 9/2 SE Date Wed, 15 Sep 2010 10:16:35 -0400

```Arka-

following question..

1. Were all districts in the target population included in the sample?
Or, were districts sampled?

2. Are the final sampling weights the probability sampling weights? Or
was there adjustment to the probabilithy weights (post-stratification,
"raking")  so that the sample results will better reflect population
census proportions? If the weights are so adjusted,  are the original
sampling weights available to you?

Steve

Steven J. Samuels
sjsamuels@gmail.com
18 Cantine's Island
Saugerties NY 12477
USA
Voice: 845-246-0774
Fax:    206-202-4783

On Wed, Sep 15, 2010 at 4:07 AM, Carlo Lazzaro <carlo.lazzaro@tin.it> wrote:
> Arka wrote:
> "Now I want to estimate the number of workers
> belonging to each industry in a particular district"
>
> A quite trivial example about Arka's issue may be the following one (set
> aside survey technicalities):
>
> ---------------------code begins------------------------------------
> drop _all
> set obs 100
> g Workers=_n
> g District="East" in 1/50
> replace District="West" in 51/100
> g Industry="Concrete" in 1/30
> replace  Industry="Steel" in 31/100
> g A= 1 if  District=="East" &  Industry=="Steel"
> g B= 1 if  District=="West" &  Industry=="Steel"
> g C= 1 if  District=="East" &  Industry=="Concrete"
> ---------------------code ends------------------------------------
>
> HTH and Kind Regards,
> Carlo
> -----Messaggio originale-----
> Da: owner-statalist@hsphsun2.harvard.edu
> [mailto:owner-statalist@hsphsun2.harvard.edu] Per conto di Arka Roy
> Chaudhuri
> Inviato: mercoledì 15 settembre 2010 9.24
> A: statalist@hsphsun2.harvard.edu
> Oggetto: st: Estimating the number of workers in each industry in each
> district
>
> Dear All,
>        I have a data set which has information at the individual
> level.I have variables which record the district of residence of the
> individual, the industry of employment of the individual and other
> demographic characterstics.The data set also comes with weights which
> represents the probability that a particular household is included in
> the sample.Thus all individuals belonging to a particular household
> get the same weight.Now I want to estimate the number of workers
> belonging to each industry in a particular district.Could anyone
> please advice on the correct stata code that I should write to get my
> desired estimates?Also I would be grateful if somebody could advice me
> on the possible biases that might affect my estimates at the
> industry-district level.I would really appreciate any help in this
> regard.Thanks
>
> Regards,
> Arka
> --
> Arka Roy Chaudhuri
> PhD Student
> University of British Columbia
> 997-1873 East Mall
> Vancouver
> Ph: +1 (604) 349-8283
> Email: gabuisi@gmail.com
>
>
>
>

```