Bookmark and Share

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

st: RE: creating variables using 'by' for subsets of records

From   "Martin Weiss" <>
To   <>
Subject   st: RE: creating variables using 'by' for subsets of records
Date   Tue, 23 Feb 2010 21:19:49 +0100


In the absence of example data, it is hard to give you advice. Look at this
calculation of regional unemployment rates:


//10 regions
set obs 10
gen byte region=_n

//50 indiv per region
expand 50
bys region: gen byte id=_n
gen byte unemployed=runiform()>.9

bys region: gen number=_N
by region: egen numofunempl=total(unemployed)

gen unemprate=numofunempl/number


-----Original Message-----
[] On Behalf Of John Westbury
Sent: Dienstag, 23. Februar 2010 20:55
Subject: st: creating variables using 'by' for subsets of records


I have records for individuals by geographic region and wish to aggregate
the records for individuals to records for geographic regions.  I believe I
should create variables for those regions using 'by'.  Ex: by Region gen x =
argument for variable.  I am having difficulty with arguments for variable
x.  For example I wish to create a region variable that expresses a ratio of
count of indicator values for individuals in a region to a count of
individuals in the region and am unsure how to code this.



*   For searches and help try:

*   For searches and help try:

© Copyright 1996–2018 StataCorp LLC   |   Terms of use   |   Privacy   |   Contact us   |   Site index