Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
st: RE: collapsing rows and generating new smaller dataset
From 
 
Nick Cox <[email protected]> 
To 
 
"'[email protected]'" <[email protected]> 
Subject 
 
st: RE: collapsing rows and generating new smaller dataset 
Date 
 
Fri, 8 Jun 2012 11:12:01 +0100 
Assuming that age groups 40-44 ... 75-79 are 1 ... 8 underneath, then you need a new grouping variable that is 
gen new_age_group = cond(age_grp <= 2, 1, cond(age_grp <= 6, 2, 3)) 
or you can do something similar with -recode-. 
The assumption won't be correct, but the principle will be the same. That is, -collapse- needs to be fed a grouping variable created in advance; I don't think there is an easier way to do it. But the difficult ways to do it are all one line long. 
Nick 
[email protected] 
Tim Evans
I have a dataset in Stata 11.2 that contains the following:
age_grp	f_pop	m_pop	tot_pop
40-44		7000	7000	14000
45-49		7000	7000	14000
50-54		7000	7000	14000
55-59		6000	6000	12000
60-64		5000	5000	10000
65-69		4000	4000	8000
70-74		3000	3000	6000
75-79		2000	2000	4000
What I want to do is collapse this down so that I have the following three categories which have the sum of each level of age-grp:
age_grp2	f_pop m_pop tot_pop
40-49
50-69
70-79
What I would like to know is how I can collapse the above, but sum the populations of each variable.
I.e 40-49 for f_pop will be the sum of 40-44 and 45-49 and so on.
*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/