# RE: AW: st: AW: generating count and sum variable over two different categorical variables

 From "Martin Weiss" <[email protected]> To <[email protected]> Subject RE: AW: st: AW: generating count and sum variable over two different categorical variables Date Mon, 11 Jan 2010 20:56:02 +0100

```<>

Nick seems to understand your intentions, I honestly do not. That could be
entirely my problem. Anyway, could you provide the number that you want to
come out of the calculation in your example?

HTH
Martin

-----Original Message-----
From: [email protected]
[mailto:[email protected]] On Behalf Of moleps
Sent: Montag, 11. Januar 2010 20:32
To: [email protected]
Subject: Re: AW: st: AW: generating count and sum variable over two
different categorical variables

Thx, Martin. However I need to sum the total number of people in each region
comprised of from 3-10 different districts. Using

> by region yr: egen totpop=total(distr_pop)

entails summing the district_population the same number of times as the
number of cases. What I want is to sum the different districts for each year
for a total regional population.

+--------------------------------------------------------------+
| pid   distr_~p   district   region     yr   number    totpop |
|--------------------------------------------------------------|
| 221     440674          3        1   1953        7   2802725 |
| 684     440674          3        1   1953        7   2802725 |
| 574     158681          6        1   1953        7   2802725 |
| 770     440674          3        1   1953        7   2802725 |
| 869     440674          3        1   1953        7   2802725 |
|--------------------------------------------------------------|
| 454     440674          3        1   1953        7   2802725 |
| 497     440674          3        1   1953        7   2802725 |
| 790     444041          3        1   1954        1    444041 |
| 802     112982         13        2   1954        1    112982 |
| 767     227937         18        4   1954        1    227937 |
|--------------------------------------------------------------|
|   .     139172          8        .   1953        0    139172 |
+--------------------------------------------------------------+

//M

On 11. jan. 2010, at 17.24, Martin Weiss wrote:

>
> <>
>
>
>
> *************
> clear*
>
> input   pid   distr_pop   district   region    yr
> 221     440674          3          1   1953
> 869     440674          3          1   1953
>   .     139172          8          .   1953
> 497     440674          3          1   1953
> 684     440674          3          1   1953
> 574     158681          6          1   1953
> 770     440674          3          1   1953
> 454     440674          3          1   1953
> 767     227937         18          4   1954
> 802     112982         13          2   1954
> 790     444041          3          1   1954
> end
>
> compress
>
> bys region yr: egen number=count(pid)
> by region yr: egen totpop=total(distr_pop)
>
> li, noo
> *************
>
>
>
> HTH
> Martin
>
>
> -----Ursprüngliche Nachricht-----
> Von: [email protected]
> [mailto:[email protected]] Im Auftrag von moleps
> Gesendet: Montag, 11. Januar 2010 17:19
> An: [email protected]
> Betreff: Re: st: AW: generating count and sum variable over two different
> categorical variables
>
> Sorry...
>
>
> . list pid distr_pop district region yr in 50/60
>     | pid   distr_pop   district   region    yr |
>     |---------------------------------------------|
> 50. | 221     440674          3          1   1953 |
> 51. | 869     440674          3          1   1953 |
> 52. |   .     139172          8          .   1953 |
> 53. | 497     440674          3          1   1953 |
> 54. | 684     440674          3          1   1953 |
>     |---------------------------------------------|
> 55. | 574     158681          6          1   1953 |
> 56. | 770     440674          3          1   1953 |
> 57. | 454     440674          3          1   1953 |
> 58. | 767     227937         18          4   1954 |
> 59. | 802     112982         13          2   1954 |
>     |---------------------------------------------|
> 60. | 790     444041          3          1   1954 |
>     +---------------------------------------------+
>
> So what I need to do is generate a variable counting the number of cases
> from each region for each year and also a variable containing the sum of
the
> population for each region for each year. There are between 3 and 10
> districts in each region.
>
> Any idea or do I have to program it from scratch?
>
> Regards,
> M
>
>
>
>
>
> On 11. jan. 2010, at 17.06, Martin Weiss wrote:
>
>>
>> <>
>>
>>
>>
>> As always: Show an excerpt of your data!
>>
>>
>>
>> HTH
>> Martin
>>
>>
>> -----Ursprüngliche Nachricht-----
>> Von: [email protected]
>> [mailto:[email protected]] Im Auftrag von moleps
>> Gesendet: Montag, 11. Januar 2010 16:58
>> An: [email protected]
>> Betreff: st: generating count and sum variable over two different
>> categorical variables
>>
>> Dear listers,
>> I´m doing a survival analysis, but also need to present some graphs on a
>> regional level. In other words my data is setup on an individual level
> with
>> categorical variables for year and region. What I need to do is generate
a
>> count variable for the cases counted over year AND region. Using the -
> egen
>> total- command I´m only able to sum over either year or region. Not both
> as
>> far as I understand. Also is there a way to sum over two categories, not
>> just one.
>>
>> ie my data is setup according to
>>
>> case year region
>>
>> and i´d like it setup acording to
>>
>> case year region #cases/year/region
>>
>>
>> Regards,
>> M
>>
>>
>> *
>> *   For searches and help try:
>> *   http://www.stata.com/help.cgi?search
>> *   http://www.stata.com/support/statalist/faq
>> *   http://www.ats.ucla.edu/stat/stata/
>>
>>
>> *
>> *   For searches and help try:
>> *   http://www.stata.com/help.cgi?search
>> *   http://www.stata.com/support/statalist/faq
>> *   http://www.ats.ucla.edu/stat/stata/
>
>
> *
> *   For searches and help try:
> *   http://www.stata.com/help.cgi?search
> *   http://www.stata.com/support/statalist/faq
> *   http://www.ats.ucla.edu/stat/stata/
>
>
> *
> *   For searches and help try:
> *   http://www.stata.com/help.cgi?search
> *   http://www.stata.com/support/statalist/faq
> *   http://www.ats.ucla.edu/stat/stata/

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/
```

• Follow-Ups:
• References: