Bookmark and Share

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: Identify observations by occupation


From   Oliver Jones <[email protected]>
To   [email protected]
Subject   Re: st: Identify observations by occupation
Date   Tue, 17 Apr 2012 11:24:05 +0200

Hi Nick,

thanks for your help.

-total_je_beruf- should have been --total_occupation-
(I forgot to translate the variable name from German to English.)

Best
Oliver

Am 17.04.2012 11:05, schrieb Nick Cox:
You want largest to have rank 1, but need to take account of ties.
This is a bit shorter. I don't know what -total_je_beruf- is in your
code.

bysort occupation: egen total_occupation = total(n)
bysort total_occupation occupation : gen rank = _n == 1
replace rank = sum(rank)
replace rank = rank[_N] - rank + 1

bysort total_occupation occupation : gen rank = _n == 1

is also

egen rank = tag(total_occupation occupation)

Nick

On Tue, Apr 17, 2012 at 9:45 AM, Oliver Jones
<[email protected]>  wrote:

I have a dataset containing the number of employed people
for 180 cities and 300 occupations.

Now I want to generate a variable that shows the ranking
of the occupations by the total number of people working
in that occupation.

I have managed to create this variable. But I bet there is
a more elegant and easier way to do it and I hope some one
here can show me how or tell me the appropriate command(s).

Here is what I have done ...

The data looks like this:

city  |  occupation  |  n
-----------------------------
001   |  nurse       |  1222
002   |  nurse       |   345
.     |   .          |     .
.     |   .          |     .
.     |   .          |     .
180   |  nurse       |    94
.............................
001   |  lawyer      |   308
002   |  lawyer      |    55
.     |   .          |     .
.     |   .          |     .
.     |   .          |     .
180   |  lawyer      |    15
.............................
001   |  teacher     |   930
.     |   .          |     .
.     |   .          |     .
.     |   .          |     .


bysort occupation: egen total_occupation = total(n)
bysort total_occupation: gen index = (_n == 1)
sort index total_je_beruf
bysort index: gen top_occupation = _N + 1 - _n
replace top_occupation = . if index == 0
sort top_occupation

* List the biggest 10 occupations
list occupation total_occupation top_occupation if top_occupation<= 10

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/
*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/


© Copyright 1996–2018 StataCorp LLC   |   Terms of use   |   Privacy   |   Contact us   |   Site index