Statalist


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: Create a flag variable for 10 most frequent values


From   Jeph Herrin <[email protected]>
To   [email protected]
Subject   Re: st: Create a flag variable for 10 most frequent values
Date   Tue, 17 Nov 2009 09:54:49 -0500

how about

 bys dx  : gen num=_N
 bys num : gen tmp= num if _n==1
 gsort -tmp
 gen top10num=tmp[10]
 gen byte top10=num>=top10num
 drop tmp top10num

?

J




Cohen, Elan wrote:
Hi all,

I have a string variable dx that represents a patient's diagnosis
(about 5,000 unique values).  I'd like to create a "top 10 flag" that
equals 1 if dx is one of the top 10 most frequent diagnoses and 0
otherwise.

I'm not even sure where to begin.  If someone could point me in the
right direction, I'd be grateful.  Stata 10, Windows XP

Thank you,

- Elan

* *   For searches and help try: *
http://www.stata.com/help.cgi?search *
http://www.stata.com/support/statalist/faq *
http://www.ats.ucla.edu/stat/stata/

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/



© Copyright 1996–2024 StataCorp LLC   |   Terms of use   |   Privacy   |   Contact us   |   What's new   |   Site index