Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
From | Paul Novosad <novosad@gmail.com> |
To | statalist@hsphsun2.harvard.edu |
Subject | st: Using egen and by efficiently when some observations are missing |
Date | Thu, 22 Apr 2010 11:24:48 -0400 |
Dear list, It often takes me three lines to generate variables based on conditional group operations using egen. For example, I want to run some egen operation on a subset of the data, such as a count. But I want the count to exist even when the condition does not hold. I use the following: by country: egen tmp = count(i) if condition == 1 by country: egen count_i = max(tmp) drop tmp I write code like this all over the place, and each time it makes my heart sink. It feels inefficient but I do not have another solution. Can someone recommend a more efficient practice? Thanks, Paul * * For searches and help try: * http://www.stata.com/help.cgi?search * http://www.stata.com/support/statalist/faq * http://www.ats.ucla.edu/stat/stata/