Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
From | "Marie-Luise Schmitz" <querida-ise@gmx.de> |
To | statalist@hsphsun2.harvard.edu |
Subject | st: sum over variables for determinate observations |
Date | Sun, 26 Jan 2014 16:36:07 +0100 (CET) |
Dear Stata Users, I have a data set that looks like this: province_name province_code_107 license_number ateco_section ateco_section2002 numero_contribuenti... AGRIGENTO 84 AG A A 100 AGRIGENTO 84 AG A B 50 AGRIGENTO 84 AG B C 12 AGRIGENTO 84 AG C D 79 AGRIGENTO 84 AG O P 34 AGRIGENTO 84 AG P Q 0 AGRIGENTO 84 AG Z Z 1 ALESSANDRIA 6 AL A A 29 ALESSANDRIA 6 AL A B 12 ALESSANDRIA 6 AL B C 0 ALESSANDRIA 6 AL C D 5 It contains numerous numeric variables following the variable numero_contribuenti. The variable ateco_section is a redefined version of the variable ateco_section2002 and shows sectors of economic activity. For instance, A = agriculture, B = fishery, etc. In the redefined variable ateco_section, sectors A and B are summarzied by A. However, the problem is that I want only one entry for sector A for each province that is, for numeric variables as numero_contribuenti I want the sum of previous A and B, hence: province_name province_code_107 license_number ateco_section numero_contribuenti ......... AGRIGENTO 84 AG A 150 AGRIGENTO 84 AG B 12 I want to apply that to each province. I guess this problem may be solved with collapse (sum) but I am totally lost. Any help is highly appreciated. Marie-Luise * * For searches and help try: * http://www.stata.com/help.cgi?search * http://www.stata.com/support/faqs/resources/statalist-faq/ * http://www.ats.ucla.edu/stat/stata/