Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

From |
Ophelie Desmarais <ophelie.desmarais@googlemail.com> |

To |
statalist@hsphsun2.harvard.edu |

Subject |
Re: st: RE: correct egen command to calculate means - please help |

Date |
Wed, 18 May 2011 19:55:36 +0200 |

Hi Nick, Thanks a lot for the reply. Sorry for not providing more information: Let's say I have this dataset: UniqueID ID ParentID Value 1 1 10 2 1 20 3 2 1 5 4 2 2 5 Basically, I want to add another variable (say "mean_value") that calculates the mean for (10+20)/2 and assigns the value to UniqueID=3 (since UniqueID=3's ParentID is 1 and this links to UniqueID=1 and UniqueID=2's ID variable). For UniqueID=4, similarly, the mean_value would be = 5 (the average value of the observations where ID=ParentID=3). egen mean_value=mean(Value), by(ParentID) would give me 5 and 5 for UniqueID=3 and UniqueID=4, which is not what I try to get. Again, thanks for your kind answer. o. On 18 May 2011 19:44, Nick Cox <n.j.cox@durham.ac.uk> wrote: > I don't understand your pseudocode. It looks like > > Y = mean(X) where A = B > > to me. > > Nor is it "obvious" that > > egen MeanRevenueParent=mean(Revenue), by(ParentIndustry) > > is wrong. From your word description it sounds exactly right to me! > > Please give a worked example of what you want for a minimal dataset. > > Note that explanations that assume proficiency in Excel _and_ Stata catch a much smaller set of people than explanations that assume proficiency in Stata. > > (Can we assume that income == revenue?) > > Nick > n.j.cox@durham.ac.uk > > Ophelie Desmarais > > I have tried to find an answer for this on the web but to no avail. > > Simplified, I have following dataset where the unit of observation is > the industry (IndustryID) with the respective revenue. > > IndustryID, ParentIndustryID, Revenue > > > The ParentIndustryID links to IndustryID and reflects the hierachical > structure of the industries. For each industry, I want to add a > variable reflecting the mean income of the ParentIndustryID. How do I > do this in Stata? In Excel, this would be simply using AVERAGEIFS. > egen MeanRevenueParent=mean(Revenue), by(ParentIndustry) obviously > does not give the right mean. In terms of pseudo code, this should be > something like: > > MeanRevenueParent = Mean(Revenue) WHERE ParentIndustry=IndustryID > > Could anyone help me out? I don't want to switch around with Excel all the time. > > > * > * For searches and help try: > * http://www.stata.com/help.cgi?search > * http://www.stata.com/support/statalist/faq > * http://www.ats.ucla.edu/stat/stata/ > * * For searches and help try: * http://www.stata.com/help.cgi?search * http://www.stata.com/support/statalist/faq * http://www.ats.ucla.edu/stat/stata/

**Follow-Ups**:**RE: st: RE: correct egen command to calculate means - please help***From:*Nick Cox <n.j.cox@durham.ac.uk>

**References**:**st: correct egen command to calculate means - please help***From:*Ophelie Desmarais <ophelie.desmarais@googlemail.com>

**st: RE: correct egen command to calculate means - please help***From:*Nick Cox <n.j.cox@durham.ac.uk>

- Prev by Date:
**st: RE: correct egen command to calculate means - please help** - Next by Date:
**RE: st: RE: using loop to generate distributions with different means and standard deviations** - Previous by thread:
**st: RE: correct egen command to calculate means - please help** - Next by thread:
**RE: st: RE: correct egen command to calculate means - please help** - Index(es):