Bookmark and Share

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

re: st: data management question


From   "Airey, David C" <[email protected]>
To   "[email protected]" <[email protected]>
Subject   re: st: data management question
Date   Fri, 22 Apr 2011 18:31:52 -0500

.

I'm guessing a statement like,

bysort var1-var10 (year): gen mean = mean(var11) if inlist(year,2001,2002,2003)

should get the means for those years for each subset.

Now how to get the 2004 year replaced with the means for each subset?


> We have data from 12 schools, grouped into two conditions, for 100 categories, for 9 grades, for each of 10 years. This data is for 20 different variables.
> 
> We want to replace the scores for year 2004, with the mean of the score for 2001-2003 for each variable, for each grade, for each category, for each condition, for each school. The current values for 2004 are not reliable and so we are making this replacement.
> 
> How can we do this. 
> 
> Here is an example of the data
> 
> sch_id group category grade year rex mex sex . . .
>  1       2     all      3   2001  1   2   0 . . .
>  1       2     all      3   2002  2   1   1 . . .
>  1       2     all      3   2003  4   1   1 . . .
>  2       2     all      3   2004  8   55  6 . . .
> ...
> 
> 17       1    blkfem    8   2010  1   2   1 . . .
> 
> --Alan Acock

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/


© Copyright 1996–2018 StataCorp LLC   |   Terms of use   |   Privacy   |   Contact us   |   Site index