Bookmark and Share

Notice: On March 31, it was announced that Statalist is moving from an email list to a forum. The old list will shut down at the end of May, and its replacement, statalist.org is already up and running.


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

re: st: data management question


From   "Airey, David C" <david.airey@vanderbilt.edu>
To   "statalist@hsphsun2.harvard.edu" <statalist@hsphsun2.harvard.edu>
Subject   re: st: data management question
Date   Fri, 22 Apr 2011 18:31:52 -0500

.

I'm guessing a statement like,

bysort var1-var10 (year): gen mean = mean(var11) if inlist(year,2001,2002,2003)

should get the means for those years for each subset.

Now how to get the 2004 year replaced with the means for each subset?


> We have data from 12 schools, grouped into two conditions, for 100 categories, for 9 grades, for each of 10 years. This data is for 20 different variables.
> 
> We want to replace the scores for year 2004, with the mean of the score for 2001-2003 for each variable, for each grade, for each category, for each condition, for each school. The current values for 2004 are not reliable and so we are making this replacement.
> 
> How can we do this. 
> 
> Here is an example of the data
> 
> sch_id group category grade year rex mex sex . . .
>  1       2     all      3   2001  1   2   0 . . .
>  1       2     all      3   2002  2   1   1 . . .
>  1       2     all      3   2003  4   1   1 . . .
>  2       2     all      3   2004  8   55  6 . . .
> ...
> 
> 17       1    blkfem    8   2010  1   2   1 . . .
> 
> --Alan Acock

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/


© Copyright 1996–2014 StataCorp LP   |   Terms of use   |   Privacy   |   Contact us   |   Site index