Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.

# Re: st: taking the average of duplicate observations

 From Nick Cox To "statalist@hsphsun2.harvard.edu" Subject Re: st: taking the average of duplicate observations Date Fri, 3 May 2013 12:01:37 +0100

```Your reference to another post lacks a URL, nor can we comment on code
that you don't show us, but there is a one-word solution: -collapse-.

collapse rainfall, by(station year month)

But I've worked a lot with rainfall data, and I'm puzzled at what you
are doing here. If these are daily data, the convention is to use
totals, not means. -collapse- can do that too.

Nick
njcoxstata@gmail.com

On 3 May 2013 11:48, Michael Tekle Palm <mickilinen@hotmail.com> wrote:
> Hello Statalist!
>
> I have observations with identical time values but different outcome values. Instead of dropping all but the first observations for every two/three duplicates, I want to calculate and replace with the average of the observations, and then drop the duplicates.
>
> So my data is on rainfall for a given location and is disaggregated by year and month. E.g:
>
> Station  |  Year  |  Month  |  Rainfall
> ---------------------------------------
>     1       1980       1          5
>     1       1980       1          3
>     1       1980       2          4
>     1       1980       3          8
>     1       1980       3          1
>
>
> So for each duplicate by station year month, I would like to calculate the average value for the rainfall outcomes, use this value and drop all duplicates. I think the solution suggested in this ["RE: st: questions about duplicate observations"] Statalist reply may work, but I wasn't quite able to make it work.

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/faqs/resources/statalist-faq/
*   http://www.ats.ucla.edu/stat/stata/
```