Statalist


[Date Prev][Date Next][Thread Prev][Thread Next][Date index][Thread index]

st: RE: duplicates in timevar in panels


From   "Dherani, Mukesh" <[email protected]>
To   <[email protected]>
Subject   st: RE: duplicates in timevar in panels
Date   Fri, 27 Jun 2008 09:47:25 +0100

Dear Mahesh,
One command that I often use to find duplicates in panel data is:

sort panel_id timevar,stable
br panel_id timevar if (panel_id== panel_id[_n-1]| panel_id== panel_id[_n+1]) & (timevar==timevar[_n-1]|timevar==timevar [_n+1])

this is for identification of duplicates.  
If both the rows are absolutely similar then you may delete any of the row 

drop if if panel_id== panel_id[_n-1] & timevar==timevar[_n-1]



BW,
mukesh

-----Original Message-----
From: [email protected] [mailto:[email protected]] On Behalf Of Mahesh Ramachandran
Sent: 26 June 2008 19:37
To: [email protected]
Subject: st: duplicates in timevar in panels

hello all,

I am working on a panel data set : 2000 households over 10 years(120 months)

It has a household identifier as panel id and year/month variable as timevar.

when I tried to tsset it, I get the following result:

. tsset panel_id timevar
repeated time values within panel
r(451);

My goal is to identify and isolates those panels which has repeated time values and remove the duplicates.
So I tried :
by panel_id: duplicates re timevar,

and response was that duplicates can not be combined with "by" command. Can you suggest me a way to remove
the duplicates.

Thanks,
MR
*
*   For searches and help try:
*   http://www.stata.com/support/faqs/res/findit.html
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

*
*   For searches and help try:
*   http://www.stata.com/support/faqs/res/findit.html
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/



© Copyright 1996–2024 StataCorp LLC   |   Terms of use   |   Privacy   |   Contact us   |   What's new   |   Site index