Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: st: issues with tsset based on more than a time variable and duplicates conflicting results
From
Nick Cox <[email protected]>
To
"[email protected]" <[email protected]>
Subject
Re: st: issues with tsset based on more than a time variable and duplicates conflicting results
Date
Mon, 20 Jan 2014 18:16:41 +0000
Please don't direct further questions on this as if I will answer
them. I've already suggested that you should let Stata technical
support look at this (and they will want to see you your data). I have
already commented that -egen, group()- is a poor way to get a time
identifier.
Otherwise you seem to be going round and round the same questions and
although I sympathise greatly with your puzzlement I am (as already
said) out of ideas on what is going on.
Naturally, the field is wide open for ideas from anyone else.
Nick
[email protected]
On 20 January 2014 18:11, Abdalla, Ahmed <[email protected]> wrote:
> Nick, to follow up. I tried to play around with it again.
> I have -firmid=group(permno) and timeid=group(yr mth)
> I have duplicates when I run-duplicates report firmid timeid
> and I have no duplicates when I run -duplicates report permno yr mth
>
> Since I can't -tsset- based on two time variables,i.e. tsset permno yr mth - is incorrect and tsset firmid timeid-doesn't work UNTIL I drop the duplicates on firmid and timeid that don't show up at the same time in duplicates report permno yr mth, I tried the command- tsset permno timeid- i.e. I thought to group only the time variables in one variable and no need to do the same with the panel id variable(permno). Surprisingly, though I don't know if this is correct, it worked properly and no duplicates based on permno timeid.
>
> Is this correct? Can I do that and proceed with the analysis in my panel dat or do I mix things up that way? do you know why this might even happen?
>
>
>
> ________________________________________
> From: [email protected] <[email protected]> on behalf of Nick Cox <[email protected]>
> Sent: 20 January 2014 17:55
> To: [email protected]
> Subject: Re: st: issues with tsset based on more than a time variable and duplicates conflicting results
>
> OK; that rules out that idea. No more ideas from me, sorry.
> Nick
> [email protected]
>
>
> On 20 January 2014 17:47, Abdalla, Ahmed <[email protected]> wrote:
>> Sorry I forgot to write that I get
>>
>> . tabmiss mth if mth<1
>> Variable | Obs Missings Feq.Missings NonMiss Feq.NonMiss
>> -------------+---------------------------------------------------------------
>> mth | 0 0 . 0 .
>>
>> . tabmiss mth if mth>12
>> Variable | Obs Missings Feq.Missings NonMiss Feq.NonMiss
>> -------------+---------------------------------------------------------------
>> mth | 0 0 . 0 .
>>
>> Is that what you mean ?
>> Nick, do you think I can any more investigation to track why these results are conflicting ?
>>
>>
>>
>> ________________________________________
>> From: [email protected] <[email protected]> on behalf of Nick Cox <[email protected]>
>> Sent: 20 January 2014 17:43
>> To: [email protected]
>> Subject: Re: st: issues with tsset based on more than a time variable and duplicates conflicting results
>>
>> You haven't reported on whether there are non-missing values of -mth-
>> other than 1 to 12, i.e. <1 and >12.
>>
>> There is only one way to -tsset- panel data, with a panel identifier
>> and a time identifier. There is no syntax for three variables; the
>> -help- tells you that.
>>
>> Nick
>> [email protected]
>>
>>
>> On 20 January 2014 17:38, Abdalla, Ahmed <[email protected]> wrote:
>>> I run- tabmiss firmid timeid- and -tabmiss permno yr mth- and find no missing values in both cases.
>>> I run again duplicates report permno yr mth-and-duplicates report firmid timeid- in the fist case, I get no duplicates, however in the second case I get 510,000 duplicates !!
>>>
>>> Can I tsset my panel based on permno yr mth, and avoid the grouping I have done?
>>>
>>>
>>>
>>> ________________________________________
>>> From: [email protected] <[email protected]> on behalf of Nick Cox <[email protected]>
>>> Sent: 20 January 2014 17:28
>>> To: [email protected]
>>> Subject: Re: st: issues with tsset based on more than a time variable and duplicates conflicting results
>>>
>>> A wild guess is to check for missing values on these variables, and
>>> for rogue values of -mth- (missing, <1, >12).
>>>
>>> Nick
>>> [email protected]
>>>
>>>
>>> On 20 January 2014 17:10, Abdalla, Ahmed <[email protected]> wrote:
>>>> Dear Statalist
>>>> I want to tsset my data based on permno yr mth:
>>>> I tried - tsset permno yr mth - I get the error message "too many varaibles specified"
>>>> I tried - gen firmid= group(permno)
>>>> gen timeid = ym(yr, mth)
>>>> tsset firmid timeid - I get the error " repeated time values with panel
>>>> So I tried to investigate my duplicates :
>>>> I run the command:
>>>> duplicates report firmid timeid
>>>> I get
>>>> copies | observations surplus
>>>> ----------+---------------------------
>>>> 1 | 2181223 0
>>>> 2 | 53712 26856
>>>> 3 | 16515 11010
>>>> 4 | 9556 7167
>>>> 5 | 5510 4408
>>>> 6 | 1698 1415
>>>> 7 | 196 168
>>>> 8 | 48 42
>>>> 9 | 18 16
>>>>
>>>> I drop my duplicates and tsset my data, it works properly. But I though to investigate the duplicates again and run this code (of course before dropping my duplicates):
>>>> duplicates report permno mth yr, I get:
>>>>
>>>> copies | observations surplus
>>>> ----------+---------------------------
>>>> 1 | 2268476 0
>>>>
>>>>
>>>> Why both duplicates drop based on firmid and timeid versus permno yr mth are different though the firm id groups permno and the timeid groups yr and mth ?
>>>> Is there any other way to tsset my data based on permno yr mth rather than the grouping I have done (firmid and timeid) ?
>>>>
>>>>
>>>> Thanks
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>> *
>>>> * For searches and help try:
>>>> * http://www.stata.com/help.cgi?search
>>>> * http://www.stata.com/support/faqs/resources/statalist-faq/
>>>> * http://www.ats.ucla.edu/stat/stata/
>>> *
>>> * For searches and help try:
>>> * http://www.stata.com/help.cgi?search
>>> * http://www.stata.com/support/faqs/resources/statalist-faq/
>>> * http://www.ats.ucla.edu/stat/stata/
>>>
>>> *
>>> * For searches and help try:
>>> * http://www.stata.com/help.cgi?search
>>> * http://www.stata.com/support/faqs/resources/statalist-faq/
>>> * http://www.ats.ucla.edu/stat/stata/
>> *
>> * For searches and help try:
>> * http://www.stata.com/help.cgi?search
>> * http://www.stata.com/support/faqs/resources/statalist-faq/
>> * http://www.ats.ucla.edu/stat/stata/
>>
>> *
>> * For searches and help try:
>> * http://www.stata.com/help.cgi?search
>> * http://www.stata.com/support/faqs/resources/statalist-faq/
>> * http://www.ats.ucla.edu/stat/stata/
> *
> * For searches and help try:
> * http://www.stata.com/help.cgi?search
> * http://www.stata.com/support/faqs/resources/statalist-faq/
> * http://www.ats.ucla.edu/stat/stata/
>
> *
> * For searches and help try:
> * http://www.stata.com/help.cgi?search
> * http://www.stata.com/support/faqs/resources/statalist-faq/
> * http://www.ats.ucla.edu/stat/stata/
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/faqs/resources/statalist-faq/
* http://www.ats.ucla.edu/stat/stata/