Statalist


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

AW: st: AW: forvalues & replace not working under two 'not equal to' conditions


From   "Martin Weiss" <[email protected]>
To   <[email protected]>
Subject   AW: st: AW: forvalues & replace not working under two 'not equal to' conditions
Date   Tue, 10 Nov 2009 22:44:23 +0100

<> 

My code is slightly more readable, though...

I am still not sure I understand your definitions, but try:


*************

clear*
inp byte(contract_id firm_id) /*
*/ nation_id:mylabel, auto
1   2   "US"
1   2   "US"
4   3   "UK"
4   3   "US"
8   4   "US"
8   4   "UK"
8   3   "US"
9   5   "US"
9   4   "UK"
9   3   "US"
10   5   "US"
10   5   "US"
10   6   "NL"
10   7   "NL"
10   4   "UK"
10   4   "CH"
end

bys contract_id firm_id nation_id: /*
*/ egen mycount=count(nation_id)

gen byte group_d=mycount>1 & !mi(mycount)
drop mycount

bys contract_id: egen sdnat=sd(nation_id)
bys contract_id: egen sdfir=sd(firm_id)

gen byte nongroup_f=min(sdnat,sdfir)>0
drop sdnat sdfir

l, sepby(contract_id) noo
*************



HTH
Martin


-----Ursprüngliche Nachricht-----
Von: [email protected]
[mailto:[email protected]] Im Auftrag von joe j
Gesendet: Dienstag, 10. November 2009 22:32
An: [email protected]
Betreff: Re: st: AW: forvalues & replace not working under two 'not equal
to' conditions

Thanks again Martin. With your approach one could get the same results
as with my original codes (in the original post) for the first three
variables. My problem is with the last variable. I don't understand
why the following code does not work (which aims variable 'nongroup_f'
to take the value 1 only when 'firm_id' as well as 'nation_id' of
preceding and/or following observations are different ). Instead the
variable takes the value 1 for all observations:

gen nongroup_f=.

forvalues i=1/`=_N'{
bys contract_id: replace nongroup_f=1 if (firm_id~=firm_id[_n-`i']) &
(nation_id~=nation_id[_n-`i'])
}

forvalues i=1/`=_N'{
bys contract_id: replace nongroup_f=1 if (firm_id~=firm_id[_n+`i']) &
(nation_id~=nation_id[_n+`i'])
}

On Tue, Nov 10, 2009 at 6:25 PM, Martin Weiss <[email protected]> wrote:
>
> <>
>
> So a revised version for your first dummy is:
>
>
> *************
> clear*
> inp byte(contract_id firm_id)  /*
> */ nation_id:mylabel, auto
> 1   2   "US"
> 1   2   "US"
> 4   3   "UK"
> 4   3   "US"
> 8   4   "US"
> 8   4   "UK"
> 8   3   "US"
> 9   5   "US"
> 9   4   "UK"
> 9   3   "US"
> 10   5   "US"
> 10   5   "US"
> 10   6   "NL"
> 10   7   "NL"
> 10   4   "UK"
> 10   4   "CH"
> end
>
> bys contract_id firm_id nation_id:  /*
> */ egen mycount=count(nation_id)
>
> gen byte group_d=mycount>1 & !mi(mycount)
> drop mycount
>
> l, sepby(contract_id) noo
>
> *************
>
>
>
> HTH
> Martin
>
>
> -----Ursprüngliche Nachricht-----
> Von: [email protected]
> [mailto:[email protected]] Im Auftrag von joe j
> Gesendet: Dienstag, 10. November 2009 18:08
> An: [email protected]
> Betreff: Re: st: AW: forvalues & replace not working under two 'not equal
> to' conditions
>
> Thanks. The last 4 columns (group_d; group_f; nongroup_d; nongroup_f)
> are the final output variables. Their definitions are below the table.
>
> ******
> contract_id; firm_id; nation_id; group_d; group_f; nongroup_d; nongroup_f
> 1       2       US      1       0       0       0
> 1       2       US      1       0       0       0
> 4       3       UK      0       1       0       0
> 4       3       US      0       1       0       0
> 8       3       US      0       0       1       1
> 8       4       UK      0       1       0       1
> 8       4       US      0       1       1       1
> 9       3       US      0       0       1       1
> 9       4       UK      0       0       0       1
> 9       5       US      0       0       1       1
> 10      4       CH      0       1       0       1
> 10      4       UK      0       1       0       1
> 10      5       US      1       0       0       1
> 10      5       US      1       0       0       1
> 10      6       NL      0       0       1       1
> 10      7       NL      0       0       1       1
> ******
> 1. group_d = 1 when both firm_id and nation_id are same for two or
> more observations with the same contract id
> 2. group_f = 1  when firm_id is same but nation_id is different for
> two or more observations with the same contract id
> 3. nongroup_d = 1  when firm_id is different but nation_id is same for
> two or more observations with the same contract id
> 4 .nongroup_f = 1  when both firm_id and nation_id are different for
> two or more observations with the same contract id
>
>
> On Tue, Nov 10, 2009 at 5:47 PM, Martin Weiss <[email protected]>
wrote:
>>
>> <>
>>
>>
>> For clarification, you could provide the solution, i.e. the dummies that
> you
>> actually want to see as your final output, for your chosen example. Makes
> it
>> considerably easier to work towards code for you...
>>
>>
>>
>> HTH
>> Martin
>>
>>
>> -----Ursprüngliche Nachricht-----
>> Von: [email protected]
>> [mailto:[email protected]] Im Auftrag von joe j
>> Gesendet: Dienstag, 10. November 2009 17:39
>> An: [email protected]
>> Betreff: Re: st: AW: forvalues & replace not working under two 'not equal
>> to' conditions
>>
>> Thanks Martin. I think I wasn't clear enough in the last mail. I was
>> not looking at various combinations of firm_id, nation_id and
>> contract_id 'for each observation'. Rather I was looking at the
>> similarity or difference of firm_id/nation_id 'between two or more
>> observations' under each contract_id.
>>
>> Based on Martin's suggestion I could derive group_d (see below). But I
>> still can't get right nongroup_f, which equals 1 (for all
>> observations) if firm_id and nation_id are different for two or more
>> observations under each contract_id (but it takes a value 1, wrongly,
>> for all observations in the data)
>>
>> *deriving group_d (this works)
>> egen groups=group(firm_id nation_id)
>>
>> bys contract_id (groups):  /*
>> */ gen byte distinctcount_group_d= /*
>> */ (groups[_n]==groups[_n+1])
>>
>> bys contract_id (groups):  /*
>> */ replace distinctcount_group_d=1 /*
>> */ if (groups[_n]==groups[_n-1])
>>
>> *2 deriving nongroup_f doesnt work (e.g. it should be 0 for
contract_id=1)
>> bys contract_id (groups):  /*
>> */ gen byte distinctcount_nongroup_f= /*
>> */ (groups[_n]~=groups[_n+1]) & (nation_id[_n]~=nation_id[_n+1])
>>
>> bys contract_id (groups):  /*
>> */ replace distinctcount_nongroup_f=1 /*
>> */ if (groups[_n]~=groups[_n-1]) & (nation_id[_n]~=nation_id[_n-1])
>>
>> On Tue, Nov 10, 2009 at 4:14 PM, Martin Weiss <[email protected]>
> wrote:
>>>
>>> <>
>>>
>>> I think a variable denoting the combinations between the three ids is a
>> good
>>> place to start for you:
>>>
>>>
>>>
>>> *************
>>> clear*
>>> inp byte(contract_id firm_id) nation_id:mylabel, auto
>>> 1   2   "US"
>>> 1   2   "US"
>>> 4   3   "UK"
>>> 4   3   "US"
>>> 8   4   "US"
>>> 8   4   "UK"
>>> 8   3   "US"
>>> 9   5   "US"
>>> 9   4   "UK"
>>> 9   3   "US"
>>> 10   5   "US"
>>> 10   5   "US"
>>> 10   6   "NL"
>>> 10   7   "NL"
>>> 10   4   "UK"
>>> 10   4   "CH"
>>> end
>>>
>>> egen groups=group(contract_id firm_id nation_id)
>>>
>>> l, sepby(con) noobs
>>> *************
>>>
>>>
>>>
>>> HTH
>>> Martin
>>>
>>>
>>> -----Ursprüngliche Nachricht-----
>>> Von: [email protected]
>>> [mailto:[email protected]] Im Auftrag von joe j
>>> Gesendet: Dienstag, 10. November 2009 16:04
>>> An: [email protected]
>>> Betreff: st: forvalues & replace not working under two 'not equal to'
>>> conditions
>>>
>>> My dataset has three variables 1. contract_id, 2. firm_id and 3.
>>> nation_id. I want to create 4 variables, each of which gets a value of
>>> 1 if certain conditions are met. The variables I want to create are
>>> specific to the contract id, and are:
>>>
>>> 1. group_d = 1 when both firm_id and nation_id are same for two or
>>> more firms with the same contract id
>>> 2. group_f = 1  when firm_id is same but nation_id is different for
>>> two or more firms with the same contract id
>>> 3. nongroup_d = 1  when firm_id is different but nation_id is same for
>>> two or more firms with the same contract id
>>> 4 .nongroup_f = 1  when both firm_id and nation_id are different for
>>> two or more firms with the same contract id
>>>
>>> The following code works well for the first three variables, but not
>>> for the last, nongroup_f; the value is 1 for all observations. I can't
>>> figure out why.
>>>
>>> This is a sample code:
>>>
>>> clear
>>> inp str10(contract_id firm_id   nation_id)
>>> 1   2   "US"
>>> 1   2   "US"
>>> 4   3   "UK"
>>> 4   3   "US"
>>> 8   4   "US"
>>> 8   4   "UK"
>>> 8   3   "US"
>>> 9   5   "US"
>>> 9   4   "UK"
>>> 9   3   "US"
>>> 10   5   "US"
>>> 10   5   "US"
>>> 10   6   "NL"
>>> 10   7   "NL"
>>> 10   4   "UK"
>>> 10   4   "CH"
>>> end
>>>
>>>
>>> *1.group_d . WORKS!
>>> gen group_d=.
>>> forvalues i=1/`=_N'{
>>> bys contract_id: replace group_d=1 if firm_id==firm_id[_n-`i'] &
>>> nation_id==nation_id[_n-`i']
>>> }
>>> forvalues i=1/`=_N'{
>>> bys contract_id: replace group_d=1 if firm_id==firm_id[_n+`i'] &
>>> nation_id==nation_id[_n+`i']
>>> }
>>>
>>> *2.group_f  WORKS!
>>> gen group_f=.
>>> forvalues i=1/`=_N'{
>>> bys contract_id: replace group_f=1 if firm_id==firm_id[_n-`i'] &
>>> nation_id!=nation_id[_n-`i']
>>> }
>>> forvalues i=1/`=_N'{
>>> bys contract_id: replace group_f=1 if firm_id==firm_id[_n+`i'] &
>>> nation_id!=nation_id[_n+`i']
>>> }
>>>
>>> *3. nongroup_d  WORKS!
>>> gen nongroup_d=.
>>> forvalues i=1/`=_N'{
>>> bys contract_id: replace nongroup_d=1 if firm_id!=firm_id[_n-`i'] &
>>> nation_id==nation_id[_n-`i']
>>> }
>>> forvalues i=1/`=_N'{
>>> bys contract_id: replace nongroup_d=1 if firm_id!=firm_id[_n+`i'] &
>>> nation_id==nation_id[_n+`i']
>>> }
>>>
>>> *4.nongroup_f DOESN'T WORK!!
>>> gen nongroup_f=.
>>> forvalues i=1/`=_N'{
>>> bys contract_id: replace nongroup_f=1 if (firm_id~=firm_id[_n-`i']) &
>>> (nation_id~=nation_id[_n-`i'])
>>> }
>>> forvalues i=1/`=_N'{
>>> bys contract_id: replace nongroup_f=1 if (firm_id~=firm_id[_n+`i']) &
>>> (nation_id~=nation_id[_n+`i'])
>>> }
>>> *
>>> *   For searches and help try:
>>> *   http://www.stata.com/help.cgi?search
>>> *   http://www.stata.com/support/statalist/faq
>>> *   http://www.ats.ucla.edu/stat/stata/
>>>
>>>
>>> *
>>> *   For searches and help try:
>>> *   http://www.stata.com/help.cgi?search
>>> *   http://www.stata.com/support/statalist/faq
>>> *   http://www.ats.ucla.edu/stat/stata/
>>>
>>
>> *
>> *   For searches and help try:
>> *   http://www.stata.com/help.cgi?search
>> *   http://www.stata.com/support/statalist/faq
>> *   http://www.ats.ucla.edu/stat/stata/
>>
>>
>> *
>> *   For searches and help try:
>> *   http://www.stata.com/help.cgi?search
>> *   http://www.stata.com/support/statalist/faq
>> *   http://www.ats.ucla.edu/stat/stata/
>>
>
> *
> *   For searches and help try:
> *   http://www.stata.com/help.cgi?search
> *   http://www.stata.com/support/statalist/faq
> *   http://www.ats.ucla.edu/stat/stata/
>
>
> *
> *   For searches and help try:
> *   http://www.stata.com/help.cgi?search
> *   http://www.stata.com/support/statalist/faq
> *   http://www.ats.ucla.edu/stat/stata/
>

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/


*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/



© Copyright 1996–2024 StataCorp LLC   |   Terms of use   |   Privacy   |   Contact us   |   What's new   |   Site index