Bookmark and Share

Notice: On March 31, it was announced that Statalist is moving from an email list to a forum. The old list will shut down at the end of May, and its replacement, statalist.org is already up and running.


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: copying a string variable to all rows within a group


From   <Seyi.Soremekun@lshtm.ac.uk>
To   <statalist@hsphsun2.harvard.edu>
Subject   Re: st: copying a string variable to all rows within a group
Date   Wed, 29 Feb 2012 11:38:52 +0000

Hi Again, 
you below explanation may go some way to explaining my previous reply to your response to the original query. 
However I did do a gsort of the string variable ("gsort groupvar -oldvar" using my original variable names) to address the problem of strings being sorted with missing first....

>>> Nick Cox <njcoxstata@gmail.com> 2/29/2012 10:40 AM >>>
Seyi did say "not necessarily on the first row". Only the first of
these is a good solution given that warning. The logic is that -sort-
sorts all non-empty strings to the end of any block of observations.

(Perhaps Seyi started out with the identifiers In the first
observation of each block, but they got scrambled a bit by some later
-sort-. -sort groupvar- makes no guarantees about other variables
unless you specify a stable sort.

Nick

On Wed, Feb 29, 2012 at 9:01 AM, Nick Cox <njcoxstata@gmail.com> wrote:
> For "row" read "observation". We know what you mean, but that is the
> correct Stata terminology.
>
> bysort groupvar (oldvar) : replace oldvar = oldvar[_N]
>
> Alternatively,
>
> bysort groupvar : replace oldvar = oldvar[1]
>
> will probably work too. Alternatively,
>
> bysort groupvar : replace oldvar = oldvar[_n-1] if missing(oldvar) & _n > 1
>
> will probably work too. This last is an FAQ
>
> FAQ     . . . . . . . . . . . . . . . . . . . . . . . Replacing missing values
>        . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .  N. J. Cox
>        2/03    How can I replace missing values with previous or
>                following nonmissing values?
>                http://www.stata.com/support/faqs/data/missing.html 
>
> On Wed, Feb 29, 2012 at 8:28 AM,  <Seyi.Soremekun@lshtm.ac.uk> wrote:
> ,
>> If I have a numeric variable "oldvar" which appears on only the first row of a set of rows defined by a group variable "groupvar" (rest of oldvar rows are blank), it is easy enough to copy oldvar down all rows within groupvar:
>>
>> bysort groupvar: egen newvar=max(oldvar) // can equally use =min(oldvar) as there is only 1 value of oldvar.
>>
>> How might I do the same to copy a string oldvar to all rows - again it only appears once within the group, but not necessarily on the first row.
>>

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search 
*   http://www.stata.com/support/statalist/faq 
*   http://www.ats.ucla.edu/stat/stata/


*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/


© Copyright 1996–2014 StataCorp LP   |   Terms of use   |   Privacy   |   Contact us   |   Site index