Statalist The Stata Listserver


[Date Prev][Date Next][Thread Prev][Thread Next][Date index][Thread index]

[no subject]



id where1 start1 end1 where2 ... where10 start10 end10 

so that 

reshape long where start end, i(id) 
drop if missing(where) & missing(start) & missing(end) 

would then be what you want. 

In short, this is a standard -reshape- problem, so that
detailed study of the help and manual entry for -reshape-
should give you enough background. The most likely 
small problem may be unsuitable variable names, in which 
case some prior use of -rename- is in order. Some 
common problems are discussed at 

FAQ     . . . . . . . . . . . . . . . . . . . . . . . .  Problems with reshape
        12/03   I am having problems with the reshape command. Can
                you give further guidance?
                http://www.stata.com/support/faqs/data/reshape3.html

Nick 
[email protected] 

Buszin, Justin
  
> Below are two rows of data, each representing an observation 
> in a STATA dataset (NOT raw data). The first number is the 
> variable person ID, the first word is a variable which 
> represents where that person lived at age 0 (thus, where they 
> were born). Person 1 was born in central region, person 2 
> born in western region. The number after represents the start 
> date of the spell, and then the next number is the end date 
> of the spell. So person 1 lived in central region until she 
> was 10 years old, person 2 lived in western region until she 
> was 30 years old. The next word is where that person moved, 
> if ever. Person 1 moved to Accra at age 11 and stayed there 
> until she was 20, and then moved to Ashanti region when she 
> was 21 and stayed there until she was 27, which is her 
> current age at time of survey. 
>  
> Each of these columns represents a variable and follows a 
> series (word, number, number). Since person 2 moved no more 
> than twice, she will be missing in future variable series 
> (thus why you see nothing for person 2 while person 1 shows 
> her Ashanti move) until we come upon the next attribute 
> (rural/urban). A person could have moved up to 10 times, so 
> if she moved less than that number of times, she will be 
> missing on subsequent variables and series until the next 
> attribute. So person 1 is missing on the series for 4th move, 
> 5th move, etc, until we come to the ten-series of rural/urban 
> life. Person 1 lived in a rural area from 0-3, person 2 lived 
> in an urban area from 0-30, a rural area from 31-32, and so forth. 
>  
> 1 central  0 10  accra  11 20 ashanti 21 27 rural  0 3    
> urban 4 6    rural 7 15 urban 16 20 rural 21 27
> 2 western 0 30 central 31 32                     urban 0 30  
> rural  31 32         
> 
> The words are string variables and the number numeric. In 
> case there is a problem reading in e-mail, the line breaks 
> are after person ID, the word, the start date, the end date, 
> etc (each column represents a new variable). Again, if we get 
> past the first variable, which is person ID, the sequence of 
> variables is: place of residence, age started at that place, 
> age ended at that place, 2nd place of residence, age started 
> at that place, age ended at that place, 3rd place of residence, etc. 
> 
> The big question is: How can STATA take this spell file and 
> change it into a person-year file?

*
*   For searches and help try:
*   http://www.stata.com/support/faqs/res/findit.html
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/



© Copyright 1996–2024 StataCorp LLC   |   Terms of use   |   Privacy   |   Contact us   |   What's new   |   Site index