Notice: On March 31, it was **announced** that Statalist is moving from an email list to a **forum**. The old list will shut down on April 23, and its replacement, **statalist.org** is already up and running.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

From |
Robson Glasscock <glasscockrc@vcu.edu> |

To |
statalist@hsphsun2.harvard.edu |

Subject |
Re: st: Change variable |

Date |
Thu, 15 Sep 2011 09:47:51 -0400 |

I understand that the id number is the number of the respondent and not the number of children in the house. To me, the unique id is a combination of the household and the respondent id. File 1 is panel data that tracks educational level and school type changes for each respondent in a particular household. But you're tracking these changes for a particular respondent in a particular household- so that's your unique identifier. What you're trying to do here is merge in the "ethnicity" and "livings" data for each child along with the data in File 1 that has information on how that child's education changed over time. What I'm guessing is that in File 2 you do not actually only have one observation for each respondent in each household. If there are multiple observations in File 2, then a m:1 merge won't work. If you have multiple observations in File 2 for the same respondent, then you may be able to retain only one observation for each respondent in File 2. The "ethnicity" variable won't vary in File 2 even if you have multiple observations. What about "livings"? Does that change? If it doesn't and there are multiple observations per respondent, then you can simply retain only one observation per individual in File 2 and redo the merge we discussed previously. best, Robson Glasscock On Thu, Sep 15, 2011 at 9:25 AM, Ay <yuniphone@yahoo.co.id> wrote: > Dear Robson, > > Thank you for replying. > I've tried m:1, it showed that ...is not unique id. > When i check it using isid, file1 includes educlevel as unique id. Hence, there would be the same child with different educlevel. So, this child has 2 obs in the file1. > Actually, the child (id2) does not mean the number of child in the household, but number of respondent. > I have searched online about this, but still does not get an answer. > > Thank you, > > Yun > > > > > On 15 Sep 2011, at 22:06, Robson Glasscock <glasscockrc@vcu.edu> wrote: > >> Hi Ay, >> >> No need for an apology. This is a m:1 merge using household(id1) and >> child(id2). >> >> Sort File 1 on household and child. Save it. >> Sort File 2 on household. Save it. >> >> Go into file_1 and type: >> >> merge m:1 household child using File_2.dta >> >> Note that it looks to me from the data you provided that household 2 >> has child 4 and child 3 in File 1, but File 2 only has child 1. Also, >> household 3 in File 1 has child 5, but household 3 in File 2 has child >> 2. These will show up as observations where _merge != 3 after you >> merge the files. >> >> best, >> Robson Glasscock >> >> On Thu, Sep 15, 2011 at 6:02 AM, Ay <yuniphone@yahoo.co.id> wrote: >>> Dear Robson, >>> >>> My apologize about my unclear question. >>> >>> I try to list the variables and data in rows rather than in colomn. >>> >>> File 1 >>> Household(id1)-> >>> 1, 1, 1, 2, 2, 2, 3, ... >>> >>> Child(id2)-> >>> 2, 2, 3, 4, 3, 3, 5, ... >>> >>> Educlevel-> >>> 2, 1, 1, 2, 2, 1, 1, ... >>> >>> X3(schooltype)-> >>> a, b, a, b, a, a, b, ... >>> >>> X4(repetition) >>> 2, 2, 2, 2, 2, 2, 2, ... >>> >>> Educlevel in strings. >>> Educlevel=1 is primary school. >>> Educlevel=2 is secondary level. >>> A child may have secondary level and thus have records for his primary level along with other variables in file1. >>> >>> >>> File 2 >>> Household(id1)-> >>> 1, 2, 3, ... >>> >>> Child(id2)-> >>> 3, 1, 2, ... >>> >>> X3(ethnicity) >>> 1, 3, 2, ... >>> >>> X4(livings)-> >>> 1, 2, 2, ... >>> >>> In file 2 only single obs. >>> >>> I hope this is clearer. >>> >>> Thank you. >>> >>> Regards, >>> >>> Yun >>> >>> >>> On 11 Sep 2011, at 01:53, Robson Glasscock <glasscockrc@vcu.edu> wrote: >>> >>>> So File 1 is a panel data set that records each individual's >>>> educational level over time? If that is the case, and if File 2 is >>>> also panel data, then a 1:1 merge based on x1(unique id) and x3(time >>>> period) could be done. >>>> >>>> If File 2 is not panel data, then you're looking at a m:1 merge based >>>> on x1(unique id). >>>> >>>> If this doesn't solve the problem, I think you should include an >>>> example of the observations in each of your files on your next post. >>>> I'm still struggling a bit to picture the contents of your files. >>>> >>>> On Sat, Sep 10, 2011 at 9:20 AM, Ay <yuniphone@yahoo.co.id> wrote: >>>>> Dear Robson, >>>>> >>>>> Thank you for replying. >>>>> >>>>> Actually, >>>>> File 1: >>>>> x1 (unique id) >>>>> x2 (educational level) >>>>> >>>>> File 2 >>>>> x1 (unique id) >>>>> >>>>> >>>>> In file 2, x1 & x2 together are unique id. All of the variables are in strings. >>>>> >>>>> I could not merge file1 and file 2 since x2 is the educational level in which an individual would have more than 1 level/type. >>>>> >>>>> Thank you, >>>>> >>>>> Yun >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> On 10 Sep 2011, at 22:56, Robson Glasscock <glasscockrc@vcu.edu> wrote: >>>>> >>>>>> I'm having a hard time understanding the structure of your "first >>>>>> file" and your "second file. Do your files looks like this: >>>>>> >>>>>> File 1: >>>>>> x1 >>>>>> (unique id + educational level) >>>>>> >>>>>> File 2: >>>>>> x1 >>>>>> (unique id) >>>>>> >>>>>> so that File 1 contains a concatenated variable? >>>>>> >>>>>> If File 1 has one variable, x1, that is concatenated, then I don't >>>>>> think you need to merge the files together to get what you want. You >>>>>> would just need to split x1(unique id + educational level) into >>>>>> x1(unique id) and x2(educational level). The substr function may be >>>>>> used to do this. >>>>>> >>>>>> -help substr- >>>>>> >>>>>> best, >>>>>> Robson Glasscock >>>>>> >>>>>> On Fri, Sep 9, 2011 at 5:27 PM, Ay <yuniphone@yahoo.co.id> wrote: >>>>>>> >>>>>>> I am using stata 11.0 >>>>>>> >>>>>>> How can i generate or change a variable that contains for instance, educational level or father and mother as 'Type'? >>>>>>> The variables are in strings format. >>>>>>> >>>>>>> The reason i am asking this question is that, i want to merge these 'type' variable (var B) with other variable. >>>>>>> I could not do this since the second file only has variable A as individual id variable. While in the first file, it has var A and var B ('type' or level variable) as unique Id. >>>>>>> >>>>>>> If anyone knows other ways besides generating the type variable before merging, I really appreciate it. >>>>>>> >>>>>>> Variable a >>>>>>> 1,2,1,1,2,3,3 >>>>>>> >>>>>>> Variable b(type) >>>>>>> A,A,B,C,B,C,A >>>>>>> >>>>>>> Many thanks. >>>>>>> >>>>>>> Regards, >>>>>>> >>>>>>> Yun >>>>>>> >>>>>>> >>>>>>> >>>>>>> >>>>>>> >>>>>>> * >>>>>>> * For searches and help try: >>>>>>> * http://www.stata.com/help.cgi?search >>>>>>> * http://www.stata.com/support/statalist/faq >>>>>>> * http://www.ats.ucla.edu/stat/stata/ >>>>>> >>>>>> * >>>>>> * For searches and help try: >>>>>> * http://www.stata.com/help.cgi?search >>>>>> * http://www.stata.com/support/statalist/faq >>>>>> * http://www.ats.ucla.edu/stat/stata/ >>>>> >>>>> * >>>>> * For searches and help try: >>>>> * http://www.stata.com/help.cgi?search >>>>> * http://www.stata.com/support/statalist/faq >>>>> * http://www.ats.ucla.edu/stat/stata/ >>>>> >>>> >>>> * >>>> * For searches and help try: >>>> * http://www.stata.com/help.cgi?search >>>> * http://www.stata.com/support/statalist/faq >>>> * http://www.ats.ucla.edu/stat/stata/ >>> >>> * >>> * For searches and help try: >>> * http://www.stata.com/help.cgi?search >>> * http://www.stata.com/support/statalist/faq >>> * http://www.ats.ucla.edu/stat/stata/ >>> >> >> * >> * For searches and help try: >> * http://www.stata.com/help.cgi?search >> * http://www.stata.com/support/statalist/faq >> * http://www.ats.ucla.edu/stat/stata/ > > * > * For searches and help try: > * http://www.stata.com/help.cgi?search > * http://www.stata.com/support/statalist/faq > * http://www.ats.ucla.edu/stat/stata/ > * * For searches and help try: * http://www.stata.com/help.cgi?search * http://www.stata.com/support/statalist/faq * http://www.ats.ucla.edu/stat/stata/

**Follow-Ups**:**Re: st: Change variable***From:*Ay <yuniphone@yahoo.co.id>

**References**:**st: Change variable***From:*Ay <yuniphone@yahoo.co.id>

**Re: st: Change variable***From:*Robson Glasscock <glasscockrc@vcu.edu>

**Re: st: Change variable***From:*Ay <yuniphone@yahoo.co.id>

**Re: st: Change variable***From:*Robson Glasscock <glasscockrc@vcu.edu>

**Re: st: Change variable***From:*Ay <yuniphone@yahoo.co.id>

**Re: st: Change variable***From:*Robson Glasscock <glasscockrc@vcu.edu>

**Re: st: Change variable***From:*Ay <yuniphone@yahoo.co.id>

- Prev by Date:
**Re: st: Change variable** - Next by Date:
**Re: st: Weighted counts with "svy" command** - Previous by thread:
**Re: st: Change variable** - Next by thread:
**Re: st: Change variable** - Index(es):