Statalist


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

st: RE: DHS data


From   "Nick Cox" <n.j.cox@durham.ac.uk>
To   <statalist@hsphsun2.harvard.edu>
Subject   st: RE: DHS data
Date   Thu, 5 Nov 2009 14:49:33 -0000

My guess is that you are losing uniqueness in your concatenation of
identifiers. 

See for example the thread starting 

http://www.stata.com/statalist/archive/2009-10/msg01256.html

An alternative approach is to work with the string identifier 

egen hhid1 = concat(v001 v002) 

Nick 
n.j.cox@durham.ac.uk 

Bolanle Bukoye

I am writing to inquire if you could help me with some issues that I am
having with merging the dhs data. I am using the data from Angola and
Tanzania for my thesis. In merging the child and household data for
Angola, I keep getting error messages on stata. 
I used the household dataset as the base dataset and cluster number and
household number as the key variables as suggested on the DHS website. 

My stata code is 
. gen hhid1 = v001 + v002
. sort hhid1
. merge unique using "C:\Users\bola\Documents\Fall
2009\Thesis\AOKR51FL.DTA", uniqmaster


While the household data set is open, I use the merge drop down menu-->
select one to many key variables, choose child data set and select the
hhid1 variable that I just created. 

Here is the error message that I received on stata : "variable hhid1
does not uniquely identify observations in the master data"


*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/



© Copyright 1996–2014 StataCorp LP   |   Terms of use   |   Privacy   |   Contact us   |   What's new   |   Site index