Statalist


[Date Prev][Date Next][Thread Prev][Thread Next][Date index][Thread index]

Re: st: RE: Hotdeck problem


From   anders35@uwm.edu
To   statalist@hsphsun2.harvard.edu, "Rodrigo Alfaro A." <ralfaro@bcentral.cl>
Subject   Re: st: RE: Hotdeck problem
Date   Wed, 2 Apr 2008 08:11:20 -0500

Rodrigo,

Thank you for your response. I tried your suggested technique, and came up with
the same results. The issue must lie within my variables...

Joelle

Quoting "Rodrigo Alfaro A." <ralfaro@bcentral.cl>:

> Joelle,
>
> It seems that you have missings in the by-variables. Consider the
> following sentence: hotdeck y, store by(x1 x2) keep(id). You will have
> troubles if x1 or x2 have missing, we "solved" the problem using the -9.
> Suppose x1 = {1, 2, 3} and x2={1, 2, 3, 4, 5}, then we created x1=-9 if
> x1==. and x2=-9 if x2==., we hotdeck in that way (with the news x1 and
> x2) and then we put as missings the cases where x1 or x2 ==-9. We did
> that with a simple loop.
>
> Rodrigo.
>
>
>
>
>
> -----Mensaje original-----
> De: owner-statalist@hsphsun2.harvard.edu
> [mailto:owner-statalist@hsphsun2.harvard.edu] En nombre de Joelle M
> Anderson
> Enviado el: Domingo, 30 de Marzo de 2008 01:38 p.m.
> Para: statalist@hsphsun2.harvard.edu
> Asunto: st: Hotdeck problem
>
> For my thesis, I am using the hotdeck program to impute values for
> missing cases in my income variable. Currently, I am trying to hotdeck
> my income variable (176 missing) using 3 variables (age=27 missing;
> education=13 missing; gender=0 missing; although with 9 overlapping
> missing values the combination of these three variables only has 31
> missing cases total). Yet when Stata creates my new, hotdecked income
> variable, there are an additional 19 missing cases that I can't account
> for (missing=50). Does anyone know why this might be? Another strange
> thing is that, when I try to rename my hotdecked income measure before
> merging it with my full dataset, all 50 missing cases remain missing
> after merging; when I do not rename my hotdecked income measure before
> merging, only 42 missing cases remain missing after merging. I have
> pasted my Stata output below. Any help would be greatly appreciated!
>
> Joelle Anderson
> Graduate Student, Sociology
> University of Wisconsin-Milwaukee
> anders35@uwm.edu
>
> //First hotdeck imputation, renaming the income variable BEFORE merging
> with full dataset
>
> . hotdeck incomeR using IncomeHD, store by(education sex ageR) keep(resp
> incomeR)
> DELETING all matrices....
>
> Table of the Missing data patterns
>  * signifies missing and - is not missing
>
> Varlist order: incomeR
>
>     pattern |      Freq.     Percent        Cum.
> ------------+-----------------------------------
>           * |        176       11.72       11.72
>           - |      1,326       88.28      100.00
> ------------+-----------------------------------
>       Total |      1,502      100.00
> WARNING: When the <command> option is not selected then no analysis is
> performed on the imputed datasets
>
>
> . use "C:\data\IncomeHD1.dta", clear
>
> . tab incomeR
>
>   RECODE of |
>      income |
>    (income. |
>  last year, |
>  that is in |
>  2004, what |
>    was your |
> total famil |      Freq.     Percent        Cum.
> ------------+-----------------------------------
>           1 |        103        7.09        7.09
>           2 |        164       11.29       18.39
>           3 |        222       15.29       33.68
>           4 |        148       10.19       43.87
>           5 |        162       11.16       55.03
>           6 |        265       18.25       73.28
>           7 |        178       12.26       85.54
>           8 |        124        8.54       94.08
>           9 |         86        5.92      100.00
> ------------+-----------------------------------
>       Total |      1,452      100.00
>
> . rename incomeR incomez
>
> . merge resp using "C:\Documents and Settings\anders35\My
> Documents\Thesis_3_29.dta", unique sort
>
> . tab incomez
>
>   RECODE of |
>      income |
>    (income. |
>  last year, |
>  that is in |
>  2004, what |
>    was your |
> total famil |      Freq.     Percent        Cum.
> ------------+-----------------------------------
>           1 |        103        7.09        7.09
>           2 |        164       11.29       18.39
>           3 |        222       15.29       33.68
>           4 |        148       10.19       43.87
>           5 |        162       11.16       55.03
>           6 |        265       18.25       73.28
>           7 |        178       12.26       85.54
>           8 |        124        8.54       94.08
>           9 |         86        5.92      100.00
> ------------+-----------------------------------
>       Total |      1,452      100.00
>
> //Second hotdeck imputation, renaming the hotdecked income variable
> AFTER merging with full dataset
>
> . hotdeck incomeR using IncomeHotD, store by(education sex ageR)
> keep(resp incomeR) DELETING all matrices....
>
> Table of the Missing data patterns
>  * signifies missing and - is not missing
>
> Varlist order: incomeR
>
>     pattern |      Freq.     Percent        Cum.
> ------------+-----------------------------------
>           * |        176       11.72       11.72
>           - |      1,326       88.28      100.00
> ------------+-----------------------------------
>       Total |      1,502      100.00
> WARNING: When the <command> option is not selected then no analysis is
> performed on the imputed datasets
>
> . clear
>
> . use "C:\data\IncomeHotD1.dta", clear
>
> . tab incomeR
>
>   RECODE of |
>      income |
>    (income. |
>  last year, |
>  that is in |
>  2004, what |
>    was your |
> total famil |      Freq.     Percent        Cum.
> ------------+-----------------------------------
>           1 |         98        6.75        6.75
>           2 |        162       11.16       17.91
>           3 |        220       15.15       33.06
>           4 |        153       10.54       43.60
>           5 |        159       10.95       54.55
>           6 |        267       18.39       72.93
>           7 |        178       12.26       85.19
>           8 |        126        8.68       93.87
>           9 |         89        6.13      100.00
> ------------+-----------------------------------
>       Total |      1,452      100.00
>
> . merge resp using "C:\Documents and Settings\anders35\My
> Documents\Thesis_3_29.dta", unique sort
>
> . rename incomeR incomey
>
> . tab incomey
>
>   RECODE of |
>      income |
>    (income. |
>  last year, |
>  that is in |
>  2004, what |
>    was your |
> total famil |      Freq.     Percent        Cum.
> ------------+-----------------------------------
>           1 |         98        6.71        6.71
>           2 |        164       11.23       17.95
>           3 |        222       15.21       33.15
>           4 |        153       10.48       43.63
>           5 |        159       10.89       54.52
>           6 |        270       18.49       73.01
>           7 |        178       12.19       85.21
>           8 |        126        8.63       93.84
>           9 |         90        6.16      100.00
> ------------+-----------------------------------
>       Total |      1,460      100.00
>
> *
> *   For searches and help try:
> *   http://www.stata.com/support/faqs/res/findit.html
> *   http://www.stata.com/support/statalist/faq
> *   http://www.ats.ucla.edu/stat/stata/
>
>
********************************************************************************
> ADVERTENCIA: La  información  contenida  en  esta  transmisión, y  en
> cualquier archivo  adjunto, está  sujeta a reserva legal conforme a la
> normativa aplicable  al  Banco  Central  de  Chile, y  no  puede  ser usada o
> difundida  por personas distintas  de  su o sus destinatarios. Si usted ha
> recibido esta transmisión por error,  por  favor  notifique  inmediatamente
> al remitente respondiendo por este mismo medio y elimínela de su sistema.
> El  Banco Central de Chile no se hará responsable de la exactitud y veracidad
> de la información contenida en este mensaje, así  como  de su  modificación,
> copia, divulgación  o  reenvío,  total  o  parcial.   Su  uso  no  autorizado
> puede ser sancionado de conformidad con las leyes chilenas.
> El  Banco  Central  de  Chile  transmite  sus decisiones a través de
> comunicados oficiales, los  que  pone  a  disposición  del público en su
> página de Internet: www.bcentral.cl
>
>
> DISCLAIMER: The information  contained  in  this  email or any attached file,
> is subject to legal  privilege  pursuant  to the laws and regulations
> applicable to the Central  Bank  of  Chile , and may not be used or
> disseminated by any person other  than  its  intended recipients. If you have
> received this transmission in error, please  notify  the sender immediately
> by reply to this email address and delete it from your system.
> The Central Bank  of  Chile shall not be liable for the accuracy or
> authenticity of the contents of this message, whether amended, copied,
> forwarded or disclosed in  any  form, in  whole  or  in part.  Please note
> that unauthorized use may be penalized  in  conformity  with  the  Chilean
> law.
> The Central  Bank of Chile communicates its decisions by  official releases,
> and
> makes them available to the public in its WebPages: www.bcentral.cl
>
> *
> *   For searches and help try:
> *   http://www.stata.com/support/faqs/res/findit.html
> *   http://www.stata.com/support/statalist/faq
> *   http://www.ats.ucla.edu/stat/stata/
>



*
*   For searches and help try:
*   http://www.stata.com/support/faqs/res/findit.html
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/



© Copyright 1996–2014 StataCorp LP   |   Terms of use   |   Privacy   |   Contact us   |   What's new   |   Site index