Bookmark and Share

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: Re:using a flag variable for missing values in a regression


From   Nick Cox <[email protected]>
To   [email protected]
Subject   Re: st: Re:using a flag variable for missing values in a regression
Date   Wed, 27 Mar 2013 14:14:24 +0000

I agree with Maarten.

The principle is easy: replacing missing with zero is valid whenever
missing really does mean zero. Otherwise, you're just fooling yourself
that any problem is solved. Stata will take zeros literally (here
meaning, numerically). It will have absolutely no sense that you might
mean something else.

Nick

On Wed, Mar 27, 2013 at 1:41 PM, Maarten Buis <[email protected]> wrote:
> On Wed, Mar 27, 2013 at 2:28 PM, Frank D Lopresti wrote:

>> I am working with a student whose prof said in a memo "Note: when
>> creating new variables, missing values should be coded as some
>> arbitrary numerical value (e.g., 0) so that cases aren’t dropped in
>> the regression . You can also generate a missing flag for each
>> variable, coded 1 for missing and 0 otherwise, and then include the
>> flags in tandem with the variables for unbiased estimates."  Is this a
>> valid method for dealing with missing data  I've missed?
>
> In general, this is not a valid way of dealing with missing values.
> Here is an explanation of what happens when you use this method:
> <http://www.stata.com/statalist/archive/2006-09/msg00117.html>
>
> However, there is an exception where this method can be useful, which
> is explained here:
> <http://www.stata.com/statalist/archive/2011-07/msg00456.html>
>

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/faqs/resources/statalist-faq/
*   http://www.ats.ucla.edu/stat/stata/


© Copyright 1996–2018 StataCorp LLC   |   Terms of use   |   Privacy   |   Contact us   |   Site index