Stata: Data Analysis and Statistical Software

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: Re:using a flag variable for missing values in a regression

From	Nick Cox <[email protected]>
To	[email protected]
Subject	Re: st: Re:using a flag variable for missing values in a regression
Date	Wed, 27 Mar 2013 14:14:24 +0000

I agree with Maarten.

The principle is easy: replacing missing with zero is valid whenever
missing really does mean zero. Otherwise, you're just fooling yourself
that any problem is solved. Stata will take zeros literally (here
meaning, numerically). It will have absolutely no sense that you might
mean something else.

Nick

On Wed, Mar 27, 2013 at 1:41 PM, Maarten Buis <[email protected]> wrote:
> On Wed, Mar 27, 2013 at 2:28 PM, Frank D Lopresti wrote:

>> I am working with a student whose prof said in a memo "Note: when
>> creating new variables, missing values should be coded as some
>> arbitrary numerical value (e.g., 0) so that cases aren’t dropped in
>> the regression . You can also generate a missing flag for each
>> variable, coded 1 for missing and 0 otherwise, and then include the
>> flags in tandem with the variables for unbiased estimates."  Is this a
>> valid method for dealing with missing data  I've missed?
>
> In general, this is not a valid way of dealing with missing values.
> Here is an explanation of what happens when you use this method:
> <http://www.stata.com/statalist/archive/2006-09/msg00117.html>
>
> However, there is an exception where this method can be useful, which
> is explained here:
> <http://www.stata.com/statalist/archive/2011-07/msg00456.html>
>

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/faqs/resources/statalist-faq/
*   http://www.ats.ucla.edu/stat/stata/

References:
- st: Re:using a flag variable for missing values in a regression
  - From: Frank D Lopresti <[email protected]>
- Re: st: Re:using a flag variable for missing values in a regression
  - From: Maarten Buis <[email protected]>

Prev by Date: st: How to put max and min values in a loop
Next by Date: Re: st: Using natural logs on RHS of maximum likelihood models
Previous by thread: Re: st: Re:using a flag variable for missing values in a regression
Next by thread: st: sequential subscript processing
Index(es):
- Date
- Thread