Bookmark and Share

Notice: On March 31, it was announced that Statalist is moving from an email list to a forum. The old list will shut down on April 23, and its replacement, is already up and running.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: Re:using a flag variable for missing values in a regression

From   Nick Cox <>
Subject   Re: st: Re:using a flag variable for missing values in a regression
Date   Wed, 27 Mar 2013 14:14:24 +0000

I agree with Maarten.

The principle is easy: replacing missing with zero is valid whenever
missing really does mean zero. Otherwise, you're just fooling yourself
that any problem is solved. Stata will take zeros literally (here
meaning, numerically). It will have absolutely no sense that you might
mean something else.


On Wed, Mar 27, 2013 at 1:41 PM, Maarten Buis <> wrote:
> On Wed, Mar 27, 2013 at 2:28 PM, Frank D Lopresti wrote:

>> I am working with a student whose prof said in a memo "Note: when
>> creating new variables, missing values should be coded as some
>> arbitrary numerical value (e.g., 0) so that cases aren’t dropped in
>> the regression . You can also generate a missing flag for each
>> variable, coded 1 for missing and 0 otherwise, and then include the
>> flags in tandem with the variables for unbiased estimates."  Is this a
>> valid method for dealing with missing data  I've missed?
> In general, this is not a valid way of dealing with missing values.
> Here is an explanation of what happens when you use this method:
> <>
> However, there is an exception where this method can be useful, which
> is explained here:
> <>

*   For searches and help try:

© Copyright 1996–2016 StataCorp LP   |   Terms of use   |   Privacy   |   Contact us   |   Site index