Notice: On March 31, it was **announced** that Statalist is moving from an email list to a **forum**. The old list will shut down on April 23, and its replacement, **statalist.org** is already up and running.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

From |
Nick Cox <njcoxstata@gmail.com> |

To |
statalist@hsphsun2.harvard.edu |

Subject |
Re: st: Re:using a flag variable for missing values in a regression |

Date |
Wed, 27 Mar 2013 14:14:24 +0000 |

I agree with Maarten. The principle is easy: replacing missing with zero is valid whenever missing really does mean zero. Otherwise, you're just fooling yourself that any problem is solved. Stata will take zeros literally (here meaning, numerically). It will have absolutely no sense that you might mean something else. Nick On Wed, Mar 27, 2013 at 1:41 PM, Maarten Buis <maartenlbuis@gmail.com> wrote: > On Wed, Mar 27, 2013 at 2:28 PM, Frank D Lopresti wrote: >> I am working with a student whose prof said in a memo "Note: when >> creating new variables, missing values should be coded as some >> arbitrary numerical value (e.g., 0) so that cases aren’t dropped in >> the regression . You can also generate a missing flag for each >> variable, coded 1 for missing and 0 otherwise, and then include the >> flags in tandem with the variables for unbiased estimates." Is this a >> valid method for dealing with missing data I've missed? > > In general, this is not a valid way of dealing with missing values. > Here is an explanation of what happens when you use this method: > <http://www.stata.com/statalist/archive/2006-09/msg00117.html> > > However, there is an exception where this method can be useful, which > is explained here: > <http://www.stata.com/statalist/archive/2011-07/msg00456.html> > * * For searches and help try: * http://www.stata.com/help.cgi?search * http://www.stata.com/support/faqs/resources/statalist-faq/ * http://www.ats.ucla.edu/stat/stata/

**References**:**st: Re:using a flag variable for missing values in a regression***From:*Frank D Lopresti <fl2@nyu.edu>

**Re: st: Re:using a flag variable for missing values in a regression***From:*Maarten Buis <maartenlbuis@gmail.com>

- Prev by Date:
**st: How to put max and min values in a loop** - Next by Date:
**Re: st: Using natural logs on RHS of maximum likelihood models** - Previous by thread:
**Re: st: Re:using a flag variable for missing values in a regression** - Next by thread:
**st: sequential subscript processing** - Index(es):