Notice: On March 31, it was **announced** that Statalist is moving from an email list to a **forum**. The old list will shut down at the end of May, and its replacement, **statalist.org** is already up and running.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

From |
Steve Samuels <sjsamuels@gmail.com> |

To |
statalist@hsphsun2.harvard.edu |

Subject |
Re: st: diagnostic plots with svy:reg |

Date |
Tue, 24 Apr 2012 20:47:27 -0400 |

You misunderstand the documentation's point: it's not that diagnostics are unimportant for survey regression, but that the standard measures are based on formulas that don't apply to weighted, clustered data. Prof. Richard Valliant and his students have done a lot of work in this area. See 1. Stata Conference presentation: http://econpapers.repec.org/paper/bocdcon09/15.htm 2. http://www.jos.nu/Articles/abstract.asp?article=271099 3. www.pakjs.com/journals/27(4)/27(4)12.pdf 4. ssc.ca/webfm_send/540 In correspondence with a student last year, Professor Valliant wrote: "The standard packages will compute certain things that are still informative, although not exactly right: Cook's D, DFBETAS, DFFITS from wtd least squares (WLS): these are off because the wrong variance estimates are used. But if you have any reallly extreme points, the standard diegnostics should identify them anyway. Leverages from WLS: these are ok from the standard pkgs. Collinearity diagnostics: VIFs from WLS can be too large or small, but if you have extreme collinearity between two x's in a model, the standard VIFs should tell you that. Condition indexes and variance decompositions for collinearity: These are probably pretty close to right from the standard WLS output. These allow you to diagnose which x's" I would supplement these suggestions: 1. Run -mmregress- (SSC), which does not take weights or clusters, but is otherwise excellent at identifying outliers _and_ high leverage observations that would otherwise mask one another. or 2. Run -qreg- with aweights to identify outliers better than the standard leave-one- out standard algorithms. Steve sjsamuels@gmail.com On Apr 24, 2012, at 6:28 PM, Lee Grenon wrote: Hello, in the Stata documentation on regression postestimation, it states that diagnostic plots such as rvpplots, rvfplots, dfbetas, crpplots, etc are not appropriate with the svy: prefix. I am interested in understanding why these diagnostic plots are not appropriate when using the design-based regression procedure. Can anyone explain this to me/ Does anyone have a suggestion for producing appropriate diagnostic plots for design-based regression. I am working with population survey data which has bootstrap replicate weights provided. Have a good day * * For searches and help try: * http://www.stata.com/help.cgi?search * http://www.stata.com/support/statalist/faq * http://www.ats.ucla.edu/stat/stata/ * * For searches and help try: * http://www.stata.com/help.cgi?search * http://www.stata.com/support/statalist/faq * http://www.ats.ucla.edu/stat/stata/

**References**:**st: diagnostic plots with svy:reg***From:*Lee Grenon <lgrenon@sfu.ca>

- Prev by Date:
**st: how to use command to obtain the coefficient in a regression?** - Next by Date:
**st: Re: how to use command to obtain the coefficient in a regression?** - Previous by thread:
**st: diagnostic plots with svy:reg** - Next by thread:
**st: how to use command to obtain the coefficient in a regression?** - Index(es):