Bookmark and Share

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: GLM with spatial correlation among error terms?

From   Austin Nichols <>
Subject   Re: st: GLM with spatial correlation among error terms?
Date   Tue, 10 Jan 2012 14:39:48 -0500

Clyde B Schechter <>:
I don't think you need to worry about the random effects since you are
willing to believe they are uncorrelated with the regressors, and you
can use a cluster-robust SE to deal with the spatial correlation of
errors (cluster by MSA or what have you).  But you have real problems
with unobserved attributes of families that determine where people
live and therefore what neighborhood and school characteristics are
attached to them, and that also directly affect both their health
outcomes and behavior.  Are you familiar with the Moving to
Opportunity experiment?

On Tue, Jan 10, 2012 at 9:47 AM, Clyde B Schechter
<> wrote:
> A colleague and I are trying to develop a research project wherein we will collect information on certain health outcomes in adolescents.  We expect these outcomes to be influenced both by personal attributes, attributes of the school they attend, and attributes of the neighborhood in which they live.  Some of the outcomes are dichotomous, some are counts, and some are "continuous" variables.  We would like to estimate the effects of several of the individual, school, and neighborhood attributes on these outcomes, and we are thinking of a multi-level GLM with individual nested in school X neighborhood.
> But, for a number of reasons, we don't want to define "neighborhood" to be a postal zone or census tract or other administrative area.  Rather, for our purposes, the most salient definition of neighborhood is defined as the geographic area within walking distance of the person's home (we have a working definition of walking distance--I don't think the details are relevant here).  There are a number of attributes of the "neighborhood" that we can measure, but they will not comprehensively explain all "neighborhood" effects, so we want to include a random effect at the neighborhood level.
> The problem is that, except for the relatively infrequent circumstance of two people who live at the same address, the "neighborhood" effect is completely confounded with the individual-level error term, so the model will not be identifiable.  Fair enough, and so the individual-level error term will have to serve for both.   But my real concern is this: people who live near each other will have overlapping neighborhoods, often extensively overlapping.  From this, I infer that these individual level error terms cannot reasonably be considered independent.   Rather, there will be correlation among error terms that is some (decreasing, or at least non-increasing) function of the distance between the corresponding homes.
> Can anyone point me to how one might estimate such a model?  Stata solutions are preferred, of course, but I'm willing to use other platforms to solve the problem.

*   For searches and help try:

© Copyright 1996–2018 StataCorp LLC   |   Terms of use   |   Privacy   |   Contact us   |   Site index