Stata: Data Analysis and Statistical Software

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

random effects logit model [was: st: From: Emma Gorman ...]

From	Nick Cox <[email protected]>
To	[email protected]
Subject	random effects logit model [was: st: From: Emma Gorman ...]
Date	Wed, 24 Aug 2011 07:27:21 +0100

Please use meaningful titles for your postings and full references.
Both are recommended in the Statalist FAQ.

-xt- is set up to be general enough to apply to all sorts of problems,
including but not only those with longitudinal data. -xt- is in fact
quite often applied to problems without a time variable. It is a
user's decision whether their problem requires panels of a certain
minimum length.

In any case singleton panels with known time can contribute
information about the overall change.

Nick

Emma Gorman

On Wed, Aug 24, 2011 at 3:02 AM,  <[email protected]> wrote:

> I am estimating a random effects logit model, augmented with cluster
> means  to account for correlation between effects and varying
> covariates (a la Mundlak 1978), using xtlogit.
>
> xtlogit WKRT $tmeans if allyears==1, or intpoints(20) re ;
>
> Where tmeans is regular covariates + cluster means.
>
> I have three waves of data (individuals over time) and would ideally like to
> use all longitudinal respondents (those who are in all three waves in this
> analysis). However, a fair few individuals have missing data for some
> covariates, so these observations are dropped from the regression model.
>
> I initially ended up with  *minimum observation per group: 1*
>                                                     avg obs per group: 2.5
>                                                 max obs per group: 3
>
> I found that there were many individuals who were observed in all waves, but
> only had non-missing information for all covariates (and dep variable) in
> one wave. I had assumed such people, who have only one usable wave of
> observation, would be automatically dropped form estimation by Stata as they
> provide no longitudinal info for the model.
>
> So I isolated and removed these people from the estimation command manually,
> to end up with:
>
>
> Random effects u_i ~ Gaussian                   Obs per group: min =2
>                                                              avg = 2.7
>                                                              max = 3
>
>
> My question is essentially: why is it that such cases, which only provide
> cross-sectional information, not dropped automatically (/should / they be
> dropped) ? Or is there an option to only use longitudinal information? How
> is this consistent with theory? It seems strange that the default should be
> to include everyone.
>
> My understanding of random effects models is that they use the most
> efficient combination of between and within variation, the time invariant
> individual effects are integrated out of the likelihood function and are
> assumed to be independent (in the non-linear case).
>
> So we don't want to know about those who don't have longitudinal information
> for estimation. (??)
>
> NB a complication with inclusion of cluster means is that  if there
> are individuals who only have one usable wave of information due to a
> missing dependent variable for the other waves, these guys still have
> valid cluster means for the explanatory variables, so in some sense
> there is still within and between information even with just one
> 'wave' of usable information. So perhaps these guys should not be
> gotten rid of...
>
>

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

Prev by Date: st: Panel cointegration
Next by Date: Re: st: How to insert decimal point?
Previous by thread: st: Panel cointegration
Next by thread: st: post hoc test categorical survey data
Index(es):
- Date
- Thread