Stata: Data Analysis and Statistical Software

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: Sequential Probit

From	Elin Vimefall <[email protected]>
To	[email protected]
Subject	Re: st: Sequential Probit
Date	Wed, 16 Mar 2011 08:50:34 +0100

Thanks again for the helpful discussion.
Just to make sure that I understand you correctly:

If I want to do the sequential probit and control for the correlatedunobserved heterogenity (for example the ability of the child), then itis the same thing as doing a trivariat probit with sample selection?If so; is there some way to do this in stata? I assume the heckprobcommand is what I would use if I had two steps, but is there some way toextend this into three steps?


//Elin


Maarten buis skrev:

--- On Fri, 4/3/11, Stephen Jenkins wrote:
Just to confirm Maarten's remark that tastes differ:
(i) I am less confident than he is that logit estimates are
easier to interpret than probit estimates. Much of this
depends on whether you are sure that you (and your target
audience) understand what odds ratios are. In my opinion,they are more poorly understood than most quantitative
sociologists hope or assume.
We agree that one needs to be careful to make sure that oddsratios are properly understood when you present them and thatthere are many examples where that has not happened. A trickthat often works for me is to also include the baseline odds in
the table of results, and start the discussion with that. That
is a natural way of "refreshing" the audience's memory of what
an odds is (expected number of "successes" per "failure"). Thenext coefficient you discuss will allow you to extend it bysaying that and odds ratio is a very well chosen name (unusualin statistics) as it literaly is a ratio of odds. After thatyou can move more quickly through your results. (A moredetailed discussion of this trick and how to do it in Stata is
here: <http://www.stata.com/statalist/archive/2011-02/msg00785.html>)
It's not that probit parameter estimates are easier to
understand; rather I suggest working in the probability
metric. That is, look at the implications of the estimates
using marginal effects, average marginal effects or
predicted probabilities more generally (in Stata, think
-margins-).
There is no doubt that the various forms of marginal effects
and predicted probabilties are useful. I have, however, tworeservations. First, if all you are going to do is interpret
a linear approximation of a non-linear model, then why not
cut out the middle man and directly estimate a linearprobabilty model? Second, marginal effects are only easy in
relatively simple models. As soon as you add things like
interaction terms, odds ratios tend be a lot simplerbecause the logit model is linear in the log(odds), e.g.:<http://www.maartenbuis.nl/publications/interactions.html>
(ii) how to treat unobserved heterogeneity is of course
difficult -- it is unobserved!  A multivariate probit
model with sample selection (cf. Cappellari and Jenkins
article in Stata Journal (2006), 6(2), free
download) is one way to proceed. The cost is the
assumption of joint normality (trivariate normal in the
poster's case).
I agree, though given my tastes I would have put theemphasis a bit differently. The link to that article is<http://www.stata-journal.com/article.html?article=st0101>
(iii) This way of modelling the heterogeneity is
conventional, but of course the specification is a> maintainedassumption (as Maarten stresses). On the other hand, the
implicit heterogeneity model that he assumes in his own
sequential logit package is unclear to me from his paper.The model that he implicitly tests against is also a
maintained assumption. (I think it's a single factor model
-- i.e. with the latent errors perfectly correlated and with
a Normal marginal distribution. No doubt Maarten can correct
me.)
There are two options in -seqlogit-:
By default it estimates a regular sequential logit, whichmeans that the error terms across transitions are uncorrelated.
I think of that as literaly modelling the odds given only the
variables in the model. This is a slightly different way ofsaying what was my first strategy in this post:
<http://www.stata.com/statalist/archive/2011-03/msg00231.html>.
Alternatively one can estimate the model while assuming acertain scenario for the unobserved heterogeneity. This assumes
that there is one unobserved variable (which one could think of
as a composite of a set of variables) that influences eachtransition. One sets the distribution at the first transition(normal or a discrete distribution). Due to selection thedistribution at later transitions will deviate from thedistribution at the first transition and selection will resultin a negative correlation between the observed and unobservedvariables (these changes in the distribution and correlationare derived from the model not specified by the user). Finally,the distribution of the unobserved variable at the firsttransitions is standardized to have a standard deviation of 1,and one needs to choose the size of the effect at eachtransition. The logic behind this is that this set-up allowsfor a range of scenarios that can help investigate what thepotential influence of unobserved heterogeneity could be, ina way similar to the robustness check you propose below.
(iv) Whatever, it is now relatively straightforward to
explore in Stata what happens using either approach. That sort
of robustness checking is useful.
I agree.

-- Maarten

--------------------------
Maarten L. Buis
Institut fuer Soziologie
Universitaet Tuebingen
Wilhelmstrasse 36
72074 Tuebingen
Germany

http://www.maartenbuis.nl
--------------------------
*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

Follow-Ups:
- Re: st: Sequential Probit
  - From: Maarten buis <[email protected]>

References:
- Re: st: Sequential Probit
  - From: Maarten buis <[email protected]>

Prev by Date: st: y-standardisation in logistic multilevel models with gllamm
Next by Date: Re: st: Sequential Probit
Previous by thread: Re: st: Sequential Probit
Next by thread: Re: st: Sequential Probit
Index(es):
- Date
- Thread