[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: event history analysis with years clustered in individuals

From	Steven Samuels <[email protected]>
To	[email protected]
Subject	Re: st: event history analysis with years clustered in individuals
Date	Sun, 15 Feb 2009 16:07:09 -0500

--

Hilde-

You might explain to the professor that, with survival data, thenumber of years of observation is itself the (posssibly censored)outcome. Therefore "year" cannot be a level 1 effect in a multilevelmodel;


-Steve

On Feb 15, 2009, at 3:43 PM, Hilde Karlsen wrote:

Ah. Ok, I see I have to do some serious rethinking when it comes tothis essay, then. I guess this to a certain degree explains why Ihave trouble understanding what sigma_u refers to in this specificanalysis. I am wondering if I should forward this e-mailcorrespondance to the professor who held the course in multileveltechniques, because what I've learned from you today are not inline with what we were told at the course when it comes to thismatter. Anyway. Thank you so much for the advice and for answering me.
Regards,
Hilde

Quoting Steven Samuels <[email protected]>:
I agree with Austin. Just to be clear: sigma_u is a parameter thatis meaningless for this problem, No interpretation is possible.
On Feb 15, 2009, at 9:22 AM, Austin Nichols wrote:
Hilde Karlsen <[email protected]>:
If you have to use a mixed model as an exercise, and you have no
compelling reason to choose a particular research question, youshould
ask a different research question where a mixed model is a more
appropriate model, not apply it blindly to data you know is better
suited to a survival model. Why not use the attrition dummy youhavemade as the explanatory variable in a mixed model instead--whatother
variables do you have on the data?
On Sun, Feb 15, 2009 at 8:26 AM, Hilde Karlsen<[email protected]> wrote:
Thank you both for the advice. However, I don't think I can doas yousuggest because I have to use a multilevel approach for thisessay (it is anessay for a multilevel course I followed a while ago). I shouldprobablyhave been more clear on this issue, and on what my problemreally is. What Iam wondering is not which method/command I should use, but how Iam going tointerprete the sigma_u estimate when my level 1 variable isyears and my
level 2 variable is individuals.
As mentioned, I find it more intuitive to grasp the point ofseparatevariance estimates when the levels are schools, classes etc, butfor somereason I have a hard time understanding how I should interpreatethevariance estimate sigma_u when the years are clustered inindividuals. Howshould I interpreate sigma_u when years are clustered inindividuals.
I asked the professor who was leading the course which command Ishould use,and he told me I should use xtmelogit (my advicor told me thesame thing).As he is the one who is going to judge wheter I pass or not onthis essay,
it is probably best to follow his advice.
I agree that it is a survival model, and I have designed my datafor thistype of analysis (i.e. all individuals in the file start outwith 0 on thedependent variable, and when/if they drop out of the nursingoccupation,they receive 1 on the dependent variable. I have no info onwhich date/monthpeople drop out; I only have information on which year they dropout).
Regards,
Hilde


Quoting Steven Samuels <[email protected]>:
Hilde, I agree with Austin's approach. Even if you have onlymonths, notdays, of starting and quitting, use that time unit in asurvival or discretesurvival model. I recommend Stephen Jenkins's -hshaz- (get itfrom SSC);his "model 1" (the "Prentice-Gloeckler model" is the same asthat fit by-cloglog-. His model 2 adds unobserved heterogeneity and so maybe more
realistic (Heckman and Singer, 1984).
I would not be surprised if prediction equations for of earlyand laterquitting differed. If so, time-dependent covariates or separatemodels for
early and later quitting, would be informative.

-Steve
Prentice, R. and Gloeckler L. (1978). Regression analysis ofgroupedsurvival data with application to breast cancer data.Biometrics 34 (1):
57-67.
Heckman, J.J. and Singer, B. (1984). A Method for minimizingthe impact ofdistributional assumptions in econometric models forduration data,
Econometrica,         52 (2): 271-320.
Hilde Karlsen <[email protected]>:
Attrition from nursing sounds like a survival model, probably in
discrete time, using -logit- or -cloglog- with time dummies, not
-xtmelogit- (see
http://www.iser.essex.ac.uk/iser/teaching/module-ec968 for atextbookand self-guided course on discrete time survival models). Ifyou haveT years of data on each individual, all of whom are first-yearnursesin period 1, and some of whom quit nursing in each of thesubsequentyears, with a variable nurse==1 when a nurse (and zerootherwise), an
individual identifier id, a year variable year, and a bunch of
explanatory variables x*, you can just:

tsset id year
bys id (year): g quit=(l.nurse==1 & nurse==0)
by id: replace quit=. if l.quit==1 | (mi(l.quit)&_n>1)
tab year, gen(_t)
drop _t1
logit quit _t* x*

and then work up to more complicated models with heterogeneous
frailty, etc. The main issues are that someone who quitnursing lastyear cannot quit nursing again this year, and people who neverquitnursing might at some future point that you don't observe,which is
why you use survival models...
If you know the day they started work and the day they quit,you might
prefer a continuous-time model (help st).

I've been assuming you had data on people working as nurses, but
rereading your email, maybe you have data on breastfeedingmothers,though I suppose the same considerations apply (though withmultiple
years of data on breastfeeding mothers, there is probably no
censoring).
On Fri, Feb 13, 2009 at 9:19 AM, Hilde Karlsen<[email protected]>
wrote:
Dear statalisters,
This is probably a stupid question, but I've been searchingaround the
nets
and in books and articles, and I've still not grasped theconcept: When
I'm
performing a multilevel analysis of attrition from nursing using
xtmelogit,
and time (year) is the level 1 variable and individuals (id)is the
level 2
variable (i.e. years are clustered within individuals; I have a
person-year
file), how do I formulate the expectation related to thismodel? Why is
it
important to separate between these two levels?

I find it more intuitive to grasp the fact that individuals are
clustered
within schools, and that variables on the school level - aswell asvariables on the individual level - may influence e.g. whichgrades a
student gets.
I understand (at least I hope I understand) the point thatwhen the sameindividuals are followed over a period of time, theindividual's
responses
are probably highly correlated, and that this implies aviolation to
the
assumption about the heteroskedastic error-terms. As I seeit, I could
have
used the cluster() - command (cluster(id))to 'avoid' thisviolation;however, I have to write an essay using multilevel analysis,so this is
not
an option.
I don't know if I'm being clear enough about what my problemis, but anyinformation regarding this topic (how to grasp the concept ofyears
clustered in individuals) will be greatly appreciated.
I'm really sorry for having to ask you such an infantilequestion.. Mycolleagues and friends are not familiar with multilevelanalyses, so I
don't
know who to turn to.

Best regards,
Hilde
*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/
*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/
*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/


*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

Follow-Ups:
- Re: st: event history analysis with years clustered in individuals
  - From: Hilde Karlsen <[email protected]>

References:
- st: event history analysis with years clustered in individuals
  - From: Hilde Karlsen <[email protected]>
- Re: st: event history analysis with years clustered in individuals
  - From: Austin Nichols <[email protected]>
- Re: st: event history analysis with years clustered in individuals
  - From: Steven Samuels <[email protected]>
- Re: st: event history analysis with years clustered in individuals
  - From: Hilde Karlsen <[email protected]>
- Re: st: event history analysis with years clustered in individuals
  - From: Austin Nichols <[email protected]>
- Re: st: event history analysis with years clustered in individuals
  - From: Steven Samuels <[email protected]>
- Re: st: event history analysis with years clustered in individuals
  - From: Hilde Karlsen <[email protected]>

Prev by Date: Re: st: event history analysis with years clustered in individuals
Next by Date: st: stata question?
Previous by thread: Re: st: event history analysis with years clustered in individuals
Next by thread: Re: st: event history analysis with years clustered in individuals
Index(es):
- Date
- Thread