Statalist The Stata Listserver

[Date Prev][Date Next][Thread Prev][Thread Next][Date index][Thread index]

RE: st: left censoring in discrete-time duration model

From   "Jason Yackee" <>
To   <>
Subject   RE: st: left censoring in discrete-time duration model
Date   Thu, 16 Nov 2006 10:37:37 -0800

I am not Stephen Jenkins and my own practical experience is with
right-censored data, which is obviously much left problematic.  

But in your case, and wearing my hat as a non-statistical-specialist
reviewer of your paper, I would be concerned that those companies which
have websites so early in the Internet era, and which you will probably
have to drop from your analysis as others have said, are systematically
different from latecomers to the Age of the Web.  I would very much want
to know if there are statistically significant differences between the
dropped group of companies and the non-dropped group of companies in
terms of your key explanatory variables.  So beyond pointing out that
the results don't really differ, it would be even more convincing if you
could say that dropped companies look a lot like non-dropped companies.
This is probably an obvious point, but you don't mention it, so maybe
you haven't done this sort of poking around.

-----Original Message-----
[] On Behalf Of Daniel Simon
Sent: Thursday, November 16, 2006 10:15 AM
Subject: Re: st: left censoring in discrete-time duration model

Austin - thanks for the response.  In fact, I have done what you
My point (3) where I said "I drop all observations from 1996 for firms
had websites" is actually what you are proposing, because once a firm
established a website (transitioned), they are excluded from the
sample thereafter.  I did not think about that when I wrote the
My apologies for not being clear. Thanks again for your response, which 
reminded me of what I was actually doing.

Moreover, as Austin says, I would also like to hear whether others also 
this is a sensible approach. thanks.


At 12:00 PM 11/16/2006 -0500, Austin Nichols wrote:
>Daniel Simon--
>It seems to me that your estimates can only apply to introductions
>("failures") after 1996, since you cannot distinguish pre-1996 and
>1996 introductions, and you should drop firms (in all years) that had
>websites in 1996, while keeping data on all other firms from 1996,
>though I would be interested to hear from someone who actually runs
>these sorts of models, e.g. Stephen Jenkins.
>On 11/16/06, Daniel Simon <> wrote:
>>Hi - I'm using -hshaz- to estimate a discrete-time hazard model. I
>>some left censoring that I'm not sure how to deal with. I am looking
>>firms establishing websites. I can only observe the introduction of
>>websites from 1996 onwards.  However, I know that some firms
>>websites prior to 1996, but I'm not sure which ones. Currently, I have
>>tried three approaches: (1) Treat all firms that had a website in 1996
>>if they adopted in 1996 (the first year of the sample period), whether
>>adopted in 1996 or adopted earlier; (2) Exclude 1996 from the sample
>>the analysis with 1997); (3) Drop all observations from 1996 for firms
>>had websites.
>>All three approaches give me quite similar results, so it does not
>>that the censoring is a major issue. But, I'm wondering if there is a
>>better way to deal with it. Thanks. Daniel
>*   For searches and help try:

*   For searches and help try:

*   For searches and help try:

© Copyright 1996–2014 StataCorp LP   |   Terms of use   |   Privacy   |   Contact us   |   What's new   |   Site index