Bookmark and Share

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

st: RE: Truncated sample or Heckman selection‏

From   "Millimet, Daniel" <>
To   "" <>
Subject   st: RE: Truncated sample or Heckman selection‏
Date   Thu, 4 Oct 2012 18:50:09 +0000

1. A fractional logit model is more appropriate when modeling percentages.
2. The data set up is in between the usual Heckman vs. truncated model setup.  With the typical Heckman approach, X's are observed for all observations and there is no information on the missing outcome.  With a truncated setup, we observe no X's, but have information at least on the range of values for the outcome.  Here, you do know something about the value of Y for the censored observations as in the truncated setup, but you only observe a subset of the X's.  To me, it sounds you could perhaps "invent" a new model that is a zero-inflated fractional logit model, since you have 1 set of regressors that impact perhaps the probability of no innovation, and then a second set of regressors that impacts the amount of innovation conditional on this being positive.

Anyway, perhaps not the best answer.

Daniel L. Millimet
Research Fellow, IZA
Professor, Department of Economics
Box 0496
Dallas, TX 75275-0496
phone: 214.768.3269
fax: 214.768.1821

-----Original Message-----
From: [] On Behalf Of Ebru Ozturk
Sent: Thursday, October 04, 2012 11:28 AM
Subject: st: Truncated sample or Heckman selection‏

Dear All,
I have a question that I cannot decide whether I should use truncated regression or Heckman sample selection. 
For instance, in the dataset, firms that produce any type of innovation (process or product) give information about other 'x' variables. In other words, firms that do not produce any innovation do not answer other questions as these questions are directly related to firms' innovation activities. So, the 'x' variables that I am interested in have no values only for those firms that do not produce innovation. But, I know the dependent (y) variable in both case, either firms produce innovation or not produce.

I am planning to run tobit regression as the dependent variable is percentage between 0 - 100 and Heckman sample selection model to check selection bias. But, I can not decide whether it is truncated sample or Heckman sample selection.
So, what do you think?
Thank you very much, Ebru. 		 	   		  
*   For searches and help try:

*   For searches and help try:

© Copyright 1996–2018 StataCorp LLC   |   Terms of use   |   Privacy   |   Contact us   |   Site index