[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RE: st: imputing continuous values when respondents select categories, e.g., income category

From	Richard Williams <[email protected]>
To	"[email protected]" <[email protected]>, "[email protected]" <[email protected]>
Subject	RE: st: imputing continuous values when respondents select categories, e.g., income category
Date	Sat, 25 Apr 2009 00:51:57 -0500

At 11:20 PM 4/24/2009, Roy Wada wrote:

ystar(a,b) will still give you censored predictions, which
may not be a good idea as Richard indicated.

Anyone knows if it's okay to use non-censored predictions from
-intreg- as a part of 2SLS and bootstrap the standard error,
assuming we have identifying instruments in the first stage?

Roy

These possibilities are starting to make my head hurt. :) To backup, I've never heard of anyone computing the predicted values fromintreg and then using them as an independent variable in subsequentanalyses. That may just reflect my ignorance, but it seems like at aminimum your standard errors would be too optimistic.

For that matter, there are concerns about using intreg for dependentvariables - if the assumptions of the method are not met (e.g.normality) the estimates may be wrong. And, as the manual pointsout, for something like income, you may want to use the logged valuesof the interval endpoints. See the manual for an example. So, youhave to be careful that your use of intreg is legit in the first place.

Remember, too, that unlike missing data fill in the blank techniques,with intreg you aren't just imputing some values, you are imputingall of them. And, if you are computing this y-hat from x1, x2 andx3, why not just use x1, x2 and x3 in your other models and leave outthe y-hat? Remember, the y-hat will be perfectly correlated with x1,x2 and x3 because it is computed from them.

I'm just improvising here, but this doesn't seem like the way intregshould be used. If its assumptions are met, it can be a very nicealternative to other ordinal methods. But, trying to use theestimated values from it as independent variables seemsproblematic. It is like the problems you have with single imputationof missing data, but even worse since every value is being imputed.

I keep wishing Scott Long or somebody like that would write moreabout intreg, so if we have any experts out there on it feel free to chime in!



-------------------------------------------
Richard Williams, Notre Dame Dept of Sociology
OFFICE: (574)631-6668, (574)631-6463
HOME:   (574)289-5227
EMAIL:  [email protected]
WWW:    http://www.nd.edu/~rwilliam

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

Follow-Ups:
- RE: st: imputing continuous values when respondents select categories, e.g., income category
  - From: Roy Wada <[email protected]>

References:
- st: imputing continuous values when respondents select categories, e.g., income category
  - From: Alan Acock <[email protected]>
- Re: st: imputing continuous values when respondents select categories, e.g., income category
  - From: Richard Williams <[email protected]>
- Re: st: imputing continuous values when respondents select categories, e.g., income category
  - From: Alan Acock <[email protected]>
- RE: st: imputing continuous values when respondents select categories, e.g., income category
  - From: Roy Wada <[email protected]>

Prev by Date: Re: st: Comparing two models
Next by Date: RE: st: imputing continuous values when respondents select categories, e.g., income category
Previous by thread: RE: st: imputing continuous values when respondents select categories, e.g., income category
Next by thread: RE: st: imputing continuous values when respondents select categories, e.g., income category
Index(es):
- Date
- Thread