Stata The Stata listserver
[Date Prev][Date Next][Thread Prev][Thread Next][Date index][Thread index]

st: Re: collinear categorical variable identification

From   Buzz Burhans <>
Subject   st: Re: collinear categorical variable identification
Date   Fri, 13 Jun 2003 17:23:46 -0400

Thank's Scott. your list of considerations is helpful and food for thought, especially, in my case, 2b, which would perhaps allow the non redundant information in somewhat redundant variables to be captured...

Thanks for the ideas.

Buzz Burhans

While not specifically addressing collinear categorical variables, Peter
Kennedy 's "Guide to Econometrics" presents two basic options to deal with

1. Do nothing.

2. Incorporate Additional Information.
a. Obtain more data

b. Formalize relationships among regressors and estimate in a simultaneous

c. Specify a relationship among some parameters.  Theory may suggest that
two coefficients should be equal or sum to one, for example.

d. Drop a variable.  However, omitting a relevant variables biases the
remaining coefficients unless they are uncorrelated with the omitted
variable.  As noted by Dreze (1983) "setting a coefficient equal to zero
because it is estimated with poor precision amounts to elevating ignorance
to arrogance."

e. Incorporate estimates form other studies

f. Form a principal component.

g. Shrink the OLS estimates - a ridge or Stein estimator.

Hope this helps,

Dreze, J (1983). "Nonspecialist Teaching of Econometrics: A Personal Comment
and Personalistic Lament"  Econometric Reviews 2, 291-9.

*   For searches and help try:
Buzz Burhans

*   For searches and help try:

© Copyright 1996–2019 StataCorp LLC   |   Terms of use   |   Privacy   |   Contact us   |   What's new   |   Site index