[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: multicollinearity

From	"Michael S. Hanson" <[email protected]>
To	[email protected]
Subject	Re: st: multicollinearity
Date	Thu, 20 Nov 2008 00:11:14 -0500

Chris:


On Nov 19, 2008, at 3:06 PM, Chris Witte wrote:
On Nov 19, 2008, at 5:01 PM, Chris Witte wrote:
On Nov 19, 2008, at 8:11 PM, Chris Witte wrote:

1) The Statalist FAQ strongly suggests not posting the same messagemultiple times. You may find it useful to review the FAQ at <http://www.stata.com/support/faqs/res/statalist.html>.

Is there another way to get the following module (the link isn'tworking for me)?
Example . Stata learning module on regression diagnostics:Multicollinearity. . . . . . . . . . . . . . . . . . UCLA AcademicTechnology Services12/03 http://www.ats.ucla.edu/stat/stata/modules/reg/multico.htm

2) I suspect that page just doesn't exist anymore. Not toosurprising: it is from December 2003 -- almost 5 years ago, whichwas also a few versions of Stata ago. If you poke around the UCLAATS web site, you might find related materials. Also, Google is yourfriend (TM).

Also, I have read that -anova- and -regress- will drop variablesthat have collinearity problems, but I have never had Stata dropvariables on me. For example:
sysuse auto
reg price headroom trunk weight length turn displacement gear_ratio


	[snip]

and the correlation between weight and length is 0.9460. Whyaren't one of these variables dropped? Does there have to beperfect correlation for dropping variables?

3) In a nutshell, yes. Multicollinearity and perfect collinearityare not the same thing. Indeed, they are conceptually ratherdifferent. (Your (sub)discipline may use slightly different termsfor these two concepts.) Kennedy's "A Guide to Econometrics" (forexample, the 5th edition, MIT Press, 2003) dedicates a whole chapterto multicollinearity, and has a decent discussion of thisdistinction. The explanation I have often given to my students isthat multicollinearity is a sample problem -- which in many casescould conceptually be avoided by collecting more or "better" data --whereas perfect collinearity is a model or specification problem --in which no amount of additional data will resolve your specificationerror. Mathematically, with perfect collinearity, the (X'X) matrixis rank deficient and therefore not invertible: the OLS estimatorsimply does not exist in this case. Stata thus drops each collinearvariable until (X'X) is of full rank, and the regression then can beestimated on the remaining variables. Other members of Statalistsuggested to you a few synthetic examples in earlier replies.Multicollinearity inflates variances, thereby complicating inference,but it does not preclude estimation.

In Wooldridge's "Introductory Econometrics" textbook (for example,pp. 102-4 of the 3rd edition, Thomson South-Western, 2006) there is avery informative discussion of multicollinearity, which contains thefollowing useful insight:

"Worrying about high degrees of correlation among the independentvariables in the sample is really no different from worrying about asmall sample size: both work to increase [the variance of beta hat].The famous University of Wisconsin econometrician Arthur Goldberger,reacting to econometricians' obsession with multicollinearity, has(tongue in cheek) coined the term MICRONUMEROSITY, which he definesas the 'problem of small sample size.'"


Best,
Mike

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

Follow-Ups:
- Re: st: multicollinearity
  - From: Chris Witte <[email protected]>

References:
- st: esttab question
  - From: Chris Witte <[email protected]>
- Re: st: esttab question
  - From: Neil Shephard <[email protected]>
- st: multicollinearity
  - From: Chris Witte <[email protected]>

Prev by Date: Re: st: combining many data files
Next by Date: st: Bivariate density contours
Previous by thread: st: Re: multicollinearity
Next by thread: Re: st: multicollinearity
Index(es):
- Date
- Thread