Stata: Data Analysis and Statistical Software

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: Can Spearman's rho be used to measure of the degree of association between two binary variables ?

From	Richard Williams <[email protected]>
To	[email protected], [email protected]
Subject	Re: st: Can Spearman's rho be used to measure of the degree of association between two binary variables ?
Date	Mon, 21 May 2012 07:55:05 -0500

At 01:56 AM 5/21/2012, Maarten Buis wrote:

On Mon, May 21, 2012 at 12:19 AM, Marcos Vinicius wrote:
> I was conducting a multicollinearity diagnostic analysis for alogistic regression using spearman correlation and VIF. Importantdetail:All the covariates are binary variables.
Multicollinearity is never a problem, see e.g.:
<http://www.stata.com/statalist/archive/2010-07/msg00675.html>, so
there is nothing to diagnose. If you want to inspect the association
between binary covariates I would look at a table of odds ratios.

-- Maarten

I would have to disagree with that a bit. Sometimes multicollinearitymight reflect a mistake on the researchers part. For example, yourmodel includes education, income, and then you decide to include thisSES measure you find at the end of the codebook. If SES was computedusing income and education, you may have extreme or even perfectmulticollinearity.

Or, suppose you have a categorical variable, and you create dummiesout of it. If some categories have extremely small Ns (e.g. 2 cases)you will get near-perfect collinearity. You may have to combinecategories or else drop some cases.

Suppose, too, that you have several items that basically measure thesame concept. You may be better off creating a scale from the itemsor constraining them all to have the same effects.

I don't think I have ever seen it happen with Stata, but there mightbe situations where multic makes it difficult for the model toconverge. If so, doing things like centering a variable before yousquare it might help.

If you happen to be at the design stage of the study and you areworried about multic, you may wish to collect a larger sample aslarger samples will reduce the standard errors.

I do think the problem is exaggerated. But, the researcher should beaware that they may have done something stupid, that there may bebetter ways to set the problem up, and that they may be able to avoidthe problem in the first place when they design their study.

Also, I would discourage simply dropping variables that seem to becausing you problems, as that could lead to specification error,which may be an even worse problem.



-------------------------------------------
Richard Williams, Notre Dame Dept of Sociology
OFFICE: (574)631-6668, (574)631-6463
HOME:   (574)289-5227
EMAIL:  [email protected]
WWW:    http://www.nd.edu/~rwilliam

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

References:
- Re: st: Can Spearman's rho be used to measure of the degree of association between two binary variables ?
  - From: "Richard J. Stoll" <[email protected]>
- Re: st: Can Spearman's rho be used to measure of the degree of association between two binary variables ?
  - From: Marcos Vinicius <[email protected]>
- Re: st: Can Spearman's rho be used to measure of the degree of association between two binary variables ?
  - From: Maarten Buis <[email protected]>

Prev by Date: st: generating event windows from event dates
Next by Date: Re: st: generating event windows from event dates
Previous by thread: Re: st: Can Spearman's rho be used to measure of the degree of association between two binary variables ?
Next by thread: Re: st: Can Spearman's rho be used to measure of the degree of association between two binary variables ?
Index(es):
- Date
- Thread