Stata: Data Analysis and Statistical Software

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: Factor variable notation vs. hand made dummy vars

From	Richard Williams <[email protected]>
To	[email protected], [email protected]
Subject	Re: st: Factor variable notation vs. hand made dummy vars
Date	Mon, 06 Feb 2012 10:53:55 -0500

At 10:41 AM 2/6/2012, Brendan Halpin wrote:

To put the "why" back one step, the immediate reason is evident from the
output

| . logit for mpg d2-d5
|
| note: d2 != 0 predicts failure perfectly
|       d2 dropped and 8 obs not used
|
| [...]
|
| . logit for mpg ib1.rep78
|
| note: 1.rep78 != 0 predicts failure perfectly
|       1.rep78 dropped and 2 obs not used
|
| note: 2.rep78 != 0 predicts failure perfectly
|       2.rep78 dropped and 8 obs not used
|
| note: 5.rep78 omitted because of collinearity
|
| [...]

You end up fitting different models on different data.

The question is now why do the formulations behave differently, and
which is the better default?

Brendan

I would use factor variable notation. -logit- is doing some errorchecking and some errors are creeping through with the firstnotation. Note that the seemingly identical


glm for mpg d2-d5, link(logit) family(binomial)
glm for mpg ib1.rep78, link(logit) family(binomial)

doesn't produce any error message, nor does it drop any cases, butthe standard errors are monstrous. glm is not doing the same checkslogit is. But then again, you probably shouldn't be using rep78 thisway in the first place, as category Ns are way too small. If you mustuse a var like this you probably want to combine some categories.



-------------------------------------------
Richard Williams, Notre Dame Dept of Sociology
OFFICE: (574)631-6668, (574)631-6463
HOME:   (574)289-5227
EMAIL:  [email protected]
WWW:    http://www.nd.edu/~rwilliam

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

References:
- st: Factor variable notation vs. hand made dummy vars
  - From: Ulrich Kohler <[email protected]>
- Re: st: Factor variable notation vs. hand made dummy vars
  - From: [email protected] (Brendan Halpin)

Prev by Date: RE: st: Concatenating two numerical variables with labels
Next by Date: st: Accessing to a subvector of vector like
Previous by thread: Re: st: Factor variable notation vs. hand made dummy vars
Next by thread: Re: st: Factor variable notation vs. hand made dummy vars
Index(es):
- Date
- Thread