[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RE: st: Model selection using AIC/BIC and other information criteria

From	Richard Williams <[email protected]>
To	"[email protected]" <[email protected]>, statalist <[email protected]>
Subject	RE: st: Model selection using AIC/BIC and other information criteria
Date	Tue, 23 Jun 2009 22:20:36 -0500

At 08:39 PM 6/23/2009, kokootchke wrote:

Thank you, Richard. This was exactly what I thought... but Iremember from my metrics classes long time ago that both AIC and BICdepend on N (sample size)... and I confirmed this by simply lookingat these wikipedia entries... but, just like you, I also fearedthat, even though both criteria adjust for the sample size, maybeyou can't compare between AICs and BICs when the models usedifferent # of observations...

Here is a simple example that shows the sensitivity of BIC and AIC tosample size:


. sysuse auto, clear
(1978 Automobile Data)

. quietly reg  price mpg trunk weight

. estat ic

-----------------------------------------------------------------------------
       Model |    Obs    ll(null)   ll(model)     df          AIC         BIC
-------------+---------------------------------------------------------------
           . |     74   -695.7129   -682.6073      4     1373.215    1382.431
-----------------------------------------------------------------------------
               Note:  N=Obs used in calculating BIC; see [R] BIC note

. expand 2
(74 observations created)

. quietly reg  price mpg trunk weight

. estat ic

-----------------------------------------------------------------------------
       Model |    Obs    ll(null)   ll(model)     df          AIC         BIC
-------------+---------------------------------------------------------------
           . |    148   -1391.426   -1365.215      4     2738.429    2750.418
-----------------------------------------------------------------------------
               Note:  N=Obs used in calculating BIC; see [R] BIC note

So, even if data are missing at random with your X variable, thesmaller sample sizes that result from its inclusion will drive downthe BIC and AIC stats quite a bit.



-------------------------------------------
Richard Williams, Notre Dame Dept of Sociology
OFFICE: (574)631-6668, (574)631-6463
HOME:   (574)289-5227
EMAIL:  [email protected]
WWW:    http://www.nd.edu/~rwilliam

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

References:
- st: Analyze a subpopulation of survey data in Stata 10.1
  - From: "Karadogan, Figen" <[email protected]>
- st: Model selection using AIC/BIC and other information criteria
  - From: kokootchke <[email protected]>
- Re: st: Model selection using AIC/BIC and other information criteria
  - From: Richard Williams <[email protected]>
- RE: st: Model selection using AIC/BIC and other information criteria
  - From: kokootchke <[email protected]>

Prev by Date: RE: st: Model selection using AIC/BIC and other information criteria
Next by Date: Re: st: Analyze a subpopulation of survey data in Stata 10.1
Previous by thread: RE: st: Model selection using AIC/BIC and other information criteria
Next by thread: Re: st: Analyze a subpopulation of survey data in Stata 10.1
Index(es):
- Date
- Thread