Model selection for ARIMA and ARFIMA

Order

<- See Stata 18's new features

Highlights

Model selection for ARIMA and ARFIMA models
AIC, BIC, and HQIC information criteria

Want to find the best ARIMA or ARFIMA model for your data? Compare potential models using AIC, BIC, and HQIC. Use the new arimasoc and arfimasoc commands to select the best number of autoregressive and moving-average terms.

Researchers using autoregressive moving-average (ARMA) models must decide on the proper number of lags to include for the autoregressive and moving-average parameters in their models. Information criteria, which balance model fit against model parsimony, often guide the choice of the maximum number of lags.

arimasoc and arfimasoc assist in model selection by fitting a collection of autoregressive integrated moving average (ARIMA) or autoregressive fractionally integrated moving average (ARFIMA) models and computing information criteria for each model. arimasoc and arfimasoc compute the Akaike information criterion (AIC), the Bayesian information criterion (BIC), and the Hannan–Quinn information criterion (HQIC). The selected model is the one with the lowest value of the information criterion.

Let's see it work

We would like to fit an ARMA model for the output gap. We use arimasoc to fit candidate models with a maximum autoregressive lag of 4 and a maximum moving average lag of 3.

 . webuse usmacro 
(Federal Reserve Economic Data - St. Louis Fed)

 . arimasoc ogap, maxar(4) maxma(3)
Fitting models (20): .................... done

Lag-order selection criteria

Sample: 1954q3 thru 2010q4                        Number of obs = 226


                                                           
       Model           LL         df        AIC        BIC       HQIC


   ARMA(0,0)   -549.4036          2   1102.807   1109.648   1105.568
   ARMA(0,1)   -435.0753          3   876.1507   886.4123   880.2919
   ARMA(0,2)    -361.249          4   730.4981   744.1802   736.0196
   ARMA(0,3)    -330.844          5   671.6879   688.7906   678.5898
   ARMA(1,0)   -292.3313          3   590.6625   600.9241   594.8037
   ARMA(1,1)   -281.5762          4   571.1524   584.8345   576.6739
   ARMA(1,2)   -275.3697          5   560.7395   577.8422   567.6414
   ARMA(1,3)    -274.029          6    560.058   580.5812   568.3403
   ARMA(2,0)   -276.5956          4   561.1912   574.8733   566.7127
   ARMA(2,1)   -273.9052          5   557.8104   574.9131   564.7123
   ARMA(2,2)   -273.2799          6   558.5599   579.0831   566.8422
   ARMA(2,3)   -273.2587          7   560.5174   584.4611   570.1801
   ARMA(3,0)   -273.2421          5   556.4843    573.587   563.3862
   ARMA(3,1)   -273.1883          6   558.3766   578.8998   566.6589
   ARMA(3,2)   -273.0747          7   560.1494   584.0931   569.8121
   ARMA(3,3)   -272.9944          8   561.9888   589.3531   573.0319
   ARMA(4,0)   -273.2006          6   558.4012   578.9244   566.6835
   ARMA(4,1)   -273.0027          7   560.0055   583.9492   569.6682
   ARMA(4,2)    -273.071          8    562.142   589.5063   573.1851
   ARMA(4,3)   -272.9868          9   563.9735   594.7584    576.397
Selected (max) LL:   ARMA(4,3)
Selected (min) AIC:  ARMA(3,0)
Selected (min) BIC:  ARMA(3,0)
Selected (min) HQIC: ARMA(3,0)

The output table provides information about each model, including the maximized log likelihood, the number of parameters estimated, and the AIC, BIC, and HQIC.

Below the output table, the selected model from each criterion is listed. The log-likelihood is maximized for the model with the most parameters, the ARMA(4,3). The AIC, BIC, and HQIC all select the more parsimonious ARMA(3,0) model for the output gap. We can now fit our selected model

. arima ogap, arima(3,0,0)
  (output omitted)

and proceed to investigate model predictions, forecasts, etc.

Fitting an ARFIMA model instead of an ARIMA model? Instead of typing

. arimasoc y, maxvar(4) maxma(3)

you type

. arfimasoc y, maxvar(4) maxma(3)

Tell me more

Read more about ARIMA and ARFIMA model selection in the Stata Time-Series Reference Manual; see [TS] arimasoc and [TS] arfimasoc.

View all the new features in Stata 18 and, in particular, New in time series.

Made for data science.

Get started today.

Order

Upgrade

We use cookies

We use cookies to ensure that we give you the best experience on our website—to enhance site navigation, to analyze usage, and to assist in our marketing efforts. By continuing to use our site, you consent to the storing of cookies on your device and agree to delivery of content, including web fonts and JavaScript, from third party web services.

Cookie Settings

Last updated: 16 November 2022

StataCorp LLC (StataCorp) strives to provide our users with exceptional products and services. To do so, we must collect personal information from you. This information is necessary to conduct business with our existing and potential customers. We collect and use this information only where we may legally do so. This policy explains what personal information we collect, how we use it, and what rights you have to that information.

Advertising and performance cookies

This website uses cookies to provide you with a better user experience. A cookie is a small piece of data our website stores on a site visitor's hard drive and accesses each time you visit so we can improve your access to our site, better understand how you use our site, and serve you content that may be of interest to you. For instance, we store a cookie when you log in to our shopping cart so that we can maintain your shopping cart should you not complete checkout. These cookies do not directly store your personal information, but they do support the ability to uniquely identify your internet browser and device.

Please note: Clearing your browser cookies at any time will undo preferences saved here. The option selected here will apply only to the device you are currently using.

2024 Stata Conference · 1-2 August · Portland, OR

View the program →

View the program →