Model Identification and Parameter Estimation¶

Model-Free Techniques¶

Several tools for model identification do not require models. The following are common model free tools

Sample spectral density
Sample ACF
Filtering

Model-Dependent Techniques - Stationary Models¶

The following model identification methods assume the time series is stationary.

Model Identification Flow¶

1. Test $X_t$ for White Noise¶

$$ H_o: X_t \, is \, white \, noise $$$$ H_o: X_t \, is \, not \, white \, noise $$

If $X_t$ is white noise, we have the following expectations

$E [\hat{\rho}_k] = 0$ for $k>0$
$var \left( \hat{\rho}_k \right) \sim 1 / n $
$ cov \left( \hat{\rho}_k, \hat{\rho}_{k + 1} \right) \sim 0$
$ \hat{\rho}_k \sim N(\mu, \sigma) $

Therefore, we reject $H_o$ at a 5% type 1 error rate if $ |\hat{\rho}_k| > 2 \left( 1/ \sqrt{n} \right) $

2. Estimate Orders p, q¶

Generally, the model orders p, q are selected by a maximum likelihood estimate with AIC, AICc, or BIC.

For the stationary case, AIC is calculated as

$$ AIC = ln \left( \hat{\sigma}_a^2 \right) + 2 \left( \frac{p + q + 1}{n} \right) $$

The idea is to explain as much of the variation as reasonably possible.

3. Get Parameter Estimates for the Selected Model Order¶

Once a model order is select, the model parameters can be estimated. Maximum likelihood (ML) estimates are typically used unless ML produces a model with root(s) in the unit circle. Other methods can be used to estimate AR-only models; these are discussed below.

Note: There is an alternative model ID method called the Box-Jenkins method. However, selecting a model by AIC is preferred of the Box-Jenkins method.

AR Model Parameter Estimation¶

There are three methods for estimating AR paramters given the model order.

Yule-Walker
- Verfy fast calculation.
- The models produced will always be stationary.
- Models are low quality when roots are near the unit circle (should not be used if this is suspected).
Burg
- The models produced will always be stationary.
- An improvement on Yule-Walker because it makes use of more of the data.
Maximum Likelihood
- Best mathematical properties.
- May find models with roots inside the unit circle (these models cannot be used).

Model-Dependent Techniques - Non-Stationary Models¶

ARIMA¶

In the case of ARIMA, the $\left( 1 - B^d \right)$ factors will dominate. The expected behavior is slowly damped exponential autocorrelations and wandering.

Box-Jenkins Model Identification Flow¶

1. Stationarize the Data

Take differences of the data until the data appears to be stationary. The number of differences taken to stationarize the data is the order of $d$.

2. Model the Stationarized Data

Model the stationarized data with the stationary modeling process described above.

More General Model Identification Flow¶

The Tiao-Tsay result states that if a high order AR(p) model is fit to a realization from a non-stationary process, the factors associated with the roots on the unit circle will show up in the factor table.

1. Fit a High Order AR Model

Fit a high order AR model to the realization. Get the factor table and find all the roots near unity. These likely represent non-stationary factors associated with process.

2. Remove the Effects of the Non-Stationary Factors

Remove the effects of the non-stationary factors from the realization by differencing. Note that these factors maybe seasonal, ARIMA, or non-conforming (ARUMA).

3. Model the Stationarized Data

Model the stationarized data with the stationary modeling process described above.

Seasonal¶

Use the general approach to assess whether a seasonal model will be useful. Fit a high order AR(p) model then matchup roots in the overfit table that are close to the roots of the seasonal model. If many of the roots match, then a seasonal model may be useful.

Deterministic - Linear Trend¶

The typicall approach of fitting an OLS model to $X_t$ vs time may lead to bad conclusions. The correlated errors lead to an inflated type 1 error rate.

The Cochrane-Orcutt procedure can be used to account for this. This fits an OLS model to the data, then fits an AR(1) to the noise. This is used to adjust the p-value of the slope.

Fit a linear regression to $X_t$, getting $\hat{a}, \,\, \hat{b}$
Calculate $Z_t$ as $Z_t = X_t - \hat{a} - \hat{b} t$
Fit an AR(1) to $Z_t$, getting an estimate for $\hat{\phi}_1$
Calculate $Y_t$ as follows

$$ Y_t = \left( 1 - \hat{\phi}_1 B \right) \left( \hat{a} + \hat{b} t + Z_t\right) = \left( 1 - \hat{\phi}_1 B \right) \hat{a} + \hat{b} \left[ t - \hat{\phi}_1 ( t- 1) \right] + \left( 1 - \hat{\phi}_1 B \right) Z_t $$

Letting

$$ c = \left( 1 - \hat{\phi}_1 B \right) \hat{a} \\ t_{\phi_1} = t - \hat{\phi}_1 ( t- 1) \\ g_t = \left( 1 - \hat{\phi}_1 B \right) $$

Then,

$$ Y_t = c + b t_{\phi_1} + g_t $$

We expect that $g_t$ is probably fairly uncorrelated noise.

Model Assessment¶

The central assessment of model is whether it whitens the residuals. A model may be sufficient if the resulting residuals appear to be white noise. For an AR(p) model, there are $n-p$ conditional residuals. However, backcasting can be used to calculate all $n$ unconditional residuals.

Residual Check 1

Start by checking the residuals visually. Create the typical sample plots of the residuals (realization, autocorrelations, etc). These plots should be consistent with white noise.

Residual Check 2

The Ljung-Box test can be used to test if all the sample autocorrelations are significantly different than 0. The hypotheses are

$$ H_0: \,\,\, \rho_1 = \rho_2 = ... \rho_k = 0 $$$$ H_a: \,\,\, at \, least \, one \, \rho_k \, is \, not \, zero $$

The test is

$$ L = n \left( n + 1 \right) \sum_{k = 1}^{K} \frac{\hat{\rho}^2_k}{n-k} $$

which follows a $\chi^2$ distribution with degrees of freedom equal to $K - p - q$.

Two value of K should be used, 24 adn 48 are suggested.

References¶

[1] W. Woodward and B. Salder, "Model Identification for Stationary Models", SMU, 2019
[2] W. Woodward and B. Salder, "Model Identification for Nonstationary Models", SMU, 2019
[3] W. Woodward and B. Salder, "Model Building", SMU, 2019