Autoregressive fractionally integrated moving average


In statistics, autoregressive fractionally integrated moving average models are time series models that generalize ARIMA models by allowing non-integer values of the differencing parameter. These models are useful in modeling time series with long memory—that is, in which deviations from the long-run mean decay more slowly than an exponential decay. The acronyms "ARFIMA" or "FARIMA" are often used, although it is also conventional to simply extend the "ARIMA" notation for models, by simply allowing the order of differencing, d, to take fractional values.

Basics

In an ARIMA model, the integrated part of the model includes the differencing operator raised to an integer power. For example,
where
so that
In a fractional model, the power is allowed to be fractional, with the meaning of the term identified using the following formal binomial series expansion

ARFIMA(0,''d'',0)

The simplest autoregressive fractionally integrated model, ARFIMA, is, in standard notation,
where this has the interpretation
ARFIMA is similar to fractional Gaussian noise : with d = H−½, their covariances have the same power-law decay. The advantage of fGn over ARFIMA is that many asymptotic relations hold for finite samples. The advantage of ARFIMA over fGn is that it has an especially simple spectral density—
—and it is a particular case of ARFIMA, which is a versatile family of models.

General form: ARFIMA(''p'',''d'',''q'')

An ARFIMA model shares the same form of representation as the ARIMA process, specifically:
In contrast to the ordinary ARIMA process, the "difference parameter", d, is allowed to take non-integer values.

Enhancement to ordinary ARMA models

The enhancement to ordinary ARMA models is as follows:
1. take original data series and high-pass filter it with fractional differencing enough to make the result stationary, and remember the order d of this fractional difference, d usually between 0 and 1... possibly up to 2+ in more extreme cases. Fractional difference of 2 is the 2nd derivative or 2nd difference.
1a. note: applying fractional differencing changes the units of the problem. If we started with Prices then take fractional differences, we no longer are in Price units.
1b. determining the order of differencing to make a time series stationary may be an iterative, exploratory process.
2. compute plain ARMA terms via the usual methods to fit to this stationary temporary data set which is in ersatz units.
3. forecast either to existing data or "ahead" with these ARMA terms.
4. apply the reverse filter operation to the forecasted series, to return the forecast to the original problem units.
4a. Fractional differencing and fractional integration are the same operation with opposite values of d: e.g. the fractional difference of a time series to d = 0.5 can be inverted by applying the same fractional differencing operation but with fraction d = -0.5. See GRETL fracdiff function: http://gretl.sourceforge.net/gretl-help/funcref.html#fracdiff
The point of the pre-filtering is to reduce low frequencies in the data set which can cause non-stationarities in the data, which non-stationarities ARMA models cannot handle well... but only enough so that the reductions can be recovered after the model is built.
Fractional differencing and the inverse operation fractional integration can be thought of as digital filtering and "unfiltering" operations. As such, it is useful to study the frequency response of such filters to know which frequencies are kept and which are attenuated or discarded, viz: https://github.com/diffent/fracdiff/blob/master/freqrespfracdiff.pdf
Note that any filtering that would substitute for fractional differencing and integration in this ARMA model should be similarly invertible as differencing and integration to avoid information loss. E.g. a high pass filter which completely discards many low frequencies may not work so well, because after fitting ARMA terms to the filtered series, the reverse operation to return the ARMA forecast to its original units would not be able re-boost those attenuated low frequencies, since the low frequencies were cut to zero.
Such frequency response studies may suggest other similar families of filters that might be useful replacements for the "FI" part of the ARFIMA modeling flow, such as the well-known, easy to implement, and minimal distortion high-pass Butterworth filter or similar: https://link.springer.com/chapter/10.1007/978-3-319-55789-2_13