3.3 Simple forecasting methods | Forecasting and Analytics with the Augmented Dynamic Adaptive Model (ADAM) (2024)

\(\newcommand{\mathbbm}[1]{\boldsymbol{\mathbf{#1}}}\)

3.3 Simple forecasting methods

Now that we understand that time series might contain different components and that there are approaches for their decomposition, we can introduce several simple forecasting methods that can be used in practice, at least as benchmarks. Their usage aligns with the idea of forecasting principles discussed in Section 1.2. We discuss the most popular methods for the following specific types of time series:

  1. Level time series: Naïve (Subsection 3.3.1), Global Average (Subsection 3.3.2), Simple Moving Average (Subsection 3.3.3), Simple Exponential Smoothing (Section 3.4);
  2. Trend time series: Random Walk with drift (Subsection 3.3.4), Global Trend (Subsection 3.3.5);
  3. Seasonal time series: Seasonal Naïve (Subsection 3.3.6).

3.3.1 Naïve

Naïve is one of the simplest forecasting methods. According to it, the one-step-ahead forecast is equal to the most recent actual value:\[\begin{equation} \hat{y}_t = y_{t-1} . \tag{3.6}\end{equation}\]Using this approach might sound naïve indeed, but there are cases where it is very hard to outperform. Consider an example with temperature forecasting. If we want to know what the temperature outside will be in five minutes, then Naïve would be typically very accurate: the temperature in five minutes will be the same as it is right now.

The statistical model underlying Naïve is called the “Random Walk” and is written as:\[\begin{equation} y_t = y_{t-1} + \epsilon_t. \tag{3.7}\end{equation}\]The variability of \(\epsilon_t\) will impact the speed of change of the data: the higher it is, the more rapid the values will change. Random Walk and Naïve can be represented in Figure 3.8. In the example of R code below, we use the Simple Moving Average (discussed later in Subsection 3.3.3) of order 1 to generate the data from Random Walk and then produce forecasts using Naïve.

sim.sma(1, 120) |> sma(order=1, h=10, holdout=TRUE) |> plot(which=7, main="")

3.3 Simple forecasting methods | Forecasting and Analytics with the Augmented Dynamic Adaptive Model (ADAM) (1)

Figure 3.8: A Random Walk example.

As shown from the plot in Figure 3.8, Naïve lags behind the actual time series by one observation because of how it is constructed via equation (3.6). The point forecast corresponds to the straight line parallel to the x-axis. Given that the data was generated from Random Walk, the point forecast shown in Figure 3.8 is the best possible forecast for the time series, even though it exhibits rapid changes in the level.

Note that if the time series exhibits level shifts or other types of unexpected changes in dynamics, Naïve will update rapidly and reach the new level instantaneously. However, because it only has a memory of one (last) observation, it will not filter out the noise in the data but rather copy it into the future. So, it has limited usefulness in demand forecasting (although it has applications in financial analysis). However, being the simplest possible forecasting method, it is considered one of the basic forecasting benchmarks. If your model cannot beat it, it is not worth using.

3.3.2 Global Mean

While Naïve considers only one observation (the most recent one), the Global Mean (aka “global average”) relies on all the observations in the data:\[\begin{equation} \hat{y}_t = \bar{y} = \frac{1}{T} \sum_{t=1}^T y_{t} , \tag{3.8}\end{equation}\]where \(T\) is the sample size. The model underlying this forecasting method is called “global level” and is written as:\[\begin{equation} y_t = \mu + \epsilon_t, \tag{3.9}\end{equation}\]so that the \(\bar{y}\) is an estimate of the fixed expectation \(\mu\). Graphically, this is represented with a straight line going through the series as shown in Figure 3.9.

rnorm(120, 100, 10) |> es("ANN", persistence=0, h=10, holdout=TRUE) |> plot(which=7, main="")

3.3 Simple forecasting methods | Forecasting and Analytics with the Augmented Dynamic Adaptive Model (ADAM) (2)

Figure 3.9: A global level example.

The series shown in Figure 3.9 is generated from the global level model, and the point forecast corresponds to the forecast from the Global Mean method. Note that the method assumes that the weights of the in-sample observations are equal, i.e.the first observation has precisely the exact weight of \(\frac{1}{T}\) as the last one (being as important as the last one). Suppose the series exhibits some changes in level over time. In that case, the Global Mean will not be suitable because it will produce the averaged out forecast, considering values for parts before and after the change. However, Global Mean works well in demand forecasting context and is a decent benchmark on intermittent data (discussed in Chapter 13).

3.3.3 Simple Moving Average

Naïve and Global Mean can be considered as opposite points in the spectrum of possible level time series (although there are series beyond Naïve, see for example ARIMA(0,1,1) with \(\theta_1>0\), discussed in Chapter 8). The series between them exhibits slow changes in level and can be modelled using different forecasting approaches. One of those is the Simple Moving Average (SMA), which uses the mechanism of the mean for a small part of the time series. It relies on the formula:\[\begin{equation} \hat{y}_t = \frac{1}{m}\sum_{j=1}^{m} y_{t-j}, \tag{3.10}\end{equation}\]which implies going through time series with something like a “window” of \(m\) observations and using their average for forecasting. The order \(m\) determines the length of the memory of the method: if it is equal to 1, then (3.10) turns into Naïve, while in the case of \(m=T\) it transforms into Global Mean. The order \(m\) is typically decided by a forecaster, keeping in mind that the lower \(m\) corresponds to the shorter memory method, while the higher one corresponds to the longer one.

Svetunkov and Petropoulos (2018) have shown that SMA has an underlying non-stationary AR(m) model with \(\phi_j=\frac{1}{m}\) for all \(j=1, \dots, m\). While the conventional approach to forecasting from SMA is to produce the straight line, equal to the last fitted value, Svetunkov and Petropoulos (2018) demonstrate that, in general, the point forecast of SMA does not correspond to the straight line. This is because to calculate several steps ahead point forecasts, the actual values in (3.10) are substituted iteratively by the predicted ones on the observations for the holdout.

y <- sim.sma(10,120)par(mfcol=c(2,2), mar=c(2,2,2,1))# SMA(1)sma(y$data, order=1, h=10, holdout=TRUE) |> plot(which=7)# SMA(10)sma(y$data, order=10, h=10, holdout=TRUE) |> plot(which=7)# SMA(20)sma(y$data, order=20, h=10, holdout=TRUE) |> plot(which=7)# SMA(110)sma(y$data, order=110, h=10, holdout=TRUE) |> plot(which=7)

3.3 Simple forecasting methods | Forecasting and Analytics with the Augmented Dynamic Adaptive Model (ADAM) (3)

Figure 3.10: Examples of SMA time series and several SMA models of different orders applied to it.

Figure 3.10 demonstrates the time series generated from SMA(10) and several SMA models applied to it. We can see that the higher orders of SMA lead to smoother fitted lines and calmer point forecasts. On the other hand, the SMA of a very high order, such as SMA(110), does not follow the changes in time series efficiently, ignoring the potential changes in level. Given the difficulty with selecting the order \(m\), Svetunkov and Petropoulos (2018) proposed using information criteria for the order selection of SMA in practice.

Finally, an attentive reader has already spotted that the formula for SMA corresponds to the procedure of CMA of an odd order from equation (3.4). They are similar, but they have a different purpose: CMA is needed to smooth out the series, and the calculated values are inserted in the middle of the average, while SMA is used for forecasting, and the point forecasts are inserted at the last period of the average.

3.3.4 Random Walk with drift

So far, we have discussed the methods used for level time series. But as mentioned in Section 3.1, there are other components in the time series. In the case of the series with a trend, Naïve, Global Mean, and SMA will be inappropriate because they would be missing the essential component. The simplest model that can be used in this case is called “Random Walk with drift”, which is formulated as:\[\begin{equation} y_t = y_{t-1} + a_1 + \epsilon_t, \tag{3.11}\end{equation}\]where \(a_1\) is a constant term, the introduction of which leads to increasing or decreasing trajectories, depending on the value of \(a_1\). The point forecast from this model is calculated as:\[\begin{equation} \hat{y}_{t+h} = y_{t} + a_1 h, \tag{3.12}\end{equation}\]implying that the forecast from the model is a straight line with the slope parameter \(a_1\). Figure 3.11 shows what the data generated from Random Walk with drift and \(a_1=10\) looks like. This model is discussed in Subsection 8.1.5.

sim.ssarima(orders=list(i=1), lags=1, obs=120, constant=10) |> msarima(orders=list(i=1), constant=TRUE, h=10, holdout=TRUE) |> plot(which=7, main="")

3.3 Simple forecasting methods | Forecasting and Analytics with the Augmented Dynamic Adaptive Model (ADAM) (4)

Figure 3.11: Random Walk with drift data and the model applied to that data.

The data in Figure 3.11 demonstrates a positive trend (because \(a_1>0\)) and randomness from one observation to another. The model is helpful as a benchmark and a special case for several other models because it is simple and requires the estimation of only one parameter.

3.3.5 Global Trend

Continuing the discussion of the trend time series, there is another simple model that is sometimes used in forecasting. The Global Trend model is formulated as:\[\begin{equation} y_t = a_0 + a_1 t + \epsilon_t, \tag{3.13}\end{equation}\]where \(a_0\) is the intercept and \(a_1\) is the slope of the trend. The positive value of \(a_1\) implies that the data exhibits growth, while the negative means decline. The size of \(a_1\) characterises the steepness of the slope. The point forecast from this model is:\[\begin{equation} \hat{y}_{t+h} = a_0 + a_1 (t+h), \tag{3.14}\end{equation}\]implying that the forecast from the model is a straight line with the slope parameter \(a_1\). Figure 3.12 shows how the data generated from a Global Trend with \(a_0=10\) and \(a_1=5\) looks like.

xreg <- data.frame(y=rnorm(120, 10+5*c(1:120), 10))alm(y~trend, xreg, subset=c(1:100)) |> forecast(tail(xreg, 20), h=20) |> plot(main="")

3.3 Simple forecasting methods | Forecasting and Analytics with the Augmented Dynamic Adaptive Model (ADAM) (5)

Figure 3.12: Global Trend data and the model applied to it.

The data in Figure 3.12 demonstrates a linear trend with some randomness around it. In this situation, the slope of the trend is fixed and does not change over time in contrast to what we observed in Figure 3.11 for a Random Walk with drift model. In some cases, the Global Trend model is the most suitable for the data. Some of the models discussed later in this monograph have the Global Trend as a special case when some restrictions on parameters are imposed (see, for example, Subsection 4.4.1).

3.3.6 Seasonal Naïve

Finally, in the case of seasonal data, there is a simple forecasting method that can be considered as a good benchmark in many situations. Similar to Naïve, Seasonal Naïve relies only on one observation, but instead of taking the most recent value, it uses the value from the same period a season ago. For example, for producing a forecast for January 1984, we would use January 1983. Mathematically this is written as:\[\begin{equation} \hat{y}_t = y_{t-m} , \tag{3.15}\end{equation}\]where \(m\) is the seasonal frequency. This method has an underlying model, Seasonal Random Walk:\[\begin{equation} y_t = y_{t-m} + \epsilon_t. \tag{3.16}\end{equation}\]Similar to Naïve, the higher variability of the error term \(\epsilon_t\) in (3.16) is, the faster the data exhibits changes. Seasonal Naïve does not require estimation of any parameters and thus is considered one of the popular benchmarks to use with seasonal data. Figure 3.13 demonstrates how the data generated from Seasonal Random Walk looks and how the point forecast from the Seasonal Naïve applied to this data performs.

y <- sim.ssarima(orders=list(i=1), lags=4, obs=120, mean=0, sd=50)msarima(y$data, orders=list(i=1), lags=4, h=10, holdout=TRUE) |> plot(which=7, main="")

3.3 Simple forecasting methods | Forecasting and Analytics with the Augmented Dynamic Adaptive Model (ADAM) (6)

Figure 3.13: Seasonal Random Walk and Seasonal Naïve.

Similarly to the previous methods, if other approaches cannot outperform Seasonal Naïve, it is not worth spending time on those approaches.

References

• Svetunkov, I., Petropoulos, F., 2018. Old Dog, New Tricks: a Modelling View of Simple Moving Averages. International Journal of Production Research. 56, 6034–6047. https://doi.org/10.1080/00207543.2017.1380326

3.3 Simple forecasting methods | Forecasting and Analytics with the Augmented Dynamic Adaptive Model (ADAM) (2024)

FAQs

What is the simplest forecasting model? ›

For naïve forecasts, we simply set all forecasts to be the value of the last observation. That is, ^yT+h|T=yT. y ^ T + h | T = y T . This method works remarkably well for many economic and financial time series.

What are the three 3 forecasting approach under the time series model? ›

Time series models used for forecasting include decomposition models, exponential smoothing models and ARIMA models.

What is the Adam model of regression? ›

ADAM is Augmented Dynamic Adaptive Model. It is a model that underlies ETS, ARIMA and regression, connecting them in a unified framework. The underlying model for ADAM is a Single Source of Error state space model, which is explained in detail separately in an online textbook.

What is the SMA model for forecasting? ›

To get the simple moving average (SMA) you would divide the total sales from January – March by the number of periods, which in this case would be 3 (3 months), giving you a simple average number of sales per month. This number can be used to forecast the sales of the upcoming months or period.

What is the simplest forecasting method? ›

Naïve is one of the simplest forecasting methods. According to it, the one-step-ahead forecast is equal to the most recent actual value: ^yt=yt−1.

What are the 3 major approaches for forecasting? ›

There are three basic types—qualitative techniques, time series analysis and projection, and causal models. The first uses qualitative data (expert opinion, for example) and information about special events of the kind already mentioned, and may or may not take the past into consideration.

What are the four types of forecasting models? ›

Time Series Model: good for analyzing historical data to predict future trends. Econometric Model: uses economic indicators and relationships to forecast outcomes. Judgmental Forecasting Model: leverages human intuition and expertise. The Delphi Method: forms a consensus based on expert opinions.

Which forecast model is most accurate? ›

Numerical Weather Prediction (NWP) modeling is the most widely used and accurate method for weather forecasting. NWP involves solving a set of mathematical equations that represent the fundamental laws of physics governing the atmosphere.

What is the Adam model? ›

Adaptive Moment Estimation is an algorithm for optimization technique for gradient descent. The method is really efficient when working with large problem involving a lot of data or parameters. It requires less memory and is efficient.

What is the Adams theory in a nutshell? ›

Conceived by John Stacey Adams in the 1960s, Adam's Equity Theory (see the research paper at the link) proposes that individuals assess their motivation at work by comparing their inputs (efforts) and outputs (rewards) with those of their peers. It underscores the innate human desire for fairness in the workplace.

What is the Adam theory? ›

Adams' Equity Theory calls for a fair balance to be struck between an employee's "inputs" (hard work, skill level, acceptance, enthusiasm, and so on) and their "outputs" (salary, benefits, intangibles such as recognition, and more).

What is Arima forecast modeling? ›

Autoregressive integrated moving average (ARIMA) models predict future values based on past values. ARIMA makes use of lagged moving averages to smooth time series data. They are widely used in technical analysis to forecast future security prices.

What is basic forecasting model? ›

What is a forecasting model ? A forecasting model is a statistical tool designed to predict future trends and outcomes based on historical data. It involves analyzing past patterns and trends to make informed predictions about future events, sales, demand, or inventory levels.

What is SMA analysis? ›

The simple moving average is a popular tool that can benefit both short-term traders and long-term investors. The SMA smooths out price data by averaging a security's price over a certain length of time. It is drawn as a single line on a chart and is helpful in identifying trends.

Which is most basic forecasting method? ›

The straight-line method is one of the simplest and easy-to-follow forecasting methods. A financial analyst uses historical figures and trends to predict future revenue growth.

Which is the simplest flow model used for forecasting? ›

Flow models are very frequently associated with forecasting personnel needs. The simplest one is called the Markov model.

What is the simple average method for forecasting? ›

In simple averages, the next period's forecast is the average of all previous actual values. In this case, the underlying assumption is that all history has a bearing on the most recent events.

What is the simple average forecasting technique? ›

Average method

Here, the forecasts of all future values are equal to the average (or “mean”) of the historical data. If we let the historical data be denoted by y1,…,yT y 1 , … , y T , then we can write the forecasts as ^yT+h|T=¯y=(y1+⋯+yT)/T.

References

Top Articles
Latest Posts
Article information

Author: Dong Thiel

Last Updated:

Views: 6503

Rating: 4.9 / 5 (79 voted)

Reviews: 94% of readers found this page helpful

Author information

Name: Dong Thiel

Birthday: 2001-07-14

Address: 2865 Kasha Unions, West Corrinne, AK 05708-1071

Phone: +3512198379449

Job: Design Planner

Hobby: Graffiti, Foreign language learning, Gambling, Metalworking, Rowing, Sculling, Sewing

Introduction: My name is Dong Thiel, I am a brainy, happy, tasty, lively, splendid, talented, cooperative person who loves writing and wants to share my knowledge and understanding with you.