Time Series Analysis and Prediction of Electricity Consumption of Health Care Institution Using ARIMA Model

Kaur, Harveen; Ahuja, Sachin

doi:10.1007/978-981-10-3325-4_35

Harveen Kaur²¹ &
Sachin Ahuja²²

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 547))

1111 Accesses
18 Citations

Abstract

The purpose of this research is to find a best fitting model to predict the electricity consumption in a health care institution and to find the most suitable forecasting period in terms of monthly, bimonthly, or quarterly time series. The time series data used in this study has been collected from a health care institution Apollo Hospital, Ludhiana for the time period of April 2005 to February 2016. The analysis of the time series data and prediction of electricity consumption have been performed using ARIMA (Autoregressive Integrated Moving Average) model. The most suitable candidate model for the three time series is selected by considering the lowest value of two relative quality measures i.e. AIC (Akaike Information Criterion) and SBC (Schwarz Bayesian Criterion). The appropriate forecasting period is selected by considering the lowest value of RMSE (Root Mean Square Error) and MPE (Mean Percentage Error). After building the final model a two-year prediction of electricity consumption of the health care institution is performed.

Access provided by CONRICYT-eBooks. Download conference paper PDF

Electricity Consumption Forecasting Using Time Series Analysis

Predictive Analysis of Energy Use Based on Some Forecasting Models

Predicting the Residential Energy Consumption in Morocco Based on Time Series Forecasting Models

Keywords

1 Introduction

It is critical for electricity traders [1] to predict electricity consumption for balancing the electricity purchase and sales portfolios. Electricity consumption is a significant commodity to promote society’s economic development and raise people’s standard of living [2]. Difficulties associated with electricity storage necessitate accurate prediction of its consumption. This further emphasizes the use of apt and correct approaches for electricity consumption prediction. Amongst the methods for performing the prediction, ARIMA is a statistical method for analyzing and building the forecasting model which best represents a time series by modeling the correlations in the Data.

1.1 Autocorrelation Functions

Autocorrelation: Autocorrelation, also called a lagged correlation or serial correlation is the way in which the observations in time series are related to each other and is calculated as the simple correlation between current observation y_t and the observation from p periods before the current one i.e. y_t−p for a given interval.
Partial Autocorrelation: Partial autocorrelation is the correlation calculated after removing the linear relationship between the two consecutive observations. Basically, it is the correlation between y_t and y_t−p when the effect of y at other time lags 1, 2, 3, …, p−1 is removed.
Both autocorrelations and partial autocorrelations are computed for lags in sequence for the given time series. The autocorrelation at lag 1 is between Y_t−1 and Y_t, the autocorrelation and partial autocorrelation at lag 2 is between Y_t−2 and Y_t and so on for n lags.
Autocorrelation function (ACF) and Partial autocorrelation function (PACF): The identification of the basic model is done by observing the patterns of ACFs and PACFs (Autocorrelations versus lags). ACFs and PACFs act as a tool for finding the values of the orders of autoregressive and moving average components i.e. p and q of ARIMA(p, d, q) respectively.

1.2 Autoregressive Integrated Moving Average Model

The notation for ARIMA model is ARIMA(p, d, q) where p, q, d denotes the order of autoregression, the order of moving average and the degree of differencing respectively. The final prediction depends on these parameters. Hence, the prediction equation for ARIMA (p, d, q) process is denoted by:

$$ \Delta {\text{y}}_{\text{t}} =\upbeta_{0} +\upbeta_{ 1}\Delta {\text{y}}_{{{\text{t}} - 1}} +\upvarepsilon_{\text{t}} +\uplambda_{ 1}\upvarepsilon_{{{\text{t}} - 1}} $$

(1)

Here β₀, β₁ and λ₁ are the parameters of the model and ε_t is the white noise error term (the unidentifiable form of data which doesn’t exhibit any patterns).

2 Literature Review

Currently, ARIMA model is avidly used for building a forecasting model for the time series of the electricity consumption. Katara et al. [3] used this method to forecast electricity demand in Tamale, Ghana using data from the Northern Electricity Department Tamale during 1990 to 2013. The model provides a seven-year forecast of the electricity demand in the city. Kandananond [4] implemented three forecasting techniques— autoregressive integrated moving average (ARIMA), multiple linear regression (MLR) and artificial neural network (ANN) to built forecasting model of the electricity demand in Thailand. Yasmeen and Sharif [5] studied, (the least out of sample forecast performance) i.e. the minimum forecast standard deviation value and Mean Absolute Percentage Error (MAPE) value of the four competing time series models. On the basis of these accuracy measures, the most suitable model to predict electricity consumption in Pakistan was built. Erdogdu [6] used co-integration analysis and implemented ARIMA modeling to estimate electricity demand and prediction respectively. Bianco et al. [7] evolved a long-term forecasting time series model by developing different regression models using historical electricity consumption, gross domestic product per capita (GDP per capita), gross domestic product (GDP) and population for the time period 1970 to 2007.

3 Methodology

The objective of this research is to develop a set of programs using SAS University Edition [8, 9] to find the best fitted model out of monthly, bimonthly and quarterly time series data [10,11,12] to predict the Electricity Consumption using ARIMA [13,14,15,16] model by considering the lowest value of Root Mean Square Error and Mean Percentage error for all three models. The selected forecasting model should have the lowest prediction error.

This work is purely based on ARIMA Model. The time series variable being used is the electricity consumption of the health care institution.

3.1 Data Preprocessing

Monthly data over a long period of time from April 2005 to February 2016 has been used for this research. SAS University Edition is used for building the model. Initially, data was not in a proper format and consisted of some missing values and so wasn’t ready for building the forecasting model.

The data is monthly, but we have to find the suitable time period to predict the future values among three time periods i.e. monthly, bimonthly and quarterly time series.

The following pre-processing steps has been performed:

3.1.1 Converting the Data in Proper Format

By default, SAS displays the date values in a numeric format which is difficult to recognize.

Table 1 represents the sample of the raw data displaying the date values in numeric format, when it is imported to SAS. Hence, a proper date format is assigned to the variable. Table 2 represents a sample of the final data ready to be analyzed on the SAS.

Table 1. Sample of raw data displaying the date values in numeric format

Full size table

Table 2. Sample of final data ready to be analyzed

Full size table

3.1.2 Filling the Missing Information

In a particular observation if no value exists for a particular variable, then a missing value is said to occur. Missing data are very common and can have a huge effect on the data analysis. If not handled properly, these can lead to unstable or inaccurate results. There are various methods to fill the missing data with mean, median or previous value. Table 3 represents the sample of the current data set with some missing data. In this paper, previous values have been used to fill the current value by assuming that current value will be similar to the previous value.

Table 3. Sample of data with missing values

Full size table

3.2 Seasonal Adjustment of Data

For the seasonal adjustment of the data, a time series consists of three components: trend-cycle, combined seasonal effects, and irregular component. Here, Fig. 1 shows the seasonally adjusted data, while Figs. 2 and 3 depict the trend-cycle and irregular data in the time series respectively. The main purpose of the seasonal adjustment of the data is to identify the seasonal effects and remove them from the time series data.

The time series is then called seasonally adjusted series and constitutes the trend-cycle component and irregular component.

3.3 Steps for Building the Model

Following are the steps for building the forecasting model using ARIMA:

3.3.1 Stationarity Checking

The Electricity Consumption of the health care institution is modeled variable. The time series is monthly time series from April 2005 to February 2016. The Fig. 4 represents the property of stationarity of the monthly data i.e. stationary or nonstationary times series. After the visual inspection of the graphs displayed in the figure, we can see that the time series is nonstationary and needs to be transformed into stationary time series; which is the first condition for applying ARIMA model. Here dB is the stationary time series. This is achieved by taking the first difference of the time series variable in the data. The blue line represents the nonstationary time series and redline represents the stationary time series. Data is prepared into two more formats i.e. bimonthly and quarterly time series.

The Fig. 5 given below represents the property of stationarity of the bimonthly data while Fig. 6 represents the property of stationarity of the quarterly data.

3.3.2 Model Identification and Model Estimation

By examining the Autocorrelation Function (ACF) plot and Partial Autocorrelation Function (PACF) plot of the stationary time series, the basic ARIMA(p, d, q) model can be identified, but final models can be estimated by conditional least squares estimation method. In this estimation method if there is unit root in any of the AR (autoregressive) or MA (Moving Average) terms i.e. sum of parameter estimates is 1 or close to 1 then the current model is stabilized by adding or removing the number of AR or MA terms so that the unit roots can be removed. By considering the lowest value of AIC and SBC of the candidate models the most suitable model for the three time series can be decided. PACF and ACF plot of stationary monthly time series have been shown in Fig. 7. By examining the PACF plot the value of “p” - the order of autoregressive component – can be estimated. As PACF function plot drops to zero after first two lags, the value of “p” is 2. By examining the ACF plot we can estimate the value of “q” - the order of moving average component. ACF function plot drops to zero after first three lags and hence, the value of “q” is 3. Since, the time series is differenced once, hence the value of “d” is 1. Basic model for the monthly time series of electricity consumption is ARIMA (2, 1, 3).

Similarly, referring to Fig. 8 the values of “p” and “q” can be estimated for differenced bimonthly time series data as 1 and 2 respectively. As illustrated in Fig. 9 “p” and “q” values for differenced quarterly time series data can be ascertained as 2 and 1 respectively.

The basic models are estimated by examining the ACF and PACF plots for all the three models. However, for bringing stability in the models, the existence of the unit roots is checked by conditional least squares estimation method.

If any unit root is found, then the current model is stabilized by removing or adding the number of AR or MA terms. The estimates for the parameters of all three models are calculated using the conditional least squares estimation method.

Tables 4, 5 and 6 display the parameter estimates for monthly, bimonthly and quarterly series respectively. It is worthwhile to note that in SAS, the estimate for moving average is always taken with opposite sign as displayed in table. Hence, after analysis of time series and performing the conditional least squares estimation methods, some possible models for the three series has been found. The possible candidate models for monthly, bimonthly and quarterly time series are depicted below in Tables 7, 8, and 9, respectively:

Table 4. Conditional least squares estimation for monthly time series.

Full size table

Table 5. Conditional least squares estimation for bimonthly time series.

Full size table

Table 6. Conditional least squares estimation for quarterly time series.

Full size table

Table 7. Possible models for monthly series.

Full size table

Table 8. Possible models for bimonthly series.

Full size table

Table 9. Possible models for quarterly series.

Full size table

The values of the relative quality measures AIC and SBC are also mentioned. The best model is selected by considering the lowest value of AIC and SBC for all the time series. Hence the final model selected for monthly series is ARIMA(2, 1, 3)

$$ \Delta y_{t} = \, 1588.1 + 1.72513\Delta y_{t - 1} - \Delta y_{t - 2} + \varepsilon_{t} - 1.99693\varepsilon_{t - 1} + 1.42805\varepsilon_{t - 2} - 0.32658\varepsilon_{t - 3} $$

(2)

The final model selected for bimonthly series is ARIMA(2, 1, 1)

$$ \Delta y_{t} = \, 3638.5 \, + \, 1.00597\Delta y_{t - 1} - 0.95325\Delta y_{t - 2} + \varepsilon_{t} - 0.68181\varepsilon_{t - 1} $$

(3)

The final model selected for quarterly series is ARIMA(2, 1, 1)

$$ \Delta y_{t} = 14328.6 \, - \, 0.99001\Delta y_{t - 1} - \Delta y_{t - 2} + \varepsilon_{t} + 0.17119\varepsilon_{t - 1} $$

(4)

4 Results and Discussion

The actual electricity consumption and that forecasted by ARIMA method are compared from April 2005 to February 2016. An ARIMA based forecasting of electricity consumption has been performed and applied to the practical power system of the health care institution. ARIMA based method is more reliable and better than the traditional forecasting methods.

The data has been analyzed for all the three series. Table 10 shows the calculated RMSE (Root Mean Square Error) and MPE (Mean Percentage Error) of the best ARIMA models selected for monthly, bimonthly and quarterly time series

Table 10. RMSE and MPE for monthly, bimonthly and quarterly time series.

Full size table

As depicted in Table 10, the most suitable period for forecasting is the monthly time series model with the lowest value of RMSE and MPE. On the basis of these forecasting accuracy measures of RMSE and MPE, the monthly forecasting model will be used to perform the final prediction of the electricity consumption of the health care institution.

Figure 10 shows the comparison of the actual and forecasted observations for the time period of April 2005 to February 2016, while, Fig. 11 illustrates the prediction of electricity consumption for 2 years ahead. Hence, the resultant prediction of electricity consumption of the health care institution has been performed for the time period of March 2016 to February 2018.

5 Conclusion

The ARIMA based forecasting of electricity consumption has been performed and applied to the practical power system of the health care institution. From the results it can be concluded that this methodology is very efficient, and is more accurate and reliable than any other single forecasting methods which bring out imprecise predictions. Electricity being a crucial requirement of society needs to be predicted in advance as it is difficult to store. This forecasting will benefit the health care institution in many ways. It will help the management to anticipate the increase in capacity of the power system to cater to the futuristic needs, in addition to allocating the budget for planning and updating of their power system.

References

Sauhats, A., Varfolomejeva, R., Lmkevics, O., Petrecenko, R., Kunickis, M., Balodis, M.: Analysis and prediction of electricity consumption using smart meter data. In: IEEE 5th International Conference on Power Engineering, Energy and Electrical Drives (POWERENG) (2015)
Google Scholar
Ahuja, D., Tatsutani, M., Schaffer, D.: Sustainable energy for developing countries. S.A.P.I. EN. S. (2.1) (2009)
Google Scholar
Katara, S., Faisal, A., Engmann, G.: A time series analysis of electricity demand in Tamale, Ghana. Int. J. Stat. Appl. 4(6), 269–275 (2014). Scientific & Academic Publishing
Google Scholar
Kandananond, K.: Forecasting electricity demand in Thailand with an artificial neural network approach. Energies 4(12), 1246–1257 (2011)
Article Google Scholar
Yasmeen, F., Sharif, M.: Forecasting electricity consumption for Pakistan. Int. J. Emerg. Technol. Adv. Eng. 4(4), 496–503 (2015)
Google Scholar
Erdogdu, E.: Electricity demand analysis using cointegration and ARIMA modeling: a case study of Turkey. Energ. Policy 35(2), 1129–1146 (2007)
Article Google Scholar
Bianco, V., Manca, O., Nardini, S.: Electricity consumption forecasting in Italy using linear regression models. Energy 34(9), 1413–1421 (2009)
Article Google Scholar
Analytics, Business Intelligence and Data Management (n.d.). http://www.sas.com/en_us/software/university-edition.html.
Discover SAS® Customer Resources (n.d.). https://support.sas.com
Ghosh, S.: Univariate time-series forecasting of monthly peak demand of electricity in Northern India. Int. J. Indian Cult. Bus. Manag. 1(4), 466 (2008)
Article Google Scholar
Janacek, G.: Time series analysis forecasting and control. J. Time Ser. Anal. 31(4), 303 (2010)
Google Scholar
Dordonnat, V., Pichavant, A., Pierrot, A.: GEFCom2014 probabilistic electric load forecasting using time series and semi-parametric regression models. Int. J. Forecast. 2(3), 1005–1011 (2016)
Article Google Scholar
Camara, A., Feixing, W., Xiuqin, L.: Energy consumption forecasting using seasonal ARIMA with artificial neural networks models. Int. J. Bus. Manag. 11(5), 231 (2016)
Article Google Scholar
Razak, A.F., Shitan, M., Hashim, A.H., Abidin, I.: Load forecasting using time series models. Jurnal Kejuruteraan 21(1), 53–62 (2009)
Article Google Scholar
Nie, H., Liu, G., Liu, X., Wang, Y.: Hybrid of ARIMA and SVMs for short-term load forecasting. Energ. Procedia 16, 1455–1460 (2012)
Article Google Scholar
Lee, C.-M., Ko, C.-N.: Short-term load forecasting using lifting scheme and ARIMA models. Expert Syst. Appl. 38(5), 5902–5911 (2011)
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Sciences, Chitkara University, Rajpura, India
Harveen Kaur
Chitkara University Research and Innovation Network, Rajpura, India
Sachin Ahuja

Authors

Harveen Kaur
View author publications
You can also search for this author in PubMed Google Scholar
Sachin Ahuja
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Harveen Kaur .

Editor information

Editors and Affiliations

Department of Mathematics, Indian Institute of Technology Roorkee, Roorkee, India
Kusum Deep
Department of Mathematics, South Asian University, New Delhi, India
Jagdish Chand Bansal
Department of Mathematics, National Institute of Technology, Silchar, Silchar, Assam, India
Kedar Nath Das
School of Mathematics, Thapar Institute of Engineering and Technology University, Patiala, Punjab, India
Arvind Kumar Lal
School of Mathematics, Thapar Institute of Engineering and Technology University, Patiala, Punjab, India
Harish Garg
Department of Mathematics and Computer Science, Liverpool Hope University, Liverpool, United Kingdom
Atulya K. Nagar
Department of Paper Technology, Indian Institute of Technology Roorkee, Roorkee, India
Millie Pant

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kaur, H., Ahuja, S. (2017). Time Series Analysis and Prediction of Electricity Consumption of Health Care Institution Using ARIMA Model. In: Deep, K., et al. Proceedings of Sixth International Conference on Soft Computing for Problem Solving. Advances in Intelligent Systems and Computing, vol 547. Springer, Singapore. https://doi.org/10.1007/978-981-10-3325-4_35

Download citation

DOI: https://doi.org/10.1007/978-981-10-3325-4_35
Published: 13 April 2017
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-3324-7
Online ISBN: 978-981-10-3325-4
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Time Series Analysis and Prediction of Electricity Consumption of Health Care Institution Using ARIMA Model

Abstract

Similar content being viewed by others

Electricity Consumption Forecasting Using Time Series Analysis

Predictive Analysis of Energy Use Based on Some Forecasting Models

Predicting the Residential Energy Consumption in Morocco Based on Time Series Forecasting Models

Keywords