District Heating System Load Prediction Using Machine Learning Method

Jia, Meng; Sun, Chunhua; Cao, Shanshan; Qi, Chengying

doi:10.1007/978-981-13-9524-6_61

Meng Jia¹⁰,
Chunhua Sun¹⁰,
Shanshan Cao¹⁰ &
…
Chengying Qi¹⁰

Part of the book series: Environmental Science and Engineering ((ENVENG))

Included in the following conference series:

The International Symposium on Heating, Ventilation and Air Conditioning

1334 Accesses

Abstract

Accurate prediction of heating load can help improve operational efficiency of district heating systems (DHSs). The selection of feature variables is of great significance to prediction performance. Most existing methods only use the meteorological data and historical thermal demand data. In this study, correlation analysis method is employed to analyze predominant variables affecting prediction accuracy. The correlation of supply/return temperature, outdoor temperature, and historical load data were examined. The obtained results were used to select minimal input variables subset so as to avoid multiple input variables. The extreme learning machine (ELM) was used to predict the energy consumption of the next 6, 12, and 24 h. The approach was adopted to predict heating load of a DHS in Changchun, China. Historical heating load data were proved to be the most essential prediction inputs. The results show that the root-mean-square error predicted by the ELM model can reach 4.1%.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Training data selection using information entropy: Application to heating load modeling of rural residence in northern China

Article 10 July 2024

Forecasting heating and cooling loads in residential buildings using machine learning: a comparative study of techniques and influential indicators

Article Open access 28 July 2023

Toward explainable heat load patterns prediction for district heating

Article Open access 08 May 2023

Keywords

1 Introduction

Accurate prediction of the short-term heat load is a prerequisite for efficient and stable operation of district heating system (DHS). Most existing thermal load prediction methods considered limited influencing factors, like meteorological and historical parameters, and the prediction accuracy is unstable [1,2,3,4,5,6]. These models usually reflect a smooth linear relationship between load and weather variables, which is of great nonlinearity and complexity actually [1]. A. Kusiak et al. used weather forecast data to predict steam load [2]. Nicolas Perez-Mora et al. used historical heat demand data to predict and manage DHS loads [3]. E. Dotzauer took weather forecasting and social component modeling into account [4]. H. A. Nielsen et al. obtained a regression equation between meteorological parameters (i.e., outdoor temperature, solar radiation, relative humidity, and wind speed) and building heat consumption [5]. O. Yetemen et al. found that the monsoon circulation has some influence on the long-term energy consumption prediction [6].

With the continuous development of machine learning theory, nonlinear prediction methods have been successfully applied in the field of load forecasting. Huang et al. [7] developed extreme learning machine (ELM), which is an evolutionary neural network method with good generalization ability. Sajjadi et al. established a DHS thermal load prediction model by using ELM method, revealing the robustness of this method, [8].

This paper studied the correlation of historical heating load, historical secondary supply/return temperature, and outdoor temperature. The selected input variables were used to predict heat load for the next 6, 12, and 24 h using ELM method. The proposed method was applied and analyzed in a DHS in Changchun, China.

2 Data Preprocessing

2.1 Data Outlier Elimination

Test values with coarse errors are called outliers, which are undesirable and should be removed from the measured data [9]. PauTa criterion is commonly used to judge the gross error, whose basic idea is that any error beyond triple standard deviation limit is considered to be gross error rather than random error.

When using the PauTa criterion to judge and eliminate outliers, the average value $\overline{X}$ and residual error $V_{i} = X_{i} - \overline{X}$ of the independent measurement column X_i(i = 1, 2, 3, …, n) should be calculated first. The standard deviation S of the measurement column is calculated. If the residual error V_d of a measured value X_d satisfies V_d > 3S, it is considered that X_d is an outlier needs to be rejected.

2.2 Correlation Analysis

The selection of the characteristic variables plays a crucial role in the thermal load prediction model. Through correlation analysis, the relative factors that have a great influence on load can be taken as the input factors of the prediction model to improve accuracy. In this study, the correlation coefficient method was used to analyze the correlation between two variables. r can be calculated by Eq. (1):

$$r = \frac{{\mathop \sum \nolimits_{i = 1}^{n} (X_{i} - \overline{X} )(Y_{i} - \overline{Y} )}}{{\sqrt {\mathop \sum \nolimits_{i = 1}^{n} (X_{i} - \overline{X} )^{2} } }{\sqrt {\mathop \sum \nolimits_{i = 1}^{n} (Y_{i} - \overline{Y} )^{2} } }}$$

(1)

where X and Y represent the two variables. The r is between [−1, 1]. A positive value of r indicates a positive correlation, vice versa. The greater the absolute value of r, the stronger the correlation.

3 Prediction Methods

3.1 Extreme Learning Machine (ELM)

ELM refers to an artificial neural network model that is developed with the improvements on single-hidden layer feedforward networks (SLFNs) [10], as shown in Fig. 1.

For M arbitrary samples (x_i, t_i), in which x_i=[x_i1, x_i2, …, x_in]^T ϵ Rⁿ and t_i = [t_i1, t_i2, …, t_in]^T ϵ R^m. The number of single-hidden layer nodes is Ñ, the standard SLFNs model with an activation function g(x) is as follows:

$$\mathop \sum \limits_{i = 1}^{\widetilde{N}} \beta_{i} g_{i} \left( {x_{j} } \right) = \mathop \sum \limits_{i = 1}^{\widetilde{N}} \beta_{i} g_{i} \left( {a_{i} \cdot x_{j} + b_{i} } \right),j = 1, \ldots ,N$$

(2)

where a_i = [a_i1, a_i2, …, a_im]^T is the weight vector that connects the ith hidden layer node; b_i is the threshold of ith hidden layer nodes; $\beta_{i} = [\beta_{i1} ,\beta_{i2} , \ldots ,\beta_{im} ]^{\text{T}}$ is the output weight vector connecting ith hidden layer nodes; $a_{i} \cdot x_{j}$ represents the inner product of a_i and x_j.

The ELM model can approach the output value t_j of N training samples with zero error,and we get:

$$\mathop \sum \limits_{i = 1}^{\widetilde{N}} \beta_{i} g_{i} \left( {a_{i} \cdot x_{j} + b_{i} } \right) = t_{j} ,j = 1, \ldots ,N$$

(3)

Equation (4) is written in the matrix form as follows:

$$\varvec \beta \varvec{ H} = \varvec{ T}$$

(4)

where H is the hidden layer output matrix of the network; the ith column represents the output vector of the ith hidden layer node associated with the input x₁, x₂, …, x_N, and the jth row represents the implicit layer output vector associated with the input. The hidden layer matrix day is a deterministic matrix, so training SLFNs is equivalently converted to a least-squares solution, so that βH= T, which is expressed as follows:

$$\widehat{\varvec{\beta}}= \mathop { \hbox{min} }\limits_{\beta } \left\| {\varvec{T}\left( {a_{i} , \ldots ,a_{\widetilde{N}} ,b_{i} , \ldots ,b_{\widetilde{N}} } \right)\varvec{\beta}- \varvec{T}} \right\|$$

(5)

Equation (6) can be expressed as follows:

$$\widehat{\varvec{\beta}} = \varvec{H}^{{\mathbf{ + }}} \varvec{T}$$

(6)

where $\varvec{H}^{ + }$ is the molar generalized inverse matrix of the hidden layer output matrix.

3.2 Prediction Model Performance Evaluation Criteria

The mean absolute percentage error (MAPE) and root-mean-square error (RMSE) are used to evaluate the performance of the thermal load prediction model, which are relative and absolute indicators, respectively. They can be calculated by Eq. (7):

$$\left\{ {\begin{array}{*{20}c} {{\text{MAPE}} = \frac{1}{n}\mathop \sum \limits_{t = 1}^{n} \left| { \frac{{{\text{observed}}_{t} - {\text{predicted}}_{t} }}{{{\text{observed}}_{t} }}} \right| \times 100\% } \\ {{\text{RMSE}} = \sqrt {\frac{1}{n}\mathop \sum \limits_{t = 1}^{n} \left( {{\text{observed}}_{t} - {\text{predicted}}_{t} } \right)^{2} } } \\ \end{array} } \right.$$

(7)

where observed_t is actual heat load and predicted_t is the predicted heat load.

4 Results and Discussion

In order to verify the feasibility and effectiveness of the proposed prediction algorithm, filed test of a DHS station in Changchun City was conducted from October 21 to December 7, 2018. Outdoor temperature t_w, supply temperature t_g, return temperature t_h, and heating load q were collected every 10 min, and a total of 6840 data were collected, as shown in Fig. 2. It can be seen that t_g and t_h are relatively stable. t_w and q fluctuate more severely, which may have a certain impact on the later prediction accuracy. The measured variables were averaged every 6, 12, and 24 h, to study different timescale heat load predictions.

4.1 Correlation Analysis

The measured factors were normalized and then calculate the correlation coefficient with heat consumption according to Eq. (1), and the results are shown in Tables 1, 2, and 3.

Table 1 Correlation coefficient of 6-h averaged heat load and measured data

Full size table

Table 2 Correlation coefficient of 12-h averaged heat load and measured data

Full size table

Table 3 Correlation coefficient of 24-h averaged heat load and measured data

Full size table

As shown in Table 1, when the heat consumption prediction period is 6, 12, and 24 h, the historical heat consumption and the historical secondary return temperature have a strong correlation with the heating load. The correlation coefficient of historical heat consumption, historical secondary return temperature, and heating load reached the maximum when the prediction period is 12 h.

When the prediction period is 6h, 12h, 24h, the correlation coefficient between heating load and outdoor temperature is -0.485, -0.523, -0.561, respectively. Although the correlation between outdoor temperature and heating load is weak, it is the key factor in updating the heating load prediction model. Finally, we use historical heating load, secondary return temperature, and outdoor temperature as the heating load variables with prediction periods of 6, 12, and 24 h.

4.2 Prediction Analysis

The data sets are divided into two categories by setting the number of test sets: the number of training sets = 7:3. As the ELM method is used to predict the heating load of the periods of 6, 12, and 24 h, the results are shown in Figs. 3, 4, and 5, respectively. It can be seen that when the predicted period of heating load is 6, 12, and 24 h, the corresponding MAPE values are 4.1, 6.8, and 9.3%. The corresponding MSE value is 0.941, 1.459, and 2.063. Comparing the prediction results, it is found that the heating load prediction model has the best degree of agreement in 6 h, the 12-h result is the second, and the 24-h fitting degree is the worst.

When the predicted period of heating load is 6 h, the trend of the predicted load curve is similar to the actual load trend. At 1–20 and 35–40 sample points, the predicted value is closer to the true value. The prediction results show that the ELM method has effectiveness in the application of short-term heating load prediction research.

With the extension of prediction time, the accuracy of heating load prediction decreases gradually. The main reason may be that the collected data samples are located in the early stage of heating, the heating load fluctuates greatly, and the collected heating load and other data are insufficient.

5 Conclusions

In this paper, the method of ELM heating load prediction is studied and verified in a heating network in Changchun. Through the establishment of ELM prediction model, the following conclusions can be drawn:

(1)
Studying the influence of different characteristic variables on heat load prediction, the MAPE values of predicted future heating loads at 6 and 12 h are 4.1 and 6.8%. It is proved that the optimized feature set model has good prediction performance.
(2)
In this study, the accuracy of the future 24-h heating load prediction is lower than the heat load forecast for the future 12 and 6 h, and its improvement measures need to be further researched.

References

Islam, S.M., et al.: Forecasting monthly electric load and energy for a fast growing utility using an artificial neural network. Electr. Power Syst. Res. 34(1), 1–9 (1995)
Article MathSciNet Google Scholar
Kusiak, A., et al.: A data-driven approach for steam load prediction in buildings. Appl. Energy 87(3), 925–933 (2010)
Article Google Scholar
Perez-Mora, N., et al.: DHC load management using demand forecast. Energy Procedia 91, 557–566 (2016)
Article Google Scholar
Dotzauer, E.: Simple model for prediction of loads in district-heating systems. Appl. Energy 73(3–4), 277–284 (2002)
Article Google Scholar
Nielsen, H.A., et al.: Modelling the heat consumption in district heating systems using a grey-box approach. Energy Build. 38(1), 63–71 (2006)
Article Google Scholar
Yetemen, O., et al.: Climatic parameters and evaluation of energy consumption of the Afyon geothermal district heating system, Afyon, Turkey. Renew. Energy 34(3), 706–710 (2009)
Article Google Scholar
Huang, G.B., et al.: Extreme learning machine: theory and applications. Neurocomputing 70(1–3), 489–501 (2006)
Article Google Scholar
Sajjadi, S., et al.: Extreme learning machine for prediction of heat load in district heating systems. Energy Build. 122, 222–227 (2016)
Article Google Scholar
Zhang, M., Yuan, H.: PauTa criteria and data outlier elimination. J. Zhengzhou Univ. (Eng. Sci.) 1, 87–91 (1997)
Google Scholar
Bilhan, O., et al.: The evaluation of the effect of nappe breakers on the discharge capacity of trapezoidal labyrinth weirs by ELM and SVR approaches. Flow Meas. Instrum. 64, 71–82 (2018)
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Energy and Environment, Hebei University of Technology, Tianjin, 300401, China
Meng Jia, Chunhua Sun, Shanshan Cao & Chengying Qi

Authors

Meng Jia
View author publications
You can also search for this author in PubMed Google Scholar
Chunhua Sun
View author publications
You can also search for this author in PubMed Google Scholar
Shanshan Cao
View author publications
You can also search for this author in PubMed Google Scholar
Chengying Qi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shanshan Cao .

Editor information

Editors and Affiliations

Department of Building Thermal Engineering, School of Architecture, Harbin Institute of Technology, Harbin, Heilongjiang, China
Zhaojun Wang
Department of Building Science, School of Architecture, Tsinghua University, Beijing, China
Yingxin Zhu
Department of Building Thermal Engineering, School of Architecture, Harbin Institute of Technology, Harbin, Heilongjiang, China
Fang Wang
Department of Building Thermal Engineering, School of Architecture, Harbin Institute of Technology, Harbin, Heilongjiang, China
Peng Wang
Department of Building Thermal Engineering, School of Architecture, Harbin Institute of Technology, Harbin, Heilongjiang, China
Chao Shen
Department of Building Thermal Engineering, School of Architecture, Harbin Institute of Technology, Harbin, Heilongjiang, China
Jing Liu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jia, M., Sun, C., Cao, S., Qi, C. (2020). District Heating System Load Prediction Using Machine Learning Method. In: Wang, Z., Zhu, Y., Wang, F., Wang, P., Shen, C., Liu, J. (eds) Proceedings of the 11th International Symposium on Heating, Ventilation and Air Conditioning (ISHVAC 2019). ISHVAC 2019. Environmental Science and Engineering(). Springer, Singapore. https://doi.org/10.1007/978-981-13-9524-6_61

Download citation

DOI: https://doi.org/10.1007/978-981-13-9524-6_61
Published: 20 March 2020
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-9523-9
Online ISBN: 978-981-13-9524-6
eBook Packages: Earth and Environmental ScienceEarth and Environmental Science (R0)

Publish with us

Policies and ethics

District Heating System Load Prediction Using Machine Learning Method

Abstract

Similar content being viewed by others

Training data selection using information entropy: Application to heating load modeling of rural residence in northern China

Forecasting heating and cooling loads in residential buildings using machine learning: a comparative study of techniques and influential indicators

Toward explainable heat load patterns prediction for district heating

Keywords

1 Introduction