A Systematic Analysis for Energy Performance Predictions in Residential Buildings Using Ensemble Learning

Goyal, Monika; Pandey, Mrinal

doi:10.1007/s13369-020-05069-2

A Systematic Analysis for Energy Performance Predictions in Residential Buildings Using Ensemble Learning

Research Article-Computer Engineering and Computer Science
Published: 20 November 2020

Volume 46, pages 3155–3168, (2021)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Arabian Journal for Science and Engineering Aims and scope Submit manuscript

A Systematic Analysis for Energy Performance Predictions in Residential Buildings Using Ensemble Learning

Download PDF

500 Accesses
9 Citations
Explore all metrics

Abstract

Energy being a precious resource needs to be mindfully utilized, so that efficiency is achieved and its wastage is curbed. Globally, multi-storeyed buildings are the biggest energy consumers. A large portion of energy within a building is consumed to maintain the desired temperature for the comfort of occupants. For this purpose, heating load and cooling load requirements of the building need to be met. These requirements should be minimized to reduce energy consumption and optimize energy usage. Some characteristics of buildings greatly affect the heating load and cooling load requirements. This paper presented a systematic approach for analysing various factors of a building playing a vital role in energy consumption, followed by the algorithmic approaches of traditional machine learning and modern ensemble learning for energy consumption prediction in residential buildings. The results revealed that ensemble techniques outperform machine learning techniques with an appreciable margin. The accuracy of predicting heating load and cooling load, respectively, with multiple linear regression was 88.59% and 85.26%, with support vector regression was 82.38% and 89.32%, with K-nearest neighbours was 91.91% and 94.47%. The accuracy achieved with ensemble techniques was comparatively better—99.74% and 94.79% with random forests, 99.73% and 96.22% with gradient boosting machines, 99.75% and 95.94% with extreme gradient boosting.

An Ensemble Machine Learning Model for Enhancing the Prediction Accuracy of Energy Consumption in Buildings

Article 30 June 2021

Regression tree ensemble learning-based prediction of the heating and cooling loads of residential buildings

Article 23 May 2022

Energy Consumption Forecasting Using Ensemble Learning Algorithms

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

While designing smart buildings, optimal measures should be taken so that energy is used efficiently to safeguard the environment [1]. Studies done by researchers all over the world show that the highest percentage of energy is consumed by the multi-storey buildings [2,3,4,5]. Buildings consume about 40% of the total energy consumed in the world. After buildings, the second major energy consumer is industry which is reported for 32% energy consumption. The third major area is transport with 28% energy consumption. These studies motivated to devise solutions for energy optimization in buildings. Further studies show that within buildings, heating, ventilation and air conditioning (HVAC) system is one of the major energy consumers [6, 7]. HVAC consumes energy to maintain the desired temperature within a building and control humidity. It is responsible for meeting the heating load and cooling load of a building. Heating load can be defined as the amount of heat energy that is required to be added to a certain space for maintaining a desired temperature. Cooling load, on the other hand, is the amount of heat energy to be removed from a certain space to keep the temperature within desired limits. These two are related to the thermal load of the building. When the building is cold, the thermal load is converted into heating load and when the building is hot, the thermal load is converted into cooling load [8]. The heating and cooling loads of a building directly affect its energy performance. It requires analysis of factors that affect the heating and cooling loads. Studies reveal that various characteristics of a building and its structure affect heating and cooling loads to a major extent [9, 10]. Predicting energy consumption in buildings gives insight on the future demand of energy, and if more energy is being consumed than expected, appropriate measures can be adopted to stabilize energy use.

This paper focusses on several important features of buildings for, e.g. relative compactness, surface area, wall area, roof area, overall height, orientation, glazing area and glazing area distribution. The feature selections techniques were employed to derive the relevant features for predicting the heating load and cooling load. Further, three machine learning algorithms namely—multiple linear regression, K-nearest neighbours and support vector regression, and three ensemble learning algorithms namely random forests, gradient boosting machines, extreme gradient boosting were used for creating models. Additionally, these models have been evaluated using various performance measures for, e.g. RMSE, MSE, MAE, R squared and accuracy.

The organization structure of the paper is as follows: Sect. 1 introduces the problem.

Literature review is presented in Sect. 2. Section 3 describes the state-of-the-art machine learning techniques and ensemble techniques. The methodology adopted for the work is described in Sect. 4. The experiments performed and the results obtained are presented in Sect. 5. Finally, conclusions revealed in this research are mentioned in Sect. 6.

2 Literature Survey

Several researchers worldwide worked in the field of energy consumption and optimization in buildings. An important aspect in this domain is the analysis of energy performance gap. In this context, a study on German households [11] introduces prebound effect, in which the occupants actually consume 30% less energy than as calculated in the standard ratings. This gap can be due to incorrect assumptions made during energy ratings. In another study [12], common usages of the term rebound effect have been reviewed. Rebound effect is used in the context where actual energy consumption exceeds the calculated ratings. Some of the important researches in the field of energy using various individual machine learning and ensemble techniques are analysed and shown in Table 1.

Table 1 Survey of literature

Full size table

3 Machine Learning

In this research, the applied machine learning techniques belong to two different categories.

3.1 Traditional Machine Learning Techniques

Three traditional machine learning techniques have been applied in our research work namely—multiple linear regression, K-nearest neighbours and support vector machines.

3.1.1 Multiple Linear Regression

It is although quite similar to linear regression but there is one significant difference. In MLR model, one response variable B is dependent upon multiple independent variables A1, A2, A3 … An. The relationship between predictor variable and response variable can be expressed in the form of conditional expectation as shown in Eq. (1).

$$ E\left( {Y|X} \right) = \beta_{0} + \beta_{i} X_{i} $$

(1)

$ \beta_{i} $ is the slope that depicts the change in response variable Y when the predictor variable j is varied by one unit and other predictors are kept constant. The complexity of results interpretation increases in this model as a result of the correlation between different independent variables [5]. The concept of MLR is graphically shown in Fig. 1 [35].

3.1.2 K-Nearest Neighbours

Also known as “Lazy Learner”, this technique takes into consideration “k” number of closest instances in the training dataset to predict the value of the unknown instance. These k instances are found by applying a certain distance metric such that they are the k-nearest neighbours of the unknown instance [36]. The value returned is obtained after averaging the values of k-nearest neighbours [37].

If D is a dataset consisting of x_i training instances and the value of an unknown instance p is to be predicted, the distance between p and x_i can be obtained with Eq. (2).

$$ d\left( {p,x_{i} } \right) = \mathop \sum \limits_{f \in F} w_{f} \delta \left( {p_{f} ,x_{if} } \right) $$

(2)

where $ \delta \left( {p_{f} ,x_{if} } \right) = \left| {p_{f} - x_{if} } \right| $ for continuous attribute.

Graphical representation of KNN regression can be seen in Fig. 2 [38]. In our experiments, the value of k is taken as 4.

3.1.3 Support Vector Regression

It aims at finding a function f(x) which allows deviation to a certain extent ε from the obtained target values y_i in training data samples. It should also ensure maximum flatness. A linear function [39] can be described as:

$$ f\left( x \right) = w,\,x + b\quad {\text{with}}\quad w \in {\mathbf{\mathcal{X}}},\;\;b \in {\mathbb{R}} $$

(3)

where $ {\mathcal{X}} $ represents input pattern space such that $ {\mathcal{X}} = {\mathbb{R}}^{\text{d}} $.

Figure 3 [40] shows the graphical representation of SVR. The legend in the figure represents the results of various SVR kernel functions applied on a sample dataset of 40 random numbers. Values on x-axis represent data points, values on y-axis are target points. A radial basis function (RBF) kernel can be described as:

$$ K\left( {X_{1} , X_{2} } \right) = \exp \left( { - \gamma \left| {\left| {X_{1} - X_{2} } \right|} \right|^{2} } \right) $$

(4)

where $ \left| {\left| {X_{1} - X_{2} } \right|} \right| $ is the Euclidean distance between points X₁ and X₂.

3.2 Ensemble Techniques

The basic Ensemble technique is to integrate the results of individual machine learning models, such that the prediction results exhibit improvement in terms of accuracy and robustness. Bagging and Boosting are two popular ensemble methods. The ensemble techniques used in this paper are explained as follows.

3.2.1 Random Forests

It is a tree-based ensemble technique that can be applied for both classification as well as regression. Some of the features which make random forests appealing are: prediction efficiency, suitability for highly multi-dimensional problems, missing values handling, outlier removal, etc. [41, 42].

In regression using random forests, to predict a continuous variable, the trees are grown depending on θ in such a manner that h (x, θ) takes on numeric values.

$$ \begin{aligned} & {\text{Where}}\;\theta :\;{\text{A}}\;{\text{random}}\;{\text{vector}} \\ & h\left( {x, \, \theta } \right) :\;{\text{Tree}}\;{\text{predictor}} \\ \end{aligned} $$

The values of the response variable are numeric, and it is assumed that the training sample is drawn independently from the distribution X of random vector Y.

The RF predictor is created by taking the mean over k of the trees

$$ \left\{ {h\left( {x,\theta_{k} } \right)} \right\} $$

(5)

The mean square generalization error for a numeric predictor h(x) is given by

$$ E_{X,Y} \left( {Y - h\left( X \right)} \right)^{2} $$

(6)

For infinite numbers of trees in the forest, the RF predictor is defined as

$$ E_{X,Y} \left( {Y - av_{k} h\left( {X,\theta_{k} } \right)} \right)^{2} \to E_{X,Y} \left( {Y - E_{\theta } h\left( {X,\theta } \right)} \right)^{2} $$

(7)

The schematic diagram of Random Forests is shown in Fig. 4.

3.2.2 Gradient Boosting Machines

GBM is also an ensemble learning technique, whose underlying structure is a decision tree. In GBM, additive regression models are created by iteratively fitting a simple base to currently updated pseudo residuals by calculating least squares at every continuous iteration [43]. In gradient boosting a function F*(x) is generated that maps x to y, so that when the joint distribution of all (y, x) values is taken, the expected value of Ψ(y, F(x)) is minimized, where Ψ(y, F(x)) is some specified loss function. This relation is depicted in Eq. (8).

$$ F^{*} \left( x \right) = \arg \hbox{min} E_{y,x}\Psi \left( {y,F\left( x \right)} \right) $$

(8)

where y: The random output or response variable, x = {x₁, x₂, …x_n}: a set of random input variables.

Figure 5 shows the scheme behind gradient boosting machines.

3.2.3 Extreme Gradient Boosting

Apart from performance and speed as its key features, this technique has an added feature of Scalability. Several optimizations have been performed on the basic algorithm to ensure the scalability of the model [44]. Figure 6 shows the schematic diagram of XGBoost.

4 Method and Data

This section explains the workflow approach followed in this research. Figure 7 shows the steps of the methodology employed. The methodology starts with the data collections followed by data analysis and pre-processing, data partitioning and model constructions using various machine learning and ensemble learning algorithms. Finally, models have been evaluated on various parameters. Each phase in the process has been explained below.

4.1 Data Set Collection and Preparation

The dataset used in this research is a standard dataset that has been collected from the University of California, Irvine (UCI) repository [45].

This dataset is related to energy efficiency in buildings and consists of eight different characteristics of buildings which act as input variables (X₁,X₂…X₈) and heating load (Y₁) and cooling load (Y₂) of buildings as two output variables.

The detailed description of the parameters of the data used along with symbols and its respective type is given in Table 2.

Table 2 Dataset parameter description

Full size table

4.2 Data Analysis and Pre-processing

Data pre-processing is a process that consists of checking the dataset for missing values and filling them with appropriate values, detecting and removing any outliers, converting it into a particular form suitable for applying algorithm, attribute selection, etc.

4.2.1 Statistical Analysis

The statistics of input parameters like minimum value, maximum value, mean, standard deviation, and variance were derived and are described in Table 3.

Table 3 Parameter statistics

Full size table

Probability distribution of all input variables, X₁ to X₈ and both the output variables, Y₁ and Y₂ using histograms is shown in Fig. 8. The distribution graphs show that none of the input and output variables follow Normal distribution.

4.2.2 Feature Selection

Feature selection is an important step in the process of predicting results using machine learning because all the features are generally not equally important for predicting the response value. Some features carry more weights than others for deriving a particular value and are thus more important, whereas some are very less or not at all important in the derivation of results. Such irrelevant features need to be excluded from the input to save training time and computation time. Additionally, applying the algorithm on only important and relevant features may result in more accurate prediction, reducing over fitting. In this research, feature selection has been performed in the following two ways:

4.2.2.1 Filter Feature Selection

It is a univariate method in which statistical techniques are used to derive the relationship between each input variable and the target variable. The features which are strongly related to the response variable are selected as input for algorithm application and the features which are weakly related to the response variable can be eliminated. In our research, Spearman correlation coefficient was calculated to derive the strength of the relationship of several independent variables of the dataset with each of the response variables. Spearman’s method for computing correlation was employed as the distribution of dataset used is non-Gaussian. A zero value for the correlation coefficient means the variables are not correlated, i.e., they are independent. A value closer to 1 indicates a high correlation among variables [46]. High correlation among two variables means one variable varies in accordance with the other; if one increases the other also increases. Similarly, reduction in one variable tends to reduce the other. The values of correlation coefficient between independent variables X₁–X₈ and Y₁ are represented in Fig. 9 and the same with Y₂ are represented in Fig. 10.

4.2.2.2 Feature Importance

As mentioned earlier, features play a very important role in prediction and some features tend to be more important than others. In this context, feature importance graphs were generated to obtain the degree of effectiveness of each of the independent parameters, so that the contribution of each feature in prediction can be known and accordingly selection can be made. Figures 11 and 12 show the feature importance graph generated using random forests and gradient boosting machines, respectively. These techniques showed similar importance of features for both response variables, Y₁ and Y₂. According to random forests, overall height has maximum importance, followed by relative compactness, then surface area, wall area, glazing area, roof area, glazing area distribution and orientation. The sequence of features as derived by gradient boosting machines in the decreasing order of importance is—relative compactness, Surface area, roof area, overall height, glazing area, wall area, orientation and glazing area distribution.

Figures 13 and 14 represent the graphs generated by applying LASSO technique for feature importance for Y₁ and Y₂, respectively. According to LASSO, relative compactness, overall height, glazing area and glazing area distribution are more important for predicting the heating load of a building. Furthermore, for predicting cooling load, the parameters—relative compactness, surface area, overall height, orientation and glazing area—are more important than others. Therefore, for performing experiments, X₁, X₅, X₇ and X₈ have been selected as input parameters for predicting Y₁. Likewise, X₁, X₂, X₅, X₆ and X₇ have been selected for the prediction of Y₂.

4.3 Data Analysis and Pre-processing

Dataset was partitioned according to 70–30% rule into two subsets: training dataset and testing dataset. For partitioning, random sampling without replacement was applied which resulted in 70% training data, on which the algorithms were applied, and the remaining 30% was used for testing the algorithms.

4.4 Model Construction

The model was constructed by applying three machine learning techniques namely—multiple linear regression, K-nearest neighbours and support vector regression. Three ensemble techniques were applied namely—random forests, gradient boosting machines, and extreme gradient boosting. The models were applied on the training dataset and tested using the testing dataset.

4.5 Model Evaluation

The evaluation of the results obtained after applying algorithms was done using five well-known performance measures namely root mean square error, mean square error, mean absolute error, R squared and accuracy. These measures can be calculated by applying following formulae, where Y_i: is the observed value for the ith observation, $ \hat{Y}_{i} $: is the predicted value, N: is sample size.

The original dataset consisting of 768 instances has been partitioned into two subsets—70% training dataset and 30% testing dataset, by random sampling. So N = 538 for model construction on training dataset and N = 230 for testing purpose.

Root mean square error Following equation defines the formula for RMSE:

$$ {\text{RMSE}} = \sqrt[2]{{\sum\limits_{i = 1}^{N} {\frac{{\left( {\hat{Y}_{i} - Y_{i} } \right)^{2} }}{N}} }} $$

(9)

Mean square error Following equation defines the formula for MSE:

$$ {\text{MSE}} = \frac{1}{N}\mathop \sum \limits_{i = 1}^{N} \left( {Y_{i} - \hat{Y}_{i} } \right)^{2} $$

(10)

Mean absolute error MAE can be defined by the following equation:

$$ {\text{MAE}} = \mathop \sum \limits_{i = 1}^{N} \frac{{\left| {Y_{i} - \hat{Y}_{i} } \right|}}{N} $$

(11)

R Squared R squared can be defined by the following equation:

$$ R^{2} = 1 - \frac{{\sum \left( {Y_{i} - \hat{Y}_{i } } \right)^{2} }}{{\sum \left( {Y_{i} - \bar{Y}} \right)^{2} }} $$

(12)

Accuracy Accuracy of a model can be calculated using the following formula:

$$ {\text{Accuracy}} = \frac{{\left| {V_{\text{A}} - V_{\text{O}} } \right|}}{{V_{\text{A}} }}*100 $$

(13)

where V_A: actual value, V_O: obtained value.

Sample calculations performed on the dataset using the above equations are shown in “Appendix”.

5 Results

All the experiments of the research were performed using Python programming language. Three machine learning algorithms namely MLR, KNN and SVR and three ensemble techniques namely, RF, GBM, and XGBoost have been experimented on the collected dataset. The results of the ML and Ensemble experiments are described in Tables 4 and 5 respectively.

Table 4 Results of ML algorithms

Full size table

Table 5 Results of ensemble algorithms

Full size table

5.1 Results of Classical ML Techniques

The results obtained after applying all the three aforementioned classical machine learning algorithms on the dataset are summarized in Table 4. These results are based on the performance measures. The values of RMSE range between 3.13 and 4.22 for both output variables Y₁ and Y₂ after applying MLR and SVR, whereas it is lower, 2.86 for Y₁ and 2.25 for Y₂ when KNN is applied. Correspondingly MSE values for KNN are also lower than MLR and SVR. Similarly, MAE values range between 2.25 and 3.19 using MLR and SVR, and the values are 1.96 and 1.54 using KNN. R Squared values are better in KNN (0.90 and 0.94), as compared to MLR (0.87 and 0.83) and SVR (0.76 and 0.84). KNN results are better than the other two algorithms in terms of accuracy also.

5.2 Results of Ensemble Techniques

Table 5 summarizes the results of the experiments performed by applying Ensemble techniques. As per the results, RMSE is 0.50 for output variable Y₁ for RF and XGBoost and 0.52 for GBM. Y₂ value varies slightly, 2.18 for RF, 1.86 for GBM and 1.93 for XGBoost. Correspondingly MSE value is also same 0.25 for Y₁ with RF and XGBoost and 0.27 with GBM. For Y₂, the values of MSE are 4.78, 3.47 and 3.72 with RF, GBM and XGBoost, respectively. MAE value varies slightly for Y₁ in the range 0.36–0.38 for all three algorithms, whereas the range for Y₂ is 1.25–1.39. R Squared values for Y₁ for all three algorithms are same, 0.99 and for Y₂; they vary from 0.94 to 0.96. Accuracy is also same, above 99% for Y₁ with all three algorithms, whereas accuracy percentage for Y₂ is 94.79%, 96.22% and 95.94% when RF, GBM and XGBoost are applied, respectively.

Figures 15 and 16 show the graphs plotted for results obtained in Table 4 for ML algorithms and Table 5 for ensemble algorithms, respectively. Figure 17 represents combined results for all six algorithms (ML and Ensemble) for both response variables Y₁ and Y₂.

5.3 Comparative Analysis of Machine Learning and Ensemble Learning Algorithms

In this section, the results of experiments are represented graphically based on various performance measures used for results evaluation. The graphs for RMSE, MAE, R squared and accuracy are shown in Figs. 18, 19, 20 and 21, respectively.

It can be observed from Fig. 18 that the RMSE value is high (more than 3.0) for SVR and MLR algorithms for both the output variables Y₁ and Y₂, comparatively lower (approximately 2.0) for KNN, whereas the error values are extremely low (below 0.5) with all ensemble algorithms—RF, GBM and XGBoost for Y₁ and between 1.8 and 1.9 for Y₂.

Similar pattern can be observed from Fig. 19 for MAE. The values for MLR and SVR algorithms range between 2.32 and 2.63 for both Y₁ and Y₂. The values are lower when KNN is applied, 1.50 for Y₁ and 1.35 for Y₂. The results are better with ensemble algorithms with MAE between 1.19 and 1.27 for Y₂ and even lower values (0.35–0.36) are obtained for Y₁.

For R squared (Fig. 20), higher values, i.e., the values approaching 1 are considered better. In this context, again ensemble techniques have outperformed traditional ML techniques. The lowest values for R squared are obtained for SVR, 0.82 and 0.79 for Y₁ and Y₂, respectively. Slightly higher values are obtained for MLR, 0.88 and 0.83, and even higher for KNN with 0.94 and 0.96. R Squared results obtained with ensemble algorithms are significantly better than traditional algorithms with values ranging from 0.94 and going up to 0.99.

In Fig. 21, the plot for accuracy score also concludes that ensemble techniques perform significantly better than traditional algorithms with accuracy ranging between 96 and 99.76%. Among classical ML techniques, KNN performs better with an accuracy score of 95.12% and 96.47%, while the accuracy score for MLR and SVR ranges between 85.75 and 89.63%.

Ensemble techniques became popular from last two decades in the area of classification and prediction. The idea behind ensemble methods is that it can be compared to situations in real life, such as when critical decisions has to be taken, often opinions of several experts are taken into account rather than relying on a single judgment. Ensembles have shown to be more accurate in many cases than the individual models. Ideal ensembles consist of models with high accuracy which differ as much as possible. If each model makes different mistakes, then the total error will be reduced, if the models are identical, then a combination is useless since the results remain unchanged. It is evident from the survey of literature performed in Table 1 [13, 14, 17, 18, 23, 25, 26, 29] that ensemble techniques are far better in terms of performance prediction as compared to traditional machine learning algorithms. On similar terms, the results of experiments performed in this research also conclude that the predictions done by Ensemble models resulted in much lower error values RMSE, MSE and MAE, better R squared values and improved accuracy as compared to the traditional machine learning models used.

6 Conclusion and Future Scope

The issue of energy consumption at a fast pace and in large amounts demands solutions in this area, which can help in using the energy efficiently. Globally, buildings are the largest energy consumers, accounting for nearly 40% energy consumption. Therefore, the analysis of various energy-consuming components of a building reveal that the HVAC system consumes a large percentage of the building’s total energy. HVAC needs energy for operation so that it can meet the heating load and cooling load requirements of the building. Heating and cooling loads are largely affected by various attributes of a building. This research shows that relative compactness, surface area, overall height, orientation and glazing area are more important in predicting heating load and cooling load of the buildings. Furthermore, the results of experiments prove that ensemble techniques perform better than traditional machine learning techniques.

In this research, only one dataset is used. In future, we can apply experiments on multiple datasets with large number of instances to better prove the accuracy of models. Apart from the models applied in this research, more advanced models like stacking and voting can be applied for better analysis.

Data Availability

Yes, Data are available.

Code Availability

Yes, code is available.

References

Lam, J.C.; Wan, K.K.; Tsang, C.L.; Yang, L.: Building energy efficiency in different climates. Energy Convers. Manag. 49(8), 2354–2366 (2008)
Article Google Scholar
Ahmad, M.W.; Mourshed, M.; Rezgui, Y.: Trees vs neurons: comparison between random forest and ANN for high-resolution prediction of building energy consumption. Energy Build. 147, 77–89 (2017)
Article Google Scholar
Chou, J.S.; Bui, D.K.: Modeling heating and cooling loads by artificial intelligence for energy-efficient building design. Energy Build. 82, 437–446 (2014)
Article Google Scholar
Jain, R.K.; Smith, K.M.; Culligan, P.J.; Taylor, J.E.: Forecasting energy consumption of multi-family residential buildings using support vector regression: investigating the impact of temporal and spatial monitoring granularity on performance accuracy. Appl. Energy 123, 168–178 (2014)
Article Google Scholar
Krzywinski, M.; Altman, N.: Multiple linear regression: when multiple variables are associated with a response, the interpretation of a prediction equation is seldom simple. Nat. Methods 12(12), 1103–1105 (2015)
Article Google Scholar
Carreira, P.; Costa, A.A.; Mansu, V.; Arsénio, A.: Can HVAC Really Learn from Users? A Simulation-Based Study on the Effectiveness of Voting for Comfort and Energy Use Optimization. Sustain. Cities Soc. 41, 275–285 (2018)
Article Google Scholar
Drgoňa, J.; Picard, D.; Kvasnica, M.; Helsen, L.: Approximate model predictive building control via machine learning. Appl. Energy 218, 199–216 (2018)
Article Google Scholar
Roy, S.S.; Roy, R.; Balas, V.E.: Estimating heating load in buildings using multivariate adaptive regression splines, extreme learning machine, a hybrid model of MARS and ELM. Renew. Sustain. Energy Rev. 82, 4256–4268 (2018)
Article Google Scholar
Kumar, S.; Pal, S.K.; Singh, R.P.: A novel method based on extreme learning machine to predict heating and cooling load through design and structural attributes. Energy Build. 176, 275–286 (2018)
Article Google Scholar
Ngo, N.T.: Early predicting cooling loads for energy-efficient design in office buildings by machine learning. Energy Build. 182, 264–273 (2019)
Article Google Scholar
Sunikka-Blank, M.; Galvin, R.: Introducing the prebound effect: the gap between performance and actual energy consumption. Build. Res. Inf. 40(3), 260–273 (2012)
Article Google Scholar
Galvin, R.: Making the ‘rebound effect’more useful for performance evaluation of thermal retrofits of existing homes: defining the ‘energy savings deficit’and the ‘energy performance gap’. Energy Build. 69, 515–524 (2014)
Article Google Scholar
Tsanas, A.; Xifara, A.: Accurate quantitative estimation of energy performance of residential buildings using statistical machine learning tools. Energy Build. 49, 560–567 (2012)
Article Google Scholar
Fan, C.; Xiao, F.; Wang, S.: Development of prediction models for next-day building energy consumption and peak power demand using data mining techniques. Appl. Energy 127, 1–10 (2014)
Article Google Scholar
Wei, X.; Kusiak, A.; Li, M.; Tang, F.; Zeng, Y.: Multi-objective optimization of the HVAC (heating, ventilation, and air conditioning) system performance. Energy 83, 294–306 (2015)
Article Google Scholar
Park, H.S.; Lee, M.; Kang, H.; Hong, T.; Jeong, J.: Development of a new energy benchmark for improving the operational rating system of office buildings using various data-mining techniques. Appl. Energy 173, 225–237 (2016)
Article Google Scholar
Candanedo, L.M.; Feldheim, V.; Deramaix, D.: Data driven prediction models of energy use of appliances in a low-energy house. Energy Build. 140, 81–97 (2017)
Article Google Scholar
Manjarres, D.; Mera, A.; Perea, E.; Lejarazu, A.; Gil-Lopez, S.: An energy-efficient predictive control for HVAC systems applied to tertiary buildings based on regression techniques. Energy Build. 152, 409–417 (2017)
Article Google Scholar
Peng, Y.; Rysanek, A.; Nagy, Z.; Schlüter, A.: Using machine learning techniques for occupancy-prediction-based cooling control in office buildings. Appl. Energy 211, 1343–1358 (2018)
Article Google Scholar
Gallagher, C.V.; Bruton, K.; Leahy, K.; O’Sullivan, D.T.: The suitability of machine learning to minimise uncertainty in the measurement and verification of energy savings. Energy Build. 158, 647–655 (2018)
Article Google Scholar
Deb, C.; Lee, S.E.; Santamouris, M.: Using artificial neural networks to assess HVAC related energy saving in retrofitted office buildings. Sol. Energy 163, 32–44 (2018)
Article Google Scholar
Nayak, S.C.: Escalation of forecasting accuracy through linear combiners of predictive models. EAI Endorsed Trans. Scalable Inf. Syst. 6(22), 1–14 (2019)
Google Scholar
Sethi, J.S.; Mittal, M.: Ambient air quality estimation using supervised learning techniques. EAI Endorsed Trans. Scalable Inf. Syst. 6(22) (2019)
Pallonetto, F.; De Rosa, M.; Milano, F.; Finn, D.P.: Demand response algorithms for smart-grid ready residential buildings using machine learning models. Appl. Energy 239, 1265–1282 (2019)
Article Google Scholar
Pham, A.D.; Ngo, N.T.; Truong, T.T.H.; Huynh, N.T.; Truong, N.S.: Predicting energy consumption in multiple buildings using machine learning for improving energy efficiency and sustainability. J. Clean. Prod. 260, 121082 (2020)
Article Google Scholar
Walker, S.; Khan, W.; Katic, K.; Maassen, W.; Zeiler, W.: Accuracy of different machine learning algorithms and added-value of predicting aggregated-level energy performance of commercial buildings. Energy Build. 209, 109705 (2020)
Article Google Scholar
Xu, X.; Wang, W.; Hong, T.; Chen, J.: Incorporating machine learning with building network analysis to predict multi-building energy use. Energy Build. 186, 80–97 (2019)
Article Google Scholar
Zhou, G.; Moayedi, H.; Bahiraei, M.; Lyu, Z.: Employing artificial bee colony and particle swarm techniques for optimizing a neural network in prediction of heating and cooling loads of residential buildings. J. Clean. Prod. 254, 120082 (2020)
Article Google Scholar
Gao, W.; Alsarraf, J.; Moayedi, H.; Shahsavar, A.; Nguyen, H.: Comprehensive preference learning and feature validity for designing energy-efficient residential buildings using machine learning paradigms. Appl. Soft Comput. 84, 105748 (2019)
Article Google Scholar
Seyedzadeh, S.; Rahimian, F.P.; Rastogi, P.; Glesk, I.: Tuning machine learning models for prediction of building energy loads. Sustain. Cities Soc. 47, 101484 (2019)
Article Google Scholar
Roy, S.S.; Samui, P.; Nagtode, I.; Jain, H.; Shivaramakrishnan, V.; Mohammadi-Ivatloo, B.: Forecasting heating and cooling loads of buildings: a comparative performance analysis. J. Ambient Intell. Humaniz. Comput. 11(3), 1253–1264 (2020)
Article Google Scholar
Iruela, J.R.S.; Ruiz, L.G.B.; Pegalajar, M.C.; Capel, M.I.: A parallel solution with GPU technology to predict energy consumption in spatially distributed buildings using evolutionary optimization and artificial neural networks. Energy Convers. Manag. 207, 112535 (2020)
Article Google Scholar
Das, S.; Swetapadma, A.; Panigrahi, C.; Abdelaziz, A.Y.: Improved method for approximation of heating and cooling load in urban buildings for energy performance enhancement. Electr. Power Compon. Syst. 48, 1–11 (2020)
Article Google Scholar
Cozza, S.; Chambers, J.; Deb, C.; Scartezzini, J.L.; Schlüter, A.; Patel, M.K.: Do energy performance certificates allow reliable predictions of actual energy consumption and savings? Learning from the Swiss national database. Energy Build. 224, 110235 (2020)
Article Google Scholar
https://sweetcode.io/simple-multiple-linear-regression-python-scikit/
Cunningham, P.; Delany, S.J.: k-Nearest neighbour classifiers. Multiple Classif. Syst. 34(8), 1–17 (2007)
Google Scholar
Martínez, F.; Frías, M.P.; Pérez, M.D.; Rivera, A.J.: A methodology for applying k-nearest neighbor to time series forecasting. Artif. Intell. Rev. 52(3), 2019–2037 (2019)
Article Google Scholar
https://www.slideshare.net/amirudind/k-nearest-neighbor-presentation
Smola, A.J.; Schölkopf, B.: A tutorial on support vector regression. Stat. Comput. 14(3), 199–222 (2004)
Article MathSciNet Google Scholar
https://scikit-learn.org/0.18/auto_examples/svm/plot_svm_regression.html
Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001)
Article Google Scholar
Cutler, A.; Cutler, D.R.; Stevens, J.R.: Random forests. In: Ensemble Machine Learning, pp. 157–175. Springer, Boston, MA (2012)
Friedman, J.H.: Stochastic gradient boosting. Comput. Stat. Data Anal. 38(4), 367–378 (2002)
Article MathSciNet Google Scholar
Chen, T.; Guestrin, C.: Xgboost: a scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 785–794 (2016)
https://archive.ics.uci.edu/ml/datasets/Energy+efficiency
Myers, L.; Sirois, M.J.: Spearman correlation coefficients, differences between. Encycl. Stat. Sci. (2004)

Download references

Author information

Authors and Affiliations

Computer Science and Technology, Manav Rachna University (Formerly, Manav Rachna College of Engineering), Faridabad, India
Monika Goyal & Mrinal Pandey

Authors

Monika Goyal
View author publications
You can also search for this author in PubMed Google Scholar
Mrinal Pandey
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Mrinal Pandey and Monika Goyal conducted the research and analyze the data. Monika Goyal performed the literature survey and experiments. Statistical Analysis is done by Mrinal Pandey. The research article is written by Mrinal Pandey and Monika Goyal.

Corresponding author

Correspondence to Mrinal Pandey.

Appendix

1.1 Sample Calculations for Model Evaluation

The sample calculations using formulae in Eqs. 9–13 are described here. Table 6 contains the predicted values, observed values of response variables Y₁ and Y₂ from the dataset and predicted values after applying KNN algorithms. The calculations for model evaluation on the basis of values given in Table 6 have been performed manually on 20 and 100 sample size, respectively, which has been selected in respective order from 1–10 and 1–100.

Table 6 Sample dataset showing all predictor values and predicted values using KNN

Full size table

Referring to Eqs. 9–13, applying the formulae on observed values and values predicted using KNN, For Y₁ calculated results for samples of initial 20 records,

$$ \begin{aligned} & {\text{RMSE}} = 12.01 \\ & {\text{MSE}} = 144.29 \\ & {\text{MAE}} = 10.2 \\ & R\;{\text{Squared}} = - 5.2 \\ & {\text{Accuracy}} = 43.82\% \\ \end{aligned} $$

Referring to Eqs. 9–13, applying the formulae on observed values and values predicted using KNN, For Y₁ calculated results for samples of initial 100 records,

$$ \begin{aligned} & {\text{RMSE}} = 14.02 \\ & {\text{MSE}} = 196.6 \\ & {\text{MAE}} = 10.9 \\ & R\;{\text{Squared}} = - 1.55 \\ & {\text{Accuracy}} = 48.46\% \\ \end{aligned} $$

The sample calculations using formulae in Eqs. 9–13 are described here. Table 7 contains the predicted values, observed values of response variables Y₁ and Y₂ from the dataset, and predicted values after applying XGBoost algorithms. The calculations for model evaluation on the basis of values given in Table 7 have been performed manually on 20 and 100 sample size, respectively, which has been selected in respective order from 1–20 and 1–100.

Table 7 Sample dataset showing all predictor values and predicted values using XGBoost

Full size table

Applying the formulae on the values predicted using XGBoost, For Y1 the calculated results for samples of initial 20 records,

$$ \begin{aligned} & {\text{RMSE}} = 11.98 \\ & {\text{MSE}} = 143.59 \\ & {\text{MAE}} = 9.82 \\ & R\;{\text{Squared}} = - 5.17 \\ & {\text{Accuracy}} = 40.68 \\ \end{aligned} $$

Applying the formulae on the values predicted using XGBoost, For Y₁ the calculated results for samples of initial 100 records,

$$ \begin{aligned} & {\text{RMSE}} = 14.25 \\ & {\text{MSE}} = 203.1 \\ & {\text{MAE}} = 11.13 \\ & R\;{\text{Squared}} = - 1.63 \\ & {\text{Accuracy}} = 49.57 \\ \end{aligned} $$

Rights and permissions

Reprints and permissions

About this article

Cite this article

Goyal, M., Pandey, M. A Systematic Analysis for Energy Performance Predictions in Residential Buildings Using Ensemble Learning. Arab J Sci Eng 46, 3155–3168 (2021). https://doi.org/10.1007/s13369-020-05069-2

Download citation

Received: 10 June 2020
Accepted: 23 October 2020
Published: 20 November 2020
Issue Date: April 2021
DOI: https://doi.org/10.1007/s13369-020-05069-2

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A Systematic Analysis for Energy Performance Predictions in Residential Buildings Using Ensemble Learning

Abstract

Similar content being viewed by others

An Ensemble Machine Learning Model for Enhancing the Prediction Accuracy of Energy Consumption in Buildings

Regression tree ensemble learning-based prediction of the heating and cooling loads of residential buildings

Energy Consumption Forecasting Using Ensemble Learning Algorithms

Explore related subjects

1 Introduction

2 Literature Survey

3 Machine Learning

3.1 Traditional Machine Learning Techniques

3.1.1 Multiple Linear Regression

3.1.2 K-Nearest Neighbours

3.1.3 Support Vector Regression

3.2 Ensemble Techniques

3.2.1 Random Forests

3.2.2 Gradient Boosting Machines

3.2.3 Extreme Gradient Boosting

4 Method and Data

4.1 Data Set Collection and Preparation

4.2 Data Analysis and Pre-processing

4.2.1 Statistical Analysis

4.2.2 Feature Selection

4.2.2.1 Filter Feature Selection

4.2.2.2 Feature Importance

4.3 Data Analysis and Pre-processing

4.4 Model Construction

4.5 Model Evaluation

5 Results

5.1 Results of Classical ML Techniques

5.2 Results of Ensemble Techniques

5.3 Comparative Analysis of Machine Learning and Ensemble Learning Algorithms

6 Conclusion and Future Scope

Data Availability

Code Availability

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Appendix

Appendix

1.1 Sample Calculations for Model Evaluation

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation