Abstract
In this study, a hybrid support vector machine–firefly optimization algorithm (SVM-FFA) model is proposed to estimate monthly mean horizontal global solar radiation (HGSR). The merit of SVM-FFA is assessed statistically by comparing its performance with three previously used approaches. Using each approach and long-term measured HGSR, three models are calibrated by considering different sets of meteorological parameters measured for Bandar Abbass situated in Iran. It is found that the model (3) utilizing the combination of relative sunshine duration, difference between maximum and minimum temperatures, relative humidity, water vapor pressure, average temperature, and extraterrestrial solar radiation shows superior performance based upon all approaches. Moreover, the extraterrestrial radiation is introduced as a significant parameter to accurately estimate the global solar radiation. The survey results reveal that the developed SVM-FFA approach is greatly capable to provide favorable predictions with significantly higher precision than other examined techniques. For the SVM-FFA (3), the statistical indicators of mean absolute percentage error (MAPE), root mean square error (RMSE), relative root mean square error (RRMSE), and coefficient of determination (R 2) are 3.3252 %, 0.1859 kWh/m2, 3.7350 %, and 0.9737, respectively which according to the RRMSE has an excellent performance. As a more evaluation of SVM-FFA (3), the ratio of estimated to measured values is computed and found that 47 out of 48 months considered as testing data fall between 0.90 and 1.10. Also, by performing a further verification, it is concluded that SVM-FFA (3) offers absolute superiority over the empirical models using relatively similar input parameters. In a nutshell, the hybrid SVM-FFA approach would be considered highly efficient to estimate the HGSR.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
1 Introduction
Currently, solar energy is being broadly harnessed in various locations across the globe to enhance the sustainability and abate the prevalent environmental problems such as global warming and air pollution. On this account, various technologies have been invented in which solar energy can be utilized either directly or indirectly. Nevertheless, the availability of precise solar radiation data is a fundamental requirement for solar system specialists to successfully simulate, operate, and monitor the solar energy technologies for a variety of applications (Bannani et al. 2006; Mubiru et al. 2007; Mubiru and Banda 2007; Benghanem and Mellit 2014; Flores et al. 2015). Unfortunately, the reliable measured solar radiation data, even in the form of global solar radiation, are not accessible in many sites due to a series of obstacles including the required costs for purchasing, maintaining, and calibrating the measurement equipment (Wu et al. 2012; Shamim et al. 2015). Thus, this has necessitated the development of proper models for accurate prediction of global solar radiation using a considerable number of input elements (Gueymard 2014; Yadav and Chandel 2014). These parameters include meteorological and geographical variables such as sunshine duration, ambient temperatures, relative humidity, water vapor and sea level pressures, cloud cover, altitude, latitude, longitude, and extraterrestrial radiation. Nonetheless, although numerous studies have been conducted to estimate global solar radiation in various regions, developing new techniques and models with high level of reliability and adaptability to achieve further accuracy would be still a main challenge.
Recently, the artificial intelligence and computational intelligence techniques are extensively utilized to solve real problems where conventional methodologies are inadequate or further accuracy is required. Application of such approaches in the realm of solar radiation estimation has received specific attention in recent years.
Tulcan-Paulescu and Paulescu (2008) employed the fuzzy set theory to estimate the global solar radiation from air temperatures. By testing the developed fuzzy-based model using the data of many European stations, they found that the model would provide favorable estimations which are comparable with existing models. Moghaddamnia et al. (2009) provided a comparison between different nonlinear models such as adaptive neuro-fuzzy inference system (ANFIS) to estimate the daily global solar radiation using extraterrestrial radiation, precipitation, air temperature, and wind speed in Brue catchment, UK. Chen et al. (2011) examined the possibility of utilizing the support vector machines (SVMs) for estimating the monthly mean global solar radiation utilizing maximum and minimum air temperatures at Chongqing station, China. They applied three different equations such as linear, polynomial, and radial basis function as kernel functions. They found more preciseness for the SVM model developed using polynomial kernel function. Ozgoren et al. (2012) developed an artificial neural network (ANN) model on the basis of multi-nonlinear regression (MNLR) method for estimation of the monthly global solar radiation over Turkey. They used various variables and then employed the stepwise MNLR method to determine the most proper input values. Their results showed that the ANN model can predict the values with acceptable errors compared with the actual data. Linares-Rodriguez et al. (2013) developed an optimized ANN model to calculate the daily global solar radiation over Andalusia, Spain. In the model, they utilized both clear-sky estimates and satellite images as input elements and also applied genetic algorithm to optimize the selection of inputs. They found that the predicted values by the model are relatively precise. Chen and Li (2014) assessed the performance of SVM for estimation of global solar radiation using measured data of 15 stations in China. They established 20 SVM models based on different combinations of meteorological parameters. Their results indicated that SVM models show remarkable superiority over empirical models with an average of 14 % more precision. Rizwan et al. (2014) applied fuzzy logic (FL) technique to model monthly mean global solar radiation in four Indian stations by different input data. They found that the developed FL-based model is accurate since the amounts of obtained errors are limited. Ramedani et al. (2014) employed support vector regression (SVR) technique to develop a model for prediction of global solar radiation in Tehran, Iran. They used two SVR models of radial basis function (SVR-rbf) and polynomial function (SVR-poly). They found more superiority for SVR-rbf technique. Dahmani et al. (2014) evaluated the capability of ANN method to estimate the 5 min tilted horizontal global solar radiation from horizontal ones in Bouzareah, Algeria. They concluded that very favorable precision can be achieved by ANN since the attained relative root mean square error is around 8 %.
In the last few years, many authors have aimed at enhancing the accuracy of solar radiation estimation by combining some approaches.
Mostafavi et al. (2013) developed a hybrid approach for estimation of the solar global radiation by combining genetic programming (GP) and simulated annealing (SA). They also performed a sensitivity analysis to assess the influence of different meteorological parameters on solar radiation estimation. Their results showed that the suggested model provide precise predictions. Salcedo-Sanz et al. (2014) assessed the capability of a novel coral reefs optimization–extreme learning machine (CRO–ELM) algorithm to predict the global solar radiation at Murcia (southern Spain) using different meteorological data. They concluded that the CRO–ELM approach can predict the daily global radiation accurately with further preciseness than the classical ELM and the SVR algorithm. Wu et al. (2014) developed a genetic algorithm combing multi-model framework to predict solar radiation. By comparing the prediction performance of the proposed technique with some other algorithms, they found higher accuracy and consistency for their approach. Bhardwaj et al. (2013) proposed a hybrid approach which includes hidden Markov models and generalized fuzzy models to estimate solar irradiation in India. They assessed the influence of different meteorological parameters for estimation of solar radiation using the developed model. Their results showed that the predicted values by the proposed model are in favorable agreements with the measured data. Huang et al. (2013) developed a hybrid autoregressive and dynamical system (CARDS) model to forecast hourly global solar radiation in Mildura, Australia. Their results indicated that the CARDS model can forecast hourly solar radiation favorably. Wu and Chan (2011) combined the autoregressive and moving average (ARMA) model with the controversial time delay neural network (TDNN) to predict hourly solar radiation. The achieved results revealed that the hybrid model has a higher capability than both ARMA and TDNN.
The utilization of hybrid models for solar radiation estimation has gained immense popularity since it takes the advantages of different approaches. As a consequence, in this research work, a new model is proposed to estimate monthly mean daily horizontal global solar radiation by hybridizing the SVMs and firefly optimization algorithm (FFA). Basically, SVMs are a type of soft computing technique that has lately obtained importance in the variety of applications such as solar radiation estimation. The exactness of a SVM model is chiefly reliant upon the determination of its model parameters; thus, the FFA is applied to boost the performance of SVMs. To verify the capability of the developed hybrid SVM-FFA model, long-term measured databases including horizontal global solar radiation and different meteorological parameters for port of Bandar Abbass located in south part of Iran are utilized. To ensure the accuracy and adaptability of the proposed model, its prediction performance is appraised against ANN, GP, and ARMA. Various combinations of meteorological parameters are used as inputs in order to establish three models based upon each technique. The hybrid approach proposed in this study is new and differs from the SVMs reported in literature in that it utilizes the firefly optimization algorithm to select its parameters in a more appropriate manner.
The organization of the reminder of the paper is as follows: Section 2 explains the data sets utilized for the analysis. Section 3, which offers the utilized methodology, is divided into two parts: While in section 3.1, the support vector machine is described, in section 3.2 the firefly optimization algorithm is explained. The utilized statistical indicators for models’ performance assessment are introduced and reviewed in section 4. The results and discussion are brought forward in section 5. Finally, the conclusions are presented in section 6.
2 Data description
To evaluate the adaptability and accuracy of the proposed hybrid SVM-FFA approach, the long-term measured global solar radiation along with many meteorological parameters for port of Bandar Abbass, located in Iran, have been utilized. Port of Bandar Abbas, the capital city of the Hormozgan province, is situated in the southern part of Iran at geographical location of 27° 13′ N and 56° 22′ E, and its elevation is 9.8 m above the sea level. Long warm season and cool short season are the climatic characteristics of the region. Basically, the region is a desert zone with extremely low level of atmospheric precipitation (http://en.wikipedia.org/wiki/Bandar Abass>Accessed August 20, 2014). Based upon Köppen classification, the climate condition of Bandar Abbas is categorized as BWh, which relates to arid desert hot (Kottek et al. 2006).
For this research work, long-term measured data consisting the daily horizontal global solar radiation (RS); sunshine duration (n); minimum, maximum, and average air temperatures (Tmin, Tmax, and Tavg); relative humidity (Rh); and water vapor pressure (Vp) provided by Iranian Meteorological Organization (IMO) for the period of 14 years from January 1992 to December 2005 were utilized.
Prior to performing any computational process, a preliminary test was conducted to improve the quality of raw data. The data cleaning procedure generally aims at enhancing the data quality by checking and filtering them from any uncertainty or erroneous. In horizontal global solar radiation data used in this study, there were some missing and also unreliable values possibly due to instruments’ malfunction. In this research work, an approach same as the previous studies was applied to achieve further accuracy and consistency in the quality of data (Mohammadi et al. 2015a; Mohammadi et al. 2015b). After conducting the quality control test, the daily data of each month were averaged to obtain the monthly mean daily values.
To model the horizontal global solar radiation (RS) via the proposed approach, different combinations of data consisting relative sunshine duration defined as the ratio of sunshine duration to the maximum possible sunshine duration (n/N), difference between maximum and minimum ambient temperatures (Tmax − Tmin), relative humidity (Rh), water vapor pressure (VP), average ambient temperature (Tavg), and extraterrestrial solar radiation on a horizontal surface (Ra) are used as inputs. It is worth mentioning that the values of Ra and N were computed by the equations presented in the Appendix.
To achieve reliable evaluation and comparison, the developed hybrid model is tested with data set that has not been used during the training process. For this aim, the obtained monthly mean daily data were divided into two parts of training and testing data sets. The first set of 10 years from 1992–2001 (10 × 12) were used for training phase while the second set of 4 years from 2002–2005 (4 × 12) were utilized for testing phase.
Figure 1a–f illustrates the variation of monthly mean daily values of RS (MJ/m2), n/N (dimensionless), Tmax − Tmin (°C), Rh (%), VP (mb), Tavg (°C), respectively. The periods considered as training and testing phases have been shown in each figure.
3 Methodology
In this study, a hybrid approach named SVM-FFA is developed by coupling the SVM with FFA for prediction of horizontal global solar radiation. The potential and precision of the SVM-FFA approach is compared with ANN, GP, and ARMA. This section aims at describing briefly the support vector machine and firefly optimization algorithm as well as the encoding and methodology carried out to estimate the monthly mean daily global solar radiation with the proposed hybrid SVM-FFA approach. The description of ANN, GP, and ARMA can be found in the literature (Mora-López and Sidrach-de-Cardona 1998; Alam et al. 2009; Şenkal and Kuleli 2009; Voyant et al. 2012; Russo et al. 2014).
3.1 SVM
SVM is one of the soft computing learning algorithms which has recently applied in the variety of fields such as computing, hydrology, and environmental researches (Lu and Wang 2005; Asefa et al. 2006; Ji and Sun 2013; Sun 2013). It has mainly utilized in pattern recognition, forecasting, classification, and regression analysis. It has been proved that its applications show superior performance compared to prior developed methodologies such as neural network and other conventional statistical models (Vapnik et al. 1996; Joachims 1998; Collobert and Bengio 2000; Mukkamala et al. 2002; Huang et al. 2002; Sung and Mukkamala 2003). The details of theory and evolution of SVM developed by Vapnik can be found in (Vapnik and Vapnik 1998; Vapnik 2000).
SVM was developed according to the statistical machine learning development as well as structural risk minimization to reduce the upper bound generalization error compared to local training error, which is a common technique in the previously used machine learning methodologies. The mentioned technique proved advantages over other soft computing learning algorithms. Additional advantages provided in this methodology include (1) applying high dimensional spaced set of kernel equations, which discreetly include nonlinear transformation; thus, there is no assumption in functional transformation which makes data linearly separable indispensable and (2) unique solution due to the convex nature of the optimal problem.
SVM functions according to Vapnik’s theory are represented in Eqs. (1–4). R = {xi, di}ni is used to assume a set of data points. xi indicates the input space vector of the data sample. Also, di and n are the target value and data size, respectively. SVM approximates the function as represented in Eqs. (1) and (2):
In Eq. (1), φ(x) indicates high dimensional space characteristic that mapped the input space vector x. Also, w and b are a normal vector and scalar, respectively. In addition, \( C\frac{1}{n}{\displaystyle \sum_{i=1}^nL\left({x}_i,{d}_i\right)} \) stands error or risk. Factors b and w are measured by minimization of regularized risk equation following by introduction of positive slack variables ξi and ξ*i that indicate upper and lower excess deviation (Vapnik and Vapnik 1998):
where \( \frac{1}{2}{\left\Vert w\right\Vert}^2 \) is the regularization term, C represents the error penalty feature utilized to control the trade-off between the empirical error (risk) and regularization term, \( \varepsilon \) represents the loss function associated to approximation accuracy of the trained data point and the number of factors in the training data set which is defined as the l.
Optimality constraints and Lagrange multiplier which can be used to solve Eq. (1) are consequently obtained using a generic function as follows:
In Eq. (4), K(x, xi) = φ(xi)φ(xj) and the term K is defined as the kernel function, which is dependent on the two inner vector xi and xj in the feature space φ(xi) and φ(xj), respectively.
The main objective of SVMs is to determine data correlation through nonlinear mapping methodology. The kernel function, denoted by K, as a straight-forward computation technique (hereafter) can be used to generate a nonlinear learning machine. The method is employed to calculate the inner product in a feature space that serve as a function to original input points. The adaptability of SVM to use kernel functions is important where it discreetly alters the information into a higher dimensional feature space. The obtained results in such a space typify the outcomes of the lower dimensional, original input space.
Sigmoid, lineal, polynomial, and radial basis functions are the four basic kernel functions which are provided by SVM. Over time, the radial basis function (RBF) has been repeatedly proven to be the ideal function in its category due to its ability for efficient, simple, reliable, and adaptable computation for the purpose of optimization especially for adaptability in handling the parameters which are complex (Rajasekaran et al. 2008; Yang et al. 2009; Wu and Wang 2009). Only the solution of a set of linear functions are required for the training of RBF kernel equation rather than the lengthy and complicated demanding quadratic programming problem (Shamshirband et al. 2014; Mohammadi et al. 2015c). Accordingly, the radial basis equation with parameter σ is adopted. The nonlinear radial basis kernel function is defined as
where xi and xj are vectors in the input space, i.e., vectors of features computed from training or testing samples. In addition, the accuracy of predictions using RBF kernel function depends on the selection of its three factors (γ, ε, and C). In this study, the optimal values of these factors are established using firefly optimization algorithm, which is described in the following subsection.
3.2 SVM parameter selection using firefly optimization algorithm
Over the years, biological inspired metaheuristic optimization algorithms such as ant colony optimization (ACO), genetic algorithm (GA), particle swarm optimization (PSO), cuckoo search (CS), FFA, and many more have found wide applications in the fields of optimization (Kisi 2014; Kıran et al. 2012; Sudheer et al. 2014; Bojic et al. 2012). A more recent approach in biological inspired metaheuristic optimization algorithms is FFA developed by Yang (2010). This approach is on the basis of the certain behavioral pattern, particularly the flashing characteristic of fireflies. A firefly is a kind of insects that utilize the principle of bioluminescence to attract mates or prey. The luminance produced by a firefly enables other fireflies to trail its path in search of their prey. This concept of luminance production is useful to develop algorithms for solving many optimization problems. FFA proves to be more promising, robust, and efficient in finding both local and global optimal compared to other existing metaheuristic algorithms (Mohammadi et al. 2013; Amiri et al. 2013).
The fundamental rules in FFA development are as follows: (1) all fireflies are assumed unisex; thus, each has the opportunity to attract another one irrespective of their sex; (2) the attractiveness of one firefly to another is proportional to the amount of luminance produce (luminous intensity) which is declined with increasing the distance between them; consequently, the ones with less brightness will always move toward the ones with higher brightness; and (3) the brightness of the individual firefly is affected by the nature of the encoded cost function, simply say, the brightness is proportional to the value of the fitness or objective function (Poursalehi et al. 2013; Olatomiwa et al. 2015). The major issues in FFA development are the formulation of the objective function (attractiveness) and the variation of the light intensity. As an instance, in the optimal design problem involving the maximization of objective function, the fitness function is proportional to the brightness or the amount of light emitted by the firefly. Therefore, decrement of the light intensity due to more distance between the fireflies will lead to the variations of intensity and thereby lessen the attractiveness among them. Equation (6) can be used to represent the light intensity with varying distance.
where I is the light intensity at distance r from a firefly, Io represents initial light intensity, i.e., when r = 0 and γ is the light absorption coefficient which can be taken as a constant value varying between 0.1 and 10 (Sudheer et al. 2014). As a firefly’s attractiveness is proportional to the light intensity observed by adjacent fireflies, we can represent the attractiveness β at a distance r from the firefly as
where βo shows the attractiveness at distance r = 0.
Equation (8) represents the Cartesian distance between any two fireflies i and j:
The movement of firefly i as attracted to another brighter firefly j can be represented as
The first term appeared in the Eq. (9) is due to the attraction, while the second term represents the randomization with α as randomization coefficient whose value is between 0 and 1 (Sudheer et al. 2014) and εi is the random number vector derived from a Gaussian distribution. The next movement of firefly i is updated as
4 Performance assessment criteria
The robustness of the proposed hybrid SVM-FFA approach to estimate the monthly mean daily horizontal global solar radiation is evaluated via different statistical indicators of mean absolute percentage error (MAPE), root mean square error (RMSE), relative root mean square error (RRMSE), and coefficient of determination (R 2).
The MAPE, as an accuracy level estimator, shows the mean absolute percentage difference between the estimated and the measured data. The MAPE is obtained by
where Hi,c is the ith calculated solar radiation value by predictive techniques and Hi,m is the ith measured solar radiation value. Also, x is the total number of observations.
The RMSE determines the precision of the model by comparing the deviation between the estimated and the measured data. The RMSE has always a positive value and is calculated by
The RRMSE in percent is achieved by dividing the RMSE to the average of measured values, which is defined by
According to Li et al. (2013), different ranges of RRMSE can be defined to show the models’ capability such that a model precision is
-
Excellent for RRMSE <10 %;
-
Good for 10 % < RRMSE <20 %;
-
Fair for 20 % < RRMSE <30 %;
-
Poor for RRMSE >30 %.
The R 2 provides a measure of the linear relationship between the estimated and the measured values. The R 2 is obtained by
where Hm,a v g is the average of measured values.
It is worth mentioning that the smaller values of MAPE, RMSE, and RRMSE represent further preciseness of the global solar radiation estimation and in an ideal case they are zero. The R 2 ranges between 0 and +1. The R 2 value around +1 indicates that there is a perfect linear relationship between the estimated values and measured ones whereas R 2 around zero shows that there is no linear relationship.
5 Results and discussion
In this study, as mentioned earlier, the RBF was applied as the kernel function for the prediction of monthly mean global solar radiation. The three parameters associated with RBF kernels are C, γ, and ε. The optimal values of these parameters were obtained using firefly algorithm. Table 1 provides the achieved optimal values of user-defined parameters of C, γ, and ε.
Generally, the capability of each model and technique to offer accurate estimations is contingent upon proper input parameter selection. Various predictive variables described in section 2 with eight different possible combinations have been considered to find a more suitable set based upon a primary analysis of input parameter selection. It was found that combination of relative sunshine duration, difference between maximum and minimum air temperatures, relative humidity, and water vapor pressure is more effective to obtain acceptable estimation. For this aim, according to the examination conducted, three models with different combinations of input elements as presented in Table 2 are established via four approaches of SVM-FFA, ANN, GP, and ARMA and later explored to determine the most precise one.
Through different widely utilized statistical parameters of MAPE, RMSE, RRMSE, and R 2, the potential of the proposed hybrid model as well as ANN, GP, and ARMA models were assessed. The results are offered in Table 3 for both training and testing phases. According to the statistical indicators and one by one comparison of models (1)–(3), it is apparently found that SVM-FFA approach enjoys superior performance compared to the ANN, GP, and ARMA techniques. Besides, model (3) established based on each approach utilizing relative sunshine duration, difference between air temperatures, relative humidity, water vapor pressure, average temperature, and extraterrestrial solar radiation as inputs provides more precision compared to models (1) and (2). Therefore, it can be concluded that for favorable predictions of the horizontal global solar radiation in the considered case study, the presence of extraterrestrial solar radiation plays a remarkable role in attaining further accuracy as achieved by model (3).
Thus, to draw more appropriate conclusions, in the following, the proficiency of the SVM-FFA (3) model is more assessed compared to the ANN (3), GP (3), and ARMA (3) models.
The capability of the SVM-FFA (3) for monthly mean global solar radiation estimation in comparison with ANN (3), GP (3), and ARMA (3) can be shown by depicting the predicted values against the measured data. Figure 2a–d illustrates the scatterplots between the measured and the computed global solar radiation values via SVM-FFA (3), ANN (3), GP (3), and ARMA (3), respectively, for the training data set. It is observed that for SVM-FFA (3) as the slope of the straight line, according to Fig. 2(a), is nearly close to one, the number of either overestimated or underestimated values produced are really limited. Consequently, it is obvious that the predicted values by SVM-FFA (3) enjoy the highest level of precision. Whereas Fig. 2b–d shows that the amount of deviations of predicted data points by ANN (3), GP (3), and ARMA (3) are really higher which demonstrate the lower rate of correlation between the measured and the estimated values.
Figure 3a–d, in the form of scatterplot, shows the predicted horizontal daily global solar radiation values, respectively by SVM-FFA (3), ANN (3), GP (3), and ARMA (3) against the measured ones for the testing data set. It is clear that there are very favorable agreements between the estimated values by SVM-FFA (3) and the measured global solar radiation data. This proves the great merit of the SVM-FFA approach for prediction of monthly mean horizontal global solar radiation.
To provide more assessments on the accuracy of SVM-FFA approach, the ratios of estimated global solar radiation by SVM-FFA (3), ANN (3), GP (3), and ARMA (3) to the measured data were computed for the testing data set and the achieved results are presented as histogram plots in Fig. 4a–d, respectively. Histogram is a useful diagram to represent the probability occurrence of a given variable in any specific interval. Figure 4a–d shows the histogram of the number of months in different intervals of the computed ratios of data. It is observed that for SVM-FFA (3), 47 out of 48 months considered as the testing data set fall in the range of 0.9 to 1.1 which is a further validation to show the low errors and high potential of SVM-FFA approach in estimating the monthly mean horizontal global solar radiation.
In this part, to further verify the potential of the developed SVM-FFA (3) model to predict monthly mean global solar radiation, its capability is compared with the two well-known and widely used empirical models using relatively similar input parameters as inputs. For this aim, the Abdalla (1994) and Ododo et al. (1995) models have been established utilizing the traditional statistical regression technique and the used data sets of this study, respectively as
It is noticed that extraterrestrial solar radiation as a significant parameter plays a role in both models. For the Abdalla (1994) model (i.e., Eq. (15)), the attained statistical indicators are MAPE = 6.8004 %, RMSE = 0.4118 kWh/m2, RRMSE = 8.2738 %, and R 2 = 0.8436. Also, for the Ododo et al. (1995) model (i.e., Eq. (16)), the statistical parameters are achieved as MAPE = 6.7960 %, RMSE = 0.4050 kWh/m2, RRMSE = 8.1371 %, and R 2 = 0.8475.
Comparing these statistical indicators with those presented in Table 3 reveals that the predicted global solar radiation values by the SVM-FFA (3) are much closer to the measured data than those obtained by these two empirical models. In fact, based on the values of MAPE, RMSE, and RRMSE, it is noticed that more than two times more accuracy can be achieved by SVM-FFA (3) compared to these two empirical models. These comparisons prove the merit of the SVM-FFA (3) over the traditional empirical models using relatively similar input parameters.
The month by month comparison between the measured and the estimated global solar radiation on a horizontal surface via SVM-FFA (3) for all 48 months used as the testing data set is illustrated in Fig. 5.
6 Conclusions
The application of hybrid approaches to predict the global solar radiation is being growing rapidly owing to the fact that they take the advantages of different approaches, which eventuates in boosting the accuracy. In this study, using the combination of the SVM and FFA, a new model named SVM-FFA is proposed for prediction of monthly mean daily horizontal global solar radiation. As a case study, long-term measured horizontal global solar radiation and different meteorological parameters for port of Bandar Abbass situated in south costal region of Iran were used to evaluate the suitability of the new hybrid approach. The performance of the proposed approach was assessed by comparing its capability with ANN, GP, and ARMA approaches via different statistical techniques. By analyzing the possibility of utilizing various combinations of meteorological parameters as inputs, three metrological-based models were established using each approach. The results indicated that the model (3) using the combination of relative sunshine duration, difference between maximum and minimum air temperatures, relative humidity, water vapor pressure, average temperature as well as extraterrestrial solar radiation as inputs performed best based upon all approaches. This analysis proved the indispensible significance of extraterrestrial solar radiation to obtain higher accuracy in estimation.
It was conclusively found that the proposed hybrid SVM-FFA approach is highly efficient in estimating the monthly mean daily horizontal global solar radiation. According to the statistical indicators and one by one comparison of models (1)–(3), it was apparently found that SVM-FFA approach enjoys superior performance compared to the ANN, GP, and ARMA techniques. The order of model’s accuracy based on the model (3) as the best model of each approach was SVM-FFA (3) > GP (3) > ANN (3) > ARMA (3). In fact, the hybrid SVM-FFA represented very higher preciseness compared to others while the performance’s difference between GP, ANN, and ARMA was insignificant. The achieved statistical indicators for SVM-FFA (3) were MAPE = 3.3252 %, RMSE = 0.1859 kWh/m2, RRMSE = 3.7350 %, and R 2 = 0.9737. On the basis of RRMSE, the SVM-FFA (3) showed an excellent performance. Furthermore, by computing the ratio of estimated to the measured solar radiation values, it was found that for SVM-FFA (3), 47 out of 48 months considered as testing data set fall in the range of 0.9 to 1.1 which is a further verification for the merit of SVM-FFA approach. In the final analysis, two widely used empirical models of Abdalla (1994) and Ododo et al. (1995), using relatively similar input parameters, were established based on used data series of this study. By providing statistical comparisons, it was concluded that SVM-FFA (3) shows absolute superiority over empirical models.
To summarize, the study results strongly advocate the feasibility of utilizing the new hybrid SVM-FFA model to obtain further accuracy in estimating the monthly mean horizontal global solar radiation.
Change history
09 March 2020
The Editor-in-Chief has retracted this article [1] because validity of the content of this article cannot be verified. This article showed evidence of peer review and authorship manipulation.
References
Abdalla YAG (1994) New correlation of global solar radiation with meteorological parameters for Bahrain. Int J Sol Energy 16:111–120
Alam S, Kaushik SC, Garg SN (2009) Assessment of diffuse solar energy under general sky condition using artificial neural network. Appl Energy 86:554–564
Amiri B, Hossain L, Crawford JW, Wigand RT (2013) Community detection in complex networks: multi-objective enhanced firefly algorithm. Knowl-Based Syst 46:1–11
Asefa T, Kemblowski M, McKee M, Khalil A (2006) Multi-time scale stream flow predictions: the support vector machines approach. J Hydrol 318:7–16
Bannani FK, Sharif TA, Ben-Khalifa AOR (2006) Estimation of monthly average solar radiation in Libya. Theor Appl Climatol 83:211–215
Benghanem M, Mellit A (2014) A simplified calibrated model for estimating daily global solar radiation in Madinah, Saudi Arabia. Theor Appl Climatol 115:197–205
Bhardwaj S, Sharma V, Srivastava S, Sastry OS, Bandyopadhyay B, Chandel SS et al (2013) Estimation of solar radiation using a combination of hidden markov model and generalized fuzzy model. Sol Energy 93:43–54
Bojic I, Podobnik V, Ljubi I, Jezic G, Kusek M (2012) A self-optimizing mobile network: auto-tuning the network with firefly-synchronized agents. Inform Sci 182:77–92
Chen JL, Li GS (2014) Evaluation of support vector machine for estimation of solar radiation from measured meteorological variables. Theor Appl Climatol 115:627–638
Chen JL, Liu HB, Wu W, Xie DT (2011) Estimation of monthly solar radiation from measured temperatures using support vector machines—a case study. Renew Energy 36:413–420
Collobert R, Bengio S (2000) Support vector machines for large-scale regression problems. Institut Dalle Molle d’Intelligence Artificelle Perceptive (IDIAP), Martigny, Switzerland, Tech. Rep. IDIAP-RR-00-17.
Dahmani K, Dizene R, Notton G, Paoli C, Voyant C, Nivet ML (2014) Estimation of 5-min time-step data of tilted solar global irradiation using ANN (artificial neural network) model. Energy 70:374–381
Duffie JA, Beckman WA (2006) Solar engineering of thermal processes, 3rd edn. John Wiley & Son, New York
Flores JL, Karam HA, Filho EPM, Filho AJP (2015) Estimation of atmospheric turbidity and surface radiative parameters using broadband clear sky solar irradiance models in Rio de Janeiro-Brasil. Theor Appl Climatol. doi:10.1007/s00704-014-1369-7
Gueymard CA (2014) A review of validation methodologies and statistical performance indicators for modeled solar radiation data: towards a better bankability of solar projects. Renew Sustain Energy Rev 39:1024–1034
http://en.wikipedia.org/wiki/Bandar Abass. Accessed 20 Aug 2014
Huang C, Davis L, Townshend J (2002) An assessment of support vector machines for land cover classification. Int J Remote Sen 23(4):725–749
Huang J, Korolkiewicz M, Agrawal M, Boland J (2013) Forecasting solar radiation on an hourly time scale using a coupled auto regressive and dynamical system (CARDS) model. Sol Energy 87:136–149
Ji Y, Sun S (2013) Multitask multiclass support vector machines: model and experiments. Pattern Recogn 46(3):914–924
Joachims T (1998) Text categorization with support vector machines: learning with many relevant features. Springer
Kalogirou SA (2009) Solar energy engineering: processes and systems. 1st ed. Elsevier Inc
Kıran MS, Özceylan E, Gündüz M, Paksoy T (2012) A novel hybrid approach based on Particle Swarm Optimization and Ant Colony Algorithm to forecast energy demand of Turkey. Energy Convers Manage 53:75–83
Kisi O (2014) Modeling solar radiation of Mediterranean region in Turkey by using fuzzy genetic approach. Energy 64:429–436
Kottek M, Grieser J, Beck C, Rudolf B, Rubel F (2006) World map of the Koppen-Geiger climate classification updated. Meteorol Z 15(3):259–263
Li MF, Tang XP, Wu W, Liu HB (2013) General models for estimating daily global solar radiation for different solar radiation zones in mainland China. Energy Convers Manage 70:139–148
Linares-Rodriguez A, Ruiz-Arias JA, Pozo-Vazquez D, Tovar-Pescador J (2013) An artificial neural network ensemble model for estimating global solar radiation from Meteosat satellite images. Energy 61:636–645
Lu WZ, Wang WJ (2005) Potential assessment of the “support vector machine” method in forecasting ambient air pollutant trends. Chemosphere 59:693–701
Moghaddamnia A, Remesan R, Hassanpour Kashani M, Mohammadi M, Han D, Piri J (2009) Comparison of LLR, MLP, Elman, NNARX and ANFIS Models—with a case study in solar radiation estimation. J Atmos Sol-Terr Phys 71:975–982
Mohammadi S, Mozafari B, Solimani S, Niknam T (2013) An Adaptive Modified Firefly Optimisation Algorithm based on Hong's Point Estimate Method to optimal operation management in a microgrid with consideration of uncertainties. Energy 51:339–348
Mohammadi K, Shamshirband S, Anisi MH, Alam KA, Petkovic D (2015a) Support vector regression based prediction of global solar radiation on a horizontal surface. Energy Convers Manage 91:433–441
Mohammadi K, Shamshirband S, Tong CW, Alam KA, Petkovic D (2015b) Potential of adaptive neuro-fuzzy system for prediction of daily global solar radiation by day of the year. Energy Convers Manage 93:406–413
Mohammadi K, Shamshirband S, Tong CW, Arif M, Petkovic D, Ch S (2015c) A new hybrid support vector machine–wavelet transform approach for estimation of horizontal global solar radiation. Energy Convers Manage 92:162–171.
Mora-López L, Sidrach-de-Cardona M (1998) Multiplicative ARMA models to generate hourly series of global irradiation. Sol Energy 63:283–291
Mostafavi ES, Saeidi Ramiyani S, Sarvar R, Izadi Moud H, Mousavi SM (2013) A hybrid computational approach to estimate solar global radiation: an empirical evidence from Iran. Energy 49:204–210
Mubiru J, Banda E (2007) J K B (2007) Performance of empirical correlations for predicting monthly mean daily diffuse solar radiation values at Kampala. Uganda Theor Appl Climatol 88:127–131
Mubiru J, Banda EJKB, D’Ujanga F, Senyonga T (2007) Assessing the performance of global solar radiation empirical formulations in Kampala, Uganda. Theor Appl Climatol 87:179–184
Mukkamala S, Janoski G, Sung A (2002) Intrusion detection using neural networks and support vector machines. in Neural Networks IJCNN'02. Proceedings of the 2002 International Joint Conference on IEEE
Ododo JC, Sulaiman AT, Aidan J, Yguda MM, Ogbu FA (1995) The importance of maximum air temperature in the parameterization of solar radiation in Nigeria. Renew Energy 6:751–763
Olatomiwa L, Mekhilef S, Shamshirband S, Mohammadi M, Petkovic D, Sudheer Ch (2015) A support vector machine–firefly algorithm-based model for global solar radiation prediction. Sol Energy 115:632–644
Ozgoren M, Bilgili M, Sahin B (2012) Estimation of global solar radiation using ANN over Turkey. Expert Syst Appl 39:5043–5051
Poursalehi N, Zolfaghari A, Minuchehr A, Moghaddam HK (2013) Continuous firefly algorithm applied to PWR core pattern enhancement. Nucl Eng Des 258:107–115
Rajasekaran S, Gayathri S, Lee TL (2008) Support vector regression methodology for storm surge predictions. Ocean Eng 35(16):1578–1587
Ramedani Z, Omid M, Keyhani A, Shamshirband S, Khoshnevisan B (2014) Potential of radial basis function based support vector regression for global solar radiation prediction. Renew Sustain Energy Rev 39:1005–1011
Rizwan M, Jamil M, Kirmani S, Kothari DP (2014) Fuzzy logic based modeling and estimation of global solar energy using meteorological parameters. Energy 70:685–691
Russo M, Leotta G, Pugliatti PM, Gigliucci G (2014) Genetic programming for photovoltaic plant output forecasting. Sol Energy 105:264–273
Salcedo-Sanz S, Casanova-Mateo C, Pastor-Sánchez A, Sánchez-Girón M (2014) Daily global solar radiation prediction based on a hybrid Coral Reefs Optimization–Extreme Learning Machine approach. Sol Energy 105:91–98
Şenkal O, Kuleli T (2009) Estimation of solar radiation over Turkey using artificial neural network and satellite data. Appl Energy 86:1222–1228
Shamim MA, Bray M, Remesan R, Han D (2015) A hybrid modelling approach for assessing solar radiation. Theor Appl Climatol. doi:10.1007/s00704-014-1301-1
Shamshirband S, Petković D, Saboohi H, Anuar NB, Inayat I, Akib S et al (2014) Wind turbine power coefficient estimation by soft computing methodologies: comparative study. Energy Convers Manage 81:520–526
Sudheer C, Sohani SK, Kumar D, Malik A, Chahar BR, Nema AK et al (2014) A support vector machine-firefly algorithm based forecasting model to determine malaria transmission. Neurocomputing 129:279–288
Sun S (2013) A survey of multi-view machine learning. Neural Comput Appl 23:2031–2038
Sung AH, Mukkamala S (2003) Identifying important features for intrusion detection using support vector machines and neural networks. in Applications and the Internet, 2003. Proceedings. 2003 Symposium on IEEE
Tulcan-Paulescu E, Paulescu M (2008) Fuzzy modelling of solar irradiation using air temperature data. Theor Appl Climatol 91:181–192
Vapnik V (2000) The nature of statistical learning theory: springer
Vapnik VN, Vapnik V (1998) Statistical learning theory. Wiley, New York
Vapnik V, Golowich SE, Smola A (1996) Support vector method for function approximation, regression estimation, and signal processing. Advances in neural information processing systems 281–87
Voyant C, Muselli M, Paoli C, Nivet ML (2012) Numerical weather prediction (NWP) and hybrid ARMA/ANN model to predict global radiation. Energy 39:341–355
Wu J, Chan CK (2011) Prediction of hourly solar radiation using a novel hybrid model of ARMA and TDNN. Sol Energy 85:808–817
Wu KP, Wang SD (2009) Choosing the kernel parameters for support vector machines by the inter-cluster distance in the feature space. Pattern Recogn 42(5):710–717
Wu Z, Du H, Zhao D, Li M, Meng X, Zong S (2012) Estimating daily global solar radiation during the growing season in Northeast China using the Ångström–Prescott model. Theor Appl Climatol 108:495–503
Wu J, Chan CK, Zhang Y, Xiong BY, Zhang QH (2014) Prediction of solar radiation with genetic approach combing multi-model framework. Renew Energy 66:132–139
Yadav AK, Chandel SS (2014) Solar radiation prediction using Artificial Neural Network techniques: a review. Renew Sustain Energy Rev 33:772–781
Yang XS (2010) Firefly algorithm, stochastic test functions and design optimisation. Int J Bio-Inspired Comput 2:78–84
Yang H, Huang K, King I, Lyu MR (2009) Localized support vector regression for time series prediction. Neurocomputing 72(10):2659–2669
Acknowledgments
The authors would like to thank the University of Malaya for the research grants allocated for this project, i.e. the University of Malaya Research Grant (RP015C-13AET). Special appreciation is also credited to the Malaysian Ministry of Education (MOE) for the Fundamental Research Grant Scheme (FP053-2013B).
Author information
Authors and Affiliations
Corresponding author
Additional information
The Editor-in-Chief has retracted this article because validity of the content of this article cannot be verified. This article showed evidence of peer review and authorship manipulation. Shahaboddin Shamshirband disagrees with this retraction. Authors Abdullah Kasra Mohammadi, Chong Wen Tong, Mazdak Zamani, Shervin Motamedi, and Sudheer Ch have not responded to correspondence about this retraction.
Appendix
Appendix
The extraterrestrial solar radiation on a horizontal surface (Ra) is expressed as (Duffie and Beckman 2006; Kalogirou 2009)
where Gsc is the solar constant which based upon the new assessments reported by Intergovernmental Panel on Climate Change (IPCC) is assumed equal to 1361.5 W/m2 (www.ipcc.ch/report/ar5/wg1) and nday is the average day of each month (Duffie and Beckman 2006). δ and ωs are the daily solar declination and sunset hour angles, respectively, as (Duffie and Beckman 2006)
The maximum possible sunshine duration (N) is (Duffie and Beckman 2006; Kalogirou 2009)
About this article
Cite this article
Shamshirband, S., Mohammadi, K., Tong, C.W. et al. RETRACTED ARTICLE: A hybrid SVM-FFA method for prediction of monthly mean global solar radiation. Theor Appl Climatol 125, 53–65 (2016). https://doi.org/10.1007/s00704-015-1482-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00704-015-1482-2