Keywords

1 Introduction

Renewable energy resources are the major competitor of the fossil fuels such as coal, gas, and petroleum. Fossil fuel depletes with time, moreover, the resources are available in some regions around the world. Wind power is an indirect solar potential, which is clean, freely available, environmentally friendly, widely distributed, and naturally abundant almost anywhere around the globe. It has been applied decades ago for sailing ships, machine grinding, windmills, and crop handling. Recently, it has become popular for electrical power generation.

The development of wind energy has reached a large scale in terms of annual installed capacity. Large-scale wind farms are linked to the electrical power transmission lines; meanwhile, small wind turbines rated from few watts up to 10 kW are being used for stand-alone application to provide electricity to the isolated, remote, and rural locations [13]. Notwithstanding, many professionals have claimed that the wind potential remains unharnessed. In fact, 2014 is another remarkable year for wind turbine installation. Currently, wind turbine producers are yet to meet up with the present demands [4, 5].

Wind resource assessment (WRA), micrositing and sizing of wind turbines are prerequisite requirements that must be performed during the technical feasibility stage. The essential aspect that needs to be considered is the stochastic and unpredictable nature of wind speed. It is well known that a small deviation of wind speed will lead to a large error in the wind power output [6]. Because of this, wind speed prediction is essential for analyzing the performance of wind turbine system.

Many published studies have demonstrated the usefulness of wind speed and direction prediction models and this helps in siting of wind energy systems. Those techniques are usually classified into persistence, numeric weather prediction (NWP), statistical, stochastic, and method based on soft computing (artificial neural network (ANN) and fuzzy logic).

In the persistence approach, the accuracy drops with an increase in prediction time horizons. NWP method was based on the kinematics equations, which involve meteorological parameters. Examples, AIOLOS mass-consistent code numerical models [7]. COMPLEX and NOABL models were tested in the united kingdom (UK) [8]. Modified AIOLOS based on thermal stratification [9]. Multigrid solution via three-dimensional solver [10]. Moreover, according to [7], on the account of unpredictable nature of wind speed, it is difficult to generate a reliable algorithm that will take into account of all those irregularities, with acceptable accuracy. Furthermore, it has found that no mathematical model either physical or numeric will give a perfect definitive solution [11]. Meteorologist ordinarily uses the method to predict future samples.

A statistical and stochastic strategy mainly aims short samples. In the era of soft computing, neural network (NN), support vector machine (SVM), simulated annealing (SA) and fuzzy logic are found to be more appropriate [1216].

On the other hand, in order to guarantee the prediction reliability for nonlinear time series, these techniques have some difficulties such as the problems in selecting input parameters, the complexity of computation time, and so on. In particular, one of the issues of fuzzy logic is the difficulty to determine the precise weights and the degree of each rule; moreover, it involves fuzzy sets and interval numbers. The application of ANN presents a series of advantages such as self-adaptive, parallelism, simple structure, rapid training speed, fast convergence fault tolerance. Based on these advantages, the ANN model has been widely formulated for various nonlinear applications [17]. The application of NN for the prediction of wind speed varies depending on the prediction timescale, that is, from short term up to long term. Many researchers [4, 1821] have applied different forms of NN for wind speed prediction using different meteorological and geographical data. However, none of the listed studies have considered the influence of terrain shape and roughness changes. Nowadays, special attention has been put into the prediction of nonlinear and nonstationary time-varying data using ANNs, because of their widespread approximation functionality. It is generally acknowledged that ANN is the most suitable for the wind speed prediction [2, 22, 23]. The main objective of this paper is to present a novel ANN models named topographical feedforward neural network (T-FFNN) for predicting monthly wind speed and direction for Sarawak is proposed.

This research was carried out based upon the obtainable average hourly data at eight ground stations under the control of the Malaysia Meteorological Department (MMD). The rest of the paper is structured as follows. First, in depth wind speed and direction prediction using an extended proposed ANN and geographical information system (GIS) assisted methodology are discussed. Subsequently, the predicted and developed wind map results are analyzed. Lastly, purposeful findings are drawn.

2 Wind Speed and Direction Prediction Using ANN

Wide application of wind energy system necessitates accurate and precise estimation of wind speed/direction, which has a direct effect on the energy output. Traditionally, this can only be possible by using anemometer and wind vane to carry out experiments. However, in rural and remote areas, where obstacle-free wind resources are available, these devices or wind monitoring stations are not available. The goal of this paper is to provide a solution for estimating the abundant resource based on the knowledge of existing wind station in Sarawak, Malaysia. In fact, prediction tool is one of the best approaches for estimating wind speed and direction in non-monitored locations, which can help in designing of the wind energy systems, prior to microsizing and siting.

In wind engineering, ANNs have been widely used for the prediction of wind speed where a wind turbine is expected to operate [2, 22, 23].

ANNs are mathematical techniques constructed to perform a variety task, such as incremental learning, pattern recognition, process control data mining prediction, and financial modeling. They learn by examples and produce a future unseen data. ANN mimics human being brain system, which consist of layers of parallel element unit called neurons.

The neurons are connected to large number of weights over which the signal is passing; the neuron receives the input through the input layer and multiple by a weight generally performs nonlinear operations and produce an output. The most successful ANN used for the prediction is feedforward neural network (FFNN) using log-sigmoid [14, 15]. For these reasons, one hidden layer FFNN with backpropagation was adopted in this paper to examine the complex dependency of wind speed in the locations where wind station is not available, taking terrain shape and roughness height into consideration.

Theoretically, it has been validated that a single layer design is satisfactory to approximate any nonlinear problems [5, 24]. Figure 1 shows a fully connected three layers topology of the proposed model for wind speed and direction prediction. Fully connected implies that the output from each of the input and hidden layer is distributed to all of the neurons in the subsequent layer. However, feedforward signifies that the network has not any directed cycle.

Fig. 1
figure 1

T-FFNN for wind speed/direction prediction

Fig. 2
figure 2

Log-Sigmoid and purelin activation functions

Although, there is no restriction regarding the adoption of transfer functions, it can be any mathematical function such as tangent, hyperbolic, logarithm, or combination of both like log-sigmoid, hyperbolic tangent, or Purelin. In this paper, log-sigmoid and Purelin (Fig. 2, [2]), activation functions were selected in order to obtain the differential function between the output and input variables, whose mathematical formulas are expressed in Eqs. 1 and 2 accordingly [25].

$$\begin{aligned} f(x)=\frac{1}{1+e^{-x}} \end{aligned}$$
(1)
$$\begin{aligned} f(x)=x \end{aligned}$$
(2)

Yet another most important phase in the course of designing an ANN is the training phase, simply because ANN is trained to solve a problem instead of programmed to accomplish this. The supervised training was adopted in this study. Monthly average data were feed in and the network learned by comparing the measured with the estimated. The error difference is propagated back from the output layer via hidden layer to the input layer and the weights on the connection between the neurons are updated as the error is backpropagated using Levenberg–Marquardt (LM). The goal of selecting LM was to assure speedy processing and furthermore to overwhelm the slow convergence associated with the conventional algorithm such as descent gradients and resilient propagation.

FFNN can mathematically approximate multivariate function to any level of accuracy if an adequate number of the hidden layer neurons are available. To avoid underfitting and overfitting in determining the number of optimal neurons in the hidden layer. This research work considered that the number of weights must not exceed the data used for the training. Hence, 9-7-1 topology was used with a step of five in all the designed models.

A MATLAB editor was used to the write the scripts file and implemented in the NN Toolbox. The developed models have 9 inputs latitude, longitude, altitude, and month of the year, temperature, atmospheric pressure, temperature and relative humidity, terrain shape and roughness height. Meanwhile, the output is the monthly wind speed for the wind speed prediction. In the case of wind direction, as the objection function (model 2). The network has latitude, longitude, altitude and month of the year, temperature, atmospheric pressure, temperature, and wind speed, terrain shape and roughness height. After the ANN training, the network was simulated to get the weights /biases, and can be applied to formulate the mathematical function.

The appropriateness of the designed models was assessed according to two statistical methods, the correlation coefficient (R) and mean absolute percentage error (MAPE). These values are mathematically described by the following equation [18, 26].

$$\begin{aligned} R=\frac{\displaystyle \mathop {\sum }_{i=1}^N {\left( {t_i -\overline{t} } \right) } .\left( {o_i -\overline{o} } \right) }{\sqrt{\displaystyle \mathop {\sum }_{i=1}^N {\left( {t_i -\overline{t} } \right) } ^{2}.\sqrt{\displaystyle \mathop {\sum }_{i=1}^N {\left( {o_i -\overline{o} } \right) ^{2}} }}} \end{aligned}$$
(3)
$$\begin{aligned} MAPE(\% )=\left( {\frac{1}{N}\displaystyle \mathop {\sum }_{i=1}^N {\left| {\frac{t_i -o_i }{t_i }} \right| } } \right) 100 \end{aligned}$$
(4)

where N is the number of data, and \(t_{i}\), \(o_{i}\) are target value and ANN predicted value, respectively, of one data point i. The bars indicate the average value.

3 Isovent Lines Mapping and Maps Production

Wind energy mapping of the study area was generated using ARCGIS 9.3 software. The base maps of Sarawak at 1: 1,25,000 were digitized in order to obtain the shape files. The coordinate of the ground and predicted station were converted from degrees, minutes, seconds to the decimal unit. The World Geodetic System (WGS) of 1984 was applied for the definition of the coordination system and it was used in generating contour lines. Series of interpolation were conducted to select the best method that fits the wind speed and energy data of the studied area. It was found that Kriging method provides better accuracy for the spatial analysis carried out. This methodology has been verified in various studies [27, 28].

4 Results and Discussion

The employed data consist of 3650 daily records from Kuching, Miri, Sibu, Bintulu, and Sri Aman for the period of ten years (2003–2012). In addition, for the remaining stations, Kapit, Limbang and Mulu, the period of observation is 5 years (2008–2012). For the first five stations, the data were segmented from 2003–2009, 2010–2011, and 2012. For the last three stations, 2008–2009, 2010–2011, and 2012 for the training, testing, and validation. Prior to the training, all the data employed were scaled to the range of [–1, 1].

4.1 Topographical Simulation Models

Forty networks were designed and trained. The training was performed according to the mean squared error (MSE). It was identified that the optimum network in terms of fast convergence and lower MSE was 9-152-1 and 9-94-1 for the wind speed and direction models. The number of epochs was varied from 0 to 1000 in a step of one. It was realized that no further improvement could be made once the training reaches between 996 and 1000 epochs. Hence, the training was stopped. Figure 3 shows the error model function obtained during the training of one station. It can be noted from the figure that the after 1000 epochs the MSE reduces drastically between the ANN and measured data. The realized MSE was 0.043821. It should be observed that extended training would result in the ANN to remember training data, which ends up in poor generalization capability of the ANN model.

Pertaining to the data normalization, activation function used in the hidden layer and by using the weights and biases generated after the training, the equation for calculating wind speed and direction in the case of one station becomes:

$$\begin{aligned} v_m =1.098\left[ {\left\{ {\left( {\frac{2.907}{1+e^{-2.907(XW_1 +B_1 )}}} \right) W_2 +B_2 } \right\} +1} \right] +0.337 \end{aligned}$$
(5)
$$\begin{aligned} w_D =0.035\left[ {\left\{ {\left( {\frac{0.072}{1+e^{-0.062(XW_1 +B_1 )}}} \right) W_2 +B_2 } \right\} +1} \right] +10.789 \end{aligned}$$
(6)

where \(V_{m}\), \(W_{D}\): are the monthly wind speed/direction, \(W_{1}\): Weight between the input and hidden layer, \(W_{2}\): Weight between the hidden layer and output layer, \(B_{1}\): Biases of the hidden layer, \(B_{1}\): Biases in the output layer \(B_{2}\), X: is the column vector, which contains normalized values of nine input variables.

Fig. 3
figure 3

Sample of the error function during the training (color figure online)

In accordance with the results displayed in Table 1. The suggested models have the satisfactory precision for calculating wind speed and direction. It is interested to observe that, the T-FFNN models showed improved outcomes on the test data, which verifies the high generalization of the development process. For this reason, the formulated models can be utilized efficiently to examine the relation between geographical, topographical, and meteorological variables on the wind speed and direction.

To this aim, Eq. 5 was plotted to generate a contours map response of wind speed for change in terrain condition (the shape of the terrain and roughness class) see Fig. 4. It is clear that, the terrain varies depending on the location, and wind speed is affected by the terrain and changes with the roughness of the terrain such as forest, lake, city, etc. From the figure, it is noticeable that the changes are more pronounce at higher wind speed, however, wind speed in the range of 1 m/s to about 2.2 m/s (class from 0–5 and 5–10) has limited effect on the wind flow within the studied area.

Table 1 T-FFNN prediction accuaracy
Fig. 4
figure 4

Interaction effect between wind speed and terrain elevation at \({{\varvec{v}}}={{\varvec{1}}}{-}{{\varvec{3}}}\) m/s

4.2 Wind Atlas Map of Sarawak at Various Elavations

The isovents map of Sarawak was developed based on the procedures discussed in the previous section. The generated map was based on the long-term measured and predicted wind speed at 10–40 m in heights. Sample of the wind speed, power, and energy density maps are depicted in Figs. 5, 6 and 7 at 10 m in elevation. In all the listed figures, the areas marked red and blue represent highest and lowest potentials respectively.

Fig. 5
figure 5

Long-term wind speed map of Sarawak (color figure online)

Fig. 6
figure 6

Long-term power density map of Sarawak (color figure online)

Fig. 7
figure 7

Long-term energy density map of Sarawak (color figure online)

5 Conclusion

In this paper, the idea of a new wind speed and direction prediction modeling based on the NN considering terrain shape is introduced. The proposed T-FFNN has been analyzed mathematically. The prediction models were used in conjunction with ground-based wind station to construct an isovents wind map of Sarawak that will be useful to policymakers, wind turbine manufacturers, potential investor and structural and bridge engineers.