1 Introduction

Forecasting seasonal rainfall a few months in advance is potentially valuable for a vast range of decision-makers and users, including irrigators and urban and rural water managers, to devise risk management strategies. It is also essential for the regions with high spatial and temporal rainfall variability. However, forecasting rainfall is complex due to participating in many variables such as temperature, pressure, climate drivers, etc. In the past decades, Artificial Intelligence (AI) based methods were introduced to overcome the complexity of rainfall forecasting. ANNs are one of the AI-based methods that have been broadly used for forecasting rainfall (Pakdaman et al. 2020; Poonia and Tiwari 2020; Yadav and Sagar 2019). For example, Babel et al. (2015) developed three ANN models with various explanatory variables for forecasting rainfall in Mumbai, India. ANN was used for post-processing the monthly precipitation forecast by Pakdaman et al. (2020). They showed that the performance of ANN is superior compared to the multi-model ensemble (NMME) models for all months.

However, sometimes, the ANN is integrated with other techniques to improve the results and accuracy of the models. There have been many attempts to improve rainfall prediction by developing novel hybrid techniques. For example, recently, a novel post-processing technique was proposed by Yazdandoost et al. (2020) that can improve the conventional method's performance over the monthly and seasonal predictions. There is another successful effort in using hybrid techniques in forecasting rainfall. A new hybrid ANN was developed to improve the performance of the ANN in forecasting rainfall by Abdel-Kader et al. (2021). A Hybrid Adaptive Neuro-Fuzzy Inference System was suggested to evaluate temporal rainfall variability by Farrokhi et al. (2020). Diop et al. (2020) established a hybrid Multilayer Perceptron-Whale Optimization Algorithm (MLP-WOA) model for forecasting annual rainfall. They found that the proposed method improves the performance of the Multilayer Perceptron (MLP).

The wavelet transform is one of the techniques that help researchers to investigate climatological events precisely and help them have better insight into these phenomena. This technique can capture useful hidden information of weather parameters such as rainfall and climate drivers, which cannot be easily seen in the main time series. He et al. (2015) conducted appreciable research to forecast monthly rainfall in Australia using a hybrid wavelet ANN. Their study included the processing and input selection methodologies. Sharghi et al. (2018) applied a wavelet ANN technique for forecasting daily and monthly rainfall in two watersheds. They showed that the WANN could enhance the prediction of the simple feed-forward neural network up to 50%. A model using the integration of wavelet transform and convolutional neural network (CNN) was developed for forecasting monthly and daily rainfall by Chong et al. (2020). They showed that their proposed model captures patterns of the rainfall time series satisfactorily. Recently, Ghamariadyan and Imteaz (2021a) developed a wavelet-ANN model for forecasting monthly rainfall in Queensland, Australia, with up to twelve-month lead time.

The Australian climate is variable compared to other parts of the world (Bagirov and Mahmood 2018). It has been changing in recent decades, and researches indicate that it has undergone higher temperature and less rainfall. The El Nino–Southern Oscillation (ENSO) and Indian Ocean Dipole (IOD) are the most significant anomalies that cause rainfall variability in Australia (Hasan and Dunn 2012). The ENSO is mainly measured using the southern oscillation index (SOI) (Abtew and Trimble 2010). In addition, IPO is another influential index that is often defined as exhibiting an “ENSO-like” decadal pattern (Power et al. 1999). The IPO is measured by variations in the sea surface temperature in the Pacific Ocean.

There are few studies on combined and lagged effects of climate indices on rainfall (Ghamariadyan and Imteaz 2021b; Mekanik et al. 2013; Schepen et al. 2012). It is noteworthy that understanding the relationships between rainfall and different climate indices can help to have a better insight to predict future rainfall.

The contribution of this study is to develop a coupled wavelet-ANN for predicting seasonal rainfall in Queensland. Also, results are compared with the conventional methods (such as MLR, ARIMA), the current prediction system of the Bureau of Meteorology (BOM) of Australia, and climatology forecasts.

2 Methodology

A summary of the procedure used in the study is shown in Fig. 1. A correlation analysis is conducted in the first step, and then the best input set is selected according to the correlation analysis. Next, the various models are developed, and the best input set for each model is identified based on the RMSE value. In the last step, the performance of WANN is compared with other methods using various evaluation criteria.

Fig. 1
figure 1

Schematic diagram of different models development and comparison

The methodology for forecasting seasonal rainfall in this study is mainly based on the WANN and ANN methods. To date, seasonal rainfall forecasting in Australia has been investigated for some regions (Mekanik et al. 2016; Tozer et al. 2017). Nevertheless, no study has been conducted to predict the seasonal rainfall in Queensland. Besides, there is no versatile model to forecast the seasonal rainfall one year ahead with reasonable accuracy.

2.1 Wavelet Analysis

The concept of the wavelet theorem, primarily, was introduced as a substitute for the Fourier transforms (FT). In wavelet analysis, the primary signal is split into various sub-signals with lower resolution using shifting and scaling processes. The wavelet is shifted to align with the feature we are looking for in the signal. Then, the similarity of the shifted or stretched signal through the scaling process is measured using a factor called the wavelet coefficient (Mallat 1999).

It is noteworthy that selecting the appropriate wavelet function (mother wavelet) affects the precision of the results. The proper wavelet function is acquired through the trial and error process (Nourani et al. 2012). In the current study, Dmey (Discrete Meyer) was chosen as the main wavelet function.

In wavelet transform analysis, the signal is converted from the time domain into the time/frequency domain. It is mathematically defined as (Mallat 1999):

$$f\left(a,b\right)=\frac{1}{\sqrt{a}}{\int }_{-\infty }^{\infty }f(t)\varphi \left(\frac{t-b}{a}\right)dt$$
(1)

where, \(f(t)\) stands for the main signal, \(\varphi (t)\) is wavelet function with length (t), \(a\) and \(b\) are scale factors, and time-like translation variables, correspondingly. In the high-scale band, the wavelet is greatly stretched, while the wavelet is immensely compacted in the low-scale band.

The approximation and detail components are obtained after decomposition of the main signal at different levels (identifying levels number is based on trial and error) and are defined by

$$f\left(t\right)={D}_{1}+{D}_{2}+{D}_{3}+\dots +{D}_{i}+{A}_{i}$$
(2)

where, \(A\) and \(D\) stand for “approximation” and “detail”, respectively. \(i\) shows the level of decomposition. In this definition, the low-frequency signal (\(A\)) consists of a high-frequency and low-frequency signal. In other words, \({A}_{1}\) is a low-frequency signal that can create \({A}_{2}\) (a low-frequency signal) and \({D}_{2}\) (a high-frequency signal).

2.2 Artificial Neural Networks

ANNs are computing systems that try to imitate human brain properties by learning and training rules. An ANN comprises nodes connected by some weighted synaptic connections. An MPL neural network contains an input and output layer with one or more hidden layers. The hidden nodes’ size is obtained through trial and error, and it relies on the complexity of the problem. An MLP is mathematically defined as (Kim and Valdés 2003):

$${\widehat{y}}_{k}={f}_{0}\left[{\sum}_{j=1}^{M}{w}_{kj}.{f}_{h}\left({\sum}_{i=1}^{N}{w}_{ji}{x}_{i}+{w}_{jo}\right)+{w}_{ko}\right]$$
(3)

where, \({w}_{ji}\) is the weights of the hidden layer that links the \({i}^{th}\) neuron in the input layer with the \({j}^{th}\) neuron in the hidden layer;\({f}_{o}\) stands for the output layer’s activation function; \({w}_{jo}\) is the bias for the \({j}^{th}\) hidden neuron; \({w}_{kj}\) denotes the weight of the output layer that connects \({j}^{th}\) neuron with the \({k}^{th}\) neuron in the output layer; \({w}_{ko}\) is the bias for the \({k}^{th}\) output neuron; \({f}_{h}\) is the activation function of hidden neurons and \({\widehat{y}}_{k}\) is the neural network’s output.

For the hidden layer, the tan-sigmoid transfer function, and for the output layer, the linear purlin (\(g\)) is used. The use of sigmoidal and linear form transfer functions for hidden and output layers is recommended, respectively (Maier and Dandy 2000).

In this study, the training algorithm is the Levenberg–Marquardt (LM) algorithm. The early stop technique is used to eliminate overtraining. In this process, the training is stopped when the validation error begins to increase while the training set error is still declining. Hence, by using this procedure, overfitting does not happen. Moreover, an out-of-sample data set is selected to evaluate the performance of the model accurately.

2.3 Integrating the Wavelet with ANN

For connecting the ANN with the wavelet, the approximations (A) and details (D) acquired through the wavelet analysis at each level are used as inputs of the ANN. That means that the time series of the approximation and detail at each level is used as the input of the ANN. For instance, through the wavelet analysis, if there is only one time series as input set (model 1) and three decomposition levels are used, then the six sub-time series is applied as the network's input.

Figure 2 shows the process of connecting the wavelet and ANN. In this figure, ‘A’ and ‘D’ stand for the approximation and detail at each level. Also, ‘m’ and ‘n’ are numbers of inputs and levels, correspondingly.

Fig. 2
figure 2

A multi-level Wavelet ANN architecture with various input sets

2.4 MLR and ARIMA

In this study, the classical methods of the ARIMA and MLR are also used for further comparison. The MLR is a linear method to create a correlation between the dependent and independent variables (Brown 2014). In MLR, the stepwise technique is used, and the F-test is employed to identify the best-fitted model (Galvao et al. 2008).

The ARIMA comprises autoregressive (AR) and moving average (MA) models (Box et al. 2015). This method is a widely used and popular model in time series prediction. However, sometimes identifying the typical inherent parameters [p, d, q] is not straightforward (Islam and Imteaz 2020). In addition, The ARIMA cannot be applied directly to the non-stationary time series, and it must be converted into a stationary time series by using the time series differencing technique. In the current study, since the time series is stationary, applying the differencing technique is unnecessary. The ARIMA modeling in this study is conducted using the SPSS software. The “expert modeler” is used as a feature to determining the order of the parameters and the best model. Thereby, the values of ARIMA parameters are different for each station. Further background and information about ARIMA are available in (Box et al. 2015).

3 Data and Study Area

Queensland, in the northeast of Australia, is the second-largest and third-most-populous state of Australia. The location of the study area, along with selected rainfall stations, is shown in Fig. 3. The data of rainfall and the SOI are obtained from the Australian BOM website (http://www.bom.gov.au/climate/data/). Ten weather stations with a long record of data were chosen. The details of the selected weather stations are shown in Table 1. The Nino3.4 data is collected from the Royal Netherlands Meteorological Institute (KNMI) Climate Explorer (http://climexp.knmi.nl). Also, the data related to IPO is obtained from Chris Folland, Met Office, Hadley Centre, UK (https://www.metoffice.gov.uk).

Fig. 3
figure 3

Schematic map of Queensland State and weather stations

Table 1 Geographical location of selected rainfall sites of the study

Queensland’s rainfall is spatially and temporally variable (Knip et al. 2011). However, the wet season is considered from November to March in north Queensland (Sumner and Bonell 1988). In southeastern and central Queensland, wet and dry seasons are considered from October to March and September to April, respectively (Wilson et al. 2013). Moreover, most of the flood events in Queensland occur in summer. In this study, rainfall of summer (December to February) as the wet season is considered.

Total 109 years of data from 1908 to 2016 is used in this study. For training the models, data of the first 94 years (1908 to 2001) are used, and the remaining 15 years (2002-2016) are assigned for verifying the models to avoid any potential overfitting. In other words, the testing data set are new data that were not used previously during the training process.

In this study, eight different model sets comprising rainfall and climate indices are defined to identify the best model for each station (Table 2). Notably, using a single climate driver might be unsuitable for forecasting rainfall accurately because of the complications of the relationships between climate drivers and rainfall. Thereby, combined inputs comprised of various effective climate indices plus rainfall values are employed. Also, the individual input sets (Rainfall, IPO, SOI, and Nino3.4) are defined to investigate better the impact of a single predictor on the prediction of summer rainfall in Queensland.

Table 2 The input sets defined for each station

The models’ performances during calibration and testing are evaluated using statistical parameters such as RMSE, mean absolute error (MAE), refined index of agreement (\({d}_{r}\)) and correlation coefficient (R). According to Willmott et al. (2012), this index is more efficient for evaluating the model's accuracy than other available statistical parameters. Moreover, the Nash-Sutcliffe coefficient of efficiency (NSE) is another practical factor that reasonably measures the hydrological models’ prediction capabilities (Nash and Sutcliffe 1970). It is bounded between \(-\infty\) to \(1\) and is presented to different forms of mathematical models (Gupta and Kling 2011).

For further assessment of the models, a statistical score is used to assess the skills of the predicted model against a reference forecast like climatology prediction. The skill score, which is used for ACCESS-S, WANN, and ANN is as follows (Hawthorne et al. 2013)

$$Skill score=\frac{{RMSE}_{Ref.}-{RMSE}_{For.}}{{RMSE}_{Ref.}}*100\mathrm{\%}$$
(4)

where, \({RMSE}_{For.}\), and \({RMSE}_{Ref.}\) stand for score (RMSE) of the model forecast and a reference model (climatology), respectively. The 100% skill score shows a perfect prediction, while a zero skill score indicates that the skills of the forecasted model are the same as the climatology (reference forecast). A score skill less than zero denotes that the prediction skill score is worse than the climatology.

4 Results and Discussion

The correlation of summer rainfall with each monthly climate index is investigated to understand better the influence of an individual climate index on summer rainfall in each station. Inputs of the final models are selected based on their correlations with summer rainfall. For example, according to the correlation analysis for Cairns station, the summer rainfall is correlated with Nino3.4 from January to April (Table 3). For the remaining months, the correlation is insignificant. Therefore, the input sets comprised of Nino3.4 from January to April (four inputs for Nino3.4) are considered input values. It is apparent that correlation is insignificant between different monthly Nino3.4 with summer rainfall for Ayrshire Downs, Boulia, and Cunnamulla. However, Pitssworth’s summer rainfall correlation with Nino3.4 covers a broader range of months compared to other stations, though the correlations are not too strong.

Table 3 Pearson correlation coefficient of Nino 3.4, SOI, and IPO antecedents and summer rainfall

Moreover, the effect of SOI on the summer rainfall of Queensland is investigated in Table 3. It is evident that the summer rainfall is correlated with SOI in all stations, at least in one of the months. In Cairns, Richmond, Boulia, Isisford, and Cunnamulla, the summer rainfalls are correlated with SOI only in February.

Nonetheless, the correlation coefficients are significant for most of the months for Calliope and Pittsworth. The highest correlation value can be found in Pittsworth in April with 0.33. Similarly, the correlation analysis is done for IPO and summer rainfall in Table 3. It is evident that the correlations cover a broader range of months compared to Nino3.4 and SOI. It is seen that only in one station (Boulia), the summer rainfall is not correlated with IPO. However, for other stations, correlations are significant at least at three months. The observed summer rainfall with a one-year lead time is allocated to forecast the next year's seasonal rainfall for target values. In other words, the target values are considered as the next year’s observed summer rainfall time series in the calibration process.

Various input sets are applied to the models to assess their performance. Table 4 compares the input matrix for ANN and WANN for Cairns station as a sample. The structure of each input set for each station depends on the correlation analysis and the number of levels in the WANN method. As in this study, one lag time is considered; two time series are generated (one main time series plus one lagged time series). After considering the lagged time series, the total number of time series applied to the network would be 12, with 93 values for each one during the calibration process. For instance, in model 8, 108 time series are generated through the wavelet analysis, while only 18 time series is applied to the ANN model. This technique provides very meticulous information for training the network, which is not identifiable by ANN solely.

Table 4 Input structure of WANN and ANN for Cairns station during the calibration

4.1 Comparisons of WANN with ANN

The prediction skills of the ANN and WANN models during the calibration and testing periods are presented in Table 5. It is found that compared to the other stations, Cairns and Pleystowe show high RMSE values for both the ANN and WANN methods. However, the lower RMSEs are observed for Cunnamulla, Boulia, and Isisford through ANN and WANN, respectively. During the testing process, the RMSE values through the ANN are 78.8mm and 95.6mm for Cunnamulla and Boulia, while the RMSE values for the Cunnamulla and Isisford stations are 55.3mm and 54.6mm for WANN. It is evident that RMSE values are remarkably higher for the tropical east coast region (Cairns, Calliope, and Pleystowe weather stations), resulting from a high magnitude of summer mean rainfall. The average RMSE values through the ten stations are obtained 169.5mm and 178.9mm using ANN during calibration and testing periods, respectively. Nonetheless, the corresponding average RMSE values are 108.7mm and 112.2mm, using the WANN during calibration and testing periods. The WANN improves the prediction accuracy of the ANN by 37% over the ten selected stations in terms of RMSE values. Moreover, it is worthwhile noting that the average MAE through ANN during the testing period is 141.7mm, while it is 90.5mm through the WANN.

Table 5 Best ANN and WANN models performance

The correlation coefficients (R) through the WANN are not less than 0.72 (Boulia station), whereas those values range between 0.14 and 0.61 through the ANN. It can be seen that for Calliope, the forecasted rainfall using WANN is well correlated with observed values for the fourth model (R=0.91). It is evident that the best model through the WANN is developed using the IPO index, which is applicable in all the stations except Pleystowe and Boulia. It is evident that the IPO is a determinative factor in predicting the summer rainfall in this region. However, the ANN does not follow a specific trend regarding the suitable input set for each station.

It is observed that the values of \({\mathrm{d}}_{\mathrm{r}}\) for WANN are relatively higher (average \({\mathrm{d}}_{\mathrm{r}}\)=0.68) than the values for ANN (average \({\mathrm{d}}_{\mathrm{r}}\)=0.51) in all stations. According to the RMSE and MAE values, the superior performance of the WANN compared to the ANN is notable in all districts of Queensland.

Figure 4 shows the comparisons between WANN, ANN, and observed summer rainfall for the selected three regions of Queensland through testing sets. It is observed that the WANN models can predict the summer rainfall accurately, while the ANN is unable to forecast the observed values with the same accuracy. In Pleystowe station, the predicted rainfall through ANN shows a smooth trend, and it is hardly able to predict the seasonal rainfall values correctly. Nonetheless, the results obtained by WANN ascertain that this method can calculate extreme summer rainfall with excellent accuracy. The extreme summer rainfall throughout the testing period for Ayrshire station is obtained as 302.6mm through the ANN, while it is acquired as 423.9mm by WANN when the observed value was 572.6mm. Such accuracy benefits researchers and water resources managers to plan appropriate solutions before the occurrence of extreme seasonal rainfall.

Fig. 4
figure 4

Observed and modeled rainfall using ANN and WNN during the verification period at one-year lead time

4.2 Comparisons of WANN’s Forecast Accuracy with ACCESS-S, Climatology, ARIMA, and MLR Predictions

The accuracy of the WANN, ACCESS-S, and climatology are presented in Table 6. It is evident that ACCESS-S and climatology forecasts produce much higher RMSE and MAE values for all the stations. The highest RMSE value among all stations is related to Cairns for WANN (RMSE=307.9mm), whereas the highest values of 564.5mm and 932.3mm are acquired for the same station through ACCESS-S and climatology models, respectively. It can be seen that the WANN outperforms the ACCESS-S in forecasting the seasonal rainfall by 60% over the ten selected stations in terms of RMSE values.

Table 6 Comparison of predictions of WANN with ACCESS-S, and climatology during the testing period

The NSE values are negative through ACCESS-S and climatology for all the stations. For the WANN model, the NSE values are positive (from 0.45 to 0.79), which denotes the perfect fit of predicted and observed values. In other words, the NSE values of WANN indicate that WANN is an effective method to predict rainfall with lower errors. Figure 5 illustrates the time series values of WANN and ACCESS-S rainfall predictions along with the observed rainfalls. It is evident that the WANN prediction results are remarkably accurate than the ACCESS-S forecast results. Also, the WANN performs better in predicting the extreme seasonal rainfall, while ACCESS-S is incapable of generating exact peak values. For example, the ACCESS-S underestimates the extreme rainfall over the testing period by 73% in the Ayrshire Downs station. Also, its performances for different stations can be summarised as 62% overestimation for Calliope, 77% underestimation for Cunnamulla, 80% underestimation for Isisford, and 66% underestimation for Richmond in regards to the extreme seasonal rainfall.

Fig. 5
figure 5

Comparison of WANN and ACCESS-S predictions with observed rainfall at one-year lead time during the verification period

Table 7 shows the prediction accuracies of the ARIMA and MLR in terms of RMSE, MAE, \({\mathrm{d}}_{\mathrm{r}}\), and R are presented. It is evident that the values of RMSE through WANN modeling are much lower than the RMSEs’ by ARIMA and MLR models. The percentages of RMSE improvements by WANN are remarkably high; for Ayrshire Downs, 68% and 47% compared to MLR and ARIMA, respectively. The lowest improvements through WANN are 24% and 27% compared to MLR and ARIMA for Cairns, respectively. For all the stations, the WANN provides better results compared to the MLR and ARIMA.

Table 7 Comparison of predictions of WANN with MLR, and ARIMA during the testing period

The skill score values for the ACCESS-S, WANN, ANN, MLR, and ARIMA predictions are shown in Table 8. The skills of all three models for Cairns and Richmond have positive values, which means that those models' RMSE values are lower than the climatology forecast’s values. However, for all the stations (except Cairns and Richmond with 39% and 15%, respectively), the skill scores are negative through the ACCESS-S. This indicates that the RMSEs of the ACCESS-S are higher than the RMSE values of climatology prediction, and the prediction skill of the ACCESS-S is worse than the climatology. However, the prediction capability of the WANN is higher than the climatology forecast (lower RMSE values) for all the stations. The ANN, MLR, and ARIMA prediction skills are very close to the climatology skill for Pittsworth with 1%, 2%, and 1% of skill scores, respectively. In general, the WANN renders an excellent accuracy in terms of the average skill score for all the stations with a value of 53%, and ACCESS-S provides worse prediction compared to the climatology forecast with an average skill score of -22%. Overall, the achieved average skill scores through the ANN, ARIMA, and MLR are 24%, 20%, and 15%, respectively.

Table 8 Percentage of forecast skill score for ACCESS-S, WANN, ANN, MLR, and ARIMA during the testing period

5 Conclusion

In this study, a model called WANN was developed to forecast Queensland’s seasonal rainfall one year in advance. The results were compared with the conventional ANN, MLR, ARIMA, and the current prediction system of the BOM (ACCESS-S) and climatology forecasts. It was found that the WANN improves the ANN and ACCESS-S prediction results in terms of RMSE over the ten selected stations by 37% and 60%, respectively. However, the WANN could improve the predictions by ARIMA and MLR by 40% and 42%, correspondingly. In terms of skill score, the WANN with 53% outperforms ANN, ACCESS-S, ARIMA, and MLR with 24%, -22%, 20%, and 15%, respectively. It was also found that ACCESS-S's performance is not satisfactory regarding the average NSE (-1.8), compared to the average NSE value of 0.61 achieved by WANN. While the skill score values are negative for ACCESS-S for most stations, they are positive (higher than 0.37) through the WANN.

Since the MLR and ARIMA are linear methods, they are incapable of properly capturing the non-linear characteristic of the rainfall time series, and they are unable to accurately replicate a great range of data. Nonetheless, the WANN exceptionally outperforms the MLR and ARIMA because it can appropriately capture the non-linearity of the subject time series. It should be noted that the ANN cannot improve the prediction skills of the ARIMA and MLR. Thereby, it is seen that combining the wavelet analysis with ANN should be taken into account for the prediction of the seasonal rainfall with a low number of available data. However, in WANN implementation, the mother wavelet must be carefully selected through the trial and error process as it highly impacts the model's performance. Also, selecting the appropriate decomposition level in wavelet analysis affects the performance of the model. Therefore, careful steps must be taken in determining the number of levels and type of wavelet function in wavelet analysis.

To the best of our knowledge, no model can predict the seasonal rainfall one year in advance with such a level of accuracy. Thus, this capability can be beneficial for better management of unusual events such as flooding and drought, especially for regions with extreme weather conditions. This study opened an opportunity to predict seasonal rainfall one year ahead with excellent precision using various effective climate indices. It should be noted that the outcomes of this study show that this technique can be applied to forecast rainfalls of other seasons, and application of this method in longer lead times could be attempted for those seasons.