1 Introduction

Knowledge of bulk wave parameters is essential in engineering activities over the continental shelf from offshore to the surf zone. Predicting wind-induced waves is highly associated with the accuracy of the wind data forced into the model (Zhao and Toba 2001). The wind forcing can be provided from the measurement systems or global weather datasets. Engineers prefer to use the result of numerical weather prediction model because of scarce and irregular measurement network and exorbitant costs of field measurements. The major efforts are concentrated to use the global atmospheric model databases such as European Centre for Medium-Range Weather Forecasts (ECMWF) and Global Forecast System (GFS). These databases cover more than 50 years of met-oceanic information; however, there is still some hesitation for choosing them in the Caspian Sea (Hartgerink 2005; Mazaheri et al. 2013).

It is important to achieve an accurate prediction of nearshore waves in the southern part of the Caspian Sea where rip current generated by waves is drowning up to 500 people per year. The weather climate substantially changes from south to north of the Sea. The Southern part is affected by Mediterranean tropical fronts, which cause hot and dry summers and warm winters. Over the north of Caspian Sea, the weather is dominated by the Azores high pressure that causes summer-time northerly winds and by the Asiatic anticyclone which creates a winter-time easterly cold winds (Graham et al. 2002). In the southern part of the Caspian Sea, the annual cycle of the monthly mean wind produced from the global atmospheric models represents northward winds during December and January, south-southwestward divergence winds during February–July, and westward winds through August-November (Ibrayev et al. 2010). The reversal wind field as a result of the reversed land-sea temperature differences between the austral summer and winter (Ibrayev et al. 2010) leads to seasonal changes in wave pattern. Therefore, a precise knowledge of wind field data is required for modeling the bulk wave parameters.

In contrast to studies of other coastal processes, there are few studies focused on wave modeling in the southern part of the Caspian Sea. In most of these cases, data assimilation was a major concern. Lopatoukhin et al. (2004) offered some results for obtaining the wind and wave climate close to Russia from the National Centers for Environmental Prediction-National Center for Atmospheric Research (NCEP/NCAR) re-analysis dataset. They modified the dataset according to the Markov processes (Parzen 1999) using 280 thousand ships observations in the Caspian Sea, which were available since 1948. Moreover, an improved wind data set was utilized as the input of SWAN (Simulating Wave Nearshore) model (Booij et al. 1996). Their results showed a periodic spatiotemporal variability in the weighting coefficients. Hartgerink (2005) also applied the NCEP wind database in combination with SWAN model and discovered NCEP wind database does not contain sufficient information on the high-frequency wind speed variations which are needed to determine the design wave height. Additionally, he found that unlike the southern basin of the Caspian Sea (SBC), swells do not play an important role in the Northern part. Furthermore, it was proposed that the measured spectra from the Caspian Sea may be adequately described by the theoretical JONSWAP and TMA spectra. Golshani et al. 2007 did the 11 years (1992–2002) of wave simulation in the Caspian Sea to achieve the wave climate atlas of the basin. ECMWF operational wind field with spatial resolution of 0.5° and temporal resolution of 6 h was modified based on comparison with in situ and satellite input. This modified ECMWF operational wind input was used in Kamranzad et al. (2016) study as well. Hadadpour et al. (2013) simulated the wave bulk parameters using SWAN in the Anzali port region, situated at the southwestern part of the Caspian Sea. They forced the model using QuikSCAT wind field and applied an artificial neural network to the simulated wave parameters to improve model performance. Mazaheri et al. (2013) verified ERA-40 wind components using QuikSCAT satellite measurements and applied ERA-40 wind field with a 0.5° × 0.5° horizontal resolution over the Caspian Sea for a period of 32 years. They proposed correction factors for wind to improve simulation results from SWAN. Their results implied that the correction of eastward wind components is needed over the entire basin whereas for the northward components, the modification is only needed for the SBC. Although they reported some differences between the ECMWF wind field and QuikSCAT wind measurements, they did not provide any information about the corrected and pure ECMWF wind field. Rusu and Onea (2013) evaluated the wind and wave energy resources along the Caspian Sea for allocating turbine farm. Satellite data were provided from Archiving, Validation and Interpretation of Satellite Oceanographic (AVISO) data and model results were provided by SWAN model using a staggered grid which was forced by 1.5° × 1.5° ECMWF wind data. Hadadpour et al. (2014) used SWAN model for simulating the wave characteristics over a nested grid system to describe the existence and variability of wave energy along the Anzali coasts. Similar to the present study, they used a nested procedure to obtain optimal results at both global Caspian Sea domain and the coastal Anzali domain. Kamranzad et al. (2016) evaluated the wave energy potential and its spatial and temporal variations in the SBC. They used SWAN model to hindcast wave characteristics for a period of 11 years using the modified ECMWF operational wind data (Golshani et al. 2007) and found that southwestern part of the Caspian Sea is less energetic that other areas.

The latest released dataset from ECMWF which called ERA-Interim (ERAI) is probably more interesting for researchers than NCEP/NCAR and ERA-40 datasets. Evaluation of the wave characteristics using this dataset is still an open issue in many water bodies around the world. The main objective of this study is to evaluate ERAI over the southern part of the Caspian Sea.

To accomplish the present work, SWAN model was used in two-layer nested grids by the ERAI forcing in order to determine the bulk wave parameters in SBC. The field of study is described in the following section. Data sets, underlying physics, model configuration and statistical indices are presented in Sect. 3. Validating wind datasets based on remotely sensed data, analyzing ERAI and GFS wind datasets based on buoy measurements, and also bulk wave characteristics based on buoy measurements. The modeling results are presented in Sect. 4 and followed by the conclusions of the study.

2 Field of study

Caspian Sea extends between 46.67′E to 54.80′E and 36.56′N to 47.1′N with three meridional interconnected regions (Fig. 1): the shallow Northern Caspian Basin (NCS), the deep Middle Caspian Basin (MCB) and the deeper Southern basin of the Caspian Sea (SBC) with the maximum depth of 25, 778 and 1025 m, respectively. In the northern part, the depth noticeably increases towards MCB, where the average depth is 190 m. The SBC is being separated from the MCB by the Apsheron Rift. The optimum values for the geometry of the continental shelf are found to be up to 11 km and 1° for the width and slope close to the Derbent and Divichi, expanding to the south where the continental shelf slope becomes sloping downward (Naseka and Bogutskaya 2009). Based on the bathymetric data in each part of the Caspian Sea, the mean slope of sea-bed locally changes from mid-shelf to the coastline. Forty-six percent of sea area lies on the shelf with depths up to 100 m, while the continental slope of the SCB is indeed quite steep, with the eastern side slope running deeper and wider than the corresponding values on the western side (Naseka and Bogutskaya 2009). Historically, the Caspian Sea level (CSL) has experienced long- and short-time fluctuations; however, these have been limited between − 25 to − 29 m below the open-ocean surface since 1995 (Arpe and Leroy 2007). In 2011, the CSL was − 28 m below open sea datum. Bathymetry data for simulations in this study were generated from General Bathymetric Chart of the Oceans (GEBCO) database with 30 s horizontal resolution which is available from http://www.gebco.net.

Fig. 1
figure 1

Bathymetrical map of parent and child grids implemented for the coupled model. The position of deep and shallow buoys and ADCP station are shown. Contour levels are given in meters

3 Data and methodology

3.1 Datasets

Historical met-ocean data measurements are scarce in SBC along the Iranian coasts. Port and Maritime Organization (PMO) have implemented a national monitoring project for the southerly coast of the Caspian Sea since 2007. In addition, a set of ADCP measurements has been conducted along its inner-shelf by Iranian National Institute for Oceanography and Atmospheric Sciences (INIOAS).

For this study, wind measurements are collected from two oceanic buoy which have been deployed in center of SBC (DB) from February to November 2011 and in mid-shelf near Kiashahr Port (KB) from March 2010 to May 2011. The wave characteristics have been measured by abovementioned buoys (DB and KB) and an ADCP which have been conducted from 27 June to 10 July 2011 close to the Anzali Port (AA). The spectral wave data were not available from the buoys due to memory limitation of the instrument. Deployment depth and geographic positions are presented in Table 1 and shown in Fig. 1.

Table 1 Deployment depth and geographic position of measurement instruments (after PMO and INIOAS)

In addition to buoys observation, the atmospheric forcing is taken from ECMWF and GFS datasets. Both of them are released in re-analysis (RE-ANL) and combination of re-analysis and forecast (MIXED) forms on multiple scales. The re-analysis data include results at 0000, 0600, 1200 and 1800 UTC, and the mixed one includes both re-analysis fields (at 0000, 0600, 1200 and 1800 UTC) and forecast fields (at 0300, 0900, 1500, 2100 UTC).

ECMWF provides weather information from 1979 to 2017 using six different re-analysis products. The first, ERA-15, is 15 years generated re-analyses data which covers from 1979 to 1993. The second one, ERA-40, covers up to 45 years from the International Geophysical Year (1957) to 2002. The third, ERA-Interim Re-Analysis (ERAI_RE-ANL), is a re-analysis product for replacing ERA-40, which is available since 1979. ECMWF also released three new re-analysis datasets, spanning 1900–2010 which known as ERA-20C, ERA-20CM and CERA-20C datasets. The ERA-Interim atmospheric model is natively running under 256 × 512 latitude/longitude horizontal grid, which means about 0.75° horizontal grid resolution at the equator. ERA-Interim Ocean-Wave model results are stored on the 1.0° latitude/longitude grid. ERA-Interim data can optionally interpolated to a custom grid and same horizontal resolution in both directions. The default interpolation method is bilinear for continuous fields (e.g., wind velocity components) and by nearest-neighbor for discrete fields (e.g., wave 2D spectra).

For obtaining wind data, re-analysis and the mixed re-analysis/forecast datasets, ERAI_RE-ANL with 0.5°, ERAI_MIXED with 0.125°, and GFS_MIXED with 0.5° horizontal resolutions are considered. Also, we examined ERAI_MIXED-COR with 0.125° resolution which has been corrected the northward and eastward wind components as suggested by Mazaheri et al. (2013).

All wind datasets provide the wind components at 10 m above the sea surface, which are spatially distributed on a regular rectangular grid. The near-surface wind transferred to the wave model curvilinear grids using ROMS MATLAB scripts distributed along with the pre-/post-processing ROMS MATLAB toolbox, available from https://www.myroms.org/svn-/src/matlab.

For choosing the best dataset, all wind datasets and the corresponding simulated wave characteristics were compared with remotely sensed data from CERSAT. These data were obtained from the Centre de Recherche et d’Exploitation Satellitaire (CERSAT), at Ifremer, Plouzané (France). They were produced in the framework of GlobWave project, funded by the European Space Agency (ESA). It is combination of satellite wind data from ERS2, ENVISAT, JASON1, JASON2 and CRYOSAT1 during 2011. Figure 2 shows satellite track paths over the Caspian Sea including 27,852 data points in 2011. Since data from neighboring points are not statistically independent, they cannot be used for statistical analysis; therefore, all data points cannot be used to calculate statistical indices. To remedy this problem, only points which are measured at least 20 min apart in time and 25 km apart in space were used, reducing the dataset to 6977 points. We considered data in SBC with latitudes less than 40.5 and, as a result, there were 2877 data points available. The data from ERAI dataset were temporally and spatially interpolated at those points.

Fig. 2
figure 2

Combination of track paths of ERS2, ENVISAT, JASON1, JASON2 and CRYOSAT1 satellites during 2011 over the Caspian Sea

3.2 Model configuration

The SWAN model is used for evaluating the response of the wave model to ERAI wind forcing. The model can employ a rectilinear, a curvilinear or an unstructured grid. Furthermore, nested computing can be supported for structured grids. To provide a better spatial resolution close to the observation stations, a two-level composite nesting curvilinear grid system was created. The nesting technique provides two benefits: it reduces the computational cost in unconcerned areas and it provides accurate results close to in situ measurements. The regional (parent) and local (child) domains are built based on the GEBCO 30 arcsec data. Since the resolution of GEBCO data is not sufficient for modeling coastal waters, sounding bathymetric data surveys—provided by INIOAS—merged into the GEBCO bathymetric data. Horizontal resolution is 2  × 2 km for the parent grid and it is defined with 7:1 refinement factor for the child grid. The child grid resolution is 300  ×  300 m, encompassing the Guilan province (Fig. 1), and spans from 49.4°E to 50.1°E and from 37.35°N to 37.85°N.

SWAN model solves the spectral wave action balance equation (Hasselmann et al. 1973) with sources or sinks terms for o wave energy density, including generation, dissipation, and transfer processes. It represents the linear and exponential growth by the wind, dissipation due to whitecapping and bottom friction, depth-induced wave breaking and energy transfer due to quadruplet and the triad wave–wave interaction (Booij et al. 1999). The following configuration has been used for this study: linear wind wave generation was incorporated using the formulation of Cavaleri and Rizzoli (1981). The third-generation mode (GEN3) for exponential growth of waves under wind force was employed using WAM cycle 3 formulation. Calculation of whitecapping was based on formulation provided by Hasselmann (1974) and can be applied according to Komen et al. (1984). Different values for the whitecapping coefficient (Cds) were used for tuning the wave parameters. The treatment of the energy dissipation due to the depth-induced breaking which has a significant effect on wave properties in nearshore areas was based on Battjes and Janssen (1978). In the present study, a constant breaker index was used with values of 0.73 and 1.0 for the ratio of maximum individual wave height over depth and tuning coefficient on the rate of dissipation, respectively. Wave energy is dissipated by the bottom friction according to the following formulation:

$$S_{{{\text{ds}},{\text{b}}}} = - \;C_{\text{b}} \frac{{\sigma^{2} }}{{g^{2} \sinh^{2} kd}}E(\sigma ,\theta ),$$
(1)

where Cb is the bottom-friction coefficient. Based on JONSWAP experiments, Hasselmann et al. (1973) estimated a constant value of 0.038 m2/s3 for swell dissipation. Bouws and Komen (1983) suggested a value of 0.067 m2/s3 for fully developed wind seas. For a more realistic condition, bottom-friction coefficient is related to bottom-orbital velocity Urms:

$$U_{\text{rms}}^{2} = \int_{0}^{2\pi } {\int_{0}^{\infty } {\frac{{\sigma^{2} }}{{g^{2} \sinh^{2} kd}}E(\sigma ,\theta )} } {\text{d}}\sigma {\text{d}}\theta ,$$
(2)

Based on Collins description (Collins 1972):

$$C_{\text{b}} = C_{\text{f}} gU_{\text{rms}} .$$
(3)

Therefore, the bottom-friction coefficient is not constant according to Eq. (3) and it depends on the speed of the orbital wave. For including the effects of bottom friction dissipation, expression (3) was activated using Cf = 0.015 for the bottom friction coefficient.

The shape of the wave spectrum is mainly controlled by nonlinear wave–wave processes (Hasselmann et al. 1985). Wave–wave interactions illustrate different behavior in deep and shallow waters. In deep water condition, quadruplet wave–wave interaction transfers the wave energy from the spectral peak to lower and higher frequencies, especially during wave growth. SWAN computes quadruplet wave–wave interaction using the discrete interaction approximation (DIA) proposed by Hasselmann et al. (1985) for multidirectional waves. Close to the shore and in very shallow water, triad wave–wave interactions become important for steep waves. This process transfers energy from the peak frequency to its harmonics (Beji and Battjes 1993) which is computed in SWAN by the lumped triad approximation (LTA).

3.3 Statistical indices

Statistical parameters can be used for evaluating the quality of wind datasets and the performance of wave model. Following Wilks (2011), in this study, mean bias error (MBE), root-mean squared error (RMSE), correlation coefficient (CC) and scatter index (SI) were used:

$${\text{MBE}} = \left[ {\mathop \sum \nolimits \left( {A - A_{\text{obs}} } \right)} \right]/n,$$
(4)
$${\text{RMSE}} = \sqrt {\left[ {\mathop \sum \nolimits \left( {A - A_{\text{obs}} } \right)^{2} } \right]/n} ,$$
(5)
$${\text{CC}} = \frac{{n\mathop \sum \nolimits A_{{}} A_{\text{obs}} - \mathop \sum \nolimits A_{{}} \mathop \sum \nolimits A_{\text{obs}} }}{{\sqrt {\left[ {n\mathop \sum \nolimits A_{{}}^{2} - \left( {\mathop \sum \nolimits A_{{}} } \right)^{2} } \right]\left[ {n\mathop \sum \nolimits A_{\text{obs}}^{2} - \left( {\mathop \sum \nolimits A_{\text{obs}} } \right)^{2} } \right]} }},$$
(6)
$${\text{SI}} = \frac{\text{RMSE}}{{\mathop \sum \nolimits A_{\text{obs}} }}.$$
(7)

The index of agreement (IA) is another useful statistical parameter used for quantifying the agreement between simulated and observed values (Willmott et al. 2012):

$${\text{IA}} = 1 - \frac{{\left[ {\mathop \sum \nolimits \left( {A_{{}} - A_{\text{obs}} } \right)^{2} } \right]}}{{\left[ {\left| {\mathop \sum \nolimits A_{{}} - \frac{{\left( {\mathop \sum \nolimits A_{\text{obs}} } \right)}}{n}} \right| + \left| {\mathop \sum \nolimits A_{\text{obs}} - \frac{{\left( {\mathop \sum \nolimits A_{\text{obs}} } \right)}}{n}} \right|} \right]^{2} }}.$$
(8)

The MBE describes the average difference between the ERAI and buoy data. Here, the negative MBE values suggest the averages amount of underestimation in the ERAI values, while the positive values indicate overestimation. The RMSE describes how close measurements and model data are, and a lower value of RMSE implies a better agreement between the ERAI and buoy data. The ideal values of MBE and RMSE are zero. The SI determines the percentage of expected error for the model parameter, and a smaller value indicates a better agreement. Finally, the closer the value of IA to 1, the better the agreement between model results and observations.

4 Results and discussion

4.1 Validating wind datasets

European Centre for Medium-Range Weather Forecasts has produced the ERA-Interim dataset in multiple scale in space and time interval as well as GFS from NCEP/NCAR. To obtain more accurate wind dataset, the ERAI_RE-ANL, the ERAI_MIXED and ERAI_MIXED-COR datasets are compared with CERSAT wind data (U10SAR) in SBC. The results are also compared with data from GFS_MIXED wind dataset.

The statistical parameters used for validating wind datasets are presented in Table 2 and shown in Fig. 3a–d. The correlation coefficient (CC) for re-analysis and mixture of re-analysis/forecast dataset are very close together at 0.7. Based on the negative biases, the ERAI products underestimated wind velocities. Applying modification factors suggested by Mazaheri et al. (2013) has improved mean bias value. However, the mixed version of re-analysis/forecast wind data provided better statistical parameters. The lowest root means square errors (RMSE) and scatter index (SI) are found as 1.60 m/s and 0.28 for ERAI_MIXED datasets. Nevertheless, this implies that ERAI_MIXED dataset outperforms other ERAI datasets. The GFS_MIXED wind dataset overestimated wind values throughout the SBC.

Table 2 Statistical comparison of CERSAT dataset (U10SAR) vs. (a) ERAI_RE-ANL, (b) ERAI_MIXED, (c) ERAI_MIXED-COR, and (d) GFS_MIXED datasets in SBC, in terms of mean bias error (MBE) and root mean square error (RMSE), scatter index (SI), correlation coefficient (CC) and index agreement (IA)
Fig. 3
figure 3

Quantile–quantile scatter plot of wind speed from CERSAT dataset (U10SAR) vs. a ERAI_RE-ANL, b ERAI_MIXED, c ERAI_MIXED-COR, and d GFS_MIXED datasets

Rusu and Onea (2013) compared the AVISO remotely sensed dataset with ERAI at fifteen fixed points along the Caspian Sea where five points were in the SBC. They found the values of MBE, RMSE, SI and CC as 0.12 m/s, 1.26 m/s, 0.28 and 0.83 in a position which are similar to values found in at DB buoy. However, those results were based on one station point in SBC while spatially extensive area was used for comparisons in the present work.

4.2 Wind data analysis

Time series of wind velocity are presented in Fig. 4 for offshore station and in Fig. 5 for nearshore buoy. At offshore location (Fig. 4), the wind speed varied from zero to 15.94 m/s with a mean of 3.56 m/s. The corresponding values were from zero to 13.04 m/s with a mean of 3.64 for the ERAI_RE-ANL, from 0.02 up to 13.46 m/s with a mean of 4.36 m/s for the ERAI_MIXED, from 0.02 to 14.81 m/s with a mean of 4.80 m/s for the ERAI_MIXED-COR, and from 0.10 to 17.33 m/s with a mean of 5.43 m/s for the GFS_MIXED datasets. The nearshore buoy (KB) has been located nearly 150 km far from DB station, 2 km away from the coastline. Although the buoy has been operated from January 2010 to June 2011, but the last 5-month measurements were available for this study. At the nearshore buoy location (Fig. 5), where the wind speed measurements ranged from 0.2 to 12.1 m/s with a mean of 3.72 m/s, the corresponding values were from zero to 12.14 m/s with a mean of 3.32 from the ERAI_RE-ANL, from 0.18 to 9.65 m/s with a mean of 3.88 m/s from the ERAI_MIXED, from 0.21 to 11.58 m/s with a mean of 4.66 m/s for the ERAI_MIXED-COR, and from 0.21 to 10.83 m/s with a mean of 4.12 m/s for the GFS_MIXED datasets, respectively.

Fig. 4
figure 4

Comparison of the wind components of u and v, and U10 from the observation and ERAI_ER-ANL, ERAI_MIXED, ERAI_MIXED-COR and GFS_MIXED data offshore buoy (DB)

Fig. 5
figure 5

Comparison of the wind components of u and v, and U10 from the observation and ERAI_RE-ANL, ERAI_MIXED, ERAI_MIXED-COR and GFS_MIXED data nearshore buoy (KB)

The statistical indices for the ERAI_RE-ANL, ERAI_MIXED, ERAI_MIXED-COR, and GFS_MIXED wind datasets are presented in Table 3 and shown in Figs. 6 and 7. For offshore location (DB), according to the negative biases, it can be noticed that all ERAI and GFS datasets underestimated the wind velocity components. The RMSE varies in the range of 1.35–1.57 for u component, and 1.43–2.06 for v component, and 1.48–2.03 for U10. The best index of agreements, IA (0.93, 0.94 and 0.80 for u, v and U10), was resulted for the ERAI_MIXED dataset. The ERAI_MIXED also provided the lowest RMSE and SI values among all datasets at the offshore position, which implies its higher accuracy. Generally, datasets overestimated the wind velocity at the DB buoy location, especially before June. For the nearshore buoy location (Fig. 7), the RMSE and SI values implied that ERAI_MIXED provided a more accurate estimate of wind velocity than other datasets. The RMSE values for KB location are slightly less than DB, and GFS_MIXED and ERAI_MIXED datasets were the most successful datasets.

Table 3 Statistical parameters in terms of mean bias error (MBE) and root mean square error (RMSE), scatter index (SI), correlation coefficient (CC) and index agreement (IA) at offshore buoy (DB) and nearshore buoy (DB)
Fig. 6
figure 6

Scatter plot for wind components of u and v, and U10 from the observation and a ERAI_RE-ANL, b ERAI_MIXED, c ERAI_MIXED-COR and d GFS_MIXED data at offshore buoy (DB)

Fig. 7
figure 7

Scatter plot for wind components of u and v, and U10 observation and a ERAI_RE-ANL, b ERAI_MIXED, c ERAI_MIXED-COR and d GFS_MIXED data at nearshore buoy (KB)

The overall conclusion is the fact that ERAI significantly underestimates the wind speed when compared to measured data at buoys locations. Mazaheri et al. (2013) also obtained the similar conclusion for the ECMWF wind field. However, in the present study, the result of the comparison does not support the good performance of the ERAI_MIXED-COR wind field.

The wind frequency distributions from measurements and datasets are presented in Table 4. Hourly records during Feb–Nov 2011 for offshore location and during Jan–May 2011 for nearshore stations are used. According to Beaufort Wind Scale, the dominant condition is between light air and gentle breeze conditions. However, the most frequent wind is light breeze condition based on the observation and GFS_MIXED data, and gentle breeze from ERAI datasets.

Table 4 Beaufort wind force scale from the buoy wind data and (a) ERAI_RE-ANL, (b) ERAI_MIXED, (c) ERAI_MIXED-COR and (d) GFS_MMIXED datasets at offshore and nearshore buoy locations

The directional distribution of wind vectors is depicted in Fig. 8a, b for offshore and nearshore buoys. For the point located in the center of SBC (DB), it can be noticed that the northwesterlies and easterlies are dominant (Fig. 8a). However, the strongest winds are coming from northeast. For KB location, wind rose highlighted the importance of the northwest to east wind directions (Fig. 8b). The winds never exceed 10 m/s in both locations; the frequency of wind speed higher than 9 m/s was 2–6%. Rusu and Onea (2013) suggested north-eastern dominant wind conditions for north and west of the SBC location. Regarding the location of the buoy (KB), the winds are strongly influenced by the orographic feature of SBC, including Alborz and Caucasus mountains in the south and west of SBC.

Fig. 8
figure 8

Wind roses for the DB and KB location based on buoy data. Analysis performed for February to November 2011 in DB location and for February to May 2011 in KB location

Comparison of time series of wind speed and direction obtained from ERAI and GFS dataset with remotely sensed data from CERSAT and observations at offshore and nearshore buoys supported the use of ERAI_MIXED dataset for wave simulations in SBC.

4.3 Wave modeling

Conventional wave data are scarce in the southern part of Caspian Sea. Fortunately, in the last decade, a few field investigations were carried out using the buoy and bottom-mounted acoustic instruments. For this study, measured wave data have been collected from one offshore buoy (DB) located in the middle of SBC operated from February to November 2011, one mid-shelf buoy (KB) near Kiashahr Port deployed from January to May 2011 and an ADCP bottom mounted (AA) near Anzali port from 27 June to 10 July 2011. Two observation data from buoys were used for calibrating the regional SWAN model and the last is used in the local nested grid. For both buoys, only bulk wave parameters including significant wave height (Hs), peak period (Tp) and direction were available at 1 h intervals.

As mentioned in Sect. 4.2, evaluation of different wind datasets implied that the ERAI_MIXED dataset was more consistent with the measurement data. It can also be confirmed by wave simulation, when different wind data are forced to model. To avoid any complexity, SWAN model using default setting was evaluated: Two different exponential wind growth formulations of Komen (WAM cycle 3) and Janssen (WAM cycle 4) were considered for wave generation in combination with the Komen whitecapping dissipation. Statistical parameters presented in Table 5 showed that the Komen formulation was more successful than Janssen formulation. Since the ERAI_MIXED dataset resulted in lower values of RMSE and SI for wave at both DB and KB stations, it can be concluded that this dataset should be used as the main forcing over the Caspian Sea.

Table 5 Wave statistical parameters in terms of mean bias error (MBE), root mean square error (RMSE), scatter index (SI), correlation coefficient (CC) and index agreement (IA) at offshore buoy (DB) and nearshore buoy (DB) using (a) ERAI_RE-ANL, (b) ERAI_MIXED, (c) ERAI_MIXED-COR and (d) GFS_MIXED wind data

In the next step, model calibration was carried out in order the tune wave generation and whitecapping parameters. Hartgerink (2005) used the Komen formulation with Cds2 values of 2.38e−5 (default), 1e−5 and 5e−5 and found that the default value outperforms the others when δ = 1 was used. Rusu and Onea (2013) used the Janssen formulation for exponential wind growth with Cds1 = 3.2 and δ = 0.55, and Van der West-Huysen formulation for whitecapping. Hadadpour et al. (2014) and Kamranzad et al. (2016) used the Komen formulation with default values. The effects of changing the whitecapping coefficients at DB station are shown in Fig. 9 as direct comparison (Fig. 9a) and scatter plots (Fig. 9b, c) (ERAI_MIXED wind data are forced to model).

Fig. 9
figure 9

a Comparison and scatter plot for wave characteristics with different rate of whitecapping dissipation using Komen formulation and ERAI_MIXED wind data, b scatter plot from simulated and observation wave height and c scatter plot from simulated and observation peak period

Evaluation of different whitecapping dissipation coefficients shows that Komen formulation with default value generates convenient results; however, the use of Cds = 1.3e−5 resulted in a better agreement to measurements. The statistical indices for model performance using different whitecapping dissipation (cds) are presented in Table 6.

Table 6 Statistical parameters for whitecapping calibration on Hs and Tp, in terms of mean bias error (MBE), root mean square error (RMSE), scatter index (SI), correlation coefficient (CC) and index agreement (IA), mean, and standard deviation (SD) in the offshore location

The comparison between SWAN results with measurements is shown in Figs. 10, 11, and 12. At the offshore location, the results revealed consist trends of the model results and the measurements but also suggest a poor performance of model in capturing Hs and Tp values correctly. The wave heights values produced by the model at DB vary in the range of 0.07–4.66 m with the mean and standard deviations of 0.68 and 0.52 m. In contrast, the maximum measured Hs from the buoy exceeds 4.88 m with the mean value of 0.74 m and the standard deviation of 0.56 m. Also, the peak wave period varies from 1.26 to 9.98 s in model results. The mean and standard deviation are 4.63 and 1.5 s, respectively. The measured values for Tp vary from 0.34 to 14.2 s with mean values of 4.9 s and standard deviation of 1.5 s.

Fig. 10
figure 10

Direct comparison of the wave parameters Hs and Tp, and wave direction between the observation (solid, bobble) and ERAI_MIXED (dotted and plus symbols) data at offshore buoy (DB)

Fig. 11
figure 11

Direct comparison of the wave parameters Hs and Tp, and wave direction between the observation (solid, bobble) and ERAI_MIXED (dotted and plus symbols) data on offshore buoy (KB)

Fig. 12
figure 12

Direct comparison of the wave parameters Hs and Tp, and wave direction between the observation (solid, bobble) and ERAI_MIXED (dotted and plus symbols) data on ADCP location (AA)

Statistical wave parameters and directional spreading are presented in Table 7 and shown in Fig. 13. The bias errors for Hs varied from − 2.82 to − 1.26 m, with value of − 0.12 m for the MBE. The negative values of the mean bias error for both significant wave height and wave period suggest a significant underestimation at the offshore buoy. This is consistent with the results of Jafari et al. (2014). The discrepancy was more significant in early July when v-component of wind forcing was significantly different from the measurements. For the root means square (RMSE) and SI, the related values are found as 0.25 m and 0.36 with the correlation coefficients (CC) of 0.88. For the peak period (Tp), the MBE was found as − 0.36 s, while the RMSE value was 0.98 s, and the SI and the CC values were 0.21 s and 0.79, respectively.

Table 7 Hourly statistics for the wave characteristics in terms of mean bias error (MBE), root mean square error (RMSE), scatter index (SI), correlation coefficient (CC) and index agreement (IA) on offshore buoy (DB), nearshore buoy (KB) and ADCP (AA) locations
Fig. 13
figure 13

Scatter plot for wave characteristics by comparing the observation data and simulated results using ERAI_MIXED wind data forcing a on offshore buoy (DB), b on nearshore buoy and c ADCP station (AA)

The cumulative frequencies of the wave results at offshore station implied that the simulated wave height about 88% of time was less than 1 m, where only 2.1% was higher than 2 m. Also, the wave period was less than 47.8% below 3 s and 52.2% above 3 s. At the nearshore buoy station, about 5.2% of time, the wave height was higher than 1 m and 10% of the waves occurs with less than 3 s periods and slightly less than 5% occurs with higher than 8 s.

Comparison of wave characteristics from buoy data with model results (Fig. 13a) showed the values of linear regression (R2) found at 0.8 and 0.6 for Hs and Tp and the slopes of the regression lines are lower than 1. These are implied that the model underestimated Hs and Tp; however, the RSME values suggested a good consistency for wave height and wave period except for some specific events.

At the Kiashahr buoy (KB), the values of linear regression (R2) were found at 0.6 and 0.3 for Hs and Tp (Fig. 13b). In similarity of offshore location, the slopes of the regression lines are slightly lower than 1, suggesting that model result underestimate the nearshore location data.

Direct comparisons of the wave characteristics at ADCP location revealed a poor trend consistency between the model results and observation. In both locations, waves are distributed with the values of Hs less than 1 m. Over 80% of the time, the waves are observed at wave height of less than 1 m. The observations always are found at 80% with lower than 1 m wave height and nearly up to 3% are found above 2 m. These conditions indicate that the SBC usually has moderate to low conditions. Rusu and Onea (2013) found the contribution of 98% with less than 2 m height for summer and 87% during winter. Waves in the enclosed water bodies are dominantly generated by winds and Caspian Sea.

Observed and simulated wave roses are presented in Fig. 14. At offshore location, waves are coming from the north to east direction and, in fact, they are mainly coming from the north sector (Fig. 14a). Simulation results (Fig. 14b) revealed the same pattern at DB point, which is also confirmed by observations. At nearshore buoy location, the waves are dominantly coming from the north and east (Fig. 14c). The model overestimated the north-east components and underestimated the north and east components as presented in Fig. 14d.

Fig. 14
figure 14

Wave roses for the a offshore (DB) and b nearshore (KB) location based on buoy data. Analysis performed for Feb–Nov 2011 in DB location and for Jan–May 2011 in KB location

The spectral information during two high energy events is shown in Fig. 15. It is clear from both wave spectrum and its direction spreading that high energy events are not bimodal at SBC; therefore, the model performance can be evaluated effectively using bulk wave parameters such as Hs and Tp.

Fig. 15
figure 15

Wave spectra for two consecutive events occurred at a March and b August at the offshore buoy location (DB)

5 Conclusions

High resolution ERA-Interim (ERA-I) wind forcing is evaluated in the southern basin of Caspian Sea (SBC) for the wave generation. The SWAN model was used to simulate the waves using two nested layer grids. The accuracy of the wind forcing significantly affects the performance of the model. Such wind data can be achieved from several resources such as remotely sensed data and global weather forecast system datasets. In the present study, mixed re-analysis and forecast ERA-Interim (ERA-I) dataset are evaluated to simulate wave condition in the SBC. The wind data were compared with CERSAT wind data, and the same process was repeated for GFS mixed re-analysis/forecast wind data. The results revealed a fairly good agreement between satellite-derived data and ERAI-MIXED datasets. However, underestimation of the wind speed was quiet evidence. This wind dataset was used as forcing for SWAN model to simulate the wave characteristics. Three measurement stations including two met-ocean buoys and one ADCP were used to evaluate the wave model performance. The latest measurement data are published for the first time in this study.

Since ERAI dataset has been critiqued for its underestimation of the wind velocity, some local correction coefficients were suggested in previous studies. The direct comparison of the wind data with satellite-derived wind speed, and the performance of SWAN using such wind data did not support using such correction factors.

Although the uncertainty in the wind data affects the wave model results, the SWAN provided acceptable bulk wave characteristics using ERAI-MIXED wind data. The analysis results showed a good agreement between observation and model results for both significant wave height and wave period except during some extreme events. The results calculated from the observation and ERAI data showed that ERAI slightly overestimated the bulk wave parameters in the offshore station and underestimated it close to nearshore buoy. Finally, the quantitative evaluation of directional distribution of wind conditions indicated the moderate to low energy conditions at SBC.