1 Introduction

Regarding the climate change induced by both the effects of natural and anthropogenic variabilities, the global mean sea level is rapidly rising in the twenty-first century (Woodworth et al. 2011; IPCC 2013, 2014; Cazenave and Cozannet 2014; Arns et al. 2015; Hamlington et al. 2019). The extreme sea levels arising from climate-related drivers such as storm surges and wind waves over astronomical tides are also similarly increasing with the mean sea level (Lowe et al. 2010; Woodworth et al. 2011; IPCC 2013; Arns et al. 2015). In addition, the typhoons generated by the transformation of a vast amount of heat between oceans and the atmosphere tend to increase in intensity and genesis frequency with increasing global energy under the global warming effect (Knutson et al. 2010). Typhoon-induced surges, which are defined as an abnormal sea level rise (SLR) by typhoon-induced strong winds blowing over shallow and continental shelves, push water to the coast and further lead to an inundation risk in coastal areas, particularly where the population and urbanization has increased (Lowe et al. 2010; Arns et al. 2015; Kim and Cho 2016; Weather Meteorological Organization 2017; Talke and Jay 2020). To prevent meteorological SLR-related risks or to establish climate change adaptation plans, local (extreme) SLR estimation is required as a first step.

The SLR magnitude has been calculated as the rate of change using tidal gauges or satellite altimetry measurements (Woodworth et al. 2011; IPCC 2013; Cazenave and Cozannet 2014; Hay et al. 2015; Hamlington et al. 2019). The data revised by removing the vertical land motion from tide gauge measurements and correcting the effects of the ionospheric and tropospheric delays, instrumental biases, etc., in satellite measurements have been replotted via regression (IPCC 2013; Cazenave and Cozannet 2014; Hay et al. 2015; Kim and Cho 2016). Then, a single SLR rate of change has been mostly obtained and employed in future SLR projection and estimation. The Korea Hydrographic and Oceanographic Agency (KHOA) has reported that the sea level of the seas covering the Korean Peninsula has increased by 2.97 mm/year over the past 30 years (1989–2018). This rate is higher than the global mean SLR rate (KHOA; IPCC 2013).

Even though the extreme SLR pattern follows that of the mean sea level rise (MSLR), the former is mainly driven by meteorological components and bathymetry. In particular, the typhoon-induced surge, which contains uncertainties regarding various processes from its genesis to its extinction, is greatly amplified by waves propagating through narrow and shallow continental shelves (Ku et al. 2019b). For the management of natural hazards such as coastal flooding and strong winds, relevant standards have been mostly estimated in the form of return period levels, with a particular emphasis on the increasing uncertainty due to climate change and human activities (Butler et al. 2007; Toro et al. 2010; Goring et al. 2011; Alam et al. 2018; Ke et al. 2018; Rao et al. 2020).

Frequency analysis has been conducted to relate the magnitude of extreme events such as coastal floods and extreme (the lowest and highest) water levels to their occurrence frequency at given locations. This is based on the extreme value theory that the largest or smallest random variables tend to follow an asymptotic distribution. Various statistical distributions, such as the generalized extreme value (GEV), Weibull, and Gumbel distributions, have been applied to estimate the return period levels of high water levels in rivers, estuaries, and coastal regions, and of flooding and precipitation levels (Kwon et al. 2009; Butler et al. 2007; Arns et al. 2015; Alam et al. 2018). Specifically, the GEV and Weibull distributions have been the most widely employed to fit stationary data sets of wave heights, precipitation, and lowest and highest water levels (Kottegoda and Rosso 1997; Kwon et al. 2009; Butler et al. 2007). In studies estimating the return period levels of the maximum monthly rainfall (Alam et al. 2018) and maximum water level in rivers (Ke et al. 2018), the GEV and Weibull distributions have been adopted to describe the patterns of these extreme values. These estimations have been validated by adopting goodness-of-fit tests such as the chi-squared (Chi-S) and Kolmogorov–Smirnov (K–S) tests and the root-mean-square error (RMSE), which test and reveal whether the data set follows the hypothesized probabilistic distribution.

As a part of assessing the future SLR impact on coastal regions with respect to climate change, this research aims to estimate and update the extreme typhoon-induced SLR in terms of the return period level (or the chance of a certain level being exceeded in any one year) in the seas around the Korean Peninsula. In general, the stationary data for the aforementioned extreme value analysis consist of observations such as those measured at tidal stations for at least 50 years. South Korea possesses a total of 44 tidal stations operated by the KHOA, some of which have observed water levels lower than the threshold. Additionally, estimation of the return period levels by using the above observational data requires spatial interpolation, and regional characteristics such as geographical features cannot be considered. To overcome these drawbacks, this study adopts meteorological extreme sea level data, i.e., the typhoon-induced surge height, obtained with the deterministic Sea, Lake, and Overland Surges from Hurricanes (SLOSH) model after model validation on Korean seas (Jelesnianski et al. 1984; Shaffer et al. 1986; Taylor and Glahn 2008; Glahn et al. 2009; Seo et al. 2018). The return period levels of local extreme sea levels are estimated by adopting the GEV and Weibull distributions. Fitting curves are validated by conducting Chi-S and K–S goodness-of-fit tests. This research also provides the best 50- and 100-year return period levels of local typhoon-induced SLR values and demonstrates the validity of the adaptation of numerically obtained SLR data from the SLOSH model on Korean seas.

2 Methodologies

2.1 Deterministic SLOSH model

In this research, the typhoon-induced surge height above the stationary ocean surface was calculated by the deterministic numerical SLOSH model. This model was developed with the purpose of forecasting real-time storm surges by the National Oceanic and Atmospheric Administration (NOAA), and it has been applied to establish evacuation plans in coastal areas by implementing probabilistic or composite approaches for varying typhoons (Jelesnianski et al. 1984; Shaffer et al. 1986; Taylor and Glahn 2008; Glahn et al. 2009). On estimating the typhoon-induced surge heights, wave–tide–surge nonlinear interaction has been considered for increasing the accuracy (Chen et al. 2017; Hsiao et al. 2019, 2020). Even though the SLOSH model calculates only the wind effect, the SLOSH model has been providing the suitable accuracy in estimating the maximum values of typhoon-induced surges and the wave patterns over the seas around Korean Peninsula (Seo et al. 2018). We adopted the model SLOSH to organize a random variable set of the annual maximum typhoon-induced surges for the probabilistic approach.

The SLOSH model consists of two parts: wind and storm surge ones. The wind model first computes the pressure and wind direction for a stationary, circularly symmetric storm, and these data are used as driving forces in the surge model. (See Seo et al. 2018; Ku et al. 2019a for the details on verifying the suitability of the wind and storm surge parts on the seas around Korean Peninsula.) The surge model derives the typhoon-induced surge height by solving shallow-water equations conformally transformed from z = (x, y) into ζ = (P, Q) and then particularized onto a polar frame of reference given by (please refer to Jelesnianski et al. 1992; Kim et al. 1996 for details):

$$\zeta = P + iQ = \ln \left( {\frac{r}{{R_{0} }}} \right) + i\theta$$
(1)

where R0 is a convenient scale. The final form of the governing equations is expressed as:

$$\frac{\partial U}{{\partial t}} = - g\left( {D + h} \right)\left[ {B_{r} \frac{\partial h}{{\partial P}} - B_{r} \frac{\partial h}{{\partial Q}}} \right] + f\left( {A_{r} V + A_{i} U} \right) + \left[ {{\text{Re}} \left( {\frac{\partial z}{{\partial \zeta }}} \right)^{*} } \right]T_{x} - \left[ {{\text{Re}} \left( {\frac{\partial z}{{\partial \zeta }}} \right)^{*} } \right]T_{y} ,$$
(2)
$$\frac{\partial V}{{\partial t}} = - g\left( {D + h} \right)\left[ {B_{r} \frac{\partial h}{{\partial Q}} - B_{r} \frac{\partial h}{{\partial P}}} \right] + f\left( {A_{r} V + A_{i} U} \right) + \left[ {{\text{Re}} \left( {\frac{\partial z}{{\partial \zeta }}} \right)^{*} } \right]T_{x} + \left[ {{\text{Re}} \left( {\frac{\partial z}{{\partial \zeta }}} \right)^{*} } \right]T_{y} ,$$
(3)

subject to:

$$\frac{\partial h}{{\partial t}} = - \left| {\frac{\partial \zeta }{{\partial z}}} \right|^{2} \cdot \left( {\frac{\partial U}{{\partial P}} + \frac{\partial V}{{\partial Q}}} \right)$$
(4)

where U and V are volume transport components on the polar frame of reference, D is the quiescent water level relative to a common datum, h is the wave height above the reference datum, h0 is the hydrostatic water level, and f is the Coriolis parameter. Parameters Tx and Ty indicate the atmospheric forcing terms defined on the Cartesian coordinate system as follows:

$$T_{x} = g\left( {D + h} \right)\left( {B_{r} \frac{{\partial h_{0} }}{\partial x} - B_{i} \frac{{\partial h_{0} }}{\partial y}} \right) + C_{r} \tau_{x} - C_{i} \tau_{y} ,$$
(5)
$$T_{y} = g\left( {D + h} \right)\left( {B_{r} \frac{{\partial h_{0} }}{\partial y} - B_{i} \frac{{\partial h_{0} }}{\partial x}} \right) + C_{r} \tau_{y} - C_{i} \tau_{x} .$$
(6)

In Eqs. (3)–(6), the bottom stress coefficients of Ar, Ai, Br, Bi, Cr and Ci consist of real and imaginary components with subscripts r and i, respectively. These surface stresses over water bodies are usually obtained from meteorological sources, i.e., the wind at a height of 10 m above the sea, by computing the pressure and wind direction for a stationary and circularly symmetric storm defined as:

$$\frac{1}{{\rho_{a}} }\frac{{{\text{d}}p\left( {r_{t} } \right)}}{{{\text{d}}r_{t} }} = \frac{{k_{{\text{s}}} V^{2} }}{\sin \phi } - V\frac{{{\text{d}}V}}{{{\text{d}}r_{{\text{t}}} }},$$
(7)
$$\frac{1}{{\rho_{a}} }\frac{{{\text{d}}p}}{{{\text{d}}r_{t} }}\cos \phi = fV + \frac{{V^{2} }}{{r_{t} }}\cos \phi - V^{2} \frac{{{\text{d}}\phi }}{{{\text{d}}r_{t} }}\sin \phi + k_{n} V^{2} ,$$
(8)

where the wind speed profile is given by:

$$V\left( {r_{t} } \right) = V_{{\text{R}}} \frac{{2Rr_{t} }}{{R^{2} + r_{t}^{2} }},$$
(9)

where rt is the distance from the storm center to any given location within the circularly symmetric storm, p(rt) is the pressure, ϕ(rt) is the inflow angle across circular isobars toward the storm center, V(rt) is the wind speed, and ks and kn are empirical surface friction coefficients. The maximum wind speed and its radius are written as CR and R, respectively.

2.2 Probability distribution function

2.2.1 Generalized extreme value (GEV) distribution

The GEV distribution is a flexible three-parameter model that identifies the frequency distribution of extreme events, i.e., a set of random variables \(q_{i} \left( {i = 1, \ldots ,l} \right)\). The cumulative distribution function (CDF) of the GEV distribution is given by:

$$F_{{\text{Q}}} \left( q \right) = \exp \left\{ { - \left[ {1 - \frac{{\kappa \left( {q - \epsilon} \right)}}{\alpha }} \right]^{1/\kappa } } \right\}.$$
(10)

The three parameters are α, \(\epsilon\), and κ, which correspond to scale, location, and shape parameters, respectively. These are estimated by applying maximum likelihood estimation (MLE). The given return period values of exceedance that are on the inverse GEV CDF function (also as known as quantile function) are as follows:

$$q\left( T \right) = \xi_{{1 - \frac{1}{T}}} = \epsilon + \frac{\alpha }{\kappa }\left[ {1 - \left( {\ln \frac{T}{T - 1}} \right)^{\kappa } } \right],$$
(11)

where qmax(T) is the required design value over return period T in year and ξ are the inverse values of the GEV CDF form.

2.2.2 Weibull distribution

The Weibull distribution is one of the Type III extreme value distributions (also written as EV3) and is the most widely used with respect to smallest-extreme problems, such as stream flows and water levels (Kottegoda and Rosso 1997; Alam et al. 2018; Ke et al. 2018). Its CDF is given by:

$$F_{{\text{Q}}} \left( q \right) = 1 - \exp \left[ { - \left( {\frac{q}{\lambda }} \right)^{\beta } } \right],\quad q \ge 0$$
(12)

where λ and β are location and shape parameters, respectively. As the application of the EV3 (third asymptotic) distribution, these parameters are λ ≥ 0 and β > 0 (Kottegoda and Rosso 1997), and they are estimated via MLE. From the inverse form of the Weibull distribution, the quantile function q(T) is represented by:

$$q\left( T \right) = \xi_{{1 - \frac{1}{T}}} = - \lambda \left[ {\ln \left( \frac{1}{T} \right)} \right]^{{\frac{1}{\beta }}} .$$
(13)

2.3 Statistical performance indication

In this study, we estimated the return period values using two types of probability distribution functions: the GEV and Weibull distributions. It is necessary to ascertain whether these distributions are suitable. In this study, we adopted Chi-S and K–S goodness-of-fit tests. Here, in this subsection, these goodness-of-fit tests are briefly explained.

2.3.1 Chi-square goodness-of-fit test

The Chi-S goodness-of-fit test is one main method of comparing observed (or numerically estimated) and hypothetical (or statistically obtained) frequencies through full or specified classes (Kottegoda and Rosso 1997). The weighted sum of the squared differences between these frequencies is as follows:

$$\chi^{2} = \mathop \sum \limits_{i = 1}^{l} \frac{{\left( {O_{i} - E_{i} } \right)^{2} }}{{E_{i} }}.$$
(14)

In Eq. (14), Oi is the observed frequency and Ei is the expected frequency. To satisfy the hypothetical frequencies that are acceptable, the weighted sum χ2 should be less than that of a critical value of a \(\chi_{\nu }^{2}\) distribution. This distribution depends on the degrees of freedom, \(\nu = l - 1 - k\), where k is the number of estimation parameters.

2.3.2 Kolmogorov–Smirnov goodness-of-fit test

The K–S goodness-of-fit test is a nonparametric, empirical distribution function test based on the CDF. It involves the supreme class of the distances between the theoretical and empirical CDFs (Kottegoda and Rosso 1997; Dufour et al. 1998; Alam et al. 2018).

$$D_{n} = \mathop {\sup }\limits_{i} \left| {F_{n} \left( q \right) - F_{Q} \left( q \right)} \right|,$$
(15)

where FQ(q) is a completely specified theoretical continuous CDF and Fn(q) is the empirical distribution function given as follows:

$$F_{n} \left( q \right) = \left\{ {\begin{array}{*{20}l} 0 \hfill & {\quad {\text{for}}\,q < q_{1} } \hfill \\ {k/l} \hfill & {\quad {\text{for}}\,q_{i} \le q < q_{i + 1} } \hfill \\ 1 \hfill & {\quad {\text{for}}\,q \ge q_{l} .} \hfill \\ \end{array} } \right.$$
(16)

Here, i means the sequence of the random variables \(q_{i} \left( {i = 1, \ldots ,l} \right)\).

For the K–S goodness-of-fit test, the critical values correspond to the level of significance φ. In this paper, the level of significance is set as φ = 0.05, and the critical value of the K–S goodness-of-fit test is defined as \(D_{n,\varphi } = 1.36/\sqrt {\left( l \right)}\), where the sample size l is larger than 40.

2.4 Data for the extreme value analysis

As previously mentioned in Sect. 2.1, the deterministic SLOSH model is employed to establish a data set of the meteorological SLR induced by typhoons over the Korean input basin covering the Korean Peninsula (Seo et al. 2018; Ku et al. 2019a). According to previous studies on the application of the SLOSH model to the Korean Peninsula (Seo et al. 2018; Ku et al. 2019a), the estimation of local and meteorological SLR values with the SLOSH model falls within approximately ± 20 to ± 50% error bounds over and under the threshold of the typhoon-induced surge height, which is approximately 3 m (10 ft). Again, these error bounds are based on previous studies on the SLOSH model accuracy (Jelesnianski et al. 1992; Glahn et al. 2009).

The meteorological SLR values for the extreme value analysis were obtained in regard to the historical typhoons affecting the Korean Peninsula from 1951 to 2017. A total of 199 typhoons impacted the Korean Peninsula or the Seas of Korea. These typhoons are listed in Table 1, with the same ID as that assigned by the Regional Specialized Meteorological Center (RSMC). The first two digits indicate the calendar year, followed by a 2-digit serial number only for those storms stronger than a tropical storm. No typhoon affected Korea in 1988, 2001, and 2009. The historical typhoon input data for the SLOSH model were established by adopting the best-track data of the RSMC. Their 6-hourly data were converted into hourly tracks for 100 h, at least 48 and 12 h before and after typhoon landing, respectively. The typhoon input data consisted of the latitude and longitude of the typhoon center in degrees, the moving speed in mph, the moving direction in degrees, the minimum sea level pressure (MSLP) in mb, and the radius of the maximum sustained wind speed (RMW) in mph.

Table 1 Historical typhoons considered to calculate the typhoon-induced surge heights

For the extreme value analysis, the annual maximum values of the typhoon-induced surge heights were sampled at each grid point of the Korean input basin for the SLOSH model. The goodness-of-fit tests were conducted at a total of 44 tidal stations of the KHOA including those shown in Fig. 1. The tidal stations are shown in Fig. 1 with respect to the seas: the red asterisks, magenta squares, coral diamonds, and green circles represent the Yellow Sea, South Sea, East Sea, and Jeju Island, respectively. The identification numbers attached to the tidal stations indicate their location, i.e., Incheon (78), Yeosu (9), Masan (219), Ulsan (5), Pohang (4), and Seogwipo (14), which were chosen to visualize the results such as the fitting curves and CDFs. Note that the Busan and Tongyeong tidal stations are depicted to examine the suitability of the return period levels of the typhoon-induced surges obtained from the deterministic SLOSH model in Sect. 4.

Fig. 1
figure 1

Locations of the tidal stations (operated by the Korea Hydrographic and Oceanographic Agency) in South Korea. The red asterisks, magenta squares, coral diamonds, and green circles are the tidal stations located in the Yellow Sea, South Sea, East Sea, and Jeju Island, respectively

3 Results

3.1 Maximum envelope of water (MEOW)

The MEOW is a set of maximum water heights at each grid point. Previous studies (Seo et al. 2018; Ku et al. 2019a, b) have calculated the typhoon-induced surges on the Korean input basin; the MEOW occurs when a typhoon approaches the Korean Peninsula by piling up the water surface along shallow and continental shelves. Figure 2 shows the MEOWs for the historical typhoons of Sarah (5914), Thelma (8705), Maemi (0314), and Bolaven (1215), which penetrated the south coast of Korea, Straits of Korea, and the Yellow Sea. Additionally, these typhoons reached the 4–5 Saffir–Simpson scale in their life cycle, as underlined in Table 1. As typhoons Sarah (5914), Thelma (8705), and Maemi (0314) passed directly through the southern coastal shelves and Straits of Korea, the highest SLR occurred along both sides of the typhoon path, which was then amplified on the southern coasts. In contrast, typhoon Bolaven (1215) penetrated the Yellow Sea, and the highest sea level occurred along the west coast. The waves generated by the strong winds within the typhoon (at times decreasing to a tropical depression while approaching Korea) were transported from their origin (Boccotti 2000) toward the western coast of Korea.

Fig. 2
figure 2

Maximum envelope of water of typhoons a Sarah (5914), b Thelma (8705), c Maemi (0314), and d Bolaven (1215). The typhoon paths are shown with white straight lines and hourly ‘x’ positioning marks

Figure 2 also shows the highest SLR due to a single typhoon event on the coastal shelves along the typhoon path. Even though the MEOWs of all 199 historical typhoons are not addressed in this article, it is supposed that the maximum values of the MEOWs would reach approximately 3 m due to typhoons of 4 scale or greater, especially along the west and south coasts of Korea. In this research, for the extreme value analysis, these MEOWs are classified by the year and sorted in ascending order. These annual maximum values of the typhoon-induced surge heights are shown in Fig. 3, and they are adopted as random variables to fit curves with the GEV and Weibull probability distribution functions.

Fig. 3
figure 3

Best-fit curves for the generalized extreme value (GEV) and Weibull distributions at six tidal stations, a Incheon, b Yeosu, c Masan, d Ulsan, e Pohang, and f Seogwipo. The red and blue straight lines are the GEV and Weibull distributions, respectively. The circles are the annual maximum values of the MEOWs of all 199 historical typhoons

3.2 Probability distributions of the typhoon-induced surge heights

Figure 3 depicts the best-fit curves of the GEV and Weibull distributions at the six tidal stations of Incheon (78), Yeosu (9), Masan (219), Ulsan (5), Pohang (4), and Seogwipo (14) with 95% confidence intervals corresponding to the lower and upper confidence limits Cl and Cu which satisfy \(\Pr \left[ {C_{{\text{l}}} < \theta < C_{{\text{u}}} } \right] = 1 - \alpha\)) for α = 0.05 by using Walda method. (See Kottegoda and Rosso 1997 for details.) Both the GEV and Weibull distributions follow the annual maximum typhoon-induced surge heights well, especially for typhoon-induced surges below 0.8 m, i.e., at the Pohang and Seogwipo tidal stations with the converging pattern. On the GEV distribution, these converging patterns are depicted by negative shape parameters κ shown in Table 2 satisfying the sufficient condition for convergence of bounded random variables (Kottegoda and Rosso 1997). The relatively large shape parameters β of the Weibull distribution of the Pohang and Seogwipo tidal stations resulted in decreasing rates of changes in the typhoon-induced surge heights. But these converging patterns of the GEV distribution do not capture the highest typhoon-induced surges.

Table 2 Estimate of parameters for GEV and Weibull distribution at Incheon(78), Yeosu(9), Masan(219), Ulsan(5), Pohang(4), and Seogwipo(14) tidal stations

At the other tidal stations, the annual maximum values beyond the threshold of a 1-m typhoon-induced surge were recorded, i.e., at the Incheon, Yeosu, Masan, and Ulsan tidal stations in Fig. 3. The positive shape parameters κ and β in GEV and Weibull distributions in Table 2 describe that these stations have recorded strictly monotonic and continuous typhoon-induced surge heights up to the largest values. But only on the GEV fitting curve, the rate of change has increased and led to a sharp increase in its fitting curve. This resulted in different patterns of the GEV and Weibull distribution curves. Even though both curves underestimate the annual maximum typhoon-induced surge for return periods shorter than 10 years, the increasing rate of change of the GEV distribution results in its curve approaching the annual maximum values at long return period. These differences are also observed in Figs. 4 and 5. Figure 4 shows the empirical CDFs Fn(q) and those of the probability distributions FQ(q) with 95% confidence intervals. The increasing rate of the change of the GEV distribution leads to an overestimation of the typhoon-induced surge for long return periods due to the convergence point (i.e., where the slope equals 0) occurring later on the x-axis than that of the Weibull distribution. Figure 5 indicates the 50- and 100-year return period levels at 44 tidal stations. Some of the estimated frequencies of the 50- and 100-year return periods are much higher in the GEV distribution, especially at the tidal stations along the west coast. Additionally, these values result in much higher estimations of the annual maximum variables than those with the Weibull distribution. The decreasing or converging rate of change, particularly of the Weibull distribution along most coasts, results in a few gaps between the 50- and 100-year return period levels.

Fig. 4
figure 4

Cumulative distribution function (CDF) of the GEV and Weibull distributions at the 95% confidence level based on the empirical CDF (the black straight line). The blue and red straight lines indicate the GEV and Weibull distributions, respectively. The dotted and dashed lines indicate their 95% confidence levels

Fig. 5
figure 5

Estimated frequencies of the 50- and 100-year return period levels by the generalized extreme value (GEV) and Weibull distributions at 44 tidal stations. The blue diamonds and squares and red pentagrams and circles represent the 50- and 100-year return period levels, respectively, obtained by the GEV and Weibull distributions, respectively

3.3 Statistical performance indicators

The hypothesis testing procedure related to the form of the probability distribution is the first step assessing acceptance or rejection of the null hypothesis that the random variables have the GEV or Weibull distribution with the specified coefficients. Here, this is quantified by statistical performance indicators such as the weighted sum χ2 of the Chi-S test and supremum Dn,φ of the K–S test. There are two objectives to the performance examination of statistical analysis. First, it is verified whether the probabilistic distributions are suitable. The second objective is to provide criteria and guidance for the determination of better fitting curves.

Here, we first examine the performance of the statistical analysis in verifying the suitability of the two probabilistic distributions. The Chi-S and K–S goodness-of-fit tests ascertain the acceptance of the null hypothesis by weighted sum χ2 and supremum Dn,φ values that are lower than their critical values. In this research, the critical values for the Chi-S and K–S goodness-of-fit tests are \(\chi_{\nu }^{2} = 81.4\) and Dn,φ = 0.17, respectively, at a significance level of φ = 0.05 (Kottegoda and Rosso 1997; Panik 2005). Figures 6, 7 and Table 3 show the values of the Chi-S and K–S goodness-of-fit tests. At all tidal stations, the values of both goodness-of-fit tests are lower than their critical values. Therefore, the null hypothesis is accepted at all tidal stations regarding both the GEV and Weibull distributions.

Fig. 6
figure 6

Chi-squared goodness-of-fit test results at 44 tidal station in South Korea

Fig. 7
figure 7

Kolmogorov–Smirnov goodness-of-fit test results at 44 tidal station in South Korea

Table 3 Statistical performance indicators, i.e., the weighted sum χ2 of the chi-squared goodness-of-fit test and the supremum Dn of the Kolmogorov–Smirnov goodness-of-fit test at 44 tidal stations

In regard to the critical values, the lower the weighted sum χ2 and supremum Dn values are, the better the agreement between the curves is. Figure 6 shows that the Weibull distribution is better in terms of the Chi-S goodness-of-fit test, while Fig. 7 shows that the GEV results are better by the K–S test. Table 3 also summarizes the differences in the goodness-of-fit test values between the two probabilistic distributions. In the Chi-S test, the differences are in the range of \(0.03 \le \chi^{2} \le 0.69\), and the Weibull distribution is better except for tidal station 282, which is located on the northernmost coast of South Korea in the Yellow Sea. The K–S test values are in the range of 0 ≤ Dn ≤ 0.03, and the GEV distribution attains a better performance in estimating the return period level. To select the best distribution for variable fitting purposes, Alam et al. (2018) established a ranking system that ranked goodness-of-fit test results in descending order with respect to the probabilistic distribution and summed these ranked values. The smaller the sum is, the better the curve fit is. Even though this research evaluated just two probabilistic distributions by two goodness-of-fit tests, in applying the above ranking system (Alam et al. 2018), the GEV distribution is better to estimate the probability of exceeding a certain level in any given year because of the ranking score of tidal station 282.

4 Discussion of the return period levels of the typhoon-induced surges

Figures 8 and 9 show a bird’s-eye view of the 50- and 100-year return period levels, respectively, of the typhoon-induced surge height. The highest and relatively higher return period levels of the typhoon-induced surge height occur along the west and south coasts, respectively, based on the GEV distribution. The increasing rates of change shown in Figs. 3 and 4 resulted in higher return period levels. Additionally, comparing the 50- and 100-year return period levels of the typhoon-induced surge height, the converging pattern of the Weibull distribution resulted in a relatively gradual increase. The differences obtained from the probability distribution reached approximately 4 m (please refer to Fig. 5 for details) at the 100-year recurrence level.

Fig. 8
figure 8

Bird’s-eye view of the 50-year return period levels of the typhoon-induced surge height by the a GEV and b Weibull distributions around the Korean Peninsula

Fig. 9
figure 9

Bird’s-eye view of the 100-year return period levels of the typhoon-induced surge height by the a GEV and b Weibull distributions around the Korean Peninsula

The relatively higher return levels of the typhoon-induced surge height occurring along the south coast resulted from the historical typhoons frequently penetrating the south coast and Straits of Korea and the geographical features of the narrow and long channels. In particular, Masan Bay with the Masan (219) tidal station (please refer to Fig. 1 for the locations of the tidal stations) exhibits these geographical features. These factors amplified the waves generated by the typhoon-induced strong winds and delivered them to the inside of the bay (Ku et al. 2019a).

In the case of the east coast, the return period water levels are much lower than those along the west and south coasts. The typhoon-induced SLR values of the MEOW shown in Fig. 2 even smoothly decrease along the north direction. In general, typhoons lost energy when moving on land in the east direction, and they faded out. This resulted in relatively lower typhoon-induced winds and surge heights. In most cases, the east coast is not directly affected by typhoons.

In Sect. 3.3, the comparison of the weighted sum χ2 of the Chi-S goodness-of-fit test and the supremum Dn of the K–S test revealed that the GEV distribution attained a better performance in estimating the return period level of local typhoon-induced SLR values because of its lower goodness-of-fit test values. Particularly, the GEV distribution, i.e., the increasing slopes of the best-fit curves, as shown in Fig. 3, leads to higher typhoon-induced surge heights along the west and south coasts. Additionally, comparing the return levels to those reported in previous studies, whereby extreme local SLR values were statistically obtained by using long-term observational data at the Yeosu, Tongyeong, and Busan tidal stations on the south coast (Kwon et al. 2009; Kang and Kim 2018; Kim and Suh 2018; respectively), the return period levels of the typhoon-induced surge height by both the GEV and Weibull distributions are overestimated. This comparison is summarized in Table 4. The return period levels by the GEV distribution are closer but higher than the mean values of the 100-year return period level. The Weibull distribution was applied to estimate the 50-year return period levels in previous studies (Kwon et al. 2009; Kang and Kim 2018). These values are also included in the ± 90% confidence intervals (CIs). However, as shown in Fig. 3, compared to the numerically obtained local extreme events, the return period levels by the GEV distribution tend to be underestimated. The estimated return period levels of the typhoon-induced surge height are much higher than the mean values of the observation data, which strongly supports the application of SLOSH model results in estimating the return period level of the typhoon-induced surge height. It is expected that this will promote the adoption of the return period level by policymakers and planners who require extreme values when making decisions for the prevention of coastal flooding, the management of coastal regions, or in response to climate change.

Table 4 Return period levels of the typhoon-induced surge height by the GEV and Weibull distributions and previous studies for verification purposes (unit: m)

5 Conclusion

Climate change has led to increases in both the global mean and extreme sea levels. Although SLR values are increasing with similar patterns, the SLR rates of the seas around South Korea are much higher than the global mean SLR rate (KHOA; IPCC 2013). The typhoon-induced surge height is a main driver of extreme sea levels. As South Korea is increasingly exposed to typhoons that are becoming stronger and more frequently generated due to climate change, its coastal area is increasingly subject to local, meteorological SLR-related risks such as coastal flooding and inundation. Coastal management, policymaking, and planning with respect to SLR and climate change require the estimation of the probability of exceeding a certain level in any given year as the first step.

This research aims to estimate and update the local extreme SLR values induced by typhoons by implementing extreme value analysis with the GEV and Weibull distributions. To validate and select the best fitting curves of the annual maximum values obtained from the deterministic numerical SLOSH model, the Chi-S and K–S goodness-of-fit tests are employed. The statistical performance indicators, i.e., the weighted sum χ2 of the Chi-S goodness-of-fit test and the supremum Dn of the K–S test, ascertain the acceptance of the null hypothesis, which validates the statistical analysis. These results also provide guidance for better estimations. The lower the statistical performance indicators are, the better the fitted curves are. As a result, the GEV distribution attained a better estimation of the return period levels of the typhoon-induced SLR. The GEV distribution tends to capture extreme values (please refer to Fig. 3 for details) and provides higher return period levels along the west and south coasts.

In addition, the return period levels of the typhoon-induced SLR by the GEV distribution conform better to the observation data than do the numerically obtained typhoon-induced surge heights in terms of extreme events. Even though its error ranges from ± 20 to ± 50%, the SLOSH model tends to overestimate the maximum wave height. (See Ku et al. 2019a, b for details.) The SLOSH model results based on the Korea input basin are suitable to estimate the return period level for coastal and risk management purposes. Additionally, the above numerical model covers all oceans of South Korea, and it provides the return period level at many more locations than just tidal stations recording the tidal level. This could be a useful tool or map for other purposes such as coastal management and related policymaking and planning.