1 Introduction

The effect of the spatial distribution of rainfall on basin discharge has been reviewed by Singh (1997), who suggested that the spatial distribution of rainfall affected the runoff timing rather than the peak discharge. Nevertheless, some studies have found that the spatial distribution of rainfall could affect the flood peak if (i) the rainfall duration was shorter than the time to equilibrium (Ogden and Julien 1993) and (ii) the basin characteristics were nonuniform (Freeze 1980; Singh 1997). Recent studies have simulated discharges from spatially distributed rainfalls over nonuniform basins and found that uniform and spatially distributed rainfall could lead to different rainfall thresholds (Golian et al. 2010, 2011), flood sources (Lee et al. 2009; Saghafian et al. 2013) and flood peaks (Chotpantarat and Chanyotha 2003; Saghafian et al. 2014; Tramblay et al. 2011; Zoccatelli et al. 2011). The basin characteristics that affected the flood peak mentioned in these studies were the hydraulic conductivity, which controlled the amount of excess rainfall, and topography, which controlled the attenuation. Other characteristics that could influence the flood peak were the shape, storage and roughness of the area. The spatial data have been taken into account in flood simulations in many studies (Hamidi et al. 2018; Klongvessa and Chotpantarat 2014; Meng et al. 2017; Nkiaka et al. 2018; Samadi et al. 2018).

However, prioritization of the influences of each characteristic is still poorly understood. If the influences of these characteristics were prioritized, it would be easier to determine the locations where rainfall has a strong impact on the peak discharge from these basin characteristics; moreover, it would be easier to create a numerical model that focuses on only the important characteristics. In this study, the peak discharges from various spatially distributed rainfalls were simulated, similar to the approach used in previous studies, but then the response of the peak discharge to the rainfall at each location were further investigated. Finally, the characteristics of the location where the response was most pronounced were identified, and the influence of each characteristic was then analyzed. With this approach, the influence of each characteristic was prioritized.

The Yom River basin, northern Thailand (see Sect. 2) was chosen for this study because it is large and long, leading to a longer time to equilibrium, and there are clearly different characteristics between the upstream and downstream areas of this basin. The upstream area is a forested mountainous region, while the downstream area is an agricultural plain area. Moreover, the flow in this basin is primarily natural, since there are no large-scale control structures along the river. Furthermore, the long shape of the basin drives a clear variability in the rainfall locations. For these reasons, the Yom River basin was deemed to be appropriate for studying the response of the flood peak to the spatial distribution of rainfall.

2 Study area

The Yom River basin is in northern Thailand and covers an area of approximately 24,000 km2 (Fig. 1). The length of the main stream, the Yom River, is approximately 700 km. The basin was divided into the three parts: the upper, middle and lower Yom basin. However, the upper part was excluded from this study because there is a weir near the boundary between the upper and middle Yom basin (Fig. 1), which can affect the natural flow. Apart from this weir, there are no large-scale control structures in the basin. Several small floodgates for irrigation have been constructed but do not significantly affect the flow.

Fig. 1
figure 1

Location of the study area (Yom River basin, Northern Thailand)

The study area of the middle and lower Yom basin covers an area of approximately 19,000 km2, and the length of the main stream was approximately 600 km. The downstream part of the area is a plain that is mostly agricultural with some surface water storage. The soil in this part of the study area is mainly a fine-textured soil. The upstream part of the study area is a mountainous region and mostly covered by forest with a coarser soil texture than that downstream. Most of the flood simulations over the Yom River basin and the Chao Praya basin, in which the Yom River basin is located, have used two-dimensional models (Hunukumbura and Tachikawa 2012; Kotsuki and Tanaka 2013; Sayama et al. 2015, 2017; Sriariyawat et al. 2013) or a one-dimensional model at a sub-basin scale (Klongvessa et al. 2017b; Kure and Tebakari 2012; Tingsanchali and Karim 2010), where nonuniform rainfall or basin characteristics can be considered.

Rainfall in the study area is high during May–October, but floods mostly occur during September–October when the rainwater from rainfalls during the previous months saturates the soil and decreases the rainwater storage capacity for subsequent rainfalls (Chotpantarat and Boonkaewwan 2018; Kotsuki and Tanaka 2013). The city of Sukhothai, which is located within the study area (Fig. 1), is frequently damaged by floods due to the low retention capability of the upstream area and the narrow basin cross section in the downstream area (Sriariyawat et al. 2013).

3 Methodology

3.1 Model description

This study used the HEC-HMS model (Feldman 2000) to simulate rainfall runoff. However, due to the long main river, the river flow could play an important role on traveling of water. Therefore, the runoff simply predicted by the transformation from excess rainwater in the HEC-HMS model might not be very accurate. We improved the accuracy by using the physically-based HEC-RAS model (Brunner 2010) to simulate the river flow. The runoff simulated by the HEC-HMS model determined the lateral flow to the river, and the river flow was routed by the HEC-RAS model. The timestep for the simulation was 15 min and the distance between calculation points along the river was set to be closet to but not more than 5 km.

In the HEC-HMS model, the excess rainwater was controlled by a constant percolation loss and surface water storage volume. In this study, the infiltration storage was assumed to be full because flooding in the study area is mostly associated with a saturated soil condition (Kotsuki and Tanaka 2013). The excess rainwater was transformed to runoff by Snyder’s unit hydrograph method, which is characterized by the standard lag time and peak coefficient. The standard lag time varied upon the shape and characteristics of the drainage area, as shown in Eq. (1) (Feldman 2000);

$$t_{p} = 0.75C_{t} \left( {LL_{c} } \right)^{0.3} ,$$
(1)

where tp is the standard lag time (h); Ct is the non-physically based basin coefficient, which is generally affected by topography and was low over the mountainous region; L is the distance from an outlet to the farthest boundary (km); and Lc is the centroid distance (km). The peak coefficient is a non-physical parameter that determines the peak height, and is generally high when Ct is low.

In the HEC-RAS model, the flow was routed along the river by the principles of conservation of mass and conservation of momentum using a four-point implicit scheme (Brunner 2010). The river was characterized by Manning’s roughness coefficient (n).

The conceptual model of the study area was developed in Klongvessa et al. (2017b). The study area was divided into 13 subareas, in which the rainfall was transformed into lateral flow to the main stream (Fig. 2 and Table 1). The upper four subareas represent the mountainous region and were calibrated separately from the lower nine subareas that represent the plain. However, in the experimental simulation, since the water level in the city of Sukhothai (stream gauge Y4) did not respond to the rainfall in the five downstream subareas, the rainfall could be simulated over only the other eight subareas. The rainfall was spatially distributed across four zones (Fig. 2 and Table 1), following the methods of Klongvessa et al. (2017b), who had shown that distributing the rainfall across these four zones could still drive accurate simulation results that were similar to the simulation results when the rainfall was distributed across eight subareas. These four zones are referred to as zones 1–4, according to their locations: the most upstream zone was zone 1 and the most downstream zone was zone 4. Zones 1 and 2 represent the mountainous region, while zones 3 and 4 represent the plain area. The result of the model validation is shown in Fig. 3. It can be seen that the model could simulate the traveling of flood peak accurately with root-mean-square errors (RMSE) of at most approximately 1.0 m for the recent years (2012–2014). For the year 2011, the RMSE’s were large because the river cross sections were changed by the big flood.

Fig. 2
figure 2

Digitized study area for the simulations, showing also the location of the 13 outlet stream gauges (Y1–Y43)

Table 1 Details of the 13 subareas within the study area
Fig. 3
figure 3

Result of model validation at the stream gauges along the Yom River (Y37, Y14, Y6, Y3A, Y33 and Y4) and root-mean-square error (RMSE) between observed and expected water levels

The calibrated parameters are shown in Table 2. Zones 1 and 2 were characterized by a high percolation rate due to the coarser soil texture observed in these zones than in zones 3 and 4, a high channel Manning’s n due to the vegetation over the forestland (Chow et al. 1988), a low basin coefficient [Ct in Eq. (1)] due to the steepness (Feldman 2000) and a low surface water storage. The relative magnitude of the values of these four parameters were reversed for zones 3 and 4, which had a different soil texture, land use and steepness than those of zones 1 and 2.

Table 2 Model parameters (Klongvessa et al. 2017b)

3.2 Rainfall design

The total rainfall amount and temporal distribution used in this study were from Klongvessa et al. (2017b). That study determined the amounts of 2-, 5- and 10-y rainfalls under durations of 24, 48 and 72 h and designed the hyetograph from the data during May–October 1981–2010 collected every 3 h at 11 high quality rain gauges operated by Thai Meteorological Department (TMD). The locations of these rain gauges are shown in the Fig. 1. Even though most of the rain gauges were located outside the Yom River basin, the rainfall amounts collected at these rain gauges have been found to be consistent with rainfall amounts collected at 24 local rain gauges inside the basin (Chotpantarat et al. 2013). Therefore, they were capable of representing the rainfall in our study area. Because the rainfall amounts at the local rain gauges were collected on a daily basis, not every 3 h, they could not be used for the rainfall design in the study. Hence, only the rainfall data at the rain gauges shown in the Fig. 1 were used. The amounts of 2-, 5- and 10-y rainfalls were determined using the Weibull equation (Singh 1992) and generalized extreme value (GEV) distribution (Martins and Stedinger 2000), while the rainfall temporal distribution was determined from the observed areal rainfalls over each of the zones 1–4 with the timestep of 3 h by the alternating block method (Chow et al. 1988). Mostly, the duration of these observed rainfalls ranged from a few hours to 3 days. These calculations were shown to accurately represent the rainfall in the study area. The rainfall amount at each return period and duration is shown in Table 3.

Table 3 Amount of the 2-, 5- and 10-y rainfalls with 24-, 48- and 72-h durations (Klongvessa et al. 2017b)

A total of 1000 spatial distributions of rainfall were created according to the observed data and simulated using a Monte Carlo analysis (Golian et al. 2010, 2011) and Cholesky randomization approach (Kreyszig 1999). First, observed rainfall events were selected; in this study, the spatial distribution characteristics of the 24-, 48- and 72-h rainfalls were distinguished. The 24-h rainfall was determined from observed events that had days with rainfall greater than 35.0 mm, while the 48-h and 72-h rainfalls were determined from the observed events that had 2 and 3 consecutive days rainfall greater than 35.0 mm, respectively. The rainfall data were observed at the 11 rain gauges during May–October 1981–2010 (Fig. 1), and the zonal rainfalls were calculated from the Thiessen weighted average for the zones 1-4. A total of 728, 249 and 104 events were used to determine the characteristics of the 24-, 48- and 72-h rainfalls, respectively. The criterion of greater than 35.0 mm/day rainfall was used because it is the same criterion that the TMD used to classify heavy rainfall days. Then, for each of the selected events, the rainfall was converted to dimensionless rainfall by dividing the zonal rainfall amount (rainfall amount in each of the four zones) with the average areal rainfall amount. Next, the probability distribution of the dimensionless rainfall in each zone (Sect. 3.2.1) and the spatial correlations of the dimensionless rainfall among the zones were determined. Subsequently, the dimensionless rainfall in each of the four zones was randomized according to the determined probability distributions and spatial correlations (Sect. 3.2.2). Finally, the dimensionless random rainfall distributions were converted to the dimensional rainfall by multiplying the dimensionless amounts with the determined amount (Table 3).

3.2.1 Probability distribution of the zonal dimensionless rainfall

In the Monte Carlo analysis, the probability distribution of the zonal dimensionless rainfall was determined by fitting the probability of the observed zonal dimensionless rainfall amount with a statistical distribution before generating the random rainfalls according to the same statistical distribution (Golian et al. 2010, 2011). First, the Weibull equation (Singh 1992) was applied to assign the probabilities to the observed data. Next, the normal distribution was used to fit the observed data using Eq. (2);

$$x = \left\{ {\begin{array}{*{20}l} {\mu + \sigma \sqrt 2 erf^{ - 1} \left( {1 - 2P} \right), \mu + \sigma \sqrt 2 erf^{ - 1} \left( {1 - 2P} \right) \ge 0} \hfill \\ {0, \mu + \sigma \sqrt 2 erf^{ - 1} \left( {1 - 2P} \right) < 0} \hfill \\ \end{array} } \right.,$$
(2)

where x is the expected zonal dimensionless rainfall amount, μ is a location parameter, σ is a scale parameter, erf is an error function and P is the exceedance probability.

It should be noted that the rainfall amount cannot be negative; therefore, the dimensionless rainfall amount should be set to 0 if the expected amount determined by the normal distribution was negative.

3.2.2 Rainfall randomization

The zonal dimensionless rainfalls were randomized according to (i) the location parameter (μ) and scale parameter (σ) of the normal distribution, which was used to fit the observed data with Eq. (2), and (ii) the spatial correlations of the data in zones 1–4. This process was performed with four series of normally distributed random numbers (which represent the rainfall in the four zones); the series have specified correlation coefficients. These series of random numbers were generated by Cholesky randomization (Kreyszig 1999).

Let \(\varvec{V}_{1} = \left[ {\begin{array}{*{20}c} {v_{1,1} } & {v_{2,1} } & \ldots & {v_{1000,1} } \\ \end{array} } \right]\), \(\varvec{V}_{2} = \left[ {\begin{array}{*{20}c} {v_{1,2} } & {v_{2,2} } & \ldots & {v_{1000,2} } \\ \end{array} } \right]\), \(\varvec{V}_{3} = \left[ {\begin{array}{*{20}c} {v_{1,3} } & {v_{2,3} } & \ldots & {v_{1000,3} } \\ \end{array} } \right]\) and \(\varvec{V}_{4} = \left[ {\begin{array}{*{20}c} {v_{1,4} } & {v_{2,4} } & \ldots & {v_{1000,4} } \\ \end{array} } \right]\) be vectors of independent random numbers with a standard normal distribution, and let ci,j be a correlation coefficient between the dimensionless rainfalls in zones i and j. The matrix of random numbers \(\varvec{V}\) and matrix of correlations \(\varvec{C}\) can be written as Eqs. (3) and (4), respectively;

$$\varvec{V} = \left[ {\begin{array}{*{20}c} {v_{1,1} } & {v_{1,2} } & {v_{1,3} } & {v_{1,4} } \\ {v_{2,1} } & {v_{2,2} } & {v_{2,3} } & {v_{2,4} } \\ \vdots & \vdots & \vdots & \vdots \\ {v_{1000,1} } & {v_{1000,2} } & {v_{1000,3} } & {v_{1000,4} } \\ \end{array} } \right],$$
(3)
$$\varvec{C} = \left[ {\begin{array}{*{20}c} 1 & {c_{1,2} } & {c_{1,3} } & {c_{1,4} } \\ {c_{1,2} } & 1 & {c_{2,3} } & {c_{2,4} } \\ {c_{1,3} } & {c_{2,3} } & 1 & {c_{3,4} } \\ {c_{1,4} } & {c_{2,4} } & {c_{3,4} } & 1 \\ \end{array} } \right].$$
(4)

The Cholesky component \(\varvec{Z}\) of the matrix \(\varvec{C}\) is defined as the triangular matrix that satisfied Eq. (5);

$$\varvec{C} = \varvec{ZZ}^{\text{T}} .$$
(5)

The matrix V* is defined by Eq. (6);

$$\varvec{V}^{\varvec{*}} = \varvec{VZ} = \left[ {\begin{array}{*{20}c} {v^{*}_{1,1} } & {v^{*}_{1,2} } & {v^{*}_{1,3} } & {v^{*}_{1,4} } \\ {v^{*}_{2,1} } & {v^{*}_{2,2} } & {v^{*}_{2,3} } & {v^{*}_{2,4} } \\ \vdots & \vdots & \vdots & \vdots \\ {v^{*}_{1000,1} } & {v^{*}_{1000,2} } & {v^{*}_{1000,3} } & {v^{*}_{1000,4} } \\ \end{array} } \right].$$
(6)

The correlation between the components of vectors \(\varvec{V}^{\varvec{*}}_{\varvec{i}} = \left[ {\begin{array}{*{20}c} {v^{*}_{1,i} } & {v^{*}_{2,i} } & \ldots & {v^{*}_{1000,i} } \\ \end{array} } \right]\) and \(\varvec{V}^{\varvec{*}}_{\varvec{j}} = \left[ {\begin{array}{*{20}c} {v^{*}_{1,j} } & {v^{*}_{2,j} } & \ldots & {v^{*}_{1000,j} } \\ \end{array} } \right]\) where \(i,j \in \left\{ {1, 2, 3, 4} \right\}\) is ci,j, and these components have a standard normal distribution. Therefore, in the event \(r \in \left\{ {1, 2, \ldots , 1000} \right\}\), the dimensionless rainfall amount in the zone \(c \in \left\{ {1, 2, 3, 4} \right\}\) can be calculated from Eq. (7);

$$x = \left\{ {\begin{array}{*{20}l} {\mu + \sigma v^{*}_{r,c} ,\quad \mu + \sigma v^{*}_{r,c} \ge 0} \hfill \\ {0,\quad \mu + \sigma v^{*}_{r,c} < 0} \hfill \\ \end{array} } \right.,$$
(7)

where x is the dimensionless rainfall amount and μ and σ are also used in Eq. (2). However, for some matrix \(\varvec{C}\), there is no matrix \(\varvec{Z}\) that can satisfy Eq. (5). In these cases, some values of ci,j were slightly adjusted (Higham 2002).

It should be noted that in the observed data, the areal average of the dimensionless rainfall for each event was 1, but for random events, the areal average dimensionless rainfall might not be equal to 1 due to the randomness of the inputs. Therefore, before the conversion of the random dimensionless rainfall to dimensional rainfall by using a determined amount, the random zonal dimensionless rainfalls must be reweighted to calibrate the areal average values to 1.

4 Results and discussion

4.1 Spatial rainfall

The parameters of the normal distribution used to fit the zonal dimensionless rainfall, as in Eq. (2), are shown in Table 4, and the plots of the distributions are shown in Fig. 4. The Chi square goodness of fit test confirmed that Eq. (2) can fit the data at the 0.05 significance level (Table 4). Comparisons of the dimensionless rainfalls generated by the Monte Carlo analysis with Cholesky randomization and the observed dimensionless rainfalls are shown in Fig. 5, which provides the averages and standard deviations, and Fig. 6, which provides the spatial correlations; the generated and observed dimensionless rainfalls were similar. The positive correlation between rainfalls in zones 2 and 3 and the negative correlations for the other zones suggest that the rainfall location can be classified into 3 groups, zone 1, zones 2–3 and zone 4. This classification coincides with the study by Klongvessa et al. (2017a) who have divided the northern Thailand into 3 parts according to the characteristics of consecutive rainfall days, the mountainous region (zone 1 in our study) with low rainfall amount, the plain area (zone 4 in our study) with higher rainfall amount than the mountainous region and the joint area (zones 2–3 in our study) with high probability of consecutive heavy rainfall days. Unlike the short-duration (24-h) rainfall, the long-duration (48- and 72-h) rainfalls tended to be more concentrated upstream, since the 48- and 72-h rainfalls were more common than the 24-h rainfall in zones 1–3, while the 24-h rainfall were more common than the 48- and 72-h rainfalls in zone 4. However, all short- and long-duration rainfalls tended to be more concentrated over the downstream part of the study area than the upstream part.

Table 4 Parameters of the normal distribution used to fit the zonal dimensionless rainfall, RMSE between the observed and expected values, and test and critical values of the Chi square goodness of fit test
Fig. 4
figure 4

Zonal dimensionless rainfall probabilities calculated from all rainfalls, 2-day or longer rainfalls and 3-day or longer rainfalls

Fig. 5
figure 5

Averages and standard deviations (whiskers) of simulated and observed zonal dimensionless rainfalls

Fig. 6
figure 6

Spatial correlation of the simulated and observed zonal dimensionless rainfalls

4.2 Flood peak

Flood peaks in the city of Sukhothai from the 2-, 5- and 10-y rainfalls with 24-, 48- and 72-h durations are shown in Fig. 7. The flood peak from a 24-h rainfall was usually low but highly variable, while those from 48 to 72-h rainfalls were high and fairly uniform. From Fig. 7, it can be seen that in some rare cases, a 24-h rainfall can produce an extremely high flood peak that is even higher than the highest flood peaks from the 48- and 72-h rainfalls.

Fig. 7
figure 7

Simulated peak discharges at Sukhothai city from 1000 simulated rainfalls of 2-y, 5-y and 10-y return periods and 24-, 48- and 72-h durations

The spatial distribution of the rainfall that caused the high flood peaks was evaluated by determination of the correlation coefficient between the flood peak and peak rainfall intensity in each of the four zones (Fig. 8). The flood peak from a 24-h rainfall was high when the rainfall was concentrated over zones 1 and 2 (upstream area), while the flood peaks from the 48- and 72-h rainfalls were high when the rainfalls were concentrated over zones 2 and 3 (joint area). The negative correlation for the zone 4 suggests that the high peak discharge at the Sukhothai city is caused by discharge from the upstream rainfall rather than the local rainfall (rainfall in zone 4). Given a fixed areal rainfall amount, the higher rainfall amount in the upstream area means the lower rainfall amount in the local area. With this reason, the correlations between the peak discharges and peak rainfall intensities in the zone 4 are negative.

Fig. 8
figure 8

Correlation coefficient between the peak discharge and peak rainfall intensity in each zone

The flood peak from a 24-h rainfall was low because the rainfall duration was shorter than the concentration times, which was 48–72 h depending on the rainfall location (Klongvessa et al. 2017b); moreover, 24-h rainfalls were rarely concentrated over zones 1 and 2 (Fig. 5). The high variability in the flood peaks from the 24-h rainfall was because the rainfall duration was shorter than the concentration time, and the equilibrium point, where the incoming water from the rainfall is balanced with the outgoing discharge, was not reached (Ogden and Julien 1993). Conversely, the flood peaks from the 48- and 72-h rainfalls were high because the rainfall durations were similar to the concentration time, and these rainfalls tended to concentrate over zones 2 and 3. The flood peaks from those rainfalls were not considerably variable because the equilibrium point had been reached.

4.3 Influence of basin characteristics

In this section, the physical mechanism behind the correlation between the flood peak and the rainfall intensity was investigated in each zone.

4.3.1 Short-duration rainfall

The flood peak from a short-duration rainfall appeared to respond to the rainfall in zones 1 and 2 more strongly than the rainfall in zones 3 and 4. The differences in the basin parameters between zones 1 and 2 and zones 3 and 4 included the percolation rates, surface water storage, basin constants [Ct in Eq. (1)], peak coefficients for the Snyder unit hydrograph transformation and channel Manning’s n. These characteristics were evaluated by performing simulations with uniform parameters to determine which characteristic had the most influence on the peak discharge. For each parameter, the value over zones 1 and 2 was used for the first simulation, and the value over zones 3 and 4 was used for the second simulation; the influence of each parameter can then be determined by evaluating the difference between the simulated peak discharges from these two simulations.

The influence of each parameter appeared to depend on the rainfall amount and whether the rainfall was concentrated over the mountainous region or the plain area. Table 5 shows the results from the simulations of two samples of 2-y rainfalls with 24-h duration when each parameter was adjusted to be uniform. One sample was concentrated over the mountainous region, while another sample was concentrated over the plain area. Table 6 is similar to Table 5 but for samples of 10-y rainfalls. The rainfall hyetographs and discharges at the outlets of the zones 1–4 for the usual case (no adjustment of parameter) are shown in Figs. 9 and 10 for the samples of 2-y and 10-y rainfalls, respectively.

Table 5 Peak discharge (m3/s) from one sample of 24-h rainfalls with a 2-y return period that concentrated over the mountainous region and one sample of those over the plain area when the basin coefficient, surface storage, Channel Manning’s n and percolation rate were adjusted to be uniform
Table 6 Peak discharge (m3/s) from one sample of 24-h rainfalls with a 10-y return period that concentrated over the mountainous region and one sample of those over the plain area, when the basin coefficient, surface storage, Channel Manning’s n and percolation rate were adjusted to be uniform
Fig. 9
figure 9

Discharge from the sample of 24-h rainfalls with a 2-y return period that concentrated over the mountainous region (upper) and one sample of those over the plain area (lower)

Fig. 10
figure 10

Discharge from the sample of 24-h rainfalls with a 10-y return period that concentrated over the mountainous region (upper) and one sample of those over the plain area (lower)

From Table 5, when the amount of rainfall was small, the flood peak appeared to respond to the soil percolation rate and the surface storage rather than the steepness. The responses to these two characteristics are apparent when the rainfall was concentrated over the mountainous region. The flood peak responded to the rainfall in zones 1 and 2 rather than that in zones 3 and 4 because zones 3 and 4 had high surface water storage capabilities to retain the rainwater. Conversely, Table 6 shows that when the amount of rainfall was large, the flood peak appeared to respond to the steepness rather than the soil percolation rate and surface storage. This response to the steepness was apparent when the rainfall was concentrated over the plain area. The flood peak responds to the rainfall in zones 1 and 2 because these zones are steep, which causes the rainwater to be quickly drained to the stream. However, with both small and large rainfalls, the channel roughness had only a very small influence on the flood peaks.

4.3.2 Long-duration rainfall

The flood peak from a long-duration rainfall appeared to be mainly affected by the soil percolation rate at the rainfall location, because the long-duration rainfalls had lower intensities than the short-duration rainfalls. Apart from the soil percolation rate, there were effects of the surface storage and terrain steepness that depended on the rainfall magnitude, similar to the results of short-duration rainfall (Sect. 4.3.1). Tables 7 and 8 show the flood peaks from the 2-y and 10-y simulations, respectively, for samples 48-h rainfalls with each parameter adjusted to be uniform. The rainfall hyetographs and discharges at the outlets of the zones 1–4 for the usual case (no adjustment of parameter) are shown in Figs. 11 and 12 for the samples of 2- and 10-y rainfalls, respectively. It can be seen than the flood peak responded most strongly to the soil percolation rate at the rainfall location, followed by the surface water storage if the amount of rainfall was small and by the terrain steepness if the amount of rainfall was large. For the 72-h rainfall, a stronger influence of the soil percolation rate was expected because of the lower rainfall intensity than that in the 48-h rainfall.

Table 7 Peak discharge (m3/s) from one sample of 48-h rainfalls with a 2-y return period that concentrated over the mountainous region and one sample of those over the plain area, when the basin coefficient, surface storage, Channel Manning’s n and percolation rate were adjusted to be uniform
Table 8 Peak discharge (m3/s) from one sample of 48-h rainfalls with a 10-y return period that concentrated over the mountainous region and one sample of those over the plain area, when the basin coefficient, surface storage, Channel Manning’s n and percolation rate were adjusted to be uniform
Fig. 11
figure 11

Discharge from the sample of 48-h rainfalls with a 2-y return period that concentrated over the mountainous region (upper) and one sample of those over the plain area (lower)

Fig. 12
figure 12

Discharge from the sample of 48-h rainfalls with a 10-y return period that concentrated over the mountainous region (upper) and one sample of those over the plain area (lower)

Both zones 1 and 2 had a high soil percolation rate, but zone 1 had a lower rainfall magnitude than zone 2. In zone 1, the low rainfall intensity caused the influence of the percolation to be strong; therefore, most of the rainwater was lost to percolation. In zone 2, a higher rainfall intensity reduced the influence of the percolation. Thus, the correlation between the flood peak and rainfall was weak in zone 1 and strong in zone 2. Figure 13 shows the average of the peak excess rainfall intensity in each zone. Zone 1 had a lower peak excess rainfall intensity than zone 2, and the correlation between the flood peak and rainfall in zone 1 (Fig. 8) varied with the peak excess rainfall intensity, while that in zone 2 was always strong.

Fig. 13
figure 13

Average peak excess rainfall intensity in each zone from the simulated rainfalls given a fixed rainfall amount for each return period and duration

Compared to zone 3, zone 4 had a very long centroid distance (Table 1), which resulted in a very long standard lag time, tp in Eq. (1); therefore, the flood peak from the rainfall in zone 4 was low. The standard lag time was 51 h for zone 4 but at most 32 h for the other zones. Thus, the correlation between the flood peak and the rainfall in zone 4 was always weak while that in zone 3 was always strong. This finding notes that the distance between the rainfall location and the main stream had a strong effect on the peak discharge, and so the presence of the main river in the model is necessary for the simulation of the flood peak from rainfall.

5 Conclusions

Although previous studies have reported the response of flood peaks to the spatial distribution of rainfall in large and nonuniform basins, the importance of each characteristic is still poorly understood. Thus, this study aimed to determine which characteristics are important for the response of the flood peak. The Yom River basin was chosen for this study because it has clearly different characteristics between the upstream forested mountainous region with a small surface water storage and the downstream agricultural plain area with a large surface water storage. Moreover, since there are no large-scale control structures, the flow in this basin is nearly natural. The study area contains the city of Sukhothai, which is well known for being inundated by floods.

Floods in the Sukhothai area were simulated from rainfalls with various spatial distributions using the HEC-HMS and HEC-RAS models. The area which contributed the water to the Sukhothai city was divided into 8 subareas for routing the rainfall to the main stream, the Yom River, and the rainfalls were spatially distributed into four zones. Zones 1 and 2 represented the mountainous region with a high terrain steepness, high soil percolation rate and small surface water storage. Zone 3 was a plain area with a low steepness, low soil percolation rate and large surface water storage, while zone 4 was similar to zone 3 but was farther from the main stream. For each of 2-, 5- and 10-y rainfalls with 24-, 48- and 72-h durations, 1000 rainfall simulations with various spatial distributions were performed. The rainfall amounts in each of these four zones were spatially randomized according to the observed means, standard deviations and spatial correlations from a Monte Carlo analysis and Cholesky randomization. The rainfall tended to concentrate over the downstream part rather than the upstream part of the study area, and the 48- and 72-h rainfalls tended to concentrate upstream more than the 24-h rainfalls.

The simulations revealed that the flood peaks in the city of Sukhothai from a 24-h (short-duration) rainfall were usually low but highly variable, while the flood peaks from the 48- and 72-h (long-duration) rainfalls were high and fairly uniform. The correlation between the flood peak and rainfall in each zone suggested that a short-duration rainfall produced a high flood peak if it was concentrated over the upstream area (zones 1 and 2), while a long-duration rainfall produced a high flood peak if it was concentrated over the joint area (zones 2 and 3). The flood peak from a 24-h rainfall was usually low because this rainfall period is shorter than the concentration time; furthermore, 24-h rainfalls were rarely concentrated over the upstream area. Conversely, the flood peaks from 48 to 72-h rainfalls were high because the rainfall duration was similar to the concentration time, and these rainfalls tended to concentrate over the joint area. Usually, a high flood peak is generated from a long-duration rainfall, but it is possible for short-duration rainfalls to produce an extremely high flood peak if they are concentrated over the upstream area.

The basin characteristics that affect the response of the flood peak to the spatial distribution of the rainfall varied and depended upon the rainfall duration and amount. For a short-duration event with a small amount of rainfall, the flood peak was influenced by the surface water storage of the rainfall location. For a short-duration event with a large amount of rainfall, the flood peak is influenced by the steepness of the rainfall location. For a long-duration rainfall, the flood peak was mainly influenced by the percolation rate; in addition, the surface storage and steepness can be influential depending on the rainfall amount, similar to the case of the short-duration rainfall. Apart from the surface storage, steepness and soil percolation rate, the flood peak responded to the distance from the rainfall location to the stream for both the short- and long-duration rainfalls. Therefore, the presence of the stream, especially a large stream, in the model is necessary for the flood peak simulation from the rainfall. Nevertheless, the channel roughness did not seem to influence the flood peak.

It should be noted that the influence of basin characteristics can vary upon the area and the result of the study strongly depends on the model parameterization. Therefore, the parameters should be carefully determined and the model validation is required from upstream to downstream areas, not only at the target area. For our study, because of long and narrow basin shape with a main river following the length direction, the parameterization in 2 parts (upstream mountainous region and downstream plain area) can make the model valid. For the complicate case when the area is wide, has many river branches and has variety of characteristics, the parameterization may be required in many parts of the area until the model is valid.