1 Introduction

There are a lot of parameters in the conceptual hydrological models (CHM), and parameters value will influence the forecast effect directly. Manual parameters calibration of CHM requires great experience and time of the operator. The development of computer skills during the last decades has contributed to implementation of the automatic calibration (Dakhlaoui et al. 2012). Most parameters of CHM have clear physics meaning, which, in principle, can directly determinate the values of parameters. However, due to the lack of measurement and experiment of all facts in the process of rainfall-runoff formation, in reality, we can merely deduce the parameter value by system identification methods. The definition of the value still faces great difficulties because of the huge amount of parameters while little information. In practice, we usually work it out by concept analysis, which means we can get the preliminary range of the values according to measurement and physics meaning, then input them and gain the output through model calculation, compare the output progress with the measurement and optimize it to decide the optimal value of the parameters under specific condition.

Traditional calibration of CHM mainly is done by considering single objective. The optimal methods include the Rosenbrock’s method (Rosenbrock 1960), shuffled complex evolution (SCE-UA) algorithm (Wang et al. 2013), genetic algorithm (GA) (Sahay 2012), particle swarm optimization (PSO) (Afshar et al. 2013), and so on. However, the engineering practice shows that the traditional calibration scheme is hard to reflect the dynamic behavior characteristics of the hydrologic system because the parameter optimization calibration only considers one aspect of the features of hydrological process (Vrugt et al. 2003a). Therefore, it is necessary to develop the multi-objective model calibration of CHM so that different aspects of the system characteristics can take a full consideration. Multi-objective optimization method will produce a series of optimization schemes, which will not only allow an analysis of the trade-offs among the different objective functions but also enable hydrologists to better understand the limitations of current hydrologic model structure.

In recent years, researchers have begun to apply multi-objective evolutionary algorithms to multi-objective model calibration problems and achieved various degrees of success. Yapo et al. (1998) demonstrated that the multi-objective calibration approach is practical and relatively simple to implement and can help to better understand the different aspects of CHM, see also Cheng et al. (2002); Deckers et al. (2010); Dumedah et al. (2010); Dumedah (2012); Boyle et al. (2013). However, the optimization problem quickly becomes a decision-making problem because the multi-objective functions have been incorporated in CHM, and the users of hydrological model calibration have to face the task of selecting a set of suitable model parameters from the numerous Pareto-optimal sets (Khu and Madsen 2005).

In this paper, we propose a multi-objective cultural self-adaptive electromagnetism-like Mechanism (MOCSEM), which is an improvement over the Electromagnetism-like Mechanism (EM) algorithm. EM algorithm is proposed by Birbil and Fang (2003) and has been proven to be a flexible and effective global optimization algorithm for solving various optimization problems successfully (Chang et al. 2009; Wei et al. 2012). The method is first implemented in solving CHM problem. In MOCSEM, a self-adaptive parameter is applied in local search operation for adjusting the values of parameters dynamically to help the proposed method escape from local Pareto optimal front. In addition, cultural algorithm (CA) is adopted to keep a good diversity and uniformity of Pareto-optimal solutions (POS). MOCSEM is tested, firstly, by several benchmark test problems and compared with some previous research. After that, a case study is implemented for parameter calibration of a CHM by comparing the properties of POS obtained by MOCSEM method and those of the state-of-art algorithm SPEA2 (Zitzler et al. 2001), MOSCEM-UA (Vrugt et al. 2003a), and MOSCDE (Guo et al. 2013). Finally, the Fuzzy Technique for Order Preference by Similarity to an Ideal Solution (fuzzy TOPSIS) method has been used to rank the POS and select the optimal scheme.

This paper is organized as follows: Section 2 we briefly describe the CA scheme, detail the procedure of MOCSEM and fuzzy TOPSIS method. Section 3 introduces the multi-objective parameter calibration; in Section 4, the performance of the MOCSEM is first tested, and the computational results of a practical CHM problem are shown; while conclusions are made in Section 5.

2 Methodology

2.1 Framework of Cultural Algorithm (CA)

Cultural algorithm (CA), proposed by Reynolds in 1994, is a computational model inspired by the principle of cultural evolution that exceeds the rate of biological evolution based upon a process of dual inheritance (Reynolds 1994). The cultural evolution consists of three major processes: individuals’ evolution in the population space, macro-evolutionary in the belief space and the communication between the population space and belief space. In CA, population space consists of a set of individuals, one of which corresponds to a solution of the problem, and it can be optimized by using any population-based algorithm, such as genetic algorithms (Reynolds 1994), differential evolution (Qin et al. 2010), particle swarm optimization (Zhang et al. 2012). As the evolutionary process unfolds, the belief space is a collectively information library where the knowledge acquired from individuals is stored. To communicate knowledge throughout the two spaces, a communication channel is established. In belief space, the function update() is the function to refine population experience by classifying the belief space. The belief space is established by using the function accept() to collect the population experience from the top individuals of the population space. In turn, the cultural knowledge controls the evolution of the population by a function influence() to improve the algorithm’s searching efficiency. The function objective() is used to evaluate the fitness of every individual and the function select() selects the individuals for next generation evolved by function generate(). The framework of cultural algorithm is illustrated by (Qin et al. 2010).

2.2 Electromagnetism-Like Mechanism (EM) Algorithm

EM algorithm is a population-based stochastic algorithm which is quite simple, strong robust, significantly fast and effective. EM originates from the electromagnetism theory of physics by considering each sample point as a charged particle spread around the solution space (Tsou and Kao 2008). The fundamental procedures of EM include initialization of population, local search, calculation of total force, and movement of particles. The basic strategy of EM is detailed by Birbil and Fang (2003).

2.3 MOCSEM

As mentioned earlier, EM is primordially proposed for solving a single objective optimization problem. Thus, EM needs to make some modifications of its operations for dealing with multi-objective problems (MOP). Hence, in MOCSEM, we mainly focus on the preservation of POS, the modification of EM’s operations and avoiding premature convergence effectively to try to achieve a successful application in dealing with MOP.

2.3.1 Knowledge Structures Defined in Belief Space

In Cultural Algorithm framework, There are at least five basic categories of cultural knowledge such as situational, normative, history or temporal, domain and topographic knowledge (Saleem 2001). In MOCSEM, we redefine three knowledge structures (Situational Knowledge, Normative Knowledge and History Knowledge). These knowledge structures are introduced as follows:

  1. (1)

    Situational Knowledge

    Situational knowledge is composed of outstanding individuals along the population evolution process. These individuals are nearest to the true Pareto optimal front of multi-objective optimization. The structure of the redesigned situational knowledge is [P 1,…,P i ,…,P NQ ]. P i is an outstanding Pareto solution; i is the index of the generation, and NQ is the size of situational knowledge.

    Because of the limitation of computation source, the size N Q of situational knowledge is usually a constant. Crowding distance, which was introduced by Deb et al. (2002) is often used to calculate the sharing fitness of POS. Moreover, the following updating strategy is implemented to maintain the size of situational knowledge while keeping POS spreading uniformly: (1) if situational knowledge is empty, P i will be joined in situational knowledge directly; (2) if P i is not dominated by any individual in situational knowledge, the individuals dominated by P i are deleted, and P i will be joined in situational knowledge; (3) if the number of POS is greater than N Q , the cut operation is executed to reject redundant individuals according to the crowding distance.

  2. (2)

    Normative Knowledge

    Normative knowledge saves the feasible region of decision variables on each dimension where POS have been found. The structure of normative knowledge is [L 1,…,L i ,…,L NQ ; U 1,…,U i ,…,U NQ ]. L j and U j are the minimum and maximum values on the i-th dimension of the individuals. When the boundary is acquired from situational knowledge, normative knowledge needs to be reformed as the situational knowledge has changed. The upper and lower limits of individuals in new situational knowledge are introduced as new normative knowledge. When the original variables’ boundaries are replaced by normative knowledge, the searching direction may be directed toward the optimization direction. The strategy can be shown as follow:

    $$ {X}_k^i=\left\{\begin{array}{ll}\begin{array}{c}\hfill {L}_k,\kern1em if\ {X}_k^i\prec {L}_k\hfill \\ {}\hfill {U}_k,\kern1em if\ {X}_k^i\succ {U}_k\hfill \end{array}\hfill & \left(k=1,2,\cdots, D\right)\hfill \end{array}\right. $$
    (1)

    Where D is the dimensionality of decision variable X i; X i k denotes the k-th of the X i; U k and L k are the upper and lower bound of X i k , respectively.

  3. (3)

    History Knowledge

    This knowledge source is used to monitor searching process of evolution algorithm and keep the distribution characteristics of decision variables. The structure of history knowledge is [S 1,…,S i ,…,S NQ ]. S i is a convergence performance metric, which evaluates convergence of current POS to a reference set. We calculate S k as follow:

    $$ {S}_k=\sqrt{\frac{1}{N_Q}{\displaystyle \sum_{i=1}^{N_Q}{\left(\frac{X_k^i-\overline{X_k}}{U_k-{L}_k}\right)}^2}} $$
    (2)

    Where \( \overline{X_k} \) is the average value of the k-th dimension variables. For updating the history knowledge, S k of k-th dimensions in situational knowledge is recalculated when situational knowledge and normative knowledge are evolved.

2.3.2 Modification of EM Operators

  1. (1)

    Local search

The procedure of local search is going to move the points of solutions toward the local minimums that are near them. The method used in this procedure is very simple. In this paper, a novel self-adaptive mechanism is added to the local search operation, which improves the accuracy of solution and avoids the premature convergence. The modification of this step revises the evolution step of EM, as follow:

$$ {X}_k^i=\left\{\begin{array}{lll}{X}_k^i+\lambda \cdot decaypara(g)\cdot \left( \max \left({U}_k-{L}_k\right)\right)\hfill & \hfill & if\kern0.5em rnd\left(\right)>0.5\hfill \\ {}{X}_k^i-\lambda \cdot decaypara(g)\cdot \left( \max \left({U}_k-{L}_k\right)\right)\hfill & \hfill & otherwise\hfill \end{array}\right. $$
(3)

Where δ is local search parameter; λ is a uniform distributed random parameter; rnd() is a uniform random number between [0, 1]; decaypara(g) is the g-th self-adaptive function; g is the index of the generation:

$$ decaypara(g)=\left\{\begin{array}{l}\begin{array}{cc}\hfill \delta \cdot \exp \left(-\alpha \cdot count\cdot g/G\right)\hfill & \hfill if\kern0.5em count>r\hfill \end{array}\hfill \\ {}\begin{array}{cc}\hfill \delta \cdot \exp \left(-g/G\right)\hfill & \hfill otherwise\hfill \end{array}\hfill \end{array}\right. $$
(4)

Where G denotes the total evolution number; α is the self-adaptive parameter; count and r are the number and threshold value of stagnation, respectively.

  1. (2)

    Calculation of total force

In general, MOP don’t have a single solution that could optimize all objectives simultaneously. To solve this problem, the calculation for the charge q i of i-th particle is revised by Tsou and Kao (2008). The technology determines q i of X i by its minimum distance to the non-dominated front:

$$ {q}^i= \exp \left(-D\frac{ Aprox\left({X}^i\right)- Aprox\left({X}^{best}\right)}{{\displaystyle \sum_{k=1}^N\left( Aprox\left({X}^k\right)- Aprox\left({X}^{best}\right)\right)}}\right),\forall i $$
(5)

Where Aprox(X i) is the minimum distance of X i from belief space; X best is the nearest one to the belief space, as follows:

$$ Aprox\left({X}^i\right)=\underset{Y\in Qset}{ \min}\left(\left\Vert f\left({X}^i\right)-f(Y)\right\Vert \right) $$
(6)
$$ {X}^{best}= \arg \min \left( Aprox\left({X}^i\right)\right),\kern1em i=1,2,\dots, N $$
(7)

Where f(X i) is the objective function value of X i. The dominance relationship of two particles can be judging by the distance of X i to belief space: (1) if Aprox(X j) < Aprox(X i), X i is attracted by X j; (2) if Aprox(X j) > Aprox(X i), the relationship change. The function for calculate total force F i is modified to the following equation.

$$ {F}^i={\displaystyle \sum_{j\ne i}^N\left\{\begin{array}{ll}\left({X}^j-{X}^i\right)\frac{q^i{q}^j}{{\left\Vert {X}^j-{X}^i\right\Vert}^2}\hfill & if\kern0.5em Aprox\left({X}^j\right)< Aprox\left({X}^i\right)\hfill \\ {}\left({X}^i-{X}^j\right)\frac{q^i{q}^j}{{\left\Vert {X}^j-{X}^i\right\Vert}^2}\hfill & if\kern0.5em Aprox\left({X}^j\right)> Aprox\left({X}^i\right)\hfill \end{array}\right\},\forall i} $$
(8)

2.3.3 Procedures of MOCSEM

The flow chart of MOCSEM is shown as Fig. 1.

Fig. 1
figure 1

The flow chart of MOCSEM

2.4 Fuzzy Technique for Order Preference by Similarity to an Ideal Solution (Fuzzy TOPSIS)

For selecting a set of suitable model parameters from Pareto-optimal sets, fuzzy TOPSIS (Baykasoglu et al. 2013) method has been implemented to sort the schemes of solution set. TOPSIS, proposed by (Hwang and Yoon 1981), is an ordering method according to proximity of evaluation objects and idealized goal. There are two types of ideal solutions, the positive ideal solution (or optimal solution) and negative ideal solution (or worst solution). The chosen one of the evaluation objects should be the nearest to positive ideal solution and is far away from negative ideal solution. To adapt the fuzzy set, Hausdorff distance (Rucklidge 1997) is used to measure the proximity of the evaluation objects to the idealized goal. The detailed steps of fuzzy TOPSIS are as follows:

  1. Step1

    Synthesize attribute weights \( \tilde{\omega} \) to decision matrix \( \tilde{R} \), and structure weight normalized matrix.

    $$ \tilde{D}=\tilde{\omega}\otimes \tilde{R}=\left[\begin{array}{cccc}\hfill {\tilde{d}}_{11}\hfill & \hfill {\tilde{d}}_{12}\hfill & \hfill \cdots \hfill & \hfill {\tilde{d}}_{1n}\hfill \\ {}\hfill {\tilde{d}}_{21}\hfill & \hfill {\tilde{d}}_{21}\hfill & \hfill \cdots \hfill & \hfill {\tilde{d}}_{2n}\hfill \\ {}\hfill \vdots \hfill & \hfill \vdots \hfill & \hfill \vdots \hfill & \hfill \vdots \hfill \\ {}\hfill {\tilde{d}}_{m1}\hfill & \hfill {\tilde{d}}_{m2}\hfill & \hfill \cdots \hfill & \hfill {\tilde{d}}_{mn}\hfill \end{array}\right]\begin{array}{cc}\hfill, \hfill & \hfill {\tilde{d}}_{ij}={\tilde{\omega}}_j{\tilde{r}}_{ij}\hfill \end{array} $$
    (9)

    Where \( {\tilde{r}}_{ij} \) is j-th attribute value of i-th solution; \( {\tilde{d}}_{ij} \) is the normalized value of i-th solution on j-th attribute; \( {\tilde{\omega}}_j \) is weight value of i-th attribute.

  2. Step2

    Determine the fuzzy ideal solution and fuzzy negative ideal solution by using fuzzy maximum set and fuzzy minimum set.

    Fuzzy ideal solution \( {\tilde{C}}^{+} \) and fuzzy negative ideal solution \( {\tilde{C}}^{-} \) are defined as follows:

    $$ {\tilde{C}}^{+}=\left({\tilde{C}}_1^{+},{\tilde{C}}_2^{+},\cdots, {\tilde{C}}_n^{+}\right) $$
    (10)
    $$ {\tilde{C}}^{-}=\left({\tilde{C}}_1^{-},{\tilde{C}}_2^{-},\cdots, {\tilde{C}}_n^{-}\right) $$
    (11)

    Where \( {\tilde{C}}_j^{+} \) is the fuzzy maximum set on the j-th attribute.

    $$ {\tilde{C}}^{+}=\underset{j}{ \max}\left\{{\tilde{d}}_{ij}\right\} $$
    (12)

    Its subordinating degree function is formulated as:

    $$ {\mu}_{{\tilde{C}}^{+}}(x)=\underset{x= \max \left({x}_1,{x}_2\cdots {x}_m\right)}{ \sup } \min \left\{{\mu}_{{\tilde{d}}_{1j}}\left({x}_1\right),{\mu}_{{\tilde{d}}_{2j}}\left({x}_2\right),\cdots, {\mu}_{{\tilde{d}}_{mj}}\left({x}_m\right)\right\} $$
    (13)

    Where \( {\tilde{C}}_j^{-} \) is the fuzzy minim set on the j-th attribute.

    $$ {\tilde{C}}^{-}=\underset{j}{ \min}\left\{{\tilde{d}}_{ij}\right\} $$
    (14)

    Its subordinating degree function is formulated as:

    $$ {\mu}_{{\tilde{C}}^{+}}(x)=\underset{x= \max \left({x}_1,{x}_2\cdots {x}_m\right)}{ \sup } \min \left\{{\mu}_{{\tilde{d}}_{1j}}\left({x}_1\right),{\mu}_{{\tilde{d}}_{2j}}\left({x}_2\right),\cdots, {\mu}_{{\tilde{d}}_{mj}}\left({x}_m\right)\right\} $$
    (15)
  3. Step3

    Calculate fuzzy distance scale between \( {\tilde{C}}^{+} \) and \( {\tilde{C}}^{-} \).

    A i is the i-th alternative in solution set, its Hausdauff distances to \( {\tilde{C}}^{+} \) and \( {\tilde{C}}^{-} \), \( {\tilde{d}}_i^{+} \) and \( {\tilde{d}}_i^{-} \) respectively, are calculated as:

    $$ {\tilde{d}}_i^{+}={\displaystyle \sum_{j=1}^n{d}_{\lambda}\left({\tilde{d}}_{ij},{\tilde{C}}_j^{+}\right)=}{\displaystyle \sum_{j=1}^n{\vee}_{\lambda \in \left[0,1\right]}\lambda \vee \left(\left|{\tilde{d}}_{ij}^L-{\tilde{C}}_j^{+L}\right|,\left|{\tilde{d}}_{ij}^R-{\tilde{C}}_j^{+R}\right|\right)} $$
    (16)
    $$ {\tilde{d}}_i^{-}={\displaystyle \sum_{j=1}^n{d}_{\lambda}\left({\tilde{d}}_{ij},{\tilde{C}}_j^{-}\right)=}{\displaystyle \sum_{j=1}^n{\vee}_{\lambda \in \left[0,1\right]}\lambda \vee \left(\left|{\tilde{d}}_{ij}^{-L}-{\tilde{C}}_j^L\right|,\left|{\tilde{d}}_{ij}^R-{\tilde{C}}_j^{-R}\right|\right)} $$
    (17)
  4. Step4

    The close degree d i of alternatives in solution set can be obtained by follow:

    $$ {d}_i=\frac{{\tilde{d}}_i^{-}}{{\tilde{d}}_i^{-}+{\tilde{d}}_i^{+}} $$
    (18)

    Then the schemes of solution set can be descending sorted by the value of their d i .

3 Study Area and Hydrologic Model

To implement a multi-objective parameters calibration of CHM, there are many selections of: a multi-objective algorithm to search the parameter space; a decision-making method to choose a scheme from solution set; a CHM; a period of historical data against which to calibrate CHM; multiple objective functions used to represent different characteristics of the runoff hydrograph and multiple evaluation indexes introduced to measure the merits of schemes. The multi-objective algorithm and decision-making optimization method are detailed in the previous section, and the other selections will appear in the following sections.

3.1 Selections of Study Area and the Data Used

Leaf River, a principal tributary of Pascagoula River, is located at 42°7′35″N and 89°24′11″W (42.126350, −89.402976). Pascagoula River is a river, about 80 miles long, in southeastern Mississippi in the United States, which flows into Mississippi Sound of Gulf of Mexico. Leaf River basin, draining an area of about 1,944 km2, belongs to the typical wet basin. A long series of hydrological data about 11 years (30 September 1952 to 30 September 1962), which are investigated intensively in previous studies (Vrugt et al. 2003a, b; Blasone et al. 2008), are used for model parameter calibration. To eliminate the influence of initial condition, we set a 65-day warm-up period before the model calibration.

3.2 Selections of Conceptual Hydrological Model

The hydrological model (HYMOD), a five-parameter conceptual rainfall-runoff model, is used to illustrate the advantage of MOCSEM algorithm for multi-objective parameter calibration. In a catchment, the five parameters of HYMOD include maximum storage capacity Cmax, degree of spatial variability of soil moisture capacity bexp, factor distributing the flow between the two series of reservoirs Alpha, and residence times of the linear quick and slow response reservoirs Rq and Rs, respectively. This five parameters is the decision variable of multi-objective parameters calibration. HYMOD consists of a simple rainfall excess model and two series of linear reservoirs: three identical reservoirs for quick flow response and a single reservoir for slow flow response. This model, described in detail by Moore (1985), and recently used by Guo et al. (2013).

3.3 Selections of the Objective Functions

In general terms, to simulate the hydrological behavior of the catchment as closely as possible, different characteristics of the runoff hydrograph should be considered in a multi-objective framework. The following objective functions, mean squared logarithmic error (MSLE) (Hogue et al. 2000; Guo et al. 2013) and mean fourth-power error (M4E) (de Vos and Rientjes 2008), are selected during the optimization process for HYMOD model:

$$ \mathrm{MSLE}=\sqrt{\frac{1}{T}{\displaystyle \sum_{t=1}^T{\left( \log {Q}_{t, sim}- \log {Q}_{t, obs}\right)}^2}} $$
(19)
$$ \mathrm{M}4\mathrm{E}=\frac{1}{T}{\displaystyle \sum_{t=1}^T{\left({Q}_{t, sim}-{Q}_{t, obs}\right)}^4} $$
(20)
$$ Q=\operatorname{HYMOD}\left({\mathrm{C}}_{\max },{\mathrm{b}}_{\exp },\mathrm{Alpha},{\mathrm{R}}_{\mathrm{q}},{\mathrm{R}}_{\mathrm{s}}\right) $$
(21)

Where T is the number of the samples; Q t,sim and Q t,obs are the simulated and observed runoff value of t-th sample, respectively; the range of Cmax, bexp, Alpha, Rq and Rs are [1.0,500.0], [0.1,2.0], [0.1,0.99], [0, 0.1], [0.1,0.99].

Because of the logarithmic transformation, MSLE function emphasizes fitting of low flows. M4E function pays more emphasizes on high flows events. It means trade-offs exist between MSLE and M4E. Therefore, MSLE and M4E are adopted as the objective functions for the multi-objective parameter optimization of a hydrological model.

3.4 Selections of the Multiple Evaluation Indexes

In this respect, selections of multiple evaluation indexes are intended to measure the solution quality. In order to obtain an optimal scheme, it is significative to formulate the multiple evaluation indexes that reflect different characteristics of Leaf River. The following four evaluation indexes (MSLE, M4E, coefficient of determination (R2) and qualified rate (QR)) are used here for this study. MSLE and M4E are appeared in the previous section.

R2, another index of how well parameters calibration using MOCSEM, is computed as:

$$ {\operatorname{R}}^2=1-\frac{ Dos}{ Dom}=1-\frac{{\displaystyle \sum_t^T{\left({Q}_{t, obs}-{Q}_{t, sim}\right)}^2}}{{\displaystyle \sum_t^T{\left({Q}_{t, obs}-{Q}_{t, obs}^{ave}\right)}^2}} $$
(22)

Dos is the deviations of observations from forecasted runoff values; Dom means the deviations of observations from their mean values.

QR, the fourth index, is formulated as follow:

$$ \operatorname{QR}=\frac{M}{T}\times 100\% $$
(23)

M said the number of the eligible sample, whose relative error of runoff is less than 20 %.

4 Result and Decision Analysis

4.1 Numerical Simulation

4.1.1 Test Functions and Performance Measures

In this paper, the optimization performance of MOCSEM is measured by adopting four well known test functions (denotes as ZDT2, ZDT3, ZDT4 and ZDT6) of the Zitzler–Deb–Thiele (ZDT) series. It is different to the main properties of their true optimal Pareto fronts, and the true fronts of these functions are known. These advantages for testing the optimization ability of MOCSEM are expatiated by Deb et al. (2002).

Generally, there are two types of measures in multi-objective optimization: convergence measure and diversity measure, for verifying the performance of multi-objective optimization algorithms. This paper applies two widely used measures: Convergence metric CM and Diversity metric DM (Deb et al. 2002), and their expressions are as follows:

$$ \operatorname{CM}={\displaystyle {\sum}_{i=1}^n{D}_i}/n $$
(24)
$$ \operatorname{DM}=\frac{d_f+{d}_l+{\displaystyle {\sum}_{i=1}^{n-1}\left|{d}_i-{d}_{ave}\right|}}{d_f+{d}_l+\left(n-1\right){d}_{ave}} $$
(25)

where n is the number of POS, D i is the minimum Euclidean distance between the i-th optimal solution and its corresponding point in true Pareto optimal front, d i is the Euclidean distance between the i-th and the i + 1-th solution, and d ave is the average distance of all d i , d f and d l are the Euclidean distance between upper and lower boundary solutions and its corresponding boundary points in true Pareto optimal front. From the description mentioned above, the desired value for CM is zero, which means obtained POS coincide with true Pareto optimal front perfectly. Meanwhile, the obtained POS will distribute equidistantly if the value of DM is zero.

4.1.2 Experimental Results and Comparison

To solve the four test functions, main parameters of MOCSEM are set as follows in this paper: the population size N = 100, the size of belief space N Q  = 50, the maximum number of generation G is selected as 2000, the maximum iteration number for local search operation L max is set as 10, threshold value of stagnation r is set as 5, the local area parameter δ = 0.1 and the self-adaptive parameter α is selected as 0.8955. Moreover, trying to prove the optimizing performance of the proposed MOCSEM, the experimental results of these four test functions obtained by NSGA-II (Deb et al. 2002), SPEA2 (Zitzler et al. 2001), ADEA (Qian and Li 2008) and MOSCDE (Guo et al. 2013).

Figure 2 and Table 1 show the optimal solutions and its technical indexes, the CM and DM, obtained by MOCSEM. Meanwhile, the true Pareto optimal fronts of four ZDT test functions are also displayed in these figures to check the properties of the proposed method. Moreover, the mean and variance values of the technical indexes, which are acquired averaged over 10 runs by MOCSEM and other comparative algorithms, are displayed in Table 1. Among, the mean and variance values are shown in upper and lower rows respectively, and the value which is smaller than 10−6 will be denoted as 0.

Fig. 2
figure 2

Pareto frontiers for ZDT2, ZDT3, ZDT4 and ZDT6

Table 1 Statistics of results on CM and DM

From Fig. 2, we can see that the results acquired by MOCSEM are excellent for the four test problems. Because the results attain the true Pareto optimal front accurately. While the convergence and distribution performance of results converges well to the true Pareto frontiers and distributes uniformly.

From Table 1, MOCSEM can achieve better performance than other comparative methods for these ZDT test problems except ZDT3. About ZDT3, the result of MOCSEM is better than NSGA-II, SPEA2 and ADEA, but the result of MOSCDE is equally good on this problem. Besides, NSGA-II and SPEA2 may have some difficulties in dealing with ZDT2 and ZDT6 while MOCSEM has much better convergence performance for these two test problems.

All in all, MOCSEM can obtain excellent convergence performance and diversity property compared to other comparative methods.

4.2 Application to Multi-Objective Parameter Optimization of HYMOD

4.2.1 Parameter Settings of the Algorithms

MOCSEM is implemented to solve the multi-objective parameters calibration problems of CHM, and parameters settings of MOCSEM are as follows: D = 5, N y  = 2, N = 100, the size of belief space N Q  = 30, G is selected as 1000, LocalNum is set as 100, r is set as 5, δ = 0.29 and α is selected as 0.8955. To verify the effectiveness of MOCSEM, the famous SPEA2, MOSCEM-UA and MOSCDE method is implemented with solving the same case. The parameter settings of the remnant algorithms are done according to Guo et al. (2013).

4.2.2 Results and Discussion

In this section, MOCSEM is implemented to solve model parameters calibration problem of HYMOD. Figure 3 displays the optimal solutions obtained by the algorithms: MOCSEM, MOSCDE, SPEA2 and MOSCEM-UA.

Fig. 3
figure 3

The set of non-dominated solutions generated by MOCSEM, MOSCDE, SPEA2 and MOSCEM-UA methods

From Fig. 3, the POS verifies that it is hardly to get an optimal solution for MSLE and M4E corresponding to the lower and higher runoff respectively. When comparing Fig. 3, it is obvious that MOCSEM can generate more uniform Pareto solutions than SPEA2, MOSCEM-UA and MOSCDE. The ranges of the MSLE and M4E obtained by MOCSEM are [0.4288, 0.5069] and [301.58, 403.92] which is a broader scale than MOSCDE and other methods. In addition, we can see that although a small part of the solutions are missed by MOSCEM-UA and SPEA2. The MSLE value of these solutions is in the range [0.465, 0.480]. This is because the belief space is a collectively information library where the knowledge acquired from the comprehensive top individuals of the population space is stored. Moreover, the self-adaptive local search is used for escaping the local minimums of the problem. In Fig. 3d, we can infer that M4E reduces faster when the value of MSLE is increasing in the range from 0.4288, 0.45. While, the value of MSLE mushroom within the range [0.45, 0.50] as the value of M4E is reducing from 310 to 301.

4.3 Optimal Selection of Calibrated Parameters Schemes

Moreover, to clearly measure the solution quality of runoff forecasting and to further verify the trade-off relationship between objective functions MSLE and M4E in parameter calibration of HYMOD, we use fuzzy TOPSIS method in two types of decision-making scenarios, emphasizing low flows (scenario 1) and emphasizing peak flows (scenario 2). The above mentioned four evaluation indexes (MSLE, M4E, R2 and QR) of calibration solutions obtained by MOCSEM are calculated here and displayed in the Table 2 for this study. According to two different decisions, various attributes predilections of the experts are determined to evaluate the importance of each attribute. Table 3 illuminates the predilection of five specialists for the four evaluation indexes and specifies the specialist weights, objective weights and comprehensive weights of this four indexes on the two decision-making scenarios. Then the schemes in Table 2 are graded and sorted using fuzzy TOPSIS method. The sequences are shown in Table 4, and the chosen two schemes are marked by bold in Table 2. The parameters values (Cmax, bexp, Alpha, Rq, Rs) of this two schemes, scheme 11 and scheme 45, are (215.4852, 0.2559, 0.7819, 0.0024, 0.4586) and (210.7721, 0.2235, 0.9900, 0.0000, 0.4700) respectively.

Table 2 Four evaluation indexes value of calibration solutions obtained by MOCSEM
Table 3 the predilection of experts and weights for the indexes on two decision-making scenarios
Table 4 Sequencing optimization result of scheme set

From Table 4, in scenario 1, the evaluation will be more focused on MSLE, the rank of schemes is generally following the change trend of MSLE. On another hand, the evaluation optimization is emphasizing M4E to consider peak flows in scenario 2. The pros and cons of sorting result for schemes are based on the value of M4E. Except the inclination, the choice of schemes is also influenced by three other properties. This two scenarios are the extreme situations, and the weights of four decision index will be changed to adapt to different forecast premises. Besides, to specifically display the performance of runoff forecasting of the chosen schemes on objective functions MSLE and M4E in parameter calibration of HYMOD model, the calibration results associated with the two solutions are plotted in Fig. 4 for a 300-day portion of the calibration period (21 May 1960 to 16 March 1961) which contain low flows period and peak flows period. The Dev value is the difference between runoffs of scheme 11 minus runoffs of scheme 45.

Fig. 4
figure 4

Calibration results of two extreme Pareto solutions for a 300-day portion of the calibration period (21 May 1960 to 16 March 1961)

The above results indicate that expert’s subjective preferences and objective decision-making information can be embodied in the parameters scheme as well. The parameters schemes calibrated by purposed method can provide a feasible set of model parameters to policymakers for different operating situations of CHM.

5 Conclusions

Parameters calibration of hydrological model is one of the most important works in the field of hydrology. To obtain a successful parameters calibration, in this paper, we propose a multi-objective cultural self-adaptive electromagnetism-like Mechanism (MOCSEM) algorithm, which is first implemented in solving the parameters calibration problem of the conceptual hydrological model (CHM). In this algorithm, a self-adaptive parameter is applied in the local search operation for adjusting the values of parameters dynamically. Meanwhile, considering the complicated constraints and objectives of CHM problem, cultural algorithm (CA) is adopted to keep a good diversity and uniformity of POS. MOCSEM is tested, firstly, by several benchmark test problems and the results show that MOCSEM algorithm outperforms other algorithms proposed in previous research. After that, a case study is implemented for parameters calibration of a CHM by comparing the convergence properties and diversification of POS obtained by MOCSEM, SPEA2, MOSCEM-UA and MOSCDE method. Finally, when the optimization problem quickly becomes a decision-making problem because of the multiple objectives in CHM, fuzzy technique for order preference by similarity to an ideal solution (fuzzy TOPSIS) method has been used to rank the POS and select the optimal scheme on decision-making scenarios. It is found that MOCSEM algorithm can provide high-accuracy parameters of CHM on various decision-making scenarios.