1 Introduction

Portfolio selection aims at selecting a set of securities to optimize a performance measure under some specific constraints, particularly a budget constraint. Investors often want to obtain a most profitable return and avoid risk at the same time. However, how to construct a satisfying portfolio has become a challenging problem owing to the uncertainty associated with the returns of securities.

In real financial markets, there exist many non-random factors that affect portfolio decision-making such as economic, social, political, people’s psychological factors and so on. Especially, experts’ knowledge and experiences play important roles in portfolio decision-making. So, in many cases, the market data of risky assets are usually imprecise and ambiguous. Fuzzy set theory in Zadeh (1965) has been regarded as a powerful tool for describing an uncertain environment with vagueness, ambiguity or some other type of fuzziness, which appears in many aspects of financial markets, such as portfolio managers’ unpredictable behaviors. With the wide use of fuzzy set theory, more and more people have realized that they could use it to investigate the fuzzy uncertainty associated with financial markets. To model the fuzzy uncertainty associated with financial markets, the return of a risky asset is characterized by a fuzzy variable with possibility distribution. A possibility distribution is identified to reflect the returns of risky assets associated with possibility grades offered by portfolio experts. Tanaka et al. (2000) pointed out that fuzzy portfolio models integrated the historical data and human factors to catch variations of stock markets better. By using fuzzy mathematical approaches, quantitative analysis, qualitative analysis, the experts’ knowledge and the investors’ subjective opinions can be better incorporated into a portfolio selection model as mentioned by Wang and Zhu (2002).

Fuzzy portfolio selection problem has been researched from 1990s. Various researchers have investigated fuzzy portfolio selection problem by using different approaches and proposed kinds of portfolio optimization models. For example, Watada (1997) proposed a fuzzy portfolio selection model by using fuzzy numbers to represent decision makers’ aspiration levels for the expected return rate and a certain degree of risk. Inuiguchi and Tanino (2000) introduced a novel possibilistic programming approach to investigate portfolio selection problem based on the minimax regret criterion and yielded a distributive investment solution. By using fuzzy goal programming approach, Parra et al. (2001) presented a multiple criteria model for portfolio selection. León et al. (2002) discussed some fuzzy optimization schemes for managing portfolio selection problems under the framework of risk–return tradeoff. Bilbao-Terol et al. (2006) formulated a fuzzy compromise programming model for portfolio selection. Gupta et al. (2008) applied fuzzy mathematical programming to develop comprehensive models for asset portfolio optimization. Tiryaki and Ahlatcioglu (2009) used fuzzy analytic hierarchy process to investigate fuzzy portfolio selection problem. Calvo et al. (2014) proposed a fuzzy multi-criteria model for portfolio selection with non-financial goals. In Vercher et al. (2007), Zhang et al. (2007) and Zhang et al. (2009), possibility theory was applied to handle uncertainty and solve portfolio optimization problems. Huang (2008), Zhang et al. (2011), Li et al. (2012) and Gupta et al. (2013) used credibility theory to study portfolio selection problems in a fuzzy economic environment.

All the literature mentioned above are formulated under the framework of fuzzy set theory. They assume that investors can accurately predict the possibility distributions of the uncertain parameters on risky assets by historical data. However, in emerging markets or financial incidents, it is impossible for investors to predict the possibility distributions of the uncertain parameters on risky assets due to lack of historical information. In these cases, the investors have no choice but to predict the approximate ranges of the uncertain parameters on risky assets based on experts’ experiences and their own market sentiment. Nowadays, some researchers have used interval programming approaches to study fuzzy portfolio selection problems. Lai et al. (2002) developed an interval absolute semideviation model for portfolio selection. Wang and Zhu (2002) employed interval programming for portfolio selection. Ida (2003, (2004) discussed multi-objective portfolio selection problems with interval coefficients. Giove et al. (2006) formulated a minimax regret portfolio selection problem in which the prices of the securities were considered as interval variables. Based on fuzzy decision theory, Fang et al. (2006) proposed a linear interval programming model for the portfolio rebalancing problem with transaction costs. Li and Xu (2007) dealt with a possibilistic portfolio selection model with interval center values. Bhattacharyya et al. (2011) utilized the concept of interval numbers in fuzzy set theory to extend the classical mean–variance model into a mean–variance–skewness model with transaction cost for fuzzy portfolio selection. Liu (2011) discussed the uncertain portfolio selection problem where the returns of risky assets were represented by interval numbers. Wu et al. (2013) studied an interval portfolio selection model in which both the returns and the risks of risky assets were defined as interval numbers.

Though, there are some researches about portfolio selection problems in emerging markets, most of them are mainly focused on single period portfolio selection. In the real world, an investor’s behavior is usually multi-period. Thus, it is necessary to investigate multi-period portfolio selection problems in emerging markets. Recently, Liu et al. (2013) studied a multi-period portfolio selection problem in emerging markets by using interval programming approach. However, they still neglected many real constraints such as floor and ceiling constraints and cardinality constraints. As far as investors are concerned, cardinality constraints enable them to limit the complexity of a portfolio and control transaction costs. Bound constraints enable them to control the amount invested on each asset. Based on the facts mentioned above, we investigate multi-period portfolio selection problem with some real features in emerging markets by using interval programming approach. Our contributions can be summarized as the following three aspects: (1) We propose four interval portfolio optimization models with cardinality constraints for multi-period portfolio selection in emerging markets. To achieve greater flexibility in portfolio selection, we incorporate some decision criteria including return, transaction costs, risk, liquidity, diversification degree, bound constraints and cardinality constraints into the proposed models. (2) We first use the concepts of possibility degree of interval inequalities to express investors’ different risk attitudes and transform the proposed interval programming portfolio selection models into deterministic forms. (3) Considering the complexity of the proposed models, we design a new dynamic differential evolution algorithm with a self-adapting control parameter to solve them.

The remainder of this paper is structured as follows. In Sect. 2, we summarize some basic concepts about interval numbers. In Sect. 3, we propose four interval portfolio optimization models with cardinality constraints for multi-period portfolio selection in emerging markets. In Sect. 4, based on the definition of possibility degree for interval inequality, we transform the proposed interval programming models into the corresponding deterministic forms. Then, we design a new dynamic differential evolution algorithm with self-adapting control parameter to solve the deterministic models. In Sect. 5, we give a numerical example to demonstrate the applications of our models and the effectiveness of the designed algorithm. Finally, we present conclusions of our study in Sect. 6.

2 Basic conceptions

Denote the set of all real numbers by \({\mathbb {R}}\). An ordered pair in a bracket defines an interval number

$$\begin{aligned} {\widetilde{a}}=[{\underline{a}},{\overline{a}}]=\{x:{\underline{a}}\le x\le {\overline{a}}, x\in {\mathbb {R}}\}, \end{aligned}$$
(1)

where \({\underline{a}}\) and \({\overline{a}}\) are the lower and upper bounds of interval number \({\widetilde{a}}\). In the special case, when \({\underline{a}}={\overline{a}}\), \({\widetilde{a}}\) is reduced to a real number. The center and width of interval number \({\widetilde{a}}=[{\underline{a}},{\overline{a}}]\) are defined as

$$\begin{aligned} m({\widetilde{a}})=\tfrac{1}{2}({\underline{a}}+{\overline{a}})\quad \text {and} \quad w({\widetilde{a}})=\tfrac{1}{2}({\overline{a}}-{\underline{a}}). \end{aligned}$$
(2)

Using \(m({\widetilde{a}})\) and \(\omega ({\widetilde{a}})\), the interval number \({\widetilde{a}}\) can be rewritten as

$$\begin{aligned} {\widetilde{a}}&=\langle m({\widetilde{a}}),\omega ({\widetilde{a}})\rangle \\&=\{x:m({\widetilde{a}})-\omega ({\widetilde{a}})\le x\le m({\widetilde{a}})+\omega ({\widetilde{a}}), x\in {\mathbb {R}}\}. \end{aligned}$$

Notice that \(m({\widetilde{a}})\) is the crisp value of \({\widetilde{a}}\), which behaves like the expected value of a uniform distribution. \(\omega ({\widetilde{a}})\) is similar to the spread of a symmetrical fuzzy number. It represents the uncertainty associated with interval number \({\widetilde{a}}\) and behaves likes the fluctuation range of a random variable, which is determined by historical data and human’s subjective judgement in decision making.

Definition 1

(Alefeld and Herzberger (1983)). Let \(\circ \in \{+,-,\times ,\div \}\) be a binary operation on \({\mathbb {R}}\). For any given two interval numbers \({\widetilde{a}}\) and \({\widetilde{b}}\), the binary operation on them is defined by

$$\begin{aligned} {\widetilde{a}} \circ {\widetilde{b}}=\{x \circ y: x\in {\widetilde{a}}, y\in {\widetilde{b}}\}, \end{aligned}$$

where 0 is not in \({\widetilde{b}}\) for the case of division.

Let \({\widetilde{a}}=[{\underline{a}},{\overline{a}}]\) and \({\widetilde{b}}=[{\underline{b}},{\overline{b}}]\) be two interval numbers, and let \(\lambda \in {\mathbb {R}}\) be a real number. The binary operations on interval numbers used in this paper are given as follows (see Moore (1966))

$$\begin{aligned}&{\widetilde{a}}+{\widetilde{b}}=[{\underline{a}}+{\underline{b}},{\overline{a}}+{\overline{b}}],\end{aligned}$$
(3)
$$\begin{aligned}&\lambda {\widetilde{a}}= \left\{ \begin{array}{cc} {[\lambda {\underline{a}},\lambda {\overline{a}}]}, &{} \quad \text {if}\; \lambda \ge 0,\\ {[\lambda {\underline{a}},\lambda {\overline{a}}]}, &{} \quad \text {if}\; \lambda < 0, \end{array}\right. \end{aligned}$$
(4)
$$\begin{aligned}&{\widetilde{a}}\pm \lambda =[{\underline{a}}\pm \lambda ,{\overline{a}}\pm \lambda ],\end{aligned}$$
(5)
$$\begin{aligned}&{\widetilde{a}}\times {\widetilde{b}}=[\min \{{\underline{a}}{\underline{b}},{\underline{a}}{\overline{b}},{\overline{a}}{\underline{b}},{\overline{a}}{\overline{b}}\},\max \{{\underline{a}}{\underline{b}},{\underline{a}}{\overline{b}},{\overline{a}}{\underline{b}},{\overline{a}}{\overline{b}}\}]. \end{aligned}$$
(6)

Let \({\widetilde{a}}_{1}=[{\underline{a}}_{1},{\overline{a}}_{1}],{\widetilde{a}}_{2}=[{\underline{a}}_{2},{\overline{a}}_{2}],\ldots ,{\widetilde{a}}_{n}=[{\underline{a}}_{n},{\overline{a}}_{n}]\) be n interval numbers, and let \(\lambda _{1},\lambda _{2},\ldots ,\lambda _{n}\) be n nonnegative real numbers. Then, the linear combination of the n interval numbers can be expressed by

$$\begin{aligned} \lambda _{1}{\widetilde{a}}_{1}+\lambda _{2}{\widetilde{a}}_{2}+\ldots +\lambda _{n}{\widetilde{a}}_{n}= \left[ \sum \limits _{i = 1}^n \lambda _{i}{\underline{a}}_{i},\sum \limits _{i = 1}^n \lambda _{i}{\overline{a}}_{i}\right] . \end{aligned}$$
(7)

It follows from Eqs. (2) and (7) that

$$\begin{aligned} \lambda _{1}m({\widetilde{a}}_{1})+\lambda _{2}m({\widetilde{a}}_{2})+\ldots +\lambda _{n}m({\widetilde{a}}_{n})&=\sum \limits _{i = 1}^n\lambda _{i}m({\widetilde{a}}_{i}),\\ \lambda _{1}\omega ({\widetilde{a}}_{1})+\lambda _{2}\omega ({\widetilde{a}}_{2})+\ldots +\lambda _{n}\omega ({\widetilde{a}}_{n})&=\sum \limits _{i = 1}^n\lambda _{i}\omega ({\widetilde{a}}_{i}). \end{aligned}$$

Definition 2

(Ishibuchi and Tanaka (1990)). Let \({\widetilde{a}}\) and \({\widetilde{b}}\) be two interval numbers. Then, the interval inequality \({\widetilde{a}}\succeq {\widetilde{b}}\) is defined as follows

$$\begin{aligned} {\widetilde{a}}\succeq {\widetilde{b}}\; \text {if and only if}\; {\underline{a}}\ge {\underline{b}}\; \text {and}\; {\overline{a}}\ge {\overline{b}}. \end{aligned}$$

For describing the interval inequality above in detail, Jiang et al. (2008) gave the concept of possibility degree as follows.

Definition 3

For any given two interval numbers \({\widetilde{a}}=[{\underline{a}},{\overline{a}}]\) and \({\widetilde{b}}=[{\underline{b}},{\overline{b}}]\), the possibility degree of \({\widetilde{a}}\succeq {\widetilde{b}}\) (denoted by \(P({\widetilde{a}}\succeq {\widetilde{b}})\)) is defined as

$$\begin{aligned} P({\widetilde{a}}\succeq {\widetilde{b}})= \left\{ \begin{array}{ll} 1, &{} \quad \text {if}\; {\underline{a}}\ge {\overline{b}},\\ \frac{{\overline{a}}-{\overline{b}}}{{\overline{a}}-{\underline{a}}}+\frac{{\overline{b}}-{\underline{a}}}{{\overline{a}}-{\underline{a}}}\cdot \frac{{\underline{a}}-{\underline{b}}}{{\overline{b}}-{\underline{b}}}+\tfrac{1}{2}\cdot \frac{{\overline{b}}-{\underline{a}}}{{\overline{a}}-{\underline{a}}}\cdot \frac{{\overline{b}}-{\underline{a}}}{{\overline{b}}-{\underline{b}}}, &{}\quad \text {if}\; {\underline{b}}\le {\underline{a}}<{\overline{b}}\le {\overline{a}},\\ \frac{{\overline{a}}-{\overline{b}}}{{\overline{a}}-{\underline{a}}}+\tfrac{1}{2}\cdot \frac{{\overline{b}}-{\underline{b}}}{{\overline{a}}-{\underline{a}}}, &{}\quad \text {if}\;{\underline{a}}< {\underline{b}}<{\overline{b}}\le {\overline{a}},\\ \tfrac{1}{2}\cdot \frac{{\overline{a}}-{\underline{b}}}{{\overline{a}}-{\underline{a}}}\cdot \frac{{\overline{a}}-{\underline{b}}}{{\overline{b}}-{\underline{b}}}, &{} \quad \text {if}\;{\underline{a}}< {\underline{b}}\le {\overline{a}}<{\overline{b}},\\ \frac{{\underline{a}}-{\underline{b}}}{{\overline{b}}-{\underline{b}}}+\tfrac{1}{2}\cdot \frac{{\overline{a}}-{\underline{a}}}{{\overline{b}}-{\underline{b}}}, &{} \quad \text {if}\;{\underline{b}}\le {\underline{a}}<{\overline{a}}<{\overline{b}},\\ 0, &{} \quad \text {if}\; {\overline{a}}<{\underline{b}}. \end{array} \right. \end{aligned}$$

Especially, in Definition 3, if \({\widetilde{b}}\) is reduced into a real number b, then

$$\begin{aligned} P({\widetilde{a}}\succeq b)= \left\{ \begin{array}{ll} 0, &{} \quad \text {if}\; b>{\overline{a}},\\ \frac{{\overline{a}}-b}{{\overline{a}}-{\underline{a}}}, &{} \quad \text {if}\; {\underline{a}}<b\le {\overline{a}}, \\ 1, &{} \quad \text {if}\; b\le {\underline{a}}. \end{array} \right. \end{aligned}$$
(8)

If \({\widetilde{a}}\) is reduced into a real number a in Definition 3, then

$$\begin{aligned} P(a\succeq {\widetilde{b}})= \left\{ \begin{array}{ll} 0, &{} \quad \text {if}\; a<{\underline{b}},\\ \frac{a-{\underline{b}}}{{\overline{b}}-{\underline{b}}}, &{} \quad \text {if}\; {\underline{b}}<a\le {\overline{b}}, \\ 1, &{} \quad \text {if}\; a\ge {\overline{b}}. \end{array} \right. \end{aligned}$$
(9)

Thus, the possibility degree in the concept of an interval inequality represents the degree to which one interval number is larger or smaller than another.

Notice that an interval number is similar to a uniform distribution in form. However, they have obvious difference as follows: The interval number can be viewed as a special fuzzy number whose membership function takes the value 1 over the interval and 0 anywhere else. Correspondingly, a uniform distribution variable is special case of random variable which takes a constant value in an interval, and 0 anywhere else. In financial decision-making, an interval number can be regarded as an alternative for a uniform distribution, in which the un-quantifiable factors such as experts’ knowledge and investors’ subjective opinions can be easily reflected. What’s more, in contrast with the uniform distribution, we need less information to determine the interval number of a parameter than to determine the uniform distribution. It only requires us to determine the lower and upper bounds of the interval by using both historical data and human subjective judgement. Unlike the uniform distribution, it does not require us to determine any distribution function defined over the interval with equal probability.

3 Interval programming models for multi-period cardinality constrained portfolio selection

In this section, we discuss a multi-period portfolio selection problem in emerging markets, in which the expected proceeds, the variances of the return rates and the turnover rates on securities are characterized by interval numbers. We propose four multi-period cardinality constrained portfolio selection models with interval coefficients. In the proposed models, we use the expected proceeds of a portfolio as the investment return, the variance of the return rate on a portfolio as the investment risk and the turnover rate of a portfolio as the portfolio liquidity. Before formulating the proposed models, we first introduce an interval estimation approach to predict the aforementioned three factors of interval values.

3.1 Estimation of the interval-valued expected proceeds, variance and turnover rate

Since the future states of securities cannot be accurately predicted in emerging markets, in this paper, we use the interval estimation approach in Fang et al. (2008) to predict the interval values of the investment return, investment risk and turnover rate of a security. Here, we use the estimation of the investment return interval of the security as an example to introduce the estimation approach.

Traditionally, the arithmetic mean of the historical proceeds of a security is used as the approximate value of its expected proceeds. So the expected proceeds of the security is a crisp value in this way. However, using the traditional estimation approach, we need to solve the following two main problems. Firstly, if the observation period of the historical data of the security is long, it may lead to the result that the influence of both the earlier historical data and the recent data are the same. However, in the real world, the later historical data of the security most often indicate that the performance of a corporation is more important than that of the earlier historical data. Secondly, if the historical data of the security are not enough, one cannot accurately estimate the statistical parameters due to data scarcity.

To take the two problems above into consideration, it is a good idea to characterize the expected proceeds of a security as an interval number rather than a crisp value based on the arithmetic mean of historical data. Investors may make use of a corporation’s financial reports and the historical data of the security to determine its expected proceeds interval. To determine the expected proceeds interval of a security, let us consider the following three factors:

(1) Arithmetic mean proceeds factor Although the arithmetic mean of the proceeds of a security should not be represented as its expected proceeds directly, it is a good approximation. Generally, the arithmetic mean proceeds factor of security i at period t is calculated by its historical data as follows

$$\begin{aligned} r_{t,i}^{a}=\tfrac{1}{t-t_{0}}\sum \limits _{s = t_{0}}^{t-1}r_{s,i},\quad t-1\ge t_{0}, \end{aligned}$$

where \(t_{0}\) is the starting period and t the current period and \(r_{s,i}\) is the real proceeds of unit capital invested on security i at period s.

(2) Historical proceeds tendency factor The historical proceeds tendency is also an important index for predicting the future proceeds of a security. In the real world, if the recent proceeds of the security have been increasing, we believe that its expected proceeds will be greater than the arithmetic mean proceeds based on historical data. Conversely, if the recent proceeds of the security have been declining, we believe that its expected proceeds will be smaller than the arithmetic mean proceeds based on historical data. Denote the historical proceeds tendency factor of security i at period t as \(r_{t,i}^{h}\), which reflects the tendency of the proceeds on security i. The value of \(r_{t,i}^{h}\) can be regarded as the arithmetic mean proceeds of the recent historical data on security i. The observation period of recent historical data is subjectively determined by experts. In many cases, the value of \(r_{t,i}^{h}\) can be directly given by experts.

(3) Forecast proceeds factor: The third factor influencing the expected proceeds of a security is its estimated future proceeds. According to the financial reports of a corporation, if the proceeds on the corporation’s stocks will increase, then we believe that the expected proceeds of security i at period t is larger than its arithmetic mean proceeds factor \(r_{t,i}^{a}\). Contrarily, if the proceeds of the corporation’s stocks will decrease in future, then we believe that the expected proceeds of security i at period t is smaller than its arithmetic mean proceeds factor \(r_{t,i}^{a}\). Denote the forecast proceeds factor of security i at period t by \(r_{t,i}^{f}\). The value of \(r_{t,i}^{f}\) can be determined by some forecasts based on the financial reports and experts’ experiences.

Using the three factors above, we can determine lower and upper limits of the expected proceeds of security i at period t. We set the minimum and maximum values of the three factors as the lower and upper limits of its expected proceeds, denoted them by \({\underline{r}}_{t,i}=\min \{r_{t,i}^{a},r_{t,i}^{h},r_{t,i}^{f}\}\) and \({\overline{r}}_{t,i}=\max \{r_{t,i}^{a},r_{t,i}^{h},r_{t,i}^{f}\}\). Then, the expected proceeds of security i at period t can be estimated by \({\widetilde{r}}_{t,i}=[{\underline{r}}_{t,i},{\overline{r}}_{t,i}]\). Similarly, the turnover rate of security i at period t can be estimated by \({\widetilde{l}}_{t,i}=[\min \{l_{t,i}^{a},l_{t,i}^{h},l_{t,i}^{f}\},\max \{l_{t,i}^{a},l_{t,i}^{h},l_{t,i}^{f}\}]\) and the covariance of the return rates of securities i and j can be estimated by \({\widetilde{\delta }}_{t,ij}=[\min \{{\delta }_{t,ij}^{a},{\delta }_{t,ij}^{h},{\delta }_{t,ij}^{f}\},\max \{{\delta }_{t,ij}^{a},{\delta }_{t,ij}^{h},{\delta }_{t,ij}^{f}\}]\). Here, \(l_{t,i}^{a},l_{t,i}^{h}\) and \(l_{t,i}^{f}\) denote the arithmetic mean, historical tendency and future forecast factors of the turnover rate of security i at period t, respectively; \({\delta }_{t,ij}^{a},{\delta }_{t,ij}^{h}\) and \({\delta }_{t,ij}^{f}\) denote the arithmetic mean, historical tendency and future forecast covariance factors of securities i and j at period t, respectively. For convenience, we denote \({\widetilde{l}}_{t,i}=[{\underline{l}}_{t,i},{\overline{l}}_{t,i}]\) and \({\widetilde{\delta }}_{t,ij}=[{\underline{\delta }}_{t,ij},{\overline{\delta }}_{t,ij}]\) in the following sections.

3.2 Assumptions and notations

Suppose that an investor with initial wealth \(W_{0}\) selects n securities from emerging markets for constructing T consecutive time periods investment. At the beginning of each of the following \(T-1\) periods, the investor can reallocate his wealth. Owing to lack of historical data, the proceeds, the turnover rates and the variances of the return rates on the n securities in the T investment periods are assumed to be interval numbers. To aid the description, we introduce the following notations:

\({\widetilde{r}}_{t,i}\) :

the expected proceeds of per unit capital invested on security i at period t, where \({\widetilde{r}}_{t,i}=[{\underline{r}}_{t,i},{\overline{r}}_{t,i}]\);

\({\widetilde{\delta }}_{t,ij}\) :

the covariance of the return rates on securities i and j at period t, where \({\widetilde{\delta }}_{t,ij}=[{\underline{\delta }}_{t,ij},{\overline{\delta }}_{t,ij}]\);

\(c_{t,i}\) :

the transaction cost rate of security i at period t, where \(c_{t,i}\) is a real number;

\(x_{t,i}\) :

the investment proportion of security i at period t;

\(x_{t}\) :

the portfolio at period t, where \(x_{t} =(x_{t,1},x_{t,2},\ldots ,x_{t,n})\);

\({\widetilde{\delta }}_{t}\) :

the variance of the return rate on the portfolio at period t;

\({\widetilde{l}}_{t,i}\) :

the turnover rate of security \(i\,(i=1,2,\ldots ,n)\), where \({\widetilde{l}}_{t,i}=[{\underline{l}}_{t,i},{\overline{l}}_{t,i}]\);

\(W_{t}\) :

the available wealth at the end of period t for all \(t = 1,2,\ldots ,T\).

3.3 Decision objective and investment constraints

With the same consideration as Markowitz (1987), we assume that the transaction costs at period t is a V-shaped function of the differences between the given portfolio at period \(t-1\) and the new portfolio at period t. Hence, the transaction costs of the portfolio \(x_{t} =(x_{t,1},x_{t,2},\ldots ,x_{t,n})\) at period t can be expressed as

$$\begin{aligned} C_{t}=\sum \limits _{i = 1}^n c_{t,i} \left| {{x_{t,i}} - {x_{t - 1,i}}} \right| ,\quad t = 1,2,\ldots ,T. \end{aligned}$$
(10)

Using the interval estimation approach in Sect. 3.1, the expected proceeds of per unit capital invested on security i at period t can be given by

$$\begin{aligned} {\widetilde{r}}_{t,i}=[{\underline{r}}_{t,i},{\overline{r}}_{t,i}]=\left[ \min \left\{ r_{t,i}^{a},r_{t,i}^{h},r_{t,i}^{f}\right\} ,\max \left\{ r_{t,i}^{a},r_{t,i}^{h},r_{t,i}^{f}\right\} \right] . \end{aligned}$$
(11)

Considering the fact that short selling may increase investment risk and incur transaction costs, as is usual in most of the existing literature, we assume that short selling is not allowed over the whole investment horizon, that is, \(x_{t,i}\ge 0\;(i = 1,2,\ldots ,n; t = 1,2,\ldots ,T)\). From Eq. (7), the expected proceeds of per unit capital invested on the portfolio at period t can be represented as the following interval number

$$\begin{aligned} {\widetilde{R}}_{t}&=\sum \limits _{i = 1}^n x_{t,i}{\widetilde{r}}_{t,i}=\left[ \sum \limits _{i = 1}^n x_{t,i}{\underline{r}}_{t,i},\sum \limits _{i = 1}^n x_{t,i}{\overline{r}}_{t,i}\right] . \end{aligned}$$
(12)

Derived from Eqs. (5), (10) and (12), the expected net proceeds of per unit capital invested on the portfolio at period t is

$$\begin{aligned} {\widetilde{R}}_{N,t}&=\sum \limits _{i = 1}^n x_{t,i}{\widetilde{r}}_{t,i}-C_{t}=\left[ \sum \limits _{i = 1}^n x_{t,i}{\underline{r}}_{t,i}-C_{t},\sum \limits _{i = 1}^n x_{t,i}{\overline{r}}_{t,i}-C_{t}\right] . \end{aligned}$$
(13)

According to Eqs. (6) and (13), the wealth obtained at the end of period t is

$$\begin{aligned} W_{t}=W_{t-1} {\widetilde{R}}_{N,t}=W_{t-1}\left[ \left( \sum \limits _{i = 1}^n x_{t,i}{\underline{r}}_{t,i}-C_{t}\right) , \left( \sum \limits _{i = 1}^n x_{t,i}{\overline{r}}_{t,i}-C_{t}\right) \right] . \end{aligned}$$
(14)

It follows from Eqs. (6) and (14) that the terminal wealth obtained at the end of period T is

$$\begin{aligned} W_{T}=W_{0}\prod \limits _{t = 1}^T {\widetilde{R}}_{N,t}=W_{0}\left[ \prod \limits _{t = 1}^T \left( \sum \limits _{i = 1}^n x_{t,i}{\underline{r}}_{t,i}-C_{t}\right) ,\prod \limits _{t = 1}^T\left( \sum \limits _{i = 1}^n x_{t,i}{\overline{r}}_{t,i}-C_{t}\right) \right] . \end{aligned}$$
(15)

Then, by Eqs. (2) and (15), the crisp value of the terminal wealth \(W_{T}\) is

$$\begin{aligned} m(W_{T})=\frac{1}{2}W_{0}\left[ \prod \limits _{t = 1}^T\left( \sum \limits _{i = 1}^n x_{t,i}{\underline{r}}_{t,i}-C_{t}\right) +\prod \limits _{t = 1}^T\left( \sum \limits _{i = 1}^n x_{t,i}{\overline{r}}_{t,i}-C_{t}\right) \right] . \end{aligned}$$
(16)

According to the assumption above, the covariance of the return rates on securities i and j (\(i,j=1,2,\ldots ,n\)) at each period are characterized by interval numbers. Similar to the estimation of the expected proceeds above, the arithmetic mean covariance factor of securities i and j at period t can be computed by \({\delta }_{t,ij}^{a}=\tfrac{1}{t-t_{0}}\sum \nolimits _{s = t_{0}}^{t-1}\delta _{s,ij}\). Assume that the historical covariance tendency factor \({\delta }_{t,ij}^{h}\) and the forecast covariance factor \({\delta }_{t,ij}^{f}\) are directly given by experts. Then, the covariance of the return rates of securities i and j can be expressed by

$$\begin{aligned} {\widetilde{\delta }}_{t,ij}=[{\underline{\delta }}_{t,ij},{\overline{\delta }}_{t,ij}]=\left[ \min \left\{ {\delta }_{t,ij}^{a},{\delta }_{t,ij}^{h},{\delta }_{t,ij}^{f}\right\} ,\max \left\{ {\delta }_{t,ij}^{a},{\delta }_{t,ij}^{h},{\delta }_{t,ij}^{f}\right\} \right] . \end{aligned}$$

By Eqs. (3) and (4), the variance of the return rate on the portfolio at period t is

$$\begin{aligned} {\widetilde{\delta }}_{t}=\sum \limits _{i = 1}^n\sum \limits _{j = 1}^n x_{t,i} x_{t,j}{\widetilde{\delta }}_{t,ij}=\left[ \sum \limits _{i = 1}^n\sum \limits _{j = 1}^n x_{t,i} x_{t,j}{\underline{\delta }}_{t,ij},\sum \limits _{i = 1}^n\sum \limits _{j = 1}^n x_{t,i} x_{t,j}{\overline{\delta }}_{t,ij}\right] . \end{aligned}$$
(17)

Liquidity is another one of the main concerns for portfolio managers. It measures the degree of probability of being able to convert an investment into cash without any significant loss in value. The liquidity of a security is commonly measured by its turnover rate, which is the proportion of turnover volume to the tradable volume. Generally, investors prefer greater liquidity. Especially, in a bull market, the returns of securities with high liquidity tend to increase with time. According to the assumption above, the turnover rates of securities at each period are characterized by interval numbers. By using the estimation approach of Sect. 3.1, the arithmetic mean turnover rate factor of security i at period t can be estimated by \(l_{t,i}^{a}=\tfrac{1}{t-t_{0}}\sum \nolimits _{s = t_{0}}^{t-1}l_{s,i}\). Assume that the historical turnover rate tendency factor \(l_{t,i}^{h}\) and the forecast turnover rate factor \(l_{t,i}^{f}\) are directly given by experts. Then, the turnover rate of security i at period t can be given as an interval number with the following form

$$\begin{aligned} {\widetilde{l}}_{t,i}=[{\underline{r}}_{t,i},{\overline{r}}_{t,i}]=\left[ \min \left\{ r_{t,i}^{a},r_{t,i}^{h},r_{t,i}^{f}\right\} ,\max \left\{ r_{t,i}^{a},r_{t,i}^{h},r_{t,i}^{f}\right\} \right] . \end{aligned}$$

Then, derived from Eq. (7), the turnover rate of the portfolio at period t can be represented by

$$\begin{aligned} {\widetilde{l}}_{t}&=\sum \limits _{i = 1}^n x_{t,i}{\widetilde{l}}_{t,i}=\left[ \sum \limits _{i = 1}^n x_{t,i}{\underline{l}}_{t,i},\sum \limits _{i = 1}^n x_{t,i}{\overline{l}}_{t,i}\right] . \end{aligned}$$
(18)

To construct a diversified portfolio, investors usually wish to control the maximum and minimum fractions of the capital invested on each security. Mathematically, the lower and upper bound constraints on \(x_{t,i}\) can be expressed as

$$\begin{aligned} m_{t,i}z_{t,i}\le x_{t,i} \le M_{t,i}z_{t,i},\quad i = 1,2,\ldots ,n; t = 1, 2, \ldots ,T, \end{aligned}$$

where \(m_{t,i}\) and \(M_{t,i}\) are the maximum and minimum fractions of the capital invested on security i at period t; \(z_{t,i}\) is a binary variable which will be 1 if security i at period t is held and 0 otherwise.

There is a widely accepted policy for reducing the investment risk of a portfolio by not allocating the whole investment in just a few securities. This idea is reflected in a well-known saying that “don’t put all your eggs in one basket”. However, how to construct a well diversified portfolio has become a problem to researchers. So far, some researchers have investigated the problem and used proportion entropy to measure the diversification degree of a portfolio, see for example Fang et al. (1997). Similarly, we also use proportion entropy to measure the diversification degree of the portfolio at each period. Mathematically, the diversification degree of the portfolio \(x_{t} =(x_{t,1},x_{t,2},\ldots ,x_{t,n})\) at period t can be represented by

$$\begin{aligned} h(x_{t})=-\sum \limits _{i = 1}^n x_{t,i}\text {ln} ( x_{t,i}),\quad t = 1, 2, \ldots ,T. \end{aligned}$$
(19)

In the real world, to control transaction costs and reduce the complexity of a portfolio, investor may wish to restrict the maximum number of securities in the portfolio at each period. In this paper, we assume that the investor intends to hold not more than K securities in the portfolio at period t. Then, the cardinality constraints about the portfolio at period t can be expressed by

$$\begin{aligned} \sum \limits _{i = 1}^n z_{t,i}\le K,\quad t = 1,2, \ldots ,T, \end{aligned}$$

where \(z_{t,i}\in \{0,1\}\) is a binary variable that controls whether security i at period t should be selected in the portfolio or not.

3.4 Model formulation

Based on the discussion in the previous section, we investigate a multi-period portfolio problem in emerging markets by taking several criteria into consideration including transaction costs, expected proceeds, variance, turnover rate, diversification degree, bound constraints and cardinality constraints. To demonstrate the effects of these decision criteria on portfolio selection and provide investors with more choices, we propose four multi-period cardinality constrained portfolio optimization models with interval coefficients, viz., the return–risk (RR) model, the return–risk–liquidity (RRL) model, the return–risk–entropy (RRE) model and the return–risk–liquidity–entropy (RRLE) model.

(1) In the RR model, we assume that an investor intends to seek an investment strategy with the objective of maximizing terminal wealth. Meanwhile, as shown in Constraints (20)–(24), the investor requires that the expected proceeds of the portfolio on unit capital investment at period t must achieve or exceed the preset minimum proceeds level \(R_{t}\), the variance of the return rate on the portfolio at period t must not exceed the preset maximum tolerable variance level \(\text {Var}_{t}\), the maximum number of securities in the portfolio at period t must be not more than K, the investment proportion of at period t must sum to one and the investment proportion on each selected security at period t must satisfy the lower and upper bound constraints. Then, the RR model for the multi-period portfolio selection in emerging markets can be formulated as the following interval programming problem \((P_{1})\):

$$\begin{aligned} {(P_{1})} {\left\{ \begin{array}{ll} \max \quad W_{T}=\left[ W_{0}\prod \limits _{t = 1}^T\left( \sum \limits _{i = 1}^n x_{t,i}{\underline{r}}_{t,i}-C_{t}\right) ,W_{0}\prod \limits _{t = 1}^T\left( \sum \limits _{i = 1}^n x_{t,i}{\overline{r}}_{t,i}-C_{t}\right) \right] \\ \text {s.t.} \quad \; \left[ \sum \limits _{i = 1}^n x_{t,i}{\underline{r}}_{t,i}-C_{t},\sum \limits _{i = 1}^n x_{t,i}{\overline{r}}_{t,i}-C_{t}\right] \succeq R_{t}, &{}(20) \\ \quad \qquad \left[ \sum \limits _{i = 1}^n\sum \limits _{k = 1}^n x_{t,i} x_{t,k}{\underline{\delta }}_{i,k}^{t},\sum \limits _{i = 1}^n\sum \limits _{k = 1}^n x_{t,i} x_{t,k}{\overline{\delta }}_{i,k}^{t}\right] \preceq \text {Var}_{t}, &{}(21) \\ \quad \qquad \sum \limits _{i = 1}^n z_{t,i}\le K,\;z_{t,i}\in \{0,1\},&{}(22) \\ \quad \qquad \sum \limits _{i = 1}^n {{x_{t,i}} = 1}, \; x_{t,i}\ge 0,&{}(23) \\ \quad \qquad m_{t,i}z_{t,i}\le x_{t,i} \le M_{t,i}z_{t,i}, \,i = 1, 2,\ldots ,n; t = 1, 2,\ldots ,T. &{}(24) \end{array}\right. } \end{aligned}$$

From Constraint (23), we have \(x_{t,i}\ge 0\) for all \(i = 1, 2,\ldots ,n\) and \(t = 1, 2,\ldots ,T\). It indicates that short selling is not allowed over the whole investment horizon.

(2) In the RRL model, we assume that the investor not only considers the criteria in the model \((P_{1})\) but also takes the effect of the turnover rate of the portfolio at each period into consideration. As shown in Constraint (25), he demands that the turnover rate of the portfolio at period t must be larger than or equal to the given turnover rate level \(l_{t}\). Then, on the basis of \((P_{1})\), the RRL model for the multi-period portfolio selection in emerging markets can be formulated as the following interval programming problem \((P_{2})\):

$$\begin{aligned} {(P_{2})} {\left\{ \begin{array}{ll} \max &{} W_{T}=\left[ W_{0}\prod \limits _{t = 1}^T\left( \sum \limits _{i = 1}^n x_{t,i}{\underline{r}}_{t,i}-C_{t}\right) ,W_{0}\prod \limits _{t = 1}^T \left( \sum \limits _{i = 1}^n x_{t,i}{\overline{r}}_{t,i}-C_{t}\right) \right] \\ \text {s.t.} &{} \text {Constraints (20)--(24)},\\ &{} \left[ \sum \limits _{i = 1}^n x_{t,i}{\underline{l}}_{t,i},\sum \limits _{i = 1}^n x_{t,i}{\overline{l}}_{t,i}\right] \succeq l_{t}. \end{array}\right. } \end{aligned}$$
(25)

(3) In the RRE model, we assume that the investor considers all the criteria in the RR model \((P_{1})\). Meanwhile, he requires that the diversification degree of the portfolio at period t must be no less than the preset minimum diversification level \( e_t\) as shown in Constraint (26). Then, on the basis of \((P_{1})\), the RRE model can be formulated as the following interval programming problem \((P_{3})\):

$$\begin{aligned} {(P_{3})} {\left\{ \begin{array}{ll} \max &{} _{T}= \left[ W_{0}\prod \limits _{t = 1}^T \left( \sum \limits _{i = 1}^n x_{t,i}{\underline{r}}_{t,i}-C_{t}\right) , W_{0}\prod \limits _{t = 1}^T\left( \sum \limits _{i = 1}^n x_{t,i}{\overline{r}}_{t,i}-C_{t}\right) \right] \\ \text {s.t.} &{}\text {Constraints (20)--(24)},\\ &{} -\sum \limits _{i = 1}^n x_{t,i}\text {ln} ( x_{t,i})\ge e_t. \end{array}\right. } \end{aligned}$$
(26)

(4) In the RRLE model, we assume that the investor takes all the decision criteria in the aforementioned three models into consideration. Then, the RRLE model can be formulated as the following interval programming problem \((P_{4})\):

$$\begin{aligned} {(P_{4})} {\left\{ \begin{array}{ll} \max &{} W_{T}=\left[ W_{0}\prod \limits _{t = 1}^T \left( \sum \limits _{i = 1}^n x_{t,i}{\underline{r}}_{t,i}-C_{t}\right) , W_{0}\prod \limits _{t = 1}^T\left( \sum \limits _{i = 1}^n x_{t,i}{\overline{r}}_{t,i}-C_{t}\right) \right] \\ \text {s.t.} &{}\text {Constraints (20)--(26)}. \end{array}\right. } \end{aligned}$$

3.5 Deterministic multi-period portfolio optimization models

Notice that the proposed four models in previous section are interval programming problems with interval coefficients in both objectives and inequality constraints. To solve the proposed models, it is necessary to transform them into deterministic programming models. In this section, we first transform the proposed models into parameter programming problems with interval coefficients in inequality constraints. Then, based on the definitions of the possibility degree for the interval inequalities, we can convert the interval inequality constraints in the transformed models into deterministic inequality constraints.

3.5.1 Treatment of the uncertain objective functions

To handle the uncertain objective functions in the proposed models, we apply a commonly-used variable transformation method to transform them into the corresponding parameter programming problems with interval coefficients in constraints. Here, we take the RR model \((P_{1})\) as an example to introduce the treatment of its uncertain objective function. By using the variable transformation method, the RR model \((P_{1})\) can be equivalently converted into the following parameter programming model \((P_{1}^{'})\):

$$\begin{aligned} {(P_{1}^{'})} {\left\{ \begin{array}{ll} \max &{} f\\ \text {s.t.} &{}\text {Constraints (20)--(24)},\\ &{}\left[ W_{0}\prod \limits _{t = 1}^T \left( \sum \limits _{i = 1}^n x_{t,i}{\underline{r}}_{t,i}-C_{t}\right) , W_{0}\prod \limits _{t = 1}^T \left( \sum \limits _{i = 1}^n x_{t,i}{\overline{r}}_{t,i}-C_{t}\right) \right] \succeq f. \end{array}\right. } \end{aligned}$$
(27)

Similarly, models \((P_{2})\), \((P_{3})\) and \((P_{4})\) can be converted into the corresponding parameter programming models. Denote them by \((P_{2}^{'})\), \((P_{3}^{'})\) and \((P_{4}^{'})\), respectively.

3.5.2 Treatment of the uncertain inequality constraints

In stochastic optimization problems (see Liu et al. (2003)), we often make an uncertain inequality constraint satisfied with a confidence level and transform it into deterministic inequality constraint. Analogously, in fuzzy optimization problems, we can make an interval inequality constraint satisfied with a possibility degree level and convert it into deterministic inequality constraint.

In this paper, for the uncertainty inequality constraints in Constraints (20), (21), (25) and (27), we use the definitions of the possibility degree for interval inequalities in Eqs. (8) and (9) to transform them into the corresponding deterministic forms. Let \(\theta _{{\widetilde{R}}_{t}}\), \(\theta _{{\widetilde{l}}_{t}}\) and \(\theta _{W_{T}}\) be the possibility degree levels of Constraints (20), (25) and (27), respectively. Then, by Eq. (8), the three interval inequality constraints can be, respectively, converted into deterministic inequality constraints as follows

$$\begin{aligned} \sum \limits _{i = 1}^n x_{t,i}{\overline{r}}_{t,i}-C_{t}-R_{t}&\ge \theta _{{\widetilde{R}}_{t}}\sum \limits _{i = 1}^n x_{t,i}({\overline{r}}_{t,i}-{\underline{r}}_{t,i}),\end{aligned}$$
(28)
$$\begin{aligned} \sum \limits _{i = 1}^n x_{t,i}{\overline{l}}_{t,i}-l_{t}&\ge \theta _{{\widetilde{l}}_{t}}\sum \limits _{i = 1}^n( x_{t,i}{\overline{l}}_{t,i}-{\underline{l}}_{t,i}),\end{aligned}$$
(29)
$$\begin{aligned} W_{0}\prod \limits _{t = 1}^T\left( \sum \limits _{i = 1}^n x_{t,i}{\overline{r}}_{t,i}{-}C_{t}\right) {-}f&\ge \theta _{W_{T}}W_{0} \left[ \prod \limits _{t = 1}^T \left( \sum \limits _{i = 1}^n x_{t,i}{\overline{r}}_{t,i}{-}C_{t}\right) {-}\prod \limits _{t = 1}^T\left( \sum \limits _{i = 1}^n x_{t,i}{\underline{r}}_{t,i}{-}C_{t}\right) \right] . \end{aligned}$$
(30)

For the uncertainty inequality constraint in Constraint (21), we denote its possibility degree level by \(\theta _{{\widetilde{\delta }}_{t}}\). Then, by Eq. (9), it can be transformed into the following deterministic inequality constraint

$$\begin{aligned} \text {Var}_{t}-\sum \limits _{i = 1}^n\sum \limits _{k = 1}^n x_{t,i} x_{t,k}{\underline{\delta }}_{i,k}^{t}\ge \theta _{{\widetilde{\delta }}_{t}}\sum \limits _{i = 1}^n\sum \limits _{k = 1}^n x_{t,i} x_{t,k}({\overline{\delta }}_{i,k}^{t}-{\underline{\delta }}_{i,k}^{t}). \end{aligned}$$
(31)

Notice that, with the increasing of \(\theta _{{\widetilde{R}}_{t}}\), \(\theta _{{\widetilde{l}}_{t}}\), \(\theta _{W_{T}}\) and \(\theta _{{\widetilde{\delta }}_{t}}\), the investor will become more and more conservative to demand that Constraints (28)–(31) should be held. Thus, we can freely express investor’s different risk attitudes by varying the values of the possibility degree levels on the aforementioned uncertain inequality constraints.

Then, by Eqs. (28), (30) and (31), the model \((P_{1}^{'})\) can be transformed into the following deterministic parameter programming problem \((P_{1}^{''})\):

$$\begin{aligned} {(P_{1}^{''})} {\left\{ \begin{array}{ll} \max &{} f\\ s.t. &{}W_{0}\prod \limits _{t = 1}^T\left( \sum \limits _{i = 1}^n x_{t,i}{\overline{r}}_{t,i}{-}C_{t}\right) {-}f\ge \theta _{W_{T}}W_{0}\left[ \prod \limits _{t = 1}^T \left( \sum \limits _{i = 1}^n x_{t,i}{\overline{r}}_{t,i}{-}C_{t}\right) {-}\prod \limits _{t = 1}^T\left( \sum \limits _{i = 1}^n x_{t,i}{\underline{r}}_{t,i}{-}C_{t}\right) \right] ,\\ &{}\sum \limits _{i = 1}^n x_{t,i}{\overline{r}}_{t,i}{-}C_{t}-R_{t}\ge \theta _{{\widetilde{R}}_{t}}\sum \limits _{i = 1}^n x_{t,i}({\overline{r}}_{t,i}-{\underline{r}}_{t,i}),\\ &{}\text {Var}_{t}-\sum \limits _{i = 1}^n\sum \limits _{k = 1}^n x_{t,i} x_{t,k}{\underline{\delta }}_{i,k}^{t}\ge \theta _{{\widetilde{\delta }}_{t}}\sum \limits _{i = 1}^n\sum \limits _{k = 1}^n x_{t,i} x_{t,k}({\overline{\delta }}_{i,k}^{t}-{\underline{\delta }}_{i,k}^{t}),\\ &{} \sum \limits _{i = 1}^n {{x_{t,i}} = 1}, \; x_{t,i}\ge 0,\\ &{} \sum \limits _{i = 1}^n z_{t,i}\le K,\;\; z_{t,i}\in \{0,1\},\\ &{} m_{t,i}z_{t,i}\le x_{t,i} \le M_{t,i}z_{t,i},\;i = 1,2, \ldots ,n,t = 1,2,\ldots ,T. \end{array}\right. } \end{aligned}$$

By substituting Eqs. (28)–(31) into the models \((P_{2}^{'})\), \((P_{3}^{'})\) and \((P_{4}^{'})\), the corresponding deterministic parameter programming models can also be obtained. Denote them by \((P_{2}^{''})\), \((P_{3}^{''})\) and \((P_{4}^{''})\), respectively.

It is well known that portfolio selection is affected by many realistic investment constraints including cardinality constraints, bound constraints and so on. To highlight the effects of both the cardinality constraints and the bound constraints on portfolio selection, on the basis of the model \((P_{1}^{''})\), we formulate a contrast model without considering Constraints (22) and (24) as follows

$$\begin{aligned} {(P_{5})}{\left\{ \begin{array}{ll} \max &{} f\\ s.t. &{}W_{0}\prod \limits _{t = 1}^T\left( \sum \limits _{i = 1}^n x_{t,i}{\overline{r}}_{t,i}{-}C_{t}\right) {-}f\ge \theta _{W_{T}}W_{0}\left[ \prod \limits _{t = 1}^T \left( \sum \limits _{i = 1}^n x_{t,i}{\overline{r}}_{t,i}{-}C_{t}\right) {-}\prod \limits _{t = 1}^T\left( \sum \limits _{i = 1}^n x_{t,i}{\underline{r}}_{t,i}{-}C_{t}\right) \right] ,\\ &{}\sum \limits _{i = 1}^n x_{t,i}{\overline{r}}_{t,i}-C_{t}-R_{t}\ge \theta _{{\widetilde{R}}_{t}}\sum \limits _{i = 1}^n x_{t,i}({\overline{r}}_{t,i}-{\underline{r}}_{t,i}),\\ &{}\text {Var}_{t}-\sum \limits _{i = 1}^n\sum \limits _{k = 1}^n x_{t,i} x_{t,k}{\underline{\delta }}_{i,k}^{t}\ge \theta _{{\widetilde{\delta }}_{t}}\sum \limits _{i = 1}^n\sum \limits _{k = 1}^n x_{t,i} x_{t,k}({\overline{\delta }}_{i,k}^{t}-{\underline{\delta }}_{i,k}^{t}),\\ &{} \sum \limits _{i = 1}^n {{x_{t,i}} = 1},\; x_{t,i}\ge 0,\;t = 1,2,\ldots ,T. \end{array}\right. } \end{aligned}$$

4 Solution algorithm

Because Constraint (22) is a 0–1 integer constraint and the first derivatives of Constraints (28) and (30) are discontinuous, traditional optimization approaches usually fail to find the optimal solutions of the models mentioned in previous section. In this section, we design a new dynamic differential evolution algorithm with self-adapting control parameter to solve them.

Differential evolution (DE) is a stochastic population-based search method, which was originally proposed by Storn and Price (1995). Like other evolutionary algorithms (EAs), it encodes each decision variable by a real number and utilizes the three important operators (i.e., mutation, crossover and selection) to evolve from randomly generated initial population to final individual solution. The key idea of DE is a scheme for generating trial vectors. Mutation and crossover operators are used to generate trial vectors, and the selection operator determines which of the vectors will survive into the next generation (see Brest et al. (2006)).

DE is of simple concept, few control parameters, easy implementation and high convergence characteristics (see Storn and Price (1997)), and has attracted much attention and wide applications for solving the unconstrained optimization problems. But, DE lacks an explicit mechanism to guide the search process towards the feasible region, which limits its application. As we know, many real-world optimization problems are with constraints. The main challenge in solving constrained optimization is how to balance the search between feasible and infeasible regions effectively. Namely, how to design an efficient constraint-handling mechanism to locate the global optimum in the feasible region. Numerous researchers have concentrated on constraint-handling mechanisms for DEs and presented a series of approaches for handling constraints (see e.g., Mezura-Montes et al. (2010) and Wang and Li (2010)). To solve constrained optimization problems with mixed-type variables, Mohamed and Sabry (2012) designed a novel dynamic differential evolution algorithm. On the basis of Alia and Törn (2004) and Mohamed and Sabry (2012), we design a new dynamic differential evolution with self-adapting control parameter to solve the proposed models. Here, we introduce its fitness function, representation and coding, constraint-handling, mutation, crossover, selection and stopping criterion.

4.1 Fitness function

The fitness function is a factor for measuring the quality of a solution. In an optimization problem, fitness is often determined by the value of its objective function. Thus, for the maximization problems, the individual with higher fitness will have more chance to generate offspring. For example, in the maximization problem \((P_{1}^{''})\), we regard its objective function as the fitness function of the designed algorithm.

4.2 Representation and coding

In this algorithm, we use the hybrid representation to define a portfolio, in which two vectors are expressed as the following forms

$$\begin{aligned}&\triangle =\{z_{1,1},z_{1,2},\ldots ,z_{1,n};\ldots ;z_{T,1},z_{T,2},\ldots ,z_{T,n}\}, \end{aligned}$$
(32)
$$\begin{aligned}&X=\{x_{1,1},x_{1,2},\ldots ,x_{1,n};\ldots ;x_{T,1},x_{T,2},\ldots ,x_{T,n}\}. \end{aligned}$$
(33)

To simplify the description, we denote \(\triangle \) and X by \(X=(x_{1},x_{2},\ldots ,x_{T})\) with \(x_{t}=(x_{t,1},x_{t,2},\ldots ,x_{t,n})\) and \(\triangle =\{z_{1},z_{2},\ldots ,z_{T}\}\) with \(z_{t}=(z_{t,1},z_{t,2},\ldots ,z_{t,n})\) for all \(t=1,2,\ldots ,T\). Here, \(z_{t}\) is a binary vector that specifies whether a particular security participates in the portfolio at period t, where \(z_{t,i}\in \{0,1\}\,(i=1,2,\ldots ,n; t=1,2,\ldots ,T)\). \(x_{t}\) is a real-valued vector used to compute the investment proportions of the budget invested on the portfolio at period t, where \(x_{t,i}\in [0,1]\,(i=1,2,\ldots ,n; t=1,2,\ldots ,T)\).

4.3 Constraint-handling

Similar to Chang et al. (2000) and Mishra et al. (2014), we perform the following repair mechanism to find the portfolio at period t associated with \(z_{t}\) and \(x_{t}\). First, if the number of securities in the portfolio at period t (i.e., the number of 1’s in \(z_{t}\)) exceeds the maximum allowable number K, then we delete (by changing its value from 1 to 0 in \(z_{t}\)) those securities with the \((n-K)\)–smallest weights in \(x_{t}\). In this way, we can keep the portfolio at each period satisfy the cardinality constraints (i.e., Constraint (22)).

To meet the budget constraint (i.e., Constraint (23)), we perform the following normalization operation

$$\begin{aligned} x_{t,i}^{\prime }=\tfrac{ x_{t,i}z_{t,i}}{\sum \nolimits _{j = 1}^n x_{t,j}z_{t,j}},\quad i=1,2,\ldots ,n;t=1,2,\ldots ,T. \end{aligned}$$
(34)

Since the normalized investment proportion in Eq. (34) may not satisfy the bound constraints, we need to discuss the following three different cases.

Case 1: If both the lower and upper bound constraints are presented, then the adjusted investment proportions are computed by

$$\begin{aligned} x_{t,i}^{\prime }=y_{t,i}+\tfrac{x_{t,i}z_{t,i}}{\sum \nolimits _{j= 1}^n x_{t,j}z_{t,j}}\left( 1-\sum \limits _{j=1}^n y_{t,j}\right) ,\;\;\;\; t=1,2,\ldots ,T, \end{aligned}$$
(35)

where \(y_{t,j}=m_{t,j}z_{t,j}+M_{t,j}(1-z_{t,j})\) means that the investment proportion \(x_{t,i}\) cannot escape either the lower or upper bound.

Case 2: If the investment proportion has to be adjusted only for the lower bound constraint, and there is no restriction on the upper bound constraint, then the adjusted investment proportions are calculated by

$$\begin{aligned} x_{t,i}^{\prime }=m_{t,i}z_{t,i}+\tfrac{x_{t,i}z_{t,i}}{\sum \nolimits _{j= 1}^n x_{t,j}z_{t,j}}\left( 1-\sum \limits _{j=1}^n m_{t,j}z_{t,j}\right) ,\;\;\;\; t=1,2,\ldots ,T. \end{aligned}$$
(36)

Case 3: If the investment proportion has to be adjusted for the upper bound constraint, and there is no restriction on the lower bound constraint, then the adjusted investment proportions are computed by

$$\begin{aligned} x_{t,i}^{\prime }=M_{t,i}z_{t,i}-\tfrac{ x_{t,i}z_{t,i}}{\sum \nolimits _{j = 1}^n x_{t,j}z_{t,j}}\left( \sum \limits _{i = 1}^n M_{t,i}z_{t,i}-1\right) , t=1, 2, \ldots , T. \end{aligned}$$
(37)

For the remaining inequality constraints in the maximization problem, we rewrite them into the common forms \(g_{j}(x)\le 0\,(j=1, 2, \ldots ,q)\). Then, the violation value of the solution X is defined as

$$\begin{aligned} \text {Viol}(X)=\sum \limits _{j=1}^q \frac{g_{j}(x)}{ g_{\max ,j}(x)}, \end{aligned}$$
(38)

where \(g_{j}(x)\) is the jth constraint of the problem and \( g_{\max ,j}(x)\) is the largest violation of the constraint \(g_{j}(x)\) found so far.

According to the feasibility-based rule in Deb (2000), we compare each trial vector with its corresponding target vector in current population by taking both fitness values and violation values into consideration. We replace the target vector by the trial vector as an individual of next generation by the following criteria:

(i):

The trial vector is feasible and the target vector is infeasible;

(ii):

Both the trial vector and the target vector are feasible, but the fitness value of the trial vector is no less than the corresponding target vector;

(iii):

Both the trial vector and the target vector are infeasible, but the violation value of the trial vector is smaller the corresponding target vector.

After repeating this operation N times, we can obtain a population with N initialized individuals.

4.4 Mutation operation

Since the basic mutation strategy DE/rand/1/bin slows down the convergence of DE algorithms, we design a new mutation strategy with self-adapting control parameter to perform mutation operation as follows: For each target vector \(X_{i}^{G}\), a mutant vector V is generated as

$$\begin{aligned}&\text {if}\; (\text {rand} [0,1]\le 0.5)\end{aligned}$$
(39)
$$\begin{aligned}&\text {then}\; V_{i}^{G+1}=X_{r}^{G}+F(X_{b}^{G}-X_{w}^{G})\end{aligned}$$
(40)
$$\begin{aligned}&\text {else}\; V_{i}^{G+1}=X_{r_{1}}^{G}+F(X_{r_{2}}^{G}-X_{r_{3}}^{G}) \end{aligned}$$
(41)

where G is the current generation number; \(X_{r}^{G}\) is a randomly chosen vector at generation G; \(X_{b}^{G}\) and \(X_{w}^{G}\) are the best and the worst vectors in the entire population at current generation, respectively; \(\text {rand} [0,1]\) is a random number in [0,1]; \(r_{1}, r_{2}, r_{3}\in \{1,2,\ldots ,N\}\) are randomly chosen indices; F is a self-adapting control parameter with the following form (see Alia and Törn (2004))

$$\begin{aligned} F= \left\{ \begin{array}{ll} \max \left( F_{\mathrm {min}}, 1-\left| \frac{f_{\mathrm {max}}}{f_{\mathrm {min}}}\right| \right) , &{}\quad \text {if}\; |\frac{f_{\mathrm {max}}}{f_{\mathrm {min}}}|<1,\\ \max \left( F_{\mathrm {min}}, 1-\left| \frac{f_{\mathrm {min}}}{f_{\mathrm {max}}}\right| \right) , &{} \quad \text {otherwise}, \end{array} \right. \end{aligned}$$

where \(f_{\mathrm {min}}\) and \(f_{\mathrm {max}}\) are the minimum and maximum objective values of vector X at current generation, \(F_{\mathrm {min}}\) is the minimum value of F. In Alia and Törn (2004), the value of \(F_{\mathrm {min}}\) is set as 0.4.

4.5 Crossover operation

To balance between global exploration ability and local exploitation tendency in constrained optimization, a dynamic non-linear increased crossover probability scheme is performed as follows

$$\begin{aligned} u_{i,j}^{G+1}= \left\{ \begin{array}{ll} x_{i,j}^{G+1}, &{}\quad \text {if}\; \text {rand}(j)\le \text {CR} \,\; \text {or}\; j=\text {randn}(i),\\ v_{i,j}^{G}, &{} \quad \text {if}\; \text {rand}(j)> \text {CR}\, \text {and}\; j\ne \text {randn}(i), \end{array} \right. \end{aligned}$$
(42)

where \(j=1,2,\ldots ,nT\); \(\text {rand}(j)\) is the jth evaluation of a uniform random generator number; \(\text {randn}(i)\in \{1,2,\ldots ,nT\}\) is a randomly chosen index and it ensures that \(u_{i}^{G+1}\) gets at least one element from \(v_{i}^{G+1}\); the crossover probability \(\text {CR}\) is defined as

$$\begin{aligned} \text {CR}=\text {CR}_{\mathrm {max}}+(\text {CR}_{\mathrm {min}}-\text {CR}_{\mathrm {max}}) \left( 1-\tfrac{G}{G_{\mathrm {max}}}\right) ^{p}. \end{aligned}$$

Here, \(G_{\mathrm {max}}\) is the maximum generation number; \(\text {CR}_{\mathrm {min}}\) and \(\text {CR}_{\mathrm {max}}\) are the minimum and maximum values of \(\text {CR}\), respectively; p is a positive number. In this paper, the values of \(\text {CR}_{\mathrm {min}}\), \(\text {CR}_{\mathrm {max}}\) and p are set as 0.5, 0.95 and 4, respectively.

4.6 Selection operation

In this algorithm, the selection operation is based on a greedy selection strategy. Namely, if the trial vector \(u_{i}^{G+1}\) yields a better fitness function value than \(X_{i}^{G}\), then we set \(u_{i}^{G+1}\) as \(X_{i}^{G}\). Otherwise, we retain \(X_{i}^{G}\). Thus, the selection operation can be expressed as

$$\begin{aligned} X_{i}^{G+1}= \left\{ \begin{array}{ll} u_{i}^{G+1}, &{}\quad \text {if}\; f(u_{i}^{G+1}) >f(X_{i}^{G+1}),\\ X_{i}^{G}, &{} \quad \text {if}\; f(u_{i}^{G+1})\le f(X_{i}^{G+1}). \end{array} \right. \end{aligned}$$
(43)

4.7 Stopping criterion

We terminate the algorithm when the maximum number of generations \(G_{\mathrm {max}}\) is reached or there is no improvement after a certain number of generations.

The concrete procedures of the designed algorithm are summarized as follows:

Step 1: Input population size N, crossover rate parameters (\(\text {CR}_{\mathrm {min}}\), \(\text {CR}_{\mathrm {max}}\) and p) and maximum generation number \(G_{\mathrm {max}}\);

Step 2: Randomly generate N initial individuals and convert them into corresponding feasible ones by using the constraint-handing mechanisms;

Step 3: Calculate the fitness value of each individual;

Step 4: Perform mutation operation by Eqs. (39)–(41) and crossover operation by Eq. (42);

Step 5: Compare the fitness values of the offspring \(u_{i}^{G+1}\) and its parent \(X_{i}^{G}\), and then select out the better one as an individual for next generation by Eq. (43);

Step 6: Check the stopping criterion. If the stopping criterion is satisfied, then quit the iteration operation and report the best individual as the optimal solution. Otherwise, go back to Step 3.

5 Numerical example

In this section, we give a numerical example to illustrate the applications of the proposed models and demonstrate the validity of the designed algorithm.

Assume that an investor chooses 8 stocks from the Shanghai Stock Exchange for his investment. He intends to make three consecutive periods investment among the 8 stocks with initial wealth 10,000 RMB. To simulate the real transaction, we collect the historical data about weekly return rates, variances and turnover rates of the 8 stocks from January 2010 to January 2013. We set each year as an investment period to handle these historical data. In this example, the values of \(r_{t,i}^{a}\), \(\delta _{t,i}^{a}\) and \(l_{t,i}^{a}\) are calculated by their arithmetic means of the historical data in the recent a year for \(i=1, 2, \ldots , 8\) and \(t=1,2,3\). Namely, the value of \(t_{0}\) is set as 1. The values of \(r_{t,i}^{h}\), \(\delta _{t,i}^{h}\) and \(l_{t,i}^{h}\) are subjectively given by experts. Here, we assume that they are calculated by their arithmetic means of the historical data in the first half of the recent a year. The values of \(r_{t,i}^{f}\), \(\delta _{t,i}^{f}\) and \(l_{t,i}^{f}\) are roughly computed by their arithmetic means of the historical data in the second half of the recent a year. According to the interval estimation approach in Fang et al. (2008), the proceeds obtained by per unit capital investment, the turnover rates and the covariances of the return rates on the 8 stocks at each period are characterized by interval numbers as shown in Tables 1 and 2.

Table 1 The interval-valued proceeds and turnover rates of the 8 stocks at each period
Table 2 The interval-valued covariances of the return rates on the 8 stocks at each period

In this example, the maximum holding number of stocks in the portfolio at each period is set as 6, that is, \(K=6\). The transaction costs of per unit capital adjustment are set as 0.003 for all stocks over the whole investment horizon. The minimum expected return level of the portfolio on unit capital investment at each period is set as 1.0475. The maximum risk tolerance levels for the portfolios in the three investment periods are set as 0.012, 0.015 and 0.018, respectively. The minimum expected turnover rate level of the portfolio at each period is set as 0.21. The diversification degree of the portfolio at each period is set as 1.6. The lower and upper bounds of the investment proportion of each selected stock over the whole investment horizon are set as 0.005 and 0.3, respectively. The parameters of the designed algorithm are set as follows: The population size is 200 and the maximum generation number is 1000. The values of \(\text {CR}_{\mathrm {min}}\) and \(\text {CR}_{\mathrm {max}}\) are set as 0.5 and 0.95. To demonstrate the performances of the proposed models, we assume that the possibility degree level for each interval inequality constraint is 0.8. After running the designed algorithm 1000 iterations on each model, we record the corresponding investment strategies as shown in Table 3.

Table 3 Comparative results about the investment strategies obtained by different portfolio selection models

Table 3, representing the investment strategies obtained by different models, shows that different portfolio selection models reflect investors’ different investment intentions. From Table 3, we can find that most of the investor’s wealth is allocated among Stocks 1, 5, 6, 7 and 8. If the investor uses the model \((P_{1})\) to make his portfolio decision, he should follow the investment strategies listed in lines 2, 3 and 4 of Table 3 to adjust his wealth at the beginning of each period. Namely, at the beginning of period 1, the investor needs to assign his initial wealth among Stocks 1, 4, 5, 6 and 7 by the investment proportions of 0.2747, 0.0053, 0.2993, 0.2975 and 0.1231, respectively. At the beginning of period 2, the investor needs to adjust his wealth again. After adjustment, he holds Stocks 1, 4, 5, 6, 7 and 8 by the investment proportions of 0.2998, 0.0052, 0.2999, 0.0935, 0.0050 and 0.2965, respectively. Subsequently, at the beginning of period 3, the investor needs to adjust his wealth again. In this investment period, he constructs a portfolio among Stocks 1, 2, 5, 6, 7 and 8 by the investment proportions of 0.2988, 0.0092, 0.1830, 0.2109, 0.0064 and 0.2917, respectively. In this case, the crisp value of terminal wealth is 11,945.76 RMB. If the investor uses the model \((P_{2})\) to make his portfolio decision. He should allocate his wealth by the investment strategies listed in lines 5, 6 and 7 of Table 3 at the beginning of each period. The corresponding crisp value of terminal wealth is 11,879.92 RMB. If the investor uses the model \((P_{3})\) to make his portfolio decision, he should follow the investment strategies listed in lines 8, 9 and 10 of Table 3 to allocate his wealth at the beginning of each period. In this case, the crisp value of terminal wealth is 11,909.30 RMB. If the investor selects the model \((P_{4})\) to make his portfolio decision, he should follow the investment strategies listed in lines 11, 12 and 13 of Table 3. In this case, the crisp value of terminal wealth is 11,875.29 RMB. If the investor selects all stocks in equal investment proportions and hold them over the whole investment horizon, the corresponding crisp value of terminal wealth obtained at the end of period 3 is 11,616.94 RMB. After comparison, we can find out that the crisp value of the terminal wealth obtained by any one of the proposed models is larger than the one obtained by the equal-weighted investment. It follows that the portfolios generated by the proposed models perform better than the equal-weighted one. To highlight the effects of both the cardinality constraints and the bound constraints on portfolio selection, we also use the designed algorithm to solve the model \((P_{5})\) and record the corresponding numerical results in lines 14, 15 and 16 of Table 3. It can be observed that, in the sense of terminal wealth, the result of the model \((P_{1})\) is preferable to the result of the contrast model \((P_{5})\). Thus, we can conclude that both the cardinality constraints and the bound constraints do affect the optimal portfolio composition. In addition, we also calculate the expected proceeds obtained by per unit capital, the turnover rate and the variance of the return rate on the portfolio at each period obtained by different models as shown in Table 4.

Table 4 The expected proceeds \({\widetilde{R}}_{t}\), variance \({\widetilde{\delta }}_{t}\) and turnover rate \({\widetilde{l}}_{t}\) of the portfolio at period t obtained by different models
Table 5 The investment strategies obtained by the model \((P_{1})\) under different possibility degree levels (PDL)
Table 6 Comparative computational results obtained by different algorithms under different generations

To demonstrate the fact that the proposed models can express the investor’s different investment preference freely, we take the model \((P_{1})\) as an example and vary the possibility degree level on each interval inequality constraint from 0.60 to 0.80. Under each risk attitude case, a run of the designed algorithm with 1000 generations shows that the investor should assign his wealth according to the investment strategies listed in Table 5. From Table 5, we can find that different possibility degree levels on interval inequality constraints lead to different investment strategies. With the increasing of the possibility degree level, the investor becomes more conservative and the obtained terminal wealth becomes smaller, which is consistent with the real world investment case. If the investor is not satisfied with any one of these obtained portfolios, he can obtain more portfolios by varying the possibility degree levels on these interval inequality constraints. Thus, we can conclude that investors’ investment intentions can be freely expressed by the proposed models.

To illustrate the effectiveness of the designed algorithm, we also use the dynamic differential evolution (DDE) algorithm in Mohamed and Sabry (2012) to solve the proposed four models. The parameters of the DDE algorithm are set the same as the designed algorithm. We vary the number of generations from 1000 to 5000 to illustrate the stability of the designed algorithm and record the comparative computational results as shown in Table 6. From Table 6, we can find that the objective value of each model obtained by our algorithm is larger than the one obtained by the DDE algorithm. In addition, we also calculate the maximum deviation of objective for each model with respect to different generations. The maximum deviation generated by the designed algorithm for each model is smaller than the one generated by the DDE algorithm. Thus, we can conclude that the designed algorithm is more suitable for solving the proposed models.

6 Conclusions

Due to serious lack of historical information in emerging markets, the returns, risks and turnover rates of securities usually cannot be predicted accurately. In this paper, we assume that the investor can estimate their approximate ranges. Namely, they are characterized by interval numbers. We propose four multi-period cardinality constrained portfolio selection models with interval coefficients. We use the definitions of the possibility degree for interval inequalities to handle the uncertainty inequality constraints in the proposed models and express investors’ different risk attitudes. Then, the proposed models are transformed into the corresponding deterministic forms. After that, we design a new dynamic differential evolution algorithm with self-adapting control parameter to solve them. Finally, a numerical example with real data from the Shanghai Stock Exchange is given to illustrate the ideas of our models and the validity of the designed algorithm. Comparative analysis is also given to demonstrate the effects of real constraints on portfolio decision. Computational results indicate that the proposed models can freely express investors’ different investment preferences by adjusting the possibility degree levels on the uncertainty constraints and the designed algorithm is suitable for solving our models.