1 Introduction

As we know, stock returns exhibit nonlinear long memory behaviors, which correspond to the multifractal nature (Muzy et al., 2001; Calvet & Fisher, 2002; Jiang & Zhou, 2011). Two stylized facts in financial returns, including fat-tailed distribution and long range dependence, are considered as sources of multifractality (Zhou, 2009; Jiang & Zhou, 2008b; Zhou, 2012; Grahovac & Leonenko, 2014). Multifractal nature in returns makes the price dynamics deviate from the Brownian process, triggering the studies of applying multifractality on uncovering market efficiency (Wang & Wu, 2013; Liu et al., 2010), designing trading strategies (Dewandaru et al., 2015), constructing measures for predicting volatility (Wei & Wang, 2008; Chen & Wu, 2011; Wei et al., 2013; Chen et al., 2014), and to list a few. New theoretical models including multifractal random walk (MRW) (Bacry et al., 2001), the multifractal model of asset returns (MMAR) (Calvet & Fisher, 2002), and Markov-switching multifractal model (MSM) (Calvet & Fisher, 2001) have been proposed to replicate the multi-scaling price behaviors. In comparison of other competitive econometric models, these models can improve the performance of forecasting volatility (Calvet & Fisher, 2001, 2004; Duchon & Robert, 2012; Chuang et al., 2013; Lux et al., 2014; Nasr et al., 2016; Segnon et al., 2017; Wang et al., 2016), predicting financial duration (Chen et al., 2013; Žikeš et al., 2017), and estimating VaR (Batten et al., 2014; Lux & Kaizoji, 2007; Chuang et al., 2013; Lee et al., 2016; Lux et al., 2016; Herrera et al., 2017). The forecasting power of multifractality on volatility indicates that the multifractal characteristics in price dynamic could have a strong connection to the market risk. As high market risk is accompanied with high return, we could infer that strong multifractality in price dynamic will lead to high return. However, such inference is still lack of empirical evidence.

Return predictability has been received considerable research interests, because it can highlight the understanding of asset pricing in academy and improve the performance of stock investments in industry. However, the predictability of stock returns is still under controversy. On the one hand, Welch and Goyal (2008) perform both in-sample and out-of-sample tests on return predictability with the factors from earlier academic researches and find that those factors are unstable or even spurious. On the other hand, recent researches reveal that the factors, including unexpected changes of oil prices (Casassus & Higuera, 2012), cash flow volatility (Narayan & Westerlund, 2014), aligned technical indicator (Lin, 2018), curvature of the oil futures curve (Chiang & Hughen, 2017), price-to-fundamental ratios (Lawrenz & Zorn, 2017), and daily internet search volume index (SVI) (Chronopoulos et al., 2018), can significantly and economically predict the excess return. However, the predictors lost the forecasting ability when they are published (Mclean & Pontiff, 2016), supporting that the predictors usually exhibit time-varying predictability (Devpura et al., 2018).

Multifractality captures the nonlinear scaling behavior for all moments of stock returns and many empirical studies uncover that higher order moments of returns, like skewness and kurtosis (namely ex ante volatility), contain the information of future returns (Conrad et al., 2013). Chang et al. (2013) reveal that the market skewness risk premium is statistically and economically significant and cannot be explained by the common risk factors, such as market excess return, the size, book-to-market, momentum, market volatility factors, and firm characteristics. Furthermore, other forms of higher order moments, such as the higher order moments of conditional return distribution and the asymmetry of the cross-sectional distribution, are also good predictors of future returns (Perez-Quiros & Timmermann, 2001; Garcia et al., 2014). In emerging markets, the skewness is mostly positive and idiosyncratic, which can be applied to build an optimal portfolio to have sizeable certainty-equivalent gains (Ghysels et al., 2016). Hwang & Satchell (2001) found that the emerging markets can be better explained by the high-moment CAPMs, which consider the additional risks of skewness and kurtosis.

With the aim of testing whether multifractal characteristics can be used to predict future returns, this paper is organized as follows. Data and methods are given in Sect. 2. Section 3 presents the results of empirical multifractality. The in-sample and out-of-sample tests on return predictability based on multifractality are elaborated in Sect. 4. The robustness tests are given in Sect. 5. Section 6 explains the return predictability of multifractal characteristics. And Sect. 7 concludes.

2 Data and methods

2.1 Data sets

Our data, including the Shanghai Stock Exchange Composite 180 Index (SH180) and the Shenzhen Stock Exchange Composite Index (SZCI) in the Chinese stock markets, are retrieved from the finance database of Resset (http://www.resset.cn). Both indexes cover a period from February 14, 2003 to May 23, 2022 including 4583 trading days in total. By removing the days having recording errors, we left 4569 days for SH180 and 4579 days for SZCI, respectively. There are four trading hours (240 min) on each trading day. For each index, we have the price \(p_m\) at each minute on each trading day and thus we define the minutely return \(r_m(t)\) as,

$$\begin{aligned} r_m(t) = \ln p_m(t) - \ln p_m(t-1). \end{aligned}$$
(1)

We regard the last price on each trading day as the closing price \(p_c(d)\) on that day and the daily return \(r_d\) is defined in the following,

$$\begin{aligned} r_d(d) = \ln p_c(d) - \ln p_c(d-1). \end{aligned}$$
(2)

2.2 Multifractal detrended fluctuation analysis (MFDFA)

For a given window of minutely returns \(r_m(i), i = 1, \cdots , N\), we can define y(i) as follows,

$$\begin{aligned} y(i) = \sum _{u=1}^{i} r_m(i),~i = 1, 2, \cdots , N. \end{aligned}$$
(3)

The series y is covered by \(N_s\) disjoint boxes and each box has the same size s. For our convenience, we label the sub-series in each box as,

$$\begin{aligned} Y_k(i) = \{y(i) \mid (k-1)s+1 \le i \le ks\}, \end{aligned}$$
(4)

In some cases, the whole series y cannot be exactly covered by \(N_s\) boxes, which means that we have to neglect some data points at the end of the series. In order to avoid this situation, we can utilize \(2N_s\) boxes to cover the series, where \(N_s\) boxes cover from the beginning and \(N_s\) boxes cover from the end. In each box, the sub-series \(Y_k\) is regressed by a polynomial \(g_l(\cdot )\) of order l (in our work \(l = 1\)). The overall detrended fluctuation \(F_q(s)\) of the sub-series \(Y_k\) is defined via the sample variance of the fitting residuals as follows,

$$\begin{aligned} F_q(s) = \left\{ \frac{1}{2N_s}\sum _{k=1}^{2N_s}[F_k(s)]^q \right\} ^{1/q}. \end{aligned}$$
(5)

where q can take any real value except for \(q = 0\). While \(q = 0\), we have

$$\begin{aligned} F_0(s) = \exp \left\{ \frac{1}{2N_s}\sum _{k=1}^{2N_s}\ln [F_k(s)] \right\} . \end{aligned}$$
(6)

according to the L’Hôpital’s rule. By varying the value of s in the range from \(s_{\min } = 20\) to \(s_{\max } = N/4\), one can expect that the detrended fluctuation function \(F_q(s)\) scales with the size s, which reads

$$\begin{aligned} F_q(s) \sim s^{h(q)}, \end{aligned}$$
(7)

where h(q) is the generalized Hurst index. Note that while \(q = 2\), h(2) is nothing but Hurst index H. The scaling exponents \(\tau (q)\), which is used to reveal the multifractality in the standard multifractal formalism based on the partition function, can be obtained from the following traditional function for each q,

$$\begin{aligned} \tau (q) = qh(q) - D_f, \end{aligned}$$
(8)

where \(D_f\) is the fractal dimension of the geometric support of the multifractal measure (in our case \(D_f = 1\)). The local singularity exponent \(\alpha \) of the measure \(\mu \) and its spectrum \(f(\alpha )\) are related to \(\tau (q)\) through the Legendre transformation (Halsey et al., 1986),

$$\begin{aligned} \left\{ \begin{aligned}&\alpha = \mathrm{{d}}\tau (q)/\mathrm{{d}}q \\&f(\alpha )=q \alpha -\tau (q) \end{aligned} \right. ~. \end{aligned}$$
(9)

Taking into account the statistical significance of the estimation of overall fluctuation functions, we focus on \(q \in [-4, 8]\).

We further employ three parameters (\(\Delta {\alpha }\), \(\Delta f\), and B) to capture the characteristics of the multifractal spectrum. Parameter \(\Delta \alpha \) stands for the width of multifractal spectrum, defined as \(\Delta {\alpha } = \alpha _{\max } - \alpha _{\min }\). \(\Delta {\alpha }\) quantitatively describes the dispersion of singularity exponents \(\alpha \), and measures the degree of heterogeneity for the probability measure of subsets on the overall fractal structure (Zhou, 2007). In practice, \(\Delta {\alpha }\) is widely used to gauge the degree of multifractality (Jiang & Zhou, 2008a, b; Zhou, 2009). The larger the value of \(\Delta {\alpha }\) is, the stronger the multifractal nature is. Parameter \(\Delta f\) is estimated via \(\Delta f = f(\alpha _{\min }) - f(\alpha _{\max })\), depicting the difference between the proportion of the subset with the minimum probability measure and that with the maximum probability measure. Thus, more measures at the peak leads to \(\Delta f < 0\) and more measures at the trough gives \(\Delta f > 0\). Parameter B is obtained by fitting the \(f(\alpha ) \sim \alpha \) curve to the following quadratic function \(f(\alpha ) = A(\alpha - \alpha _0)^2 + B(\alpha - \alpha _0) + C\) (Shimizu et al., 2002; Munõz-Diosdado & Río-Correa , 2006). B captures the asymmetry of the spectrum. The spectrum curve are symmetric when \(B = 0\). \(B < 0\) means that the spectrum curve is right-hooked, indicating that the data set is dominated by the subsets with large probability measures. In contrast, \(B > 0\) means that the spectrum exhibits a left-hooked pattern, suggesting that the subsets with small probability measures take the leading role in the data set.

The shape of the multifractal spectrum and the definition of \(\Delta f\) and B underlie a positive correlation between \(\Delta f\) and B. When \(B > 0\) (respectively, \(B < 0\)), the multifractal spectrum is left-hooked (respectively, right-hooked), which means \(f(\alpha _{\min }) > f(\alpha _{\max })\) (respectively, \(f(\alpha _{\min }) < f(\alpha _{\max })\)), and then we have \(\Delta f > 0\) (respectively, \(\Delta f < 0\)). When B approaches 0, the multifractal spectrum gradually becomes symmetric, which leads to \(f(\alpha _{\min }) \approx f(\alpha _{\max })\) and we will obtain \(\Delta f \approx 0\). However, \(\Delta f\) and B have very different physical meaning. \(\Delta f\) measures the proportion difference between the minimum measure and the maximum measure, while B indicates which type of measures plays a leading role in the data sets, large measures or small measures.

3 Empirical multifractal characteristics

Using a moving window with a size of 5 days, we perform the multifractal analysis on the returns in each window by means of the MFDFA method. To have an impression that the multifractal spectrum is able to quantitatively capture the market dynamics, we present the results of multifractal analysis on three typical price trajectories, corresponding to the declining, rising, and sideways trend, in three windows for SH 180. Window 1 covers a period from 7 April, 2004 to 13 April, 2004. As shown in Fig. 1 a, b, and c, in window 1, we observe that the price exhibits an decreasing pattern and contains many small returns, which leads to a left-hooked multifractal spectrum. Window 2 spans a period from 16 January, 2004 to 3 February, 2004 and in that window the price has an increasing pattern and the return is dominated by large values, which results in a right-hooked multifractal spectrum, as illustrated in Fig. 1 d, e, and f. Window 3 is from 6 December, 2004 to 10 December, 2004 and the corresponding results are plotted in Fig. 1 g, h, and i. Due to the fact that the price is in a sideways trend and the fraction of large and small returns are almost equal, one can observe a symmetric multifractal spectrum.

Fig. 1
figure 1

Results of multifractal analysis on SH180 in three windows, in which the prices exhibit a declining, rising, and sideways trend, respectively. ac Results of window 1, spanning from 7 April 2004 to 13 April 2004. df Results of window 2, spanning from 16 January 2004 to 3 February 2004. gi Results of window 3, spanning from 6 December 2004 to 10 December 2004. a, d, g Plots of the price trajectories. b, e, h Plots of returns. c, f, i Plots of the multifractal spectrum

We also estimate the three characteristic parameters \((\Delta {\alpha }, \Delta f, B)\) of multifractal spectrum in the three windows and obtain \(\Delta {\alpha } = 0.33\), \(\Delta f = 0.21\), and \(B = 0.20\) for window 1, \(\Delta {\alpha } = 0.20\), \(\Delta f = -0.13\), and \(B = -0.18\) for window 2, and \(\Delta {\alpha } = 0.11\), \(\Delta f = 0.01\), and \(B = -0.003\) for window 3, respectively. For the width of multifractal spectra, we have \(\Delta {\alpha }^\mathrm{{Window1}}> \Delta {\alpha }^\mathrm{{Window2}} > \Delta {\alpha }^\mathrm{{Window3}}\), which is in consistent with the observation that the price in window 1 is the most volatile and the price in window 3 is the least volatile. This indicates that the spectral width \(\Delta {\alpha }\) can be used to quantitatively capture the market volatility. Wei and Wang (2008) have proposed a volatility measure based on the width of multifractal spectrum and achieve better VaR measures comparing with the GARCH-type models (Wei et al., 2013). We also find that the values of \(\Delta f\) and B in these three windows are in consistent with the geometric features of their multifractal spectra.

By performing the multifractal analysis on the returns in each moving window, we will accumulate three series of multifractal characteristics (\(\Delta {\alpha }\), \(\Delta f\), and B). Table 1 lists the basic information of cumulative return (\(\sum r_m\)) and multifractal characteristics (\(\Delta {\alpha }\), \(\Delta f\), and B) for SH180 and SZCI. In Panel A, one can see that the skewness is positive and the kurtosis is much greater than 3 for \(\sum r_m\), \(\Delta {\alpha }\), \(\Delta f\), and B, indicating that the four variables exhibit a right-skewed and fat-tailed distribution. It is also observed that for \(\Delta {\alpha }\) and \(\Delta f\) the gap between the mean and the median are very small and for B the mean is less than the median, indicating that the extreme values in \(\Delta {\alpha }\), \(\Delta f\), and B have negligible effects on their mean values.

In Panel B of Table 1, one can find that there is no correlation between the daily returns \(r_d\) on the day exactly after the moving windows and three multifractal parameters (\(\Delta {\alpha }\), \(\Delta f\), and B), as their correlation coefficients are very close to 0 and none of them is significant. The cumulative returns in moving windows \(\sum r_m\) are found to be negatively and weakly correlated with \(\Delta {\alpha }\) for SH180 and significantly correlated with \(\Delta f\) for both indexes. The correlation of \(\Delta {\alpha }\) and \(\Delta f\) is weak, positive, and significant for both indexes, because both coefficients are around 0.05. We also see that \(\Delta f\) and B are significantly positive correlated and the correlation coefficients are 0.14 and 0.13 for SH180 and SZCI, agreeing with the underlying correlation between \(\Delta f\) and B inferring from their definitions and the geometric features of multifractal spectrum.

In Panel C of Table 1, we find that \(\Delta {\alpha }\) and \(\Delta f\) exhibit very strong autocorrelating behaviors, since their autocorrelation coefficients of lags 1 and 5 are positive, large, and significant. The autocorrelation coefficient of cumulative returns \(\sum r_m\) in moving windows is around 0.78 at lag 1 for both indexes and quickly fall to 0 with the increasing of lag, as the autocorrelation coefficient of lag 5 approaches 0. The autocorrelation in B is weak, because only the coefficient at lag 1 is significantly positive and the values are less than 0.1 for both indexes. In addition, the Ljung and Box Q statistics of lags 30 and 50 are statistically significant for the four variables, showing that the null hypothesis of no autocorrelation is rejected at the 1% level. Such results imply the existence of autocorrelation in \(\sum r_m\), \(\Delta {\alpha }\), \(\Delta f\), and B, which can be attributed to the fact that the four variables are estimated from overlapping moving windows.

We report the results of augmented Dickey-Fuller (ADF) unit root tests and ARCH tests in Panels D and E of Table 1. For the ADF unit root test, the optimal lag length is determined according to the Schwarz information criterion. For both indexes, the ADF statistics suggest that the null hypothesis of having a unit root in the series of \(\sum r_m\), \(\Delta {\alpha }\), \(\Delta f\), and B is rejected at the 1% level, indicating the stationarity of the four series. Panel F lists the F-statistics of the ARCH tests at lags 1, 5, 10 and 15, suggesting that the null hypothesis of no ARCH effects is rejected at any level of significance. The results clearly speak for the presence of heteroscedasticity in the series of \(\sum r_m\), \(\Delta {\alpha }\), \(\Delta f\), and B for both indexes.

Table 1 Basic information of the cumulative return (\(\sum r_m\)) and multifractal characteristics (\(\Delta {\alpha }\), \(\Delta f\), and B) in moving windows for SH180 and SZCI.

4 Predictive power of multifractal characteristics

Practitioners and researchers have put great efforts on predicting future returns. One natural thought is to use the information in the option market, since it reflects the expectation of the market participants, to predict the future price movements. Bu et al. (2019) employ the VIX volatility curve to forecast future stock returns in developed markets. And the variance risk premium is also regarded as a potential predictor for returns (Pyun, 2019). Factors such as time-varying consumption-aggregate wealth ratio (Chang et al., 2019), (time-varying) tail risk dynamics (Chevapatrakul et al., 2019) are found to exhibit certain predictive ability on the excess returns. In emerging markets, Chang et al. (2009) found that foreign capital flows in the option market can be used to predict the underlying asset returns in Taiwan. It is found that simple technical trading rules can yield positive excess returns even when the trading costs is considered, showing the evidence of predictability (Chang et al., 2004; Gunasekarage & Power, 2001).

4.1 In-sample tests

The daily excess return \(r^*\) on day d is defined as the difference between the index return \(r_d\) and the market risk-free return \(r_f\),

$$\begin{aligned} r^*(d) = r_d(d) - r_f(d). \end{aligned}$$
(10)

And the corresponding multifractal characteristics on day d, denoted as \(\Delta \alpha _{d-4:d}\), \(\Delta f_{d-4:d}\), and \(B_{d-4:d}\), are estimated from day \((d-4)\) to day d. We first separate the multifractal characteristic \(\Delta \alpha _{d-4:d}\) into six groups based on the following bins \((-\infty , 0.05)\), [0.05, 0.1], [0.1, 0.15), [0.15, 0.2), [0.2, 0.25), and \([0.25, \infty )\). In each group, we calculate the average value of \(\Delta \alpha _{d-4:d}\) and the average value of excess returns \(r^*_{d+1}\) on day \(d+1\). In Fig. 2 a, we illustrate the errorbar plots of the average excess return \(\langle r^*_{d+1} \rangle \) with respect to the average spectral width \(\Delta \alpha _{d-4:d}\) on the left y-axis and the fraction of the spectral width \(\Delta \alpha _{d-4:d}\) in each bin on the right y-axis. The top panel is the results of SH180 and the bottom panel is the results of SZCI. One can see that there is a slightly increasing trend for \(\langle r^*_{d+1} \rangle \) with the increasing of \(\langle \Delta \alpha _{d-4:d} \rangle \) for both indexes, indicating the predicting ability of \(\Delta {\alpha }\) on future excess returns. For \(\Delta f\) and B, we perform the same analysis. The corresponding results are shown in Fig. 2 b, c. One can observe a slightly decreasing trend for \(\langle r^*_{d+1} \rangle \) with the increasing of \(\langle \Delta f_{d-4:d} \rangle \) for both indexes, implying that there exists to be a negative correlation between \(\Delta f\) and future excess returns. However, we cannot find any dependence between \(\langle B \rangle \) and \(\langle r^*_{d+1} \rangle \), suggesting no information of future excess returns in B.

Fig. 2
figure 2

Illustration of the predictive power of multifractal characteristics (\(\Delta {\alpha }\), \(\Delta f\), and B) on the future excess returns for SH180 (top panels) and SZCI (bottom panels). a Plots of the average excess returns with respect to the average \(\Delta {\alpha }\) on the left y-axis and the fraction of \(\Delta {\alpha }\) in each bin on the right y-axis. b The same as a, but for \(\Delta f\). c The same as a, but for B

We further conduct the Granger causality tests between the excess returns \(r^*\) and the multifractal characteristics \(\Delta {\alpha }\), \(d_{f}\), and B. The corresponding null hypothesis of \(x \nrightarrow y\) is that x does not Granger cause y. The results are listed in Table 2. One can see that the excess return \(r^*\) is the Granger causality of \(\Delta {\alpha }\) and B for SH180 at the level of 5% and of B for SZCI at the level of 1%. The null hypothesis that \(\Delta {\alpha }\) does not Granger cause \(r^*\) is rejected for SH180 (respectively, SZCI) at the significant level of 0.1% (5%), indicating that the lagged \(\Delta {\alpha }\) may contain the information of future excess returns \(r^*\). For \(r^*\) and \(\Delta f\), the Granger tests cannot reject the null hypothesis, indicating no Granger causality between them.

Table 2 Results of Granger causality tests between excess returns \(r^*\) and multifractal characteristics

A standard univariate predictive regression framework is employed to test the predictive power of mulitfractal characteristics \(M_{d-4:d}\) from day \({d-4}\) to day d on the excess return on day \(d+1\),

$$\begin{aligned} r^*_{d+1} = \alpha _r + \beta _{r, i} M_{i, d-4:d} + {\varepsilon _d}, d = 5, \cdots , T \end{aligned}$$
(11)

where \(M_1 = \Delta {\alpha }\), \(M_2 = \Delta f\), \(M_3 = B\), \(M_4 = \Delta {\alpha } \Delta f\), \(M_5 = \Delta {\alpha } B\), and \(M_6 = \Delta f B\). The corresponding results of in-sample tests are shown in Table 3 for both indexes, in which the regression slopes, intercepts, and the adjusted \(R^2\) statistics are reported. The p-values, which are obtained from the NW t-tests (Newey & West, 1987), are also listed in the parentheses under the regression parameters. The null hypothesis of the NW t-tests is that the regressing slopes (\(\beta _{r,i}\)) are vanishing. If the null hypothesis is rejected for \(\beta _{r,i}\), we can conclude that multifractal characteristic \(M_i\) contains information of predicting future excess return \(r^*_{d+1}\). And accepting null hypothesis means that multifractal characteristic \(M_i\) is not a return predictor.

In panel A of Table 3, we list the results of in-sample tests for SH180. One can find that except \(\Delta {\alpha }, \Delta f\), and \(\Delta fB\) the other multifractal characteristics give insignificant regression coefficients. The regression coefficient of \(\Delta {\alpha }\) is significant at the level of 1% and the adjusted \(R^2\) statistics is 0.19%, meaning that \(\Delta {\alpha }\) can explain 0.19% of the excess returns. In Panel B, we report the regressing results of in-sample tests for SZCI. One can observe that the regressing coefficients of \(\Delta {\alpha }\), \(\Delta f\), \(\Delta {\alpha }\) \(\Delta f\), and \(\Delta fB\) are statistically significant. It is also observed that the regression coefficients of other multifractal characteristics are not statistically significant and the adjusted \(R^2\) are vanishing. Usually, \(\Delta {\alpha }\) can be regarded as a measurement of volatility. The predicting ability of \(\Delta {\alpha }\) can be linked to the market risk. Higher market risk, associating with high market volatility, results in higher risk premium (Merton, 1980; French et al., 1987).

Table 3 In-sample tests of predicting the excess return \(r^*_{d+1}\) on day \(d+1\) with multifractal characteristics \(M_{d-4:d}\) from day \(d-4\) to day d

We further run the in-sample regression by incorporating the market volatility, such that

$$\begin{aligned} r^*_{d+1} = \alpha _r + \beta _{r, i} M_{i, d-4:d} + \psi _{r} v_{d-4:d} + {\varepsilon _d}, d = 5, \cdots , T \end{aligned}$$
(12)

where \(v_{d-4:d}\) is the realized market volatility from day \(d-4\) to d, given by summing the square of minutely returns. The regression results of Eq. (12) are listed in Table 4. It is observed that the results in Table 4 are in accordance with those in Table 3. We find that the coefficients of \(\Delta {\alpha }\) are statistically significant for both indexes when adding the market volatility into the regression, revealing that the multifractal characteristic \(\Delta {\alpha }\) do have additional information beyond the market volatility.

Table 4 Comparison of the predictive ability between multifractal characteristic and market volatility

4.2 Out-of-sample tests

Out-of-sample tests are usually encouraged to evaluate the return predictability by excluding the using of future information and over-fitting in-sample tests. Two statistics, the \(R_{OS}^2\) statistic (Campbell & Thompson, 2008) and adjusted MSFE statistic (Clark & West, 2007), are employed to assess the out-of-sample forecasting performance. The \(R_{OS}^2\) statistic is defined as follows,

$$\begin{aligned} R_{OS}^2 = 1 - \frac{\sum _{d=n}^{T-1} \left( r_{d+1}^* - {\hat{r}}^*_{d+1)} \right) ^2}{ \sum _{d=n}^{T-1} \left( r_{d+1}^* - {\bar{r}}_{d+1}^* \right) ^2 } \end{aligned}$$
(13)

where \(r_{d+1}^*\) is the actual excess return, \({\hat{r}}_{d+1}^*\) is the predicting excess return, and \({\bar{r}}_{d+1}^*\) is the benchmark of historical average returns. \(R_{OS}^2\) measures the percent reduction in mean square forecast error for the predictive regression forecast relative to the historical average benchmark forecast. According to the definition, we can infer that \(R_{OS}^{2}\) locates in the range of \((\infty , 1]\). The predictive regression forecast \({\hat{r}}_{d+1}^*\) is better than the historical average \({\bar{r}}_{d+1}^*\) from the perspective of mean squared forecasting errors (MSFE) when \(R_{OS}^2 > 0\). The adjusted MSFE statistic, proposed by Clark and West (2007), is used to indicate the statistical significance of \(R_{OS}^2 > 0\) (Neely et al., 2014; Phan et al., 2015; Guo & Tao , 2017; Chen et al., 2017). The corresponding null hypothesis is \(R_{OS}^2 \le 0\), meaning that the MSFE of the historical average benchmark is less than or equal to that of predictive regression forecast.

Our out-of-sample tests are conducted in both moving windows and expanding windows. Here is the detailed procedure. The predictive regression (Eq. (11)) is estimated in each window and the obtained parameters are then used to generate the out-of-sample prediction for the day following that window. These steps are repeated by expanding or sliding the window till one reaches the end of sample period. The initial window includes the first 600 data points, about 2.5 years of data. The results are reported in Table 5. Panel A lists the results of the out-of-sample tests for SH180, one can find that \(\Delta {\alpha }, \Delta f\!, B\) and \(\Delta \alpha B\) have positive \(R_{\mathrm{{OS}}}^2\) in both moving and expanding windows, indicating that the predictive regression forecast is superior to the historical average, and the other predictors have \(R_{\mathrm{{OS}}}^2 < 0\). According to the adjusted MSFE statistics, we find that the \(R_{OS}^2\) not greater than 0 is rejected for \(\Delta {\alpha }\) and \(\Delta f\) at the significant level of 0.1% and 5% in the moving window and the expanding window. The results of the out-of-sample tests for SZCI are reported in Panel B of Table 5. One can see that \(R_{OS}^2\) corresponding to the predictors B and \(\Delta {\alpha } B\) are negative and the other predictors all have positive \(R_{OS}^2\) in the moving window and expanding window. However, only \(\Delta \alpha \) and \(\Delta f\) are statistically significant according to the adjusted MSFE statistics both in the moving window tests and \(\Delta f\) in the expanding window tests. We also find that including more data points into the predictive regression forecast will shrink the predictive ability of predictors, evidenced by lower \(R_{OS}^2\) in expanding window tests comparing with that in moving window tests. The out-of-sample tests reveal that the multifractal characteristics \(\Delta {\alpha }\) and \(\Delta f\) are good return predictors.

Table 5 Out-of-sample tests of predicting the excess index return \(r^*_{d+1}\) on day \(d+1\) with multifractal characteristics \(M_{d-4:d}\) from day \(d-4\) to day d

Following Campbell and Thompson (2008); Rapach & Zhou (2013); Wang et al. (2018), we employ the mean-variance utility function to test the economic significance of the return predictability based on multifractality. Such utility-based method treats the predictions as inputs of a specific trading rule or an asset allocation decision derived from maximizing the expected utility. Supposing that a mean-variance investor with relative risk aversion \(\gamma \) allocates his portfolio between a risky asset and a risk-free asset with the purpose of maximizing the following utility function,

$$\begin{aligned} U = \max \left[ \mathrm{{E}}_t \left( r_{t+1}^p \right) - \frac{\gamma }{2} \mathrm{{Var}}_t \left( r_{t+1}^p \right) \right] \end{aligned}$$
(14)

where \(r_{t+1}^p\) represents the return of the portfolio and \(\gamma \) measures the risk aversion. The portfolio returns \(r_{t+1}^p\) can be estimated as follows,

$$\begin{aligned} r_{t+1}^p = r_{t+1}^f + \omega _t r_{t+1}, \end{aligned}$$
(15)

where \(\omega _t\) corresponding to the optimal portfolio weight for the risky asset by solving Eq. (14),

$$\begin{aligned} \omega _{t} = \frac{1}{\gamma } \left( \frac{{\hat{r}}_{t+1} }{{\hat{\sigma }}^2_{r,t+1} }\right) \end{aligned}$$
(16)

where \({\hat{r}}_{t+1}\) is the expected excess returns, giving by the forecasting model and \({\hat{\sigma }}^2_{t+1} \) is the variance of excess returns. Once we have the portfolio \(r_t^p\) at different time t, we are able to calculate the utility U through Eq. (14).

By setting \(\gamma =5\) (for \(\gamma = 3\) very similar results are obtained), we estimate the utility \({\hat{U}}\) of the portfolio constructed based on the excess returns given by our return forecasting model, Eq. (11). For comparison, we also calculate the utility of the portfolio \({\bar{U}}\) obtained from a benchmark model, in which the excess returns are given by the historical mean returns. The utility gains \(\Delta U\) is defined as the difference between the utility of two portfolios, that is \(\Delta U = {\hat{U}}-{\bar{U}}\). This utility gain can be interpreted as the portfolio management fee that an investor would be willing to pay to have access to the information in the predictive regression forecast relative to the information in the historical average forecast alone (Rapach & Zhou, 2013). Because of the trading constraint, the optimal \(\omega _t\) is limited in the range of [0,  1.5]. The utility gain \(\Delta U\) is reported in Table 5 for different multifractal characteristics. We must note that significant \(R^2_{OS}\) doesn’t ensure positive utility gains, according to the results of the south Asia stock market(Rahman et al., 2019). Comparing with the expanding window, the utility gains in the moving windows are much more volatile, which is in line with the intuition that the latest information are more valuable for investments. And the positive utility gains of the multifractal characteristic \(\Delta \alpha \) and \(\Delta f\) for SH180 and SZCI in the moving window tests and the expanding window tests indicate that a mean-variance investor is willing to pay a portfolio management fee to follow the forecasting returns giving by our multifractal predictive models, as opposed to the historical means.

5 Robustness tests

To check the predicting ability of multifractal characteristics on the financial returns in different markets, we perform the same analysis on the S &P 500 index, the exchange rate between Bitcoin and US dollar, and the Chinese commodity futures of bean pulp, corn, copper, and natural rubber. The S &P 500 index covers from January 2, 1989 and the Bitcoin covers from February 10, 2014 to January 18, 2018. The prices of four commodity futures are recorded from March 10, 2005 to May 23, 2022 for bean pulp, from April 11, 2005 to May 23, 2022 for corn, from April 25, 2006 to May 23, 2022 for cooper, and from February 28, 2008 to May 23, 2022 for natural rubber. The corresponding results are reported in Table 6. One can find that all \(R^2_{OS}\) are significantly positive and all \(\Delta U\) are not less than zero in the moving window and expanding window analysis, supporting that the multifractal characteristic is a robust excess return predictor in different markets.

Table 6 Robustness tests of out-of-sample predicting excess returns from different markets with multifractal characteristics

We further separate the sample period of SH180 and SZCI into different sub-periods, including the pre-COVID period (4 June, 2003–9 January, 2020), the COVID period (10 January, 2020 - 23 May, 2022), the period before the market crash in 2015 (4 June, 2003–15 June, 2015), the period after the market crash in 2015 (16 June, 2015–23 May, 2022), the period of bull market (30 June, 2014–June 15, 2015), and the period of bear market (June 16, 2015–June 30, 2016). The predicting performance of multifractal characteristics is investigated in these sub-periods. The statistics of performance are listed in Table 7. Again, one can see that the regressing results in sub-periods are very similar to those in the entire periods. \(\Delta \alpha \) and \(\Delta f\) exhibit significant predictability as \(R^2_{OS}\) is significantly positive and \(\Delta U\) is not less than zero. This result indicates that the return predicting power of multifractal characteristic is robust in different sub-periods. Furthermore, it is also observed that multifractal characteristics exhibit higher return predictability in the period after the market crash in 2015 and the period of bear market. The possible reason is that volatile market increases the strength of multifractality, and in turn improves the return predictability, which motivates us to test the economic explanation on the return predictability of multifractal characteristics in the following section.

Table 7 Robustness tests of out-of-sample predicting excess returns in different periods with multifractal characteristics

6 Economic explanations

In this section, we try to explore the economic explanations on why the multifractal characteristics can predict the future returns. As the multifractal characteristics are used to capture the market volatility, one possible explanation is that the multifractal measurements reflect the market volatility risks. As the multifractality is originated from the fat-tailed distribution and more extreme events can lead to stronger multifractality, thus another possible explanation is that multifractal characteristics reflect the the market tail risk. We thus employ the realized volatility (RV) (respectively, the value at risk (VaR) and expected shortfall (ES)) to measure the market risk (respectively, the tail risk) and run the following regression,

$$\begin{aligned} R F_{i, d+1}=\alpha _v+\sum _{i} \beta _{i} M_{i, d-4 \cdot d}+\epsilon _{d+1} \end{aligned}$$
(17)

where \(RF_{1}=RV\), \(RF_{2}=VaR\), \(RF_{3}=ES\), \(M_{1}=\Delta \alpha \), \(M_{2}=\Delta f\), \(M_{3}=B\).

Table 8 reports the regressing results. For both indexes, the coefficients of \(\Delta \alpha \) are significant at the level of 0.1\(\%\), indicating that the spectral width \(\Delta \alpha \) is able to explain the three risk measures and agrees with the idiosyncratic volatility puzzle and the idiosyncratic tail risk puzzle in the Chinese stock markets (Nartea et al., 2013; Gu et al., 2018; Wan, 2018; Long et al., 2018). The predicator \(\Delta f\) is also statistically significant for the RV, VaR and ES of both indexes, indicating that the spectral height \(\Delta f\) has explanatory power for the volatility risk and the downside tail risk. Therefore, both \(\Delta \alpha \) and \(\Delta f\) can be regarded as a risk measure including the information of market volatility and downside tail risk.

Table 8 Linking multifractal characteristics \(M_{d-4: d}\) from day \(d-4\) to day d with risk measures on day \(d+1\)

7 Conclusion

In this paper, we apply MFDFA to detect the multifractal characteristics in high-frequency data of SH180 and SZCI. We find that both indexes exhibit strong multifractality. We extract three measures of multifractal spectra (\(\Delta {\alpha }\), \(\Delta f\), and B) from returns in moving windows with a size of five days and found that there is a positive correlation between \(\Delta {\alpha }\) and future excess returns, and a negative correlation between \(\Delta f\) and future excess returns. With the factor model, the return predictability is observed for the multifractal spectral width \(\Delta {\alpha }\) in both in-sample and out-of-sample tests, which is further corroborated by the robustness tests on the S &P 500 index, the exchange rate between Bitcoin and US dollar, four Chinese commodity futures, and the SH180 and SZCI in different sub-periods.

One possible explanation of such predicting ability is that multifractal characteristics can be linked to market volatility and the return predictability may be economically explained by the theory of risk premium. Another possible explanation is that the multifractality in stock returns originates from the fat-tailed distribution, which may contain the information of downside tail risks. Our work suggests that the multifractal characteristics \(\Delta \alpha \) and \(\Delta f\) jointly include the information of market volatility and downside tail risk, which indicates that multifractality may have potential applications on the risk management. On the one hand, one can design better risk measures based on multifractal characteristics. On the other hand, one can develop hybrid models by combining multifractality and machine learning techniques to improve the accuracy of volatility prediction (Pradeepkumar & Ravi, 2016, 2017, 2017b, 2020; Ravi et al., 2017). Furthermore, the future work can also be focused on testing the return predictability of multifractal characteristics on various industry sectors and regions, and developing trading strategies based on multifractal characteristics.