Introduction

Generally, green energy includes solar, hydraulic, wind, marine, and geothermal energy (Bardi et al. 2013). Renewable energy is advantageous in that it induces a low level of pollution (Bartłomiej et al. 2015); however, it is constrained by natural conditions (e.g., requiring resources such as hydraulic, wind, and solar energy) (Deborah et al. 2013). Furthermore, generating electricity through renewable energy is expensive because investment and maintenance expenses involved are substantial, yet production efficiency is relatively low (Farah et al. 2013).

Numerous scientists are seeking new technologies and methods for improving renewable energy (Akay et al. 2013). Several studies have indicated that renewable energy is certain to play an increasingly critical role as earth’s resources are depleted (Zhang et al. 2014).

McLellan et al. (2012) and Prakash et al. (2013) have reported that renewable energy offers the following advantages over conventional energy:

  1. (a)

    Renewability Conventional forms of primary energy, such as coal, petroleum, and natural gas, are formed through long geological processes. Earth has limited reserves of these forms of energy (Federica and Idiano 2015). Rahim and Victoria (2015) have reported that based on the current mining capacity, these forms of energy will ultimately become exhausted. People must seek and use sustainable energy sources that cannot be exhausted to replace these conventional forms of energy before their depletion. New forms of renewable energy, such as wind energy, solar energy, and biomass energy, are ideal alternatives.

  2. (b)

    Cleanliness The use of conventional energy leads to deadly pollution and environmental destruction (Farhad et al. 2015).

China surpassed the United States as the number one consumer of primary energy in 2010 and retained this position in both 2011 and 2012, truly becoming a major energy consumer. In 2012, the annual growth in Chinese petroleum consumption was 5.3 %, which was once again the highest increase in petroleum consumption globally. Chinese coal consumption also constituted 50.2 % of global coal consumption in 2012; this marked the first time China exceeded 50 % of global coal consumption. China’s primary energy consumption structure remains focused on coal. In 2012, coal constituted 67.8 % of the Chinese primary energy consumption, followed by petroleum at 18.2 % and natural gas at 4.8 %. Renewable energy constituted 9.2 % of the Chinese energy consumption (China Energy Statistical Year book 2013).

Chinese economy is currently the second largest economy in the world. It also consumes the most energy and emits the most greenhouse gases. China’s energy efficiency remains poor. With global climate change, environmental degradation, and shortages of conventional energy, a general consensus worldwide has been developed regarding the necessity of renewable energy. Following trends in environmental protection and the depletion of oil reserves, renewable energy has also become the core focus of China’s energy policy. In 2012, the 12 Five-Year Plan for National Economic and Social Development of the People’s Republic of China was presented. This plan included concrete indicators for renewable energy, projecting renewable energy consumption to reach approximately 10 % of the total energy consumption by 2015 and approximately 15 % of the total energy consumption by 2020 (Zhao et al. 2014).

Forecasting trends in renewable energy in China is extremely crucial. If trends in Chinese renewable energy consumption can be predicted accurately, then the development of renewable energy in China and even the entire world can be grasped. In addition, forecasting trends in renewable energy is a major part of green production.

This study forecasted growth trends in renewable energy consumption in China. Because historical data on renewable energy were limited in sample size and because the data were not normally distributed, forecasting methods used for analyzing large amounts of data (such as conventional regression analysis, neural networks, and genetic algorithms) were unsuitable for this study.

Grey system theory is applicable to system models with incomplete information, unclear behavior, and unclear operating mechanisms (Deng 2003). It can be used to perform a comprehensive analysis, observe developments and changes in systems, and conduct long-term forecasts (Deng 2004). The most outstanding feature of grey system theory is that it can be used to establish models with a minimum of only four data samples. Furthermore, the distribution of sample populations does not require making numerous stringent assumptions. Numerous studies have indicated that the original grey model (1, 1) [GM(1, 1)] has an extremely high predictive accuracy when data from small samples were used (Chen and Chen 2011).

Numerous studies have argued that GM(1, 1) is highly accurate only when the experimental sample data exhibit steady growth trends. Li et al. (2009) and Lee et al. (2014) have reported that when sample data demonstrated large fluctuations, GM(1, 1) required revision to increase its predictive accuracy. Such revised models include the nonlinear Bernoulli modification model and Markov modification model.

To improve the weak points of previous studies and to increase the predictive accuracy, this study used the GM(1, 1) model along with the revised nonlinear grey Bernoulli model (NGBM) to perform theoretical derivation and scientific verification. This study compared the results of the grey models with those of a regression analysis model to confirm the predictive accuracy and fitness of these three methods. This study determined the optimal prediction model among the three models. Finally, according to the derived results, we present a conclusion and suggestions for future research.

Literature review

Grey system theory and grey prediction

Herbig et al. (1993) stated that prediction is used to estimate future events or situations that organizations cannot control and to provide managers with a basis for planning. Therefore, prediction is vital in decision-making processes.

Deng (1982) proposed grey system theory, which can be used to perform relational analysis and model construction in ambiguous system models with incomplete information. Using grey system theory entails employing prediction and decision-making methods to investigate and understand system conditions. In grey system theory, black is used to represent a lack of information, white is used to represent complete information, and grey is used to represent insufficient or incomplete information. Kwonpongsagoon et al. (2007) indicated that Deng originally applied grey system theory to control areas, but subsequent scholars have developed and applied it to other fields, such as management decisions, socioeconomic research, and meteorological and water conservation forecast.

Grey system theory can be used to effectively process uncertainty, multivariate inputs, discrete data, and incomplete data (Chang et al. 2013). Grey theory is applicable to small, uncertain samples. The applications of grey theory differ from those of probability (statistics), which is used to analyze large, uncertain samples, and fuzzy theory, which is used to analyze cognitive uncertainty (Sadeghi et al. 2013).

According to the classifications by Xiong et al. (2010), Chang (2012), and Lee et al. (2014), studies on grey system theory can be divided into the following classes: (1) grey generating techniques, (2) grey relational analysis, (3) grey model construction, (4) grey prediction (5), grey decision making, and (6) grey control.

In recent years, grey forecasting has been applied to various fields, including management decisions, socioeconomic research, meteorological and water conservation forecasting, and disaster prevention. For example, Hu (2004) used GM(1, 1) to establish a forecast model to help consumers make optimal choices when purchasing new cars. Consumers must enter only their preferred brand, price, safety, functionality, and fuel consumption data to make optimal choices. Wang (2010) used an improved GM(1, 1) to forecast the number of Hong Kong, U.S., and German tourists who visited Taiwan between 1989 and 2000. The conclusion of this study was that the predictive accuracy of the improved GM(1, 1) was extremely high. Furthermore, because large samples were not necessary, the improved model greatly reduced the cost and time required for data collection. Kayacan et al. (2010) used the NGBM and grey Markov model to forecast the U.S. dollar and euro exchange rate between 2005 and 2007. The results of this study indicated that the models had high accuracy when the experimental sample data showed steady growth trends. Akay et al. (2013) used various models to forecast electricity demand in Turkey and observed that grey theory demonstrated the most satisfactory predictions. Lee et al. (2014) used grey system models and fuzzy time series to forecast trends in electrochromic electronic materials. The results indicated that the revised GM(1, 1) was the optimal forecasting model for small samples.

The aforementioned studies have reflected the following advantages of grey prediction: (1) Grey prediction does not require substantial amounts of historical data. The amount of data can be selected according to actual conditions and requirements. In general, forecasting models can be established with no less than four data pieces. (2) Grey prediction does not require the use of numerous associated factors. Data are easy to obtain, considerably reducing the time and cost of collecting data. (3) Grey prediction is extremely accurate.

Researchers in previous related studies have focused primarily on using one or two grey prediction methods to improve the accuracy of their predictions. However, the same methods may not be appropriate for sample data obtained from different industries. Therefore, to improve the weak points of previous studies and to increase the predictive accuracy, this study used the GM(1, 1) model and the revised NGBM(1, 1) model for theoretical derivation and verification. In addition, this study performed a comparison using regression analysis to assess the advantages and disadvantages of grey prediction with small samples and to confirm the predictive accuracy and goodness of fit of the three models. Finally, this study determined the optimal prediction model among the three models.

Methodology

GM(1, 1)

GM(1, 1) is a model used in grey theory for performing forecasts. It expresses a first-order differential and has a single input variable. (Chang 2012). Grey prediction is a method of performing forecasts with existing data based on GM(1, 1). It is used to investigate the future dynamic conditions of numerous elements within a series (Avinash et al. 2012).

The primary advantage of grey prediction is that it does not entail using much data and its mathematical foundation is simple. Because the GM(1, 1) requires as little as four data pieces, rolling checks in series with at least four data pieces can be used to test the reliability of the prediction model. In addition, the residuals of the predicted and actual values reflect the reliability of the prediction model. Lower residuals reflect greater reliability.

In this study, series prediction, which is the direct establishment of grey prediction models according to given information, was addressed. The steps for establishing GM(1, 1) for grey prediction are outlined as follows:

(a) First, define the given data as the original series:

$$X^{(0)} = \left\{ {x^{(0)} (1),\,x^{(0)} (2), \ldots ,\,x^{(0)} (n)} \right\}.$$
(1)

(b) Use a single AGO to sum the established original series. The following generating sequence is obtained:

$$X^{(1)} = \left\{ {x^{(1)} (1),\,x^{(1)} (2), \ldots ,\,x^{(1)} (n)} \right\},$$
(2)

where

$$x^{\left( 1\right)} \left( k \right) = \sum\limits_{i = 1}^{k} {x^{(0)} \left( i \right), \quad k = 1, 2, 3 ,\ldots,\,n}.$$
(3)

(c) Establish the GM(1, 1) differential equation as a differential equation with one order and one variable as follows. In the following equation, b is a constant term:

$$\frac{{{\text{d}}X^{(1)} }}{{{\text{d}}t}} + aX^{(1)} = b.$$
(4)

The derivative definition can be stated as follows:

$$\frac{{{\text{d}}X^{\left( 1 \right)} \left( t \right)}}{{{\text{d}}t}} \cong \frac{{X^{\left( 1 \right)} \left( {k + \Delta t} \right) - X^{\left( 1 \right)} }}{\Delta t}.$$
(5)

If ∆t = 1, then

$$\frac{{{\text{d}}X^{\left( 1 \right)} \left( t \right)}}{{{\text{d}}t}} \cong X^{\left( 1 \right)} \left( {k + 1} \right) - X^{(1)} (k) = X^{(0)} (k).$$
(6)

In addition, X (1) ≅ Z (1) and

$$Z^{\left( 1 \right)} (k) \cong \frac{{X^{\left( 1 \right)} (k) + X^{1} (k + 1)}}{2}, \quad k \, = \, 2,3,4, \ldots ,\,n.$$
(7)

The following expression is obtained using Eqs. (4), (5), (6) and (7):

$$X^{\left( 0 \right)} \left( k \right) + aZ^{(1)} = b.$$
(8)

(d) Use the least square method and differential and difference equations to obtain parameters a and b.

Thus,

$$\theta = \left[ {a,b} \right]^{\text{T}} = (B^{\text{T}} B)^{ - 1} B^{\text{T}} Y,\quad Y = B\theta$$
(9)
$$Y = \left[ {X^{(0)} (2), \; X^{(0)} (3), \ldots ,X^{(0)} (n)} \right]^{\text{T}}$$
(10)
$$B = \left[ \begin{aligned} \begin{array}{ll}- \frac{1}{2}[(X^{(1)} (1) + X^{(1)} (2)], & 1 \\ - \frac{1}{2}[(X_{{}}^{(1)} (2) + X_{{}}^{(1)} (3)], & 1 \\ \qquad \qquad \quad \vdots & {} \\- \frac{1}{2}[(X^{(1)} (n - 1) + X^{(1)} (n)], & 1 \end{array} \end{aligned}\right].$$
(11)

(e) Use the grey differential equation to obtain the grey AGO equation:

$$\hat{X}^{\left( 1 \right)} \left( {k + 1} \right) = \left[ {X^{\left( 0 \right)} \left( 1 \right) - \frac{b}{a}} \right]e^{ - ak} + \frac{b}{a},\quad k = 1,2,3, \ldots ,n.$$
(12)

Next, use the IAGO for reduction to obtain the required forecasting model:

$$\hat{X}^{\left( 0 \right)} \left( k \right) = \left[ {X^{\left( 0 \right)} \left( 1 \right) - \frac{b}{a}} \right](1 - e^{a} ) \, e^{ - a(k - 1)} ,\,k = 1,2,3, \ldots ,n.$$
(13)

NGBM(1, 1)

The NGBM is an original prediction model derived from GM(1, 1) combined with the Bernoulli equation, a basic differential equation (Li et al. 2010). This model retains the two advantages of GM(1, 1): it has a simple derivation process and requires only four data samples for modeling. In addition, in the NGBM, the prediction error of GM(1, 1) is reduced and the predictive accuracy of GM(1, 1) on nonlinear data types is improved (Feng et al. 2012).

GM(1, 1) is a special case of the NGBM. The procedures involved in deriving the NGBM equation are outlined as follows:

  1. (a)

    The obtained data are defined as an original series. New series can be obtained using the AGO for calculation. The first three equations for the NGBM are identical to Eqs. (1)–(3) for GM(1, 1).

  2. (b)

    At this point, the NGBM equations differ from those of GM(1, 1). Use the Bernoulli equation to establish the NGBM differential and difference models.

    The NGBM differential equation is as follows:

    $$\frac{{{\text{d}}X^{(1)} }}{{{\text{d}}t}} + aX^{(1)} = b\left[ {X^{(1)} } \right]^{r}.$$
    (14)

    The following NGBM differential equation can be obtained by substituting Eqs. (18) into Eqs. (4), (5), (6), and (7) of GM(1, 1):

    $$X^{\left( 0 \right)} \left( k \right) + aX^{\left( 1 \right)} \left( k \right) = b\left[ {Z^{\left( 1 \right)} (k)} \right]^{r} , \quad k \, = \, 2,3,4, \ldots n.$$
    (15)
  3. (c)

    Use the least squares method and NGBM differential and difference equations to obtain a and b.

    Thus,

    $$\theta = \left[ {a,b} \right]^{\text{T}} = (B^{\text{T}} B)^{ - 1} B^{\text{T}} Y,\,Y = B\theta$$
    (16)
    $$Y = \left[ {X^{(0)} (2),X^{(0)} (3), \ldots ,\,X^{(0)} (n)} \right]^{\text{T}}$$
    (17)
    $$B = \left[ {\begin{array}{*{20}l} { - \frac{1}{2 }\left[ {X^{\left( 1 \right)} \left( 1 \right) + X^{\left( 1 \right)} \left( 2 \right)} \right]} & {\left\{ { - \frac{1}{2 }\left[ {X^{\left( 1 \right)} \left( 1 \right) + X^{\left( 1 \right)} \left( 2 \right)} \right]} \right\}^{r} } \\ { - \frac{1}{2 }\left[ {X^{\left( 1 \right)} \left( 2 \right) + X^{\left( 1 \right)} \left( 3 \right)} \right]} & {\left\{ { - \frac{1}{2 }\left[ {X^{\left( 1 \right)} \left( 1 \right) + X^{\left( 1 \right)} \left( 2 \right)} \right]} \right\}^{r} } \\ \qquad \qquad \; \; \; \vdots & \qquad \qquad \quad \; \vdots \\ { - \frac{1}{2 }\left[ {X^{\left( 1 \right)} \left( {n - 1} \right) + X^{\left( 1 \right)} \left( n \right)} \right]} & {\left\{ { - \frac{1}{2 }\left[ {X^{\left( 1 \right)} \left( {n - 1} \right) + X^{\left( 1 \right)} n} \right]} \right\}^{r} } \\ \end{array} } \right].$$
    (18)
  4. (d)

    Use the grey differential equation to derive the grey AGO equation:

    $$\hat{X}^{\left( 1 \right)} \left( {k + 1} \right) = \left[ {\left[ {X^{(0)} (1)^{(1 - r)} - \frac{u}{a}} \right]^{{e^{{ - a\left( {1 - r} \right)k}} }} + \frac{b}{a}} \right]^{{\left( {\frac{1}{1 - r}} \right)}} ,\,k = 1,2,3, \ldots n.$$
    (19)
  5. (e)

    Reduce Eq. (19) using the IAGO to obtain the required forecasting model:

    $$\hat{X}^{\left( 0 \right)} \left( k \right) = \hat{X}^{\left( 1 \right)} \left( k \right) - \hat{X}^{\left( 1 \right)} \left( {k - 1} \right), \quad k = 1,2,3 \ldots ,n.$$
    (20)

Predictive accuracy measurement

The estimated difference between the actual values and the predicted values obtained using a prediction model is considered the prediction error (Li et al. 2011). As a judgment method, determining the prediction error indicates the success of a forecasting model. In this study, we adopted the three most indicative measurement models—the mean absolute error (MAE), mean squared error (MSE), and mean absolute percentage error (MAPE)—to measure the accuracy of the forecasting models (Chang et al. 2013).

The various error types are defined as follows:

  1. (1)

    Lower MAE indicates a more satisfactory predictive ability.

    $${\text{MAE}} = \frac{\mathop \sum \nolimits \left| e \right|}{n}$$
    (21)
  2. (2)

    Lower MSE indicates a more satisfactory predictive ability.

    $${\text{MSE}} = \frac{{\mathop \sum \nolimits e^{2} }}{n - 1}$$
    (22)
  3. (3)

    Lower MAPE indicates a more satisfactory predictive ability.

    $${\text{MAPE}} = \frac{{\mathop \sum \nolimits \left| \frac{e}{a} \right|}}{n} \times 100\;\% ,\quad a = {\text{ actual value}}$$
    (23)

Lower errors indicate higher accuracy. According to Lewis (1982), MAPE values lower than 10 % indicate predictions with high predictive accuracy (Table 1).

Table 1 MAPE forecasting accuracy reference criteria

Results and discussion

After numerous years of development, the ratio of renewable energy consumption to overall primary energy consumption in China has increased annually. In this study, we collected 22 data samples. In 1991, the percentage of renewable energy consumption within overall energy consumption was 4.8 %; furthermore, it was 7.5 % in 2001, 6.7 % in 2006, 8.6 % in 2010, and 9.2 % in 2012 (Table 2).

Table 2 Percentage of renewable energy consumption out of overall energy consumption in China

GM(1, 1) results

This study established GM(1, 1) using four data samples and employed MATLAB for the calculation. This study then used Eqs. (1)–(13) to derive the predicted values. The predicted value for the percentage of renewable energy consumption from the overall energy consumption in 1995 was 5.3 %. In 2001, 2006, 2010, and 2012, these values were predicted to be 7.1, 7.4, 8.4, and 9.1 %, respectively; Table 3 shows other details. The predicted values for 2013, 2014, and 2015 were 9.4, 9.6, and 10.1 %, respectively.

Table 3 Predicted values (%) from the three forecasting models

This study subsequently used Eqs. (21)–(23) in Excel to calculate the predictive accuracy indicators for GM(1, 1). The MAE was 0.394, MSE was 0.244, and MAPE was 5.855 %; Table 4 shows the detailed data.

Table 4 Forecast accuracy indicators for the three forecasting models

NGBM(1, 1) results

This study applied NGBM(1, 1) for prediction and used MATLAB for calculation. This study then used Eqs. (14)–(20) to derive the predicted values. The predicted value for the percentage of renewable energy consumption from the overall energy consumption in 1995 was 5.6 %. In 2001, 2006, 2010, and 2012, the predicted values were 7.3, 7.4, 8.5, and 9.1 %, respectively; Table 3 shows other details. The predicted values for 2013, 2014, and 2015 were 9.5, 9.8, and 10.3 %, respectively.

This study subsequently used Eqs. (21)–(23) in Excel to calculate the predictive accuracy indicators for NGBM(1, 1). The MAE was 0.333, MSE was 0.186, and MAPE was 4.893 %; Table 4 shows the detailed data.

Regression analysis results

To compare the predictive accuracy of these methods, this study performed forecasts using a regression analysis model and compared the results with those of the aforementioned models.

This study employed Minitab to establish a regression analysis model and scatter plot using the 1991–2012 data shown in Table 2. Table 3 and Fig. 1 show the detailed data. The R2 and adjusted R2 of this model were both 1.0. The predicted values for 2013, 2014, and 2015 were 8.5, 8.7, and 8.8 %, respectively.

Fig. 1
figure 1

Regression analysis and scatter diagram

The regression equation can be expressed as follows:

$$y = \, 0. 1 5 5 3x{\,-\,}{ 3}0 4. 1 2.$$

This study then used Eqs. (21)–(23) in Excel to calculate the predictive accuracy indicators for the regression analysis model. The MAE was 0.430, MSE was 0.265, and MAPE was 6.026 %; Table 4 shows the detailed data.

Discussion

This study used the MAE, MSE, and MAPE to compare the predictive accuracy of the three grey prediction models. NGBM(1, 1) exhibited the highest predictive accuracy, followed by GM(1, 1). The results indicated that the revised grey prediction models, NGBM(1, 1), registered more satisfactory forecasts than the GM(1, 1) did.

According to Lewis (1982), a MAPE lower than 10 % indicates a high predictive accuracy. The MAPEs of NGBM(1, 1), GM(1, 1), and regression analysis model were 4.893, 5.855, and 6.026 %, respectively. Therefore, NGNM(1, 1) returned the most accurate forecasts, followed by GM(1, 1) and regression analysis model. The results indicated that the modified NGBM(1, 1) exhibited superior predictive abilities among the compared models.

This study compared the predictive accuracy of the grey prediction models with that of the regression analysis model. According to the MAE, MSE, and MAPE values, the forecasts of the grey prediction models were more satisfactory than those yielded by the regression analysis model (Table 4).

Figure 2 illustrates the comparison of the predicted values from the three models with the actual values. NGBM(1, 1) demonstrated the highest predictive accuracy, followed by GM(1, 1) and finally the regression analysis model.

Fig. 2
figure 2

Comparison of these prediction models

The obtained results confirmed that grey prediction demonstrates the following advantages: (1) Grey prediction does not require substantial amounts of historical data. Data amounts are selected according to only actual circumstances and requirements. In general, a forecasting model can be established using a minimum of four data samples. (2) Grey prediction methods involve simple calculations. (3) Grey prediction does not require an excessive number of related factors. Data required for grey prediction are obtained easily, and data collection time and costs are considerably lower compared with those required in other methods. (4) Grey prediction is highly accurate.

The results obtained in this study indicated that in the revised grey prediction models, NGBM(1, 1) yielded a higher predictive accuracy than that of the GM(1, 1) when small data samples were used. The predictive capabilities of the grey prediction models were also superior to those of the conventional regression analysis model.

Conclusion

Forecasting involves estimating events or circumstances that organizations cannot control; it is performed to provide a basis for managers to formulate plans. Therefore, forecasting is critical during decision-making processes. In this study, we forecasted growth trends in renewable energy consumption in China. Renewable energy technologies are emergent. We used only 22 data samples in this study. Because historical data about renewable energy were limited in sample size and because the data were not normally distributed, forecasting methods used for analyzing large amounts of data were unsuitable for this study.

According to the empirical results, the study compared the predictive accuracy of the grey prediction models with that of the regression analysis model. The MAE, MSE, and MAPE results indicated that NGBM(1, 1) demonstrated the highest predictive accuracy, followed by GM(1, 1). The regression analysis model demonstrated the lowest predictive accuracy among the models. In addition, the study verified that when small data samples were used, in the modified grey prediction models, NGBM(1, 1) yielded more accurate predictions than the GM(1, 1) did.

Numerous previous industrial studies may have been unable to make accurate predictions because they could not obtain substantial amounts of historical data. This paper suggested that subsequent researchers build on the foundation of this study and select different industries to perform cumulative studies.