1 Introduction

Vital to the role of crop yield models as early warning and assessment tools for food security applications is the prediction of seasonal climate (Hansen et al. 2010; Robertson et al. 2007). Many studies have been conducted to explore this linkage, and several approaches have been proposed to reconcile the mismatch between the scale of climate data and the requirements of crop models (Hansen and Indeje 2004). Crop models require daily weather inputs, while climate forecasts come in seasonal formats not compatible with what the crop models need. One of the promising approaches is the use of GCM outputs to drive crop simulation models as they can predict climate in advance before the growing season and they can provide daily weather inputs. The GCM outputs have been used for water resources planning as well as addressing other socio-economic challenges (Sivakumar 2011). Several methods are used to downscale GCM outputs at the river basin scale which include multiple linear regression, robust regression, ridge regression, artificial neural networks, and Bayesian neural networks to identify an appropriate transfer function in statistical downscaling models (Mishra and Singh 2009; Sivakumar 2011; Jeong et al. 2012).

Using raw daily GCM outputs directly as inputs to the crop model, however, is not advisable due to the biases in the GCM data (Baron et al. 2005). Before applying the GCM outputs for required applications, the bias needs to be removed for better results (Ines and Hansen 2006; Wood et al. 2004; Maurer 2007; Mishra and Singh 2009; Kyoung et al. 2010). Ines and Hansen (2006) applied a simple bias correction method and successfully removed the biases in rainfall frequency and intensity of daily GCM rainfall. When linked with a crop model, the two-step bias correction procedure they used improved simulated yields, although the yields were still under-predicted, which they attributed to the inability of the bias correction method to correct the time structure in daily GCM rainfall. Baigorria et al. (2008) adapted the method in regional climate model (RCM) outputs, including daily temperature and solar radiation, and showed that the improvements made to the RCM outputs corrected biases in the predicted yields as compared to using raw RCM outputs directly. Recently, Ines et al. (2011) showed that by linking bias corrected GCM rainfall with a stochastic disaggregation method to redistribute the corrected wet days properly in the time series could somehow correct the time structure mismatch in the bias corrected GCM rainfall, and further improve the prediction of crop yields.

Lacking in the above studies is the in-depth analyses of where these improvements in yields are coming from. Although it is generally accepted that the improvements made in the biases of the GCM rainfall frequency, intensity and dry spell lengths resulted in the improvements of predicted yields, nothing has been done to quantify the causal relationships. Delineating this information is crucial for the development of better crop yield prediction models, as this will provide insights into which climate information from the GCM is best to produce a more robust and reliable crop yield prediction system. This paper aims to present a hierarchy of crop prediction models based on dynamic simulation of crop growth using a crop model and combinations of downscaling schemes of GCM rainfall, and to extract information content from downscaled GCM rainfall variables using information theory for identifying both the downscaling technique as well as suitable precipitation variables. Since it is not possible to present the entire crop yield simulation results, our objective is only focused on Maize yield. We conduct our study in Katumani, Machakos, Kenya during the short October–December rainy season, using maize (Zea mays)—a staple food in the region, as a case study.

2 Methodology

2.1 Bias correction (BC), stochastic disaggregation (DisAg) and combination (BC-DisAg)

We follow the procedure of Ines and Hansen (2006) and Ines et al. (2011) for the bias correction of daily GCM rainfall. Basically, the GCM bias correction is done by:

  1. (a)

    The number of wet days in the GCM for month I is corrected by fitting a GCM threshold (\( x_{I,GCM} \)) derived from the observed historical rainfall distribution:

    $$ x_{I,GCM} = F_{I,GCM}^{ - 1} \left[ {F_{I,obs} (x_{I,obs} )} \right] $$
    (1)

    where \( x_{I,obs} \) is the threshold for a wet day from observations, and \( F_{I,GCM}^{{}} \) and \( F_{I,obs}^{{}} \) are empirical cumulative distributions of GCM and observed rainfall, respectively.

  2. (b)

    Rainfall intensity of GCM is corrected by fitting the truncated rainfall distribution of the climate model into a two-parameter gamma (\( G_{I,\,GCM} \)) and then mapping it with the truncated, gamma-fitted observed rainfall distribution (\( G_{I,\,obs} \)). Correcting rainfall (x i ) for a day is done as

    $$ x_{i}^{\prime } = \left\{ {\begin{array}{*{20}c} {G_{I,obs}^{ - 1} \left[ {G_{I,GCM} (x_{i} )} \right]} \hfill & {x_{i} > x_{I,GCM} } \hfill \\ 0 \hfill & {} \hfill \\ \end{array} } \right. $$
    (2)

    where \( x^{\prime}_{i} \) is the corrected GCM rainfall for that day. Shape and scale parameters of GCM and observed rainfall gamma distributions are calculated using the maximum likelihood method.

The method outlined above is called BC1 in this paper.

The general notion of daily GCM rainfall is that it always over-predicts rainfall frequency, but this is not the case always. An update of BC1 called BC2 was designed to handle cases when GCM rainfall frequency is less than or more than the observed. When GCM rainfall frequency is less than the observed, we append a number of wet-day events with minimum rainfall amounts (i.e., \( x_{1,obs} \, + \,0.1\,\,{\text{mm}} \)) in the GCM rainfall CDF to match the observed rainfall frequency. The number of appended wet-days depends on the discrepancy between the number of wet-days in the GCM rainfall CDF above the calibrated threshold and observations. These wet days are added and distributed evenly (stochastically) in the GCM rainfall daily time series. BC2 also handles if the left-side of the empirical distribution of the GCM rainfall contains more values equivalent to \( x_{1,GCM} \) above the truncation point (Ines et al. 2011) by preserving those values instead of eliminating them ensuring that rainfall frequency is preserved.

Linking the bias corrected GCM information with stochastic disaggregation (DisAg), i.e., through a conditional stochastic weather generator (Hansen and Ines 2005), is done as follows:

We used monthly rainfall frequencies of the bias-corrected daily GCM rainfall to adjust the first- and second-order transition probabilities of a high-breed Markov chain rainfall occurrence model to correct the time distribution of wet and dry days. Formalism for adjusting the transition probabilities of a high-breed Markov chain rainfall occurrence model within the stochastic disaggregation can be found elsewhere (e.g., Katz and Parlange (1998) and Hansen and Ines (2005)) and is summarized below (see Ines et al. 2011).

The rainfall occurrence model of the stochastic weather generator used in the stochastic disaggregation is a two-state, hybrid second-order Markov chain able to simulate rainfall occurrence with a first-order chain if the previous day was wet, or a second-order chain if the previous day was dry (Hansen and Ines 2005). If the Markov model simulates the occurrence of rainfall on a given day, the rainfall amount is sampled from a mixture of two exponential distributions. Hansen and Mavromatis (2001) described details of the temperature and solar radiation sub-models.

The rainfall frequency of bias-corrected daily GCM rainfall was used to adjust the first- and second-order transition probabilities of the rainfall occurrence model to simulate time series of wet and dry days. Transition probabilities (i.e., first-order probabilities, e.g., wet day following a dry day (p 01), wet to wet (p 11), and second-order probabilities, p 101 and p 001) to match a target rainfall frequency (e.g., from bias-corrected GCM rainfall) are related directly to the unconditional rainfall occurrence probability, π (Eq. 3) and persistence of dry days, ρ 1 (Eq. 4) (Katz and Parlange 1998):

$$ \pi = \frac{{p_{01} }}{{1 + p_{01} - p_{11} }} $$
(3)
$$ \rho_{1} = p_{11} - p_{01} $$
(4)

With the assumption that persistence of dry days remains constant when a target rainfall occurrence probability changes (e.g., from bias corrected GCM rainfall), \( \pi^{\prime} \), the first-order adjusted transition probabilities (with apostrophes) are determined by solving Eqs. (3) and (4) simultaneously, thus,

$$ p^{\prime}_{01} = \pi^{\prime}\left( {1 - \rho_{1} } \right) $$
(5)
$$ p^{\prime}_{11} = \rho_{1} + p'_{01} $$
(6)

Equations (5) and (6) are used to determine a wet day if the previous day was wet. If the previous day was dry, the second-order Markov chain is used to determine if the current day will be wet or dry through the adjusted second-order transition probabilities. The transition probabilities for the hybrid second-order Markov chain rainfall occurrence model can be adjusted for a given rainfall frequency by keeping the first- and second-order persistence of dry days (ρ 1 (Eq. 4) and ρ 2 (Eq. 7)) constant:

$$ \rho_{2} = p_{101} - p_{001} $$
(7)

The adjusted transition probabilities are given as follows (Eqs. 810) (Hansen and Mavromatis 2001; Katz and Parlange 1998):

$$ p^{\prime}_{11} = \pi^{\prime}\left( {1 - \rho_{1} } \right) + \rho_{1} $$
(8)
$$ p^{\prime}_{001} = \pi^{\prime}\;\frac{{\left( {1 - \rho_{1} } \right)\left( {1 - \pi^{\prime}} \right) - \rho_{2} \left( {1 - p^{\prime}_{11} } \right)}}{{1 - \pi^{\prime}}} $$
(9)
$$ p^{\prime}_{101} = p^{\prime}_{001} + \rho_{2} $$
(10)

If rainfall frequency and total are used at the same time to condition the stochastic weather generator, (i) transition probabilities (first- and second-order) of the rainfall occurrence model are adjusted based on bias-corrected monthly GCM rainfall frequency, (ii) daily rainfall realizations are generated iteratively until generated monthly rainfall total matches 95 % of the target value (e.g., bias-corrected monthly GCM rainfall), and (iii) generated daily values are re-scaled by a ratio of monthly target (\( R_{m} \)) and generated rainfall totals (\( \bar{R}_{m} \)) (i.e., \( \frac{{R_{m} }}{{\bar{R}_{m} }} \)) such that the monthly rainfall total generated matches the target value. These steps are repeated for each calendar month in a year for all the considered years (Hansen and Ines 2005). Based on BC1, BC2, combination of BC and DisAg (BC-DisAg), different downscaling methods were used in the study using different sources of rainfall information (Table 1).

Table 1 Downscaling schemes conducted in the study

2.2 Maize simulations

We used CERES-Maize (Ritchie et al. 1998) to evaluate the performance of the bias-corrected, and bias-corrected—disaggregated GCM rainfall in predicting maize yields. Data on soil properties (sandy clay loam), crop cultivar and management practices were based on a previous study at the study site (Keating et al. 1992). Every year, the water balance was re-initialized on 17 October with soil water at 20 % of the soil capacity. The date of sowing was decided when the soil water content exceeded 40 % of the soil capacity over the top 15 cm depth, or forced on 1 November, otherwise. The plant density was set at 4.4 plants m−2 and 20 kg N ha−1 as ammonium nitrate was applied at planting (low inputs system) (Ines and Hansen 2006). First, the crop model was run with observed daily rainfall, then with the rainfall outputs from the cases enumerated in Table 1. In the crop modeling, daily minimum (T min ) and maximum temperature (T max ) and solar radiation (SRAD) were all generated from monthly mean values conditioned on the occurrence or non-occurrence of rainfall.

2.3 Mutual information using nonparametric method

The relationship between rainfall variables and crop yield do not share a linear relationship as studied in many previous articles due the fact that rainfall variables (i.e., amount, intensity and frequency) are highly stochastic in nature. Therefore it is important to measure the information content between precipitation variables and crop yield, and entropy seems to be quite useful for measuring the information between two variables. The mutual information (MI) using entropy is used as a measure of statistical dependence among random variables which captures the full dependence structure, both linear and nonlinear and several applications made in the last decades (Moon et al. 1995; Sharma 2000; Mishra and Coulibaly 2010).

A high value of MI score would indicate a strong dependence between two variables. The mutual information between two random variables S and Q is defined as (Fraser and Swinney 1986):

$$ MI(S,Q) = \int {P_{sq} (s,q)\log \left[ {{{P_{sq(s,q)} } \mathord{\left/ {\vphantom {{P_{sq(s,q)} } {P_{s} (s)P_{q} (q)}}} \right. \kern-0pt} {P_{s} (s)P_{q} (q)}}} \right]} \,dsdq $$
(11)

where \( P_{s} (s) \) and \( P_{q} (q) \) are the marginal probability density functions (pdf’s) of S and Q respectively, and \( P_{sq} (s,q) \) is the joint pdf of S and Q. In our study, S denotes maize yield, whereas Q represents individual precipitation variables (i.e., amount, intensity and frequency).

For any given bivariate sample, the MI score in Eq. (11) can be estimated as:

$$ MI(S,Q) = \frac{1}{n}\sum\limits_{i = 1}^{n} {\log_{e} \left[ {P_{sq} (s_{i} ,q_{i} )/P_{s} (s_{i} )P_{q} (q_{i} )} \right]} $$
(12)

where \( s_{i} \) and \( q_{i} \) are the ith bivariate sample data pair in a bivariate sample of size n.

The multivariate kernel density estimator (Scott 1992; Wand and Jones 1995) using a Gaussian kernel function for estimating joint and marginal pdfs can be defined as:

$$ \hat{f}_{X} (x) = \frac{1}{n}\sum\limits_{i = 1}^{n} {\frac{1}{{(2\pi )^{d/2} \lambda^{d} \det (Sc)^{1/2} }}\exp \left( { - \frac{{(x - x_{i} )^{T} Sc^{ - 1} (x - x_{i} )}}{{2\lambda^{2} }}} \right)} $$
(13)

where the mutivariate kernel density estimate is denoted by \( \hat{f}_{X} (x) \), the sample covariance of the variable set X is denoted bySc; and λ is known as the bandwidth of the kernel density estimate. In this study λ is chosen as the optimal Gaussian bandwidth for a normal kernel given as (Scott 1992; Silverman 1986):

$$ \lambda = \left( {\frac{4}{d + 2}} \right)^{1/(d + 4)} n^{( - 1/(d + 4))} $$
(14)

where n and d refer to the sample size and dimension of the multivariate variable set, respectively.

3 Data and study area

The analyses are based on data from the Katumani Dryland Research Center (1°35′S, 37°14′E, 1601 a.m.s.l) in the Machakos District of Eastern Kenya, a main maize growing region. Rainfall is bimodal in distribution and the climate is marginal for maize in both seasons. Because of strong food preferences, maize is the staple crop. The October–December short rainy season is an important maize growing season, and is fairly predictable at a seasonal lead-time using statistical (Indeje et al. 2000) and dynamic (Hansen and Indeje 2004) forecast models, making it interesting to test the utility of daily GCM rainfall from crop yield prediction.

For analysis, we used ECHAM4.5 daily rainfall outputs (Roeckner et al. 1996) forced by observed sea surface temperature (SST) (http://iridl.ldeo.columbia.edu). The climate model grid cell encompassing the Katumani Dryland Research Center in the Machakos district of eastern Kenya was selected. Daily rainfall outputs (1970–1995) from all 24 ensemble members in this grid cell were extracted. Figure 1 shows how skillful (R) is the uncorrected GCM rainfall in the study area. Daily rainfall observations from the research center were used for the GCM rainfall bias correction. Other input data needed for the quantitative evaluation (crop modeling) of the bias-corrected GCM daily rainfall were all collected from the research station (Hansen and Indeje 2004).

Fig. 1
figure 1

Correlation of uncorrected ECHAM4.5 rainfall for October–November–December (OND) season (1970–1995) against University of East Anglia’s gridded rainfall data. Star mark shows the location of Katumani Dryland Research Center

All 24 members were bias-corrected using the threshold, >0 mm, delineating a wet day. The bias corrections were conducted using the simple bias correction of Ines and Hansen (2006) (BC1) and that of Ines et al. (2011) (BC2) that accounts for both under and over predictions of rainfall frequency, and corrects the “nugget effect” in truncating empirical distributions. Monthly rainfall statistics for 26 years (1970–1995) were then extracted and used in the stochastic disaggregation (rainfall frequency, rainfall frequency + totals, totals).

4 Results and discussion

4.1 Precipitation downscaling

The advantages of bias correction and the linkage of bias corrected rainfall information with stochastic disaggregation to the improvements of downscaled GCM rainfall is shown in Table 2 based on different goodness-of-fit criteria: correlation coefficient, denoted by R (Pearson product-moment correlation coefficient); root mean square error denoted by RMSE; and mean bias error denoted by MBE. Considering all three goodness-of-fit criteria, it can be said that for the month of October the BC2-DisAG2-Freqn (see Table 1 for definitions of methods) methodology performed better for total monthly precipitation. When the downscaled precipitation was not considered for bias correction, higher RMSE, MBE and lower R were observed. Interestingly, when the probability of occurrence was observed, the BC1-DisAg2-Freqn-Total methodology performed better. However, the performance of rainfall intensity for the month of October was not properly captured in most of the methodology except for the BC2-DisAG2-Freqn method which showed an improvement based on R, even though the relative performance of RMSE and MBE did not improve.

Table 2 Seasonal rainfall statistics from different downscaling schemes

Based on the all three goodness-of-fit measures, the downscaled capability for the November total rainfall was found to be better for the BC2, BC2-DisAg2-Freqn-Total and BC2-DisAg2-Total methodologies. The performance of uncorrected downscaled rainfall could not be interpreted well using R, however based on RMSE and MBE its performance worsened. Therefore, it is important to consider all three performance measures in the selection of methodology to be considered. A similar observation was also made for uncorrected rainfall intensity for the November rainfall. For the month of December the total rainfall as well as the probability of occurrence, the overall BC2-DisAg2-Freqn methodology was found to be performing better for two statistical performance criteria, even though MBE was the lowest for other methodologies. However, the methodologies could not improve rainfall intensity in comparison to the total rainfall and probability of occurrence. Except for dry spell lengths, a majority of the biases in rainfall frequency, intensity and totals in the GCM rainfall were corrected by the deterministic transformation of the GCM rainfall. The corrections of under/over-prediction of rainfall frequency and the “nugget effect” of truncating empirical distributions (BC2) improved the corrections of rainfall frequency in the month of November, where rainfall occurrence was under-predicted by the GCM (not shown). Only when we linked the bias corrected GCM rainfall information (BC1 and BC2) with a stochastic weather generator that the majority of the dry spell length biases were reduced.

4.2 Mutual information (MI) between downscaled precipitation variables and crop yield

Crop yield depends on multiple precipitation variables which include amount, probability of occurrence, intensity and dry spells. These variables may be on a seasonal time scale as well as for individual months. Different downscaling methods performed differently for the evaluation of variables calculated using downscaled precipitation and exploring their relationship with crop yield prediction. Therefore, it is important to identify suitable downscaling techniques as well as important precipitation variables, which can be useful for seasonal crop yield prediction based on the GCM output.

The mutual information values were calculated between seasonal precipitation amounts along with constituent months obtained from different downscaling methods with respect to crop yield, as shown in Fig. 2a. The MI was calculated between maize yield and downscaled precipitation variables. A high value of MI score would indicate a strong dependence between two variables. It was observed that the downscaled seasonal precipitation amount obtained from BC2-DisAg2 frequency shared more information with the crop yield. Based on overall observation, seasonal precipitation amount shared more information, whereas the October precipitation amount shared the least information. However, the November and December month precipitation provided nearly similar type of information with crop yield. This is logical, because the month of October comprises most of the vegetative stage of the crop, and November and December are when flowering and milking stage occur, which are very sensitive to the water availability. Timing and severity of water stress during this period would drastically decrease crop yields. However, it can be said that increase in seasonal precipitation as well as the November and December month rainfall increases crop yields (Fig. 2). The next downscaling method which shared maximum information is BC1-DisAg2 frequency, which is again based on the correction of frequency but with the daily GCM rainfall corrected by the older BC (Table 1).

Fig. 2
figure 2

The mutual information content between seasonal and individual month’s a rainfall amount, b probability of occurrences, c rainfall intensity, and d dry spell for different models with the crop yield

The mutual information between precipitation variables and crop yield based on the probability of occurrence differed from the precipitation amount on the seasonal scale (Fig. 2b). However, the downscaled probability of occurrence using BC2-DisAg2 frequency for the month of December shared maximum information with crop yield. Overall December and November month’s probability of occurrence provided higher information in comparison to the seasonal and October time scales, suggesting that rainfall frequency information from seasonal time scale, as well as those from the less-sensitive period of crop growth, were not better predictors of crop yields in the study location as compared to the November and December rainfall frequencies. This is a critical insight, because in current climate prediction system, it is the seasonal values that are predicted. If a climate model is skillful to provide rainfall frequency information during the expected critical periods of crop growth, one may be able to predict better crop yields before the end of the growing season.

Similarly, the information content between precipitation intensity and crop yield differed from that of probability of occurrence as well as precipitation amount (Fig. 2c). Higher information content was observed between precipitation intensity of December downscaled based on the BC2-DisAg2-Total and interestingly it was least for December based on the BC2-DisAg2 frequency. Seasonal as well as the October precipitation intensity did not share much information with crop yield in comparison with months of December and November. The reason for rainfall intensities generated by stochastic disaggregation of rainfall frequency not giving better information could be due to the rainfall intensity model parameters not being constrained/adjusted on those months of interest. In other words, the rainfall intensities generated did not conform to the target rainfall intensities, but were likely representative of climatology (Hansen and Ines 2005). On the other hand, when we combined monthly rainfall total with rainfall frequency or using it alone in the stochastic disaggregation, we generated more appropriate rainfall intensities representing those months of interest, as shown by the increased MI in the months of November and December (Fig. 2c). It should be noted however that we did not use rainfall intensity as a predictor in the stochastic disaggregation. Usually, rainfall intensity was less (or not) predictable compared with other rainfall variables (Moron et al. 2007). Ines et al. (2011) found that the GCM model had lesser skill for predicting the October–November–December rainfall intensity (R = 0.37) than rainfall frequency and totals (R ≈ 0.70) in the study area.

Dry spell lengths during the anthesis stage (November 15–December 31), on the other hand, were found to give the highest information with respect to the predictability of crop yields (Fig. 2d). For all the downscaling schemes tested, mutual information was greater than 0.30, except for BC1 (see Table 1) when a threshold (continuous >3 days no rain) was imposed to define a dry spell. Even for the case of no bias correction, MI measured a strong relationship between yield variability and dry spell during this period. It should be noted however that the bias in the predicted yield was not measured by entropy. The yield bias diminished as we applied the hierarchy of downscaling schemes especially when we stochastically corrected the timing of wet/dry days within the time series.

5 Conclusions

The mutual information content between precipitation variables and crop yield plays an important role for seasonal crop prediction using the GCM output. This study derives a set of downscaling techniques to explore the mutual information between precipitation variables with crop yield. The following conclusions are drawn from this study:

  1. (i)

    The bias correction and stochastic disaggregation and their combination help improve the performance of downscaled GCM rainfall. It is worth noting that the performance of rainfall intensity could not be improved significantly; however the bias is greatly improved in comparison to the uncorrected precipitation. The performance of different methodologies varies among precipitation variables. Even though the performance measure based on the coefficient of correlation does not change much for the uncorrected rainfall, however, significant changes are observed in removing the bias.

  2. (ii)

    The total seasonal rainfall amount obtained from the BC2-DisAg2-Freqn methodology plays an important role for crop yield in comparison to the October rainfall.

  3. (iii)

    The probability of occurrence of rainfall during November and December have more importance than the October and seasonal time period in the study area. Overall the probability of occurrence obtained in December after stochastic disaggregation of BC2 corrected GCM rainfall frequency performs better.

  4. (iv)

    The rainfall intensity observed during the month of December has more contribution to the better crop yield prediction than other monthly and seasonal time periods. Based on all the methodology, the BC2-DisAg2-Total performs better for rainfall intensity of the December month.

  5. (v)

    The performance of mutual information based on the dry spell shows a larger variation among different models as well as based on threshold and no threshold conditions. Overall the BC 2 method performs better with no threshold conditions. When all rainfall variables are compared it can be observed that the performance of a dry spell is an important contributing factor for crop yield as higher information is shared based on the mutual information. Therefore, the challenge is to predict the dry spells on a seasonal lead time for better crop yield simulation.