Introduction

Grain yield goals have been used for decades to make important in-season agronomic decisions. How they are derived and defined has been highly debated. Dahnke et al. (1988) simply defined a yield goal as the “yield per acre you hope to grow.” Much of the debate is over the difference between a crop’s yield potential and maximum yield. Dahnke et al. (1988) stated that yield potential is the highest possible yield obtainable with ideal management, soil, and weather. Evans and Fischer (1999) defined yield potential as “the yield of a cultivar when grown in environments to which it is adapted, with nutrients and water non-limiting and with pests, diseases, weeds, lodging, and other stresses effectively controlled.” According to Raun et al. (2001), yield potential as defined by Dahnke et al. (1988) and Evans and Fischer (1999) would be defined as maximum yield because potential yield is associated with site-specific soil and climate conditions that can change annually. Other approaches for producers to determine yield goals have been to average the grain yield over the last 4–5 years and increase it by ten percent or rely on variety trials or county averages (Fanning 2012; Geisseler and Horwath 2013). Raun et al. (2001) noted that yield potential is known to change from one site-year to another because of the temporal and spatial variability. Refined definitions, adapted from Raun et al. (2001), for measured grain yield, potential grain yield, and maximum grain yield are described below.

Measured grain yield: The grain yield that is actually harvested in a given year for a given site.

Potential grain yield: The grain yield that is predicted for a given year and a given site, based upon the assumptions that the level of growth factors that are responsible for early crop development will be maintained.

Maximum grain yield: The grain yield that is achievable when all manageable growth factors are non-limiting, and the environment is ideal.

Numerous researchers have documented the significance soil moisture has on estimating final wheat grain yield. Black and Bauer (1988) stated winter wheat yield goal should be based upon the amount of plant available water stored in the soil profile in addition to the amount of potential precipitation throughout the growing season. Rehm and Schmitt (1989) reported that if soil moisture conditions at planting were favorable, they recommended adjusting the grain yield goal to 10–20 percent above the recent averages. Rehm and Schmitt (1989) also noted that if soil moisture is limiting, utilizing averages from previous crop years might not be best for estimating grain yield. Robinson et al. (1999) reviewed 30 plus years of climate and winter wheat grain yield data and concluded that nitrogen (N) fertilizer applications would be most profitable if soil moisture measurements indicated above average soil moisture at planting. Girma et al. (2007) evaluated soil moisture content from the previous growing season and its effect on final grain yield of the subsequent year. They observed that soil moisture content at 0.75 m was a good predictor of grain yield at one research location and soil moisture content at 0.60 m provided good prediction of grain yield at another site.

Historically, grain yield goals or a field’s yield potential have been utilized to make pre-plant N fertilizer recommendations. Recent advancements in sensor-based technologies along with crop and weather modeling have allowed for in-season N fertilizer adjustments based upon crop grain yield potential (Raun et al. 2001). Research conducted over the last three decades has advanced the capability of sensor-based methodologies to assist in making agronomic management decisions. Stone et al. (1996) and Solie et al. (1996) first observed that normalized difference vegetation index (NDVI) measurements of winter wheat at Feekes Physiological Growth Stages 4 and 5 (Large 1954) were able to reliably predict both N uptake and plant/crop biomass. Raun et al. (2001) reported that the sum of two post-dormancy NDVI measurements divided by the cumulative growing degree days (GDD) from the first to the second reading was an accurate predictor of final grain yield and could assist in adjusting in-season applications of N fertilizer. Using 30 site-years of winter wheat grain yield data, Raun et al. (2005) built upon their earlier work and determined that grain yield could then be predicted at any growth stage when the NDVI was divided by the cumulative number of days with a growing threshold temperature value of 4.4 °C. They reported that the non-linear relationship of in-season estimate of yield (INSEY) and grain yield was highly correlated and essentially provided an estimate of biomass produced per day.

Researchers have attempted to incorporate some form of soil moisture measurement into sensor based technologies to predict wheat grain yield. Girma et al. (2006) evaluated mid-season measurements in winter wheat of sensor derived NDVI and soil moisture, as well as leaf color, chlorophyll content, plant height, canopy temperature, tiller density, plant density, soil NH4–N, NO3–N, organic C, total N, pH, and N mineralization potential. They noted that mid-season NDVI, chlorophyll content, plant height, and total N uptake were good predictors of final winter wheat grain yield; however, they observed soil moisture to be associated with grain yield, but not a reliable predictor of final grain yield. Walsh et al. (2013) measured soil water content at sowing and utilized it along with mid-season NDVI values to predict grain yield in winter wheat. They evaluated soil moisture at three different depths (0.05, 0.25, 0.60 m). By combining the NDVI-based approach with 0.05 m soil moisture data at the time of planting and NDVI data collection at Feekes 5 growth stage they could more accurately predict wheat grain yield.

Crop growth, development, and subsequently grain yield of winter wheat are affected by temperature (Porter and Moot 1998). The concept of GDD, or accumulation of heat units, has been proven to more accurately describe and predict crop development and physiological stages of growth much better than the number of days since planting or time of year (McMaster and Wilhelm 1997). For wheat, extensive research has shown relatively small and consistent standard errors of the cardinal temperatures for many crop growth stages and processes (Porter and Gawith 1999). Other researchers have utilized the concept of biological days to document crop growth, in which there is not only optimal temperature for growth, but other factors such as photoperiod or lack of water stress and nutrient stress that are included (Hunt and Pararajasingham 1995; Soltani and Sinclair 2012).

Knowledge of water use or evapotranspiration (ET) by a winter wheat cropping system could assist in predicting final grain yield, in that a producer would know if the soil moisture content they have at a given time will be enough to carry the crop to maturity. Nix and Fitzpatrick (1969) evaluated how the ratio of plant available soil water to the potential evaporative demand of a growing wheat crop, referred to as a stress index (SI), affected final grain yield. They reported highly significant correlations between SI and final grain yield. Similar results of a strong relationship between a derived SI and grain yield were observed by Stephens et al. (1989), in that they concluded the lack of adequate water to maintain growth strongly impacts final grain yield.

The early February to June water usage by winter wheat grown in the Southern Great Plains of North America would be very important, as this is the time grain yield potentials would be utilized to make agronomic management decisions, such as in-season N fertilizer applications. Howell et al. (1997) measured ET of winter wheat and observed season-long ET values of 877 mm, with daily values of 3–4 mm per day, and values that rarely exceeded 10 mm per day, but did on days with high-sustained wind speeds. Liu et al. (2002) determined in wheat grown in northern China that after the over-wintering period daily ET values were about 1.2 mm per day and rapidly increased to approximately 4–6 mm per day during the jointing and booting stages. In the semi-arid portion of India, Singandhupe and Sethi (2005) reported daily ET values ranged from 2.6 to 9.6 mm per day between February and April.

Any advances in the ability of grain yield prediction models to estimate winter wheat grain yield in-season will benefit producers in making improved agronomic management decisions. Therefore, the objective of this study was to develop a model that incorporates climatic parameters, such as soil moisture, with NDVI measurements to increase the reliability of predicting wheat grain yield in-season. We hypothesize that the ability to predict grain yield in-season will improve when soil moisture measurements are incorporated into current methodologies for predicting grain yield in winter wheat.

Materials and methods

Site descriptions

Grain yield and other crop and soil parameters utilized to calibrate a model to determine grain yield potential were collected from three long-term continuous winter wheat soil fertility experiments at Stillwater (222), Lahoma (502), and Perkins (N&P), Oklahoma. These fertility trials were established in 1969, 1970, and 1996 for Stillwater (222), Lahoma (502), and Perkins (N&P), respectively. The soils at each experimental site are classified and represent soils utilized for wheat production in the South-central Great Plains region of Oklahoma, Kansas, and Texas and that encompass approximately 650,000 hectares of the geographic area (Soil Survey Staff 2012a, b). Grain yield and other crop and soil parameters employed to validate the developed model were collected from the three experiments described above as well as three additional sites: Hennessey (Reg), Lake Carl Blackwell (Reg), and Lake Carl Blackwell (Val). The additional sites were regionally based N fertilizer response trials, in which the same pertinent data was collected. A brief summary of the soils utilized for each experimental site is reported in Table 1.

Table 1 Soil information for experimental sites utilized for model development and validation utilized in this experiment

Plots that were analyzed were those that received no N fertilizer treatment throughout the year or received preplant N fertilizer treatments only with no mid-season N fertilizer applications. The N fertilizer treatments and rates for each trial location are listed in Table 2. The site-years and growth stages where grain yield potential parameters were collected to develop a grain yield potential estimate are listed in Table 3. Altogether, 22 site-years of data were collected to develop the calibration model for grain yield potential. Gaps in years for data collection were due to the crop not being taken to grain yield because of natural occurrences (drought, late freeze, hail, etc.). The site-years and growth stages where grain yield potential parameters were collected to validate the developed model are described in Table 4. Altogether, 11 site-years of data were collected to validate the developed model.

Table 2 Nitrogen fertilizer application treatments and plot dimensions for research plots utilized in the model calibration and validation of the study
Table 3 Years and Feekes growth stages that yield potential measurements were collected for three long-term trials utilized in developing a model for winter wheat yield potential
Table 4 Years and Feekes growth stages that yield potential measurements were collected for three long-term trials and three regional N fertilizer response trials utilized in validating a model for winter wheat yield potential

Current model for predicting yield

The current model utilized for predicting winter wheat grain yield potential was that described by Raun et al. (2005). The INSEY was calculated by dividing the NDVI by the cumulative number of GDD with a growing threshold value of 4.4 °C. A non-linear relationship was established between INSEY and final grain yield, and the equation from this relationship is thus used to predict grain yield.

Proposed model parameters

Normalized difference vegetation index (NDVI)

Spectral reflectance expressed as NDVI was measured using a Greenseeker™ (Trimble, Sunnyvale, CA, USA) ground-based, active, optical sensor. The NDVI was computed from red and near infrared reflectance values. The equation to calculate NDVI is listed below:

$$ {\text{NDVI }} = \, {\raise0.7ex\hbox{${\left( {{\text{NIR }} - {\text{ Red}}} \right)}$} \!\mathord{\left/ {\vphantom {{\left( {{\text{NIR }} - {\text{ Red}}} \right)} {\left( {{\text{NIR }} + {\text{ Red}}} \right)}}}\right.\kern-0pt} \!\lower0.7ex\hbox{${\left( {{\text{NIR }} + {\text{ Red}}} \right)}$}} $$

where: NIR and Red are the reflectance measurements in the near-infrared (780 nm) and red bands (660 nm), respectively. The NDVI values were collected at various times throughout the growing season at a height of 0.75–1.00 m over the center portion of the research plots with the ground-based optical sensor and the Feekes (Large 1954) growth stage was documented as well.

Days of potential growth (DPG)

This parameter was collected following the biological day concept described by Soltani and Sinclair (2012). We proposed that for considerable growth to occur in winter wheat there should be adequate temperature along with adequate soil water. Soil moisture data were downloaded for each experimental site from the nearest adjacent Oklahoma Mesonet climate-monitoring station (Oklahoma Mesonet 2015) for the pertinent time periods of data collection. Soil moisture measurements were recorded at depths of 0.05, 0.25, and 0.60 m below the soil surface. Soil moisture data were collected using a Campbell Scientific 229-L heat dissipation sensor (Campbell Scientific, Inc., Logan, UT, USA). The sensor measures a change in temperature after a pulse of heat is introduced to the system (Basara and Crawford 2000). The magnitude in change can then be calibrated to an estimate of soil moisture content. The sensor’s response can be normalized to a fractional water index (FWI), which is a unitless value that ranges from 0.00 for dry soils to 1.00 for wet/saturated soils (Illston et al. 2008). According to Illston et al. (2008), utilizing the FWI is more ideal when operating on a larger scale and may not be limited by varying soil texture across research sites. The DPG were counted as the number of days that not only met the criteria for a GDD described in the current method for predicting grain yield, but also had a weighted average FWI across the 0.80 m soil profile of 0.30 or more. The 0.30 FWI threshold is described by Soltani and Sinclair (2012) as the level at which growth could potentially be inhibited for winter wheat.

Stress index (SI)

Soil moisture data were downloaded from the adjacent Oklahoma Mesonet climate-monitoring station (Oklahoma Mesonet 2015) for each experimental site for the time periods of data collection. To determine the amount of plant available water (PAW) at the time of NDVI data collection the weighted 0.80 m FWI values were converted to volumetric water content. Gravimetric water content values and soil bulk density values were obtained from the USDA-NRCS SSURGO tabular dataset for each respective experimental site (Soil Survey Staff 2012b). It was assumed that a FWI of 0.00 was permanent wilting point and a FWI of 1.00 was close to saturated conditions. One-third bar water or field capacity was then determined by using the soil physical property data and Eqs. 4, 5, and 6 from Illston et al. (2008). Equations were created to predict PAW in the 0.80 m profile from the weighted FWI values. The SI was then determined similarly to Nix and Fitzpatrick (1969) by dividing the amount of PAW by the amount of water needed to maintain yield from the date of sensing to an assumed harvest date of June 10. Water usage by the crop was assumed to be 5 mm per day. This value was chosen because it is likely the highest average potential evapotraspiration (PET) loss in the Southern Great Plains from February 1 to June 10 according to the literature and Oklahoma Mesonet predicted PET losses for the three model calibration sites (Table 5). To keep from skewing the data for sensing times late in the growing season, SI values were not allowed to exceed 1.00.

Table 5 Average (avg), minimum (min), and max (max) potential evapotranspiration amounts (mm) from February 1 to June 10 for three long-term soil fertility locations

Statistical analysis

Weather data

Weather data were downloaded from the Oklahoma Mesonet and imported into Microsoft Access databases. Structured query language queries were developed to retrieve and summarize weather data to create desired model parameter variables.

Model development

Multiple linear regression techniques were utilized to develop a model for predicting grain yield from the three proposed parameters (NDVI, DPG, SI). Step-wise regression was employed to determine which main and interactive effects of the proposed parameters had a significant impact on final grain yield. The maximized adjusted R2 values were used to determine the appropriate regression equation parameters that best estimated final grain yield. All analyses were conducted using SAS statistical software (SAS Institute, Inc 2011).

To determine if surface soil texture was to have any effect on yield potential, three different models were developed. One model was created from all three calibration sites with no regard to surface texture. The other two models developed were the loamy surface textured model (Stillwater, OK and Lahoma, OK) and the coarse surface textured model (Perkins, OK).

Model validation

The statistical model developed was validated utilizing datasets from the 2011–2012 and 2012–2013 growing seasons. From this point forward, the growing season will be described by the year of grain harvest. Three of the experimental sites used were the long-term soil fertility trials used in the development of the model. Three additional experimental sites from regional N fertilizer response trials were also used. None of the data sets or site-years used in model validation were part of the model development.

The INSEY values from the current methodology for predicting grain yield and the proposed models described above were evaluated for their effectiveness by regressing the predicted values against the actual grain yield values. Coefficient of determination (R2) values and root mean square error (RMSE) values were then used to determine which methods performed best for predicting grain yield. All analyses were conducted using SAS statistical software (SAS Institute, Inc 2011).

Results

Step-wise regression techniques revealed that the main effect and all interactive effects of the proposed model parameters had significant effects on final grain yield when data from all three calibration sites were used (Table 6). The same was true when only the data for the two loamy sites were used. For the coarse site model development, DPG, SI, and the interaction of DPG and SI did not have a significant effect on grain yield, however including them in the model maximized the adjusted R2 (Table 6). The parameters described in Table 6 were then employed to evaluate the effectiveness of each model’s ability to predict yield for each of the validation sites included in this trial.

Table 6 Model parameter estimates for estimating winter wheat grain yield

In 2012, the Stillwater, OK (222) site did not display a significant difference in model performance between the current INSEY and the proposed INSEY models for the Feekes growth stages 4, 5, 6. The amount of variation accounted for improved in both proposed models later in the growing season when compared to the current INSEY (Table 7). When the ability to predict grain yield was evaluated across all growth stages in 2012, the proposed INSEY model that was developed regardless of soil type performed the best with an R2 of 0.31 compared to the R2 values of 0.20 and 0.20 for the current INSEY model and the model developed for loamy textured soils, respectively. In 2013, fall moisture at sowing was negligible and the first stand failed. The wheat was replanted in late November and marginal growth occurred until later in the following spring. One sensor reading was recorded at Feekes 7, however, the ability to predict grain yield was good for the three different models, but no model was significantly superior (Table 7). When the Stillwater, OK (222) plots were pooled across growth stages and growing season, the proposed INSEY model developed from all validation sites and that utilized soil moisture performed better than the current INSEY model and the model developed for loamy textured sites. The RMSE values were not significantly different from one another within each growth stage for the three models evaluated (Table 8). A trend of decreasing RMSE values was observed as the growth stage increased.

Table 7 Coefficient of determination values (R2) for current and proposed models for predicting final winter wheat grain yield by Feekes (FK) growth stage
Table 8 Root mean square error values for current and proposed models for predicting final winter wheat grain yield by Feekes (FK) growth stage

For Lahoma, OK (502) in 2012, the ability of the three models to predict grain yield tended to improve later into the growing season (Table 7). At Feekes 4 there was no difference between the three model’s ability to predict grain yield. However, as the growing season progressed the two proposed INSEY models that used soil moisture parameters outperformed the current INSEY model. This trend was also observed when the plots were analyzed across all growth stages for 2012. In 2013, fall moisture was negligible, so very little growth occurred until later in the spring when the area received significant rainfall. With low biomass accumulation at Feekes 4 and 5, the ability to predict grain yield was poor. After rain had occurred and vegetative growth resumed, yield prediction for the three models improved at the Feekes 7 growth stage. As in 2012, the two proposed models utilizing soil moisture data outperformed the current INSEY model in the 2013 growing season. When grain yield prediction was analyzed across growth stages in 2013, the poor prediction values at Feekes 4 and 5 seemed to dictate an overall poor performance of predicting grain yield for the 2013 growing season. When the ability to predict grain yield was analyzed across all growth stages and growing seasons, again the two proposed INSEY models that utilized soil moisture outperformed the current INSEY model. The RMSE values for the three models’ regression of the predicted versus actual grain yield decreased throughout the growing season (Table 8). Very little difference was observed in the values between the different models except at the Feekes 5, 7, and 10 growth stages in 2012, and there was a slight improvement for the two models utilizing soil moisture compared to the current INSEY model (Table 8).

The ability of the three models to predict grain yield improved as the growing season progressed in 2012 at the Hennessey, OK (Reg) site (Table 7). Negligible, but lower coefficient of determination values were observed between the current INSEY model and the two proposed INSEY models for all growth stages in 2012. This was also observed when the 2012 data were analyzed across all growth stages; however, the proposed INSEY model developed from all calibration sites had improved grain yield predictions as compared to the proposed INSEY model that was developed for loamy textured soils. In 2013, it was impossible to predict yield at Feekes 3 due to minimal available water for vegetative growth. After the site received significant rainfall, the trend of improving yield prediction as the growing season progressed was observed. At the Feekes 4, 5, and 6 growth stages, no differences were observed in the three models’ performances; however, at Feekes 7 and 10 the models utilizing soil moisture gave slight improvements in predicting yield (Table 7). When analyzed across all growth stages for the 2013 growing season, the same trend as 2012 for the proposed INSEY model developed from all sites outperformed the other two models. This was then again observed when the data were analyzed across all growth stages and both growing seasons. No differences were observed in the RMSE values for each model, but again the values decreased as the growing season progressed. Of all the site-years analyzed, the values were the highest for the 2012 growing season. This could likely be explained by significantly higher average grain yields harvested at this site for that growing season.

In 2012 at the Lake Carl Blackwell, OK (Reg) site, slight improvements in the ability to predict grain yield were observed for the proposed INSEY models that included soil moisture parameters (Table 7). When the data were analyzed across all growth stages in 2012, the proposed INSEY model that was developed from all sites outperformed the other two models. For the 2013 growing season, very little differences were observed in the three models’ abilities to predict grain yield until they were analyzed across all growth stages. Unlike the other previously described site-years, there was a decrease in model performance between Feekes 7 and 9. This may be because of a potentially damaging freeze that occurred between these two growth stages. When analyzed across all growth stages, the current INSEY model outperformed the other two proposed INSEY models. This same trend, though not as prominent, was observed when the data were analyzed across growth stages and both growing seasons. The RMSE values for the 2012 growing season decreased as the season progressed and there were not any differences between the values for each model (Table 8). In 2013, the RMSE values were not different amongst models and there was no observable trend (Table 8).

When all the validation sites classified as loamy were analyzed across site and growing season, no distinct patterns between model performance and growth stage were observed (Fig. 1). When just observing the Feekes 4 through 10 growth stages, the ability to predict grain yield did tend to increase throughout the growing season (Fig. 1) with the exception of the Feekes 9 growth stage which can likely be explained by the late season freeze at Lake Carl Blackwell, OK (Reg) in 2013. When all loamy sites were analyzed across all growth stages, the two proposed INSEY models outperformed the current INSEY model with the proposed INSEY model developed from all sites performing the best with a coefficient of determination value of 0.55 (Fig. 1).

Fig. 1
figure 1

Validation sites with a loamy surface soil texture coefficient of determination (R 2) values for the current model of determining winter wheat in-season estimation of yield (INSEY), and proposed new models that incorporate soil moisture data into yield prediction. Two proposed new models are displayed, one that predicts yield regardless of soil type and one that predicts yield for soils with a loamy textured surface. Predictions are grouped together by Feekes (FK) growth stage across the 2012 and 2013 growing seasons

Days with above average temperatures prior to and during grain fill at Perkins, OK (N&P) in 2012 likely led to the very poor performance of the three models at predicting grain yield, with all coefficient of determination values being <0.20 (Table 7). In 2013, grain yield prediction values decreased from Feekes 4 to 5 and 6, but increased for Feekes 7 and 10. The current INSEY model had slightly higher prediction capabilities at Feekes 3 and 4 compared to the two proposed models that utilize soil moisture parameters. The opposite trend was observed for Feekes 5 and 6, but there was no difference between the three models at Feekes 7 and 10. When the data were analyzed across all growth stages in 2013, the proposed INSEY model that was developed from all soil types outperformed the other two models. When the data were analyzed across all growth stages and both growing seasons, no discernable best model was revealed. This again is likely due to grain yields that were marred by climatic conditions post sensing in 2012. No observable differences were detected in the RMSE values between models and growth stages for the 2012 growing season. In 2013, the RMSE tended to decrease throughout the growing season, much like the previously described loamy textured sites (Table 8).

An additional N fertilizer response trial was added at Lake Carl Blackwell, OK (Val) on a coarse textured soil in 2013. The three models failed to predict yield early in the growing season when growth was limited due to lack of water. As precipitation increased, vegetative growth resumed and differences were observed at the remaining growth stages (Table 7). At Feekes 4, 5, 7, and 10 the proposed INSEY models that incorporated soil moisture parameters performed slightly better than the current INSEY model (Table 7). The ability to predict yield did increase from Feekes 4 to 7, but decreased from 7 to 10. This decrease is likely due to an unseasonably late freeze. When data were analyzed across all growth stages, extremely poor grain yield predictions were observed. Though the current INSEY performed the best of the three models, all the coefficient of determination values were less than 0.15. Like many of the previously described validation sites, the RMSE values decreased throughout the growing season from Feekes 3 to 7, but increased from 7 to 10 (Table 8).

No observable trends could be reported when the coarse textured validation sites were aggregated across site-years and analyzed (Fig. 2). This could be because of a lack of site-years and the 2012 data collected from the Perkins, OK (N&P) research site. One item to note would be when the data were analyzed across site-years and growth stages, the proposed INSEY model that was developed by all the calibration sites slightly outperformed the current INSEY model. The model that was developed for coarse textured soils performed poorly with a coefficient of determination value <0.10 (Fig. 2).

Fig. 2
figure 2

Validation sites with a coarse surface soil texture coefficient of determination (R 2) values for the current model of determining winter wheat in-season estimation of yield (INSEY), and proposed new models that incorporate soil moisture data into yield prediction. Two proposed new models are displayed, one that predicts yield regardless of soil type and one that predicts yield for soils with a coarse textured surface. Predictions are grouped together by Feekes (FK) growth stage across the 2012 and 2013 growing seasons

When the data were grouped across all site-years and analyzed at all growth stages, differences were observed between the current INSEY and proposed INSEY developed using soil moisture data. At growth stages Feekes 3 and 4, the current INSEY outperformed the proposed INSEY, but the proposed INSEY outperformed the current INSEY at Feekes 5, 6, 8, and 10. No significant difference was observed between the two models at Feekes 7 and 9. (Figure 3). When the data were aggregated across all site-years and growth stages, the proposed INSEY model outperformed the current INSEY model with a coefficient of determination value of 0.49 compared to 0.40 (Fig. 4). The RMSE values didn’t differ between the two models. It should also be noted that the linear regression equation for the proposed INSEY did not have a slope significantly different from one and an intercept different from zero. The regression equation for the current INSEY model had a slope significantly different than one, but the intercept was not significantly different from zero.

Fig. 3
figure 3

Coefficient of determination (R 2) values for the current model of determining winter wheat in-season estimation of yield (INSEY), and a proposed new model that incorporate soil moisture data into yield prediction. Predictions are grouped together by Feekes (FK) growth stage across all validation sites for the 2012 and 2013 growing seasons

Fig. 4
figure 4

Linear relationships between predicted winter wheat in-season estimations of yield based upon soil moisture parameters (A) or the current model (B) used to predict actual grain yield. Data presented is from all validation sites across all growth stages. Dashed line represents one standard deviation above the actual yield

Discussion

Proper validation of developed prediction models should be conducted to determine the legitimacy of a model to accurately predict grain yield. Little to no work similar to that described in this paper, with an emphasis being placed on remotely sensed NDVI, has been published that truly validates a developed grain yield prediction model that could be compared to actual grain yield. The current model for predicting INSEY (Raun et al. 2005), described the goodness of fit of the data used to develop the exponential relationship. The grain yield prediction model was only evaluated on its ability to improve N fertilizer recommendations and was not evaluated with an independent data set to predict grain yield. Other researchers have employed multi-parameter models that utilize remotely sensed NDVI from ground based optical sensors and climate parameters to try and predict grain yield. Both Girma et al. (2006) and Walsh et al. (2013) developed models and made suggestions of what type of parameters could be reliable predictors of grain yield, but did not validate their conclusions with an independent dataset.

The common trend of the ability to accurately predict grain yield increasing with the progression of the growing season seems logical. As one moves later in the growing season the probability of unexpected climatological events, such as freeze or extreme heat, between sensing and grain harvest decreases. The decrease in grain yield and thus a decrease in the ability to predict yield for the Perkins (N&P) validation site in 2012 is a prime example of what can happen even when there is just a short period of time in which temperatures exceed 31–35 °C during the grain filling process. Wheat grain yields can be significantly impacted even when as little as five percent of the grain filling process occurs under excess heat (Stone and Nicolas 1994; Wheeler et al. 1996; Ferris et al. 1998). The decrease in the models’ ability to predict yield for the Feekes 10 growth stages for the Lake Carl Blackwell (Reg) and Lake Carl Blackwell (Val) validation sites in 2013 may be attributed to a yield damaging late freeze. The amount of damage to grain yield would depend on the stage of the wheat crop at the time of the freeze (Thakur et al. 2010). Because dry matter accumulation and partitioning during grain fill can be strongly influenced by N fertilizer nutrition (Demotes-Mainard et al. 1999), the fact that the data utilized in this experiment came from N response trials would explain why plots had differing degrees of damage due to the freeze. This would also explain the increase in RMSE values later in the growing season for the 2013 Lake Carl Blackwell sites.

Based on the results, there is no benefit to having a different model based on soil type. Though no similar winter wheat research exists, Sharma and Franzen (2013) reported contrary results while investigating the ability to predict maize grain yield with optical sensors and other maize plant measurements. They developed two different grain yield prediction curves for clayey and medium textured soils. The differences in the curves were due to soils potentially being water-logged and having different responses to N fertilization. Excess water, typically is not a problem for winter wheat grown in the region of the calibration and validation trials. Likely, the fact that soil physical properties were incorporated into the SI model parameter for the proposed INSEY model would negate the need for different grain yield prediction models based on soil type.

Conclusions

The ability of grain yield prediction models to estimate winter wheat grain yield in-season will aid producers in making better agronomic management decisions. Regardless of the model, the ability and the accuracy at which grain yield could be predicted increased as the growing season progressed, unless rare unseasonable climatic events occurred after NDVI data collection. No added benefit was observed for creating two separate models based on soil type. One universal model developed with soil moisture data was sufficient for predicting grain yield. When comparing the ability to estimate grain yield of proposed INSEY models that utilize soil moisture data with the current INSEY model, the proposed INSEY models typically outperformed the current INSEY model at most validation sites. When data were combined over sites the current INSEY model did perform better at earlier growth stages (Feekes 3 and 4), but the opposite was observed for the mid to late growth stages (Feekes 5–10). Lastly, the proposed INSEY model outperformed the current INSEY model, with a coefficient of determination value of 0.49 compared to 0.42, when data were analyzed across all growth stages, thus providing a model in which producers would not be required to stage wheat growth in order to determine an appropriate model for grain yield prediction.