Abstract
Annual crop yield fluctuation due to natural and anthropogenic factors is a major concern of the Ethiopian Government. For an immediate response to drastically changing crop yields and resulting harvest failures and to enhance the country’s food security in general, extensive area crop growth monitoring and early prediction of production are needed. In this study, we developed an early maize (Zea mays) yield forecasting model using Sentinel-2 MSI (Multispectral Instrument); the study was carried out in the Abaya district of the Oromia Regional State in Ethiopia. The model consists of the following components: (1) Sentinel-2 image-based cropland identification for different crop development stages, (2) extraction of time series Sentinel-2 vegetation indices for crop growth monitoring, and (3) a simple linear stepwise forward regression approach for yield prediction. We tested different spectral indices regarding their performance in describing the crop development and eventually predicting the expected yield. The result showed that (1) the linear Red-Edge Enhanced Vegetation Index (Red-Edge EVI), (2) the combination of the Enhanced Vegetation Index (EVI) and the Green Vegetation Index (GVI), (3) the combination of the Red-Edge EVI and Soil Adjusted Vegetation Index (SAVI), and (4) the combination of the Normalized Difference Vegetation Index (NDVI), Red-Edge EVI and SAVI offer the best predictive model about 2 months before harvesting with the highest coefficients of determination (R2) of 0.73, 0.80, 0.84, and 0.88, respectively. The correlation for the GVI was generally lowest compared to established models, and no evidence of a peak correlation for NDVI was observed. Our approach showed a high accuracy of detecting maize fields, detecting crop phenology, and early predicting of grain yield for the study year 2018. Our simple model may generate early warning information, which may support in-time decision-making regarding food supply when critical yield fluctuations are to be expected.
Zusammenfassung
Jährliche Schwankungen der Ernteerträge aufgrund natürlicher und anthropogener Faktoren sind ein Hauptanliegen der äthiopischen Regierung. Für eine sofortige Reaktion auf sich drastisch verändernde Ernteerträge und daraus resultierende Ernteausfälle sowie zur Verbesserung der Ernährungssicherheit des Landes im Allgemeinen sind eine großflächige Überwachung des Pflanzenwachstums und eine frühzeitige Vorhersage der Produktion erforderlich. In dieser Studie haben wir ein Modell zur frühzeitigen Vorhersage des Maisertrags (Zea mays) unter Verwendung des Sentinel-2 MSI (Multispectral Instrument) entwickelt; die Studie wurde im Bezirk Abaya im Regionalstaat Oromia in Äthiopien durchgeführt. Das Modell besteht aus den folgenden Komponenten: (1) Sentinel-2-Bild-basierte Ackerland-Identifikation für verschiedene Pflanzenentwicklungsstadien, (2) Extraktion von Zeitreihen von Sentinel-2-Vegetationsindizes (VIs) für die Überwachung des Pflanzenwachstums und (3) ein einfacher linearer schrittweiser Vorwärtsregressionsansatz für die Ertragsvorhersage. Wir testeten verschiedene Spektralindizes hinsichtlich ihrer Leistung bei der Beschreibung der Pflanzenentwicklung und schließlich der Vorhersage des erwarteten Ertrags. Das Ergebnis zeigte, dass: (i) der lineare Red-Edge Enhanced Vegetation Index (Red-Edge EVI), (ii) die Kombination aus dem Enhanced Vegetation Index (EVI) und dem Green Vegetation Index (GVI), (iii) die Kombination aus dem Red-Edge EVI und dem Soil Adjusted Vegetation Index (SAVI), und (iv) die Kombination aus dem Normalized Difference Vegetation Index (NDVI), Red-Edge EVI und SAVI das beste Vorhersagemodell etwa zwei Monate vor der Ernte mit den höchsten Bestimmtheitsmaßen (R2) von 0,73, 0,80, 0,84 bzw. 0,88, bieten. Die Korrelation für den GVI war im Vergleich zu den etablierten Modellen generell am niedrigsten, und für den NDVI wurde kein Hinweis auf eine Spitzenkorrelation beobachtet. Unser Ansatz zeigte eine hohe Genauigkeit bei der Erkennung von Maisfeldern, der Erkennung der Pflanzenphänologie und der frühzeitigen Vorhersage des Kornertrags für das Studienjahr 2018. Unser einfaches Modell kann Frühwarninformationen generieren, die eine rechtzeitige Entscheidungsfindung bezüglich der Nahrungsmittelversorgung unterstützen können, wenn kritische Ertragsschwankungen zu erwarten sind.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Maize (Zea mays) is a crucial staple crop in Ethiopia. Maize is mainly grown as a rain-fed agricultural system without irrigation by smallholder private farmers. The production has been fluctuating from year to year due to rainfall and temperature variability, variation in soil fertility, plant diseases, and changing types of crop management (MOA 2015). Currently, yield fluctuations remain one of the significant challenges for the country's food security strategy (Cochrane and Bekele 2018; FAO 2015).
The agriculture sector of Ethiopia contributes more than 80% of the Gross Domestic Product (GDP) of the country (MOA 2019). Cereal crops are the primary food source where maize plays the most critical role in food security. Timely yield estimation for this significant crop is essential for supporting efficient agricultural decision-making for timely planning of food import in case of shortage or, optionally, to export in case of surplus production. Traditionally, in Ethiopia, early crop yield estimation is done from the observed data, such as the farmers' request for agricultural inputs and the yearly rainfall and distribution pattern. This technique, however, turned out to be biased, costly, and associated with significant uncertainties leading to poor estimates of crop area and predictions of crop yield (Balaghi et al. 2008; Ramirez-Villegas and Challinor 2012). Such information is also released late or even after the end of the season, which is too late to take appropriate actions to avert hunger. Locally specific models for a reliable annual yield prediction would significantly improve well-informed and timely decision-making in food provision.
The development of earth observation technology suggests that remotely sensed data be straightforwardly employed for agricultural crop yield forecasting. Remote sensing data offers at modest cost spatially explicit, large area and timely monitoring of the earth’s surface, including crop fields and development (Liu and Kogan 2002). To identify plants and their cover, discriminate individual plants, vegetation indices (VIs) are used to measure vegetation greenness. VIs are computed as a combination of different spectral bands, usually including the red and near infrared. VIs were also found to have a solid link to plant physiology and crop productivity (Meng et al. 2014; Noureldin et al. 2013; Prabhakara et al. 2015; Sakamoto et al. 2014). The relationship of various satellite image derived VIs with biophysical parameters of plants has been investigated, such as in Ban et al. (2016), Bussay et al. (2015), Chipanshi et al. (2015), Darvishzadeh et al. (2009), Prabhakara et al. (2015), Sharma et al. (2015) and Smethurst et al. (2017). Several researches have also been conducted on the relationships between various crops including maize, and vegetation indices (Bolton and Friedl 2013; Chivasa et al. 2017; da Silva et al. 2020; Maresma et al. 2016; Peng and Gitelson 2011; Zhou et al. 2017). These studies have established a positive linear correlation between VIs and crop biophysical variables, such as leaf area index, leaf traits, biomass, and grain yields.
The spectral and temporal changes of satellite images and their response to the biophysical characteristics of crop plants play a significant role when establishing an effective pre-harvest estimation of crop productivity. Multispectral satellite imageries have been extensively used for the spectral reflectance intensity in different spectral bands of visible to mid-infrared regions of the electromagnetic spectrum (Kamal and Bhatia 2010). There are many classes of spectral indices which have been subjected in substantial researches; such as Normalized Difference Vegetation Index (NDVI) (Rouse et al. 1973), Soil Adjusted Vegetation Index (SAVI) (Baret et al. 1989), Enhanced Vegetation Index (EVI) (Huete et al. 2002), Green Vegetation Index (GVI) (Panda et al. 2010), and Normalized Difference Flood Index (NDFI) (Boschetti et al. 2014). New group of VIs has also been developed based on the shape and position of the spectral reflectance curve. It comprises the Red-Edge part of the electromagnetic reflectance spectrum, such as the Red-Edge Normalized Difference Vegetation Index (Red-Edge NDVI) and Red-Edge Enhanced Vegetation Index (Red-Edge EVI) (Mutanga and Skidmore 2004). These VIs have a steep slope between the lower parts of the visible region and the higher reflectance of the near-infrared region (0.67–0.79 µm). The red-edge inflection point depends on the chlorophyll amount detected by the sensor and is strongly correlated with plant leaves chlorophyll concentration, offering a sensitive indicator of vegetation stresses and biomass content (Rossini et al. 2007).
The main aim of this study was to improve decision-making regarding food provision from agriculture and thus contribute to better food security in Ethiopia. The technical goals are (1) to validate the potential of Sentinel-2 MSI to identify and map maize fields; (2) to develop a remote sensing-based technique to monitor maize phenology; and (3) to establish models to predict the expected maize yield.
2 Materials and Methods
2.1 Study Site
Our study area is in the district of Abaya Woreda, Oromia Regional State, Ethiopia. It is situated in the Borena Zone at the southern part of the capital city of Addis Ababa, between 6° 17′ 14″ and 6° 28′ 4″ N, and 38° 7′ 54″ and 38° 19′ 19″ E (Fig. 1). The agricultural fields in the district where our investigation piloted is about 30,000 ha and is mainly cultivated by smallholder farmers. The altitude ranges from 1100 to 1900 m a. s. l. The area has a subtropical climate with an average annual rainfall of 1500 mm (mainly between beginning of April and October), and the monthly mean temperature ranges from 18 to 25 °C. According to UNDP (2000), the land comprises 41% of arable land (28.7% is under annual crops), 35% pasture, 15% forest, and the remaining 9% is swampy and degraded land. Four major crops are cultivated: maize (Zea mays), teff (Eragrostis tef), barley (Hordeum vulgare L.), and haricot beans (Phaseolus vulgaris L.), and there is also some cultivation of Sorghum (Sorghum bicolor) (CSA 2018).
2.2 Field Data
The maize in our study area was sown at the beginning of April and harvested in October of 2018, which is the suitable cropping season of the study site. Principally, the local variety of ‘Pawuner’ was cultivated. In situ crop information of phenological or growth stages and transition dates, viz, sowing and establishment, vegetative growth, tasselling and silking, yield formation and maturity, harvesting and yield information, were noted regularly during the field observation. We used random sampling with a sample size n = 250 for training data collection.
The maize reference data required for phenological or growth stages monitoring and yield modelling were collected in a sample of n = 31 square field plots of 100 m2 (10*10 m) whose position was recorded by a global navigation satellite system (GNSS) with the accuracy of ± 3 m. This plot size represented one single Sentinel-2 image pixel.
2.3 Satellite Data
Time series of Sentinel-2A MSI for the 2018 maize growing season was used for classification and vegetation indices (VIs) extraction. The satellite image is part of ESA´s Sentinel mission for the monitoring of the Earth´s surface. It has a high revisit time of 10 days at the equator with one satellite resulting in 5 days from the two satellites constellation under cloud-free conditions in 2–3 days at mid-altitude (ESA 2016).
The images were originally published in the Copernicus Open Access Hub with Top of Atmosphere (ToA) reflectance, and we obtained it as LevelC. Images with a cloud cover surpassing 20% were discarded. Pre-processing, such as radiometric calibration, geometric and atmospheric corrections were completed in Google Earth Engine. Of 13 spectral bands of Sentinel-2 MSI ranging between 0.443 and 2.190 µm of wavelength and 10–60 m spatial resolution, we used seven spectral reflectance bands for maize field classification, phenological monitoring, and model prediction: blue (0.458–0.523 µm), green (0.543–0.578 µm), red (0. 650–0. 680 µm), Red-Edge (0.773–0.793 µm), near infrared (NIR) (0.785–0.900 µm), and short-wave infrared (SWIR) (2.10–2.28 µm). Blue, green, red, and NIR spectral bands have a 10 m spatial resolution, while Red-Edge and SWIR bands originate with a pixel size of 20 m and were later resampled to 10 m of pixel size following Park and Schowengerdt (1983) of nearest neighbour resampling approach.
2.4 Methods
The approach of this study comprises three main steps: (1) classification and mapping of maize crop fields; (2) development of a remote sensing-based crop phenological monitoring technique; and (3) building yield prediction models. Detailed activities implemented in each step are discussed in the following sections, and the overall procedural workflow of the study is illustrated in Fig. 2.
2.4.1 Processing of Vegetation Indices (VIs)
A composite of Sentinel-2 scene executed from each phenological stage was used to derive Vegetation Indices (VIs). Seven VIs were applied as remote sensing-based phenology and yield predictors: Normalized Difference Vegetation Index (NDVI), Enhanced Vegetation Index (EVI), Green Vegetation Index (GVI), Soil Adjusted Vegetation Index (SAVI), Normalized Difference Flood Index (NDFI), Red-Edge Normalized Difference Vegetation Index (Red-Edge NDVI) and Red-Edge Enhanced Vegetation Index (Red-Edge EVI). Calculations of spectral indices were executed according to the equations in Table 1.
2.4.2 Classification and Validation of Maize Fields
Composite scenes of phenologically adjusted images were used for classification. The composite was executed based on averaging the image pixel information (Running et al. 1995), acquired in different phenological stages. Due to the study site's high cloud cover effects in some months (especially in July and August), few scenes (about four images of composite per month) were considered only. We used a supervised random forest (RF) classifier (Breiman 2001) to discriminate maize fields (with package “ee.Classifier.smileRandomForest” in the Google Earth Engine). The RF classifier was chosen due to its lack of overfitting, its thrifty for user-defined features, low sensitivity to the number of input parameters, minimize correlation between classifiers, low computational demands, high processing speed and its ability to reduce noise (Belgiu and Drăguţ 2016; Gislason et al. 2006; Lambert et al. 2018). From a sample of n = 250 ground-truthing plots, a random subset of 70% of these field data was used for model building and the remaining 30% for validation. Randomly distributed reference training polygons were initially delineated manually in ArcMap for RF modeling based on the GPS location recorded during field observation. These reference polygons reflect the spectral properties of the maize crop and other land cover classes. The number of trees was set to 500 following the recommendation by Rodriguez-Galiano et al. (2012). A number of classifications were executed at different phenological stages to determine the best phenological transitional dates for accurate maize field identification. The classification accuracy was evaluated in terms of producer accuracy, user accuracy, and overall accuracy along with the standard techniques (Congalton 1991).
2.4.3 Monitoring of Maize Phenology Development
Crop phenology is the change in the growing phases of plants (Ruml and Vulic 2005). Maize phenologies and transition dates were regularly recorded in the field once a week (for n = 31 maize sample plots). Remote sensing-based spectral reflectance patterns of maize throughout the cropping season were then compared against field observed phenologies. We applied NDVI, EVI, GVI, SAVI, Red-Edge NDVI, and Red-Edge EVI for phenological information, while NDFI was used as an agronomic water level and flooding indicator. The VIs were obtained based on a total of 25 multi-temporal images distributed throughout the maize development stages.
2.4.4 Yield Predictive Model and Production Estimation
The phenologically adjusted values of multi-temporal VIs corresponding to the field sampling plots were weighted by an average fraction of each pixel. The values were subsequently used as predictor variables for yield models. For single VI-based predictive model tests, linear, exponential, power, logarithmic, and polynomial mathematical functions were analyzed. Predictive models of multiple regressions were executed through a stepwise forward regression approach using “stats” and “ggplot2” of statistical packages from RStudio (RStudio Team 2018). Accordingly, phenologically adjusted multi-models were established. The next step was to optimize the phenological periods to predict final grain yields in a timely and more accurate way. The model optimizations were evaluated based on the computed coefficient of determination (R2), root mean square error (RMSE), and bias (Eqs. 1–3). The R2 and RMSE values served as a model performance indicator. The VIs that provides peak accuracy when regressed with observed yield can then be selected as the best predictor of the final grain yield estimates.
where yi—observed value of maize yield; \(\hat{y}_{i}\)—predicted value of maize yield; n—number of observations.
3 Results and Discussion
3.1 Discriminating Maize Fields
The overall accuracy (OA) results of maize discrimination were varied with different multi-date classifications subsequent to phenological phases. At the beginning of the developmental stages (between April and July), the OA of classifications was relatively low (ranging from 55 to 67%). This confirms that it is challenging to discriminate maize crops at these growth stages due to the spectral mixes of several plants. For instance, high weed infestations, which were not yet entirely removed at these stages, contribute to high spectral confusion. A high weed infestation during the cropping season leads to substantial omission error, which is also revealed from other studies (Eddy et al. 2014; Lambert et al. 2018).
In our study, classification accuracy significantly improved during tasselling and silking phenologies, typically between 80 and 110 days after sowing. During these stages, the OA of maize classification reached about 85%. The maize fields were well-weeded during these phenological stages and enabled the maize to have a distinct spectral signature and low commission errors. In addition, these periods offer a complete stage of maize development, while other cereal crops, such as bean (Phaseolus vulgaris L.) and barley (Hordeum vulgare L.) and some weed remnants were already at dehydration and senescence stages. This allows us for clear spectral discrimination of maize from other vegetation. Kussul et al. (2016) also reported an improved crop mapping with an OA of 87% during early growth stages that increased to 94% at late development stages. Several studies also distinguish specific temporal ranges for more accurate classifications (Mazzia et al. 2020; Vuolo et al. 2018; Waldhoff et al. 2017). The final classified maize fields accounted for about 8% of all land covers in the study site (Fig. 3).
We also confirmed that using multi-date composite images in the optimal temporal casement offered better maize discrimination accuracy when compared to using a single-date dataset. Belgiu and Csillik (2018) and Palchowdhuri et al. (2018) also reported classification accuracy increases when using multi-temporal composite images. With the opportunities to highly accurate crop discrimination potential of the high-resolution Sentinel-2 MSI, our ultimate classification result was slightly lower than several other studies (Khaliq et al. 2018; Lebourgeois et al. 2017; Nasrallah et al. 2018; Sonobe et al. 2018; Zheng et al. 2017). This is because the study area is highly fragmented farmlands with several adjusting agroforestry plant species, which can lead to spectral confusion between different species. Various studies also recognized spectral confusions among different species (Forkuor et al. 2014; Hu et al. 2019). In addition, intercropping operations in our study site, principally soybean (Glycine max) within the same maize fields, can also contribute to spectral confusions.
3.2 Remotely Sensed Maize Phenology Monitoring Technique
The remotely sensed monitoring of phenology of maize development revealed a distinct temporal pattern as presented in Fig. 4. Plant spectral reflectance-based VIs (NDVI, EVI, GVI, SAVI, Red-Edge NDVI, Red-Edge EVI) unveiled almost identical and consistent temporal patterns. At the same time, NDFI showed irregular patterns with the variability of moisture. During the initial periods, VIs showed the lowest records. After few weeks (around mid-April), NDFI values increased, which denotes the start of rainfall and might also point to agronomic flooding. This period is when farmers started maize sowing. Likewise, it is also possible to recognize sowing dates in remote sensing from the smoothed VIs temporal profile. A few weeks later (from 30 to 70 days after sowing), VIs steadily increased, representing the emerging of the maize plants and rapid growth. The crop reached its period of peak growth between 80 and 110 days after sowing. During this period, the VIs rose to the highest values. The decrease of VIs around 120–150 days after sowing represents plant maturity and senescence. VIs dropped to their lowest values around 160–180 after sowing as the leaves dried out and died, which was the time of harvesting operations in the study area. In the typical environmental conditions of the crop growing season, various plant spectral VI profiles showed a similar reflectance characteristic, which confirms numerous previous studies (Liao et al. 2019; Lobell et al. 2003; Sakamoto et al. 2005; Tian et al. 2019). Only NDFI revealed distinct reflectance properties linked with the frequency and quantity of rainfall in the study site.
3.3 Grain Yield Forecasting Models and Production Estimation
We analyzed the capability of six spectral VIs to predict maize grain yield early before the harvesting period. The result showed VIs could significantly predict the grain yield in the middle and late phenological stages. More accurate predictive models were achieved during the peak maize spectral reflectance period (between 80 and 110 days after sowing). Predictive models of this phenological stage are presented in Table 2. The use of single spectral VI, from Model 1–9, performed relatively low. From a single VI perspective, the Red-Edge band and EVI performed better with all estimations below 700 kg/ha of RMSE. The most accurate model was achieved through a combination of spectral VIs. Mathematical models: Model 10 (EVI and GVI), Model 11 (Red-Edge EVI and SAVI), and Model 12 (NDVI, Red-Edge EVI, and SAVI) all offered R2 values ≥ 80% of yield estimations.
Model validation was carried out using regression analysis between actual field observations and predicted yield (Fig. 5). The scatterplots were established on one-hectare-based agricultural land production rates. In accordance with Table 2, Models 1–6 were performed according to empirical linear functions, Model 7 with a polynomial function, Models 8 and 9 with exponential functions, and Models 10–12 were based on multiple-variable regressions. According to these predictive models, there was a clear relationship between the observed and predicted grain yields. The mathematical Model 12 offered the most highly accurate yield estimations in our study with a RMSE of 449.66 kg/ha. The model showed a slight overestimation with an overall computed bias of 3.00 kg/ha. Mathematical Model 5, which bases on GVI, showed the lowest predictive power with a RMSE of 784.61 kg/ha.
In our study, remarkable yield regression models were obtained with evidence for practical use (Table 2). The accuracy of yield estimates peaked between 80 and 110 days after sowing for linear and multiple regression models, which was the tasselling and silking stages after green-up. A similar result was obtained using MODIS images, suggesting that grain yield prediction using spectral indices offers high accuracy (Bolton and Friedl 2013). Other studies found that the time for peak correlation with the best time for early yield predictive models is during the reproductive stage of maize (Mkhabela et al. 2011; Sacks and Kucharik 2011).
In particular, GVI and NDVI-based simple linear regressions resulted in relatively lower predictive grain estimation compared to other models, suggesting that yield was under and overestimated, respectively. The poor correlation is because the high greenness of the cultivated maize variety can make these VIs easily saturated. Thus, these VIs have the character of easy saturation with low biomass (Casanova et al. 1998; Xue and Su 2017). In crop fields with low moisture content and low weed infestation, NDVI can be a good yield predictive indicator (Noureldin et al. 2013). In fragmented small-scale farmlands, variable moisture, and high greenness crop fields of our study, using EVI, Red-Edge-based, and soil suppressing spectral indices improve predictive models' precision. Studies of Clevers and Gitelson (2013), Dong et al. (2015), and Forkuor et al. (2017) also point to the importance of Red-Edge to minimize saturation problems in crop analysis and crop mapping.
Models from multiple regression offered higher accuracy of all yield forecasts; models 10, 11, and 12 featured lower RMSE of 5.73, 5.09, and 4.50 kg/pixel of grain, respectively (Table 2). The use of more than one spectral index increased the accuracy of the models. This method allowed for a comprehensive assessment of different attributes, such as chlorophyll content, vegetation health, water level, and soil and canopy conditions through several spectral bands. In addition, our approach offers improved yield estimates compared to the predictive models developed using climate variables, such as rainfall and temperature (Ramirez-Villegas and Challinor 2012). If highly accurate predictive models are required, multiple VIs-based models can be preferred. But, if parsimonious yield estimation is needed, predictive models with a single spectral index, such as EVI, Red-Edge-based NDVI, and EVI can be selected.
Grain yield mapping in the study area can be done through summing of all pixel yield estimates (10*10 m) in each maize field, and then converting it to standard production rate (ha). We presented the spatial distribution maps by executing based on considering the best predictive models; potentially close to the 1:1 relationship line (Models 4, 10, 11, and 12) (Fig. 6). With the most refined predictive model (Model 12), about 3.4 t ha−1 of mean grain production was obtained. With a total area of 1917.74 ha of classified maize fields in the study area, about 6520.32 tonnes of maize grain was found to be harvested in the 2018 cropping season.
In our investigation, VIs can potentially provide production anomalies. The potential of VIs for crop growth monitoring, and accurate yield prediction about 2 months before the start of the harvesting season, supports the government’s readiness for various early warning alerts. Subsequently, the government can make reliable agricultural decisions based on the crop growth conditions and production status, such as during the time of crop growth abnormalities, and shortage or surplus grain products.
4 Conclusion
We established approaches for remotely sensed maize phenological monitoring and yield predictive models early before the real-time harvesting period for Abaya district of Oromia Regional State in Ethiopia using spectral indices derived from Sentinel-2 MSI data. We also verified the potential of Sentinel-2 high-resolution imagery to discriminate maize fields in our study site. In addition, we evaluated the suitability of different vegetation indices (VIs) for grain prediction. Overall, our finding shows that the best time to predict maize grain yields was between 80 and 110 days after establishment. The uses of phenological-adjusted spectral indices provide more precise grain yield estimates. Multiple regressions from various VIs also offer more precise yield estimates than single VI-based models.
It is important to note that using a small number of sampling plots and the size of Sentinel-2 MSI grids in our study can affect our predictive models. We relied on agricultural lands that landlords were cooperative with to provide us with reliable yield information. On the other hand, the predictive model established in our study using only annual data may not be used for another year. This is because there has been a lack of a production database for the past years. Likewise, the Sentinel-2 imagery mission has been available only from 2016 onwards. For regular use of Sentinel-2-based predictive models, it would be essential to refine these models with data from many years. Slight pixel geometrical mismatches with the georeferenced field measured plots were also observed. But, it is reasonable to assume that yield variation within a single farm field is relatively small, and thus its effect on the model development is assumed to be negligible. In general, the results from this study suggest that Sentinel-2 MSI-based crop phenological information has the potential to support crop monitoring, mapping, and grain yield predicting in fragmented small-scale farming lands, like our study site, for agricultural decision-making.
References
Balaghi R, Tychon B, Eerens H, Jlibene M (2008) Empirical regression models using NDVI, rainfall and temperature data for the early prediction of wheat grain yields in Morocco. Int J Appl Earth Obs Geoinf 10(4):438–452. https://doi.org/10.1016/j.jag.2006.12.001
Ban H, Kim KS, Park N, Lee B (2016) Using MODIS data to predict regional corn yields. Remote Sens 16(9):1–19. https://doi.org/10.3390/rs9010016
Baret F, Guyot G, Major DJ (1989) TSAVI: a vegetation index which minimizes soil brightness effects on LAI and APAR estimation. In: 12th Canadian symposium on remote sensing geoscience and remote sensing symposium 3. pp 1355–1358. https://doi.org/10.1109/IGARSS.1989.576128
Belgiu M, Csillik O (2018) Sentinel-2 cropland mapping using pixel-based and object-based time-weighted dynamic time warping analysis. Remote Sens Environ 204:509–523. https://doi.org/10.1016/j.rse.2017.10.005
Belgiu M, Drăguţ L (2016) Random forest in remote sensing: a review of applications and future directions. ISPRS J Photogramm Remote Sens 114:24–31. https://doi.org/10.1016/j.isprsjprs.2016.01.011
Bolton DK, Friedl MA (2013) Forecasting crop yield using remotely sensed vegetation indices and crop phenology metrics. Agric for Meteorol 173:74–84. https://doi.org/10.1016/j.agrformet.2013.01.007
Boschetti M, Nutini F, Manfron G, Brivio PA, Nelson A (2014) Comparative analysis of normalised difference spectral indices derived from MODIS for detecting surface water in flooded rice cropping systems. PLoS One 9(2):1–21. https://doi.org/10.1371/journal.pone.0088741
Breiman L (2001) Random forests. Mach Learn 45:5–32. https://doi.org/10.1023/A:1010933404324
Bussay A, Velde MVD, Fumagalli D, Seguini L (2015) Improving operational maize yield forecasting in Hungary. AGSY 141:94–106. https://doi.org/10.1016/j.agsy.2015.10.001
Casanova D, Epema GF, Goudriaan J (1998) Monitoring rice reflectance at field level for estimating biomass and LAI. Field Crop Res 55(1):83–92. https://doi.org/10.1016/S0378-4290(97)00064-6
Chipanshi A, Zhang Y, Kouadio L, Newlands N, Davidson A, Hill H, Warren R, Qian B, Daneshfar B, Bedard F, Reichert G (2015) Agricultural and forest meteorology evaluation of the integrated canadian crop yield forecaster ( ICCYF) model for in-season prediction of crop yield across the Canadian agricultural landscape. Agric for Meteorol 206:137–150. https://doi.org/10.1016/j.agrformet.2015.03.007
Chivasa W, Mutanga O, Biradar C (2017) Application of remote sensing in estimating maize grain yield in heterogeneous African agricultural landscapes: a review. Int J Remote Sens 38(23):6816–6845. https://doi.org/10.1080/01431161.2017.1365390
Cian F, Marconcini M, Ceccato P (2018) Normalized Difference Flood Index for rapid flood mapping: taking advantage of EO big data. Remote Sens Environ 209:712–730. https://doi.org/10.1016/j.rse.2018.03.006
Clevers JGP, Gitelson AA (2013) Remote estimation of crop and grass chlorophyll and nitrogen content using red-edge bands on Sentinel-2 and -3. Int J Appl Earth Obs Geoinf 23:344–351. https://doi.org/10.1016/j.jag.2012.10.008
Cochrane L, Bekele YW (2018) Average crop yield (2001–2017) in Ethiopia: trends at national, regional and zonal levels. Data Brief 16:1025–1033. https://doi.org/10.1016/j.dib.2017.12.039
Congalton RG (1991) A review of assessing the accuracy of classifications of remotely sensed data. Remote Sens Environ 37(1):35–46. https://doi.org/10.1016/0034-4257(91)90048-B
CSA (Central Statistical Agency of Ethiopia) (2018) Farm management statistics of Ethiopia. Available at https://knoema.com/EFMS2020/farm-management-statistics-of-ethiopia. Accessed 16 Sept 2021.
da Silva EE, Rojo Baio FH, Ribeiro TLP, da Silva Junior CA, Borges RS, Teodoro PE (2020) UAV-multispectral and vegetation indices in soybean grain yield prediction based on in situ observation. Remote Sens Appl Soc Environ 18:100318. https://doi.org/10.1016/j.rsase.2020.100318
Darvishzadeh R, Atzberger C, Skidmore AK, Abkar AA (2009) Leaf Area Index derivation from hyperspectral vegetation indicesand the red edge position. Int J Remote Sens 30(23):6199–6218. https://doi.org/10.1080/01431160902842342
Dong T, Meng J, Shang J, Liu J, Wu B (2015) Evaluation of chlorophyll-related vegetation indices using simulated Sentinel-2 data for estimation of crop fraction of absorbed photosynthetically active radiation. IEEE J Sel Top Appl Earth Obs Remote Sens 8(8):4049–4059. https://doi.org/10.1109/JSTARS.2015.2400134
Eddy PR, Smith AM, Hill BD, Peddle DR, Coburn CA, Blackshaw RE (2014) Weed and crop discrimination using hyperspectral image data and reduced bandsets. Can J Remote Sens 39(6):481–490. https://doi.org/10.5589/m14-001
ESA (European Space Agency) (2016) Copernicus Sentinel-2 mission. Available at https://www.esa.int/Applications/Observing_the_Earth/Copernicus/Sentinel-2. Accessed 01 Jan 2020
FAO (2015) Analysis of price incentives for maize in Ethiopia for the time period 2005–2012. Available at http://www.fao.org/in-action/mafap/resources/detail/en/c/394285/. Accessed 01 Sept 2021
Forkuor G, Conrad C, Thiel M, Ullmann T, Zoungrana E (2014) Integration of optical and synthetic aperture radar imagery for improving crop mapping in Northwestern Benin, West Africa. Remote Sens 6(7):6472–6499. https://doi.org/10.3390/rs6076472
Forkuor G, Dimobe K, Serme I, Tondoh JE (2017) Landsat-8 vs. Sentinel-2: examining the added value of sentinel-2 ’ s red-edge bands to land-use and land-cover mapping in Burkina Faso. Gisci Remote Sens 55(3):331–354. https://doi.org/10.1080/15481603.2017.1370169
Gislason PO, Benediktsson JA, Sveinsson JR (2006) Random forests for land cover classification. Pattern Recogn Lett 27(4):294–300. https://doi.org/10.1016/j.patrec.2005.08.011
Gitelson A, Merzlyak MN (1994) Spectral reflectance changes associated with autumn senescence of Aesculus hippocastanum L. and Acer platanoides L. leaves. Spectral features and relation to chlorophyll estimation. J Plant Physiol 143(3):286–292. https://doi.org/10.1016/S0176-1617(11)81633-0
González-Gómez L, Campos I, Calera A (2018) Use of different temporal scales to monitor phenology and its relationship with temporal evolution of normalized difference vegetation index in wheat. J Appl Remote Sens 12(2):1–18. https://doi.org/10.1117/1.JRS.12.026010
Guo BB, Qi SL, Heng YR, Duan JZ, Zhang HY, Wu YP, Feng W, Xie YX, Zhu YJ (2017) Remotely assessing leaf N uptake in winter wheat based on canopy hyperspectral red-edge absorption. Eur J Agron 82:113–124. https://doi.org/10.1016/j.eja.2016.10.009
Haerani H, Apan A, Basnet B (2018) Mapping of peanut crops in Queensland, Australia, using time-series PROBA-V 100-m normalized difference vegetation index imagery. J Appl Remote Sens 12(3):1–22. https://doi.org/10.1117/1.JRS.12.036005
Hu Q, Sulla-Menashe D, Xu B, Yin H, Tang H, Yang P, Wu W (2019) A phenology-based spectral and temporal feature selection method for crop mapping from satellite time series. Int J Appl Earth Obs Geoinf 80:218–229. https://doi.org/10.1016/j.jag.2019.04.014
Huete AR, Jackson RD (1987) Suitability of spectral indices for evaluating vegetation characteristics on arid rangelands. Remote Sens Environ 23(2):213–232. https://doi.org/10.1016/0034-4257(87)90038-1
Huete AR, Liu H, van Leeuwen WJD (1997) The use of vegetation indices in forested regions: issues of linearity and saturation. IEEE Int Geosci Remote Sens 1(1):1966–1968. https://doi.org/10.1109/IGARSS.1997.609169
Huete A, Didan K, Miura T, Rodriguez EP, Gao X, Ferreira LG (2002) Overview of the radiometric and biophysical performance of the MODIS vegetation indices. Remote Sens Environ 83(1):195–213. https://doi.org/10.1016/S0034-4257(02)00096-2
Kamal J, Bhatia KJ (2010) Review of rice crop identification and classification using hyper-spectral image processing system. Int J Comput Sci Commun 1(1): 253–258. http://www.csjournals.com/IJCSC/PDF1-1/54.pdf
Khaliq A, Peroni L, Chiaberge M (2018) Land cover and crop classification using multitemporal sentinel-2 images based on crops phenological cycle. 2018 IEEE Workshop on Environmental, Energy, and Structural Monitoring Systems (EESMS). https://doi.org/10.1109/EESMS.2018.8405830
Kussul N, Lavreniuk M, Shelestov A, Yailymov B (2016) Along the season crop classification in Ukraine based on time series of optical and SAR images using ensemble of neural network classifiers. 2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS). pp 7145–7148. https://doi.org/10.1109/IGARSS.2016.7730864
Lambert M, Sibiry PC, Blaes X, Baret P, Defourny P (2018) Estimating smallholder crops production at village level from Sentinel-2 time series in Mali’s cotton belt. Remote Sens Environ 216:647–657. https://doi.org/10.1016/j.rse.2018.06.036
Lebourgeois V, Dupuy S, Vintrou É, Ameline M, Butler S, Bégué A (2017) A combined random forest and OBIA classification scheme for mapping smallholder agriculture at different nomenclature levels using multisource data (Simulated Sentinel-2 Time Series VHRS and DEM. Remote Sens. https://doi.org/10.3390/rs9030259
Liao C, Wang J, Dong T, Shang J, Liu J, Song Y (2019) Using spatio-temporal fusion of Landsat-8 and MODIS data to derive phenology, biomass and yield estimates for corn and soybean. Sci Total Environ 650:1707–1721. https://doi.org/10.1016/j.scitotenv.2018.09.308
Liu WT, Kogan F (2002) Monitoring Brazilian soybean production using NOAA/AVHRR based vegetation condition indices. Int J Remote Sens 23(6):1161–1179. https://doi.org/10.1080/01431160110076126
Lobell DB, Asner GP, Ortiz-Monasterio JI, Benning TL (2003) Remote sensing of regional crop production in the Yaqui Valley, Mexico: estimates and uncertainties. Agric Ecosyst Environ 94(2):205–220. https://doi.org/10.1016/S0167-8809(02)00021-X
Maresma Á, Ariza M, Martínez E, Lloveras J, Martínez-Casasnovas JA (2016) Analysis of Vegetation Indices to Determine Nitrogen Application and Yield Prediction in Maize (Zea mays L.) from a Standard UAV Service. Remote Sens 8(12):1–15. https://doi.org/10.3390/rs8120973
Mazzia V, Khaliq A, Chiaberge M (2020) Improvement in land cover and crop classification based on temporal features learning from Sentinel-2 data using recurrent-convolutional neural network (R-CNN). Appl Sci 10(1):1–23. https://doi.org/10.3390/app10010238
Meng W, Fu-lu TAO, Wen-jiao SHI (2014) Corn yield forecasting in northeast china using remotely Se n sed spectral indices and crop phenology metrics. J Integr Agric 13:1538–1545. https://doi.org/10.1016/S2095-3119(14)60817-0
Mkhabela MS, Bullock P, Raj S, Wang S, Yang Y (2011) Crop yield forecasting on the Canadian Prairies using MODIS NDVI data. Agric for Meteorol 151(3):385–393. https://doi.org/10.1016/j.agrformet.2010.11.012
MOA (Ministry of Agriculture) (2015) Ethiopian agriculture: production and market development. http://www.moa.gov.et/agricultural-development-sector. Accessed 01 Nov 2018
MOA (Ministry of Agriculture) (2019) Ethiopian agriculture: annual production and factors affecting this sector. http://www.moa.gov.et/web/guest/about-the-ministry. Accessed 2019 Feb
Mutanga O, Skidmore AK (2004) Narrow band vegetation indices overcome the saturation problem in biomass estimation. Int J Remote Sens 25(19):3999–4014. https://doi.org/10.1080/01431160310001654923
Nasrallah A, Baghdadi N, Mhawej M, Faour G, Darwish T, Belhouchette H, Darwich S (2018) A novel approach for mapping wheat areas using high resolution Sentinel-2 images. Sensors. https://doi.org/10.3390/s18072089
Noureldin NA, Aboelghar MA, Saudy HS, Ali AM (2013) Rice yield forecasting models using satellite imagery in Egypt. Egypt J Remote Sens Space Sci 16(1):125–131. https://doi.org/10.1016/j.ejrs.2013.04.005
Palchowdhuri Y, Valcarce-Diñeiro R, King P, Sanabria-Soto M (2018) Classification of multi-temporal spectral indices for crop type mapping: a case study in Coalville, UK. J Agric Sci 156(1):24–36. https://doi.org/10.1017/S0021859617000879
Panda SS, Ames DP, Panigrahi S (2010) Application of vegetation indices for agricultural crop yield prediction using neural network techniques. Remote Sens 2(3):673–696. https://doi.org/10.3390/rs2030673
Park SK, Schowengerdt RA (1983) Image reconstruction by parametric cubic convolution. Comput vis Graph Image Process 23(3):258–272. https://doi.org/10.1016/0734-189X(83)90026-9
Peng Y, Gitelson AA (2011) Application of chlorophyll-related vegetation indices for remote estimation of maize productivity. Agric for Meteorol 151(9):1267–1276. https://doi.org/10.1016/j.agrformet.2011.05.005
Prabhakara K, Hively WD, Mccarty GW (2015) Evaluating the relationship between biomass, percent groundcover and remote sensing indices across six winter cover crop fields in Maryland, United States. Int J Appl Earth Obs Geoinf 39(3):88–102. https://doi.org/10.1016/j.jag.2015.03.002
Ramirez-Villegas J, Challinor A (2012) Assessing relevant climate data for agricultural applications. Agric for Meteorol 161:26–45. https://doi.org/10.1016/j.agrformet.2012.03.015
Rodriguez-Galiano VF, Ghimire B, Rogan J, Chica-Olmo M, Rigol-Sanchez JP (2012) An assessment of the effectiveness of a random forest classifier for land-cover classification. ISPRS J Photogramm Remote Sens 67:93–104. https://doi.org/10.1016/j.isprsjprs.2011.11.002
Rossini M, Panigada C, Meroni M, Busetto L, Castrovinci R, Colombo R (2007) Pedunculate oak forests (Quercus robur L.) survey in the Ticino Regional Park (Italy) by remote sensing. J Silvic for Ecol 4(2):194–203. https://doi.org/10.3832/efor0450-0040194
Rouse JW, Haas RH, Schell JA, Deering DW, Harlan JC (1973) Monitoring the vernal advancement of retrogradation of natural vegetation (vols. E73–10303). Available via NTRS - NASA. https://ntrs.nasa.gov/citations/19740008955. Accessed 01 June 2021
Rouse JW, Haas RH, Schell JA, Deering DW, Harlan JC (1974) Monitoring vegetation systems in the great plains with ERTS. Proceedings, Third Earth Resources Technology Satellite-1 Symposium, Greenbelt 1974. pp 3010–3017. Available via CiNii Research. http://ci.nii.ac.jp/naid/10025572118/en/. Accessed 05 Oct 2021
RStudio Team (2018) RStudio: Integrated Development for R. RStudio, PBC, Boston, MA. Available at http://www.rstudio.com/. Accessed 01 Aug 2021
Ruml M, Vulic T (2005) Importance of phenological observations and predictions in agriculture. J Agric Sci 50(2):217–225. https://doi.org/10.2298/jas0502217r
Running SW, Loveland TR, Pierce LL, Nemani RR, Hunt ER (1995) A remote sensing based vegetation classification logic for global land cover analysis. Remote Sens Environ 51(1):39–48. https://doi.org/10.1016/0034-4257(94)00063-S
Sacks WJ, Kucharik CJ (2011) Crop management and phenology trends in the US Corn Belt: impacts on yields, evapotranspiration and energy balance. Agric for Meteorol 151(7):882–894. https://doi.org/10.1016/j.agrformet.2011.02.010
Sakamoto T, Yokozawa M, Toritani H, Shibayama M, Ishitsuka N, Ohno H (2005) A crop phenology detection method using time-series MODIS data. Remote Sens Environ 96(3):366–374. https://doi.org/10.1016/j.rse.2005.03.008
Sakamoto T, Wardlow BD, Gitelson AA, Verma SB, Suyker AE, Arkebauer TJ (2014) Remote sensing of environment a two-step filtering approach for detecting maize and soybean phenology with time-series MODIS data. Remote Sens Environ 114(10):2146–2159. https://doi.org/10.1016/j.rse.2010.04.019
Sharma LK, Bu H, Denton A, Franzen DW (2015) Active-optical sensors using red NDVI compared to red edge NDVI for prediction of corn grain yield in North Dakota, USA. Remote Sens 15(11):27832–27853. https://doi.org/10.3390/s151127832
Smethurst PJ, Huth NI, Masikati P, Sileshi GW, Akinnifesi FK, Wilson J, Sinclair F (2017) Accurate crop yield predictions from modelling tree-crop interactions in gliricidia-maize agroforestry. Agric Syst 155:70–77. https://doi.org/10.1016/j.agsy.2017.04.008
Sonobe R, Yamaya Y, Tani H, Wang X, Kobayashi N, Mochizuki K (2018) Crop classification from Sentinel-2-derived vegetation indices using ensemble learning. J Appl Remote Sens 12(2):1–16. https://doi.org/10.1117/1.JRS.12.026019
Tian H, Huang N, Niu Z, Qin Y, Pei J, Wang J (2019) Mapping winter crops in china with multi-source satellite imagery and phenology-based algorithm. Remote Sens 11(7):1–23. https://doi.org/10.3390/rs11070820
UNDP (2000) The Agricultural Weredas of Borena Zone, Oromiya Region. Available at https://www.africa.upenn.edu/eue_web/borena0600.htm. Accessed 20 Dec 2018
Vuolo F, Neuwirth M, Immitzer M, Atzberger C, Ng WT (2018) How much does multi-temporal Sentinel-2 data improve crop type classification? Int J Appl Earth Obs Geoinf 72:122–130. https://doi.org/10.1016/j.jag.2018.06.007
Waldhoff G, Lussem U, Bareth G (2017) Multi-data approach for remote sensing-based regional crop rotation mapping: a case study for the Rur catchment, Germany. Int J Appl Earth Obs Geoinf 61:55–69. https://doi.org/10.1016/j.jag.2017.04.009
Xue J, Su B (2017) Significant remote sensing vegetation indices: a review of developments and applications. J Sens 2017:1353691. https://doi.org/10.1155/2017/1353691
Zheng H, Du P, Chen J, Xia J, Li E, Xu Z, Li X, Yokoya N (2017) Performance evaluation of downscaling Sentinel-2 imagery for land use and land cover classification by spectral-spatial features. Remote Sens. https://doi.org/10.3390/rs9121274
Zhou X, Zheng HB, Xu XQ, He JY, Ge XK, Yao X, Cheng T, Zhu Y, Cao WX, Tian YC (2017) Predicting grain yield in rice using multi-temporal vegetation indices from UAV-based multispectral and digital imagery. ISPRS J Photogramm Remote Sens 130:246–255. https://doi.org/10.1016/j.isprsjprs.2017.05.003
Acknowledgements
This work was supported by Dilla University, Ethiopia, under-funds for medium and large-scale research projects. Field maize yield data and phenological stages were collected at small-scale farmlands in collaboration with local farmers. We thank ESA for the availability of Sentinel-2 MSI at Copernicus Open Access Hub. We would also like to thank the anonymous reviewers for their insightful comments.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Bazezew, M.N., Belay, A.T., Guda, S.T. et al. Developing Maize Yield Predictive Models from Sentinel-2 MSI Derived Vegetation Indices: An Approach to an Early Warning System on Yield Fluctuation and Food Security. PFG 89, 535–548 (2021). https://doi.org/10.1007/s41064-021-00178-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s41064-021-00178-5