Introduction

Rice, Oryza sativa, is a globally important food for more than half the population and it has a notable influence on the global political and economic system. With the changes in global climate and increase in world population, it is utmost important and challenging to maintain the production and demand for rice, particularly in developing countries like India (Auffhammer et al., 2012; Guntukula, 2020; Mahajan et al., 2017). This may be addressed by effective monitoring of crop health status and accurate estimation of yield to maintain political and social stability and equity (Brisco et al., 1998). In contrast to conventional field-based approaches, remote sensing provides valuable inputs for this purpose by supplying coarse to high spatial and temporal resolution images at regional to global scales. The marginal 20–25 days temporal resolution with high spatial resolution datasets are generally acceptable and plentiful for agricultural crops monitoring because the average field size is very small in most of the developing countries including India (Sivasankar et al., 2018). Nonetheless, the spectral region used to acquire remotely sensed imagery has a significant impact on the potentials to retrieve crop characteristics including yield (Alebele et al., 2020; Ranjan & Parida, 2021).

Most early investigations were performed using multispectral optical remote sensing data to infer crop bio-geophysical parameters. For instance, Patel et al. (1991) proposed a semi-empirical based approach to estimate paddy yield from Landsat Multispectral Scanning System (MSS) and Indian Remote Sensing (IRS) Linear Imaging Self Scanning (LISS)-I datasets. In spite of potential capabilities and availability of multispectral remotely sensed imagery, operational applications of the data in this regard are limited due to its ability to acquire data only in clear weather conditions. This has made the radar remote sensing, operating in the microwave region of the electromagnetic spectrum, as a better alternative toward uninterrupted crop growth monitoring. Moreover, synthetic aperture radar (SAR) data is sensitive toward geometrical, structural and dielectric properties of various crop cover components which made the data as suitable to retrieve crop bio-geophysical parameters (Liao et al., 2018; Sivasankar, Kumar, et al., 2020; Sharifi and Hosseingholizadeh, 2020). However, it is necessary to choose proper SAR sensor parameters such as frequency, polarization and incidence angle of operation in enhancing the sensitivity of return signal toward crop cover (Sivasankar et al., 2018). Particularly, higher frequency like C-band and high incidence angle (> 30°) is identified as suitable for narrow leaf crops such as paddy and wheat (Ferrazzoli et al., 1997; Macelloni et al., 2001). Apart from SAR data sensitivity toward vegetation characteristics, it is also highly influenced by topographic characteristics like aspect and degree of slope in hilly terrain.

With the advancements in remote sensing, GIS, computer science and computation techniques collectively enhanced the accuracy and precision to retrieve crop cover distribution and characteristics. Several previous studies (Patel et al., 2006; Inoue et al., 2014a, 2014b; Setiyono et al., 2019; Mandal and Rao 2020) have proposed various approaches to effectively retrieve crop yield within season from SAR data. These techniques range from simple empirical based to more complex models like radiative transfer, artificial intelligence, machine learning and deep learning-based approaches. Patel et al. (2006), is one of the notable studies in this area, proposed Interaction Factor (IF) from volume, moisture, height for each of the components and density of plant to determine the influence of various crop components on SAR backscatter. Moreover, Chauhan et al. (2018) reported that the IF provides valuable information as a crop descriptor to simulate SAR backscatter using water cloud model. Since 1990, the utilization of artificial neural networks (ANNs), a machine learning approach, over conventional methods have been significantly increased for remote sensing analysis (Atkinson & Tatnall, 1997). This is due to its ability to integrate different types of data for understanding complex patterns by incorporating realistic physical constraints (Carpenter et al., 1997; Lek & Guégan, 1999; Mas & Flores, 2008). Thus, several studies have attempted to analyze the performance of ANNs for agricultural applications which includes classification (Kumar et al., 2016; Murthy et al., 2003), crop biophysical parameters estimation (Chauhan et al., 2018; Jiang et al., 2004; Prasad et al., 2012). However, limited studies like Jing et al. (2013) have assessed the applicability of ANNs for yield estimation from SAR data. The literature clearly demonstrates the importance and research scope of ANNs in developing a robust model to improve the accuracy and precision of crop yield prediction from remotely sensed imagery.

Most of the previous studies (Maki et al., 2017; Setiyono et al., 2017, 2019) have attempted to analyze the integrated use of remotely sensed imagery (optical or/and SAR) in crop growth models to predict yield. However, the necessary information about weather and soil properties may not be available accurately over developing countries. Moreover, few studies (Li et al., 2003; Patel et al., 2006) attempted to retrieve yield based on remote sensing imagery alone, mostly exploring the potentials using empirical/semiempirical approach. Therefore, the study aims to assess the potentials of dual-polarization (VV + VH) Sentinel-1 SAR data using ANN-based approach. Further details about the study area, ground-truth data collection and methodology are given in the following sections.

Study Area and Datasets Used

Study Area

The study is carried over Cuttack district located in the mid-eastern part of Odisha. It extends over an area of 3,932 sq. km, between 84.85° E to 86.28° E longitudes and 20.04° N to 20.70° N latitudes as shown in Fig. 1. This region is situated at an altitude of about 15 m above MSL (Mean Sea Level) and covered with deltaic sediments of the Mahanadi River (Antaryami & Jyotsnarani, 2013). In Cuttack district, the temperature typically varies between 15 °C and 39 °C with an average annual rainfall of about 1501.3 mm. The area is dominated by agricultural fields, majorly cultivating paddy crop in the kharif season extended between June and December.

Fig. 1
figure 1

Location map of the study area. Red dots on the map represent the locations selected for CCE experiments

Satellite Data

Remotely sensed imagery acquired from Sentinel-1 and Sentinel-2 missions was used for estimating crop yield and site selection process for ground-truth data collection, respectively. These two missions are a constellation of two satellites, developed by European Space Agency (ESA) on behalf of the joint ESA/European Commission initiative Copernicus. Sentinel-1 provides the C-band (center frequency of 5.405 GHz) SAR datasets acquired in single or dual polarization, i.e., VV (vertical transmitting, vertical receiving) or HH (horizontal transmitting, horizontal receiving) or VV + VH (vertical transmitting, horizontal receiving) or HH + HV (horizontal transmitting, vertical receiving) in four different modes such as stripmap, Interferometric Wide swath (IW), extra wide swath and wave modes. In IW mode, Sentinel-1 acquires SAR data at spatial resolution (range × azimuth) of 5 m × 20 m with a wide swath width of 250 km at a look angle ranging from 29.1° (near range) to 46.0° (far range), whereas Sentinel-2 provides optical data in 13 spectral bands with four, six and three bands at 10 m, 20 m and 60 m spatial resolution, respectively. In this study, multi-temporal C-band dual-polarization (VV + VH) Sentinel-1A Level-1 Ground Range Detected (GRD) SAR imagery acquired in IW mode between 13 August 2019 and 11 December 2019 as well as cloud-free Level-1C Sentinel-2 optical data acquired on 15 November 2019 are used for analysis. The detailed specifications of the datasets used are given in Table 1. These datasets are freely available for researchers to download from the ESA Copernicus open access hub.

Table 1 Specifications of satellite data used in the study

Ground-Truth Data

Two ground-truth data collection campaigns were carried over the study area during paddy crop growing season. The first campaign was done during the initial crop growth stage, i.e., between 20 and 23 September 2019 to collect necessary information for paddy crop mapping. During this campaign, field boundaries and tracks were recorded using handheld GPS along with the crop type and health status details.

The second campaign was carried out between 9 and 23 December 2019 during the harvesting phase of the paddy crop. Field level paddy yield data are collected from selected fields, well distributed in the study area, using the Crop Cutting Experiment (CCE) approach. The dimension of CCE size (5 m × 5 m) for the field level yield estimation was selected on the basis of government state rule.

These fields are identified using the smart sampling techniques, detailed procedure is given in the following section. This enables the study to be comprehensive by considering the wide range of variations in the paddy crop patterns which includes early, mid and late sown crops. During the ground-truth data collection, an Android-based mobile application developed by Amnex Infotechnologies Pvt. Ltd., was used for CCE sample collection. The information collected through CCE mobile app consists of geographic coordinates (latitude & longitude), district, taluk, gram panchayat/ village name, date of CCE, farmer name, farmer contact number, crop name, crop variety, CCE size detail, land details, biomass weight, grain wet weight, grain dry weight, moisture percentage, photographs, other remarks (stress, disease, pest), etc. The plot-wise measured paddy grain weight and moisture content are used to estimate actual yield (dry grain weight). In this study, a total 137 CCE sample points have been collected over the parts of Cuttack District.

Methodology

Initially, multi-temporal SAR datasets are preprocessed and co-registered with cloud-free optical data in order to minimize shifts between images. The preprocessed multi-temporal Sentinel-1 SAR data are integrated with ground-truth data collected during the first campaign for paddy crop mapping, based on the workflow proposed by Sivasankar, Sharma, et al. (2020). Further analyses are carried over paddy fields using cloud-free optical data for the selection of locations to carry out CCE experiments. The detailed process of the smart sampling method used in this study is given in the following subsection. Later, field boundaries, locations and tracks recorded during the ground-truth data collection are overlaid on SAR imagery to extract the backscattering coefficients (\(\sigma_{{{\text{VV}}}}^{0}\) and \(\sigma_{{{\text{VH}}}}^{0}\)) signature from sampled agricultural fields. These multi-temporal backscatter signatures are analyzed with a polynomial curve fitting approach to obtain the peak stage signature of paddy crop patterns which includes early, mid and late sown crops. This peak stage signature of backscattering coefficients is related to in situ paddy yield (collected during the second ground-truth data collection campaign) to train and validate the ANN as well as to test the same. The detailed flowchart of the methodology is given in Fig. 2. As the sampled fields are considered over flat terrain, the influence of topographic characteristics is neglected in the study.

Fig. 2
figure 2

Flowchart of the methodology

Sentinel-1 SAR Data Preprocessing

The C-band dual-polarization (VV + VH) Sentinel-1A Level-1 GRD SAR imagery are preprocessed using Sentinel Application Platform (SNAP), an open-source software developed by Brockmann Consult, SkyWatch, Sensar and C-S. At first, Sentinel-1 SAR data applied a precise orbit of acquisition and thermal noise removal. Then, radiometric calibration was carried to convert digital number (DN) values of raw data into sigma nought backscatter (\({\sigma }^{0}\)) values which are proportional to the intensity of backscattered signal from a unit area on ground. This is followed by removal of speckle (black and pepper) noise, which is inherently in SAR images due to the coherent nature of the acquisition, using a refined Lee filter with window kernel size of 5 × 5 pixels. Later, it was applied to Range Doppler terrain correction using the Shuttle Radar Topography Mission (SRTM) digital elevation model at a spatial resolution of 1 arc second (30 m) as reference. These datasets are converted to dB logarithmic units, co-registered and resampled to 20 m pixel spacing before utilizing them for crop classification and yield estimation purposes.

Remote Sensing-Based CCE Sampling

The sampling fields for in situ yield data collection are identified using the smart sampling technique based on stratification of normalized difference vegetation index (NDVI) and normalized difference water index (NDWI), proposed by Kriegler et al. (1969) and Gao (1996), respectively, generated from cloud-free Sentinel-2 optical data acquired on 15 November 2019 (see Fig. 3a, b). Among these indices, NDVI measured using near-infrared (NIR) and red bands indicates the photosynthetically active biomass, whereas NDWI measured using NIR and short-wave infrared (SWIR) bands represents the water content of leaves. Thus, these two indices provide a better indicator to assess the spatial distribution of crop health status across the study area. The paddy crop in the study area is initially categorized into five different classes based on the NDVI and NDWI values. Then, the sampling fields (as given in Fig. 3c) are identified using a stratified sampling approach based on the crop health status by keeping in mind the accessibility and possibility to measure in situ yield data. Therefore, this enables the study to be comprehensive by considering the wide range of variations in paddy crop patterns.

Fig. 3
figure 3

Analysis of vegetation indices for stratification of paddy crop: a NDVI, b NDWI and c CCE sampling points

ANN for Crop Yield Estimation

A two-layer feedforward neural network comprising one hidden layer with sigmoid neurons and one output layer with linear neurons, available in the MATLAB software is employed for the study analysis. The typical structure of this type of ANN is illustrated in Fig. 4. The neural network operates based on the weights and biases between layers (as given in Eq. 1) to minimize errors during the training phase of the model. In this study, the model was attempted twice with backscattering coefficients (first attempt with \(\sigma_{{{\text{VV}}}}^{0}\) whereas second attempt with \(\sigma_{{{\text{VH}}}}^{0}\)) as inputs and one hidden layer with 10 neurons to infer paddy yield. The total of 137 samples are randomly divided into three independent sets, i.e., 95 samples (≈ 70%) for training the model whereas two different sets of 21 samples (≈ 15%) for validation and testing purposes. Among these sets, training samples are basically used during training and further corrections in the network based on the error calculated from validation samples, whereas the testing samples provide an independent measure of network performance during and after training. This study considered the Levenberg–Marquardt algorithm for network training process and mean squared error (MSE) is the average squared difference between estimated and actual yield, for assessing the performance. The ANN model automatically adjusts weights and biases during each training iteration to provide an optimum trained model based on the performance.

$$y = f\left( {\mathop \sum \limits_{j} w_{ij} x_{ij} + b} \right)$$
(1)

where ‘y’ and ‘x’ represent the output and input of the model, respectively; f(.) indicates the complex neural network function; ‘w’ and ‘b’ indicate weights and bias, respectively; and ‘i’ and ‘j’ indicate the number of inputs and neurons in hidden layer, respectively.

Fig. 4
figure 4

Structure of a two-layer feedforward neural network

Results and Discussion

In this study, the maximum likelihood supervised classification was used for paddy crop classification using multi-temporal dual-polarized (VV + VH) Sentinel-1A SAR data. During this process, the ground-truth information collected over 65 locations during the first campaign is randomly divided into two sets in which 46 (≈ 70%) are used for training and 19 (≈ 30%) are used for validation purposes. It is well known that the pixel-based classification methods like maximum likelihood commonly observe ‘salt and pepper’ or speckled appearance which limits its accuracy (Lu & Weng, 2007). Thus, the classification output underwent post-classification processing in ArcGIS software which includes filtering, smoothening and generalization. The final paddy crop map (given in Fig. 5) has significantly improved the classification performance with overall accuracy of 91.25% and observed an area of about 97,000 ha.

Fig. 5
figure 5

Paddy classification map of the study area

As discussed in ‘Remote Sensing-Based CCE sampling’ section, ground-truth sampling fields are identified for paddy yield data collection. The study analysis observed that the study area is majorly covered by good and excellent crop health status with an area of about 12,270.49 ha (which contributes 12.56% of total paddy area). Table 2 provides the area (ha) under each category of crop health status and number of CCE samples taken in the corresponding category. The statistical analysis of in situ yield data observes mean and standard deviation of 5286.55 kg/ha and 1115.21 kg/ha, respectively. It is also observed that the kurtosis of 0.45 and 0.51 indicates the data is most closely representing the normal distribution (see Fig. 6). This clearly shows that the data is not having influential outliers or extreme values for further analysis. Thus, the ground-truth paddy yield data has been considered for prediction modeling using multi-temporal SAR data.

Table 2 Area of paddy crop health status categories and CCE samples taken for the study
Fig. 6
figure 6

Analysis of ground-truth data: a histogram of yield data with normal distribution curve and b normal probability plot

The time-series backscattering coefficients (\({\sigma }^{0}\)), as given in Fig. 7, show the various paddy crop growth patterns in the study area. It is observed that the increase in backscatter (\({\sigma }_{\mathrm{VV}}^{0}\) and \({\sigma }_{\mathrm{VH}}^{0}\)) during the crop growth from initial stage to mature stage whereas the backscatter is decreased between mature stage and harvesting. Several previous studies (Choudhury & Chakraborty, 2006; Patel & Srivastava, 2013; Yang et al., 2014) have also observed similar trends of SAR backscatter with paddy crop growth. During the peak crop growth stages, the microwave signal interacts more with crop cover than the underneath soil. This enables the remotely sensed data at peak crop growth stages as a crucial input to infer the crop bio-geophysical parameters including grain yield (Brandão et al., 2015; Liu et al., 2019). Thus, the study estimated the highest value of \({\sigma }_{\mathrm{VV}}^{0}\) and \({\sigma }_{\mathrm{VH}}^{0}\) during the crop growing season from each sampled plot for further analysis. The study observed that the peak mature stage of paddy crop with mean and standard deviation of -8.9 dB and 1.48 dB, respectively, for VV polarization whereas − 14.22 dB and 1.2 dB, respectively, for VH polarization.

Fig. 7
figure 7

Polynomial curve fit plot: ac VH-backscatter and d–f VV-backscatter

The peak crop growth backscatter signatures in VV and VH polarization are analyzed to retrieve the paddy yield using ANN algorithm. It is observed that cross-polarization (\({\sigma }_{\mathrm{VH}}^{0}\)) has better performance for paddy yield estimation than the like polarization (\({\sigma }_{\mathrm{VV}}^{0}\)) data (see Fig. 8) due to the multiple scattering within the crop canopy at the peak stage and less influence by the crop surface condition (Ranson, & Sun, 1994; Wang et al., 2019). The training results for the \({\sigma }_{\mathrm{VH}}^{0}\) backscatter show that the overall values are greater than 0.72 with the validation and test results also showed the good coefficient of determination (R2), 0.75 and 0.70, respectively, between estimated yield and observed yield. But in case of \({\sigma }_{VV}^{0}\) backscatter, the training result shows the poor ‘R2’ value of 0.26 with the validation and test result also showed poor ‘R2,’ 0.36 and 0.14 in comparison of VH polarization. Moreover, the best validation performance was observed for \({\sigma }_{\mathrm{VH}}^{0}\) at epoch 4 whereas the \({\sigma }_{\mathrm{VV}}^{0}\) at epoch 14. This clearly indicates that the cross-polarization (VH) backscattering coefficients are more suitable for paddy yield estimation than the like polarization (VV). The study results support the previous studies (Inoue et al., 2014a, 2014b; Parida & Singh, 2021; Patel & Srivastava, 2013) in observing the cross-polarization for retrieving crop bio-geophysical parameters due to the multiple scattering in crop cover.

Fig. 8
figure 8

Training, validation and test results for VV polarization and VH polarization between observed paddy yield and estimated yield a VV polarization and b VH polarization

The study further analyzed the error distribution during training, testing and validation process for each ANN model. Error histograms (difference between actual and predicted yield value) given in Fig. 9 represents the error value, error frequency and zero error for \({\sigma }_{\mathrm{VV}}^{0}\) and \({\sigma }_{\mathrm{VH}}^{0}\) backscattering coefficients. It is observed that the zero error is illustrated with a yellow line in the middle with 12 instances for \({\sigma }_{\mathrm{VH}}^{0}\) and 5 instances for \({\sigma }_{\mathrm{VV}}^{0}\) in the training set. It is also observed that the errors in training, test and validation sets are more inclined to zero meaning, indicating that these ANN models have a high degree of generalizability. The detailed study results about the performance of \({\sigma }_{\mathrm{VV}}^{0}\) and \({\sigma }_{\mathrm{VH}}^{0}\) for paddy yield forecasting are given in Table 3 and spatial distribution of paddy yield derived from \({\sigma }_{\mathrm{VH}}^{0}\) using ANN is given in Fig. 10.

Fig. 9
figure 9

Error histogram with 20 bins for the training, validation and test steps: a \(\sigma_{{{\text{VV}}}}^{0}\) and b \(\sigma_{{{\text{VH}}}}^{0}\)

Table 3 ANN model result for VV and VH polarization for paddy yield forecasting
Fig. 10
figure 10

Paddy yield map derived from VH-backscattering using ANN

The study was used the ANN model to predict paddy yield and build a spatial map of the study area. The spatial map (Fig. 10) depicts the distribution of minimum, mean and maximum paddy yields across the research area. The minimum, mean and maximum yield values were found to be 394 kg/ha, 4791 kg/ha and 8498 kg/ha, respectively. Also, it has been observed that distribution of low yields pixels ranging from 394 kg/ha to 1743 kg/ha, is very low in comparison with distribution of mean and maximum yield pixels. We have verified with optical data as well and found that total 2.99 ha of very low yield value pixel due to the very low density of paddy crop. The results also indicate the overestimation of grain yield pixels, ranging 6646 kg/ ha to 8498 kg/ha, near to the river and coastal area, as well as a low land area with high moisture availability regions with total area of 2203.657 ha of very high yield pixels.

The study results show comparable accuracy obtained from previous studies (Maki et al., 2017; Setiyono et al., 2017) where remote sensing data integrated with climate variables and crop growth models. It is also demonstrated that the proper choice of input data and effective use of ANN may provide reasonable accuracy and precision that is observed from integrated use of remotely sensed imagery and crop growth models.

Conclusions

The study focuses on analyzing the potentials of dual-polarization (VV + VH) C-band Sentinel-1 SAR data for paddy yield forecasting using ANN model. A completely remotely sensed imagery (optical + SAR) has been proposed in this work to properly select the locations for ground-truth data collection as well as paddy yield estimation. In this study, the artificial neural network model has been used for estimating the paddy yield from VV and VH polarization signatures during the peak stage of paddy crop growth. It is observed that the VH polarization is highly sensitive and provides better accuracy than the VV polarization. This is because of the cross-polarization sensitivity toward the multiple scattering of microwave signals in crop cover. The study results exhibit that the effective use of ANN model integrated with remotely sensed imagery may provide comparable accuracy that is observed from crop growth models which require a lot of information regarding crop, soil, weather, etc. However, the proposed methodology needs to be further analyzed over different study areas to understand its applicability for operational applications as well as influence of weather parameters and others.