Introduction

Remote sensing is an important technique in agricultural monitoring, and is mainly used to provide estimates of crops yields, identify new agricultural areas, and monitor crops throughout the development cycle (Rudorff et al. 2010). According to do Amaral and Molin (2014), canopy sensors are useful tools for detecting variability in sugarcane fields.

These techniques can be used to obtain information about crops rapidly and non-destructively (Shiratsuchi et al. 2014). Additionally, these tools have become very relevant for obtaining and processing field data and are used to perform important tasks such as weather forecasting, water need assessment in plants, and disease and pest detection. According to Molin et al. (2015), the use of sensors, both in terms of application and equipment, has demonstrated the greatest potential for development in precision agriculture (PA). All these applications are based on the spectral behavior of the crops.

The spectral response of agricultural crops is influenced by the physical characteristics of the canopy and a series of plant biochemical factors. The following plant biochemical factors are related to agronomic parameters: canopy architecture; leaf chemistry; and leaf pigments such as carotenes, anthocyanins, chlorophyll a and b, and xanthophyll (Abdel-Rahman and Ahmed 2008). These factors influence the physiological processes related to either plant development or absorption of electromagnetic radiation (Martins and de Galo 2015). One way to evaluate the canopy spectral response is through vegetation indices.

Vegetation indices are mathematical formulas based on various band combinations within the electromagnetic spectrum. An understanding of the spectral behavior of vegetation is fundamental to interpreting the indices. The methods employed to evaluate canopy characteristics using vegetation indices are increasingly becoming relevant as these processes can be carried out in a non-destructive manner (Richards 1993), making it possible to perform several analyses at different stages of crop development (Jones and Vaughan 2010).

There are several types of vegetation indices that can be used among them: NDRE and NDVI, which are widely used in agriculture, but NDVI is saturated when the crop shows high physiological potential and as verified by Grohs et al. (2009) NDVI saturation happens with the increase of biomass. Taskos et al. (2015), when comparing NDRE with NDVI results in presenting a red-edge band, showed that NDRE was less susceptible to saturation as compared to NDVI. There is still a lack in the literature of studies that present methods for identifying NDVI saturation. In particular there is a problem in detecting at what point in the growth stage of the crop this problem occurs. If it were clearwhen the moment of this saturation occurs, then other indices can be used to solve this problem.

A wide range of terrestrial sensors, either active or passive, are now being used to produce vegetation indices for monitoring of biophysical properties and photosynthetic activity (Thenkabail et al. 2000; Hansen and Schjoerring 2003). Among terrestrial sensors, canopy sensors have been used as a tool to detect spatial and temporal crop variability.

Canopy reflectance has been used to evaluate the status of agricultural crops, such as wheat and corn, and to assist in management practices during crop development. However, there is little research on the use of canopy reflectance in the study and production of soybean (Miller et al. 2018).

Canopy sensors can be used for evaluating plant characteristics using the principles of canopy and leaf reflectance. The near-infrared (NIR) region (700 to 1 300 nm) is affected by canopy structure and leaf density (Kumar and Silva 1973) while chlorophyll shows higher absorption at wavelengths in the red and blue regions (Lichtenthaler and Buschmann 2001). In addition, the red-edge region (680 to 750 nm) is considered to be the inflection point between the red and NIR wavelengths of the spectrum and is sensitive to changes in chlorophyll content (Gitelson et al. 1996), which is related to the gross primary yield of terrestrial plants (Gitelson et al. 2006).

Research using canopy sensors has been conducted mainly for variable rate nitrogen application in sugarcane (Amaral et al. 2018), suitable nitrogen application in wheat crop (Cao et al. 2017), wheat yield forecast (Guo et al. 2018), nitrogen fertilizer recommendation (McFadden et al. 2018) among other research. However, research involving the correlation of the IR of these sensors with field-measured biophysical variables and the assessment of the quality of vegetation indices in a timely manner has not been proposed.

A statistical technique used in industry and that has been applied in agriculture for process quality evaluation is the statistical quality control (SQC), from this technique it is possible. In agriculture, SQC has been used to monitor and identify causes of variability in agricultural processes, providing improved operations through the identification of faults/errors found during work enabling their correction and as can be seen in soybean mechanized harvesting work (Menezes et al. 2018), groundnut sowing as a function of soil texture (Zerbato et al. 2017), groundnut sowing and harvesting (dos Santos et al. 2018), and mechanized coffee harvesting (Tavares et al. 2018) and among others.

There are no reports of the use of Statistical Process Control (SPC) to understand the spatial and temporal variability of biophysical variables and vegetation indices, especially of proximal remote sensors. Assuming that NDVI saturation occurs when the vegetation index (VI) stabilizes and cannot detect the variability of vegetation it was hypothesized that with the use of control charts, it is possible to detect when this saturation occurs.

The objectives of this study were: (i) to compare the potential and performance of the three vegetation indexes in monitoring soybean variability using canopy sensors in relation to the biophysical characteristics of the crop, showing which ones can be used during crop development, which is resistant to saturation; (ii) to determine the optimal timings for readings during the crop developmental stages, that allows detection of the optimum time for the use of the sensor in the crop through this index allowing the producer time saving and greater knowledge of the spatial and temporal variability; and (iii) to facilitate saturation detection vegetation indices through the control charts, for the remote sensing area showed great potential for identification at the moment of NDVI saturation, being a very promising result, since this index is more recommended for its use in the early stages of plant development, which can be seen in this work.

Materials and methods

Description of the experimental area

Temporal and spatial analysis of terrestrial remote sensing was performed in a Brazilian agricultural area (Fig. 1). The study was conducted between October 2016 and February 2017, which is the soybean crop season in the São Paulo State University (Unesp), School of Agricultural and Veterinarian Sciences, Jaboticabal, located near 21° 15′ 19.6″ S and 48° 15′ 38.5″ W, in the State of São Paulo, Brazil, in order to investigate temporal variability in the vegetation indices. The soil of the experimental area is classified as a Red Latosol, according to Embrapa (2013). The regional climate is Aw (tropical with a dry winter), according to the Köppen climate classification (Alvares et al. 2013), with an average temperature of 22.2 °C.

Fig. 1
figure 1

Location of the experimental area in the municipality of Jaboticabal, State of São Paulo, Brazil

The rainy season usually occurs between October and March and the annual average rainfall is 1 424 mm (Silva 2013). Pluviometric data corresponding to the study period for the experimental area has been presented in Fig. 2.

Fig. 2
figure 2

Source: DAVIS meteorological station of the Instrumentation, Automation and Processing Laboratory in the Department of Rural Engineering, Unesp/Fcav, Jaboticabal, State of São Paulo, Brazil

Monthly mean rainfall from October 2016 to February 2017, in Jaboticabal, State of São Paulo, Brazil.

Experimental design

The experimental design was based on the Statistical Quality Control method (SQC; Montgomery 2009), comprising 65 sampling points in a 30 × 30 m grid. Figure 1 shows the location of the experimental area.

At 30, 45, 60, 75, and 90 days after sowing (DAS), the parameters considered as quality indicators were evaluated by monitoring the soybean biophysical characteristics, such as fresh (or wet) and dry biomass, canopy width, chlorophyll index, plant height, and yield. The yield parameters were only evaluated at the end of the cycle, at 90 DAS (stage R6). Three vegetation indices were used to collect data: the normalized difference vegetation index (NDVI), normalized difference red-edge index (NDRE), and inverse ratio vegetation index (IRVI). Canopy sensors were used to collect data in order to correlate them with the biophysical characteristics of soybean, and to compare the potential and performance of the three vegetation indexes in monitoring soybean variability using canopy sensors during the soybean development stages.

Each sampling point or plot constituted of two lines, each 5 m long and spaced 0.45 m apart, creating 4.5 m2 of useful area per point. All evaluations were performed at all points while monitoring the spatiotemporal variability of the soybean crop. Both morphological variables and vegetation indices were collected at all sampling points (total, 65 points; Fig. 1). These points were spaced evenly from the 30 × 30 m sample grid. The evaluated period was at 30, 45, 60, 75, and 90  days after sowing (DAS). The GreenSeeker and Optrx canopy sensors were used only at the sampling points; the sensors were carried to each plot and were used above the canopy of the soybean crop. The 65 sampling points were georeferenced using a Trimble 6 global navigation satellite system (GNSS) and receiver composed of a high precision antenna, an integrated GPS receiver, and real-time kinematic (RTK) positioning signal (Trimble 2013; Embratop 2017).

Equipment

A seeder (COP Suprema; TATU Marchesan, São Paulo, Brazil), coupled to a tractor (MF 7180; Massey Ferguson, Brazil) was used for sowing. Spacing between rows was maintained at 0.45 m, and the sowing density was 24 m−1 seeds (cultivar TMG 7060 IPRO). Fertilization was incorporated to the side of the crop row during sowing, using 300 kg ha−1 (N–P–K formulation, 04–20–20). On the day of sowing, soil moisture in the 0–20 cm layer was found to be around 15.38%.

Active canopy sensors

Figure 3 shows the evaluation periods of the experiment with respect to the soybean crop growth stages. The sensor readings were obtained for the plant canopy, according to Grohs et al. (2011).

Fig. 3
figure 3

Sources: Ciampitti et al. (2014), K-State Research and Extension, adapted from Carneiro (2018)

Growth stages of soybean crop corresponding to the evaluation dates.

The GreenSeeker® active optical canopy sensor is a Trimble 500 model (Trimble, Sunnyvale, CA) sensor that emits radiation at two wavelengths, the NIR (770 nm) and the red (660 nm) regions, with a bandwidth of about 25 nm (Povh et al. 2008; Amaral et al. 2015). The sensor captures light reflected by plants and calculates the NDVI (Motomiya et al. 2014). In this study, measurements were taken for both NDVI and IRVI, similar to Kapp Júnior et al. (2016), in order to evaluate the canopy sensor efficiency. The sensor readings were maintained at a working height between 0.6 and 0.7 m as, according to the manufacturer’s instructions (Trimble), the recommended working height for the GreenSeeker sensor is between 0.6 and 1.2 m from the target (plant).

The OptRx® active optical canopy sensor is an Ag Leader ACS430 model (Ag Leader, 2202 South Riverside Drive, Ames, IA 50010, USA). It is possible to use this sensor to obtain measurements for both NDRE and NDVI. However, it was only used to measure NDRE as NDVI was measured using GreenSeeker. The height of the sensor readings was the same as that of GreenSeeker (between 0.60 and 0.70 m above the canopy), with a reading range at 60% of the reading height. According to Ag Leader Technology (2011), this sensor emits its own light, which measures the reflectance of specific electromagnetic waves. This measured reflectance indicates the chlorophyll and biomass content, allowing the sensor to obtain the vegetation index.

The Laboratory of Machines and Agricultural Mechanization—LAMMA (São Paulo State University (Unesp), School of Agricultural and Veterinarian Sciences, Jaboticabal, State of São Paulo, Brazil) created a means of transportation the OptRX sensor (Fig. 4) for use in field experiments, to enable monitoring of the growing stages of the crop. This transportation support was based on a model developed by Professor Brenda V. Ortiz, University of Auburn (Alabama, USA). A bicycle was used as support, enabling transportation of the GreenSeeker and Crop Circle active optical sensors for field surveys (Carneiro et al. 2017).

Fig. 4
figure 4

The transport used for carrying the OptRX sensor for field data collection, made by the Laboratory of Machines and Agricultural Mechanization (LAMMA)

Vegetation indices

The vegetation indices evaluated in this study were NDVI and IRVI, obtained by GreenSeeker, and NDRE, obtained by OptRX. Table 1 shows the calculations for each index.

Table 1 Vegetation indices

The NDVI was calculated using the red (660 nm) and infrared (770 nm) bands (Povh et al. 2008); the IRVI was calculated using bands in the region of 650 and 770 nm (Kapp Júnior et al. 2016); and the NDRE was measured using the 790 and 720 nm bands (Fitzgerald et al. 2006).

Quality indicators

The following variables were evaluated.

  • Plant height and canopy width (Fig. 5a). The scale used was graded in meters, canopy width was measured from one end of the plant to the other and plant height was measured from ground level to the highest point of the plant. The measurements were carried out randomly within the plot, with three measurements per plot and line (Fig. 5b).

    Fig. 5
    figure 5

    The measurements were carried out randomly within the plot, with three measurements per line and plot

    A diagram was prepared for the evaluated variables. To facilitate understanding, the two large groups were separated, namely, the biophysical characteristics of the plant and the vegetation index (Fig. 6).

    Fig. 6
    figure 6

    Diagram of the variables analyzed

  • Fresh and dry biomass were collected only from the above-ground of the plant. The samples were obtained using a 0.25 m2 (0.5 × 0.5 m) frame (Fig. 6a). Sampling was conducted per row, for 1 m per point in two rows such that each of these rows was 5 m in length (Fig. 5b). After placing the frames in each of the two rows of the sampling point, the plant was cut at the ground level with the aid of a knife. The aerial part of the soybean was taken, and the roots were discarded. The collected plants were placed in paper bags and weighed to obtain the wet mass (fresh biomass, Fig. 7a), after which the samples were placed in a greenhouse with air circulating at 65 °C for 72 h (Gobbi et al. 2009; Grohs et al. 2009) in order to dry the samples (dry biomass, Fig. 7b). Finally, the dried samples were weighed in a semi-analytical balance (model BL 3200H) to obtain the dry mass.

    Fig. 7
    figure 7

    In each collection, wet (a) and dry (b) biomass per sample

  • Chlorophyll index This was obtained using a CCM-200 plus Chlorophyll Content Meter (Marconi, Piracicaba, São Paulo, Brazil) with an accuracy of ± 1 CCI (chlorophyll content index) unit. The readings were performed on three sheets per point and collected randomly; three readings were made per sheet in order to improve data precision.

  • Grain yield The frame used had an area of 0.45 m2, within which plants were cut close to the ground level, before being harvested using a plot combine, which was a Wintersteiger Seedmech Nursey Master Elite® model (Wintersteiger South America, 80240120 Curitiba, PR, Brasil), and then cleaned. Subsequently, the grains were weighed and the data transformed into kg ha−1 (Bertolin et al. 2010).

Statistical analyses

In this study, statistical quality control was managed using control charts, descriptive analysis, and correlation using the Pearson correlation coefficient. Statistical Quality Control, one of the most used tools in the Statistical Process Control (SPC) application, was used to analyze the quality indicators using individual control charts. This tool has been used in several studies, including studies by Chioderoli et al. (2012), Zerbato et al. (2013), and Voltarelli et al. (2015), among others, to enable the monitoring of variation in data over time.

Control charts of individual values were prepared using upper (LSC) and lower (LIC) control limits and the arithmetic mean. These limits were obtained by the mean and standard deviation of the parameter values, with LIC = the mean minus three times the standard deviation, and LSC = the mean plus three times the standard deviation (Toledo et al. 2008). In a normal distribution, 99.7% of the observations are within + or − 3 standard deviations from the mean (Minitab 2017).

Descriptive analyses are simple numerical procedures used to summarize, analyze, organize, and describe the important aspects of a feature set observed in the sample (Pérez-Vicente and Ruiz 2009). Correlation analysis was conducted to verify the behavior of the quality indicators (vegetation indices, fresh and dry biomass, chlorophyll index, canopy width, plant height, and yield). This coefficient is used to measure the intensity, strength, or degree of the linear relationship between two random variables (Stevenson 2001; Bunchaft and Kellner 2002; Barbetta et al. 2004; Kazmier 2007; Ferreira 2009).

The coefficient sign indicates the direction of the correlation and can determine the presence of a linear correlation (Cruz and Regazzi 1997; Cruz and Carneiro 2003; Hair et al. 2005). The values of the correlation coefficient can vary from − 1 to 1, and the higher the value is, the stronger the relationship between the variables is. When the value is close to 0, there is no linear relationship between the variables. However, a value of 1 indicates a perfect linear relationship (Minitab® Support, 2019).

Correlation is one of the statistical analyzes widely applied in the remote sensing area, since it presents the standardized data, is dimensionless and it is also possible to verify the linear relationship between two continuous variables (Minitab 18, 2019), which in this work were the characteristics biophysical studies with vegetation indices. Dancey and Reidy (2006); and Barbetta (2006) describe the classification for the Pearson correlation coefficient: r = 0.10 to 0.30 (weak); r =0.40 to 0.6 (moderate); r = 0.70 to 1 (strong). Thus in this work it was verified which of the biophysical characteristics of the crop that most relate to the vegetation indices.

Spatial and temporal variability maps

The spatial and temporal variability maps were made using the QGIS software, where it was possible to create the maps through the real database collected in the field. The deterministic interpolator Inverse of the Squared Distance (IDW) was used to create the geofields for each biophysical and remote sensing variables collected in the samples area of the crop in the field, allowing visualization and monitoring during soybean crop growth stages (Molin et al. 2015).

$$Z = \frac{{\varSigma_{i = 1}^{n} \frac{1}{{d_{i}^{2 Zi} }}}}{{\varSigma_{i = 1}^{n} \frac{1}{{d_{i}^{2} }}}}$$
(1)

where: Z—estimated value for a given point; n—number of neighboring sampling points used in the estimation; Zi—value observed at the sampling point; di—distance between the sampling point and the estimated point (Zi and Z).

Results and discussion

The results and discussion of this study are discussed in the same section. To facilitate the understanding of this section the themes under study are divided into four topics: (i) control charts of individual values, verifying the variability of the process and detecting the processes of reading the vegetation indexes with higher quality; (ii) the descriptive analysis complementing the control charts by dispersion and position values; (iii) correlation between the biophysical variables of the crop and the vegetation indices, observing the optimal time for its use in the field and identifying the indices most correlated with the morphological variables of the soybean; and (iv) maps of spatial and temporal variability for monitoring the growth stages of the soybean crop.

Control of individual values

Using the control charts (Fig. 8) and descriptive analyses, it was possible to monitor the temporal variability of the indices (NDVI, IRVI, and NDRE) as well as the performance of these indices through the canopy sensors. NDRE demonstrated a higher process quality due to its lower variability as compared to NDVI and IRVI, and the lowest values for the coefficients of variation and dispersion.

Fig. 8
figure 8

Control charts showing individual values for the vegetation indices (NDVI, NDRE and IRVI) at 30, 45, 60, 75, and 90 DAS. UCL upper control limit, LCL lower control limit, \(\overline{X}\), average, DAS days after sowing, NDVI normalized difference vegetation index, NDRE normalized difference red-edge index, IRVI inverse ratio vegetation index

In their evaluation of several stages of soybean crop development in 2015 and 2016, Miller et al. (2018) used a canopy sensor with NDRE in order to verify the usefulness of this index for the crop, rather than using NDVI, given the limitations of the latter. The authors concluded that NDRE can be used in conditions in which the vegetative canopy has a higher biomass, and that active canopy sensors can overcome the limitations of passive sensors, such as sensor reading time and interference from cloud presence. This was also observed in the control charts that were produced in this study, in which NDRE presented a higher resistance to saturation readings than did NDVI. In addition, the control charts demonstrated their use as an excellent tool for visualizing the period in which NDVI saturation occurred.

As shown in Fig. 8a, there was a saturation in the NDVI values at 75 and 90 DAS (R4 and R6 stages) as the increase in biomass (Fig. 9b) interfered with the accuracy of the readings (Fig. 7a). The control charts of NVDI showed that there is almost no variability in the process (Fig. 9a), though there was some variability for the wet biomass (Fig. 9b). This indicates the importance of the tool in identification of the saturation point of this index. In two separate studies, Cao et al. (2015, 2016) stated that one of the challenges when using the GreenSeeker sensor is that saturation becomes problematic for NDVI during high and medium biomass conditions. Hence, NDVI is not considered suitable for areas of high yield.

Fig. 9
figure 9

Control charts showing individual values for the NDVI wet biomass at 30, 45, 60, 75, and 90 DAS. UCL upper control limit, LCL lower control limit, \(\overline{X}\), average, DAS days after sowing, NDVI normalized difference vegetation index, NDRE normalized difference red-edge index, IRVI inverse ratio vegetation index

The problem with NDVI saturation was observed by Mutanga and Skidmore (2004), who estimated pasture biomass based on vegetation indices, such as NDVI, calculated using the NIR and red bands. However, they found that these indices had limitations in high density vegetation because of saturation. The same authors indicated that, when the canopy is of high density, pasture biomass can be estimated using wavelengths located in the red-edge region, as these are more accurate than standard NDVI.

The United States Geological Survey (USGS, 2015) proposed that the reason for the saturation problem associated with NDVI saturation is that NDVI is based on a scale of values ranging from − 1 to 1, with the highest values being between 0.6 and 0.9 for dense vegetation. However, Huete et al. (2002) reported that this index begins to lose sensitivity with increase in vegetation density.

The points outside the control limits can be explained by leaf characteristics that interfere with their reflectance, such as the spaces occupied by air and water, pigmentation, and cellular structures, all of which affect the wavelength of incident radiation (Gates et al. 1965). Gausman and Allen (1973) reported on the influence of other factors, such as leaf age, light conditions, maturation, water content, senescence, nodal position, and pubescence.

Bernardes (1987) classified the factors related to the absorption of solar radiation and interception as physiological (functional character, i.e., plant age, nutrients, and leaf water content) and morphological (related to the spatial organization of the leaves, that is, leaf insertion angle, vertical and horizontal leaf distribution, and plant cover density).

Descriptive analysis

Descriptive analyses were performed to verify the spatiotemporal variability of the variables. Table 2 shows the dispersion of the values, which varied significantly over time because the ability of leaves to perform photosynthesis increased from seedling emergence until physiological maturity, while the photosynthetic rate significantly reduced until complete maturation of the plant (Moreira 2012). The same author reported that photosynthesis is related to the amount of radiation absorbed in the visible spectrum (red and blue bands), and senescent leaves reflect more strongly in the visible spectrum than young leaves.

Table 2 Statistical analysis expressed as coefficients of variation (CV) and simple arithmetic means for NDVI, NDRE, IRVI, canopy width, chlorophyll content index, fresh and dry biomass, plant height, maturity, and yield over time at 30, 45, 60, 75, and 90 DAS (n = 63)

In addition, while increased photosynthesis causes plants to absorb more in the red region as the canopy develops, the reflected radiation also increases in the NIR region (Jensen 2011). This infrared energy is not utilized by photosynthesis, but is used by the plant for molecular vibration or heating (Moreira 2012). In the present study, it possible to observe this behavior of the energy derived from the visible and NIR bands due to the use of these bands in calculating the three indices (Table 1).

The spectral behavior of the vegetation can be influenced by the substrate type and canopy architecture (Novo 2008). This was observed by Luiz and Epiphanio (2001) in spectral behavior of soybean, which was affected by soil reflectance as full soil cover was not achieved, therefore causing a reduction in reflectance. Therefore, at 30 DAS in the initial stages of this study, the coefficient of variation and dispersion values were higher as compared to those of other periods due to the influence of the soil and canopy width.

There has been a considerable increase in the number of studies on the impact of various factors on vegetation spectral behavior, such as canopy variables, data acquisition geometry, canopy height and shape, and practices relating to agriculture and species (Kimes 1983; Guyot, 1984; Valeriano 1992; Gleriani 1994; Ferri 2002; Galvão et al., 2004a, b, 2005, 2006).

Pearson correlation coefficient

Pearson coefficient correlation analysis (Table 3) showed that the strongest correlations occurred at 45 and 60 DAS (V5 and V6 stages) due to the higher physiological development potential of the crop. Hence, the indices were better correlated with the agronomic parameters (fresh and dry biomass, canopy width, chlorophyll index, plant height, and yield).

Table 3 Pearson correlation coefficient (r) results for the three vegetation indices (NDVI, NDRE, and IRVI), relating to fresh and dry biomass, canopy width, yield, chlorophyll content index, and maturity, at 30, 45, 60, 75, and 90 days after sowing (n = 65)

do Amaral and Molin (2014) also concluded that the most satisfactory canopy sensor results can be acquired and measured when sufficient biomass has accumulated, such as at the end of crop development. The sensors usually show lower sensitivity in their evaluation due the excessive accumulation of biomass as compared to earlier developmental stages. This was also observed in the present study, at 75 and 90 DAS (stages R4 and R6).

The lowest values were observed at 30 DAS (stage V3), and could be due to the influence of the higher soil exposure and smaller canopy width, as compared to the later stages. Moreira (2012) stated that a canopy is considered incomplete when the crop is at the beginning of its vegetative development, because the percentage of vegetation cover on the soil is low and reflected radiation is therefore partly derived from the soil and partly from the vegetation. Hence, low correlation values were obtained in this study because of the influence of the soil on sensor readings.

The parameters that showed the best correlations were fresh and dry biomass, canopy width, the chlorophyll index, and plant height, mainly with respect to NDRE. Similar results were obtained by Portz et al. (2012), who used a canopy sensor at wavelengths in the red-edge and NIR regions and found an improvement in the sensor performance in predicting nitrogen and biomass absorption. do Amaral and Molin (2014) observed that the amount of sugarcane biomass had a higher influence on sensor readings as compared to leaf pigment.

Canopy sensors are an efficient tool for predicting yield (Raun et al. 2001; Teal et al. 2006; Lofton et al. 2012). However, in the present study the correlation values for yield were low. This can be explained by the findings of Raun et al. (2005), Kitchen et al. (2010), and do Amaral and Molin (2014) , who reported on the various uncontrollable factors that can affect yield, thereby influencing the relationship between yield and canopy sensors (such as pest attack, climatic stress, and crop management).

NDRE presented the best correlation values in comparison with NDVI and IRVI. Cao et al. (2015, 2016) and Amaral et al. (2015) observed that vegetation indices using wavelengths in the red-edge region are able to overcome the saturation problem experienced when using NDVI. Ma et al. (2001) found that growth conditions may affect the association of NDVI with soybean yield. For example, optimal conditions may result in improved germination and rapid soil cover for capturing solar radiation, but dry and cold planting can delay emergence and lead to a decreased yield.

Typically, most soils covered by plants exhibit peak absorption at wavelengths in the red region. Hence, NDVI loses its sensitivity to biomass change, which may reflect on yield results (Povh et al. 2008). Li et al. (2010a) and Cao et al. (2015), while evaluating the performance of canopy sensors for wheat crop, observed that the amount of biomass caused saturation problems when using NDVI.

The effect of saturation is related to the built-in normalization effect of calculating the vegetation index, and the type of band chosen (Van Niel and McVicar 2004; Gnyp et al. 2014). Saturation problems can be reduced by using wavelengths with a similar penetration in the plant canopy (Van Niel and McVicar 2004) or similar vegetation index rates (Gnyp et al. 2014).

IRVI has been used to predict biomass in areas with a high vegetative intensity as this index is less susceptible to saturation (Hatfield and Prueger 2010; Li et al. 2010b; Bolfe et al. 2012). NDVI uses the NIR region to estimate crop yield by analyzing biomass composition, and methods based on NDVI that obtain data using reflectance have been widely used in recent years. However, the red-edge band may provide a better fit for models that predict soybean yield (Mourtzinis et al. 2014).

Recent studies have demonstrated the superiority of NDRE models, which yield a 28% increase in the coefficient of determination (R2) when compared with models that use wavelengths in the red region (Eitel et al. 2010). The red-edge band penetrates deeply into the plant canopy and is sensitive to chlorophyll in the upper biomass of the canopy, overcoming the saturation problem inherent in NDVI, and is therefore the preferred vegetation index (Li et al. 2014). Eitel et al. (2010) observed that using the red-edge region increased the ability to estimate variations in chlorophyll content (r2 > 0.73) from when the red-edge was not used (r2 = 0.57).

In addition, it was observed in this study that NDVI and IRVI were inversely proportional to each other (Table 3). This has been explained in a study by Kapp Júnior et al. (2016), who showed that the chlorophyll concentration in leaf tissue influences wavelength reflection in the visible spectrum, and that the lower the application of nitrogen to the plant, the lower the chlorophyll level and solar radiation absorption in the visible range, leading to an increased IRVI and decreased NDVI. Motomiya et al. (2009) also reported that lower amounts of nitrogen input cause a decrease in the leaf expansion rate and chlorophyll and reduces crop development. Therefore, the optimal time to take canopy sensor readings, mainly based on NDRE, was at 45 and 60 DAS (V5 and V6 stages), because of the superior correlations between quality indicators (vegetation indices vs. fresh and dry biomass, canopy width, chlorophyll index, yield, and maturation) during the soybean development stages (Table 3).

Spatial and temporal variability maps

In addition to the above topic, this section contains the spatial–temporal variability maps of the variables. Figures 10 and 11 show that an increase in biomass is associated with the development of the crop, and the values corresponding to 90 DAS (Figs. 10e and 11e) were the highest values of wet and dry biomass. However, a low amount of biomass was observed at some places of the map, which can be explained by the presence of terraces, where there was the accumulation of water after precipitation, as a physical barrier made runoff difficult, affecting soybean growth.

Fig. 10
figure 10

Spatiotemporal distribution of fresh or wet biomass variable (kg ha−1) using IDW (Inverse of squared distance)

Fig. 11
figure 11

Spatiotemporal distribution of the dry biomass variable (kg ha−1) using IDW (Inverse of the squared distance)

The spatial and temporal variability of soybean during crop growth has been shown in Figs. 12, 13, and 14. Few areas had small canopy-width plants and lower plant heights, which can be seen at 30 DAS, as the plant was in the initial stages and had a lower leaf area index (Fig. 14a). As the sowing operation was carried out without the use of autopilot, issues such as machinery trampling plants during the field operations of the soybean, presence of short lines, absence of plants in certain locations due to the difficulty of carrying out the sowing operation without the use of autopilot, absence of systematization of the area, presence of faults and double-spacing, and water accumulation were observed in field when there were the terraces.

Fig. 12
figure 12

Spatiotemporal distribution of the variable canopy width (cm) using IDW (Inverse of the squared distance)

Fig. 13
figure 13

Spatiotemporal distribution of the plant height variable (cm) using IDW (Inverse of the squared distance)

Fig. 14
figure 14

Spatiotemporal distribution of the chlorophyll index variable (CCI) using IDW (Inverse of the squared distance)

The variability of NDVI, NDRE, and IRVI vegetation indices for soybean crop is shown in Figs. 14, 15 and 16. In the early stages of the plant, the index readings were affected, showing high spatial variability (Fig. 15). Moreira (2012) explain that this occur because the crop has a low percentage of vegetation cover of the soil, and a noise is captured by the sensor because part of the reflectance is from the plant and the other from soil. Similar trends were also found by Povh et al. (2008), where the initial stages of the culture were influenced by the soil as a consequence of the failure in the emergence of the plants, due to the low leaf area index. However, the same authors reported that it is possible to accurately determine variation in biomass with the greater development of the above ground biomass due to the better closure of the canopy. (Fig. 17)

Fig. 15
figure 15

Spatiotemporal distribution of the NDVI variable using IDW (Inverse of the squared distance)

Fig. 16
figure 16

Spatiotemporal distribution of the variable NDRE using IDW (Inverse of the squared distance)

Fig. 17
figure 17

Spatiotemporal distribution of the variable IRVI using IDW (Inverse of the squared distance)

In the spatiotemporal distribution maps, the index that showed the better performance was the NDRE, while the NDVI had a saturation problem, and the IRVI did not show good performances, as some map locations showed differences between the readings of NDRE and NDVI indices with IRVI. As for the saturation problem of NDVI, Zanzarini et al. (2013) observed that the NDVI had this limitation factor due to the increase in biomass with fast saturation, according to the growth stage of the crop, reducing the accuracy of the reading due the saturation problem this vegetation index.

Liu (2006) found, using satellite images, that the maximum value of saturation was 0.84. Figure 15c shows that some areas had values between 0.785 and 0.833. In Fig. 15d, there is no variance between the values, since it was seen that NDVI reached saturation at 75 and 90. Hence, NDVI is more suitable in the early stages of. However, the NDRE did not present this problem because this index is not susceptible to saturation. Taskos et al. (2015), when comparing reading efficiency between NDRE and NDVI for grape culture, found that NDRE was less susceptible to saturation than NDVI because of the wavelength in the red-edge region.

Conclusions

By using the control charts, it was found that the lower variability of NDRE resulted in a higher process quality as compared to those for NDVI and IRVI, and it also presented the lowest values for the coefficients of variation. The control charts are valuable tools to identify the moment when saturation occurs for the saturation indexes that are more susceptible to saturation, as was the case with NDVI. Control charts have shown great potential in the visual detection of saturation, which is a problem encountered by the researchers, especially when using red wavelengths such as NDVI. Through the charts it is possible to observe the temporal behavior of the indices knowing at what moment it can be used accurately and when there is its saturation. Given this, new statistical tools are being applied to improve the temporal analysis of remote sensing, since normally the data from the reflectance of objects has nonlinear behavior.

Maps of temporal and spatial variability, NDRE showed better performance, NDVI had a saturation problem, and IRVI did not show good performance possibly because it was obtained by the red and near infrared bands, demonstrating that in the wavelength in red it presented limitation in the final stages of cultivation, and in the red-egde band it was less susceptible to saturation than in the red area.

To capture the spatial variability of soybean crop using canopy sensors, readings should be taken after the plants have completely covered the soil. Overall, the results showed that the optimal time was at 45 and 60 DAS (V5 and V6 plant development stages), based mainly on NDRE, due to the better correlations between quality indicators compared to other stages of soybean development.

In future work, statistical process control may possibly be applied to monitor the quality and variability of other vegetation indices and biophysical characteristics. Meanwhile, it was observed in this article in both control charts and maps, they showed similar results, further emphasizing the adoption of remote sensing charts as a promising statistical tool as has already been found in other areas of agrarian sciences.