Introduction

Mangroves have several characteristic features that separate them from other woody plants. Mangroves are halophytes with unique breathing roots—pneumatophores. There are 110 species of mangroves present, of which 54 species are true mangroves, and the rest are associated mangroves. Mangroves help recycle nutrients; act as a link between terrestrial and marine habitats, thus supporting various life forms (Nagelkerken et al., 2008); serve as a sink for different minerals (Alongi et al., 2004) and sequester a considerable percentage of global carbon annually (Bouillon et al., 2008; Kristensen et al., 2008; Alongi & Mukhopadhyay, 2015). Mangroves are degrading fast due to various anthropogenic activities. UNEP (United Nations Environmental Programme) launched the "Blue Carbon Initiative" project to protect and restore mangroves and other vegetation.

Remote sensing offers ample scope for frequently monitoring these salt marshes without difficulty and with good precision. Remote sensing can help determine the biophysical parameters for a vast area in a short time. Multispectral remote sensing views a region in a few wide wavelength bands but can range from medium to very high spatial resolution. Hyperspectral remote sensing is a kind of remote sensing that uses a range of wavelengths from the electromagnetic spectra by dividing it into bands of very fine narrow wavelengths so that variation in spectral properties of different species can be studied and determined precisely for proper identification and discrimination of species as well as determination of their biophysical and biochemical parameters.

Biomass is the biophysical property indicative of the carbon sequestration capability of mangroves. It is the measurement of the content of the living tissue mass present per unit area (e.g., g/m2). Researches across the world prove that mangroves can sequester 5% of the carbon in the atmosphere even though they cover only 0.1% of the earth's surface area (Alongi, 2002). Earlier biomass calculation was possible only by destructive methods or by measuring the trees' biophysical parameters and using species-specific or standard allometric equations. With the advances in remote sensing, methods were adopted to estimate the biophysical properties of vegetation that involve in situ data collection of the biophysical variables and then model the biomass using features derived from remotely sensed satellite images (Hirata et al., 2014; Patil et al., 2014; Green et al., 1998; Kovacs et al., 2009; Fatoyinbo et al., 2008). Studies have been done in the past to link biomass values with values of Vegetation Indices (VIs). VIs are the mathematical transformation of the spectral bands that assess the spectral contribution of objects to different bands of the electromagnetic spectrum (Elvidge & Chen, 1995). VIs minimize external effects such as sun angle, sensor angle, shadow, soil background, leaf and canopy angle and terrain effect (Kasawani et al., 2010). Sarker & Nichol (2011) tested ten slope-based and eleven distance-based vegetation indices derived from the AVNIR-2 sensor. They compared them with other models (texture-based and band ratio-based) for biomass estimation. Eckert (2012) and Zhu et al. (2015) also used VIs derived from WorldView data for mangrove biomass estimation. VIs like NDVI that are computed using red and NIR bands and have been used to estimate vegetation biomass have a limitation of saturating at high biomass levels. Some narrow-band VIs, like TVI and MNDVI, have performed reasonably well in literature in estimating biomass. At high canopy density, narrow-band indices involving red edge bands have been more accurate in estimating AGB (Mutanga & Skidmore, 2004).

Apart from directly using VIs, other features derived from VIs can also be used to study the biophysical parameters. In remote sensing, the texture is a function of spatial variation of the brightness intensity of the pixels (Armi & Fekri, 2019). It is defined as the function of local variance of the spatial resolution in the image and also depends on the size of the objects in the scene (Haralick et al., 1973; Woodcock & Strahler, 1987). Texture of an image depends upon three characteristics: (1) repetition of some local order that is larger than the order’s size, (2) the order consists in the systematic arrangement of elementary parts and (3) the parts have approximately same dimensions in the textured region (Tuceryan & Jain, 1993). Texture analysis in remote sensing depends on the structure and statistics of the pixels (Haralick, 1979). Texture has often played a more critical role than reflectance measurements for high-resolution images (Boyd & Danson, 2005; Podest & Saatchi, 2002; Ulaby et al., 1986; Dell’Acqua & Gamba, 2006). The size and the spacing of the tree crowns determine the texture of an image of a forested area (Nichol & Sarker, 2010).

Several statistical models such as linear regression with or without log transformation (Steininger, 2000; Calvao & Palmeirim, 2004); multiple regression models (Eckert, 2012; Dobson et al., 1995; Hyde et al., 2007); nonlinear regression (Santos et al., 2003); artificial neural networks (Foody et al., 2001; Zhu et al., 2015) and semi-empirical models (Castel et al., 2002) have been used in earlier studies to model the AGB. The multiple regression model is considered the best among the other models as it has proved to reasonably establish some relationship between the field biomass and remote sensing-derived information (Hame et al., 1997; Kurvonen et al., 1999; Hyde et al., 2007; Nichol & Sarker, 2010; Sarker & Nichol, 2011).

The main objective of our study is to model the Above Ground Biomass of Bhitarkanika mangroves using features derived from remotely sensed data of three different sensor types of namely high resolution multispectral, coarse resolution multispectral and high spatial and spectral resolution hyperspectral data. It compares the results obtained by applying the multilinear regression machine-learning algorithm to different feature combinations of reflectance bands and their textures. The study investigates the importance of textural features in relation to high spatial resolution imagery in modeling biomass. This may help in the future to accurately estimate mangrove biomass remotely. The study also inspects the role played by the spatial and spectral resolutions in modeling biomass. Attempts are also made to combine features from all the datasets using a multi-sensor approach to assess improvement in biomass estimation. In this process, we used various feature selection methods and compared their feature selection potential in getting the best possible outcomes.

Study Area

This study explores the relationship between the ground truth biomass data and the satellite-derived features of the mangroves of the Bhitakanika National Park, a 145 km2 large national park in the Kendrapara district, Odisha, India. Our study area extends from 20° 38′ 19″ N to 20° 47′ 27″ N latitudes and 86° 49′ 26″ E–87° 05′ 48″ E longitudes. In 1998, this area got the designation of National Park, and in 2002, it obtained the status of a Ramsar site. Three major rivers of northeast India—Brahmani, Baitrani and Dhamra; combinedly make up its vast and fertile alluvial deltaic region. It is rich in biodiversity and houses rare and endangered species. It houses the Olive Ridley sea turtles and is also the breeding place of endangered saltwater crocodiles. It serves as the habitat of more than 200 species of birds and is a host of several migratory birds. Among the numerous reptiles, mammals and vertebrates, commonly found are the cobras, the pythons, otters, spotted dear, the endangered water monitor lizards, Fishing Cat, etc. (Ravishankar et al., 2004). Its proximity to the Bay of Bengal makes its soil saline, making the area rich in different species of mangroves. The sanctuary also has backwater, mud flats, tidal creeks and distributaries of many rivers and ramifying streams, making it a land of unique flora and fauna. It has 32 true species and 46 associated species of mangroves. The major species found here are Avicennia marina, Avicennia officinalis, Ceriops decandra, Exoecaria agallocha, Heritiera fomes, Kandelia candel, Sonneratia apetala, Sonneratia caseolaris, Xylocarpus granatum,, and Xylocarpus moluccensis. The only endemic species present in the sanctuary is Heritiera kanikensis (Majumdar & Banerjee). Semi-diurnal tides inundate the area (Fig. 1).

Fig. 1
figure 1

Study area map showing Sentinel 2 false color composite (NIR, Red and Green bands) of Bhitarkanika, Odisha with locations of field plots

Materials

Satellite/Sensor Images

Features for developing the biomass model were derived from three different satellite images (Table 1). First is a coarse resolution image from the multispectral Landsat 8 Operational Land Imager (OLI) hereafter named as LO acquired on 26th November 2015. It provides data in 10 multispectral bands with a spatial resolution of 15 m for the panchromatic band to 30 m and 100 m for the multispectral bands. The second image is from high spatial and spectral resolution—Airborne Visible Infra-Red Imaging Spectrometer—Next Generation (AVIRIS-NG), a hyperspectral optical sensor of Jet Propulsion Laboratory (JPL), NASA, that was on board an ISRO B200 aircraft and a part of the ISRO-NASA airborne campaign. These data, hereafter named as AN in this manuscript, provide high-resolution airborne imaging over broad spectra in very narrow wavelength channels, the spectral resolution being 5 nm and spatial resolution of about 4 m over 420 narrow spectral bands. The narrow bandwidth gives precise information about subtle changes in the reflectance and absorption features (Chaube et al., 2019). AN data for the study area was acquired on 26th and 28th December 2015. We also utilized a medium-resolution Copernicus Sentinel-2 Multispectral Instrument (MSI) image hereafter named as SM, of 1st January 2016 that was close to AN acquisition time, provides spatial resolutions varying from 10 to 60 m over broader multispectral channels. SM imagery collects data in thirteen multispectral bands, of which three are vegetation red edge bands, one is NIR (Near-InfraRed) band and three are SWIR (Short Wave InfraRed) bands (one is SWIR-cirrus) that help in the detailed monitoring of vegetation and parameter generation.

Table 1 Spectral specifications of LO, SM and AN dataset used

In situ Data

The field biomass variables from sample plots were collected during two time periods—December 2012 and April 2013. The in situ data were collected from 42 stratified sample plots selected based on the existing mangrove community map (SAC, 2012) distributed in the mangrove forest of Bhitarkanika. The sample plots were of size 10 m × 10 m. All sample plots were selected far from roads, creeks, or other interventions, well within the vegetated area. Biophysical parameters like DBH was measured using measuring tape and tree height was measured using the Leica Disto D8 LASER distometer. The number of trees in each species was recorded with the input of field experts from the Forest Department, State Government of Odisha, India.

Methodology

Preprocessing of the Images

AN is an imaging spectrometer, and it needs to be spectrally and radiometrically calibrated prior to data analysis. According to NASA—‘All AVIRIS-NG data is processed by the AVIRIS-NG instrument ground data system (IGDS) at JPL.’ This rectifies the errors due to the aircraft’s motion and frees the data from geometric and radiometric errors caused due to the atmospheric effects. The orthorectified and atmospherically corrected reflectance data by using ATREM with simultaneous three phase H2O removal from AN data product is used in this study (Bue et al., 2015). The study area is covered under five adjacent AN image scene. Hyperspectral data comes in several narrow contiguous bands; not all bands contain extractable information. The bad bands were manually checked and removed before working with the data. Further, considering the computational complexity of the hyperspectral aerial dataset, the image features are indented to be derived from the local neighborhood region of sample plots. Hence, a buffer region of 200 m around samples are considered for further analysis. Histogram matching was performed for all the five scenes prior to the mosaicking of the scenes. Google Earth Engine was used to get the preprocessed reflectance data of both Landsat-8 (LO) and Sentinel-2 (SM).

Feature Extraction

The images are processed, and the features are extracted from each of the three images for further processing and analysis (Fig. 2). The extracted features are a combination of (1) reflectance bands, (2) vegetation indices, (3) band ratios and (4) GLCM parameters of a set of above.

Reflectance Bands

The bands mentioned in Table 1 for SM and LO multispectral imagery have been considered for the data analysis. AN has 420 bands, and using all the bands in the study can make the feature size bulky and hard to process. Moreover, all the bands may not contribute to the estimation of mangrove biomass. A total of 12 reflectance bands were selected which are represented by vegetation indices and the highest slope change from the first derivative of the spectra (Prasad & Gnanappazham, 2014) that include wavelengths of 552, 682, 707, 717, 862, 972, 1083, 1193, 1263 and 1473 nm.

Vegetation Indices

In the present study, 12 Vegetation Indices (VIs) are calculated from the SM and 11 from LO multispectral images to model the biomass using multiple regression analysis. AN being hyperspectral data has a greater number of narrow bands. Thus, we have chosen VIs that are common to our other two datasets and additional VIs corresponding to the reflectance bands selected for our hyperspectral imagery. A total of 20 VIs are calculated for the AN image.

Textural Features

Even though VIs reduce the atmospheric effects and noise, they often suffer from the problem of saturation at high biomass levels. So, we try to explore the potential of textural parameters to improve the biomass estimation (Kuplich et al., 2005; Prasad, 2016).

Grey Level Co-occurrence Matrix (GLCM) extracts statistical measures from pixels of an image determined by the spatial relationship between neighborhood pixels with a specific intensity. The selection of moving window size is an important factor because the small window size exaggerates the local variance, whereas the large window size may not extract textural information because of the over-smoothening of the textural variation (Chen & Gong, 2004). This study used small moving window sizes of 3 × 3 to estimate the 8 GLCM parameters—Mean, Variance, Homogeneity, Contrast, Dissimilarity, Entropy, Angular Second Moment (ASM) and Correlation from the following (Supplementary material. 1).

  1. (i)

    LO—(a) 6 reflectance bands, (b) 15 simple band ratios, (c) 11 Vegetation Indices

  2. (ii)

    AN—(a) 12 reflectance bands and (b) 20 Vegetation Indices.

  3. (iii)

    SM—(a) 9 reflectance bands, (b) 28 band ratios and (c) 12 vegetation indices.

The 12 reflectance bands and the VIs with each of their 8 GLCM parameters were considered in our study, giving a total of 288 features for the AVIRIS-NG dataset. The total number of features extracted for LO was 288 and for SM was 441. The entire output of the three sensor data was analyzed individually and combined manner after feature selection methods (Fig. 2).

Fig. 2
figure 2

Flowchart showing the methodology for independent and multi-sensor approach for biomass modeling

Feature Selection Methods

With the available number of samples and more degrees of freedom, there are chances that the model learns noise in the data instead of the underlying patterns and relationships. To avoid overfitting and to improve the model, we adopted the following steps: (a) Feature reduction, (b) Use of a simpler model, (c) Regularize the model and (d) Cross-validation in each of the following methods.

The best feature selection algorithms are those that are stable to the addition or removal of training samples (Chandrashekar & Sahin, 2014; Haury et al., 2011; Dunne et al., 2002; Kalousis et al., 2007; Somol & Novovicova, 2010; Yang & Mao, 2011). We had limited samples, so we made statistical scores of the regression model as our judging criteria. We used the following Python-based feature selection techniques.

  1. (a)

    Filter Methods: In this method, the selection of each feature is evaluated individually. (1) Correlation Feature Selection (CFS): Correlation statistics gives positive scores for features. The greater the score, the greater the relationship with the dependent variable. (2) Mutual information (MIFS): It is the application of information gain to feature selection. It measures the reduction in uncertainty for one variable given a known value of the other variable.

  2. (b)

    Wrapper Methods: The search algorithm wraps the predictor to find the subset with the highest predictor performance (Chandrashekar & Sahin, 2014). Recursive feature elimination (RFE) technique is computationally easy and fast when compared to the sequential forward and backward selection and exhaustive elimination methods (Venkatesh & Anuradha, 2019). This algorithm runs recursively, considering the smaller and smaller set of features. Their importance is obtained using the feature importance attribute. The minimum required features are defined by the user.

  3. (c)

    Embedded Methods: These are the methods that are complete on their own, i.e., feature selection is a part of the algorithm. Variables are selected without splitting the data into training and testing subset (Chandrashekar & Sahin, 2014; Guyon & Elisseeff, 2003; Langley, 1994; Blum & Langley, 1997). We used a regularization method called the Least Absolute Shrinkage and Selection Operator (LASSO). It is a regression analysis method that adds a penalty to the least informative features to discard them thus avoiding overfitting.

  4. (d)

    Adaptive Heuristic Search Algorithm: They are a time-saving approach to classical algorithms. Decision is made at each branching step by ranking the alternatives. Genetic Algorithm (GA) is one of the most advanced methods belonging to the class of Evolutionary Algorithms (EA). It is a stochastic method for feature optimization based on natural selection and genetics. It is a powerful selector (Salcedo-Sanz et al., 2004; Huang et al., 2007). GA uses three operators: selection, crossover and mutation to improve the quality of solutions. The estimator used here is linear regression, and the selector is GeneticSelectionCV. The scoring is based on the negative of the mean squared error.

Based on the above set of algorithms, we adopted CFS, Select K Best CFS, Select K Best MIFS, LASSO Regularization, RFECV and Genetic Algorithm in selecting the features from the Individual dataset and combined dataset.

Biomass from In Situ Data

To model the biomass of mangroves using remote sensing data, field biomass values of 42 sample plots were estimated using non-destructive methods since Bhitarkanika, being a National Park and Saltwater Crocodile Sanctuary, is protected by law. The biophysical measurements taken were used to estimate the plot biomass by the non-destructive sampling method using four species-specific and a common allometric equation (Table 2).

Table 2 Species-specific allometric equations used for the calculation of plot biomass

Statistical Analysis

The features’ values of the three sensor type data were extracted for the co-located 3 × 3 kernel size of the field sample plots. Stepwise linear regression was performed to gauge the relationship between biomass and combined features for each dataset. We performed multiple regression analyses to model the AGB collected from in situ data against the features acquired from the 3 datasets namely (1) independent features extracted from individual data (2) Combination of features of each sensor (Combine feature set) (3) Combination of features from all 3 datasets, a multi-sensor approach (Combined Dataset). Stepwise regression was performed and a total of 48 features were obtained from all three sources combined after selecting the most informative features from all three datasets independently. All the regression analyses were carried out in python. The models for all three datasets were run for hundreds of times, dividing data into random training and test samples in the ratio of 80:20 to derive the regression coefficients for biomass estimation.

Results

AGB of mangrove forests of Bhitarkanika National Park was modeled using biomass estimated from the structural properties of the trees measured from 42 plots using allometric equations against varying spectral features derived from three sensor data namely SM, LO and AN having varied spectral and spatial resolution. In addition to the reflectance bands of each dataset, various other features such as band ratios, VIs and textural parameters were extracted to model the AGB based on the results of our earlier studies (Prasad & Gnanappazham, 2018).

Performance of the Independent Input Features from Each Satellite Data

When plot biomass was regressed with spectral reflectance, band ratios and VIs, band ratios of SM multispectral data performed better (R2: 0.69) than LO multispectral data (R2: 0.65) followed by VIs of AN. However, the LO Band Ratio has been modeled with the least RMSE of 321.80 t/ha. In general, band ratios reduce dispersion in simple reflectance bands caused by solar illumination, soil background and topographic influences while also boosting spectral responsiveness from foliage. However, when textural parameters are introduced, the texture of VIs from LO yielded a better model (R2: 0.44) followed by the texture of reflectance from SM (R2 = 0.41). Textural parameters of reflectance, Band Ratios and VIs of all three sensor data were not found to improve the estimation when analyzed independently. Further, very high resolution AN was not found to give better results compared to medium and coarse resolution multispectral data. One of the reasons of this result could be the influence of textural transformation of the diverse canopy structure of very high-resolution data to model the biomass (Table 3 ).

Table 3 Model fitting parameters derived from the results of biomass estimation using simple reflectance, band ratios, vegetation indices and textural parameters

Performance of the Combination of Input Features for Each Satellite Data

Since independent feature sets did not result satisfying outputs, we attempted modeling using a combination of all features (Combined feature set) like textural parameters of reflectance, band ratios, and VIs of each dataset LO, SM and AN with biomass. In this process about 20, 19 and 9 features were selected out of the combined feature set of AN, SM and LO, respectively (Table 4).

Table 4 Selected features from combined feature set

The results of step-wise linear regression analysis using a combination of all the six types of features found to improve the results for individual dataset in terms of RMSE rather than R2 (Table 5). Our dataset has varying characteristics like LO is coarse resolution (multispectral) data, AN is high spatial and high spectral resolution (hyperspectral) data and Sentinel is medium spatial—spectral resolution data. Out of them, the model using SM is found to give better results (R2: 0.85) with lesser RMSE (195.49 t/ha) than AN and LO showing similar results. It could be noted that the model has also retained more features from SM than AN and LO.

Table 5 Regression output for combined feature set

Performance of the Combined Feature Set from Combined Sensor Datasets

When the 48 selected features from all three datasets (LO, SM and AN) combined using different feature selection methods most of the methods selected a similar set of features from each data set (Table 6). Among them, K best MIFS selected a minimum of 4 features from AN and LO, while LASSO regression was not able to reduce the features (selected 40 features) from all three datasets. About 6 features were selected by RFECV from SM and LO ignoring AN.

Table 6 Identified features from regression analysis on Combined Datasets

While analyzing the model performance of the features selected by each of the methods, the least R2 value was yielded by K Best CFS (0.15) with RMSE: 515.8 t/ha and the best out of 0.7 using GA (RMSE: 244.86 t/ha). Even though the features selected by RFECV could achieve an R2 value of 0.48, the RMSE was found to be the lowest (195.778 t/ha) among all methods (Table 7).

Table 7 Regression output for selected features from Combined Datasets

The importance score of the Combined feature set for the analysis of the Combined dataset for all three data (Fig. 3) show that LO, A20 have the maximum score of 0.38 each and S16 with 0.28, respectively. Also, features from all sensors have comparatively similar scores as could be seen from the figure.

Fig. 3
figure 3

Importance score of the Combined Data Set of AN, SM and LO

Discussion

It is well proven that textural parameters of multispectral satellite data yield better biomass estimation with more regression coefficient (Eckert, 2012; Zhu et al., 2015, Fuchs et al., 2009; Prasad & Gnanappazham, 2018). In our study on comparing varying spatial and spectral resolution also resulted that the textural features of all the basic features have been selected as the most contributing features for biomass estimation (Table 4) other than a few reflectance and a VI. However, when Individual data sets of each sensor was studied, the medium resolution multispectral data of Sentinel 2 was able to model with a higher coefficient of determination and lesser error (than coarse resolution (Landsat 8 OLI) and fine resolution hyperspectral data (AVIRIS-NG) (Fig. 4). Another important observation made was that all independent data sets were over estimating the biomass, while combined dataset estimates were closer to original though they are also overestimating. Notably the higher biomass values are mostly underestimated again substantiating the limitation of optical data. Sample Biomass map estimated using SM data of the study area is given for reference in Fig. 5. The multi-sensor approach could confirm the results what we were achieving independently for both hyper-spectral and multispectral data in terms of R2 and RMSE in the estimation of biomass. All the feature selection methods had selected features from all three dataset except Selected K Best MIFS that yielded the least R2 value. GA, the best of the models show that features from all sensors have equally contributed which is also evident from Fig. 3.

Fig. 4
figure 4

Comparison of estimated biomass from Combined features set and actual biomass estimated using allometry methods for a AN, b LO, c SM and d Combined dataset

Fig. 5
figure 5

Above-ground biomass estimated in the forest cover using combined feature set of Sentinel 2 Multispectral data

There are major challenges in estimating the biomass of mangroves using remote sensing methods. First, destructive methods from the sampled trees cannot be used to estimate the biomass due to the Act of the conservation and protection of National Parks and even any mangrove vegetation. Second, the availability of global allometric equations for estimating the ecologically sensitive vegetation types; (modeling of mangrove biomass is still dependent on allometric equations of global average only; Komiyama et al., 2005). Additionally, the species-specific allometric equations are available only for a few species. The present study of AGB estimation was also carried out with the above-mentioned challenges and lack of sufficient sample data on the mangrove structural parameters.

Beyond the data constraints, modeling the AGB of the mangroves using remote sensing data is also subjected to the prevailing structural complexity of the tropical evergreen forests (Sarker & Nichol, 2011). Utilizing or integrating Synthetic Aperture Radar data would probably lead to improved biomass modeling (Vaghela et al., 2021). Multifrequency analysis of Radar data was also found to be successful, wherein the canopy parameters sensed by low frequency radiations while sub canopy parameters by longer wavelengths making it effective in understanding the structure of the vegetation and its biomass (Proisy et al., 2003). Hence, a pertinently integrated framework of multisensory data will be able to overcome the uncertainties in the estimation of AGB of mangroves. Nevertheless, the uncertainties arising out of the non-availability of local specific allometric equation, usage of allometric methods for the actual biomass estimation need to be addressed for the reliable estimation of AGB using remote sensing methods.

Conclusion

Our study was a preliminary study that shows the trend in the modeling of AGB using features from different types of sensors—multispectral and hyperspectral independently as well as in a multi-sensor approach. Site-specific results show that the multi-sensor approach has not improved the biomass estimation of the independent sensor. Use of Visible and Near InfraRed range of spectrum would have resulted in similar results, however, the features selection method of the Genetic Algorithm on medium resolution multispectral data has given the best result in our study. The research also indicated that the incorporation of textural features improved the R2 values. This can be as texture might link to the fractional vegetation canopy (FVC) of the tree, which is now known to be an estimator of biomass. The future studies may integrate data from more sensors like SAR and LiDAR to have a better understanding of the multi-sensor approach for the estimation of AGB of mangroves.