Introduction

Forests are crucial for ecological functions, by regulating the climate because it offers a significant potential to capture and hold carbon (C). Forest biomass accounts for the largest terrestrial carbon pool (Zhao and Zhou 2005; Tan, et al. 2007; Kumar et al. 2011; Borah et al. 2013). When forests are cleared or degraded by anthropogenic actions and natural disturbances a large amount of carbon is released into the atmosphere (Gibbs et al. 2007). The growing concentration of Carbon dioxide (CO2) and other greenhouse gases have raised concerns of global warming and climatic changes (Thakur and Swamy 2010). Forest covers nearly one-third of the total global land area and store a vast amount (289 Gt) of atmospheric carbon in their biomass alone (FAO 2010). Recently, the estimation of forest biomass has received considerable interest for both practical forestry issues and scientific purposes (Tan et al. 2007; Srinath 2008; Akhavan and Kia-Daliri 2010; Devagiri et al.2013). The Kyoto Protocol to the United Nations Framework Convention on Climate Change (UNFCCC) has recognized terrestrial vegetation as significant sinks of atmospheric carbon dioxide (USAID, 2009). The UNFCCC introduced “Reducing Emissions from Deforestation and Forest Degradation” (REDD 2009) mechanism which is used to create a financial opportunity for developing countries for the carbon stored in forests in the interest of reducing carbon emissions and sustainable forest management (Cerbu et al. 2011). All the greenhouse gas inventories and emissions reduction programs require reliable, accurate, cost-effective and scientifically robust methods for measurement and monitoring of forest carbon storage (ANSAB 2010; Mohanraj et al. 2011). Therefore, it is crucial to precisely estimate the aboveground forest biomass/carbon in quantifying carbon stocks and fluxes.

There are different methods available to measure AGB but forest inventories and remote sensing (RS) are the two principal data sources which are widely used to estimate AGB (Krankina et al. 2004). The estimations of above-ground biomass (AGB) of forested areas are essential to address various questions like estimations of forest productivity (Chirici et al. 2007; Palmer et al. 2009), to determine the global carbon balance (Hese et al. 2005; Devi and Yadava 2009;) or studying the impacts of forest fires or other disturbances (García-Martín et al. 2008). The traditional techniques based on field measurements are the most accurate for biomass estimation which consist in the implementation of forest inventories where a statistical sampling design is established to acquire data from stand level management inventories (Tiwari and Singh 1984; Srinath 2008; Devi and Yadava 2009; McRoberts et al. 2010), however is not a practical approach because it is extremely costly, destructive sampling of trees, time consuming and labour intensive (Brown 2002). Many previous studies (Foody et al. 2003; Lu 2005; Maynard et al. 2007; Dadhwal et al. 2009; Kale et al. 2009; Thakur and Swamy 2010; Mohanraj et al. 2011; Kumar et al. 2011; Devagiri et al. 2013) have shown that satellite spectral information (spectral band, band ratios, band transformations, etc.) have good correlation with forest biomass and when combined with field measurements, can be used for the estimations of above ground biomass (AGB).

In recent years, many studies have been conducted using geostatistical approaches to predict continuous forest variables (e.g. basal area, density, LAI, tree height, standing volume, above-ground biomass, productivity, etc.) using remote sensing data (Berterretche et al. 2005; Sales et al. 2007; Meng et al. 2009; Akhavan and Kia-Daliri 2010). Geostatistical techniques have become popular tools and are widely used in several studies for modeling spatial structure of ecological data (Burrough 1986; Nanos et al. 2004; Berterretche et al. 2005; Maselli and Chiesi 2006; Akhavan and Kia-Daliri 2010). Co-kriging (CoK) is used to improve spatial predictions of forest biomass by integrating GPS, ground inventory data, remote sensing, and GIS.

Study Area

Ratnagiri is a coastal district of Maharashtra state lying between 15°40′ and 18° 5′ north latitude, and 73° 5′ and 73° 55′ east longitude shown in Fig. 1. Ratnagiri is surrounded by Satara, Sangli and Kolhapur districts in the east, Raigad district in the north, the Arabian Sea in the west and Sindhudurg district in the south. It has a geographical area of 8,208 Km2. The annual average precipitation is 3,188 mm with few months of very high rainfall. The maximum temperature rarely goes beyond 38 °C in the coast to 40 °C in the interiors. The forests of this district fall into two distinct types tropical moist deciduous forests and sub-tropical ever-green forests. It has a forest area of 64 Sq.km. The chief tree species occurring in this region are Bibba (Semecarpus anacardium), kinjal (Terminalia paniculata), Kokam (Garcinia indica), Amba (Mangifera Indica), Ain (Terminalia tomentosa), Apta (Bauhinia racemosa), Umbar (Ficus racemosa), Kusumb (Schleichera oleosa), Kumbha (Careya arborea), Asana (Bridelia retusa), Anjani (Memecylon edule), Atak (Flacourtia Montana), Jambhul (Syzigium cumini), Nilgiri (Eucalyptus globules), Suru (Casuarina equisetifolia) and Supari (Areca catechu).

Fig 1
figure 1

Location map of study area with sampling points

Materials and Methods

Field Data

In the present study stratified random sampling were used for estimation of biomass in which 59 plots were laid down in different homogenous strata depending on the NDVI values provided by National Remote Sensing Center (NRSC). In the present study, it was envisaged to do sampling of 0.01 % of the area, keeping in view of the time, money and availability of other resources. This sampling strategy resulted into laying out of 59 plots randomly probability proportion to size (PPS) distributed in different homogenous strata depending on the accessibility of location. The size of quadrant was determined based on species area curve (Fig. 2) where the number of species became constant for sample plot size of 30 × 30 m. The plots geographical locations were recorded by using Global positioning systems (GPS) and Survey of India topo-sheets (47 F, 47 G and 47 H) and structured in a geographic information system (GIS) database. The sample plot size is set to 30 × 30 m for trees, a 5 × 5 m subplot is nested to gather information on shrubs and a 1 × 1 m subplot for herbaceous species as shown in Fig. 3. The diameter at breast height (DBH) of all trees of ≥ 10 cm DBH and height of all trees in the plot were measured with help of tapes and hypsometer respectively. The forest stand parameters such as Diameter at Breast Height (DBH) and height were used for volume estimation by applying volume equations. The volumetric equations developed by Forest Survey of India (FSI 1996) were used to transform the volume of trees into biomass by multiplying it with specific gravity (FRI 1996). The sample of shrubs and herbs collected from sample plot were oven dried and dry weight was estimated. The Above Ground Biomass (AGB) was calculated for different components e.g. trees, shrubs and herbs for each plot-wise. The sum of trees, shrubs and herbs layer biomass (t/ha) were taken as total AGB of the plot.

Fig 2
figure 2

Species area curve

Fig 3
figure 3

Field plot design

Remote Sensing Data

The remote sensing imagery Landsat Thematic Mapper (TM), October 2009 was used in the study. All images were processed in ERDAS Image Processing software 2011.

Vegetation Indices

Vegetation indices are the mathematical transformation of the original spectral reflectance which are most commonly used to produce estimates of biomass (Hurcom and Harrison 1998; Foody et al. 2003; Zheng et al. 2004; Schlerf et al. 2005). The six vegetation indices used in this study are shown below in the Table 1. Normalized difference vegetation index (NDVI), Renormalized Difference Vegetation Index (RDVI), Modified Simple Ratio (MSR) and Difference Vegetation Index (DVI) are slope based vegetation indices which are widely used. Modified soil adjusted vegetation index (MSAVI) and optimized soil adjusted vegetation index (OSAVI) are soil adjusted vegetation indices which are used for reducing the effect of soil background reflectance.

Table 1 Equations of the vegetation indices

Correlation Analysis

Correlation analysis measures the degree of association between two or more variables. In this research, correlation analysis was used to assess the correlation between different vegetation indices and above ground biomass (AGB) from sample plot. A comparative study of different R2 was done to obtain best fit amongst different vegetation indices with AGB from sample plots.

Geostatistical Approach

Geostatistical methods are based on statistical models which are useful in modeling spatial variability through prediction and simulation (Goovaerts 1997; Deutsch and Journel 1998). It involves of fitting a mathematical function to a specified number of points,or all points within a specified radius, to determine the output value. It is a multi- step process, it include exploratory statistical analysis of the data, variogram modeling and creating a surface from sample points. Semivariogram has been used for fitting of the data. Based on the semivariogram, the geostatistical process derives optimal linear unbiased spatial prediction methods by minimizing mean-squared prediction error.

The exponential model was used as the model type. This model is applied when spatial autocorrelation decreases exponentially with increasing distance. Here the autocorrelation disappears completely only at an infinite distance. The exponential model is also a commonly used model. The choice of which model to use is based on the spatial autocorrelation of the data and on prior knowledge of the phenomenon. The number of lags for the Variogram was 12.

In this study, co-kriging has been used for generating biomass layer. The co-kriging method is discussed in detail below. Co-kriging (CoK) is an extension of kriging. It uses multiple datasets and is very flexible, allowing you to investigate graphs of cross-correlation and autocorrelation. Co-kriging is a very versatile and rigorous statistical technique for spatial point estimation when both primary and auxiliary attributes are available. Co-Kriging uses statistical models that allow a variety of map outputs including predictions, prediction standard errors, probability, etc.

The best predictor along with geostatistical methods (Co-Kriging) were used to model biomass in the study area and to create continuous biomass maps. This systematic geostatistical approach is summarized in a flow chart (Fig. 4), which considers the associations between biomass and vegetation indices into the process of spatial prediction.

Fig 4
figure 4

Methodology Flowchart

Results & Discussion

Descriptive Statistics of Study Sites

The descriptive statistics of above-ground biomass, measured during field work are shown in Table 2. The study area has biomass per hectare ranging from 0.23 to 250.69 t/ha.

Table 2 Statistical characteristics of sampling plots

Correlation Analysis Between Above-ground Biomass and Remote Sensing Data

The results of correlation analysis between above ground biomass (AGB) and vegetation indices are presented in Table 3. The linear model function was used to obtain best fit correlation coefficients. The best fit correlation was seen in NDVI with a coefficient of determination (R2) value of 0.610 shown in Fig. 5. The result shows that there is good correlation between biomass and vegetation indices. The NDVI vegetation index showed the best regression with above ground biomass (AGB), hence it was subsequently used as predictor in geostatiscal prediction method.

Table 3 Correlation coefficient (R2) between vegetation indices and biomass
Fig 5
figure 5

Relationship between NDVI and above ground biomass

Predictive Modeling

In the current study based on the field inventory data, cokriging method was applied to estimate the biomass in which vegetation indices derived from satellite data were used as predictor variables for spatial prediction. The Geo-statistical analyst of Arc-GIS 9.3 software has been used for the formation of biomass layer using Ordinary Co-Kriging method. In Co-Kriging two datasets are used. The first dataset taken is that of biomass values and second dataset used was that of NDVI values. The best fit correlation was seen in NDVI with a coefficient of determination (R2) value of 0.610 shown in Fig. 5. The result shows that there is good correlation between biomass and vegetation indices. The NDVI vegetation index showed the best regression with above ground biomass (AGB), hence it was subsequently used as predictor in geostatiscal prediction method so NDVI was used for second dataset. Box- Cox transformation was applied to enable the dataset to exhibit a normal distribution. The second order polynomial was used for removing the trend in the case of both the datasets. The biomass value dataset was used as variance and the NDVI as Covariance. The exponential model was used as the model type. This model is applied when spatial autocorrelation decreases exponentially with increasing distance. Here the autocorrelation disappears completely only at an infinite distance. The exponential model is a commonly used model. The choice of which model to use is based on the spatial autocorrelation of the data and on prior knowledge of the phenomenon. The number of lags for the variogram was 12. The Fig. 6 and 7 represent the semi-variogram, the covariance and the predicted value graph for Co–Kriging. The regression function applied is shown : Regression Function: 0.295X + 15.281. Based on the regression equation, y = 0.295X + 15.281 the model was run with on the Landsat satellite image of Ratnagiri district study area for biomass mapping. The result of the biomass spatial prediction map is discussed below and presented in Fig. 8.

Fig 6
figure 6

Semi-variogram for biomass

Fig 7
figure 7

Graph of predicted values for biomass

Fig 8
figure 8

Above ground biomass (ton/ha) prediction map

Model Evaluation of the Spatial Prediction Methods for Above-ground Biomass

In this study, geostatistical model was developed and applied for biomass prediction. It is necessary to validate the model and to check the efficiency. We applied cross-validation to assess the performance of cokriging in estimating the spatial distribution of biomass. The basic statistics such as mean error (ME), average standard error and the root mean squared error (RMSE) shown in Eqs. 1,2 and 3 were applied to check the efficiency of model in spatial predictions of biomass in this region. These statistical methods measure the difference between the known data and the predicted data, thus assess the performance and accuracy of model predictions.

$$ \boldsymbol{ME}=\frac{1}{\boldsymbol{N}}{\displaystyle \sum_{\boldsymbol{i}=1}^{\boldsymbol{N}}}\left({\hat{\boldsymbol{e}}}_{\boldsymbol{i}}-{\boldsymbol{e}}_{\boldsymbol{i}}\right) $$
(1)
$$ \boldsymbol{RMSE}=\sqrt{\frac{1}{\boldsymbol{N}}{\displaystyle \sum_{\boldsymbol{i}=1}^{\boldsymbol{N}}}{\left({\hat{\boldsymbol{e}}}_{\boldsymbol{i}}-{\boldsymbol{e}}_{\boldsymbol{i}}\right)}^2} $$
(2)
$$ \boldsymbol{Average}\ \boldsymbol{standard}\ \boldsymbol{error}=\sqrt{\frac{1}{\boldsymbol{N}}{\displaystyle \sum_{\boldsymbol{i}=1}^{\boldsymbol{N}}}{\left({\hat{\boldsymbol{e}}}_{\boldsymbol{i}}-{\boldsymbol{e}}_{\boldsymbol{i}}\right)}^2} $$
(3)

where N is size of the sample in the dataset, \( \widehat{e_i} \) is the forecast estimated biomass value, e i is the biomass values measured on the validation plots and \( \overline{e} \) is the mean of biomass values of the sample. The summary statistics of biomass estimation from spatial prediction methods is shown in Table 2. The mean error should ideally be zero if the interpolation method is unbiased. The mean error for biomass estimations is −0.078 which suggest that prediction is slightly biased. The standard errors are accurate when root-mean-square standardized prediction error is close to 1. The root-mean-square standardized prediction error is 0.967 which is close to 1 indicating good accuracy. The root-mean-square error and average standard error should be as small as possible. There is slight difference between predicted and measure value which can be indicated by root-mean square error and average standard error shown in Table 4.

Table 4 Prediction errors for biomass estimation model

Conclusions

The presented work demonstrate systematic approach of geostatistical prediction and mapping by integrating Landsat data, ground inventory, and GPS data for generating estimates of spatial biomass distribution. The maximum value of vegetation biomass was estimated to the value of 250.69 t/ha in semi evergreen forest and minimum biomass value was found to be 0.23 ton/ha in scrub land. Cokriging (CoK) is an optimum method for spatial interpolation to estimate biomass. The result of aboveground biomass was validated using statistical error methods. The result shows that cokriging (CoK) method had smaller statistical error values. Other geostatistical methods such IDW and Spline are deterministic interpolation methods and hence doesn’t provide assessment of errors with predicted values and IDW can produce “bulls eyes” around data locations. Spline are useful for smooth & continuous surfaces such as elevation and water table rather than estimation of biomass distribution, hence cokriging is the most appropriate method among the different kriging methods for spatial interpolation of biomass in this research. The present study can be used as baseline information for further temporal studies of biomass status of the region and in planning of forest and natural resources management.