Introduction

Glaciers are the essential parts of the earth’s cryosphere system. Glaciers are well known as an indicator of climatic variations, as climate fluctuations occur, the glacier also gets affected. In both hemispheres, mountain glaciers and snow cover have deteriorated on average and this worldwide retreat of glaciers and icecaps is making a substantial contribution to sea-level rise (Solomon et al., 2007). Many small glaciers that survived a few decades ago have now disappeared, and many existing glaciers today are more likely to go within a few more years (Zemp et al., 2006). It is the need of the hour to study glaciers and their facies distribution as well. Accumulation and ablation are two major facies/zones of the glacier (Williams et al., 1991). These two prime zones are the concatenation of various small zones formed on the glacier surface by snow accumulation, melting, and freezing, distribution of snow. It is very difficult to carry out field observations at each zone of the glacier, due to heavy glaciated rugged terrain. On the contrary, glacier features and zones can be classified using several machine learning algorithms (supervised or unsupervised) using satellite data (Kaushik et al., 2019). Discrimination of different glacier features is directly or indirectly interlinked with other studies and can be used as a foundation that can be both global and environmental. Glacier facies and features are located at the different parts of the glacier.

Radar backscattering characteristics allow glacier classification (Singh et al., 2011, 2012, 2014; Singh & Venkataraman 2012). Different snow facies have distinct features, like backscattering signatures, density, grain size, and metamorphism (transformation of snow into ice) of snow influence these features (Benson, 1960; Cogley et al., 2011). The accurate mapping of different zones enables the accuracy of understanding climatic variability, weather forecasting, avalanche predictions, hydrological modeling, and mass-balance studies (Shukla & Ali, 2016). The rise in availability and accessibility of remote sensing data especially SAR images has shown the way to develop and employ newly automated and semi-automated techniques for glacier facies/zones classification. In past studies, researchers have used several algorithms for glacier facies identification using optical, infra-red, and radar images. But very limited studies related to glacier facies in the Himalayan Region using SAR datasets have been carried out (Huang et al., 2011; Sood, 2015; Kundu et al., 2015; Das et al., 2018, Pandey et al., 2020). Remotely sensed SAR images are more and more popular to be used for determining the glacier facies in the last decade. Considering the radar imagery over optical data has several advantages like penetration capability, good illumination conditions, and all-weather independence (Lee & Pottier, 2009; Rott & Matzler, 1987; Woodhouse, 2006).

Satellite datasets have been broadly used in the last few decades to delineate glacier features using automated or semi-automated classification methods and band ratios that report very high accuracy (Kaushik et al., 2019). Conventional statistical methods to more dominant machine learning algorithms provide superior versions of solutions to problem in the field of remote sensing for glaciated terrain classification. But the latest achievement of artificial intelligence brings new opportunities to the cryosphere field that is deep neural networks. Pandey et al. (2020) examined the effectiveness of maximum likelihood classifier over other classification methods adopted by many using multi-temporal SAR data for glacier facies mapping over the surge-type glacier in the Karakoram Range (Ahmad, 2012; Murtaza & Rhomsoo, 2014). Nijhawan et al. (2018) reveal the development of a hybrid deep learning framework by utilizing multi-phase deep learning and random forest by the ensemble of convolutional neural networks for delineation of supra-glacial debris lies in the parts of the Alaknanda basin in Uttarakhand. The Samudra Tapu glacier features were identified by utilizing integrated datasets from co-registered optical and thermal datasets into eight different classes using the maximum likelihood classifier, optical data helped in identifying supra-glacial cover types such as supra-glacial debris, ice, snow, mixed ice, and debris whereas thermal datasets used for differentiating non-supraglacial cover classes, like periglacial debris, valley rock, water, and shadow (Shukla et al., 2010). Kundu and Chakraborty (2015) examine Radarsat-2 polarimetric SAR data and time-series data of RISAT-1 MRS and hybrid data having RH/RV mode for the discrimination of different glacier radar facies of Gangotri and Samudra Tapu glacier. The major glacierized terrain features identified in these glaciers were wet or damp snow, debris-covered glacier ice, ice wall percolation area, ice walls, percolation area, and clean glacier ice.

The study reported in (Shukla & Yousuf, 2016) expresses the mapping of debris cover glaciers in Kashmir Himalaya using ASTER data and digital elevation model by the hierarchical knowledge-based classification method. The proposed HKBC approach used optical data having several input layers (such as NDSI, NDWI, and slope) and the overall accuracy of the study was found to be 89%. Bishop et al. (2008) marked the potential of artificial neural networks over the ISODATA clustering algorithm by estimating the different classes of supra-glacial debris loads of some Himalayan glaciers. They inspected the ANN approach for distinguishing spatial reflectance features in high-resolution SPOT panchromatic satellite imagery. In another study, Ban and Wu (2005) also examined in their comparative study for the classification of land use land cover features using ANN and MLC utilizing multi-temporal Radarsat images of the Greater Toronto Area, Canada. The authors claimed that ANN is more robust than MLC as observed in experimentation that the ANN approach provides improved classification accuracies of 4–6% compared to MLC. Shukla and Yousuf (2016) claim that the ANN-based information extraction method (OA-83.74%) performed superior to the maximum likelihood classifier (66.90%) for the mapping of the Kalhoi glacier using Landsat TM data and other ancillary layers. The feasibility of polarimetric SAR satellite imagery in the field of glaciology for glacier zones and features classification has been improved by the rise of machine learning algorithms. Deep learning is one of the most adaptable up-to-date techniques for classification and a component of the broader family of machine learning techniques based on artificial neural networks.

In addition to the typical machine learning algorithms that have been used for glacier features mapping, an attempt have been made thru deep learning; an evolved practice of machine learning primarily founded upon neural networks and architectures with more hidden layers. Deep learning is a rapidly developing area within the fields of remote sensing and geo-informatics (Li et al., 2018; Ma et al., 2019; Zhang et al., 2016). Deep learning, as well as other artificial intelligence-based approaches, attempt to interpret images in the similar manner as a human machinist would, relying not only on pixel values but frequently occurring patterns and textures (Ma et al., 2019). As such, deep learning can be used to extract info from complex conditions where normal classification methods are not satisfactory (Li et al., 2018). Deep Learning algorithms consist of such a diverse set of models in comparison to a single traditional machine learning algorithm. This is because of the flexibility and feasibility that the neural network provides. To the best of our knowledge, there is a literature gap related to combining both “deep learning” and “radar glacier facies mapping” thematic. This study evolves in the direction of deep learning applications for glacier features classification, particularly related to fully polarimetric SAR images. Therefore, this study aims to develop a PolSAR classification algorithm by adopting the deep neural networks and support vector machines in glaciology for glacier facies/features identification. The utility of deep neural networks for multi-class classification using polarimetric SAR imagery has not yet been explored for glacier facies/features mapping. In this study, the deep neural networks-based PolSAR classification was carried out for identifying different glacier facies lying over the terrain. The SVM is selected to compare the results of (glacier facies–deep neural networks) GF-DNN approach because it is one of the robust, efficient, and most commonly used method for glaciology applications and it performs well with smaller training dataset also. The proposed glacier classification algorithm was implemented on the part of Siachen and Bara Shigri glaciers for assessing the performance and results were compared with the outcome of the support vector machines-based algorithm.

Glacier facies and their microwave response

Glacier surface features exhibit a range of zones—dry and wet snow, percolation, superimposed, bare ice, debris cover, etc. To better understand the varying conditions of a glacier’s surface, the glacier surface area can be divided into a sequence of systematic, idealized zones, each of which is demarcated by a set of properties linking to the metamorphism of the snow or ice; zones range from dry snow at higher elevations to melting ice near the glacier snout/terminus (Williams et al., 1991). But not all glaciers will exhibit all zones. Though there are a wide collection of glacier facies, they can be pooled such that the glacier is divided into two larger areas: the accumulation zone and the ablation zone. In the percolation zone, the meltwater percolates a certain distance into the snowpack and refreezes to form inclusions of ice in the form of ice layers, ice-lenses, and pipe-like structures called ice glands. At microwave frequencies, the return of radar signals from ice lenses and ice pipes can be extremely high. However, the radar return is dependent on the topography, snow grain size, and dielectric properties of the terrain. In the ablation area, exposed/bare ice is followed by debris cover zone and dry snow zone generally has not been found in Himalayan glaciers. The different facies identified in the selected study area are given in Table 1 as follows:

Table 1 Glacier facies identified in the selected study area

The backscatter signal received back by the RADAR antenna is the sum of the surface scattering at the air/snow interface and the volume scattering within a snowpack. In the case of dry snow, the volume scattering is the dominant phenomenon that escalates with snow grain size and snow age. As the snow wetness increases the penetration capability of the SAR signal decreases and the volume scattering changes to surface scattering as a dominant scattering. The radar signal can penetrate more in the dry snow than the wet snow. There is a contrast in the backscatter signals between debris cover, bare ice, wet snow, and percolation that makes it possible to distinguish different features. The altitudinal difference and the backscatter characteristics form the source of the detection of various features formed on the glaciated surface. In general, the accumulation zone is comprised of percolation zones and wet snow zones in the case of the Himalayan glaciers where in general the dry snow zone is absent. The ablation zone of the Himalayan glaciers consists of bare ice and debris cover. The efficacy of glaciated terrain classification has associations for many related studies within glaciology. Variations in the extent of different zones are strictly associated with mass balance and, thus, they are of extreme concern to understand dynamics of the glacier and their association to climatic variations.

Study area and dataset used

Siachen glacier

It is located in the Karakoram Range of the North-Western Himalayas. It is the longest glacier in the Karakoram and the second-longest valley glacier in the world, outside the Polar Regions, extending in the northwest-southeast direction and spanning for a length of ~ 76 km. The terminus of the glacier is located at 3670 m.a.s.l., while the highest point in the accumulation zone is located at ~ 7200 m.a.s.l. The meltwater from this glacier feeds the Shyok River (a part of the Indus River system). The location and outline of the glacier are illustrated in Fig. 1a, b, respectively.

Fig. 1
figure 1

Illustration of the study site. a Outline of India imagery with geographic location. b Map of sub-region of Siachen glacier. c Map of Bara Shigri glacier. d, e Training and testing samples selection over the PolSAR images

Bara Shigri

The Bara Shigri glacier is a few hundred kilometers south of Siachen Glacier. This glacier is located in the Chandra River Basin of Lahaul-Spiti Valley. It is heavily debris-covered, with much of it’s in the ablation zone. The elevation ranges from 6250 m.a.s.l. at the highest point in the accumulation zone to 3975 m.a.s.l. at the snout, traversing the distance of 27.2 km between them. The Chandra River, which is one of the tributaries of the Indus River, receives meltwater from the Bara Shigri glacier. The location of the glacier and its outline are signified in Fig. 1a, c, respectively.

Datasets used

The Advanced Land Observing Satellite Using Phased-array L-band Synthetic Aperture Radar (ALOS PALSAR-1 and 2) is an earth observation satellite launched by JAXA (Japan Aerospace Exploration Agency). Both are active microwave remote sensing sensors having the capability to capture data day and night and in all weather conditions. The datasets detail utilized in this study is given in Table 2.

Table 2 Technical details of acquired datasets

Methodology

PolSAR data and its processing

In this paper, fully polarimetric ALOS datasets used to classify the different glacier zones and multi-class glacier features lie in the study area. The SAR data were processed using the SNAP software packages. When an electromagnetic wave interacts with the target surface, it experiences some response and is reradiated back to the sensor; these responses are measured in the form of 2 × 2 scattering matrix [S] which is given by

$$\left[S\right]= \left[\begin{array}{cc}{S}_{hh}& {S}_{hv}\\ {S}_{vh}& {S}_{vv}\end{array}\right]$$
(1)

where each matrix element denotes the backscatter response of the target surface in a particular polarization. The complex backscatter response received in various polarimetric combinations is associated with the electrical and geometrical properties of the target surface. This 2 × 2 complex scattering matrix is not suitable to study natural targets, like snow and glaciers because they demonstrate a variety of scattering responses. In such cases, the information obtained from the scattering matrix is inadequate to describe the physical properties of the target. Therefore, the second-order statistics of the scattering matrix—coherency matrix, utilized for this purpose. This coherency matrix [T] obtained from the scattering matrix by using its vectorized form derived from Pauli vector and can be given by (Lee & Pottier, 2009):

$$\langle \left[T\right]\rangle = \langle {k}_{p}{k}_{p}^{T}\rangle$$
(2)

where \(\langle \rangle\) represents averaging over the whole data, \({k}_{p}\) is Pauli vector given by

$${k}_{p}= \frac{1}{\sqrt{2}}\left[\begin{array}{c}{S}_{hh}+{S}_{vv}\\ {S}_{hh}-{S}_{vv}\\ {2S}_{hv}\end{array}\right]$$
(3)

The coherency matrix [T] obtained from the above is

$$\langle \left[T\right]\rangle = \left[\begin{array}{ccc}\langle {\left|{S}_{hh}+{S}_{vv}\right|}^{2}\rangle & \langle \left({S}_{hh}+{S}_{vv}\right){\left({S}_{hh}-{S}_{vv}\right)}^{*}\rangle & 2\langle \left({S}_{hh}+{S}_{vv}\right){S}_{hv}^{*}\rangle \\ \langle \left({S}_{hh}-{S}_{vv}\right){\left({S}_{hh}+{S}_{vv}\right)}^{*}\rangle & \langle {\left|{S}_{hh}-{S}_{vv}\right|}^{2}\rangle & 2\langle \left({S}_{hh}-{S}_{vv}\right){S}_{hv}^{*}\rangle \\ 2\langle {S}_{hv}{\left({S}_{hh}+{S}_{vv}\right)}^{*}\rangle & 2\langle {S}_{hv}{\left({S}_{hh}-{S}_{vv}\right)}^{*}\rangle & 4\langle {\left|{S}_{hv}\right|}^{2}\rangle \end{array}\right]$$
(4)

Since the SAR images have a salt and pepper appearance called speckle noise, the presence of speckle noise may reduce the classification accuracy. So, to suppress the speckle noise it is necessary to perform multi-look in range direction as well as in azimuth direction. Using the digital elevation model (DEM), the terrain correction was applied to remove the geometric distortions. Further, the classification approaches have been implemented. The detailed architecture of the methodology followed in the work is displayed in Fig. 2.

Fig. 2
figure 2

Pictorial representation for methodology

Proposed classification approach

Two supervised classification approaches SVM and GF-DNN have been used. The number of classes that will classify considered was four using the PolSAR data for the study sites. Since the approaches used are supervised this requires a certain amount of learning data, so it is needed to create a training dataset. Different training sample/area was defined on the image for the different classes and each training area consists of multiple individual regions. The training samples over the study site were selected using ENVI software. The dataset has been divided into training data and testing data. The model learns on training data and is assessed on testing data. The details of both approaches have been discussed in further sections.

Support vector machines

The support vector machines (SVM) is a commonly used supervised machine learning technique in remote sensing applications. Based on the data points/training samples, it creates a hyperplane that serves as a decision boundary. All training points are not used to construct the hyperplane but it only includes the point that lies nearer to the hyperplane and these points are called support vectors. A linear support vector machine may not be able to separate the classes in some circumstances; in these cases, we must map the classes onto a higher-dimensional space to improve class separation. Various kernel functions are employed to map the training samples into higher-dimensional space, allowing for easier fitting of a linear hyperplane (Tso & Mather, 1995; Cortes & Vapnik, 1995). SVM kernel selected is the radial basis function (RBF) utilized to map the classes in a higher-dimensional space to obtain an optimal separation plane in such non-linear classification cases. This function can be mathematically characterized as follows:

$$K\left({X}_{1},{X}_{2}\right)=exp\left(-\frac{{\Vert {X}_{1}-{X}_{2}\Vert }^{2}}{2{\sigma }^{2}}\right)$$
(5)

where σ is the variance and \(\Vert {X}_{1}-{X}_{2}\Vert\) is the Euclidean distance between two points.

For multiclass classification, ENVI's version of SVM employs the pair-wise classification technique. The decision values of each pixel for each class are used for probability estimations in the SVM classification output. ENVI classifies data by choosing the option with the highest probability. Pixels with all probability values less than the threshold can be reported as unclassified using an optional threshold. SVM provides a penalty option that allows for some misclassification, which is especially crucial when working with non-separable training sets. The penalty parameter regulates the trade-off between allowing for training errors and enforcing strict margins (SVM Tutorial, 2020). Repeated attempts with varying C (cost or penalty parameter) and gamma, γ (trial and error method) helps in defining these parameters based on the accuracy of the classified output. An ENVI software used to test the multiple values of variables C and γ, to fine tune the parameters and find optimal values to be used for classification. The definition of the parameters used for SVM approach are provided in Table 3 as follows:

Table 3 Definition of the parameters used for SVM

Deep neural networks

A multi-layered deep neural network (DNN) with error back-propagation is the sort of deep learning architecture used in this study. DNN is a type of neural network that connects various functions in a hierarchically structured network with hidden layers. Nowadays, the classification of features has also profited from the applications of SAR deep learning approaches. There has been a lot of research going on to explore the feasibility of deep learning for several applications such as identifying oil spills, the thickness of sea ice, sea ice classification, and many more. The basic architecture of deep neural networks for glaciated terrain features classification is shown in Fig. 3

Fig. 3
figure 3

Deep neural networks in cryosphere

The number of nodes in the input layer corresponds to the number of input variables chosen. A dense layer called the fully connected layer was used in this study to develop the GF-DNN model. The information travels from the input data to the activation function, the error is calculated and propagated back to the previous levels, and the output is fed at the end of the predefined iteration (Goodfellow et al. 2016). In this approach, until the output layer, each sequential hidden layer mixes values from the previous layer and then learns to build more abstract representations. For each hidden layer in the model, the rectified linear unit activation function has been chosen and softmax activation in the output layer. And to finish it off, the model was compiled with Adam optimizer and using cross-entropy for the loss function. The dense layer in deep learning can be defined as, \(a= activation (w*x+b)\) where ‘w’ is weight, ‘x’ is input, ‘b’ is bias, ‘a’ is output, and ‘*’ matrix multiplication. The definition of the parameters used for GF-DNN approach are provided in Table 4 as follows:

Table 4 Definition of the parameters used for DNN

Keras tuner used for getting the best parameters for the proposed model, i.e., GF-DNN for the highest accuracy. Keras tuning is a package that helps users identify the best hyper-parameters for machine learning and deep learning models. Kernel sizes, learning rates for optimization, and various hyper-parameters may all be found with the package. A python program, which iterates multiple times with different parameter values, to fine tune the parameters was used to find optimal values to be used for classification. A non-linear activation function needs to be introduced at each network layer since the relationship between input and output should be a highly non-linear mapping. Due to saturating non-linearity of other activation functions, ReLU has been involved in this study. The ReLU activation function is given by

$$f\left(x\right)=\mathrm{max}\left(0,a\right) where\; a=\left({w}^{*}x+b\right)$$
(6)

The non-linear activation function allows the backpropagation and also allows to stack number of layers of neurons to create a DNN. Moreover, deep neural networks using ReLU achieved the best performance and considerably takes less training time. The softmax non-linearity is applied to the output layer for multi-class classification. This function will output the posterior probabilities for each possible class. Mathematically, the softmax formula is as follows:

$$\mathrm{softmax}, \sigma {(\overrightarrow{z})}_{i }= \frac{{e}^{{z}_{i}}}{{\sum }_{j=1}^{K}{e}^{{z}_{j}}}$$
(7)

where \({e}^{{z}_{i}}\) is the exponential function applied to each element of the input vector and the \({\sum }_{j=1}^{K}{e}^{{z}_{j}}\) is the normalization form that confirms all the output values of the function sum to one, thus creating the valid probability distribution and, K represents a number of classes in the multi-class classifier. Cross entropy is the loss function for this multi-class classification problem and can be calculated by

$$P= -\sum {Y}_{i}^{^{\prime}}. \mathrm{log}\left({Y}^{i}\right)$$
(8)

where \({Y}_{i}^{^{\prime}}\) are actual probabilities and \({Y}^{i}\) are computed probabilities. Drop-out is an extremely effective and efficient regularization technique to reduce overfitting. The backpropagation is used to train the network and it computes the error term for every unit in the network. The training of BPN will have the following phases as shown in Fig. 4:

Fig. 4
figure 4

Backpropagation—learning

The output signal of the network ‘y’ is compared with the desired output value and the difference is called error signal d of the output layer neuron computed as

$$Loss\ function, \partial = z- y$$
(9)

It is commonly useful to reduce the learning rate during training. In this paper, initially, the learning rate is 0.001 but after certain epochs, it is reduced to 0.01 where epochs represent the number of times each sample has been used in training. As the network gets trained, then it is ready to make predictions on the test data and then the performance metrics of the model have been evaluated which gives the visualization of the performance of the approach.

Accuracy assessment of the classification results

The accuracy of the classifier is assessed to assure its credibility. Among the various available metrics, precision, recall, F1score, overall accuracy, and kappa coefficient is used for assessing classifiers. Overall, accuracy is the most widely used metric which helps in easy interpretation and is effective in accuracy estimation. It expresses the percentage of test data which has been correctly classified by the classifier. The relationship between true positives and the overall number of true positives and false positives is referred to as precision. The number of true positives to the total number of true positives and false negatives is known as recall. The F1 score is the harmonic mean of the precision and recall, and it seeks to characterize their balance. Following are the formulas used to evaluate the performance measures:

$$Precision = (Total\ positive)/(Total\ positive + False\ positive)$$
$$Recall = (Total\ positive)/(Total\ positive + False\ negative)$$
$$F1\ Score = (2*\ precision\ *\ recall)/(precision + recall)$$

Additionally, with an intention to further evaluate the class-level performance of a given classifier, confusion matrices are generated for used approaches. Another aspect of focus is to understand the performance of GF-DNN and SVM models when compared to each other. The relative comparison is performed using Z-score as described in Rossiter (2014). To evaluate the significance of the output of two classifiers, the z-score calculated by the formula:

$$z= \frac{\left|{p}_{1}-{p}_{2}\right|}{\sqrt{{s}_{1}^{2}+{s}_{2}^{2}}}$$

where p1 and p2 denote the proportions of the correctly classified test data of the two classifiers while s1 and s2 represent the standard deviation of their samples. Field collected points reflecting distinct radar facies on the glacier surface should ideally be used to assess classification quality. The accuracy of the classification can be estimated by comparing the information class collected in the field with that derived from the classed output image. Due to the lack of field data for this investigation, a test set is constructed in parallel using a technique similar to that used to generate a training set in ENVI.

Results and discussions

For the mountain cryosphere, machine learning and deep learning models can be applied to classification, feature identification, automatic mapping, and visual interpretation tasks as well as time series simulations. Deep learning methods for cryosphere applications are still scarce. Only 2% of studies using deep learning-based image segmentation and object detection have looked into cryosphere-related issues (Hoeser & Kuenzer, 2020). Machine learning applications on alpine or mountain glaciers emphasis on determining the glacier extent by either extracting the calving front (Baumhoer et al., 2019; Cheng et al., 2020) or detecting the glacier boundary of debris covered glaciers (Khan et al., 2020; Nijhawan et al., 2018; Xie et al., 2020). The potential of deep learning in glaciology in terms of automation, efficient processing, and the reduction of traditionally laborious manual tasks, is slowly being realized, although it still has untapped potential. This study reveals the unexploited potential of deep learning-based glacier facies mapping on the mountain cryosphere using quad-polarimetric SAR datasets.

The support vector machines and deep neural networks-based methods have been applied on fully polarimetric SAR data over the sub-part of Siachen and Bara Shigri glacier for the identification of distinct glacier facies/zones. In general, the radar glacier zones follow the elevational zonation like dry snow, percolation zone, wet snow zone, bare ice zone are distributed successively from higher to lower elevations of glaciers (Partington, 1998). The elevation map is given in Figs. 5b and 6b gives the idea about the appropriate altitudinal pattern for different glaciated features lying over the terrain. The results from radar data analysis show a general trend of facies distribution on the surface of the glacier. In the case of the Siachen sub-part, the facies follow an altitudinal pattern with a high percolation zone followed by mid and low percolation zone located at the higher reaches (dry snow in case it is present) and exposed ice followed by debris cover at the lowest reaches of the glacier. For the Bara Shigri glacier, the pattern is like a percolation zone followed by a wet snow zone located at the higher reaches (dry snow absent) and bare ice followed by debris cover at the lower elevation of the glacier. The slope values are also very high for the accumulation area because of the steep slope and low for the ablation zone for both the glaciers. In the classified output image for Siachen sub-part Fig. 5c, d, four different zones of the glacier (high percolation, mid and low percolation, exposed ice, debris cover) were identified following the elevation pattern.

Fig. 5
figure 5

Sub-part of Siachen glacier: a Slope map. b Elevation map. c Classified map using SVM, area inside the red and green circle shows the misclassification of debris cover class into percolation class and vice-versa. d Classified map using GF-DNN, red and green circles indicate correctly classified area

Fig. 6
figure 6

Bara Shigri glacier. a Slope map. b Elevation map. c Classified map using SVM, pixels inside the yellow boxes shows wrongly classified as debris cover and bare ice at higher reaches. d Classified map using GF-DNN, yellow boxes indicates the correctly classified pixels as debris cover and bare ice at higher reaches

Generally, upper percolation facies was only found over sub-part of Siachen glacier due to the intense freezing conditions that the glacier is exposed to throughout the year. The Bara Shigri glacier is at a lower elevation, it is subjected to milder weather conditions that are not conducive to the presence of upper percolation facies. In the above Fig. 5c, d, both the approaches provide very good assessments of different features, but SVM gives typical misclassification of numerous pixels. The area inside the green and red circle in Fig. 5c shows the several pixels wrongly classified as percolation pixels of debris cover are displayed whereas by DNN these pixels are correctly classified in Fig. 5d.

For Bara Shigri glacier, due to the early summer melting of compact dry snow, the upper percolation zone does not exist in this location. So, the middle percolation zone is the sole percolation facie K is found in the Bara Shigri Glacier. As the seasons change from winter to summer, the temperature rises significantly, causing substantial melting and the disappearance of the lower percolation zone. The accumulation zone that consists of percolation and wet snow facies for Bara Shigri glacier. The ablation zone of the glacier, consisting of debris cover and bare ice. From the classified maps, presented in Fig. 6d, it can be seen debris cover is extensive in case of Bara Shigri glacier. In Bara Shigri output Fig. 6c, the percolation and wet snow pixels inside the yellow box are misclassified as bare ice pixels and grey box are debris cover pixels misclassified as wet snow whereas in Fig. 6d it is improved and shows the correctly identified percolation and wet snow pixels (in yellow box Fig. 6d) and debris cover pixels (in grey box Fig. 6d). It is seen that using SVM, there is a lot of misclassification between the classified pixels but up to a certain extent this is overcome by the DNN approach. To ensure that the results are not biased by using the same ROIs used for classification, new ROIs are employed to compute the confusion matrix. The confusion matrix provides an independent accuracy review of the classification map in this way. In this study, an accuracy assessment has been carried out to test the accuracy level of classified maps generated by SVM and GF-DNN for the datasets used. We found that the overall accuracy of DNN is more as compared to SVM for both the glacier and below are the confusion matrices provided in Tables 5 and 6.

Table 5 Siachen glacier sub-part—confusion matrices: (a) SVM (b) GF-DNN
Table 6 Bara Shigri glacier—confusion matrices: (a) SVM (b) GF-DNN

Precision, recall, and F1-score performance measures have been used to evaluate the performance of the model for the study area. The kappa coefficient was utilized as an accuracy metric in this investigation. The standard values for the kappa coefficient follow: value 0.8 or higher indicates good agreement, 0.4–0.8 indicates moderate agreement, and less than 0.4 indicates poor agreement. Table 7 includes the evaluation metrics for study area images.

Table 7 Performance metrics: (a) sub-part of Siachen glacier (b) Bara Shigri glacier

A combination of high recall and high precision is required for a classifier to be rated highly accurate. Precision and recall are the terms used to describe the user and producer accuracy, respectively (Taubenbock et al., 2011). GF-DNN produced superior precision and recall scores which, as expected, generated a higher F1 score as well. It is to be observed that precision values obtained for different classes are well and good over the study area using DNN. The overall accuracy with GF-DNN is found to be 91.17% and the kappa coefficient 0.88 for sub-part of Siachen glacier whereas in the case of Bara Shigri glacier 89% and kappa coefficient 0.85 which shows a very good agreement between the predicted output and the actual output. Studies by Huang et al., 2013; Sood, 2015; Kundu & Charakborty, 2015; Pandey et al., 2020, show such results for maximum likelihood classifier, random forest, support vector machine and others but the implementation of deep neural networks using fully polarimetric radar data for glacier facies mapping has not been studied so far. In this study, we found that the deep neural network model shows very good convergence behavior for classification accuracy as compared to other machine learning approaches. By comparing the results from the SVM and the DNN, we demonstrates that the DNN yields more accuracy for selected study regions.

To affirm the significance of results and performance between classifiers considered in the study, Z-test is performed. Z-scores indicate the significantly higher performance of GF-DNN over the SVM classifier. GF-DNN performs better than SVM 86.8% for Bara Shigri and 95.9% for sub-part of Siachen glacier and the results of Z-test score between GF-DNN and SVM is 1.75 for Bara Shigri case while for Siachen sub-part the value is 1.12. Accuracies of glacier classified maps also depend on the classifiers. Upon comparison we observed that GF-DNN model performed relatively well for both datasets, while SVM showed a decrease in performance as shown in Fig. 7 below:

Fig. 7
figure 7

Graphical illustration of performance metrics. a Siachen glacier sub-part. b Bara Shigri glacier for individual classes with SVM and DNN

It is largely agreed that SAR images are one of the most appropriate tool in glacier radar facies/zones mapping (Akbari et al., 2013; Fu et al., 2020; Huang et al., 2013; Zhou & Zheng, 2017). Day-night acquisition and cloud free imageries make the SAR applications requisite for glacier studies. Over mountainous terrains, accuracies might be affected by geometric distortions caused by side-looking SAR acquisition geometry. The past and ongoing studies recommends increasing the implementation of SAR data in different glacier studies (radar zones, mass balance, dry or wet ice, surface dynamics, etc.) in the polar and non-polar regions. It can be concluded that geospatial techniques can make the glacier facies studies possible without field survey. However, outcomes from satellite imageries should be substantiated with field observed information.

Conclusion

In this study, we have presented a novel application of the GF-DNN model using quad-pol datasets for the identification of different glacier facies, with better accuracy of classified maps. Both the approaches GF-DNN and SVM were applied on two alpine glaciers (namely sub-part of Siachen glacier and Bara Shigri glacier) and produced a very promising output in both cases using GF-DNN. The major glacier radar facies identifiable and classified using such data and methods are high percolation, mid and low percolation, exposed ice, and debris-cover zone in sub-region of Siachen glacier and Bara Shigri facies are percolation, wet snow, bare ice and debris cover followed by altitudinal pattern. Such information can be used to find the glaciers accumulation and ablation area, which can further be used as input to glacier mass balance models and to understand the dynamics of a glacier.

The accuracy assessment of both methods shows that DNN maps were considered to have better accuracy than the SVM-based classified maps. Though the classification accuracies for every class with DNN are considerably fine still put up with some misclassification that needs further work to be done. However, the accuracy of the model could be enhanced much more with the parametrization of the network because it extremely depends on the structure of the neural network. Assessments over different areas of Himalayan Region ought to additionally be performed further.