Introduction

Due to its low cost and abundance in the nature when compared to other fuels, especially in the case of electricity generation, coal remains one of the most important energy sources (Smith 1997). However, coal mining has created a number of environmental challenges as it causes irreversible damage to the surrounding topography including the water regime. Coal mining causes subsidence, air pollution, and contamination of soil (WCA 2014). Land degradation due to land cover change in the vicinity of mining areas is one of the most important factors affecting ecological systems at the local scale. Monitoring these changes in land cover is essential in mapping the latest developments, especially in case of open-cast mines, where topographical changes are frequent. For effective planning and management of operations, mining areas require thorough monitoring of open-cast pits and degraded lands; remote sensing techniques can be highly instrumental in this regard (Demirel et al. 2011). Sustainable development prioritizing environmental accountability is the need of the hour.

Remote sensing and GIS tools have been used extensively in the mining industry for various purposes such as mineral exploration, modelling and monitoring, mine planning, and environmental impact assessment (Van der Meer et al. 2012; Karan and Samadder 2016). These tools provide a quick and cost-efficient means of mapping large geographic areas (Demirel et al. 2011). With the recent development in sensor and satellite technologies, the latest geospatial information has become more economical and readily available for users to exploit. Image classification is an elaborate procedure that may be affected by several factors, such as landscape, data type, image pre-processing and classification approaches (Lu and Weng 2007). Several image classification techniques are available, each with their own advantages and disadvantages. Support vector machine (SVM) is a recent non-parametric supervised statistical machine learning technique that aims to find an optimal hyperplane (Cortes and Vapnik 1995), which separates the multispectral feature data into discrete predefined clusters consistent with training datasets. SVM-based techniques are distinctly attractive in remote sensing due to their ability to produce higher classification accuracy even with limited training dataset (Mantero et al. 2005).

Demirel et al. (2011) employed SVM to identify, quantify, and analyse the spatial response of landscape change due to mining activities in Turkey. Their results indicated that SVM could be used efficiently in monitoring environmental impacts of mining with several constraints like remote locations in mountainous region and cloud cover. Otukei and Blaschke (2010) evaluated the performance of three different classification algorithms, decision trees, support vector machines and maximum likelihood in assessing the land cover change of Pallisa District, Eastern Uganda. It was reported in their study that although all the three techniques performed relatively well, SVM revealed an improvement of classification accuracy which was probably due to the simplification of vector space needed for development of hyperplanes. Hernandez et al. (2007) evaluated the use of SVM classification in mapping priority habitat. Their results revealed an improvement of about 28 % in the overall classification accuracy when compared to maximum likelihood classification (MLC). They also reported that only a small training dataset was required for the definition of optimal separating hyperplane for isolating a class of interest from other classes. Size of training data plays an important role in the accuracy of classification and it varies for different types of satellite images. Selection of the optimum number of training samples may increase the overall classification accuracy (Foody et al. 2006). Several other studies have reported the effectiveness of SVM classification over other pixel-based techniques for variety of purposes like biophysical tasks, land use and land cover tasks, and geomorphological tasks (Melgani 2006, Tang et al. 2008; Andermann and Gloaguen 2009; Cao et al. 2009a; Knorn et al. 2009; Knudby and LeDrew, 2010).

In the present study, we evaluated the effectiveness of support vector machine-based land use classification over a conventional classification mechanism. The present study area is densely populated around the mining sites; this presents a problem while classifying land cover accurately due to the presence of built-up area. The other objective was to study the change in land use pattern over a period of 10 years due to extensive open-cast coal mining. Accurate assessment of this land use change in mining areas is important for planning of operations and mine closure.

Materials and methods

Figure 1 depicts the overall methodology of the present study starting from data acquisition, followed by land use classification using maximum likelihood (ML) and SVM-based methods. After that, accuracy assessment was carried out to validate the classification results and to confirm which one of the two classification mechanisms was more accurate. After the validation part, change detection was performed using the results of classified images.

Fig. 1
figure 1

Methodology flow chart of the present study

Study area

A part of the Jharia Coalfield, Dhanbad, India, has been selected for the present study to investigate the impact of coal mining on land use and land cover. The study area (Fig. 2) is situated at about 250 km west of Kolkata, India, and lies between latitudes 23° 38′ N and 23° 50′ N and longitudes 86° 07′ E and 86° 30′ E. The Jharia Coalfield is regarded as the coal capital of India and is known as the exclusive storehouse of prime coking coal in the country. Due to extensive and unregulated coal mining over a century, the study area has witnessed unrestricted environmental degradation over the years (Prakash and Gupta 1998; Sarkar et al. 2007). According to the Central Pollution Control Board’s (CPCB) Comprehensive Environmental Pollution Index (CEPI), Dhanbad is the 24th most critically polluted city of India (CPCB 2009). The continuous mining in the past has changed the landscape with remnants of old abandoned quarries, spoil dumps, subsided depressions and soil patches baked due to mine fires.

Fig. 2
figure 2

Location of the study area

Data acquisition

In order to evaluate the change in land use/land cover over a period of 10 years, two different multispectral satellite images were procured. Landsat 5 Thematic Mapper (TM) surface reflectance data of 30 April 2006 and Landsat 8 Operational Land Imager (OLI)/Thermal Infrared Sensor (TIRS) surface reflectance data of 25 April 2016 were collected from USGS EarthExplorer (Table 1, Fig. 3a, b). Near anniversary images were collected to reduce the potential change detection errors due to variability arising from sun angle, atmospheric condition and phenology. Both images were standardized and projected to the same projection system, Universal Transverse Mercator (UTM) Zone 45 N Datum WGS 1984 projection using ArcGIS 10.2.2. The specification of the satellite images is presented in Table 1.

Table 1 Properties of satellite images
Fig. 3
figure 3

a Landsat 5 TM satellite image (year 2006). b Landsat 8 OLI/TIRS satellite image (year 2016)

Image enhancements

The primary objective of any image enhancement technique is to improve the visual interpretability of an image by increasing the probable distinction among the features in the scene (Lillesand et al. 2014). If two different features are of same colour, their isolation may become difficult, but if those features are sharply different in tone or brightness, their separation becomes easier. For the present study, some of the surface features had near similar reflectance on the mean signature plot (e.g. built-up and OB dump). Haze reduction and histogram equalization techniques were performed using ERDAS Imagine. These techniques helped in reducing the additive effect caused by atmospheric haze and thus improving the apparent distinction among the surface features. The enhanced images were used in visual analysis for the pseudo accuracy assessment study as no other reference data was available for the image of 2006.

Maximum likelihood classification

Maximum likelihood is one of the most commonly used classification algorithms and is dependent upon the probability distribution of the feature classes as per Bayes’ theorem (Liu and Mason 2013). In this algorithm, the normal probability distributions for each of the spectral classes are demarcated using a covariance matrix by selecting a sufficient number of pixels in each spectral class as training sample for the classification algorithm (Richards and Jia 2006). In MLC, the feature space distance between cluster ω k and the image pixel Y i are weighted by the covariance matrix Σk of ω k with an offset relating to ratio N k given in Eq. (2) (Liu and Mason 2013). The algorithm for maximum likelihood classification is given in Eq. (1). N is the total number of pixels in the image Y. For all i, element Y i (Y i ∈Y) to cluster ω k if

$$ \partial \left({Y}_i,{\omega}_k\right)= \ln \left|{\sum}_k\right|+{\left({Y}_i-{\mu}_k\right)}^T{\sum}_k^{-1}\left({Y}_i-{\mu}_k\right) $$
(1)
$$ - \ln \frac{N_k}{N}= \min \left\{\partial \left({Y}_i,{\omega}_r\right)\right\} $$
(2)

for r = 1, 2,…, m.

Both images were arranged in false colour composite (FCC) band order, being a general case of an RGB colour display, FCC images effectively highlight vegetation distinctively in red (Liu and Mason 2013). ArcGIS 10.2.2 was used to perform MLC by taking seven broad land cover classes based on their representativeness in the image for the present study area. Training sets were generated manually by taking an appropriate number of training samples in each representative class and then merging them in training sample manager. A representative signature file was created for training and classifying the images.

Support vector machine-based classification

Support vector machine (SVM) technique is based on statistical learning theory presented by Cortes and Vapnik (1995). SVM classifier is a modern, powerful supervised classification method that can handle multiple-band imagery having high resolution and large segmented satellite data with ease when compared to other classification techniques, where attribute table management becomes difficult. Furthermore, SVM is a non-parametric learning technique; hence, no assumptions are made on the underlying data distributions (Fauvel et al. 2009). SVM works on the principle of generation of a hyperplane that epitomizes a precise separation of linearly separable classes in a hypersurface (Szuster et al. 2011). One major advantage of SVM over other classification algorithms is that it is less gullible to noise, correlated bands and an unbalanced number or size of training data within each class (Cawley and Talbot 2010). The purpose of SVM is to create a model (based on training data) that predicts the destination value of testing data from the training data attributes (Hsu et al. 2010). From the training set of instance-label pairs (X i , Y i ), i = 1,..., l where X i R n and Y ∈ {1, −1}l, the SVM (Boser et al. 1992; Cortes and Vapnik 1995) require the solution for the optimization problem given in Eq. (3) and Eq. (4).

$$ { \min}_{w,b,\in}\frac{1}{2}{W}^TW+C{\displaystyle \sum_{i=1}^l{\in}_i} $$
(3)
$$ \begin{array}{l}{Y}_i\left({W}^T\varnothing \left({X}_i\right)+b\right)\ge 1-{\in}_i,\\ {}\mathrm{Subject}\ to,\\ {}{\in}_i\ge 0.\end{array} $$
(4)

In the above equation, the training vectors X i are charted into a higher dimensional space by the function∅. The SVM locates a linear separating hyperplane with the maximum margin in this higher dimensional space (Hsu et al. 2010). C > 0 is the error term penalty parameter.

ArcGIS 10.2.2 along with ArcPy tool (Wehmann 2013) using LIBSVM classification library (Chang and Lin 2011) with radial basis function (RBF) kernel given in Eq. (5) was used to perform SVM classification on both images. As SVM utilizes a maximum margin hyperplane for a decision boundary, only a part of the training data, known as support vectors are needed to describe its position. The RBF kernel non-linearly maps training samples to a higher dimensional space.

$$ K\left({x}_i,{x}_j\right)= \exp \left(-\gamma ||{x_i-{x}_j}^2\right),\gamma >0. $$
(5)

The tool uses a training raster to learn how to produce the classified image. In the training raster, a small number of pixels are assigned to each training class. This is done by coding the land cover classes with numbers, and then changing the value of a number of pixels in the training raster which overlay that land cover type in the satellite image to the code of that class. All other remaining pixels are set to 0, in order to establish that the classifier should predict their classes based on the information that is extracted. Training and testing data should be a single band raster with known pixels having a value in the label set {1,…,n} and unknown pixels are assigned a value of 0. These raster images should have the exact same extent and gridding as the input raster. This can be ensured by setting Extent, Snap Raster and Cell Size options appropriately in the environment settings during production. The sparse representation of training data along with non-linear mapping provides robust classification accuracy as compared to other classification techniques.

Accuracy assessment

After the land use classification process, it is important to assess the accuracy of the classified image, to peg and quantify mapping or classification errors. Several techniques are available for accuracy assessment study (Arnoff 1982; Piper 1983; Kalkhan et al. 1995; Rosenfield and Fitzpatrik-Lins 1986; Koukoulas and Blackburn 2001). The most commonly used technique is derived using a confusion or error matrix; for the present study, the same has been used. In this method, a simple cross tabulation of the mapped class label against that observed in the ground or reference data is employed for a sample of cases at specified locations (Canters 1997). ArcGIS 10.2.2 was used for accuracy assessment study. Pseudo accuracy assessment was performed for the classified image of 2006 as no other reference data was available for that year. In pseudo accuracy assessment, the same data which was used in the classification process was enhanced and used as a reference for the classified images of 2006. Google Earth™, LISS IV high-resolution satellite image along with pan-sharpened Landsat 8 data of 2016 was used as a reference for the classified image of 2016. Random accuracy points were generated using the Create Accuracy Assessment Point tool available in ArcGIS. The tool computes the user’s and producer’s accuracy for each of the classes and calculates an overall kappa index to accommodate the effects of chance agreement. The locational accuracy for the reference data of 2006 was 100 % as it employed the same dataset for the accuracy assessment study. The locational accuracy for the reference data of 2016 was limited to 5 m (spatial resolution of the LISS IV image). The error arising from the locational accuracy of the 2016 reference data was considered to be negligible as the reference data had higher resolution than the test dataset.

Change detection

Post classification change detection (CD) technique was used for assessing the change in land use pattern over the decade. This technique is often used as a benchmark for the subjective assessment of other CD techniques (Lunetta et al. 1999). During comparison, if the corresponding pixels lie in the same class label, the pixel has not been changed, or else the pixel has been changed. One major limitation of this technique is that it may also incorporate classification errors in the final CD map (Xu et al. 2009; Hussain et al. 2013). ERDAS Imagine was used in the present study to perform CD analysis. A model was created using ERDAS’s modeller to quantify the change. Pixel values of 0 represented no change; decrease and increase were represented by change in pixel values, either negative change or positive change. Also, a change matrix was generated to understand the change from one land cover category to another due to coal mining activities.

Results and discussions

Results of maximum likelihood classification

Seven different land use classes (i.e. surface water, OB dump, built-up, coal quarry, vegetation, open scrub, and barren land) were successfully delineated using this technique (Fig. 4a, c). The results revealed that for both the years, Vegetation dominated the land cover type with 47.84 % in 2006 and 35.17 % in 2016, respectively. Coal quarry was observed as 3.16 % in 2006 and 3.77 % in 2016. OB dump was found to be 11.88 % in 2006 and 34.10 % in 2016. Likewise, the distribution of all land use classes is presented in Table 2. From visual interpretation, it became obvious that this technique was producing classification errors, in the form of overestimating the surface water for the year 2006. This error was attributed to the near similar spectral reflectance of surface water and coal quarry and the limitation of the MLC classifier. The distribution of built-up area and OB dump were also erroneous due to the similar spectral reflectance of these two features.

Fig. 4
figure 4

a Maximum likelihood-based land use classification using the Landsat 5 TM image. b Support vector machine-based land use classification using the Landsat 5 TM image. c Maximum likelihood-based land use classification using the Landsat 8 OLI/TIRS image. d Support vector machine-based land use classification using the Landsat 8 OLI/TIRS image

Table 2 Land use change detection from 2006 to 2016 based on the SVM and MLC classification techniques (total area 6541.23 ha)

Results of support vector machine-based classification

The results of SVM classification revealed 112.96 % increase in the area coverage of OB dump (increased from 1155.42 ha in 2006 to 2460.8 ha in 2016). There was also a sharp increase in the area coverage of built-up area which increased from 8.51 % in 2006 to 17.43 % in 2016, showing an increase of 104 % (Table 2, Fig. 4b, d). Coal quarry was observed to be 6.18 % in 2006 and 3.33 % in 2016 respectively. Likewise, the distribution of other land use classes (barren land, surface water, open scrub, and vegetation) is presented in Table 2. Visual interpretation revealed higher classification accuracy with this method in comparison to MLC. The difference in classification accuracy may be explained by various factors like image quality, image resolutions, classification errors, software errors, and user errors. For the present study, SVM not only gave better results than MLC in classifying all the linearly separable classes but it also produced better results while training the classes which were having near similar spectral reflectance(s). As SVM is based on the principle of separating hyperplanes, high-classification accuracy is attributed to SVM’s ability to locate an optimal separating hyperplane.

Results of accuracy assessment

Initial attention was focussed on the accuracy of support vector machine-based classification over maximum likelihood classification technique. Confusion matrices revealed that SVM-based classifier gave better results than MLC technique by almost 6 % for the Landsat 5 TM image of 2006. Using MLC technique, the overall classification accuracy was found to be 84.98 % with kappa coefficient of 0.82 (Table 3) for Landsat 5 TM image of 2006, and using SVM, the overall classification accuracy was observed as 91.20 % with kappa coefficient of 0.89 (Table 4) for Landsat 5 TM image of 2006.

Table 3 Accuracy assessment using the confusion matrix generated from the test training samples for Landsat 5 TM image based on the MLC technique (overall classification accuracy 84.98 %)
Table 4 Accuracy assessment using the confusion matrix generated from the test training samples for Landsat 5 TM image based on the SVM classification technique (overall classification accuracy 91.20 %)

The built-up area does not appear to have any particular diagnostic spectral reflectance and its spectral signature resembles to that of any other highly reflective features in the study area such as overburden dump. Therefore, it is expected that algorithms would not be able to convincingly distinguish classes having analogous spectral reflectance(s). Similarly, surface water also resembles to coal quarry having similar peaks in the mean signature plot. For Landsat 5 TM image, the built-up area was delineated with an overall accuracy (including producer and user accuracy) of 63.52 % using MLC technique. SVM offered a better result by improving the overall accuracy to 75.1 %, an improvement of 11.5 % over MLC technique. For surface water, the accuracy difference was surprisingly even more substantial, increasing almost 24.6 from 57.95 % (using MLC) to 82.63 % (using SVM). SVM-based classifiers have the ability to generalize the unseen data and it is also known for its higher accuracy on limited amount of training patterns. This is why the SVM classifier has exceeded MLC, giving higher classification accuracy for every individual class.

In case of Landsat 8 OLI/TIRS image of 2016, confusion matrices revealed an increase of almost 3 % in the overall classification accuracy from 90.7 % with kappa coefficient of 0.88 for MLC (Table 5) to 93.37 % with kappa coefficient of 0.91 for SVM (Table 6). An improvement of about 7 % was observed in classification of built-up area. Moreover, Landsat 8 OLI/TIRS has finer spectral resolution (Table 1) than Landsat 5 TM; this improved the apparent reflectance of different surface features. This is the reason why higher classification accuracies were observed using both techniques for Landsat 8 image. The improvement in classification accuracy was although close to 3 %, but considering the quality of the raw satellite image, this improvement may be termed substantial.

Table 5 Accuracy assessment using the confusion matrix generated from the test training samples for Landsat 8 OLI/TIRS image based on the MLC technique (overall classification accuracy 90.7 %)
Table 6 Accuracy assessment using the confusion matrix generated from the test training samples for Landsat 8 OLI/TIRS image using the SVM classification technique (overall classification accuracy 93.37 %)

Results of change detection

Classification maps were generated for the years 2006 and 2016 using both SVM and MLC techniques (Fig. 4). SVM classified images reported higher accuracy for both 2006 and 2016, hence for post classification change detection, SVM classified images were used. A change detection (CD) map was created using ERDAS Imagine, the CD map was further sub-classified into three distinct classes (decreased, increased, constant) (Fig. 5). It was observed that due to continuous open-cast mining operations and high stripping ratio for coking coal, OB dump increased almost 112.96 %. Due to rapid coal extraction, large open scrub areas were converted into active mining land; as a result, open scrub area decreased nearly 36.41 % from 1884.87 ha in 2006 to 1198.3 ha in 2016 (Table 7). Surface water and vegetation both decreased by 18.82 and 58.75 %, respectively. Large tracts of Vegetation were converted into OB dump due to coal mining activities.

Fig. 5
figure 5

Land use change map from 2006 to 2016 based on pixel reflectance values

Table 7 Land use change matrix from 2006 to 2016 (all figures are in ha)

Statistical validation was not performed to evaluate the classification algorithms as individual performance of the algorithms cannot be established significantly (Dietterich 1998).

Conclusions

The main aim of this paper was to compare the accuracy of the two pixel-based classification techniques (support vector machine and maximum likelihood classification) in classifying satellite images of the Jharia Coalfield. Other objective was to study the change in land use pattern over a period of 10 years due to extensive opencast coal mining using Landsat 5 TM data of 2006 and Landsat 8 OLI/TIRS data of 2016. The ability of SVM to generate an optimal separating hyperplane resulted in a better performance of SVM over MLC in classifying high-resolution satellite imagery. Confusion matrix-based accuracy assessment revealed that the features with overlapping spectral reflectance values on the mean signature plot were classified more accurately using SVM with marginal errors, which was far better than MLC. SVM was superior to MLC in overall classification accuracy as well as individual classification accuracy for many classes. Post classification change detection was used to study the change in land use pattern over a period of 10 years from 2006 to 2016. A change detection map was prepared to analyse the change in land use pattern. It was observed during the study period that nearly 47 % of the total land area changed into other classes. The main drivers for this land use change for the present study area were coal mining activities. Large parts of land were converted into mining areas. The results also indicated that the rate of change of land cover is very high and the danger of severe land degradation is challenging the ecological resilience. This study will help in quantifying the land use change; in serving as a basis for further decision support system, studies aiding variety of purposes such as environmental management, landscape planning, and monitoring reclamation success. Modern classification techniques such as SVM can be highly instrumental in monitoring land use and land cover change.