Keywords

1 Introduction

Classification of multispectral imagery has formed itself as one of the most sought-after technique for information extraction. The process of image classification, in the context or remote sensing, has been broadly classified into hard and soft classification techniques. In hard classification, a pixel is assumed to be an indecomposable part of the image and belongs to just one of the defined land cover class. However, in the real world due to the presence of mixed pixels (mixels), which form the salient feature of heterogeneous study areas, hard classifiers are shown to produce poor results as the ground data is imprecise [1,2,3]. Soft or Fuzzy classification comes in handy under these eventualities and has been known for its capability to extract a lot of helpful data from heterogeneous RS data [2, 4]. A large number of studies have been conducted for classifying RS data using Fuzzy logic [2, 5,6,7,8]. Though a large number of researchers have conferred their study towards remotely sensed image classification using different classifiers, it has still remained a challenging task within the remote sensing fraternity [1, 9,10,11,12]. Hence, it has become necessary to research and produce new classification techniques for obtaining better classification results.

Zadeh’s concept of Fuzzy set theory has provided some very useful options for operating with heterogeneous datasets [13]. Using Fuzzy set concepts a pixel in an image is described with a membership function that links the pixel a real number from 0 to 1. This membership function is treated as the probability of the pixel belonging to each class [1, 13]. This principle is shown to provide an efficient solution for mixed pixel issue [3, 8].

The objective of this study is to perform LULC mapping of the considered study area using Fuzzy theory based MLC. The goal of classification is to obtain results with a confidence level of 95% with \(\pm 4\%\) error margin. The methodology followed is illustrated in Fig. 1.

Fig. 1
figure 1

General flow of the methodology

The rest of the paper is organized as follows. Section 2 discusses the data and study area used. Section 3 discusses the classification method employed. Section 4 presents the results obtained during the study. Section 5 presents the conclusions drawn from the results.

2 Materials and Methodology

Land Satellite (Landsat) 8 data was accessed by U.S. Geological Survey (USGS) website. Figure 2 indicates the study area considered and it envelops the North Canara District in Karnataka, India. The Western Ghats form the main geographic feature of the region and runs from North to South. The average rainfall on the coastal part is 3000 mm (120 in) and is as high as 5000 mm (200 in) in the west-facing slopes of Western Ghats. East facing ridge of the Western Ghats is the rain shadow region and receives, on an average, only 1000 mm (39 in) rainfall annually [14]. High rainfall in the region supports lavish forests that coat over 70% of the coverage area. The north part of the Western Ghats forms Moist Deciduous Forests ranging from 250 to 1000 m in elevation. Above 1000 m elevation are the Evergreen rain forests [15]. The study area also has chunks of degraded scrub jungles and savanna. The beach region is characterized by coconut plantations and screw pine. The study area is recognized as a coastal agro-climatic zone by the government of India. The overall spatial resolution of the image is 15 m. This data was acquired on May 18 2016, which is the mid-summer season and is free from clouds. Layer stacking of first seven bands produces a low-resolution multispectral (MS) imagery of 30 m. Resolution merge technique is used to merge the 30 m MS image with higher spatial resolution band 8 to obtain high resolution 15 m multispectral image. This image is then used for further processing.

Fig. 2
figure 2

15 m spatial resolution map of the study area [16]

2.1 Feature Extraction/Signature Collection

There are several methods of collecting the training data, including; in situ data collection, on-screen selection of polygonal data, and/or on-screen seeding of training data [1]. By employing an on-screen selection of polygonal data technique, seven land use land cover classes were identified over the study area; (i) Water Body, (ii) Kharif, (iii) Built Up, (iv) Scrub Land, (v) Double Crop/Horticultural Plantations, (vi) Moist Deciduous Forest, and (vii) Evergreen Forest. Spectral signatures were collected for each class with at least 300 sample pixels per class. The study considered both pure and mixed pixels for training the algorithm.

The considered study area exhibits the characteristics of a heterogeneous dataset with more than two classes severely overlapping one another. Class separability was measured in terms of Euclidean distance considering two classes at a time. The Euclidean distance was redefined to measure the severity of land cover overlapping and is used as the spectral similarity index (SSI). Highest class separability was measured between Water Body and Kharif classes and was considered as the reference for other classes. Classes with a SSI value of less than 0.4 were determined to be severely overlapping one another at certain locations. SSI measurements for all possible class pairs are shown in Table 1.

Table 1 Class separability in terms of spectral similarity index

3 Fuzzy Theory Based Maximum Likelihood Classifier

The Fuzzy theory based ML classifier discussed in this paper works in three stages; (i) Fuzzification of the data, (ii) Classification using Fuzzy theory based MLC, and (iii) Defuzzification.

3.1 Fuzzification

In the Fuzzification step, each pixel on the input image is converted into a pixel measurement vector, x, of membership grades. The Fuzzy membership function for any x must lie in the range 0–1, they should all add up to unity, and should be positive values. These characteristics are listed in (1), (2), and (3) [1].

$$0 \le f_{{F_{i} }} \left( x \right) \le 1$$
(1)
$$\mathop \sum \limits_{x \in X} f_{{F_{i} }} \left( x \right) > 0$$
(2)
$$\mathop \sum \limits_{i = 1}^{m} f_{{F_{i} }} \left( x \right) = 1,$$
(3)

where, Fi is one of the spectral classes, X represents all pixels in the dataset, x is a pixel measurement vector, m is the number of classes, and \(f_{{F_{i} }}\) is the membership function of the Fuzzy set Fi(1 ≤ i ≤ m). All the membership function values are recorded as a Fuzzy partition matrix

$$\left[ {\begin{array}{*{20}c} {f_{{F_{1} }} \left( {x_{1} } \right)} & {f_{{F_{1} }} \left( {x_{2} } \right) } & { \ldots} & {f_{{F_{1} }} \left( {x_{n} } \right)} \\ {f_{2} \left( {x_{1} } \right)} & {f_{{F_{2} }} \left( {x_{2} } \right)} & { \ldots} & {f_{{F_{2} }} \left( {x_{n} } \right)} \\ : & : & { \ldots} & : \\ {f_{{F_{m} }} \left( {x_{1} } \right)} & {f_{{F_{m} }} \left( {x_{2} } \right)} & { \ldots} & {f_{{F_{m} }} \left( {x_{n} } \right)} \\ \end{array} } \right],$$
(4)

where, n represents the total number of pixels, and xi is the ith pixel’s measurement vector [1].

3.2 Classification

Fuzzy based classification involves the use of Fuzzy mean and covariance matrices. For class c, Fuzzy mean is computed as

$$\mu_{c}^{ *} { = }\frac{{\mathop \sum \nolimits_{i = 1}^{n} f_{c} \left( {x_{i} } \right)x_{i} }}{{\mathop \sum \nolimits_{{i{ = 1}}}^{n} f_{c} \left( {x_{i} } \right)}},$$
(5)

where, xi is a sample pixel measurement vector (1 ≤ i ≤ n), fc is the membership function of class c, and n is the total number of sample pixel measurement vectors [1].

The Fuzzy covariance matrix \(V_{c}^{*}\) is computed as;

$$V_{c}^{*} = \frac{{\mathop \sum \nolimits_{i = 1}^{n} f_{c} \left( {x_{i} } \right)\left( {x_{i} - \mu_{c}^{*} } \right)\left( {x_{i} - \mu_{c}^{*} } \right)^{T} }}{{\mathop \sum \nolimits_{i = 1}^{n} f_{c} \left( {x_{i} } \right)}}$$
(6)

These mean and covariance values replace the conventional mean and covariance matrix in the classical MLC algorithm. This will convert a classical MLC algorithm into a Fuzzy based soft classification algorithm [1].

Fuzzy set theory solely provides membership functions for every pixel over the defined number of classes and requires a parametric rule for assigning those pixels to relevant classes. Parametric rules such as Maximum Likelihood, Mahalanobis Distance et al. may be used in the process. This study involves the utilization of Maximum Likelihood classifier as the parametric rule.

Fuzzy based Maximum Likelihood classifier uses Fuzzy mean and Covariance matrices replacing the conventional mean and covariance matrices. For an n-band multispectral image the likelihood function for a pixel belonging to class k is given by [1],

$$p^{*} \left( {x |w_{i} } \right) = \frac{1}{{\left( {2\pi } \right)^{n/2} \left| {V_{k}^{*} } \right|^{1/2} }}{ \exp }\left[ { - \frac{1}{2}\left( {x - \mu_{k}^{*} } \right)V_{k}^{* - 1} \left( {x - \mu_{k}^{*} } \right)^{\text{T}} } \right],$$
(7)

where, x is one of the brightness values on the x-axis, \(\mu_{k}^{*}\) is the Fuzzy mean as in (5), \(V_{k}^{*}\) is the Fuzzy covariance matrix as in (6), and \(\left( {x - \mu_{k}^{*} } \right)^{\text{T}}\) is the transpose of vector \(\left( {x - \mu_{k}^{*} } \right)\). Similarly, \(p^{*} \left( {x |w_{i} } \right)\) is calculated for each pixel for all classes. A membership function then enables the algorithm to decide to which class the corresponding pixel is to be assigned. For Maximum Likelihood classifier, the membership function can be defined as [1];

$$f_{c} \left( x \right) = \frac{{p^{*} (x|w_{k} )}}{{\mathop \sum \nolimits_{i = 1}^{m} p^{*} (x|w_{k} )}}$$
(8)

The membership grades of a pixel vector depend on x’s position in the vector space. \(f_{c} \left( x \right)\) increases exponentially with the decrease of \(\left( {x - \mu_{k}^{*} } \right)V_{k}^{* - 1} \left( {x - \mu_{k}^{*} } \right)^{\text{T}}\), i.e., the Mahalanobis distance between x and class k. The factor \(\sum\nolimits_{i = 1}^{m} {p^{*} (x|w_{k} )}\) is a normalization factor [1]. Applying this type of Fuzzy logic creates a membership grade matrix for each pixel.

3.3 Defuzzification Using Fuzzy Convolution

Fuzzy convolution technique is used to convert the n-layer output of classification into a map like structure. It creates the map by computing the total weighted inverse distance of all the classes in a window of pixels. The process first computes the total inverse distance summed over the entire set of Fuzzy classification layers for each class. It then assigns the center pixel to the class for which the value T[k] is largest. The total inverse distance, T[k], can be computed using [17]:

$$T\left[ k \right] = \mathop \sum \limits_{i = 0}^{s} \mathop \sum \limits_{j = 0}^{s} \mathop \sum \limits_{l = 0}^{n} \frac{{w_{ij} }}{{D_{ijl} \left[ k \right]}}$$
(9)

where, i is the row index of window, j is the column index of window, s is the size of window, l is the layer index of fuzzy layers used, n is the number of fuzzy layers used, W is the weight table for window, k is the class value, D[k] is the distance file value for class k, and T[k] is the total weighted distance of window for class k [17]. This study considers a 5 × 5 size window given by

$$\left[ {\begin{array}{*{20}l} {0.500} \hfill & {0.605} \hfill & {0.646} \hfill & {0.605} \hfill & {0.500} \hfill \\ {0.605} \hfill & {0.750} \hfill & {0.823} \hfill & {0.750} \hfill & {0.605} \hfill \\ {0.646 } \hfill & {0.823} \hfill & {1.000} \hfill & {0.823} \hfill & {0.646} \hfill \\ {0.605} \hfill & {0.750 } \hfill & {0.823} \hfill & {0.750} \hfill & {0.605} \hfill \\ {0.500} \hfill & {0.605} \hfill & {0.646} \hfill & {0.605} \hfill & {0.500} \hfill \\ \end{array} } \right]$$
(10)

4 Results and Discussion

Figure 3 illustrates the Fuzzy topology based maximum likelihood classification map of the study area. Table 2 shows the results obtained after accuracy assessment. User’s Accuracy and Kappa value are considered as the pivotal parameters in judging the classification process [18]. The classifier extracted dominant classes (Evergreen Forest and Deciduous Forest) very efficiently. Less dominant classes (Double Crop and Built Up) are extracted very poorly by the classifier. Fuzzy topology based maximum likelihood classifier has shown significant improvement in classification performance [12]. An overall Kappa value of 0.7870 indicates an excellent performance from the classifier [18].

Fig. 3
figure 3

Fuzzy topology based maximum likelihood classified map of the study area

Table 2 Results of Fuzzy topology based maximum likelihood classification

To illustrate the usefulness of using the inverse weighted distance of all classes from the weight windows for pixels for assigning pixels to class values, Table 3 indicates the inverse weighted distance, T[k], for 10 pixels randomly selected from the study area. A hard classified map is created by assigning a pixel to the class for which the distance measure, T[k], is maximum.

Table 3 Total inverse weighted distance of all classes for selected pixels

4.1 Placing Confidence Limits on Assessed Accuracy

A straightforward statistical approach may be used to express the interval within which the true map accuracy lies, say, with 95% certainty. It is possible to use the normal distribution to obtain this interval by the expression [2];

$$- Z_{\alpha /2} < \frac{x - nP}{{\sqrt {nP\left( {1 - P} \right)} }} < Z_{\alpha /2} ,$$
(11)

where, n is the number of testing pixels, x(= np) is the number of correctly labelled pixels, P is the thematic map accuracy in percentage, p(= x/n) is the proportion of pixels correctly classified, \(Z_{\alpha /2}\) is the value of the normal distribution beyond which on both tails \(\alpha\) of the population is excluded [2].

From Eq. (11), the estimate of the thematic map accuracy, P, estimated by the proportion of pixels that are correctly classified in the testing set, at the 95% confidence level, for large values of n and x, and for reasonable accuracies are [2],

$$\frac{{x \pm 1.960\sqrt {\frac{{x\left( {n - x} \right)}}{n} + 0.960} }}{n} = p \pm \frac{1.960}{n}\sqrt {\frac{{x\left( {n - x} \right)}}{n} + 0.960}$$
(12)

For 1000 testing pixels and a minimum of 80% of expected accuracy, it is expected to have, at least, 800 pixels to be correctly classified. From (12), the bounds on the estimated map accuracy are \(P = p\,\pm\,0.039\) or, in percentage terms, the map accuracy is approximated to be between 82.1 and 89.9%.

5 Conclusion

In this paper, a novel method for classification of large RS imagery through embedding Fuzzy theory into classical Maximum Likelihood Classifier (MLC) is discussed. A pixel is the basic building block of an image and is indecomposable in hard classification applications. The study proves that a pixel can be used as a decomposable unit in image classification. The results obtained indicate that Fuzzy theory based MLC permits obtaining accurate results for large heterogeneous study areas in the presence of mixed pixels and spectrally overlapping classes. The study conjointly confirms that one can use mixed pixels as training data and yet achieve good results. The objective of obtaining classification results with confidence level of 95% with \(\pm 4\%\) error margin is achieved. Hence, it can be concluded that the discussed Fuzzy topology based MLC handles mixed pixel issue more successfully. However, more investigation is needed on the classification performance of some classes, such as Built Up and Double Crop/Horticultural Plantations. Future scope of the work involves exploring the information richness provided by embedding Fuzzy theory into MLC for other sensor data.