Abstract
The success of a large number of real-world applications such as mapping, forestry, and change detection depends on the effectiveness with which land cover classes are extracted from Remotely Sensed (RS) imagery. Application of Fuzzy theory in remote sensing has been of great interest in the remote sensing fraternity particularly when the data are inherently Fuzzy. In this paper, a Fuzzy theory based Maximum Likelihood Classifier (MLC) is discussed. The study aims at amplifying the classification accuracy of large heterogeneous multispectral remote sensor data characterized by the overlapping of spectral classes and mixed pixels. Landsat 8 multispectral data of North Canara District was collected from USGS website and is considered for the research. Seven land use land cover classes were identified over the study area. The study also aims at achieving classification results with a confidence level of 95% with \(\pm 4\%\) error margin. The conducted research attains the predicted classification accuracy and proves to be a valuable technique for classification of large heterogeneous RS multispectral imagery.
Access provided by Autonomous University of Puebla. Download conference paper PDF
Similar content being viewed by others
Keywords
- Fuzzy topology
- Remote sensing
- Geographic information system
- Maximum likelihood classification
- Accuracy assessment
1 Introduction
Classification of multispectral imagery has formed itself as one of the most sought-after technique for information extraction. The process of image classification, in the context or remote sensing, has been broadly classified into hard and soft classification techniques. In hard classification, a pixel is assumed to be an indecomposable part of the image and belongs to just one of the defined land cover class. However, in the real world due to the presence of mixed pixels (mixels), which form the salient feature of heterogeneous study areas, hard classifiers are shown to produce poor results as the ground data is imprecise [1,2,3]. Soft or Fuzzy classification comes in handy under these eventualities and has been known for its capability to extract a lot of helpful data from heterogeneous RS data [2, 4]. A large number of studies have been conducted for classifying RS data using Fuzzy logic [2, 5,6,7,8]. Though a large number of researchers have conferred their study towards remotely sensed image classification using different classifiers, it has still remained a challenging task within the remote sensing fraternity [1, 9,10,11,12]. Hence, it has become necessary to research and produce new classification techniques for obtaining better classification results.
Zadeh’s concept of Fuzzy set theory has provided some very useful options for operating with heterogeneous datasets [13]. Using Fuzzy set concepts a pixel in an image is described with a membership function that links the pixel a real number from 0 to 1. This membership function is treated as the probability of the pixel belonging to each class [1, 13]. This principle is shown to provide an efficient solution for mixed pixel issue [3, 8].
The objective of this study is to perform LULC mapping of the considered study area using Fuzzy theory based MLC. The goal of classification is to obtain results with a confidence level of 95% with \(\pm 4\%\) error margin. The methodology followed is illustrated in Fig. 1.
The rest of the paper is organized as follows. Section 2 discusses the data and study area used. Section 3 discusses the classification method employed. Section 4 presents the results obtained during the study. Section 5 presents the conclusions drawn from the results.
2 Materials and Methodology
Land Satellite (Landsat) 8 data was accessed by U.S. Geological Survey (USGS) website. Figure 2 indicates the study area considered and it envelops the North Canara District in Karnataka, India. The Western Ghats form the main geographic feature of the region and runs from North to South. The average rainfall on the coastal part is 3000 mm (120 in) and is as high as 5000 mm (200 in) in the west-facing slopes of Western Ghats. East facing ridge of the Western Ghats is the rain shadow region and receives, on an average, only 1000 mm (39 in) rainfall annually [14]. High rainfall in the region supports lavish forests that coat over 70% of the coverage area. The north part of the Western Ghats forms Moist Deciduous Forests ranging from 250 to 1000 m in elevation. Above 1000 m elevation are the Evergreen rain forests [15]. The study area also has chunks of degraded scrub jungles and savanna. The beach region is characterized by coconut plantations and screw pine. The study area is recognized as a coastal agro-climatic zone by the government of India. The overall spatial resolution of the image is 15 m. This data was acquired on May 18 2016, which is the mid-summer season and is free from clouds. Layer stacking of first seven bands produces a low-resolution multispectral (MS) imagery of 30 m. Resolution merge technique is used to merge the 30 m MS image with higher spatial resolution band 8 to obtain high resolution 15 m multispectral image. This image is then used for further processing.
2.1 Feature Extraction/Signature Collection
There are several methods of collecting the training data, including; in situ data collection, on-screen selection of polygonal data, and/or on-screen seeding of training data [1]. By employing an on-screen selection of polygonal data technique, seven land use land cover classes were identified over the study area; (i) Water Body, (ii) Kharif, (iii) Built Up, (iv) Scrub Land, (v) Double Crop/Horticultural Plantations, (vi) Moist Deciduous Forest, and (vii) Evergreen Forest. Spectral signatures were collected for each class with at least 300 sample pixels per class. The study considered both pure and mixed pixels for training the algorithm.
The considered study area exhibits the characteristics of a heterogeneous dataset with more than two classes severely overlapping one another. Class separability was measured in terms of Euclidean distance considering two classes at a time. The Euclidean distance was redefined to measure the severity of land cover overlapping and is used as the spectral similarity index (SSI). Highest class separability was measured between Water Body and Kharif classes and was considered as the reference for other classes. Classes with a SSI value of less than 0.4 were determined to be severely overlapping one another at certain locations. SSI measurements for all possible class pairs are shown in Table 1.
3 Fuzzy Theory Based Maximum Likelihood Classifier
The Fuzzy theory based ML classifier discussed in this paper works in three stages; (i) Fuzzification of the data, (ii) Classification using Fuzzy theory based MLC, and (iii) Defuzzification.
3.1 Fuzzification
In the Fuzzification step, each pixel on the input image is converted into a pixel measurement vector, x, of membership grades. The Fuzzy membership function for any x must lie in the range 0–1, they should all add up to unity, and should be positive values. These characteristics are listed in (1), (2), and (3) [1].
where, Fi is one of the spectral classes, X represents all pixels in the dataset, x is a pixel measurement vector, m is the number of classes, and \(f_{{F_{i} }}\) is the membership function of the Fuzzy set Fi(1 ≤ i ≤ m). All the membership function values are recorded as a Fuzzy partition matrix
where, n represents the total number of pixels, and xi is the ith pixel’s measurement vector [1].
3.2 Classification
Fuzzy based classification involves the use of Fuzzy mean and covariance matrices. For class c, Fuzzy mean is computed as
where, xi is a sample pixel measurement vector (1 ≤ i ≤ n), fc is the membership function of class c, and n is the total number of sample pixel measurement vectors [1].
The Fuzzy covariance matrix \(V_{c}^{*}\) is computed as;
These mean and covariance values replace the conventional mean and covariance matrix in the classical MLC algorithm. This will convert a classical MLC algorithm into a Fuzzy based soft classification algorithm [1].
Fuzzy set theory solely provides membership functions for every pixel over the defined number of classes and requires a parametric rule for assigning those pixels to relevant classes. Parametric rules such as Maximum Likelihood, Mahalanobis Distance et al. may be used in the process. This study involves the utilization of Maximum Likelihood classifier as the parametric rule.
Fuzzy based Maximum Likelihood classifier uses Fuzzy mean and Covariance matrices replacing the conventional mean and covariance matrices. For an n-band multispectral image the likelihood function for a pixel belonging to class k is given by [1],
where, x is one of the brightness values on the x-axis, \(\mu_{k}^{*}\) is the Fuzzy mean as in (5), \(V_{k}^{*}\) is the Fuzzy covariance matrix as in (6), and \(\left( {x - \mu_{k}^{*} } \right)^{\text{T}}\) is the transpose of vector \(\left( {x - \mu_{k}^{*} } \right)\). Similarly, \(p^{*} \left( {x |w_{i} } \right)\) is calculated for each pixel for all classes. A membership function then enables the algorithm to decide to which class the corresponding pixel is to be assigned. For Maximum Likelihood classifier, the membership function can be defined as [1];
The membership grades of a pixel vector depend on x’s position in the vector space. \(f_{c} \left( x \right)\) increases exponentially with the decrease of \(\left( {x - \mu_{k}^{*} } \right)V_{k}^{* - 1} \left( {x - \mu_{k}^{*} } \right)^{\text{T}}\), i.e., the Mahalanobis distance between x and class k. The factor \(\sum\nolimits_{i = 1}^{m} {p^{*} (x|w_{k} )}\) is a normalization factor [1]. Applying this type of Fuzzy logic creates a membership grade matrix for each pixel.
3.3 Defuzzification Using Fuzzy Convolution
Fuzzy convolution technique is used to convert the n-layer output of classification into a map like structure. It creates the map by computing the total weighted inverse distance of all the classes in a window of pixels. The process first computes the total inverse distance summed over the entire set of Fuzzy classification layers for each class. It then assigns the center pixel to the class for which the value T[k] is largest. The total inverse distance, T[k], can be computed using [17]:
where, i is the row index of window, j is the column index of window, s is the size of window, l is the layer index of fuzzy layers used, n is the number of fuzzy layers used, W is the weight table for window, k is the class value, D[k] is the distance file value for class k, and T[k] is the total weighted distance of window for class k [17]. This study considers a 5 × 5 size window given by
4 Results and Discussion
Figure 3 illustrates the Fuzzy topology based maximum likelihood classification map of the study area. Table 2 shows the results obtained after accuracy assessment. User’s Accuracy and Kappa value are considered as the pivotal parameters in judging the classification process [18]. The classifier extracted dominant classes (Evergreen Forest and Deciduous Forest) very efficiently. Less dominant classes (Double Crop and Built Up) are extracted very poorly by the classifier. Fuzzy topology based maximum likelihood classifier has shown significant improvement in classification performance [12]. An overall Kappa value of 0.7870 indicates an excellent performance from the classifier [18].
To illustrate the usefulness of using the inverse weighted distance of all classes from the weight windows for pixels for assigning pixels to class values, Table 3 indicates the inverse weighted distance, T[k], for 10 pixels randomly selected from the study area. A hard classified map is created by assigning a pixel to the class for which the distance measure, T[k], is maximum.
4.1 Placing Confidence Limits on Assessed Accuracy
A straightforward statistical approach may be used to express the interval within which the true map accuracy lies, say, with 95% certainty. It is possible to use the normal distribution to obtain this interval by the expression [2];
where, n is the number of testing pixels, x(= np) is the number of correctly labelled pixels, P is the thematic map accuracy in percentage, p(= x/n) is the proportion of pixels correctly classified, \(Z_{\alpha /2}\) is the value of the normal distribution beyond which on both tails \(\alpha\) of the population is excluded [2].
From Eq. (11), the estimate of the thematic map accuracy, P, estimated by the proportion of pixels that are correctly classified in the testing set, at the 95% confidence level, for large values of n and x, and for reasonable accuracies are [2],
For 1000 testing pixels and a minimum of 80% of expected accuracy, it is expected to have, at least, 800 pixels to be correctly classified. From (12), the bounds on the estimated map accuracy are \(P = p\,\pm\,0.039\) or, in percentage terms, the map accuracy is approximated to be between 82.1 and 89.9%.
5 Conclusion
In this paper, a novel method for classification of large RS imagery through embedding Fuzzy theory into classical Maximum Likelihood Classifier (MLC) is discussed. A pixel is the basic building block of an image and is indecomposable in hard classification applications. The study proves that a pixel can be used as a decomposable unit in image classification. The results obtained indicate that Fuzzy theory based MLC permits obtaining accurate results for large heterogeneous study areas in the presence of mixed pixels and spectrally overlapping classes. The study conjointly confirms that one can use mixed pixels as training data and yet achieve good results. The objective of obtaining classification results with confidence level of 95% with \(\pm 4\%\) error margin is achieved. Hence, it can be concluded that the discussed Fuzzy topology based MLC handles mixed pixel issue more successfully. However, more investigation is needed on the classification performance of some classes, such as Built Up and Double Crop/Horticultural Plantations. Future scope of the work involves exploring the information richness provided by embedding Fuzzy theory into MLC for other sensor data.
References
Jensen JR (2000) Introductory digital image processing: a remote sensing perspective. Prentice-Hall Inc., New Jersey
Zhang J, Foody GM (1998) A fuzzy classification of sub-urban land cover from remotely sensed imagery. Int J Remote Sens 19:2721–2738
Wang F (1990) Fuzzy supervised classification of remote sensing images. IEEE Trans Geosci Remote Sens 28:194–201
Ji M, Jensen JR (1996) Fuzzy training in supervised digital image processing. Geogr Inf Sci 2:1–11
Wang Y, Jamshidi M (2004) Fuzzy logic applied in remote sensing image classification. Syst Man Cybern 2004(7):6378–6382
Melgani F, Al Hashemy BAR (2000) Taha SMR (2000) An explicit fuzzy supervised classification method for multispectral remote sensing images. IEEE Trans Geosci Remote Sens 38(1):287–295
Droj G (2007) The applicability of fuzzy theory in remote sensing image classification. Stud Univ Babes, Bolyai, Inform LII, 89–96
Wang F (1990) Improving remote-sensing image-analysis through fuzzy information representation. Photogramm Eng Remote Sens 56:1163–1169
Lu D, Weng Q (2007) A survey of image classification methods and techniques for improving classification performance. Int J Remote Sens 28:823–870
Mas JF, Flores JJ (2008) The application of artificial neural networks to the analysis of remotely sensed data. Int J Remote Sens 29:617–663
Maselli F, Conese C, De Filippis T, Romani M (1995) Integration of ancillary data into a maximum-likelihood classifier with nonparametric priors. ISPRS J Photogramm Remote Sens 50:2–11
Shivakumar BR, Rajashekararadhya SV (2017) Spectral similarity for evaluating classification performance of traditional classifiers. In: 2017 International conference on wireless communications signal processing and networking (WiSPNET). Chennai, pp 1999–2004
Zadeh LA (1965) Fuzzy sets. Inf Control 8:338–353
Directorate of Census Operations Karnataka: District census handbook Uttara Kannada (2014)
Pascal JP (1986) Explanatory booklet on forest map of South India: Belgaum-Dharwar-Panaji, Shimoga, Mercara-Mysore. In: Institut Francais de Pondichery, p 88. Travaux de la Section Scientifique et Technique. Hors Serie N 18
USGS: Earth explorer, https://earthexplorer.usgs.gov/
Pouncey R, Swanson K, Hart K (1999) ERDAS field guide
Richards JA (2013) Remote sensing digital image analysis. Springer
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Shivakumar, B.R., Rajashekararadhya, S.V. (2019). Investigation on Land Cover Mapping of Large RS Imagery Using Fuzzy Based Maximum Likelihood Classifier. In: Pandian, D., Fernando, X., Baig, Z., Shi, F. (eds) Proceedings of the International Conference on ISMAC in Computational Vision and Bio-Engineering 2018 (ISMAC-CVB). ISMAC 2018. Lecture Notes in Computational Vision and Biomechanics, vol 30. Springer, Cham. https://doi.org/10.1007/978-3-030-00665-5_63
Download citation
DOI: https://doi.org/10.1007/978-3-030-00665-5_63
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-00664-8
Online ISBN: 978-3-030-00665-5
eBook Packages: EngineeringEngineering (R0)