Abstract
Selection of useful bands plays a very important role in hyperspectral image classification. In the past decade, metaheuristic algorithms have been used as promising methods for solving this problem. However, many metaheuristic algorithms may provide unsatisfactory performance due to their slow or premature convergence. Therefore, how to develop algorithms well balancing the exploration and exploitation, and find the suitable bands precisely is still a challenge. In this paper, a new hybrid global optimization algorithm, which is based on the Wind Driven Optimization (WDO) and Cuckoo Search (CS) is proposed to solve hyperspectral band selection problems. Both WDO and CS have strong searching ability and require less control parameters, but easily suffer from premature convergence due to loss of diversity of population. The proposed approach uses the Chebyshev chaotic map to initialize the population at initial step. The population is divided into two subgroups and WDO and CS are adopted for these two subgroups independently. By division, these two subgroups can share suitable information and utilize each other’s pros, thus avoid premature convergence, and obtain best optimal solution. Furthermore, the Levy flight step size in CS algorithm is adaptively adjusted based on fitness value and current iteration number, which helps in boosting the convergence speed of algorithm. The experimental results on three standard benchmark datasets namely, Pavia University, Botswana and Indian Pines, prove the superiority of the proposed approach over standard WDO and CS approaches as well as the other traditional approaches in terms of classification accuracy with fewer bands.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
1 Introduction
Recent advances in imaging techniques allow hyperspectral sensors capable of gathering spectral information in hundreds of narrow and contiguous bands, covering a broad range of wavelength in the spectrum. The increasing availability of hyperspectral images has gained huge success in the field of remote sensing. Hyperspectral image is three-dimensional (3-D) image cube (the third dimension denotes spectral band) comprises huge amount of spectral as well as spatial information to recognize and discriminate spectrally unique materials [30]. The enormous spectral bands of hyperspectral images provide redundant information, further entail huge storage cost and heavy computational burden. Besides, spectral bands are strongly correlated, i.e., more number of bands may not always result in significant discriminating capability for classification purpose. Therefore, reducing the dimensionality is a vital process to discard the redundant bands and make it less time-consuming for classification [4, 29, 32].
Generally, reduction in the dimensionality can be achieved with either feature extraction or feature selection (also known as band selection). Feature extraction is the process of reducing the dimension by mapping the data from original higher dimensional space to new lower dimensional space. Approaches like, Principal component analysis (PCA), Independent component analysis (ICA), Discriminant analysis, and transform based approaches have been widely used [14, 24, 43, 44]. However, these approaches generally alter the physical meaning of original data since the data in the low-dimensional space correspond to their linear combinations but do not original individual bands. Band selection approaches to pick set of most suitable bands from pool of bands may be preferred when the physical meaning of bands needs to be preserved [28, 31, 36]. Recently, many band selection approaches have been proposed [15, 45]. There exists two most commonly used ways for selection of optimal band subset. The first strategy includes the individual band evaluation, and the second strategy includes the band subset evaluation. Individual evaluation consists of clustering based approaches and ranking based approaches. In the individual band evaluation, the score (which signifies the importance of the band) of an individual band is measured according to its degree of relevance. Band clustering approaches form cluster of similar bands and select representative band [25, 46], whereas, in ranking based band selection approaches, the bands are selected based on the ranking [8, 47]. A hierarchical clustering approach is most commonly used band clustering approach, which is based on Ward’s linkage [18]. This technique employs mutual information (MI) between the bands to pick most informative bands. Kamandar and Ghassemian [36] proposed a novel band selection technique that maximizes relevance of selected bands and minimizes redundancy between them (mRMR), which further leads to excellent classification performance. Jia et al. [12] proposed a new hyperspectral band selection approach based on Relief-F algorithm and discards less informative bands.
Intelligent algorithms try to evaluate a complete subset of bands rather than individually selecting the bands. Generating such subsets is known to be an NP-hard problem. Therefore, such problems are solved by employing nature-inspired algorithms. In band selection, popular nature-inspired algorithms include Firefly algorithm [38], Artificial Bee Colony(ABC) [48], Wind Driven Optimization (WDO) [27], Genetic Algorithms (GA) [16], Particle Swarm Optimization (PSO) [37], Cuckoo Search (CS) optimization algorithm [33, 34], Gray Wolf Optimization (GWO) [22], and so on. These approaches consider the band selection problem as a combinatorial optimization problem which is solved by formulating an appropriate fitness function or objective function. The objective function assesses the band subsets and provides the degree of their goodness. Objective function needs careful determination as it has an influence on the performance of the system. It can be dependant or independent on the on learning algorithm. Hence, the objective function can be modelled by dependant evaluation criteria (depends on a predefined learning algorithm) [20, 39]or independent evaluation criteria such as such as, information measures (divergence, entropy or mutual information) [7, 42], distance measures (Bhattacharya distance, Kullback–Leibler divergence, Jeffries–Matusita distance, Hausdorff distance, and Spectral Angle Mapper (SAM)) [13, 22], and dependency measures (correlation measures, similarity measures) [6, 53]. Selection of an effective search strategy is a crucial task in the band selection problem. To optimize the defined objective function, an appropriate optimization algorithm must be chosen to converge to the global optimal solution and escape out of the local optimum.
Tschannerl et al. [42] proposed a modified discrete gravitational search algorithm based a novel unsupervised band selection approach employing maximum information minimum redundancy (MIMR) criterion which outperformed original MIMR with clonal selection algorithm. ABC approach is utilized to address problem of band selection [48]. An improved subspace decomposition is suggested to obtain intuitive band subspaces and ABC is employed as a search strategy. Xu et al. proposed a hybrid band selection approach by combining differential evolution (DE) and PSO in order to effectively combine characteristics of PSO and DE [49]. Sawant et al. [27] used a novel metaheuristic algorithm known as WDO, which is inspired atmospheric motion. WDO has excellent exploitation ability, but loses population diversity as gravitational force used in WDO pulls particles back to the origin. To address this limitation, PSO is combined with WDO and a novel band selection approach based on WDO and PSO is presented in [27]. Medjahed [21] proposed a binary version of CS algorithm to ensure that the selected band subset had a significant classification capability. Medjahed et al. [22] utilized a highly efficient metaheuristic algorithm called as GWO to reduce the dimensionality of hyperspectral image. This procedure has found a better band combination by optimizing five different objective functions. However, these algorithms also have their own limitations. A random population initialization strategy used in above algorithms may lead to the sluggish optimization response and fail to obtain the global optimum solution.
Recently, developing hybrid meta-heuristic algorithms has been gaining increasing attentions for researchers [10, 41, 49]. By combining two or more meta-heuristics, hybrid optimization approach can expand the performance of individual algorithm, and they seem to be more effective in solving complex optimization problems [1, 5, 9]. WDO and CS are population based meta-heuristic algorithms with different search mechanisms. WDO inspires from atmospheric motion and utilizes actual physical equations to search the solution space, whereas CS uses the Levy flight random walk to generate new solutions. The internal search mechanisms results in different characteristics of the two algorithms, i.e., WDO favours global exploration while CS is good at local exploitation. Considering the ability of these two algorithms, a hybrid approach based on WDO and modified CS, called WDOMCS, is proposed to speed up the convergence rate and enhance the optimization performance of the hybrid algorithm, aimed for selecting suitable bands with better classification performance. The main contributions of this manuscript are summarized as follows:
-
1)
The proposed hybrid optimization algorithm utilizes the Chebyshev chaotic map for generation of initial population which helps circumventing the problem of premature convergence.
-
2)
By population division, WDO and CS employ information sharing mechanism which decreases the risk of falling into a local optimal solution.
-
3)
An adaptive step size is employed in Lévy flight random walk which helps enhancing the global searching ability of the standard CS algorithm.
-
4)
A band selection approach for hyperspectral image classification based on WDOMCS is proposed, and the classification accuracy is the optimal with fewer bands.
The remaining part of the manuscript is organized as follows, a brief description about wind driven optimization, cuckoo search algorithm and chaotic map is provided in Section 2. Section 3 discusses the proposed WDOMCS algorithm in detail. Experimental analysis of the standard datasets is discussed in Section 4 and lastly, Section 5presents the conclusions.
2 Technical background
In this section, the technical background of wind driven optimization, cuckoo search algorithm and chaotic map is discussed.
2.1 Wind driven optimization
WDO is population based nature inspired metaheuristic optimization algorithm proposed by Bayraktar et al. [2, 3]. It is easy to implement and highly effective in solving multi-modal and multidimensional problem. The inspiration for WDO comes from atmospheric motion where wind blows in order to balance the imbalances in air pressure. Air flows from high pressure point to low pressure point at a velocity which is proportional to pressure gradient. Based on the above theory, WDO algorithm is derived. It starts with Newton’s second law of motion which states that the acceleration of an air particle as produced by a total force (Ft) is directly proportional to the magnitude of the total force, in the same direction as the total force according to Eq. (1).
Where, ρ is the air density for an infinitesimal air particle and a is the acceleration of an air particle. The relationship of air pressure with the air particle’s density and temperature is given by the ideal gas law,
Where, P is the pressure, R is the universal gas constant and T is the temperature.
There are various forces that are responsible for movement of air particles mainly including pressure gradient force (FPG), friction force (FF), gravitational force (FG) and Coriolis force (FC).The physical equations of the above mentioned forces are expressed as,
Where, ∇P represents the pressure gradient, δV is finite volume of the air particle, urepresents the velocity vector of the wind, and αf is the friction coefficient, grepresents the gravitational acceleration, and Ω is rotation of the Earth.
By including all four forces FG, FPG, FC, and FFin the right hand side of Eq. (1) and Eq. (1) can be expressed as,
The acceleration a in Eq. (1) is expressed as, \( a=\frac{\Delta \boldsymbol{u}}{\Delta t} \). By assuming ∆t = 1 for simplicity and for dimensionless and weightless air particle, δV = 1, the Eq. (7) can be expressed as,
By substituting Eq. (2) in Eq. (8),
For simplicity, pressure P is replaced by pressure value of air particle at current location Pcurr, and then Eq. (9) can be represented as,
In WDO algorithm, velocity and position of air particle update their corresponding values at each iteration. Hence, change in the velocity ∆u can be written as, ∆u = unew − ucurr, where, ucurr is the velocity of air particle at current iteration and unew is the velocity of air particle at next iteration. Then the velocity update equation will be,
Consider, xcurr is current location of air particle, xnew is new location of air particle, g→ and ∇P are vectors, which can be broken down in direction and magnitude as g = |g|(0 − xcurr),− ∇ P = |Popt − Pcurr|(xnew − xcurr), Popt is optimum pressure, Pcurr is the pressure of current location. The influence of the Coriolis force (Ω × u) is replaced by the velocity influence from another dimension, \( {\boldsymbol{u}}_{\mathrm{curr}}^{\mathrm{other}\ \dim } \) . Then, all the coefficients are combined together, c = −2RT and the velocity update equation in Eq. (11) will be expressed as,
Then the air particle location update equation will be,
Where, i is rank among all air parcels based on their pressure values and xnew is the new updated location for next iteration. WDO is similar to other nature inspired optimization algorithms with few control variables that need to be adjusted. When standard WDO is applied to solve the hyperspectral band selection problem, the description of WDO parameters used in the algorithm is given in Table 1 and the hyperspectral band selection using standard WDO is given as Algorithm 1.
2.2 Cuckoo search algorithm
CS is population based bio-inspired metaheuristic algorithm proposed by Yang and Deb [51] as the solution for the global optimization problems. The inspiration for CS comes from the breeding behavior of some cuckoo species. Cuckoo bird put sits eggs in the nests of other bird. If a cuckoo egg is discovered by host bird, then host bird either throws the foreign eggs away or abandons the nest and constructs a new nest. In the standard CS algorithm, each egg of the host bird in a nest represents a solution, and a cuckoo egg denotes a new solution. If a new solution is better than the one in the nest, the worse one will be replaced with new solution. Therefore, the development of the CS algorithm is based the following three main rules: (1) Each cuckoo places one egg in an arbitrarily selected nest. (2) The best nest with good quality eggs is transferred to the next generation. (3) The number of available host’s nests is fixed. The host bird determines the cuckoo egg by a probability of pa ϵ [0, 1]. If the host bird finds a cuckoo egg, the host bird can abandon the nest or throw the egg away, and construct a completely new nest.
In the standard CS algorithm, initially m number of random nest locations are generated representing potential solution, Xi for a cuckoo i. A new solution Xi + 1 is generated by adopting Lévy flight random walk as follows:
Where, α > 0 is a step size scaling factor; the product of ⊕ means entry wise multiplication; levy (λ) is a random walk with the random step length which is drawn from a Lévy distribution. The Levy flight step size coefficient α, which controls the scale of the flight by multiplication and is related to the size of the solution space of the objective function. When a Lévy step is generated using a random number generator, it is first multiplied by α before it is used to generate a new egg. The cuckoo laying eggs corresponds to a new solution set and this is analogous to generating new solutions for a cuckoo i. The levy (λ) is calculated using Mantegna’s algorithm as follows:
Where, λ is fixed parameter with value 3/2. Furthermore, μ and ν are drawn from a normal distribution with zero means and an associated variance, as follows:
Where, Γ(∙) is the standard Gamma function.
As standard CS is applied to solve the hyperspectral band selection problem, each cuckoo represents a band of hyperspectral data and each nest represents a solution of the problem (set of selected bands). Discovery probability pa indicates each band has probability of pa to be successful. The hyperspectral band selection using standard CS is given as Algorithm 2:
2.3 Chaotic map
Chaos is the behavior of nonlinear systems which exhibits randomness and ergodic behavior. Chaotic map is an exceptional solution to the dynamic behavior of the systems that are highly sensitive to the random initial seeds [35]. Most of metaheuristic optimization algorithms involve random initialization of seeds and are highly dependent on initial conditions. So, if the initial solutions are not well selected, they lead to local minimum and poor results. The randomness at initial step leads the algorithm to premature convergence and easily gets trapped into a local optimal solution. Yang proposed a chaotic map based global optimization algorithm which employed chaotic variables instead of random variables, and improved the efficiency of the algorithm [50]. Ergodicity and non-repetition of chaos enables the chaotic map in maintaining the diversity in the population and tackles the problem of premature convergence.
A chaotic map is a mathematical function that shows some kind of chaotic behavior over time characterized by,
In the chaotic sequence given in equation (4), chaotic variable Xt + 1 at time t + 1 depends on chaotic variable Xt at time t. There are several chaotic maps [40], such as iterative chaotic map with infinite collapses (ICMIC), tent map, logistic map, Chebyshev map, gauss map. The Chebyshev chaotic map used in this study is based on preliminary trials. Chebyshev map is defined as:
Where, a is finite constant.
3 Proposed band selection approach
Consider the hyperspectral image dataset which is represented as, χϵRH × W × N, where, H and W are the height and width of hyperspectral image respectively and N is the total number of spectral bands or the feature dimensions. Assume that k classes in the hyperspectral data are denoted as, Ω = [Ω1, Ω2, Ω3, …..Ωk] and n be the selected number of bands denoted as, Xi = [b1, b2, b3, …..bn]. The following subsection discusses in detail the proposed WDOMCS band selection algorithm, and the objective function.
3.1 Objective function or fitness function
The goal of the proposed band selection approach is to explore a finite set of possible subset of bands that maximize the classification accuracy. In this study, the classification accuracy of the classifier is used as an objective function. Classification accuracy is defined as follows [17]:
Where, assess (pi) is the function used to classify pixels pk.
Classify (pk) is the function that gives the class of pk. For the pixel pk with true class, the function assess (pk) = 1, and 0 otherwise.
3.2 The proposed WDOMCS band selection algorithm
Although the standard WDO and CS have their own features and benefits, they also have some limitations. CS has relatively strong local exploration capabilities, but algorithm suffers from premature convergence and gets trapped into local optimal solution. The WDO algorithm has strong exploitation ability, thus combining the searching capabilities of both algorithms seem to be good approach which will certainly lead to the development of better performance algorithms.
The standard WDO and CS start with random population initialization which may cause repeated calculations and can easily fall into local optimal solution. For the optimization problem, the final solution is very sensitive to the initial population. By using the chaotic map, the property of the non-repetition and ergodicity accelerate the search speed by exploring all search space efficiently. Furthermore, a scaling factor α of Lévy flight distribution used in Eq. (14) is fixed in a standard CS algorithm. This parameter has a great influence on the global optimal solution. A large value of α allows cuckoo to explore search space rigorously at the risk of slower convergence. A smaller value of scaling factor causes an algorithm to get stuck into a local optimum solution, thereby leading to premature convergence. Therefore, the selection of the appropriate step size is indispensable to avoid local optimal solution and to enhance the speed of convergence. The scaling factor αnew can be formulated as,
Where,
t is the iteration number.
∝L and ∝U are predefined lower and upper limits of step size, respectively.
favg(t) is the average fitness value of all nests.
fi(t) is the fitness value of ith nest.
fw(t) is the worst fitness value.
The new nest location is computed from Lévy flight and Eq. (1) is re-written as,
It is evident from this adaptation strategy, the moment current solution reaches a near optimum solution, the cuckoos upgrade their exploration around the current best solution using a smaller step size. On the other hand, a large step size is used when the current solution is less satisfactory than the average fitness value in order to inspire the cuckoos to explore a search space more rigorously. The term t is involved to encourage more exploitation. Also, the position and fitness value of the new cuckoo egg remain unchanged if it goes out of the search space. In this manner, CS can make full use of the previous excellent host bird’s nests. Therefore, inclusion of adaptive Lévy flight in the standard CS can help the algorithm quickly converge to the global optimal solution.
In this manuscript, a hybrid metaheuristic algorithm called WDOMCS is proposed aiming at combining both the exploration of CS and exploitation of WDO. In general, hybrid optimization algorithms can utilize the characteristics of the original algorithm. The main goal of hybridization is to complement each other’s characteristics. The hybridization approach is as follows: In the beginning, the initial population is generated using Chebyshev chaotic map, and the whole population is divided into two equal subgroups, Group A and Group B. WDO and modified version of CS are adopted for these two subgroups independently. By division, these two subgroups can share suitable information and utilize each other’s pros, thus avoid premature convergence, and obtain best optimal solution. The steps involved in the WDOMCS are presented in Algorithm 3. The flowchart of WDOMCS process is shown in Fig.1.
4 Result and discussion
The proposed WDOMCS algorithm was tested on three different hyperspectral remote sensing datasets, Indian pines, Pavia University and Botswana [11] for assessment of the effectiveness of the proposed band selection approach. Details of the datasets and comprehensive results are discussed in following subsections.
4.1 Dataset description and experimental setup
Table 2 summarizes the details of the standard bench-mark hyperspectral datasets, Indian pines, Pavia University and Botswana.
All the experiments are conducted using MATLAB platform version R2018b on Intel Xeon processor 2.90 GHz CPU along with 128 GB RAM in Windows 10 (64 bit) environment. The performance of the proposed method is compared with those of the other five metaheuristic band selection approaches, i.e. GA, PSO, GWO, WDO, and CS.
In the beginning, 10% number of samples of each land cover class from the Indian pine dataset, Pavia University and Botswana datasets are randomly chosen as training samples for validation of the efficacy of the proposed technique with less amount of labelled data. The rest of the samples are considered as testing data. The experimentation is conducted ten times for the evaluation of the average of Overall Accuracy (OA), Average Accuracy (AA), class-wise accuracy, and kappa coefficient κ. To perform the classification task using selected bands, one-against-all Support Vector Machine (SVM) is employed as classifier [23]. The SVM classifier is selected due to its excellent performance when dealing with small-sized training data. To compare fairly, we have conducted the experiments on the final bands searched by their methods, but using the same classifier of SVM. In SVM, Radial Basis Function (RBF) is used as its kernel function. The parameters C and γ in SVM are determined using fivefold cross validation method. The parameters of each algorithm are summarized in Table 3.
4.2 Experimental analysis
This section discusses the classification results obtained by the proposed approach and commonly used metaheuristic approaches, comparison of the proposed band selection approach with commonly used band selection approaches, WDOMCS parameter analysis and convergence analysis.
4.2.1 Classification performance
The classification results obtained by the proposed band selection approach WDOMCS are compared with other metaheuristic approaches to assess the effectiveness of the WDOMCS. GA, PSO, GWO, WDO, CS, and WDOMCS are executed by selecting 20 bands on the Indian Pines, Pavia University and Botswana dataset. The OA, AA and κ are measured in every case alongside the individual class accuracies. Results are presented in Tables 4, 5 and 6 and the best value for each metric is recorded in bold. It is observed from Table 4 that, GA performs worst on Indian Pines dataset, whereas PSO and GWO show similar performances. It is observed from Table 5 that, GA and GWO perform worst on Pavia University dataset; however, PSO shows satisfactory performance. On the other hand, Table 6 shows that, performance of GA and PSO is inferior to that of GWO. Moreover, the performance of GWO is almost similar to that of CS.
According to the results in Tables 4, 5 and 6, the optimization abilities of WDO and CS are better than GA, PSO and GWO and the OA value exceeded 80% for Indian Pines dataset, 94% for Pavia University dataset and 92% for Botswana dataset. WDO and CS require a few control parameters; however, the exploration ability of both algorithms is restricted to the initial setting. In WDO, air particles simultaneously walk around the universe, and explore the entire search space without getting trapped at the boundary. In MCS, adaptive Lévy flight strategy helps to maintain balance between exploration and exploitation and enhances the searching ability to obtain global best solution. Although, the WDO and CS give better results, they have not obtained theoretical optimal value. Thus, WDOMCS obtains the highest fitness value for all datasets, which illustrates that the excellent optimization ability of WDOMCS in comparison with other metaheuristic approaches. In contrast to this excellent performance, the classification performance obtained by the standard WDO and CS algorithm with the random initialization is inferior to that of proposed WDOMCS approach. This substantial difference indicates that, the classification performance obtained by the standard WDO and CS algorithm is deteriorated due to the randomness introduced in the selection of initial population. It is evident from Tables 4, 5 and 6, the proposed WDOMCS algorithm has strong capability to obtain an optimal solutions. Among all metaheuristic approaches, the proposed WDOMCS algorithm has achieved excellent classification performance in terms of OA, AA and κ. The proposed WDOMCS algorithm has achieved the maximum OA of 81.67% for Indian Pines dataset, 94.17% for Pavia University dataset, and 92.64% for Botswana dataset. It can be inferred from Tables 4, 5 and 6 that, the use of the initialization strategy, adaptive Lévy flight strategy and hybridization of WDO and CS have greater influence on the optimization ability of an algorithm.
The classification maps produced by all the above mentioned band selection approaches on Indian Pines, Pavia University and Botswana dataset are shown in Figs. 2, 3 and 4, respectively. The proposed approach WDOMCS shows a smooth classification map than other competing band selection approaches. For instance, the region of class “Woods” as shown in Fig. 2f (denoted by brown arrow) is smoother than those of the other approaches. Similarly, the region of class “Meadows” as shown in Fig. 3f (denoted by red arrow) is smoother than other approaches.
4.2.2 Comparison of the proposed band selection approach with commonly used band selection approaches
To validate the effectiveness of the proposed WDOMCS approach, the classification performance of the WDOMCS is compared with commonly used band selection approaches such as Ward’s linkage strategy using divergence (WaLuDi) [18], minimum-redundancy maximum-relevance (mRMR) [19], and Relief algorithm [12]. Figure 5a, b and c present the OA, AA and κ for three hyperspectral datasets using above mentioned band selection approaches. It is observed that the classification accuracy by the proposed WDOMCS shows significant improvement that by using the full spectral bands, which is higher than 4% for Indian Pines, 13% for Pavia University and 1% for Botswana dataset. More importantly, the number selected of bands is greatly reduced in comparison with original spectral bands. Among all the band selection approaches, WDOMCS attains excellent classification performance by selecting most discriminative bands. Although the number of selected bands is the same, OA is lower than 80% for Indian Pines, less than 90% for Pavia University and Botswana datasets when using Relief, mRMR, and WaLuDi approaches. Similarly, AA is less than 62% for Indian Pines, lower than 89% for Pavia University and Botswana datasets when using Relief, mRMR, and WaLuDi approaches. Furthermore, kappa coefficient is lower than 76% for Indian Pines, less than 88% for Pavia University and Botswana datasets when using Relief, mRMR, and WaLuDi approaches. In short, we can say that WDOMCS has the excellent searching ability and optimization performance, which makes it better choice to solve the band selection problem.
4.2.3 WDOMCS parameter analysis
The population size Ps, number of iterations Tare the two main parameters influencing the performance of WDOMCS. Both parameters have been analysed in Fig. 6a and b for a fixed number of 20 bands on the Botswana dataset. As expected, classification accuracy increases with increase in Ps and T. They are chosen to be as minimal as possible to attain maximum optimisation capacity with minimal computational effort. Based on these results, Ps is set to 20 and T is set to 100.
4.2.4 Convergence analysis
In this section, optimization performance of the WDOMCS is discussed with respect to convergence analysis. The proposed WDOMCS includes hybridization of the standard WDO and CS algorithm. In addition, Chebyshev chaotic map based population initialization phase and one modification to the standard CS algorithm were adopted to speed up the convergence process. The ergodicity and stochastic properties exhibited by chaos helps the algorithm to reach its global best optimal solution. A modification to the standard CS algorithm includes the use of an adaptive Lévy flight strategy based on the fitness value of previous iteration, which will helps in increasing the convergence rate by exploring around the most likely solution space. In this section, the influence of each modification on the standard algorithms (M1: WDO algorithm with Chebyshev chaotic initialization, M2: CS algorithm with Chebyshev chaotic initialization, M3: CS algorithm with adaptive Lévy flight strategy) are studied. The convergence plots using the above three modifications and the standard WDO, CS and WDOMCS algorithm for Botswana dataset are shown in Fig. 7. It is evident from Fig. 7, the use of Chebyshev chaotic map based population initialization have reached a near attainable global solution without bringing any substantial enhancement to the convergence rate in comparison with the standard WDO. Similarly, CS algorithm Chebyshev chaotic map based population initialization attained maximum fitness value without getting trapped in the local optimum. Whereas, the convergence rate of M2 is slower than that of standard CS algorithm. The Chebyshev chaotic map based population initialization strategy can improve the searching ability because it generate non-repeated solution and improve the quality of the initial feasible solution. The use of adaptive Lévy flights strategy in M3provided more stable convergence rate. The adaptive step size encourages the cuckoos to explore more rigorously to find a new solution. However, in comparison with modification M2, it failed to attain the global optimal solution. Therefore, the inclusion of all three modifications to the standard WDO and CS algorithm lead to a more stable and reliable optimization performance as expected.
All the approaches except WDOMCS have obtained almost same solution at the end iteration; however convergence speed slightly varies. As can be clearly seen from Fig. 7, WDOMCS algorithm converges quickly and reaches global optimal solution. The Chebyshev chaotic map generates non-random population and significantly increases the convergence rate. Adaptive step size helps algorithm to come out of local optima, overcomes premature convergence and improves quality of solution significantly. Furthermore, hybridization of the WDO and CS leads to exchange of useful information across the search space. Therefore, with these modifications, WDOMCS obtains optimal solution quickly in comparison with other metaheuristic approaches. The fitness curve of WDOMCS proved that, the exploitation capability of the algorithm has been enhanced which succeed in reducing premature convergence problem as well as improving the convergence rate.
5 Conclusion
In this paper, a novel hybrid optimization approach is proposed based on WDO and MCS algorithm for solving the problem of band selection in hyperspectral image. In WDOMCS, the randomness of initial population is modified through use of the chaotic map. Furthermore, the searching ability of CS algorithm is improved by adaptively adjusting the Levy flight step size. WDOMCS can enhance the ability of exploitation and avoid getting trapped into local optimal solution by dividing the population, grouping update and combining the population to share the useful information between individuals. The experimental results have been compared with other band selection techniques optimized by GA, PSO, GWO, WDO and CS. Compared to GA, PSO, GWO, WDO and CS algorithm, the WDOMCS is more effective in obtaining better solutions and convergence speed of WDOMCS is higher. This hybridization presented a better optimization performance and robustness and significantly enhances the optimization capability of original WDO and CS algorithms.
Although in this paper the hybrid WDOMCS algorithm was implemented for solving the problem of hyperspectral band selection, there are still many aspects worthy of our study. Firstly, we will focus on various initialization strategies in order to further improve the performance of optimization. Secondly, multi-objective optimization model is needed to find an optimal trade-offs between conflicting issues such as, information preserving and redundancy reducing for choosing optimal band subsets. Also, we will further explore various strategies for combining spectral and spatial information to improve the classification performance. Lastly, we will further explore our proposed approach to various multimedia applications such as, image segmentation [52], image detection [26], and image dehaze [54].
References
Abed-alguni BH, Alkhateeb F (2018) Intelligent hybrid cuckoo search and b -hill climbing algorithm. J King Saud Univ - Comput Inf Sci. https://doi.org/10.1016/j.jksuci.2018.05.003
Bayraktar Z, Komurcu M, Werner DH (2010) Wind Driven Optimization (WDO): a novel nature-inspired optimization algorithm and its application to electromagnetics, 2010 IEEE Antennas Propag. Soc Int Symp:1–4. https://doi.org/10.1109/APS.2010.5562213
Bayraktar Z, Komurcu M, Bossard JA, Werner DH (2013) The wind driven optimization technique and its application in electromagnetics. IEEE Trans Antennas Propag 61:2745–2757. https://doi.org/10.1109/TAP.2013.2238654
Boggavarapu LNP, Manoharan P (2020) Classification of hyper spectral remote sensing imagery using intrinsic parameter estimation. Springer International Publishing, Berlin. https://doi.org/10.1007/978-3-030-16660-1
Bouyer A, Hatamlou A (2018) An efficient hybrid clustering method based on improved cuckoo optimization and modified particle swarm optimization algorithms. Appl Soft Comput J 67:172–182. https://doi.org/10.1016/j.asoc.2018.03.011
Chang CI, Wang S (2006) Constrained band selection for hyperspectral imagery. IEEE Trans Geosci Remote Sens 44:1575–1585. https://doi.org/10.1109/TGRS.2006.864389
Feng J, Jiao LC, Zhang X, Sun T (2014) Hyperspectral band selection based on trivariate mutual information and clonal selection. IEEE Trans Geosci Remote Sens 52:4092–4115. https://doi.org/10.1109/TGRS.2013.2279591
Feng S, Itoh Y, Parente M, Duarte MF (2017) Hyperspectral band selection from statistical wavelet models. IEEE Trans Geosci Remote Sens 55:2111–2123. https://doi.org/10.1109/TGRS.2016.2636850
Garg H (2019) A hybrid GSA-GA algorithm for constrained optimization problems a hybrid GSA-GA algorithm for constrained optimization problems. Inf Sci (Ny) 478:499–523. https://doi.org/10.1016/j.ins.2018.11.041
Ghamisi P, Benediktsson JA (2015) Feature selection based on hybridization of genetic algorithm and particle swarm optimization. IEEE Geosci Remote Sens Lett 12:309–313. https://doi.org/10.1109/LGRS.2014.2337320
http://www.ehu.eus/ccwintco/index.php/Hyperspectral_Remote_Sensing_Scenes (n.d.)
Jia J, Yang N, Zhang C, Yue A, Yang J, Zhu D (2013) Object-oriented feature selection of high spatial resolution images using an improved relief algorithm. Math Comput Model 58:619–626. https://doi.org/10.1016/j.mcm.2011.10.045
Kailath T (1967) The divergence and Bhattacharyya distance measures in signal selection. IEEE Trans Commun Technol 15:52–60. https://doi.org/10.1109/TCOM.1967.1089532
Kumar BLNP (2020) Hyperspectral image classification using fuzzy-embedded hyperbolic sigmoid nonlinear principal component and weighted least squares approach. J Appl Remote Sens 14(2):024501. https://doi.org/10.1117/1.JRS.14.024501
Kumar BLNP, Manoharan P (2020) A new framework for hyperspectral image classification using Gabor embedded patch-based convolution neural network. Infrared Phys Technol 110:103455. https://doi.org/10.1016/j.infrared.2020.103455
Li S, Wu H, Wan D, Zhu J (2011) Knowledge-based systems an effective feature selection method for hyperspectral image classification based on genetic algorithm and support vector machine. Knowledge-Based Syst 24:40–48. https://doi.org/10.1016/j.knosys.2010.07.003
Liu Y, Wang G, Chen H, Dong H, Zhu X, Wang S (2011) An improved particle swarm optimization for feature selection. J Bionic Eng 8:191–200. https://doi.org/10.1016/S1672-6529(11)60020-6
MartÍnez-UsÓMartinez-Uso A, Pla F, Sotoca JM, GarcÍa-Sevilla P (2007) Clustering-based Hyperspectral band selection using information measures. IEEE Trans Geosci Remote Sens 45:4158–4171. https://doi.org/10.1109/TGRS.2007.904951
Max-dependency C (2005) Feat Select Based Mutual Info 27:1226–1238
Medjahed SA, Ouali M (2018) Band selection based on optimization approach for hyperspectral image classification. Egypt J Remote Sens Sp Sci. https://doi.org/10.1016/j.ejrs.2018.01.003
Medjahed SA, Saadi TA, Benyettou A, Ouali M (2015) Binary cuckoo search algorithm for band selection in hyperspectral image classification. IAENG Int J Comput Sci 42:1–9
Medjahed SA, Ait Saadi T, Benyettou A, Ouali M (2016) Gray wolf optimizer for hyperspectral band selection. Appl Soft Comput J 40:178–186. https://doi.org/10.1016/j.asoc.2015.09.045
Melgani F, Bruzzone L (2004) Classification Hyperspect Remote Sens 42:1778–1790
Manoharan Prabukumar, Shrutika Sawant, Sathishkumar Samiappan, Loganathan Agilandeeswari(2018) Three-dimensional discrete cosine transform-based feature extraction for hyperspectral image classification, J. Appl. Remote Sens. 12(4):046010 (2018). https://doi.org/10.1117/1.JRS.12.046010
Prabukumar M, Sawant SS (2018) Band clustering using expectation – maximization algorithm and weighted average fusion-based feature extraction for hyperspectral image classification. J Appl Remote Sens 12. https://doi.org/10.1117/1.JRS.12.
Quan Q, He F, Li H (2020) A multi-phase blending method with incremental intensity for training detection networks. Vis Comput. https://doi.org/10.1007/s00371-020-01796-7
Sawant SS, Manoharan P (2019) New framework for hyperspectral band selection using modified wind-driven optimization algorithm. Int J Remote Sens 00:1–22. https://doi.org/10.1080/01431161.2019.1607609
Sawant SS, Manoharan P (2020) Unsupervised band selection based on weighted information entropy and 3D discrete cosine transform for hyperspectral image classification. Int J Remote Sens 41:3948–3969. https://doi.org/10.1080/01431161.2019.1711242
Sawant SS, Prabukumar M (2017) Semi-supervised techniques based hyper-spectral image classification: a survey, in: 2017 Innovations in Power and Advanced Computing Technologies (i-PACT), Vellore, 2017, pp 1-8. https://doi.org/10.1109/IPACT.2017.8244999
Sawant Shrutika, Manoharan Prabukumar (2020) A Review on Graph-Based Semi-Supervised Learning Methods for Hyperspectral Image Classification, The Egyptian Journal of Remote Sensing and Space Sciences, 23: 243-248. https://doi.org/10.1016/j.ejrs.2018.11.001
Sawant S, Prabukumar M (2020) A survey of band selection techniques for hyperspectral image classification. J Spectr Imaging 1:1–18. https://doi.org/10.1255/jsi.2020.a5
Sawant S, Manoharan P, Samiappan S (2018) Ranking and Grouping based Feature Selection for Hyperspectral Image Classification, Proceedings Asian Conference on Remote Sensing, pp. 2305–2313
Sawant S, Manoharan P, Samiappan S (2019) A band selection method for hyperspectral image classification based on cuckoo search algorithm with correlation based in- itialization,in: 10th workshop on Hyperspectral imaging and signal processing: Evolution in remote sensing (WHISPERS), Amsterdam, Netherlands, pp. 1–4.
Sawant S, Prabukumar M, Samiappan S (2020) A modified cuckoo search algorithm based optimal band subset selection approach for hyperspectral image classification. J Spectr Imaging 1:1–20. https://doi.org/10.1255/jsi.2020.a6
Science N, Phenomena C, Walton S, Hassan O, Morgan K, Brown MR (2011) Chaos , Solitons & Fractals Modified cuckoo search : A new gradient free optimisation algorithm, Chaos, Solitons Fractals Interdiscip. J Nonlinear Sci Nonequilibrium Complex Phenom 44:710–718. https://doi.org/10.1016/j.chaos.2011.06.004
Senthil V, Ganesan K, Vasuki S (2018) Maximin distance based band selection for endmember extraction in hyperspectral images using simplex growing algorithm, 7221–7237. https://doi.org/10.1007/s11042-017-4630-0
Su H, Du Q, Chen G, Du P (2014) Optimized hyperspectral band selection using particle swarm optimization. IEEE J Sel Top Appl Earth Obs Remote Sens 7:2659–2670. https://doi.org/10.1109/JSTARS.2014.2312539
Su H, Cai Y, Du Q (2017) Firefly-algorithm-inspired framework with band selection and extreme learning machine for Hyperspectral image classification. IEEE J Sel Top Appl Earth Obs Remote Sens 10:309–320. https://doi.org/10.1109/JSTARS.2016.2591004
Sui C, Tian Y, Xu Y, Xie Y (2015) Unsupervised band selection by integrating the overall accuracy and redundancy. IEEE Geosci Remote Sens Lett 12:185–189. https://doi.org/10.1109/LGRS.2014.2331674
Tavazoei MS, Haeri M (2007) Comparison of different one-dimensional maps as chaotic search pattern in chaos optimization algorithms, 187: 1076–1085. https://doi.org/10.1016/j.amc.2006.09.087
Taylor P (2013) Hybrid genetic algorithm for feature selection with hyperspectral data 37–41. https://doi.org/10.1080/2150704X.2013.777485.
Tschannerl J, Ren J, Yuen P, Sun G, Zhao H, Yang Z, Wang Z, Marshall S (2019) MIMR-DGSA : unsupervised hyperspectral band selection based on information theory and a modified discrete gravitational search algorithm. Inf Fusion 51:189–200. https://doi.org/10.1016/j.inffus.2019.02.005
Vaddi R, Manoharan P (2020) Probabilistic PCA based hyperspectral image Classification for remote sensing applications. Springer International Publishing, Berlin. https://doi.org/10.1007/978-3-030-16660-1
Vaddi R, Manoharan P (2020) Hyperspectral image classification using CNN with spectral and spatial features integration. Infrared Phys Technol 107:103296. https://doi.org/10.1016/j.infrared.2020.103296
Vaddi R, Manoharan P (2020) CNN based hyperspectral image classification using unsupervised band selection and structure-preserving spatial features, Infrared Phys Technol 10:103457. https://doi.org/10.1016/j.infrared.2020.103457
Veera Senthil Kumar G, Vasuki S (2017) Clustering based band selection for endmember extraction using simplex growing algorithm in hyperspectral images, 8355–8371. https://doi.org/10.1007/s11042-016-3420-4
Wang Q, S. Member, Lin J, S. Member, Yuan Y, S. Member (2016) Salient Band Selection for Hyperspectral Image Classification via Manifold Ranking 27: 1279–1289.
Xie F, Li F, Lei C, Yang J, Zhang Y (2019) Unsupervised band selection based on artificial bee colony algorithm for hyperspectral image classification. Appl Soft Comput J 75:428–440. https://doi.org/10.1016/j.asoc.2018.11.014
Xu M, Sun Q, He Z, Shi J (2016) Band selection for hyperspectral images based on particle swarm optimization and differential evolution algorithms with hybrid encoding, 16: 629–640. https://doi.org/10.3233/JCM-160645
Yang D, Li G, Cheng G (2007) On the efficiency of chaos optimization algorithms for global optimization, 34: 1366–1375. https://doi.org/10.1016/j.chaos.2006.04.057
Yang X, Deb S, A.C.B. Behaviour (2009) Cuckoo Search via Lévy Flights 210–214
H. Yu, F. He, Pan Y (2019) A scalable region-based level set method using adaptive bilateral filter for noisy image segmentation
Zhang W, Li X, Zhao L (2018) A fast Hyperspectral feature selection method based on band correlation analysis. IEEE Geosci Remote Sens Lett PP 15:1–5. https://doi.org/10.1109/LGRS.2018.2853805
Zhang J, He F, Chen Y (2020) A new haze removal approach for sky / river alike scenes based on external and internal clues, 2085–2107
Acknowledgments
The authors thank the Council of Scientific & Industrial Research (CSIR), New Delhi, India for the award of CSIR-SRF and VIT for providing a VIT seed grant for carrying out this research work.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Sawant, S., Manoharan, P. A hybrid optimization approach for hyperspectral band selection based on wind driven optimization and modified cuckoo search optimization. Multimed Tools Appl 80, 1725–1748 (2021). https://doi.org/10.1007/s11042-020-09705-9
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-020-09705-9