An efficient multilevel image thresholding method based on improved heap-based optimizer

Houssein, Essam H.; Mohamed, Gaber M.; Ibrahim, Ibrahim A.; Wazery, Yaser M.

doi:10.1038/s41598-023-36066-8

An efficient multilevel image thresholding method based on improved heap-based optimizer

Article
Open access
Published: 05 June 2023

Volume 13, article number 9094, (2023)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

An efficient multilevel image thresholding method based on improved heap-based optimizer

Download PDF

Essam H. Houssein¹,
Gaber M. Mohamed¹,
Ibrahim A. Ibrahim¹ &
…
Yaser M. Wazery¹

2844 Accesses
10 Citations
Explore all metrics

Abstract

Image segmentation is the process of separating pixels of an image into multiple classes, enabling the analysis of objects in the image. Multilevel thresholding (MTH) is a method used to perform this task, and the problem is to obtain an optimal threshold that properly segments each image. Methods such as the Kapur entropy or the Otsu method, which can be used as objective functions to determine the optimal threshold, are efficient in determining the best threshold for bi-level thresholding; however, they are not effective for MTH due to their high computational cost. This paper integrates an efficient method for MTH image segmentation called the heap-based optimizer (HBO) with opposition-based learning termed improved heap-based optimizer (IHBO) to solve the problem of high computational cost for MTH and overcome the weaknesses of the original HBO. The IHBO was proposed to improve the convergence rate and local search efficiency of search agents of the basic HBO, the IHBO is applied to solve the problem of MTH using the Otsu and Kapur methods as objective functions. The performance of the IHBO-based method was evaluated on the CEC’2020 test suite and compared against seven well-known metaheuristic algorithms including the basic HBO, salp swarm algorithm, moth flame optimization, gray wolf optimization, sine cosine algorithm, harmony search optimization, and electromagnetism optimization. The experimental results revealed that the proposed IHBO algorithm outperformed the counterparts in terms of the fitness values as well as other performance indicators, such as the structural similarity index (SSIM), feature similarity index (FSIM), peak signal-to-noise ratio. Therefore, the IHBO algorithm was found to be superior to other segmentation methods for MTH image segmentation.

Enhancing image thresholding segmentation with a novel hybrid battle royale optimization algorithm

Article 26 June 2024

Multilevel image thresholding using entropy of histogram and recently developed population-based metaheuristic algorithms

Article 23 June 2017

An improved opposition-based Runge Kutta optimizer for multilevel image thresholding

Article 07 May 2023

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Introduction

Segmentation has an important role in the field of image processing¹. Segmentation is the process of separating an image into two or more homogeneous segments based on the characteristics of the pixels in the image. It is utilized in various scopes, such as industry and medicine², agriculture³, and surveillance⁴. Thresholding is one of the most common image segmentation approaches. To define the thresholds, most methods use the histogram of the image⁵, which is vital for determining the probability distribution value of pixels in the image⁶. Thresholding obtains the information of the histogram from an image and determines the best threshold ((th)) for categorizing the pixels into various groups. Image thresholding approaches can be categorized into two types: multi-level and bi-level thresholding. Bi-level thresholding techniques use one threshold to separate an image into two groups, whereas multi-level thresholding (MTH) uses two or more thresholds to separate an image into many groups¹.

To obtain the best threshold values in MTH segmentation, thresholding techniques can be classified into two approaches: non-parametric and parametric. In parametric techniques, each group of grayscale range should be consistent with a Gaussian distribution. Parametric approaches are dependent on the evaluation of the histogram using mathematical operations. The Gaussian mixture is widespread, where used to define the set of operations that convergent the histogram, and the best thresholds are then selected. Non-parametric approaches employ distinct methods to separate the pixels into homogeneous areas; then, the best threshold is defined using statistical information, such as entropy or variance. The Kapur method⁷ and Otsu method⁸ are used in this study. The Otsu method selects the best thresholds by the maximization of the variance among groups. In the Kapur method, the threshold value is defined by minimizing the cross entropy between a segmented image and the original image. These methods are efficient for one or two th values of thresholds. However, they have several restrictions; for example, they are very costly in computation, mostly when the number of thresholds increases. Non-parametric techniques have several advantages. Specifically, in terms of computation, these methods are computationally faster than parametric methods, especially when used in optimization problems. Metaheuristic algorithms (MAs) can be used in the search process. Generally, these algorithms provide better results than techniques dependent on thresholding methods^9,10.

Metaheuristic algorithms are used to solve challenging real-world problems. In the past several decades, researchers have extensively demonstrated the ability of MAs to solve several types of difficult optimization problems in various areas, such as optimization¹¹, communications¹², bioinformatics¹³, drug design¹⁴, Image segmentation^15,16 and feature selection¹⁷, mainly due to the fact that these algorithms are general-purpose and easy to implement¹⁸. MAs are commonly inspired by nature and can be classified into four main categories: Evolutionary-based, swarm-based, physics-based, and human-based algorithms. Evolutionary-based algorithms (use mechanisms inspired by biological evolution, such as recombination, crossover, mutation, and the heritage of features in offspring¹⁹. Candidate solutions to optimization problems are represented as individuals in a population, and the quality of the solutions is determined by the fitness function. Two main Evolutionary-based algorithms are differential evolution (DE)²⁰ and the genetic algorithm (GA)²¹, which are inspired by biological evolution, while swarm-based algorithms mimic the mass behavior of living creatures. Living creatures interact with each other in nature to achieve optimal mass behavior²². An offshoot is particle swarm optimization (PSO)²³, which mimics the hunting behavior of groups of fish and birds. Physics-based algorithms are generally inspired by physics to generate factors that enable search for the optimal solution in the search scope^24,25. Some of the most common categories in this branch are the gravitational search algorithm (GSA)²⁶ and electromagnetism optimization (EMO)²⁷. Human-based algorithms are inspired by human gregarious demeanor. The common and recent used algorithms in this category are teaching–learning-based optimization (TLBO)²⁸, and the heap-based optimizer (HBO)²⁹.

With respect to MTH in image processing, it is possible to use thresholding approaches such as the Otsu or Kapur method³⁰ as the objective function. The problem is not only concerned with the increased number of thresholds, but is also related to the image; for this reason, each image is an autonomous problem concerned with the levels of thresholding used for segmentation³¹. The optimal segmentation threshold values must be highly accurate in most processes. Therefore, the use of MAs has been expanded in this field. The moth swarm algorithm discussed in³² was used to obtain the best threshold values with the Kapur method based on previous literature. In addition, a modified firefly algorithm was proposed in³³ for image processing, and used the Kapur and Otsu methods as objective functions. In³⁴, ant colony optimization was used in image segmentation based on a multi-threshold image segmentation method with Kapur entropy and a non-local two-dimensional histogram. In³⁵, the researchers used a novel concept called a hyper-heuristic with MTH image segmentation, in which each iteration determined the optimal execution sequence of MAs to determine the best threshold values.

In¹⁰, the black widow optimization algorithm¹⁰ was proposed to determine the optimal threshold using the Kapur or Otsu method as an objective function with a multi-level threshold. In³⁶, the crow search algorithm was utilized in conjunction with the Kapur approach and 30th values to obtain the optimal threshold. In³⁷, the authors proposed the efficient krill herd algorithm to determine the best thresholds at various levels for color images, where the Tsallis entropy, Otsu method, and Kapur entropy were utilized as fitness functions. Harris hawks optimization (HHO) is a new algorithm, and its hybridization was achieved by adding another powerful algorithm, the differential evolution (DE) algorithm³⁸. Specifically, the entire population was split into two equal subpopulations, which were assigned to the HHO and DE algorithms, respectively. This hybridization used the Otsu and Kapur approaches as fitness functions. In³⁹, the authors combined the classical Otsu’s method with an energy curve for applying the segmentation of colored images in multilevel thresholding. The water cycle algorithm (WCA) is integrated with Masi entropy (Masi-WCA) and Tsallis in⁴⁰ to segment the color image. the results of the experiment proved the superiority of the WCA for multilevel thresholding with Masi entropy compared to other competitive algorithms. The authors in⁴¹ used a multi-verse optimizer (MVO) algorithm based on the Energy-MCE thresholding approach for searching the accurate and near-optimal thresholds for segmentation.

In the same context, Elaziz et al.⁴² proposed DE as a technique to select the best MAs to determine the optimal threshold for the Otsu method. Opposition-based learning (OBL)is one of the important effective methods to improve search efficiency of meta-heuristic algorithms⁴³. The hyper-heuristic method based on a genetic algorithm was presented in⁴⁴ and estimates various MAs for determining the optimal threshold for each image using a predetermined value of th using the Otsu method. In⁴⁵, new efficient version of the recent chimp optimization algorithm (ChOA) was proposed to overcome the weaknesses of the original ChOA and called opposition-based Lévy Flight chimp optimizer (IChOA). The IChOA is applied to solve the problem of MTH using the Otsu and Kapur methods as objective functions. In this paper, several MAs, including SCA, MFO, SSA, and EMO, were combined with Otsu. As mentioned, the utilization of MAs in MTH is growing rapidly, and a summary of various approaches can be found in⁴⁶.

According to the No Free Lunch theorem, this signifies that there is no ideal algorithm for a particular problem⁴⁷. For this reason, any algorithm must be evaluated for a real problem to demonstrate its performance. MTH based on OBL are frequently used to solve a diversity of other optimization problems. Therefore, this paper seeks to further the research in the image segmentation field by utilizing the recent heap-based optimizer (HBO). The HBO was introduced in²⁹ for optimization. This algorithm mimics the job responsibilities and descriptions of employees. The staff are coordinated in a hierarchy, and a nonlinear tree-shaped data structure is used to represent the heap. The benefit of these algorithms is that types with unsuitable fitness are deleted from the circle, leading to improved convergence speed. Based on the advantages of the HBO and the No Free Lunch theorem, this paper aims to present an alternative version from HBO called IHBO algorithm to discover the optimal solution of complex MTH problems and overcoming the weaknesses of the original HBO.

The proposed method for MTH based on the HBO is called IHBO, and applies the Kapur and Otsu methods individually to obtain the optimal threshold from benchmark images. IHBO explores the search area determined by a histogram technique to provide the best threshold values using a set of factors inspired by humans’ career hierarchy. The performance of IHBO is evaluated through various tests in which benchmark images are utilized with many levels of complexity. The segmentation results are estimated using various assessments, such as the structural similarity (SSIM) index⁴⁸, feature similarity (FSIM) index⁴⁹ and peak signal-to-noise ratio (PSNR)⁵⁰. Furthermore, IHBO algorithm was evaluated on the CEC’2020 test suite and compared against seven well-known metaheuristic algorithms including the basic HBO²⁹, SSA⁵¹, MFO⁵², GWO⁵³, SCA⁵⁴, HS⁵⁵, and EMO²⁷. The evaluations are executed through various non-parametric and statistical techniques to determine whether the optimal solutions provided by the IHBO are superior.

The main contributions of this paper can be summarized as follows:

An efficient HBO based on OBL called IHBO to overcome the weaknesses of the original HBO is presented.
Evaluating the effectiveness of IHBO on the CEC’2020 test suite.
IHBO is proposed to solve the problem of high computational cost for MTH .
Proving the ability of the IHBO to solve the image segmentation problems using the Kapur’s entropy and Otsu’s method as fitness function.
Verify the image quality using set of metrics FSIM, PSNR and SSIM to obtain optimal solutions.
Evaluating the performance of the provided method based on the various segmentation degrees to estimate stability of the optimizer and evaluate quality of the segmentation.

The remainder of this paper is organized as follows. “Preliminaries” section describes the materials and methods used in this study, while “The proposed IHBO algorithm” section presents the proposed algorithm. “Environmental and experimental requirements” section illustrates the environmental and experimental requirements, while “Experimental results and discussion” section presents the performance evaluation and experimental results. Finally, conclusions and proposals for future work are provided in “Conclusions and future works” section.

Preliminaries

This section introduces the materials required to implement the proposed segmentation method, as well as the approaches implemented based on the above-mentioned approaches.

Objective functions formulation

The entropy criterion of the Kapur⁷ approach and between-class variance of the Otsu⁸ approach are widely utilized to determine the optimal threshold value th in image segmentation. Both algorithms were developed for bi-level thresholding techniques. An approach can be readily extended for solving MTH problems.

Otsu method for segmentation

The Otsu method is an automatic and non-parametric technique used to determine the optimal thresholds of an image⁸. This method is based on the maximum variance of the various classes as a criterion to segment the image. The intensity levels L are taken from a grayscale image, and the equation below is used to calculate the probability distribution of the intensity value:

$$\begin{aligned} {{Ph_i} = \frac{{{n_i}}}{{nk}}, {Ph_i} \ge 0, \sum \limits _{i = 1}^{L} {{Ph_i} = 1}, } \end{aligned}$$

(1)

where i is a specific intensity level in the range $0 \le i \le L-1$ and $n_i$ is the number of gray level i appearing in the image. The number of pixels in the image is nk and $Ph_i$ is the probability distribution of the intensity levels. For the simplest segmentation (bi-level), two classes are represented as

$$\begin{aligned} {C_1} = \frac{{Ph_1}}{{\omega _0(th)}},\ldots ,\frac{{Ph_{th}}}{{\omega _0(th)}} \text { and } {C_2} = \frac{{Ph_{th + 1}^c}}{{\omega _1(th)}},\ldots ,\frac{{Ph_L}}{{\omega _1(th)}}\mathrm{{ }}. \end{aligned}$$

(2)

The probability distribution for $C_1$ and $C_2$ are $\omega _0(th)$ and $\omega _1(th)$, respectively, as illustrated in (3).

$$\begin{aligned} \omega _0(th)=\sum _{i=1}^{th}Ph_i \text { and } \omega _1(th)=\sum _{th+1}^{L}Ph_i. \end{aligned}$$

(3)

It is necessary to calculate the mean levels $\mu _0$ and $\mu _1$ that define the classes using (4). Once these values are calculated, the Otsu based between classes $\sigma _B^{2}$ is calculated using (5) as follows:

$$\begin{aligned} \mu _0= & {} \sum _{i=1}^{th} \dfrac{ iPh_i}{\omega _0(th)} \text { and } \mu _1=\sum _{i=th+1}^L \dfrac{iPh_i}{ \omega _1(th)}\end{aligned}$$

(4)

$$\begin{aligned} \sigma _B^{2}= & {} \sigma _1+\sigma _2 \end{aligned}$$

(5)

Moreover, $\sigma _1$ and $\sigma _2$ in (5) indicate the variance of regions $C_1$ and $C_2$, and are calculated as

$$\begin{aligned} \sigma _1=\omega _0 (\mu _0 +\mu _T)^2 \text { and } \sigma _2=\omega _1(\mu _1 +\mu _T)^2, \end{aligned}$$

(6)

where $\mu _T=\omega _0\mu _0+\omega _1\mu _1$ and $\omega _0+\omega _1=1$. Based on the values $\sigma _1$ and $\sigma _2$, (7) provides the fitness function. Subsequently, the optimization problem is reduced to determine the intensity level that maximizes (7):

$$\begin{aligned} F_{Otsu}(th)=Max(\sigma _B^{2}(th)) \text { where } 0 \le th \le L-1, \end{aligned}$$

(7)

where $\sigma _B^{2}(th)$ is the Otsu method variance for a given th value. EBO methods are used determine the intensity level th for maximizing the fitness function according to (7). The fitness or objective function $F_{Otsu}(th)$ can be modulated for MTH as follows:

$$\begin{aligned} F_{Otsu}(TH)=Max(\sigma _B^{2}(th)) \text { where } 0 \le th \le L-1 \text { and } i=[1,2,3,\ldots ,n], \end{aligned}$$

(8)

where $TH= [th_1,th_2,\ldots th_n-1]$ represents a vector including MTH, and the variance calculations are as illustrated in (9).

$$\begin{aligned} \sigma _B^{2}=\sum _{i=1}^n \sigma _i = \sum _{i=1}^n \omega _1(\mu _1 -\mu _T)^2. \end{aligned}$$

(9)

Here i represents a class, and $\omega _i$ is the occurrence probability, and $\mu _j$ is the mean of a class. For MTH, these values are obtained as

$$\begin{aligned} \omega _{n-1}(th)=\sum _{i=th_n +1}^L Ph_i \end{aligned}$$

(10)

and

$$\begin{aligned} \mu _{n-1}=\sum _{i=th_{n}+1}^L \dfrac{iPh_i}{\omega _1(th_n) }. \end{aligned}$$

(11)

Kapur entropy

Another non-parametric method used to determine the best threshold value of an image was proposed by Kapur in⁷. The approach determines the best (th) implying the overall entropy to be maximized. For a bi-level scenario, the Kapur target capacity can be determined as

$$\begin{aligned} F_{kapur}(th)=h_1 +h_2, \end{aligned}$$

(12)

where the entropies $H_1$ and $H_2$ are computed as follows:

$$\begin{aligned} h_1=\sum _{i=1}^{th} \dfrac{Ph_i}{\omega _0} ln\left( \frac{Ph_i}{\omega _0}\right) \text { and } h_2=\sum _{i=th+1}^{L} \dfrac{Ph_i}{\omega _1} ln\left( \frac{Ph_i}{\omega _1}\right) . \end{aligned}$$

(13)

In (13), $Ph_i$ is the probability distribution of the intensity levels, which is computed by (1), and $\omega _0(th)$ and $\omega _1(th)$ are the probability distributions of classes $C_1$ and $C_2$, respectively. ln(.) represents the natural logarithm. Like the Otsu method, the entropy-based method can be modulated for MTH values. In this case, it is necessary to separate an image into n groups using a similar number of thresholds. The equation below can define the new objective function:

$$\begin{aligned} F_{kapur}(TH) =\sum _{i=1}^n h_i, \end{aligned}$$

(14)

where $TH=[th_1,th_2,\ldots th_{n-1}]$ is the vector including MTH. Each entropy is computed separately with its respective th values; thus, (14) is expanded for n entropies as follows:

$$\begin{aligned} h_n^c=\sum _{i=th_{n+1}}^L \dfrac{Ph_i}{\omega _{n-1}} ln \left( \dfrac{Ph_i}{\omega _{n-1}}\right) . \end{aligned}$$

(15)

Therefore, the values of probability occurrence $(\omega _0^c , \omega _1,\ldots , \omega _{n-1})$ of n classes can be determined using (10) and the probability distribution $Ph_i$ in (1).

Heap-based optimizer (HBO)

The HBO mimics the job responsibilities and descriptions of the employees within a company²⁹. Although the job title differ from company to another and from business to another, they are organized in a hierarchy and many of titles are given like corporate hierarchy structure, organizational chart tree, or corporate rank hierarchy (CRH), etc. The collection of methods that outlines how particular activities are directed to realize the goals of an organization and also defines how information flows among levels within the company⁵⁶ is called an organizational structure. In this section, we explain the mathematical model of the Heap-based optimizer.

Mathematical modeling of the interaction with immediate boss

The upper levels set the rules and laws for employees within the centralized structure and subordinates follow their immediate boss. By the assumption that each immediate boss is a parent node of its children, thus we can model this behaviour by upgrading the location of each search agent ${{\vec {x}}_{i}}$ with reference to its original node B by using the below equation:

$$\begin{aligned} X_i^k(t + 1) = {B^k} + \gamma {\lambda ^k}|{B^k} - X_i^k(t)|\ \end{aligned}$$

(16)

where t is the current iteration, and | | calculates the absolute value. $\lambda ^k$ is the $k^{th}$ component of vector ${\vec {\lambda }}$, and it is generated random as following

$$\begin{aligned} \vec {\lambda } =2r-1 \end{aligned}$$

(17)

where r is a random number in range $\left[ 0,1 \right]$. In Eq. (16), the designed parameter is $\gamma$, this parameter is computed by the following rule:

$$\begin{aligned} \gamma = \left| {2 - \frac{{\left( {t\bmod \frac{T}{c}} \right) }}{{\frac{T}{{4c}}}}} \right| \ \ \end{aligned}$$

(18)

The current iteration is t, T is the maximum iteration’s number, and C is a user defined parameter. while executing the iterations, $\gamma$ decrease linearly from 2 to 0 and when reach to 0, it will increase again to 2 with iterations.

Modeling the interaction between colleagues mathematically

The employees having the same position are considered to be colleagues. Each employee interact with others to achieve the goals of an organization. By assuming that the nodes at the same level in heap are colleagues and others are search agents ${{\vec {x}}_{i}}$and they update their position based on the position of others selected colleagues ${{\vec S}_r}$, the position of a search agent is calculated as follows:

$$\begin{aligned} X_i^k(t + 1) = \ {\left\{ \begin{array}{ll} S_r^k + {\gamma ^{{\lambda ^k}}}|S_r^k - x_i^k(t)|,f(\vec S_r) < f(\vec x_i(t)) \\ x_i^k + {\gamma ^{{\lambda ^k}}}|S_r^k - x_i^k(t)|, f(\vec S_r) \ge f(\vec x_i(t)) \end{array}\right. } \end{aligned}$$

(19)

where f is the objective function and calculates the fitness of each search agent. Equation (19) enables the search agents to explore the search space $S_r^k$ if $({{\vec S}_r}) < f({{\vec x}_i}(t))$ and allows to explore the search space $x_i^k$ otherwise.

Self contribution of an employee

This stage explains the concept of employees self contribution. Modeling of this behavior are executed by retaining the prior position of the employee in the next iteration, as illustrated in below equation:

$$\begin{aligned} x_i^k(t + 1) = x_i^k(t) \end{aligned}$$

(20)

In Eq. (20), the search agent ${{\vec x}_i}$ does not change its rank for it’s kth design parameter in the next iteration. We used this behavior to organize the rate of change of each search agent in population.

Putting it all together

This phase explains how to combine the equations of position updating and modelling in previous subsections in one equation. There are three probabilities of selection that are used to determine equation used in updating position of search agents, this probabilities of selection is used to switch between exploration and exploitation phase. These probabilities is divided into three proportions $p_1$, $p_2$, and $p_3$. The search agent updates its location using Eq. (20) according to the proportion $p_1$. The below equation computes the outlines of $p_1$.

$$\begin{aligned} {p_1} = 1 - \frac{t}{T} \end{aligned}$$

(21)

The current iteration t, T is the maximum number of iterations. The search agent updates its location using Eq. (16) according to the selection of proportion $p_2$. The below equation compute the outlines of $p_2$.

$$\begin{aligned} {p_2} = {p_1} + \frac{{1 - {p_1}}}{2} \end{aligned}$$

(22)

Finally, the search agent updates its location using Eq. (19) according to the selection of $p_3$. The below equation computes the outlines of $p_3$.

$$\begin{aligned} {p_3} = {p_2} + \frac{{1 - {p_1}}}{2} = 1 \end{aligned}$$

(23)

A general position updating mechanism of HBO is computed as follows:

$$\begin{aligned} x_i^k(t + 1) = \left\{ {\begin{array}{ll} x_i^k(t),&{}\quad p \le {p_1}\\ {B^k} + \gamma {\lambda ^k}\left| {{B^k} - x_i^k(t)} \right| , &{}\quad p> {p_1}\,and\,p \le {p_2}\\ S_r^k + \gamma {\lambda ^k}\left| {S_r^k - x_i^k(t)} \right| , &{}\quad p> {p_2}\,and\,p \le {p_3}\,and\,f({{\vec S}_r}) < f({{\vec x}_i}(t))\\ x_i^k + \gamma {\lambda ^k}\left| {S_r^k - x_i^k(t)} \right| , &{}\quad p > {p_2}\,and\,p \le {p_3}\,and\,f({{\vec S}_r}) \ge f({{\vec x}_i}(t)) \end{array}} \right. \end{aligned}$$

(24)

where $p_1$, $p_2$ and $p_3$ are random numbers inside range [0, 1]. This subsection clarifies that the Eq. (20) improves exploration phase, Eq. (16) improves exploitation phase and convergence, and Eq. (19) allows the search agent to move from the exploration phase to exploitation phase. According to this observations, $p_1$ is higher initially and decreases linearly over iterations, this decreases the exploration phase and improves exploitation phase with iterations. After calculating $p_1$, the remainder of the span is splitted into two equal portions, which makes attraction towards the colleague and boss equally probable.

Steps of HBO

This section summarizes the HBO steps and clarifies details about their implementation-related calculations.

Parameters initialization and definition: At first, all the search agents are randomly initialized in a potential solution space. The minimum and maximum boundaries of the search space are defined by variables $(L_i,\ U_i)$ respectively. The number of the population is (N) and maximum number of iteration (T). The specific parameter C can be calculated from $C=\left\lfloor T/25 \right\rfloor$.
Population initialization: The random population P is generated from N search agents, each consisting of D dimensions. The population’s representation P is shown as follows:
$$\begin{aligned} p = \left[ {\begin{array}{*{20}{c}} {\vec x_1^T}\\ {\vec x_2^T}\\ \vdots \\ {\vec x_N^T} \end{array}} \right] = \left[ {\begin{array}{*{20}{c}} {x_1^1}&{}\quad {x_1^2}&{}\quad {x_1^3}&{}\quad {}&{}\quad {x_1^D}\\ {x_2^1}&{}\quad {x_2^2}&{}\quad {x_2^3}&{}\quad {}&{}\quad {x_2^D}\\ {}&{}\quad {}&{}\quad {}&{}\quad {}&{}\quad {}\\ {x_N^1}&{}\quad {x_N^2}&{}\quad {x_N^3}&{}\quad {}&{}\quad {x_N^D} \end{array}} \right] \end{aligned}$$
Heap building: We utilize $3-ary$ heap to execute CRH. Although heap is a tree shaped data structure, it can be executed using an array. The below operations are $d-ary$ heap based operations required for the HBO execution.

1.
parent (i): By the assumption that the heap is performed as an array, this method receives the node’s index then retrieves its parent’s index. The formulation of parent’s index for a node i is calculated by below equation:
$$\begin{aligned} parent(i) = \left\lfloor {\frac{{i + 1}}{D}} \right\rfloor \end{aligned}$$
(25)
where $\lfloor \rfloor$ indicates the floor function, which retrieves the highest integer less than or equal to a given input.
2.
child (i; k): The node can own a maximum of 3 childrens in a $3-ary$ heap. Therefore we can say, the manager may not manages more than 3 direct persons. The index of the kth child of a node i is returned by this function. The below equation shows mathematical formulation of this function.
$$\begin{aligned} child(i,k) = D \times i - D + k + 1 \end{aligned}$$
(26)
For example,index of the 3nd child of 3nd node is calculated as:
$$\begin{aligned} child(3,3) = 12 - 4 + 3 + 1 = 12 \end{aligned}$$
3.
depth (i): Assuming the last level depth equals to 0, therefore we can calculate the depth of any node i in constant time through below formula:
$$\begin{aligned} depth(i) = \left\lceil {\log (D \times i - i + 1)} \right\rceil - 1 \end{aligned}$$
(27)
The ceil function is $\lceil \rceil$, which retrieves the smallest integer greater than or equal to the input. For example, depth of a node indexed 27 in heap is calculated as: $depth(27) = \left\lceil {\log _{3} (81 - 27 + 1)} \right\rceil - 1 = \left\lceil {{\text{2}}{\text{.6476}}} \right\rceil = 3$
4.
colleague (i): Assuming that nodes at the same level of node i are the colleagues of this node. The index of any elected colleague of node i is returned by this step and the index can be calculated by generating any random integer in the range $\frac{dd^{depth(i)-1)}-1}{D-1} +1, \frac{dd^{depth(i)-1)}-1}{D-1}$.
5.
Heapify_Up (i): searching upward in the heap then add node i at its correct place to save the heap property. Algorithm 1 show the pseudo code of this operation.

Finally, the algorithm to build the heap is shown in Algorithm 2.
6.
Repeated applications of position updating mechanism: search agents’ position is repeatedly updated according to previously explained equations trying to converge on the optimum global. The pseudo code of HBO is shown in Algorithm 3.

Opposition-based learning (OBL)

The idea of opposition-based learning (OBL) is applicable strategy of search strategy to avoid stagnancy in candidate solutions. OBL is a novel concept inspired from the opposite relationship between entities⁵⁷. The concept of opposition was presented in 2005 as the first time, which has attracted a many of research efforts in the last decennium. Many of Met-heuristic algorithms use the concept of OBL to develop their performance such as harmony search algorithm⁵⁸, grasshopper optimization⁵⁹, ant colony optimization⁶⁰, artificial bee colony⁶¹ and etc. OBL improve the exploitation phase of a search mechanism. Mostly in meta-heuristic algorithms, convergence occurs quickly when the initial solutions are closer to the optimal location; moreover, late convergence is expected. So that, OBL method produce novel solutions by considering opposite search areas which may prove to be nearer to the best solution. OBL is regraded not only the candidate solutions obtained by a stochastic iteration scheme, but also their ’opposite solutions’ located in opposite parts of the search space. The OBL method has been hybridized with many bio-inspired optimization gives shorter expected distances to the best solution compared to randomly sampled solution pairs⁶² such as cuckoo optimization algorithm⁶³, shuffled complex evolution algorithm⁶⁴, particle swarm optimization⁶⁵, harmony search⁶⁶, chaotic differential evolution algorithm⁶⁷, and shuffled frog algorithm⁶⁸. In optimization problems, the strategy of simultaneously examining a candidate and its opposite solution has the purpose of accelerating the convergence rate towards a globally best solution. According to previous related works, in initialization phase utilize OBL only to improve the convergence and prevent stuck in local optima of HBO, then IHBO is utilized to solve problem of multi-thresholding for image segmentation by use two objective functions called Kapur and Otsu.

The proposed IHBO algorithm

In this paper, the HBO algorithm is enhanced based on the OBL as local search strategy called IHBO to evade the drawbacks of the random population and improve the rate of convergence of the algorithm by developing the variety of its solutions. IHBO uses OBL strategy in the initialization phase to improve the search process as following:

$$\begin{aligned} Q_i = LB_j+UB_j-X_i, i\in {1,2,\ldots ,n} \end{aligned}$$

(28)

where $Q_i$ is a vector-maintaining solution resulting from the use of OBL, and $UB_j$ and $LB_j$ are the upper and lower bounds of the $j^{t} h$ component of a vector X. The phases of the proposed image thresholding model are described in depth below.

Initialization phase

In this phase, the algorithm starts by reading the image, converting it to grayscale, computing the histogram of the selected benchmark images, and then computing the probability distribution by (1). The algorithm initializes the IHBO parameters, which are the population size (N), maximum iteration number (T), boundaries of the search space ($L_{I}$, $U_{I}$), and number of iterations per cycle (t). Thereafter, the OBL strategy is utilized to calculate the $Q_i$ vector-maintaining solution by (28).

Updating phase

This phase provides the best threshold values by evaluating the fitness value of $X_i$ and $Q_i$ populations. To update the search agents’ positions (X), we use the fitness value of the optimal threshold of the Otsu $F_{Otsu}$ method (8) or Kapur $F_{kapur}$ method (14) as the objective function then comparing the fitness value of $X_i$ and $Q_i$ and saving the global best solution with the highest fitness. We define the position of each agent based on the fitness value. In addition, we determine three probabilities of selection $P_{1}$, $P_{2}$, and $P_{3}$ using (21), (22), and (23) sequentially, and then, based on the probabilities, we calculate the position of each agent within the heap using (24). The agent’s position (X) is updated using important $D-ary$ heap-based operations, such as Heapify_Up(i), which is used to search for the superior node in the heap, and we insert the node at its correct position to preserve the heap characteristics, as demonstrated in Algorithm 1. Then, each agent upgrades its location frequently according to the best fitness value, and seeks the global optimum, as depicted in Algorithm 3. Optimization scenarios of implementing the proposed IHBO algorithm illustrated in Figure 1.

Segmentation phase

In this phase, we generate the segmented image with the optimal threshold value in an image after setting $x_{heap}[1].value$ as the threshold value of the image. The pseudo-code of the proposed IHBO algorithm is illustrated in in Algorithm 4.

Computational complexity of the IHBO

This section discusses the computational complexity of IHBO algorithm. The complexity of the population’s initialization can be represented as ${\mathcal {O}}(N \times D)$ time complexity, where D and N indicate the dimension of the problem and the size of the population, respectively. Additionally, the IHBO calculates the complexity with the fitness of each search agent as ${\mathcal {O}}\left( N \times D \times T_{\max } \right)$, where the maximum number of iterations is $T_{\max }$. Besides, the IHBO requires ${\mathcal {O}}(t)$ time complexity for executing t number of its main operations. Therefore, the time complexity of the proposed IHBO is ${\mathcal {O}}\left( N \times D \times t \times T_{\max }\right)$. But, the total amount of space occupied by the algorithm is called the space complexity. So, the space complexity of the proposed IHBO can be represented by ${\mathcal {O}}(N \times D)$.

Performance evaluation of the proposed IHBO algorithm

Parameter settings

This section provides the estimation of the proposed IHBO algorithm. As we all know, adjusting parameters will certainly affect the performance of an algorithm. However, according to the suggestion of Arcuri et al.⁶⁹, when comparing algorithm performance, all algorithm parameters should be kept at their default values, which come from their original papers, to ensure they are in a relatively optimal state. Moreover, we reduce the risk of better parametrization bias as each algorithm is set to its default values. Therefore, in this work, all algorithm parameters are kept at their default values.

Thus, the performance of the proposed IHBO algorithm is evaluated over the IEEE Congress on Evolutionary Computation (CEC’2020)⁷⁰ as test problems. The CEC’2020 benchmark functions is utilized to test the performance of IHBO algorithm. Initially, this benchmark functions contained 10 test functions referred to as $f_1$–$f_{10}$. Consequently, function 1 is unimodal functions, functions 2–4 are multimodal functions, functions 5–7 are hybrid functions, and functions 8–10 are composition functions. Table 1 illustrates the parameters setting and mathematical formulation of the CEC’2020 benchmark functions; ’Fi*’ refers to the best global value. Figure 2 illustrates a 2D visualization of the CEC’2020 benchmark functions to understand the differences and the nature of each problem.

Table 1 Parameter settings of CEC’2020 benchmark test.

An efficient multilevel image thresholding method based on improved heap-based optimizer

Abstract

Similar content being viewed by others

Explore related subjects

Introduction

Preliminaries

Objective functions formulation

Otsu method for segmentation

Kapur entropy

Heap-based optimizer (HBO)

Mathematical modeling of the interaction with immediate boss

Modeling the interaction between colleagues mathematically

Self contribution of an employee

Putting it all together

Steps of HBO

Opposition-based learning (OBL)

The proposed IHBO algorithm

Initialization phase

Updating phase

Segmentation phase

Computational complexity of the IHBO

Performance evaluation of the proposed IHBO algorithm

Parameter settings

Statistical results analysis of CEC’2020 benchmark

Boxplot analysis

Convergence curves analysis

Qualitative metrics analysis

Environmental and experimental requirements

Benchmark images

Environmental setup

Evaluation metrics

Structural similarity index (SSIM)

Feature similarity index (FSIM)

Peak signal-to-noise ratio (PSNR)

Experimental results and discussion

Otsu results analysis

Kapur results analysis

Human participants or animals

Conclusions and future works

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Navigation