Keywords

1 Introduction

Image segmentation plays a vital role in image processing and computer vision such as pattern recognition, image analysis and medical diagnosis. However, due to the complete noise, intensity inhomogeneity and lower gray contrast, these features cause it is extremely challenging in many fields. Particularly, the characteristics of intensity inhomogeneity is a huge problem in the segmentation fields. The idea of image segmentation is to split an image into some sub-regions that do not overlap each other. In particular, for each pixel x in the image domain Ω. We assume that the domain of the image contains N non-overlapping Sub-regions, which means that: \( \Omega = \cup \) \( \Omega _{i} \left( {i = 1,2,3, \ldots \ldots N} \right) \), \( \Omega _{i} \cap\Omega _{j} =\Phi \), \( \forall i \ne j \). Many of segmentation methods have been proposed, such as fuzzy clustering methods [1,2,3], deep learning based methods [4,5,6], and super pixel based methods [7, 8] and active contour methods (ACM) [9,10,11,12,13,14,15]. Fuzzy clustering methods work well on noise free images, however, these methods cannot accurately segment images corrupted by severe noise, low contrast and intensity inhomogeneity. Recently, deep learning based methods are currently in a stage of rapid development and have been approved in segmentation fields. However, this type of method requires a large number of training sample sets, which is not suitable for a small number of intensity inhomogeneous images, especially medical images. Super pixel segmentation methods generally require preprocessing and are prone to over-segmentation. ACM has been widely used to segmenting images, it is several desirable advantages for segmentation images with intensity inhomogeneity over the above mentioned segmentation methods.

In ACMs, they are divided into two major categories: one is edge-based model and the other is region-based model. Edge-based model uses the gradient of the image to get the segmentation result, which is effective to images with sharp contours. Recent years, region-based ACM and its variant models have been well addressed for image segmentation, which utilizes spatial statistical information to guide the curve evolution. The superiority is that it utilizes the local statistical information and the intensity of image itself in the local region-based models [16]. The Chan-Vese (CV) model, addressed by Chan and Vese in the Ref. [9], is one of the famous region-based model. Later, many region-based segmentation models are addressed to segment intensity inhomogeneous images. In related works, we will introduce some related models.

The rest of this paper is composed as follows: we review three typical region-based methods for image segmentation in Sect. 2. In Sect. 3, we detailed elaborate the proposed model with three subsections. Section 4 analyzes the comparison results with the representative segmentation models on artificial and real images. Finally, in Sect. 5, we end with conclusion and our future work.

2 Related Works

2.1 The CV Model

The CV model, a well-known region-based model, has been proposed by Chan and Vese in 2002. It is on the assumption that there is the same intensity value in two partitions, which refer the target to be segmented and background. Specifically, we use I(x) as the test image and \( \phi \) as its domain, the contour C is the curve, the energy functional is defined as follows:

$$ \begin{aligned} E^{CV} (\phi ) & = \mu \int\limits_{\Omega } {\delta (\phi )\left| {\nabla \phi } \right|} dxdy + v\int\limits_{\Omega } {H(\phi )} dxdy \\ & + \,\lambda_{1} \int\limits_{in(C)} {\left| {I(x,y) - c_{1} } \right|}^{2} dxdy + \lambda_{2} \int\limits_{out(C)} {\left| {I(x,y) - c_{2} } \right|}^{2} dxdy \\ \end{aligned} $$
(1)

Here, \( \upmu,\upnu,\uplambda_{1} ,\uplambda_{2} \) are non-negative constant weight parameters. I(x, y) represents the observed image, \( H\left( \phi \right) \) is a Heaviside function shown in Eq. (2), and \( \delta \left( \phi \right) \) (see Eq. (3)) is the derivative of \( H\left( \phi \right) \). in(C) and out(C) are the region inside and outside of the contour C, respectively.

$$ {\text{H}}\left( {\text{x}} \right) = \frac{1}{2}\left[ {1 + \frac{2}{\pi }{ \arctan }\left( {\frac{x}{\varepsilon }} \right)} \right] $$
(2)
$$ \updelta\left( {\text{x}} \right) = \frac{\partial H\left( x \right)}{\partial x} = \frac{\upvarepsilon}{{\pi \cdot \left( {\varepsilon^{2} + x^{2} } \right)}} $$
(3)

2.2 The Local Image Fitting Model

To accurately segment intensity inhomogeneous images, Zhang et al. constructed a model based on local information, which utilized a truncated Gaussian function as local constraint information. It improved the computational efficiency and reduced the chance of the curve evolution function falling into local optimum. The LIF energy functional is defined as follows:

$$ {\text{E}}^{LIF} \left( \phi \right) = \frac{1}{2}\int_{\phi } {\left| {I\left( x \right) - I^{LFI} \left( x \right)} \right|^{2} } dx,x \in\Omega $$
(4)

Where ILFI(x), named a local fitted image formulation, the mathematical formulation is defined as follows:

$$ {\text{I}}^{LFI} = m_{1} H_{\varepsilon } \left( \phi \right) + m_{2} (1 - H_{\varepsilon } \left( \phi \right)) $$
(5)

Here, m1 and m2 are respectively the average intensity of inside and outside the region. The definition see the Ref. [17].

2.3 The Local Statistical Active Contour Model (LSACM)

Though the LIF model was more computationally effective in segmenting intensity inhomogeneous images. When the image is disrupted by noise, lower resolution and more overlapping. This model will not perform well. Therefore, Zhang et.al have addressed a new model, named a locally statistical active contour model (LSACM) [18]. In this model, a clever mapping was created, which was mainly used to minimize overlapping. The energy function was defined:

$$ E_{\theta ,B}^{L} = \sum\nolimits_{i = 1}^{n} {\int_{\Omega } {\int_{{\Omega _{i} }} {K_{\rho } \left( {x,y} \right)} } } \left( {\log \left( {\sigma_{i} } \right) + \frac{{\left( {I\left( y \right) - B\left( x \right)c_{i} } \right)^{2} }}{{2\sigma_{i}^{2} }}} \right)dydx $$
(6)

In Eq. (6), \( \Omega = \sum\nolimits_{i = 1}^{n} {\Omega _{i} } \), with \( \Omega _{i} \cap\Omega _{j} = \emptyset, \) for all \( {\text{i}} \ne {\text{j}} \), \( K_{\rho } \left( {x,y} \right) \) is the indicator function of local region Ox. \( \rho \) controls the size of the local region. \( \uptheta_{i} = \left\{ {c_{i} ,\sigma_{i} } \right\},\theta = \left\{ {\theta_{i} ,i = 1, \ldots ,n} \right\} \).

Compared with CV model and LIF model, this LSACM segment well in intensity inhomogeneous images, especially medical images. However, the model has limitations which is the noise of the image are assumed to be Gaussian distribution. In addition, the model cannot consider the influence of spatial constraint information.

3 The Proposed Model

Base on the above analysis, we address a novel enhanced region-based model, in which edge properties can be well preserved and segmentation results have more accurate. The main idea is to construct a data-driven energy model, which is based on the enhanced range-based bilateral filter. Different from other models, the filter can preserve edge sharpness and reduce noise.

3.1 The Enhanced Region-Based Bilateral Filter

The basic bilateral filter has some merit over other filter technology. It is based on two aspects [19,20,21]: one is for considering the relationship among the pixels of the image; the other is for considering the image spatial information, which is vital to accurate segmentation.

In our model, we construct an efficient enhanced bilateral filter. For an image f(x), the mathematical expression description is defined as:

$$ h\left( x \right) = k_{r}^{ - 1} \int_{\Omega } {f\left( \xi \right)c\left( {\xi ,x} \right)s\left( {\xi ,x,\zeta } \right)d\xi } $$
(7)

With the normalization

$$ k\left( x \right) = \int_{\Omega } {c\left( {\xi ,x} \right)s\left( {\xi ,x,\zeta } \right)d\xi } $$
(8)

And the function of c is used to measure the geometric closeness in the local area; s is used for gauging the spatial information. The definitions are followed as:

$$ c\left( {\xi ,x} \right) = exp\left( {\frac{{\left| {\left| {\xi - x} \right|} \right|}}{{2\sigma_{1}^{2} }}^{2} } \right) $$
(9)
$$ s\left( {\xi ,x,\zeta } \right) = exp\left( {\frac{{\left| {\left| {f\left( \xi \right) - f\left( x \right) - \zeta \left( x \right)} \right|} \right|^{2} }}{{2\sigma_{2}^{2} }}} \right) $$
(10)

To boost the robustness of noise, we set a local region as an enhanced filter region. In Eq. (10), the function \( \zeta \left( x \right) \) controls the scale of the local region. The definition is followed as:

$$ \zeta \left( x \right) = \left\{ {\begin{array}{*{20}r} \hfill {\left| {f\left( x \right) - M\left( {\Omega _{y} } \right)} \right|,} & \hfill {\left| {x - y} \right| \le \rho } \\ \hfill { = 0,} & \hfill {otherwise} \\ \end{array} } \right. $$
(11)

Where \( \Omega _{y} \) means the neighborhood centered the image of pixel x; \( {\text{M}}( \bullet ) \) represents the mean intensity value within the neighborhood centered on the pixel x.

3.2 The Energy Model Based on Enhanced Region-Based Bilateral Filter

In this subsection, we propose and analyze the level set energy model in detail. In the proposed model, all notations is followed as:

  1. (1)

    \( I:\Omega \to {\mathcal{R}}^{d} \) represents the observed image I, \( \Omega \) means the domain of the image, d means the dimension of the image;

  2. (2)

    \( {\mathcal{C}} \) is the closed contour in the domain \( \Omega \) of I;

  3. (3)

    h(x) comes from the Eq. (7).

  4. (4)

    \( \uplambda_{i} \), (i = 1, 2) are non-negative constants.

  5. (5)

    \( f_{i} \), (i = 1, 2) are the average intensity value inside and outside of the closed \( {\mathcal{C}} \), respectively

Therefore, we construct the proposed energy function as follows:

$$ \upvarepsilon = \sum\nolimits_{i = 1}^{2} {\lambda_{i} } \int_{{\Omega _{i} }} {K\left( {x - y} \right)\left| {h\left( y \right)I\left( y \right) - f_{i} \left( x \right)} \right|^{2} dy} $$
(12)

Where \( K( \bullet ) \) is an indicator function, the value is 1 in the neighborhood of the pixel, otherwise is zero.

In the domain \( \Omega \), we get the energy function:

$$ E\left( {{\mathcal{C}},f_{1} ,f_{2} } \right) = \int_{\Omega }\upvarepsilon \left( {f_{1} ,f_{2} } \right) + \upsilon \left| {\mathcal{C}} \right| $$
(13)

Using the LSF \( \phi \) to replace the contour \( { \mathcal{C}} \), we redefine the equation as:

$$ E\left( {\phi ,f_{1} \left( {\text{x}} \right),f_{2} \left( {\text{x}} \right)} \right) = \sum\nolimits_{i = 1}^{2} {\uplambda_{i} } \int {\int {K_{\rho } \left| {h\left( y \right)I\left( y \right) - f_{i} } \right|^{2} } } M_{i} \left( \phi \right)_{ } dydx $$
(14)

Here, \( M_{1} \left( \phi \right) = {\text{H}}\left( \phi \right),M_{2} \left( \phi \right) = 1 - {\text{H}}\left( \phi \right) \). In order to keep the LSF’s stability during the curve evolution. In our model, we add a regularization term. Therefore, the final energy function can be rewritten as:

$$ \begin{array}{*{20}c} {{\mathcal{F}}\left( {\upphi,f_{1} ,f_{2} } \right) = \sum\nolimits_{i = 1}^{2} {\uplambda_{i} } \int {\int {K_{\rho } } } \left| {h\left( y \right)I\left( y \right) - f_{i} } \right|^{2} M_{i} \left( \phi \right)dydx} \\ { + \,\nu \int {\delta \left| {\nabla \phi } \right|dx + \frac{1}{2}\mu \int {\left| {\nabla \phi - 1} \right|^{2} dx}} } \\ \end{array} $$
(15)

Here \( \upmu\,{\text{and}}\,\upnu \) are nonnegative constants. Next, we minimize the Eq. (15) by using gradient flow method [22,23,24] described in Subsect. 3.3.

3.3 Energy Minimization

In Eq. (15), we first fix the implicit function \( \phi \), and minimize the energy functional about f1(x) and f2(x). Therefore, we get the following Euler-lagrange equations:

$$ \int {K\left( {x - y} \right)} M_{i} \left( \phi \right)\left( {h\left( y \right)I\left( y \right) - f_{i} \left( x \right)} \right)^{2} dy = 0, i = 1,2 $$
(16)

Accordingly, we get the expression about fi (i = 1, 2):

$$ f_{i} \left( x \right) = \frac{{K_{\rho } \left( x \right)*\left[ {M_{i} \left( \phi \right)h\left( x \right)I\left( x \right)} \right]}}{{K_{\rho } \left( x \right)*M_{i} \left( \phi \right)}} $$
(17)

In fact, the values of fi(x) (i = 1, 2) are mean value of the intensities within the neighborhood of the pixel x. And the size is related to the parameter \( \varvec{\rho} \).

Finally, we get the energy functional about \( \phi \), by solving the following the equation:

$$ \frac{ \partial \phi }{\partial t} = - \delta \left( \phi \right)\left( {\lambda_{1} e_{1} - \lambda_{2} e_{2} } \right) + \upsilon \delta \left( \phi \right)div\left( {\frac{\nabla \phi }{{\left| {\nabla \phi } \right|}}} \right) + \mu \left( {\nabla^{2} \phi - div\left( {\frac{\nabla \phi }{{\left| {\nabla \phi } \right|}}} \right)} \right) $$
(18)

And, e1 and e2 are followed as:

$$ {\text{e}}_{i} \left( x \right) = \int {K_{\rho } \left( {y - x} \right)} \left| {h\left( x \right)I\left( x \right) - f_{i} \left( y \right)} \right|^{2} dy, i = 1,2 $$
(19)

4 Experimental Results

4.1 Experimental on Artificial Images

In the first experiment, we verify the proposed model effectiveness. Figure 1 shows the segmentation results for artificial images and their level set function. The first image has no noise, the latter three images are corrupted by noise. It is obvious that our model can successfully extracts the object boundaries. Their level set functions can keep good stability.

Fig. 1.
figure 1

Segmentation results of our model for artificial image, its noisy image and its level set function.

In the second experiment, we also evaluated the effectiveness of the proposed model on an edge-blurred image and its noisy image. As shown in Fig. 2, the image boundaries are quite weak. It is clearly shown that our model accurately segments the object shapes.

Fig. 2.
figure 2

Segmentation results of our model for synthetic image, its noisy image and its level set function.

4.2 Experimental Results

In the third experiment, comparison results in segmenting the real images are executed between the proposed model and other classic models: the CV model [9], the LGD model [25] and the LSACM [18]. As shown in Fig. 3, the CV model has poor performance in segmenting the images. The reason lies in it is based on the global intensity information. In the LGD model, the local image intensities was construct by Gaussian distributions with different means and variances. If we give correct initialization information, then we get more accurate segmentation results as shown in the third image. The LSACM is also suitable for segmenting MRI medical images with intensity inhomogeneity as shown in the second image. By contrast, our model successfully extracts image boundaries for the images as shown in the last column of Fig. 3.

Fig. 3.
figure 3

Comparisons of our model with other classic segmentation models: (a) CV model, (b) LGD model, (c) LSACM, (d) our model.

To further verify the performance of the proposed model, in the fourth experiment, we test the proposed model for three images with severe intensity inhomogeneity, the results in Fig. 4 show that the RSF model can accurately segment two images. And our model successfully extracts all the boundaries of the images. For the LVC model [26], it cannot segments the first image, because the image has the feature of poor illumination. The main reason lies that the enhanced range-based bilateral filter can preserve edge sharpness and reduce noise.

Fig. 4.
figure 4

Comparisons of our model with other classic segmentation models: (a) the CV model, (b) the RSF model, (c) the LVC model, (d) our model.

We implemented the proposed model in MatlabR2013b. In the proposed model, the main computational time is closely related to the scale of the image. The complexity of the four models’ competitive algorithms is O(N), here N represents the number of iteration.

5 Quantity Analysis

In this section, we evaluate the efficiency of the proposed model using similarity measurement. In segmentation fields, Jaccard similarity (JI) coefficient is a well statistic used for measuring similarity between the ground truth and the real segmentation result. The JI can be computed by

$$ JI\left( {S_{1} ,S_{2} } \right) = \frac{{\left| {S_{1} \cap S_{2} } \right|}}{{\left| {S_{1} \cup S_{2} } \right|}} $$
(20)

In segmentation fields, S1 and S2 are the segmented object region and the ground truth, respectively.

Next, we use a synthetic image with different Gaussian noise to evaluate the efficiency of our proposed model as shown in Fig. 5, and the JI values are listed in Table 1. When the noise is small, the JI values are relatively similar. As image noise increases, the boundaries of the image become increasingly blurred. The improved bilateral filter can resist the noise and preserve the detail of the image, the CV model cannot resist the noise of the image. For the LCV and the RSF models, when the influence of noise is small, the method can effectively segment the image. However, when the noise is gradually increased, the method cannot resist the influence of noise, and thus cannot be correctly segmented. Obviously, comparison results shows that the proposed model is more efficient than other models shown in Fig. 5.

Fig. 5.
figure 5

Comparisons among the CV, the LVC, the RSF model and our model with different noise: (a) the CV model, (b) the LVC model, (c) the RSF model, (d) our model.

Table 1. Comparisons of Jaccard Index with different Gaussian noise.

6 Conclusions and Future Works

In this paper, we have proposed an enhanced region-based model for image segmentation. The proposed model efficiently utilizes a range-based adaptive bilateral filter to preserve the sharp edges and reduce the noise of the image. In addition, the model use a regularization term to ensure the regularity of the level set function and to avoid the process of the expensive re-initialization. Comparisons with the well-known models in the literature [9, 10, 18, 25] and [26], the proposed model is more efficient.

In future work, we mainly focus on three directions but not limited: (1) we will utilize intelligent algorithms and hardware resources to enhance the efficiency of the proposed model [27,28,29]. (2) We will absorb other related fields methods to improve the efficiency of the proposed model, such as image retrieval [30,31,32] and image super-resolution [33,34,35].