1 Introduction

Breast cancer is one of the most common diseases and the leading cause of death among women. Although it is hard to prevent the development of breast cancer, early detection of tumors can significantly improve clinical cure rates [39]. There exists several imaging techniques for breast exam, among which mammography (X-ray image) is the most common technique for radiologists to detect and diagnose breast cancer [16]. If the mammography images are noised during imaging or transmitting [5], they can be recovered by some denoising methods [7,8,9]. But more seriously, mammography images are usually limited by X-ray hardware systems and exhibit poor resolution and low contrast. Different from denoising, effective enhancement approaches are desired to solve the problems of poor resolution and low contrast. To this end, several enhancement algorithms have been proposed, such as the multiscale representation [14, 20, 23], histogram equalization (HE) [3, 4, 17, 21, 22], and unsharp masking (UM) [26, 27, 35, 36, 45].

1.1 Related works

Multiscale decomposition has been frequently employed in mammogram enhancement. The main idea is to perform a multiscale computation framework, such as discrete wavelet transform (DWT) [20], curvelet transform [14], or contourlet transform [23], on mammograms followed by the modification of the transform coefficients in each subband of the multiscale representation. Finally, performing the corresponding inverse transform on the modified coefficients to get the enhanced mammograms. Besides enhancement, multiscale decomposition is also widely used for medical images denoising. Manoj Diwakar et al. [6, 10, 11] studied this problem deeply and proposed a variety of multiscale decomposition based medical images denoising methods. However, it is argued that multiscale representation cannot efficiently show edges and textured region in images [24].

Histogram equalization, which is known for its simpleness and the ability of improving contrast, is a desirable technique for image enhancement. But conventional HE algorithms may cause over-enhancement and the intensity saturation effects. So many improved algorithms, such as the brightness-preservation HE algorithm [4, 22], histogram partitioning method [3], and texture enhanced HE approach [17] have been proposed. Although the brightness-preservation HE algorithm maintains a natural look and reduces saturation effect in the output image, it also generates annoying side effects depending on the variation in the gray-level distribution [17]. The histogram partitioning algorithm divides an image histogram into several subintervals and modifies each subinterval separately. However, pixels with the similar intensity may belong to different intervals, producing over-enhancement in some image regions [21]. The texture enhanced HE approach was proposed to avoid the occurrence of over-enhancement and the intensity saturation effects, but the degree of contrast enhancement is not significant.

Several algorithms have been developed for mammography image enhancement by using unsharp masking. The traditional UM method is sensitive to noise and may generate ringing artifacts. To rectify these limitations, various modified approaches have been suggested, such as the rational UM (RUM) [36], cubic UM (CUM) [35], adaptive UM (AUM) [26], HE based UM [45], and nonlinear UM (NUM) [27]. These algorithms attempt to enhance image details, but usually result in different degrees of over-enhancement. Besides, they are powerless in improving image contrast.

1.2 Contributions

Micro-calcifications and masses are the main abnormalities in mammography images, which are important basis for the diagnosis of breast cancer. Micro-calcifications are tiny calcium deposits in breast tissue. They appear in images as bright, 100-um dots or clusters of dots, which are difficult to detect. Breast masses are bright spots on the image. Most of the masses have blurred boundaries and low contrast, which are easy to be confused with the normal tissues of the breast. Because of the limitations of X-ray hardware systems, the hypothesis of this paper is that the visual quality of mammography images is low because of poor resolution and low contrast. In order to overcome the problems in the detection of micro-calcifications and masses, this paper proposes a bidirectional spiking cortical (Bi-SCM) method to improve contrast as well as enhance details of mammography images. The goal of this paper is to highlight the lesion area without losing image details and introducing background noise, so as to assist doctors in diagnosis and provide technical support for computer aided mammography detection system. The main contributions of this paper can be briefly summarized as follows.

  1. 1.

    A Bi-SCM framework is proposed from the perspective of neural information fusion to enhance the contrast of bright areas and dark areas adequately, as well as textural details.

  2. 2.

    Based on the proposed Bi-SCM framework, neural spiking model is utilized to enhance a mammography image and its inverse, and then a NSCT based fusion method is proposed to fuse the primarily enhanced results, ensuring both contrasts of bright areas and dark areas are adequately improved.

  3. 3.

    We made the first attempt to further enhance the textual details of the NSCT-fused mammography image with an adaptive unsharp masking method which consists of cubic filter and log-ratio operation.

  4. 4.

    Sufficient experiments on real mammography images are conducted to prove that the proposed method outperforms state-of-the-art methods on both enhancing contrast and details.

The novelties of the proposed method are as follows. First, Bi-SCM framework is proposed from the perspective of neural information fusion to enhance mammography images. By taking into account both intensity distribution and spatial structure information, Bi-SCM can enhance lesions while suppressing background noises. Secondly, an original fusion rule, named weighted region energy, is proposed to fuse the two SCM-enhanced results while avoiding information loss. By using the directive contrast as weighting factor, more details in mammograms can be preserved. The rest of this paper is organized as follows. Section 2 presents the framework and details of the proposed Bi-SCM method. Experimental results on real mammography images are reported and analyzed in Section 3, followed by Section 4 which gives a overall conclusion of the paper.

2 Methodology

2.1 Bidirectional spiking cortical model

In the late 1980s, Eckhorn et al. [13] studied the mechanism of cat visual cortex, and found that the binary images produced by the midbrain in an oscillating manner are distinctive representations of visual features. Based on this discovery, a a feedback neural network model entitled pulse coupled neural network (PCNN) [37], which consists of a feedback field, a modulation field and a pulse generation field was proposed. In our previous work [47], a modified version of pulse coupled neural network (PCNN) model entitled spiking cortical model (SCM), was proposed for invariant texture retrieval. The SCM is mathematically described by

$$ {U_{ij}}\left[ n \right] = f{U_{ij}}\left[ {n - 1} \right] + {S_{ij}}\sum\limits_{kl} {{W_{ijkl}}{Y_{kl}}\left[ {n - 1} \right] + {S_{ij}}}, $$
(1)
$$ {E_{ij}}\left[ n \right] = g{E_{ij}}\left[ {n - 1} \right] + h{Y_{ij}}\left[ {n - 1} \right], $$
(2)
$$ {Y_{ij}}\left[ n \right] = \left\{ {\begin{array}{*{20}{c}} {1,} \\ {0,} \end{array}} \right. \begin{array}{*{20}{c}} {{U_{ij}}\left[ n \right] > {E_{ij}}\left[ n \right]} \\ {{\text{otherwise}} } \end{array}, $$
(3)

where n denotes iteration times, and Nij denotes the neuron in the position \(\left ({i,j} \right )\). Neuron Nij has internal activity \({U_{ij}}\left [ n \right ]\) and dynamic threshold \({E_{ij}}\left [ n \right ]\), which retain the previous states by decay factors f and g, respectively. W is the synaptic weight, denoting influence of neighboring neurons, Sij denotes input stimulus, and Yij is the output of neuron Nij. Internal activity \({U_{ij}}\left [ n \right ]\) is compared with the dynamic threshold \({E_{ij}}\left [ {n} \right ]\) to judge whether neuron Nij fires (\({Y_{ij}}\left [ n \right ] = 1\)) or not (\({Y_{ij}}\left [ n \right ] = 0\) ). Subsequently, if neuron Nij fires, the dynamic threshold would increase by amplitude h suddenly, otherwise the dynamic threshold would decay by a factor g. In addition, \({Y_{kl}}\left [ {n - 1} \right ]\) denotes the output of neighboring neuron in the previous iteration.

Although SCM can enhance low contrast images, it processes contrast varying with intensities. When SCM is used to enhance images, bright areas would be processed more coarsely than dark areas [47]. It is therefore not suitable for mammogram enhancement, because both bright and dark areas provide important information in mammograms. In this paper, we proposes an improved the Bi-SCM model to overcome this problem. A graphical illustration of our approach is shown in Fig. 1. The original mammography image \(I\left ({i,j} \right )\) is firstly fed into a cubic filter to obtain the textual details \(z\left ({i,j} \right )\) and the corresponding smoothed version \(S\left ({i,j} \right )\). Then a bi-directional SCM model is proposed to enhance the smoothed image. The two SCM-enhanced results are further fused as a single out, which leads to anomalies being highlighted while the background textures and noises are suppressed. The final enhancement result is obtained by adding the detailed image to the fused image with adaptive gain control. The main motivation is to adequately enhance both of the bright areas and dark areas. For both original mammogram and the corresponding reverse mammogram, before a neuron pulses for the first time, it has no linking input, so its internal activity can be derived from (1) and described as

$$ \begin{array}{@{}rcl@{}} {U_{ij}}\left[ n \right]&=& f{U_{ij}}\left[ {n - 1} \right] + {S_{ij}}\\ &&{\text{= (}}{U_{ij}}\left[ 0 \right]{\text{ - }}\frac{{{S_{ij}}}}{{1 - f}}){f^{n}} + \frac{{{S_{ij}}}}{{1 - f}}. \end{array} $$
(4)

Its dynamic threshold Eij can be derived from (2) and expressed by

$$ {E_{ij}}\left[ n \right] = g{E_{ij}}\left[ {n - 1} \right] = {g^{n}}{E_{ij}}\left[ 0 \right]. $$
(5)

When \({U_{ij}}\left [ n \right ] = {E_{ij}}\left [ n \right ]\), the neuron Nij fires for the first time. Therefore, the first iteration times can be given by

$$ {n_{1ij}} = - \frac{1}{{\ln g}}\ln \frac{{{E_{ij}}\left[ 0 \right]}}{{{U_{ij}}\left[ {{n_{1ij}}} \right]}}. $$
(6)

Since time matrix is defined to record the first iteration times for all neurons, if the internal activity Uij is equal to its stimulus Sij, the time matrix of SCM can be approximated as

$$ {T_{ij}} = {n_{1ij}} = - \frac{1}{{\ln g}}\ln \frac{{{E_{ij}}\left[ 0 \right]}}{{{U_{ij}}\left[ {{n_{1ij}}} \right]}} \approx - \frac{1}{{\ln g}}\ln \frac{{{E_{ij}}\left[ 0 \right]}}{{{S_{ij}}}}. $$
(7)

and thus

$$ \ln S = T\ln g + \ln E\left[ 0 \right]. $$
(8)

On the other hand, Weber-Fechner law is described by [15]

$$ I = k\ln S + r, $$
(9)

where I is the human visual perception, S is the stimulus at the instant, k is a constant factor which is to be determined experimentally, and r is the constant of integration. Substituting (8) into (9), one achieve

$$ I = k(T\ln g + \ln E\left[ 0 \right]) + r = -k^{\prime}T + r^{\prime}, $$
(10)

where \({{k^{\prime }{\text { = }}k}/ {\ln g}}\), and \(r^{\prime } = k\ln E\left [ 0 \right ] + r\). Equation (10) describes the link between the subjective sense of intensity I and the time matrix T, where I can be obtained by computing the negative time matrix. Therefore, the negative time matrix of SCM conforms to Weber-Fechner law, and can be directly regarded as the enhanced result. The internal activity is initialized to 0, i.e., \({U_{ij}}\left [ 0 \right ] = 0\). Thus, equation (4) can be rewritten as

$$ U\left[ {{n_{1}}} \right] = - \frac{S}{{1 - f}}{f^{{n_{1}}}} + \frac{S}{{1 - f}} = \frac{{1 - {f^{{n_{1}}}}}}{{1 - f}}S. $$
(11)

Substituting (11) into (7), we achieve

$$ T = - \frac{1}{{\ln g}}\ln \left( {\frac{{1 - f}}{{1 - {f^{{n_{1}}}}}} \cdot \frac{{E\left[ 0 \right]}}{S}} \right). $$
(12)

In addition, according to the lateral inhibition principle of the human visual system (HVS), Laplacian of Gaussian (LoG) operator can be used for image edge sharpening, and the initial value of dynamic threshold can be obtained as [25]

$$ E\left[ 0 \right] = 1 + S \otimes {\text{LoG}}. $$
(13)

Therefore, as can be seen from (12) and (13), values of time matrix only depend on input stimulus and decay factors f and g, where 0 < f < g < 1. To insure changes in E and U occur slowly, the value of f is close to 0, and the value of g is just under 1.

Fig. 1
figure 1

Framework of the proposed Bi-SCM

To fuse the enhanced images of original mammogram and the corresponding reverse mammogram, we propose to use the multi-scale and multi-directional computation framework entitled NSCT, which consists of a nonsubsampled pyramid Laplacian filter and a nonsubsampled directional filter [2]. Comparing with wavelet transform and contourlet transform, NSCT shows better translational invariance and makes full use of the geometric regularity in images, so that it has better multidirectional selectivity and provides more detailed information. Thereby, NSCT is a desirable technique for this purpose. The NSCT based image fusion methods usually consist of three steps. Firstly, the source mammograms are transformed by NSCT to obtain low- and high-frequency components. Then two different fusion rules are used to fuse low- and high-frequency coefficients, respectively. Finally, the fused image is constructed by the inverse NSCT with all fused subimages. More detailed discussion can be found in [2]. In this subsection, we introduce a modified NSCT based fusion method to fuse the SCM enhanced mammogram and its enhanced reverse version. Weighted region energy and directive contrast are selected as the fusion rules of low- and high-frequency coefficients, respectively. The proposed fusion framework can preserve more details in source mammograms and further improve the quality of fused mammogram.

HVS is sensitive to intensity contrast, but insensitive to luminance value at independent positions. Therefore, it is necessary to quantify the intensity contrast. The local luminance contrast is defined as [42]

$$ R = \frac{{L - {L_{B}}}}{{{L_{B}}}}, $$
(14)

where L denotes the luminance at a certain location in the image plane, and LB represents the luminance of the local background, regarded as low frequency component. Hence, LLB can be taken as local high-frequency component. The local luminance contrast can be further extended as directive contrast in multiscale domain, which is given by [2]

$$ {D_{l,\theta }}\left( {i,j} \right) = \left\{ {\begin{array}{c} {{{\left( {\frac{1}{{{C_{\ell} }\left( {i,j} \right)}}} \right)}^{\partial} }\frac{{SM{L_{l,\theta }}\left( {i,j} \right)}}{{{C_{\ell} }\left( {i,j} \right)}},\quad if {C_{\ell} }\left( {i,j} \right) \ne 0}\\ {SM{L_{l,\theta }}\left( {i,j} \right), if {C_{\ell} }\left( {i,j} \right) = 0} \end{array}} \right., $$
(15)

where \({C_{\ell } }\left ({i,j} \right )\) is the low-frequency subband at the coarsest level , is a visual constant representing the slope of the best fitted lines through high-contrast data, and \(SM{L_{l,\theta }}\left ({i,j} \right )\) is the sum-modified-Laplacian of the NSCT high-frequency subband at scale l and orientation 𝜃, which is computed over a \(\left ({2P + 1} \right ) \times \left ({2Q + 1} \right )\) pixel block, given by

$$ SM{L_{l,\theta }}\left( {i,j} \right) = \sum\limits_{m = i - P}^{i + P} {\sum\limits_{n = j - Q}^{j + Q} {\nabla_{l,\theta }^{2}H(m,n)} }, $$
(16)

and

$$ \begin{array}{@{}rcl@{}} \nabla_{l,\theta }^{2}H(i,j)&=& \left| {2{H_{l,\theta }}(i,j) - {H_{l,\theta }}(i - 1,j) - {H_{l,\theta }}(i + 1,j)} \right| \\ &+& \left| {2{H_{l,\theta }}(i,j) - {H_{l,\theta }}(i,j - 1) - {H_{l,\theta }}(i,j + 1)} \right|, \end{array} $$
(17)

where \({H_{l,\theta }}\left ({i,j} \right )\) is the high-frequency subband at position (i,j), scale l, and orientation 𝜃. It has been proved that the directive contrast is less sensitive to noise and provides the most salient features [2]. Therefore, in order to select the most prominent texture and edge information, we use the directive contrast to fuse high-frequency coefficients.

Region energy is computed over a \(\left ({2M + 1} \right ) \times \left ({2N + 1} \right )\) (usually 3 × 3, 5 × 5, or 7 × 7) pixel block as [44]

$$ RE(i,j) = \sum\limits_{m = - M}^{M} {\sum\limits_{n = - N}^{N} {{{\left( {I\left( {i + m,j + n} \right) - I\left( {i,j} \right)} \right)}^{2}}} }, $$
(18)

where I(i,j) represents the gray value at position (i,j). In general, pixels with larger values of region energy represent the salient features of the image, such as edges and lines. So we can regard region energy as a quantitative metric of image clarity and use it to fuse low-frequency subimages. The fused low-frequency is calculated by

$$ C_{\ell}^{F}\left( {i,j} \right) = \left\{ {\begin{array}{c} {C_{\ell}^{A}\left( {i,j} \right),} \\ {C_{\ell}^{B}\left( {i,j} \right),} \end{array}} \right.\begin{array}{*{20}{c}} {R{E_{C_{\ell}^{A}}}\left( {i,j} \right) \ge R{E_{C_{\ell}^{B}}}\left( {i,j} \right)} \\ {R{E_{C_{\ell}^{A}}}\left( {i,j} \right) < R{E_{C_{\ell}^{B}}}\left( {i,j} \right)} \end{array}, $$
(19)

where A and B are two input images. \(C_{\ell }^{A}\) and \(C_{\ell }^{B}\) denote low-frequency subbands at the coarsest level of A and B, respectively. Region energy based fusion rule can extract more details, and is less sensitive to noises [44]. However, when the region energies of two corresponding pixels are close to each other, this method may lead to information loss. Therefore, we propose a novel fusion criterion based on weighted region energy. The weighted region energy of the low-frequency subband at the coarsest level is defined as

$$ W{E_{\ell} }(i,j) = {D_{\ell} }\left( {i,j} \right) \cdot RE\left( {i,j} \right), $$
(20)

where \({D_{\ell } }\left ({i,j} \right )\) denotes the directive contrast in NSCT domain. When the difference between the region energies of two corresponding pixels is great, the NSCT coefficients with larger region energy are selected as fused coefficients. On the other hand, when the region energies of two corresponding pixels are close to each other, the fused low-frequency is calculated by

$$ C_{\ell}^{F}\left( {i,j} \right) = \frac{{\left( {WE_{\ell}^{A}\left( {i,j} \right)C_{\ell}^{A}\left( {i,j} \right) + WE_{\ell}^{B}\left( {i,j} \right)C_{\ell}^{B}\left( {i,j} \right)} \right)}}{{\left( {WE_{\ell}^{A}\left( {i,j} \right) + WE_{\ell}^{B}\left( {i,j} \right)} \right)}}, $$
(21)

Since directive contrast Dl(i,j) is the extension of local luminance contrast in multi-resolution domain, from (20) and (21), we can find that more information will be extracted from pixels with larger local contrast. Thus, by classing the fusion task into two cases and use the directive contrast as weighting factor, the proposed fusion rule is able to avoid information loss while improve the contrast of fused mammograms.

The proposed image fusion scheme includes the following steps:

  • NSCT Based Decomposition

    Perform -level NSCT on the source images A and B to obtain their low-frequency subimages \(C_{\ell }^{A}\), \(C_{\ell }^{B}\) and a series of high-frequency subimages \(H_{l,\theta }^{A}\), \(H_{l,\theta }^{B}\) at each level \(l \in \left [ {1, \ell } \right ]\) and direction 𝜃, i.e., \({\mathrm {A}} : \left \{ {C_{\ell }^{A},H_{l,\theta }^{A}} \right \}\) and \({\mathrm {B}} : \left \{ {C_{\ell }^{B},H_{l,\theta }^{B}} \right \}\).

  • Fusion of High-frequency Subimages

    The high-frequency coefficients usually include most details and edge features of the source image. Since directive contrast can be thought as an quantitative expression of image detail and be less sensitive to noise, we select directive contrast as the fusion rule for high-frequency subimages. Firstly, the directive contrast for NSCT high-frequency subimages is computed using (15)-(17), denoted by \({D_{H_{l,\theta }^{A}}}\) and \({D_{H_{l,\theta }^{B}}}\) at each level l in direction 𝜃, respectively. Then the high-frequency coefficients are fused, and the fused high-frequency coefficients \(H_{l,\theta }^{F}\) are defined by

    $$ H_{l,\theta }^{F}\left( {i,j} \right) = \left\{ {\begin{array}{*{20}{c}} {H_{l,\theta }^{A}\left( {i,j} \right),} \\ {H_{l,\theta }^{B}\left( {i,j} \right),} \end{array}} \right.\begin{array}{*{20}{c}} {{D_{H_{l,\theta }^{A}}}\left( {i,j} \right) \ge {D_{H_{l,\theta }^{B}}}\left( {i,j} \right)} \\ {{D_{H_{l,\theta }^{A}}}\left( {i,j} \right) < {D_{H_{l,\theta }^{B}}}\left( {i,j} \right)} \end{array} $$
    (22)
  • Fusion of Low-frequency Subimages

    The low-frequency coefficients usually represent the approximation component of the source images. The simplest way to get the fused bands is conventional averaging [2]. However, it leads to information loss and low contrast of the fused image. Selecting low-frequency coefficients is still an open problem. In this subsection, we propose a novel fusion rule based on weighted region energy for low-frequency subimages. Firstly, the region energy and the weighted region energy for NSCT low-frequency subimages are computed with (18) and (20), denoted by \(R{E_{C_{\ell }^{A}}}\), \(R{E_{C_{\ell }^{B}}}\), \(W{E_{C_{\ell }^{A}}}\) and \(W{E_{C_{\ell }^{B}}}\), respectively. Then we choose fusion rule by estimating the difference between the region energies of two corresponding pixels. If the region energies of two corresponding pixels are close to each other, i.e., \(\left | {R{E_{C_{\ell }^{A}}} - R{E_{C_{\ell }^{B}}}} \right | > T\), where T is a threshold which is normally set as 0.1, the fused low-frequency is calculated by (19). If \(\left | {R{E_{C_{\ell }^{A}}} - R{E_{C_{\ell }^{B}}}} \right | \le T\), the fused low-frequency coefficients \(C_{\ell }^{F}\) is calculated by (21).

  • Inverse NSCT Based Reconstruction

    Perform -level inverse NSCT on the fused low-frequency \(\left ({C_{\ell }^{F}} \right )\) and high-frequency \(\left ({H_{l,\theta }^{F}} \right )\) subimages to get the fused image F.

2.2 Adaptive unsharp masking

Besides contrast that can be well enhanced by the proposed Bi-SCM, textual details are also very important for mammography images. In conventional enhancement models, unsharp masking [26] is usually adopted for enhancing textual details. In particular, every pixel y(i,j) in the enhanced image can be obtained from original image I through

$$ y(i,j) = I(i,j) + \lambda z(i,j), $$
(23)

where z(i,j) is a high-passed version of the input image, and λ is the enhancement factor. The traditional UM can enhance details in the original image effectively. However, using a universal enhancement gain for the whole image leads to over-enhancement and noise amplification. In this paper, to suppress the noise sensitivity, a cubic filter [35] is employed to acquire z(i,j), which is defined as

$$ \begin{array}{l} z\left( {i,j} \right) = {\left( {{I_{i - 1,j}} - {I_{i + 1,j}}} \right)^{2}}\left( {2{I_{i,j}} - {I_{i - 1,j}} - {I_{i + 1,j}}} \right) \\ {\kern32pt} + {\left( {{I_{i,j - 1}} - {I_{i,j + 1}}} \right)^{2}}\left( {2{I_{i,j}} - {I_{i,j - 1}} - {I_{i,j + 1}}} \right). \end{array} $$
(24)

On the other hand, in order to reduce the over-enhancement, we choose the log-ratio operations and adaptive gain control algorithm [12] instead of arithmetic operations and universal enhancement gain, respectively. The output of the adaptive UM scheme becomes

$$ y\left( {i,j} \right) = I \oplus \left[ {\lambda \left( z \right) \otimes z} \right], $$
(25)
$$ \lambda \left( z \right) = \partial + \beta \exp (- {\left| {2z - 1} \right|^{\gamma} }), $$
(26)

where γ is a decreasing factor, and γ > 0. The addition operation ⊕ and scalar multiplication operation ⊗ of the log-ratio operations are formally defined as

$$ \psi \left( x \right) = \frac{{1 - x}}{x}, $$
(27)
$$ {x_{1}}\oplus {x_{2}} = \frac{1}{{1 + \psi \left( {{x_{1}}} \right)\psi \left( {{x_{2}}} \right)}}, $$
(28)
$$ \lambda \otimes x = \frac{1}{{1 + \psi {{\left( x \right)}^{\lambda} }}}. $$
(29)

The gray scale set is closed under log-ratio operations. In order to avoid over-enhancement, λ is excepted to gradually decrease from \({\lambda _{{\max \limits } }}\) when the value of z is moderate to \({\lambda _{{\min \limits } }}\) when z is high or low. Thus, we can assume \({\lambda _{{\max \limits } }} = \lambda \left (1 \right )\) and \({\lambda _{{\min \limits } }} = \lambda \left (0.5 \right )\). Then the two parameters and β can be obtained by

$$ \beta = \left( {{\lambda_{\max }} - {\lambda_{\min }}} \right)/\left( {1 - {e^{- 1}}} \right) $$
(30)

and

$$ \partial = {\lambda_{\max }} - \beta, $$
(31)

respectively. The proposed adaptive UM algorithm can significantly enhance details while avoid over-enhancement. The values of parameters γ, \({\lambda _{{\max \limits } }}\), and \({\lambda _{{\min \limits } }}\) depend on the contrast of the input mammogram. We experimentally found that the choices of γ = 0.005, \({\lambda _{{\max \limits } }} = 3\), and \({\lambda _{{\min \limits } }} = 1\) are effective in providing good results to almost all tested mammograms.

3 Experimental results and discussions

This section provides the experimental results and a comprehensive discussion of the proposed approach. Mammograms for experiments are from the mini-mammographic image analysis society (MIAS) database of mammograms [40] and the digital database for screening mammography (DDSM) of the University of South Florida [19]. All test mammograms are cropped into images with smaller size which contain masses, micro-calcifications or abnormal regions.

3.1 Comparison of fusion methods

To evaluate the proposed fusion method, it was tested on 100 mammograms, half of which come from the MIAS database and others come from the DDSM database. Sufficient state-of-the-art approaches are taken into comparison, including PCNN [43], wavelet [31], contourlet [33], and NSCT (NSCT-1 [2] and NSCT-2 [41]) based methods. For wavelet, contourlet, and NSCT based methods, the level of decomposition is set to 3, and wavelet based method uses the `db4’ wavelet as basis. Typical fusion results are visualized in Fig. 2.

Fig. 2
figure 2

Comparison of mammogram fusion results with different fusion algorithms. (a) Enhanced mammogram from conventional SCM; (b) Enhanced reverse mammogram from conventional SCM;(c) Fused results by PCNN; (d) Contourlet; (e) Wavelet; (f) NSCT-1; (g) NSCT-2; (h) The proposed fusion method

Mutual information (MI) and edge based similarity measure QAB/F are commonly used objective criteria for evaluation of fusion algorithms [28]. Specifically, MI is utilized to measure the mutual dependence of the fused image F and two input images A and B. In other words, MI reflects how much information of the original image is obtained by the fused image, which is expressed as [32]

$$ MI = MI(A,F) + MI(B,F), $$
(32)

and

$$ MI(U,V) = \sum\limits_{u \in U} {\sum\limits_{v \in V} {p(u,v)} } {\log_{2}}\frac{{p(u,v)}}{{p(u)p(v)}}, $$
(33)

where p(u,v) is the joint probability distribution function of U and V, and p(u) and p(v), which are the marginal probability distribution functions of U and V, respectively, can be obtained by simple normalization of the joint and marginal histograms of both variables.

On the other hand, ESM computes the similarity of edges between the fused image and input images, which measures the ability of the fusion process to transfer important detail information as accurately as possible, and is given by [46]

$$ {Q^{AB/F}} = \frac{{\sum\limits_{i = 1}^{M} {\sum\limits_{j = 1}^{N} {\left[ {Q_{i,j}^{AF}w_{i,j}^{A} + Q_{i,j}^{BF}w_{i,j}^{B}} \right]} } }}{{\sum\limits_{i = 1}^{M} {\sum\limits_{j = 1}^{N} {\left[ {w_{i,j}^{A} + w_{i,j}^{B}} \right]} } }}, $$
(34)

where \(w_{i,j}^{A}\) and \(w_{i,j}^{B}\) are the weighting coefficients. \(Q_{i,j}^{AF}\) and \(Q_{i,j}^{BF}\) are defined as

$$ Q_{i,j}^{AF} = Q_{g}^{AF}\left( {i,j} \right)Q_{\alpha}^{AF}\left( {i,j} \right) $$
(35)

and

$$ Q_{i,j}^{BF} = Q_{g}^{BF}\left( {i,j} \right)Q_{\alpha}^{BF}\left( {i,j} \right), $$
(36)

where \(Q_{g}^{*F}\) and \(Q_{\alpha }^{*F}\) are the edge strength and orientation preservation values at location (i,j), respectively, and can be derived

$$ Q_{g}^{*F}\left( {i,j} \right) = \frac{{{{\Gamma}_{g}}}}{{1 + {e^{{\kappa_{g}}\left( {{G^{*F}}\left( {i,j} \right) - {\sigma_{g}}} \right)}}}} $$
(37)

and

$$ Q_{\alpha}^{*F}\left( {i,j} \right) = \frac{{{{\Gamma}_{\alpha} }}}{{1 + {e^{{\kappa_{\alpha} }\left( {{{\Delta}^{*F}}\left( {i,j} \right) - {\sigma_{\alpha} }} \right)}}}}, $$
(38)

where the constants Γ, κ, and σ determine the shape of the sigmoid functions used to form the edge strength and orientation preservation value. GF and ΔF, which represent the relative strength and orientation values between input image and fused image respectively, are given by

$$ {G^{*F}}\left( {i,j} \right) = \left\{ {\begin{array}{*{20}{c}} {\frac{{{g_{F}}\left( {i,j} \right)}}{{{g_{*}}\left( {i,j} \right)}}, {g_{*}}\left( {i,j} \right) > {g_{F}}\left( {i,j} \right)}\\ {\frac{{{g_{*}}\left( {i,j} \right)}}{{{g_{F}}\left( {i,j} \right)}}, {\text{otherwise} }} \end{array}} \right. $$
(39)

and

$$ {{\Delta}^{*F}}\left( {i,j} \right) = 1 - \frac{{\left| {{\alpha_{*}}\left( {i,j} \right) - {\alpha_{F}}\left( {i,j} \right)} \right|}}{{\pi /2}}. $$
(40)

The edge strength g and orientation α are got by a Sobel edge operator, i.e.

$$ {g_{*}}\left( {i,j} \right) = \sqrt {s_{*}^{x}{{\left( {i,j} \right)}^{2}} + s_{*}^{y}{{\left( {i,j} \right)}^{2}}} $$
(41)
$$ {\alpha_{*}}\left( {i,j} \right) = {\tan^{- 1}}\left( {\frac{{s_{*}^{x}\left( {i,j} \right)}}{{s_{*}^{y}\left( {i,j} \right)}}} \right), $$
(42)

where \(s_{*}^{x}\) and \(s_{*}^{y}\) are the convolved results with the horizontal and vertical Sobel templates.

Table 1 reports the average MI values and average QAB/F values of each database for all tested mammograms. All two fusion scores are normalized where values closer to 1 indicate a higher quality of the fused image. And the best records are marked in bold. From Fig. 2 and Table 1, we can see that PCNN cannot capture features at different levels and bandwidths, as shown in Fig. 2(c). This problem is overcome in multi-resolution based algorithms, especially NSCT based algorithms. This result is from the fact that NSCT can provide better shift invariance and multidirectional selectivity than other multiscale analysis tools. Among NSCT based algorithms, the proposed method performs best. The algorithm in [41] uses local entropy and region energy to fuse low- and high-frequency coefficients respectively, leading to low contrast of the fused image, as shown in Fig. 2(f). The performances of the algorithm in [2] and the proposed method are close to each other, giving better results compared to others algorithms, as shown in Fig. 2(g) and (h), respectively. However, looking carefully at the results, the proposed algorithm not only improves the visual effect of the fused image, but also preserves more details in the source image. This is due to the fact that our method can reduce the information loss and improve the contrast of the fused images. In conclusion, both subjective and objective evaluations prove the superiority of the proposed algorithm.

Table 1 Average evaluation indices for fused mammograms

3.2 Comparison of enhancement performance

The proposed enhancement method is tested on 50 mammograms obtained from the MIAS database, and compared with several well-known enhancement algorithms, including the RUM [36], nonseparable cubic UM (NSCUM) [35], generalized UM (GUM) [12], AUM [30], multi-scale retinex (MSR) [34], and brightness preserving dynamic fuzzy HE (BPDFHE) [38]. Typical mammograms and the corresponding enhanced results are shown in Figs. 345 and 6. The initial values of the parameters are listed in Table 2.

Fig. 3
figure 3

Comparison of mammogram enhancement with different algorithms. (a) Original mammogram. Enhanced results by (b) RUM; (c) NSCUM; (d) AUM; (e) GUM; (f) MSR; (g) BPDFHE; (h) The proposed Bi-SCM. Note that RUM, NSCUM, AUM, and GUM are UM based methods; MSR is MD based method; BPDFHE is HE based method

Fig. 4
figure 4

Comparison of mammogram enhancement with different algorithms. (a) Original mammogram. Enhanced results by (b) RUM; (c) NSCUM; (d) AUM; (e) GUM; (f) MSR; (g) BPDFHE; (h) The proposed Bi-SCM. Note that RUM, NSCUM, AUM, and GUM are UM based methods; MSR is MD based method; BPDFHE is HE based method

Fig. 5
figure 5

Comparison of mammogram enhancement with different algorithms. (a) Original mammogram. Enhanced results by (b) RUM; (c) NSCUM; (d) AUM; (e) GUM; (f) MSR; (g) BPDFHE; (h) The proposed Bi-SCM. Note that RUM, NSCUM, AUM, and GUM are UM based methods; MSR is MD based method; BPDFHE is HE based method

Fig. 6
figure 6

Comparison of mammogram enhancement with different algorithms. (a) Original mammogram. Enhanced results by (b) RUM; (c) NSCUM; (d) AUM; (e) GUM; (f) MSR; (g) BPDFHE; (h) The proposed Bi-SCM. Note that RUM, NSCUM, AUM, and GUM are UM based methods; MSR is MD based method; BPDFHE is HE based method

Table 2 Initialized parameters for enhancement experiments

The RUM, NSCUM and AUM are modified UM methods, which can enhance detail information. However, seeing Figs. 3(b)-6(b), Figs. 3(c)-6(c), and Figs. 3(d)-6(d), we can found that the contrast enhancement is almost indiscernible. Moreover, RUM introduces spot artifacts in enhanced images. GUM employs CLAHE [29] to enhance images, which produces over-enhancement in some portion of the image and enhances background noises, as shown in Figs. 3(e)-6(e). BPDFHE preserves the mean image-brightness, but introduces over-enhancement at the same time, leading to masses and micro-calcifications more undistinguishable than the original ones, as shown in Figs. 3(g)-6(g). In comparison with GUM and BPDFHE, the proposed Bi-SCM method can restrain noise amplification and over-enhancement to some extent. MSR simply takes reflectance as the final results, such that the light source directions of its enhanced images are confused, as shown in Figs. 3(f)-6(f). Comparing with MSR, the proposed Bi-SCM preserves the light source directions. In summary, one can see from the experimental results that the proposed Bi-SCM method not only improves the contrast but also enhances the details. In addition, it restrains noise amplification and over-enhancement. These observations indicate that the goal of the paper, which aims at enhancing visual quality of mammography images, is achieved by the proposed Bi-SCM.

To evaluate the quality of the enhanced mammography images quantitatively, the enhancement measures entitled measure of enhancement (EME) [1] and the increase of the number of visible edge (INVE) [18] are employed. EME is related to the ratio between the maximum intensity and the minimum intensity of a local region. Higher EME corresponds to more contrast improvement of the enhanced images. While INVE computes the ratio between the gradient of the visible edges before and after enhancement. Larger value of INVE indicates better enhancement performance of contrast and clearer visibility of details. Both of EME and INVE values greater than 2 indicate that contrast and detail visibility are well enhanced. The EME and INVE values of five mammograms are listed in Tables 3 and 4, respectively. And the best records are marked in bold. Examining Tables 3 and 4, we can conclude that the values of EME and INVE from GUM, MSR, and our UM scheme are higher than those of the rest algorithms, reflecting the state-of-the-art performance of the proposed Bi-SCM approach.

Table 3 Quantitative measurement results of EME
Table 4 Quantitative measurement results of INVE

In order to report the overall results on 50 test mammograms, we plot box plots in terms of EME and INVE values in Figs. 7 and 8, respectively. One can conclude from Figs. 7 and 8 that the proposed Bi-SCM achieves better performance than state-of-the-art enhancement methods, while the GUM method takes the second place.

Fig. 7
figure 7

Box-plots of EME for 50 test images

Fig. 8
figure 8

Box-plots of INVE for 50 test images

3.3 Discussions

The conventional SCM model makes full use of the intensity distribution and spatial information of image to improve contrast. When SCM is used to enhance images, bright areas would be processed more coarsely than dark areas, resulting in losing information of lesions. To overcome the drawbacks of SCM, we propose a Bi-SCM model to improve the contrast while avoiding information loss, making the lesions being highlighted from complex background. However, fusing the two SCM-enhanced results increases the computational complexity.

Specially, the proposed method is implemented by MATLAB R2010b on a PC with Intel i5 processer and 8 GB RAM. The average processing time for one test image with 256 × 256 pixels is 61.8774s. This is primary because the NSCT based fusion algorithm is a computationally intensive process. While the running time of SCM model is only 0.3293s. In the future, the efficiency of Bi-SCM can be improved from either implementation point-of-view or algorithm point-of-view. First, the Bi-SCM model is currently implemented by software for sequential processing on CPU, which heavily limits the processing speed of the algorithm. One important future research direction is to implement Bi-SCM by parallelly programming on multi-thread processors, such as GPU or FPGA. Second, a comprehensive study of different fusion algorithms will be conducted to reduce the computational burden of fusion process.

4 Conclusion

In this paper, a Bi-SCM model with unsharp masking is proposed for the purpose of enhancing both contrast and textural details of mammography images. The proposed Bi-SCM method not only explores the neural spiking ability in low-contrast image enhancement from a bidirectional way, but also is the first attempt which leverages neural spiking mechanism and unsharp masking to improve both both contrast and textural details. Sufficient experimental results on real breast mammography images show that the proposed method not only improves contrast and enhances details, but also reduces noise amplification and over-enhancement without the priori knowledge of image contents.

In addition, high-quality imaging is an important technique for early detection of tumors, which is further an important way to reduce the death rate caused by breast cancer among women. The proposed Bi-SCM is able to significantly improve the visual quality of mammography images that are usually limited by X-ray hardware systems. Therefore, the proposed method has high potential and shows clinical significance for radiologists to detect and diagnose breast cancer.