Introduction

Today globally, breast cancer is a commonly occurring cancer among women. Occurrence of breast mass is the main cause of developing breast cancer among women and early detection of breast mass may increase the survival rate. One common approach for detecting abnormalities in mammograms is isolation or segmentation of masses/lesions from the breast. Tumors/lesions and calcification are the common abnormalities for breast cancer. Segmentation plays an important role in deciding whether the mammogram image is benign or malignant. Blurry boundaries and low contrast of ultrasound images make automatic segmentation a difficult task. In order to extract any lesion/mass from the mammograms, image segmentation is done. There are many fuzzy and non-fuzzy methods in literature that segment breast tumor, lesion, or mass. But medical images are mostly non-uniformly illuminated and have vague regions/boundaries, so fuzzy methods normally perform better.

Clustering is an important method that segments tumors/lesions in the breast. It partitions the image into homogeneous regions in such a way that the pixels in a group are more similar than the pixels in different groups. Though there are many clustering algorithms in literature that clusters real-time images accurately, the accuracy of clustering algorithm is still a debatable topic. Fuzzy c means clustering (FCM), suggested by Bezdek [1], is the first and most widely used clustering algorithm. It allows each data to belong in ‘c’ number of clusters and each data has a membership degree. The membership degree of each data point in each cluster depends on the distance between the cluster centre and data point. Later, many researchers modified the FCM algorithm by incorporating spatial terms, using different types of kernel functions that may be Gaussian, radial basis, or hypertangent kernels [2,3,4,5,6]. But the accuracy in the segmentation is not better. Later with the introduction of intuitionistic fuzzy set theory by Atanassov [7], intuitionistic fuzzy c means clustering (IFCM) is introduced [8,9,10]. Intuitionistic fuzzy set takes into account a hesitation degree while defining the membership function. So, the non-membership degree is not the complement of the membership degree rather less than or equal to the complement of the membership degree.

With the introduction of IFCM algorithm, many modified intuitionistic fuzzy clustering algorithms are suggested. Verma et al. [11] suggested an intuitionistic fuzzy clustering algorithm with the introduction of hesitation degree. They used membership, non-membership, and hesitation degree while computing cluster centroids. The distance function, they used, is the Euclidean distance that considers all the three terms—membership, non-membership, and hesitation degrees. Xu and Wu [12] suggested an intuitionistic fuzzy clustering method, where they used an intuitionistic fuzzy distance measure and extended the algorithm to an interval-valued intuitionistic fuzzy sets. Kumar and Harish [13] used Hausdorff distance measure in place of Euclidean distance and used Bustince intuitionistic fuzzy generator to compute the non-membership degree. Dubey [14] suggested a rough set-based intuitionistic fuzzy clustering algorithm. They initialized cluster centroids from the location of peak points, obtained from intuitionistic fuzzy roughness measure. Roughness measure is computed by considering intuitionistic fuzzy histon as an upper approximation of rough set and fuzzy histogram as a lower approximation of rough set. A new fuzzy complement is used to compute the non-membership degree and then performed clustering.

The methods that use intuitionistic fuzzy set work better than fuzzy set as intuitionistic fuzzy set considers two uncertainties—membership and non-membership degrees. The reason is that medical images contain uncertainties that are present in the form of vague boundaries or non-uniform illumination. Fuzzy set considers only one uncertainty, which is the membership function whereas the intuitionistic fuzzy set considers two uncertainties, so better results may be expected.

In this paper, we propose a novel intuitionistic fuzzy clustering approach to segment breast mass/tumor. Initially, the image is converted to an intuitionistic fuzzy image using a novel intuitionistic fuzzy generator for computing the non-membership degree. Then, using hesitation degree, two levels are computed. Using Zadeh’s min t-norm, a new membership function is computed that is used to create an interval type 2 fuzzy image. In the clustering algorithm, four features—pixel energy, mean, standard deviation, gradient features—are used. Two types of distance functions—fuzzy divergence and an exponential distance measure—are used. Further, in each membership iteration, the membership matrix is updated using hesitation degree. The algorithm is compared with existing methods quantitatively and qualitatively.

The paper is organized as follows. “Intuitionistic Fuzzy Set” outlines the preliminaries of an intuitionistic fuzzy set. “Conversion of the Image to an Intuitionistic Fuzzy Image” presents the concept of converting the image to an intuitionistic fuzzy image. “Conversion of Intuitionistic Fuzzy Image to an Interval Type 2 Fuzzy Image” details the conversion of intuitionistic fuzzy image to an interval type 2 fuzzy image. “Clustering” describes the novel intuitionistic fuzzy clustering algorithm. Results and discussion are detailed in “Results and Discussion”. “Conclusion” concludes the paper.

Highlights of the Paper

  1. 1.

    The given image is initially converted to an intuitionistic fuzzy image using a novel intuitionistic fuzzy generator.

  2. 2.

    The intuitionistic fuzzy image is then converted to an interval type 2 fuzzy image, where the membership level lies in an interval range. Then, intuitionistic fuzzy c means clustering algorithm is applied, where the constant term in the hesitation degree in the intuitionistic fuzzy set is selected from a graphical approach. Two types of distance functions—intuitionistic fuzzy divergence and an exponential distance functions—are used in the clustering process. Four features are considered in the clustering algorithm—mean, pixel energy, gradient, standard deviation.

  3. 3.

    Lesion/tumor region is then isolated from other regions using histogram thresholding.

  4. 4.

    Morphological operators such as opening operation are applied to remove any unwanted spots to obtain a final segmented image.

Intuitionistic Fuzzy Set

Fuzzy set theory, proposed by Zadeh [16], was designed to mathematically represent uncertainty and vagueness in a set. This uncertainty is represented in terms of membership function. But in real-time images, fuzzy techniques sometimes face difficulty in case when there are more uncertainties. To overcome this drawback, Atanassov in 1985 proposed an intuitionistic fuzzy set (IFS) theory where two uncertainties—membership degree and non-membership degree—are considered. In fuzzy set, non-membership degree is the complement of the membership degree, but in IFS, non-membership degree is less than or equal to the complement of the membership degree. This ‘less than’ is due to the hesitation while defining the membership function.

For a finite set X, \({X={x}_{1},{x}_{2},\dots ,{x}_{\mathrm{n}}}\), an IFS \(A\) may be mathematically represented as:

\(A=\left\{\left.(x,{\mu }_{\mathrm{A}}\left(x\right),{\nu }_{\mathrm{A}}\left(x\right)\right|x\in X\right\}\), where \({\mu }_{\mathrm{A}}\left(x\right), {\nu }_{\mathrm{A}}\left(x\right)\) are the membership and non-membership degrees, respectively, with the condition

$$0\le {\mu }_{\mathrm{A}}(x)+{\nu }_{\mathrm{A}}(x)\le 1.$$
(1)

\({\mu }_{\mathrm{A}}\left(x\right)\) is the membership function that may be Gaussian, Gamma, triangular, S-membership function, or any other type of membership function. Non-membership function, \({\nu }_{\mathrm{A}}\left(x\right)\), is computed from intuitionistic fuzzy generator that may be Sugeno, Yager, or any other generator.

In Eq. (1), the condition, “\({\mu }_{\mathrm{A}}\left(x\right)+{\nu }_{\mathrm{A}}\left(x\right)\le 1"\), is due to the hesitation that arises while defining the membership function. Hesitation degree is denoted by \({\pi }_{\mathrm{A}}\left(x\right)\) with \({\pi }_{\mathrm{A}}\left(x\right)\le 1\). So, considering all the three degrees, we get

$${{\mu }_{\mathrm{A}}\left(x\right)+{\nu }_{\mathrm{A}}\left(x\right)+\pi }_{\mathrm{A}}\left(x\right)=1$$

Intuitionistic fuzzy generator is almost similar to fuzzy negation or complement. The difference between fuzzy complement and intuitionistic fuzzy generator is explained as follows:

A function is called intuitionistic fuzzy generator if \({\nu }_{\mathrm{A}}\left(x\right)\le 1-{\mu }_{\mathrm{A}}\left(x\right)\), ‘\(\le\)’ is due to the consideration of hesitation degree while defining the membership function. When there is no hesitation, it becomes a standard fuzzy set, i.e., \({\nu }_{\mathrm{A}}\left(x\right)=1-{\mu }_{\mathrm{A}}\left(x\right)\).

Sugeno class of fuzzy complement [17] is given as

\(n\left({\mu }_{\mathrm{A}}\left(x\right)\right)=\frac{1-{\mu }_{\mathrm{A}}\left(x\right)}{1+\lambda .{\mu }_{\mathrm{A}}\left(x\right)}\), where \(\lambda\) is the constant.

If the value of \(\lambda\) is taken as\(-1<\lambda <0\), then \(n\left({\mu }_{\mathrm{A}}\left(x\right)\right)>1-\mu (x)\) or \({\mu }_{\mathrm{A}}\left(x\right)+ n\left({\mu }_{\mathrm{A}}\left(x\right)\right)>1\) and \({\pi }_{\mathrm{A}}\left(x\right)<0\). Then, the set is not an intuitionistic fuzzy set. So, Sugeno’s intuitionistic fuzzy generator is written as \(n\left({\mu }_{\mathrm{A}}\left(x\right)\right)=\frac{1-{\mu }_{\mathrm{A}}\left(x\right)}{1+\lambda .{\mu }_{\mathrm{A}}\left(x\right)}\) with the condition \(\lambda \ge 0\).

Conversion of the Image to an Intuitionistic Fuzzy Image

In this paper, a new type of intuitionistic fuzzy generator is used. Many authors suggested different types of intuitionistic fuzzy generators [15, 17,18,19]. An intuitionistic fuzzy generator recently used by Chaira in [15], where the author used an increasing function, \(f\left(x\right)=\frac{1}{\uplambda }\mathrm{log}[1+x\left(1+\lambda \right)]\) and the fuzzy complement was obtained as \(n(x)=\frac{1-x}{1+(1+\lambda )x} \times \lambda >1\). But in this work, a different type of increasing function is used, where the constant term is different. If the increasing function in [15] is used, then the misclassification error will be more and better result will not be obtained. A sample result is shown in Fig. 1 below.

Fig. 1
figure 1

(a) Original image. (b) Segmentation result using increasing generator by [15]; (c) proposed segmented image—method 1; (d) proposed segmented image—method 2; (e) ground truth image

To create an intuitionistic fuzzy generator, characterization of fuzzy complement is required. Characterization of fuzzy complemental function is given as follows [17]:

If \(n\) is a function, \(n:\left[\mathrm{0,1}\right]\to [\mathrm{0,1}]\), then \(n\) is an involutive fuzzy complement if and only if there exists a continuous function \(f\) such that \(f\left(0\right)=0\) and \(f\) is an increasing function. It is represented as

$$n\left(\mu\left(x\right)\right){=f}^{-1}\left[\ f\left(1\right)-f\left(\mu\left(x\right)\right)\right]\;\mathrm{with}\;\mu\left(x\right)\in\left[0,1\right],$$
(2)

where \(f\left(x\right)\) is an increasing function.

Klir and Yuan [18] showed that involutive fuzzy complement function can be generated using a dual of Eq.  (2). It is written as:

$$n\left(\mu\left(x\right)\right){=g}^{-1}\left[ \ f\left(0\right)-f\left(\mu\left(x\right)\right)\right]\mathrm{with}\;\mu\;\left(x\right)\in\left[0,1\right],$$
(3)

where \(g\left(x\right)\) is a decreasing function. This is called dual generator.

In this work, let us consider an increasing function:

$$f\left(\mu (x)\right)=\frac{1}{{(\lambda -1)}^{2}}\mathrm{log}[1+\mu (x){(\lambda -1)}^{2}]$$
(4)

Considering \(\mu \left(x\right) \ \mathrm{as} \ x\) for simplicity, we may write

$$f\left(x\right)=\frac{1}{{(\lambda -1)}^{2}}\mathrm{log}\left[1+x{\left(\lambda -1\right)}^{2}\right],$$
(5)

with \(f\left(0\right)=\frac{1}{{(\lambda -1)}^{2}}\mathrm{log}\left[1\right]=0\), \(\lambda\) is a constant term.

$$f\left(1\right)=\frac{1}{{(\lambda -1)}^{2}}\mathrm{log}\left[1+{(\lambda -1)}^{2}\right]=\frac{1}{{(\lambda -1)}^{2}}\mathrm{log}({\lambda }^{2}-2\lambda +2)$$

Inverse function of Eq. (5) is written as \({g}^{-1}\left(x\right)=\frac{{e}^{{(\lambda -1)}^{2}x}-1}{{(\lambda -1)}^{2}}\)

So,\(n(x)={f}^{-1}[\frac{1}{{(\lambda -1)}^{2}}\mathrm{log}\left({\lambda }^{2}-2\lambda +2\right)-\frac{1}{{(\lambda -1)}^{2}}\mathrm{log}[1+x{(\lambda -1)}^{2}]\)

$$={f}^{-1}[\frac{1}{{(\lambda -1)}^{2}}\mathrm{log}\left(\frac{{\lambda }^{2}-2\lambda +2}{1+x.{(\lambda -1)}^{2}}\right)$$

After solving, we get

$$\begin{aligned}\left(x\right)=& \frac{1-x}{1+x. {\left(\lambda -1\right)}^{2}}. \text{for any value of } \lambda \\& \text{ , as the term } (\lambda - 1)^{2} \text{ is a squared term }\end{aligned}$$
(6)

In terms of membership function,\(\mu \left(x\right)\), the non-membership function is written as

$$n\left(\mu \left(x\right)\right)=\varphi \left(\mu \left(x\right)\right)=\frac{1-\mu (x)}{1+\mu (x){(\lambda -1)}^{2}},$$
(7)

where \(\uplambda\) in Eq. (7) is constant and it determines the non-membership function.

It is observed that with any value of \(\lambda\),\(n\left(\mu \left(x\right)\right)\le 1-\mu (x)\). This implies that the condition of intuitionistic fuzzy generator holds.

In the case of images, initially, the image is converted to a fuzzy image, and the membership function is computed using normalization, which is given as follows:

$$\mu_{mn}=\frac{g_{mn}-g_{min}}{g_{max}-g_{min}}, \text{where}$$
(8)

\({g}_{\mathrm{mn}}\) is the pixel gray level, \({g}_{\mathrm{min}}\) and \({g}_{\mathrm{max}}\) are the minimum and maximum gray levels of the image, respectively.

Intuitionistic fuzzy membership function of the image, with membership function\(,{\mu }_{\mathrm{mn}}\), at \({(m,n)}^{\mathrm{th}}\) position, is now written as

$${\mu }_{\text{mn}}^{^\prime}=1-\frac{1-{\mu }_{\text{mn}}}{1+{\mu }_{\text{mn}}{(\lambda -1)}^{2}}=\frac{[1+{(\lambda -1)}^{2}]{.\mu }_{\text{mn}}}{1+{.\mu }_{\text{mn}}{(\lambda -1)}^{2}},$$
(9)

\({\mu }_{\mathrm{mn}}\) is obtained from Eq. (3).

Non-membership function is computed using Eq. (2) as

$${\nu }_{\mathrm{mn}}^{^{\prime}}=\varphi \left({\mu }_{\mathrm{mn}}^{^{\prime}}\right)=\frac{1-{\mu }_{\mathrm{mn}}^{^{\prime}}}{1+{\left(\lambda -1\right)}^{2}.{\mu }_{\mathrm{mn}}^{^{\prime}}}\Rightarrow {\nu }_{\mathrm{mn}}^{^{\prime}}=\frac{1-\frac{\left[1+{\left(\lambda -1\right)}^{2}\right]{.\mu }_{\mathrm{mn}}}{1+{\left(\lambda -1\right)}^{2}.{\mu }_{\mathrm{mn}}}}{1+\frac{{\left(\lambda -1\right)}^{2}.\left[1+{\left(\lambda -1\right)}^{2}\right]{.\mu }_{\mathrm{mn}}}{1+{\left(\lambda -1\right)}^{2}.{\mu }_{\mathrm{mn}}}}=\frac{1+{\left(\lambda -1\right)}^{2}.{\mu }_{\mathrm{mn}}-{\mu }_{\mathrm{mn}}-{\left(\lambda -1\right)}^{2}.{\mu }_{\mathrm{mn}}}{1+{\left(\lambda -1\right)}^{2}.{\mu }_{\mathrm{mn}}+{\left(\lambda -1\right)}^{2}.{{\mu }_{\mathrm{mn}}+(\lambda -1)}^{4}.{\mu }_{\mathrm{mn}}}$$
$$=\frac{1-{\mu }_{\mathrm{mn}}}{1+{2.{\mu }_{\mathrm{mn}}.(\lambda -1)}^{2}+{{\mu }_{\mathrm{mn}}.(\lambda -1)}^{4}}$$
(10)

Hesitation degree is obtained as

$${\pi }_{\mathrm{mn}}^{^{\prime}}={1-\mu }_{\mathrm{mn}}^{^{\prime}}-{\nu }_{\mathrm{mn}}^{^{\prime}}.$$
(11)

As \(\lambda\) is not fixed for all the images, optimum value of \(\lambda\) is obtained using an intuitionistic fuzzy entropy.

Zadeh [20] first introduced the idea of fuzzy entropy in 1969. It is a measure of fuzziness in a fuzzy set or amount of uncertainty present in a set. From the concept of De Luca and Termini [21] for non-probabilistic entropy, fuzzy exponential entropy is also defined accordingly [22]. If the membership degree is 0.5, then there is maximum fuzziness or uncertainty, which implies maximum entropy. If the membership degree is 0 or 1, then the fuzziness is less, which implies that entropy is minimum. Just like conventional entropy, if the probability is 0.5, then the entropy, i.e., uncertainty, is maximum and likewise, if the probability is 0 or 1, the entropy is minimum. Intuitionistic fuzzy entropy is similar to fuzzy entropy where non-membership degree is also taken into account.

A real function \(IFE:IFE(X)\to {R}^{+}\) is called intuitionistic fuzzy entropy (IFE) on \(IFS(X)\) [23] if:

(1)\(IFE\left(A\right)=0,\) if A is non-fuzzy,

(2)\(IFE\left(A\right)=\) cardinal \(\left(X\right)=n,\) if \({\mu }_{\mathrm{A}}\left({x}_{\mathrm{i}}\right)= {\nu }_{\mathrm{A}}\left({x}_{\mathrm{i}}\right)=0, \forall {x}_{\mathrm{i}}\)

(3)\(IFE\left(A\right)\le IFE\left(B\right),\) if \(A\) is less fuzzy than \(B\), i.e.,

\({\mu }_{\mathrm{A}}(x)\le {\mu }_{\mathrm{B}}(x)\) and \({\nu }_{\mathrm{A}}(x)\ge {\nu }_{\mathrm{B}}(x)\) for \({\mu }_{\mathrm{B}}\left(x\right)\le {\nu }_{\mathrm{B}}(x)\) or

\({\mu }_{\mathrm{A}}(x)\ge {\mu }_{\mathrm{B}}(x)\) and \({\nu }_{\mathrm{A}}(x)\le {\nu }_{\mathrm{B}}(x)\) for \({\mu }_{\mathrm{B}}\left(x\right)\ge {\nu }_{\mathrm{B}}(x)\)

(4)\(IFE\left(A\right)=IFE({A}^{\mathrm{c}})\), where \(A=\left\{\left.(x,{\mu }_{\mathrm{A}}\left(x\right),{\nu }_{\mathrm{A}}\left(x\right)\right|x\in X\right\}\), \({A}^{\mathrm{c}}=\left\{\left.(x, {{\nu }_{\mathrm{A}}\left(x\right),\mu }_{\mathrm{A}}\left(x\right)\right|x\in X\right\}\) implies

\({\mu }_{{\mathrm{A}}^{\mathrm{c}}}\left({x}_{\mathrm{i}}\right)={\nu }_{\mathrm{A}}\left({x}_{\mathrm{i}}\right)\) and \({\nu }_{{\mathrm{A}}^{\mathrm{c}}}\left({x}_{\mathrm{i}}\right)={\mu }_{\mathrm{A}}\left({x}_{\mathrm{i}}\right)\)

In this paper, an intuitionistic fuzzy entropy is used, which is given as [9]

$$IFE\left(A, \lambda \right)=\frac{1}{M\times N}\sum_{n=1}^{N}\sum_{m=1}^{M}{\pi }_{\mathrm{mn}}{e}^{1-{\pi }_{\mathrm{mn}}},$$
(12)

\({\pi }_{\mathrm{mn}}\) is the hesitation degree at \({(m,n)}^{\mathrm{th}}\) position, which is computed using Eq. (11).

Optimum value of \(\lambda\) is computed by maximizing \(\lambda\) in \(IFE\left(A, \lambda \right)\), which is written as

$${\lambda }_{\mathrm{opt}}=\underset{\uplambda }{\mathrm{max}}IFE\left(A,\lambda \right).$$
(13)

For each value of \(\lambda\), intuitionistic fuzzy entropy (\(IFE\left(A, \lambda \right)\)) is computed and the maximum value of \(\lambda\) is chosen. Substituting the value of \({\lambda }_{\mathrm{opt}}\) in Eq. (9), an intuitionistic fuzzy image is created.

Conversion of Intuitionistic Fuzzy Image to an Interval Type 2 Fuzzy Image

A preliminary on type 2 fuzzy set is discussed.

(a) Type 2 fuzzy set

Type 2 fuzzy set is an extension of fuzzy set, where the membership function of the fuzzy set is considered to be fuzzy. It is also called as a blurred type 1 fuzzy set. If a type 1 fuzzy membership function (primary membership) is blurred, then all of its points shift either left or right of the membership function. The value of the shift or amplitudes at all points may not be equal. That means, corresponding to each primary membership, there is a secondary membership that also lies in the range [0, 1], which defines the possibilities for the primary membership. If the amplitude is the same and equal to 1, then it is called interval type 2 fuzzy set [23]. Thus, interval type 2 fuzzy set consists of two membership levels—lower membership level and upper membership level. A type 2 fuzzy set is written as

$$\stackrel{\sim }{A}=\left\{\left(x,u\right), {\mu }_{\stackrel{\sim }{\mathrm{A}}}\left(x,u\right)\right) \forall x\in X, u\in {J}_{\mathrm{x}}\subseteq [\mathrm{0,1}]$$

where \({\mu }_{\stackrel{\sim }{\mathrm{A}}}\left(x,u\right)\) is the secondary membership function and \({J}_{\mathrm{x}}\) is the primary membership of \(x\).

From the intuitionistic fuzzy image, two levels are constructed—upper and lower membership levels. The upper and lower membership functions are defined as:

$${\mu }^{\mathrm{upper}}={[\mu \left(x\right)]}^{1/\alpha } {\mu }^{\mathrm{lower}}={[\mu \left(x\right)]}^{\alpha },$$
(14)

where \(\alpha \in [1,\infty ]\).

Using hesitation degree from Eq. (11), two levels are modified and computed as follows [24]:

$${\mu }_{\text{lower}}={[abs({\mu }_{\text{mn}}^{^{\prime}}{-\pi }_{\text{mn}}^{^{\prime}})]}^{\alpha }, {\mu }_{\text{upper}}={({\mu }_{\text{mn}}^{^{\prime}}{+\pi }_{\text{mn}}^{^{\prime}}a)}^{\alpha },$$
(15)

where \(\alpha\) is the fuzzy hedge. Fuzzy hedge is an important feature in a fuzzy system that modifies the membership values. These are also called ‘modifiers’.

The two membership functions are then combined using Zadeh’s fuzzy t-conorm as follows [25]:

$$T\left({\mu }_{\mathrm{lower}}, {\mu }_{\mathrm{upper}}\right)=\mathrm{min}\left({\mu }_{lower}, {\mu }_{\mathrm{upper}}\right).$$
(16)

Though there are different types of t-conorm in literature, in this paper, the simplest form of fuzzy t-conorm, which is Zadeh’s t-conorm, is used.

In this method, after thorough working on images on trial basis, the value of \(\alpha =1.6\) is found suitable and is used in this work.

Clustering

The type 2 fuzzy image is then clustered to obtain a segmented image. In this paper, an intuitionistic fuzzy clustering algorithm [9] is used. A brief idea on fuzzy clustering is described.

Initially, a number of clusters are chosen. Then, membership values are randomly (initial guess) assigned to each pixel for being in the clusters. As we do not know the belongingness of each pixel in different clusters, initialization is required and this is done randomly [1]. Then, cluster centers are computed. To each pixel corresponding to each cluster center, distance between the cluster center and the pixel is computed and the membership matrix is updated. At each iteration, distance between the cluster center and the pixel is computed and membership matrix is updated and this is repeated until the difference between the updated membership matrix and previous matrix is less than \(\upvarepsilon\). Here, \(\upvarepsilon =0.004\) is taken. When the difference is \(\upvarepsilon\), then the last updated membership matrix is the final matrix and the cluster centers are the final cluster centers.

In this work, four fuzzy features are computed—pixel energy, mean, standard deviation, and gradient. These features are considered for obtaining clustered image almost accurately.

Pixel energy is computed as square of pixel intensity. It is also called uniformity. On squaring the pixel intensity, lesser pixel intensity values become less and the brighter pixel intensity values also become slightly less. In doing so, the darker region becomes uniformly darker, and brighter region also becomes uniformly brighter.

Gradient feature takes into account the Sobel operator [26]. It consists of two spatial masks that are convolved with the original image to calculate gradient magnitude.

Two Sobel convolution kernels in x and y directions are as follows:

$$K_{\mathrm x}=\begin{bmatrix}1&0&-1\\2&0&-2\\1&0&-1\end{bmatrix},\;K_y=\begin{bmatrix}1&2&1\\0&0&0\\-1&-2&-1\end{bmatrix}$$

Kernels are applied separately on the image and are combined together to compute the absolute magnitude of the gradient at each point. The gradient magnitude is given by

\(G=\sqrt{{{G}_{\mathrm{x}}}^{2}+{{G}_{\mathrm{y}}}^{2}}\), \({G}_{\mathrm{x}}\) and \({G}_{\mathrm{y}}\) are the gradients in x and y directions.

In our work, gradient feature at position \((i,j)\) is computed as

$${\mathrm{grad}}_{\mathrm{feat}} \left(i,j\right)=\mathrm{mean}\left(I\right)\times \mathrm{grad}\left(i,j\right)$$
(17)

where \(\mathrm{mean}\left(I\right)\) is the local mean value of the \(3\times 3\) neighborhood of the pixel at position \((i,j)\). \(\mathrm{grad}\left(i,j\right)\) is the image gradient value using the Sobel operator at the position \((i,j)\).

Final gradient feature is computed by normalizing Eq. (17) with the maximum value of \({\mathrm{grad}}_{\mathrm{feat}}\) which is written as

$${\mathrm{grad}}_{\mathrm{feat}\_\mathrm{final}} \left(i,j\right)=\frac{{\mathrm{grad}}_{\mathrm{feat}} \left(i,j\right)}{{\mathrm{grad}}_{\mathrm{feat}\_\mathrm{max}}},$$
(18)

where \({\mathrm{grad}}_{\mathrm{feat}\_\mathrm{max}}\) is the maximum value of \({\mathrm{grad}}_{\mathrm{feat}}\). Multiplying the gradient value with the mean value of the window makes the high gradient values slightly less that implies that the brightness is also less. Likewise, unwanted small gradient values also become less that implies that the brightness is also less. In doing so, the outline of the clustered region is clear and the clustered image looks better. If the edge brightness is more, then the image regions are not properly clustered.

Standard deviation is computed as \(\frac{1}{M\times N}{\sum }_{i=1}^{M}{\sum }_{j=1}^{N}{\left({x}_{\mathrm{ij}}-\mu \right)}^{2}\), where \(\mu\) is the mean of the image.

Intuitionistic fuzzy C means algorithm cluster the feature vectors by searching for local minima using the following objective function:

$$J=\sum_{i=1}^{c}\sum_{k=1}^{n}{u}_{\mathrm{ik}}^{^{\prime}\mathrm{m}}d{({x}_{\mathrm{k}}, {v}_{i})}^{2}+ \sum_{i=1}^{c}{\pi }_{\mathrm{i}}^{^{\prime}}{e}^{1-{\pi }_{\mathrm{i}}^{^{\prime}}},k\in \left[1,N\right]\text{ and } m=2$$
(19)

where \(d({x}_{\mathrm{k}}, {v}_{\mathrm{i}})\) is any distance measure between \({v}_{\mathrm{i}}\) (cluster center) of each region and \({x}_{\mathrm{k}}\) (points in the pattern), and \({u}_{\mathrm{ik}}\) is the membership value of \({k}^{\mathrm{th}}\) data (\({x}_{\mathrm{k}}\)) in \({i}^{\mathrm{th}}\) cluster. \(c\) is the number of clusters and \(n\) is the number of data points.

\(\sum_{i=1}^{c}{\pi }_{\mathrm{i}}^{^{\prime}}{e}^{1-{\pi }_{\mathrm{i}}^{^{\prime}}}\) is the intuitionistic fuzzy entropy, where \({\pi }_{\mathrm{i}}^{^{\prime}}=\frac{1}{N}\sum_{k=1}^{n}{\pi }_{\mathrm{ik}}\) \(k\in [1,N]\), \({\pi }_{\mathrm{ik}}\) is the hesitation degree of the \({k}^{\mathrm{th}}\) element in cluster ‘\(i\)’.

In this paper, we have used two distance functions separately instead of Euclidean distance—(i) a fuzzy exponential type distance function and (ii) an intuitionistic fuzzy divergence [27], in Eq. (14).

(i) Fuzzy exponential type distance function: We have used a fuzzy exponential distance function as

$$d\left({x}_{\mathrm{k}}, {v}_{\mathrm{i}}\right)=exp\left|{x}_{\mathrm{k}}-{v}_{\mathrm{i}}\right|-1.$$
(20)

If \(X\) is a universal set and \(P(X)\) is the class of all fuzzy sets in \(X\) and \(Q(X)\) is the class of all crisp set of \(X\), then the following criteria holds [28]:

(a) \(d\left(A,B\right)=d(B,A), \forall A,B\in P(X)\)

(b) \(d\left(A,A\right)=0\), \(\forall A\in P(X)\)

(c) \(d\left(D,{D}^{^{\prime}}\right)={max}_{\mathrm{A},\mathrm{B}\in \mathrm{F}}d\left(A,B\right)\), \(\forall D\in Q(X)\), \({D}^{^{\prime}}\) is the complement of \(D\)

(d) If \(A\subset B\subset C \forall A,B,C\in P(X), \mathrm{then} d\left(A,B\right)\le d\left(B,C\right), d(B,C)\le d(A,C)\)

(ii) Intuitionistic fuzzy divergence:

$$\begin{aligned}d\left({x}_{\mathrm{k}}, {v}_{\mathrm{i}}\right)= & 2 - ({1-x}_{\mathrm{k}}+{v}_{\mathrm{i}}).\mathrm{exp}\left({x}_{\mathrm{k}}-{v}_{\mathrm{i}}\right)\\&-\left(1-{v}_{\mathrm{i}}+{x}_{\mathrm{k}}\right).\mathrm{ exp}({v}_{\mathrm{i}}-{x}_{\mathrm{k}})\end{aligned}$$
(21)

Now, minimization of \(J\) is based on suitable selection of U (membership matrix) and \(v\) using an iterative process through the following equations:

$${u}_{\mathrm{ik}}=\frac{1}{\sum_{j=1}^{c}{\left[\frac{{d}_{\mathrm{ik}}^{2}}{{d}_{\mathrm{jk}}^{2}}\right]}^{\frac{1}{m-1}}}.$$
(22)

At each iteration, the membership function, \({u}_{\mathrm{ik}},\) is updated considering hesitation degree as follows:

$${u}_{\mathrm{ik}}^{{\prime}}= {u}_{\mathrm{ik}}+{\pi }_{\mathrm{ik}}$$
(23)

where

\({u}_{\mathrm{ik}}^{^{\prime}}\) denotes the intuitionistic fuzzy membership of the \({k}^{\mathrm{th}}\) data in \({i}^{\mathrm{th}}\) class.

The modified cluster center is written as

$${\nu }_{\mathrm{i}}^{^{\prime}}=\frac{\sum_{k=1}^{n}{u}_{\mathrm{ik}}^{^{\prime}}. {x}_{\mathrm{k}}}{\sum_{k=1}^{n}{u}_{\mathrm{ik}}^{^{\prime}}}, \forall i,k,i=\mathrm{1,2,3,}\dots ,c \text{ and }k=\mathrm{1,2,3,}\dots ,n$$
(24)

Using Eq. (24), the cluster center is updated, and simultaneously, the membership matrix is also updated.

It is to be noted that while updating the membership degree at each iteration, the hesitation degree is also considered.

Hes_degree = 1-membership degree − non-membership degree.

$$=1-\frac{[1+{\left(\lambda -1\right)}^{2}]{.\mu }_{\mathrm{mn}}}{1+{.\mu }_{\mathrm{mn}}{(\lambda -1)}^{2}}-\frac{1-{\mu }_{\mathrm{mn}}}{1+{2.{\mu }_{\mathrm{mn}}.(\lambda -1)}^{2}+{{\mu }_{\mathrm{mn}}.(\lambda -1)}^{4}}$$
(25)

In the membership matrix computation, the value of \(\lambda\) in Eq. (25) takes a different value rather than the value computed using Eq. (13) for obtaining an intuitionistic fuzzy image.

The choice of \(\lambda\) here is crucial. The plot of membership degree versus hesitation degree with different values of \(\lambda\) is shown in Fig. 2. It is observed from the figure that the plot is symmetric at \(\lambda \ge 1.1\), but at \(\lambda \ge 1.5\), the plot is not symmetric. In the algorithm, \(\lambda =1.3\) is selected. This is due to the fact that the hesitation degree or intuitionistic fuzzy index is very less, i.e., 0.0075 at \(\lambda =1.1\) whereas with \(\lambda =1.3\), the hesitation degree is 0.065.

Fig. 2
figure 2

Plot of membership function against hesitation degree: (a) λ = 1.1, (b) λ = 1.3, (c) λ = 1.5

At each iteration, the cluster center and the membership matrix are updated and the algorithm stops when the updated membership matrix and the previous matrix, i.e.,

\({\mathrm{max}}_{\mathrm{i}.\mathrm{k}}\left|{U}_{\mathrm{ik}}^{^{\prime}\mathrm{new}}-{U}_{\mathrm{ik}}^{^{\prime}\mathrm{prev}}\right|<\varepsilon\),\(\varepsilon\) is a user defined, and in this algorithm, \(\varepsilon\) is selected as \(\varepsilon =0.004\).

Clustered image, showing different types of regions including the region of interest, is obtained. Then, lesion/tumor/mass is segmented from the clustered image by removing the unwanted regions from the clustered image and a binary image is formed. As the ground truth image is a binary image, so we segmented the required region of interest and displayed it in binary form.

After segmentation, using morphological operator, i.e., an opening operation is done to remove unwanted small spots in the segmented image. The opening operation removes unwanted small objects (ones in binary images) in the foreground region.

It is to be noted that for comparing our proposed method with the existing methods, opening operation is also applied on all the existing methods to remove unwanted small spots in the segmented image to maintain uniformity (Fig. 3).

Fig. 3
figure 3

Model of the proposed method along with images

Algorithm:

  1. 1.

    Select an image.

  2. 2.

    Convert the image into an intuitionistic fuzzy image using a novel intuitionistic fuzzy generator.

  3. 3.

    Compute two levels using hesitation degree from Eq. (15).

  4. 4.

    Using Zadeh’s t-conorm, compute a modified membership function and create a new image using the modified membership function.

  5. 5.

    Compute four features of the new image.

  6. 6.

    Cluster the image using 4 fuzzy features and distance functions.

  7. 7.

    Modify the membership function in each iteration using hesitation degree, computed from the novel intuitionistic fuzzy generator.

  8. 8.

    Stop the clustering algorithm when the difference between the updated membership matrix and the previous matrix is less than \(\varepsilon\), \(\varepsilon =0.004.\)

  9. 9.

    Segment the lesion/tumor from the clustered image by removing the unwanted regions from the clusters.

  10. 10.

    Apply morphological opening operation to remove any unwanted small spots in the segmented image.

Results and Discussion

The test is performed on 25 images and these images are downloaded from the website. Results on 20 images are shown. The proposed method is compared with four methods—fuzzy clustering method (FCM) [1], intuitionistic fuzzy method IFCM-1 [11], a method by IFCM-2 [14], and Kernel fuzzy clustering by KFCM [2]. Image results on 4 images are shown.

It is observed that in Fig. 4, the proposed method shows the tumor region without any unwanted region and is almost similar to that of ground truth image. FCM can also segment the tumor without noise, but the segmented region is less when compared with the ground truth image.

Fig. 4
figure 4

(a) Original image, (b) proposed clustered image, (c) segmented image using IFCM-1, (d) segmented image using FCM, (e) segmented image using IFCM-2, (f) segmented image using KFCM, (g) proposed segmented image-exponential distance, (h) proposed segmented image-divergence, (i) ground truth image

Figure 5 shows a mammogram image with lesion. It is observed that as compared with the other existing methods, the tumor/lesion in the proposed methods is almost similar to that of ground truth image.

Fig. 5
figure 5

(a) Original image, (b) proposed clustered image, (c) segmented image using IFCM-1, (d) segmented image using FCM, (e) segmented image using IFCM-2, (f) segmented image using KFCM, (g) proposed segmented image-exponential distance, (h) proposed segmented image-divergence, (i) ground truth image

Figure 6 shows a breast tumor where the proposed two methods perform better than the other methods. FCM can also segment the tumor without noise and the segmented region is slightly less than the proposed method. The other methods contain unwanted regions.

Fig. 6
figure 6

(a) Original image, (b) proposed clustered image, (c) segmented image using IFCM-1, (d) segmented image using FCM, (e) segmented image using IFCM-2, (f) segmented image using KFCM, (g) proposed segmented image-exponential distance, (h) proposed segmented image-divergence, (i) ground truth image

Figure 7 shows that the proposed method using both the distance functions performs better than all the methods. IFCM-2 method does not contain any unwanted region but the segmented tumor region is less than that of the ground truth image. In other methods, there are many unwanted regions.

Fig. 7
figure 7

(a) Original image. (b) Proposed clustered image, (c) segmented image using IFCM-1, (d) segmented image using FCM, (e) segmented image using IFCM-2, (f) segmented image using KFCM, (g) proposed segmented image-exponential distance, (h) proposed segmented image-divergence, (i) ground truth image

Tables 1 and 2 show the misclassification error of the 20 images of the proposed method using two distance measures separately and the other methods.

Table 1 Percentage misclassification error of the proposed method using exponential distance function 1 with 4 existing methods
Table 2 Percentage misclassification error of the proposed method using divergence as distance function with 4 existing methods

Performance evaluation: In order to verify the performance of the segmented region of the proposed method and all other methods with respect to the ground-truth images, misclassification error is calculated. It is defined as [29]

$$\mathrm{Error}=\frac{|{B}_{ET}\cap {B}_{GT}|+|{F}_{ET}+{F}_{GT}|}{{B}_{GT}+{F}_{GT}}$$

where \({F}_{\mathrm{GT}}\) and \({B}_{\mathrm{GT}}\) denote the foreground and background area pixels of the ground truth image. \({F}_{\mathrm{ET}}\) and \({B}_{\mathrm{ET}}\) are the foreground and background area pixels of the segmented test image.

From Table 1, it is observed that in the proposed method using exponential distance function, the average misclassification error of 1.26%, i.e., percentage classification is 98.74%. Result of IFCM-1 shows an average misclassification error of 2.65%, i.e., percentage classification is 97.35%. FCM shows an average misclassification error of 2.77%, i.e., percentage classification is 97.33%. IFCM-2 shows an average misclassification error of 2.62%, i.e., percentage classification is 97.38%. KFCM shows an average misclassification error of 3.28%, i.e., percentage classification is 96.72%.

From Table 2, it is observed that the proposed method using divergence-based distance function also performs better but fails to perform better on one image, i.e., ‘mam-18’. It shows an average misclassification error of 1.58%, i.e., percentage classification is 98.42%. IFCM-1 and IFCM-2 show an average misclassification error of 2.65% and 2.62%, i.e., percentage classifications are 97.35% and 97.38%, respectively. So, the proposed method using divergence-based distance function also performs better than the 4 methods.

It is observed that on average, the proposed method using two types of distance function performs almost similar and better than the existing methods.

Conclusion

The paper suggests a modified intuitionistic fuzzy clustering approach for segmenting lesion/tumor in mammogram images. A novel intuitionistic fuzzy generator is used to compute the non-membership degree of the fuzzy image and an intuitionistic fuzzy image is created. From intuitionistic fuzzy membership function, two membership levels are computed using fuzzy hedge. These two membership levels are combined using Zadeh’s t norm and an interval type 2 fuzzy image is obtained. The image is then clustered in different regions where four fuzzy features are used. Two types of distance functions—fuzzy divergence and fuzzy exponential type distance functions—are used in the clustering algorithm. Further, in the membership matrix iteration, hesitation degree is taken into account that is computed from the novel intuitionistic fuzzy generator. The clustered image is then segmented to obtain lesion/tumor in the image. The proposed method is compared with the existing algorithms both quantitatively and qualitatively. It is observed that the proposed method can segment the tumor/lesions with better accuracy as compared with the existing methods.