Keywords

1 Introduction

Over 2000 years ago Hippocrates layered mud over a patient’s body. He noticed the mud dried quicker where the internal disease generated heat. Actually he understood the diagnostic significance of body temperature changes.

Infrared energy wavelength is extended from 700 nm to 1 mm which human eyes sensors are not able to detect the all range, so it is not visible. Infrared energy is a sector of the electromagnetic spectrum that one recognizes as heat. All objects with a temperature above absolute zero radiate infrared electromagnetic energy. Objects with higher temperature radiate larger infrared radiations. One infrared camera can capture the heat pattern emitted from the object.

Breast thermal imaging is a radiation-free updating screening procedure that captures heat pattern of the breast helping early detection of cancer. The infrared radiation from an individual as an outcome of surface temperature is related to the amount of blood circulation. As a tumor grows it develops a blood supply that emits more heat [1].

Breast thermal imaging is a harmless physiological examination that presents the chance of earlier detection of breast disease than has been attainable through breast self-examination, and other breast screening tests alone.

Breast cancer if detected early can be taken out and stopped. Once a woman who has cancer is too late to start early detection, but if identified early enough, it can be removed in initial phases before it metastasizes and spreads to the neighboring area.

Breast thermal imaging is a painless, noninvasive , fast, low cost, passive, updating clinical examination without any risk of radiation. All subjects with any age with any breast size can benefit of this technique. Also women with dense breast as well as nursing or pregnant women can use breast thermal imaging [26].

Lawson declared in 1956 that the surface temperature of cancerous breast was higher than the normal one. Therefore, Lawson claimed that the cancerous cells can be recognized as hot regions in thermograms [7]. In infrared pseudo-color images distinct colors show distinct rate of hotness. Hence, breast thermograms’ color segmentation can be beneficial in identifying the suspecting zones.

This study is arranged in this way: Pseudo-coloring of gray images is explained in Sect. 2. MS technique is discussed in Sect. 3. Numerical outcomes and discussions are given in Sect. 4 and finally the findings are achieved in Sect. 5.

2 Color Conversion of Gray-Scale Images

Pseudo-coloring or color conversion of gray-scale images is one of the possible means of upgrading digital image. The human eye is more sensitive to differences in color than to differences in shades of gray. It is a common task in image processing with the aim of upgrading visual quality, presenting a visual appeal and emphasizing specific features in the image.

Two limitations with gray-scale images which are related to specific characteristics of the individual discernable system lead to suggest the concept of pseudo-coloring. First, as the size of the feature composed of the alternate gray shade is reduced, a human’s ability to discern gray-shade differences in an image decreases. As the size of the feature decreases, it takes larger differences in gray-scale intensity for a person to be able to see the alternate shade of gray. Second, humans are only capable of discerning between 60 and 90 separate shades of gray, suggesting much of the information in a typical 256 gray-shade image is lost on the viewer. Furthermore, individuals are capable of discerning more than 500 shades of color [8, 9].

Although it is controversial that the viewer allocates greater knowledge by pseudo-coloring of gray images, thoroughly produced pseudo-color image indeed allows tumor identification that is as good as gray scale and enhances fulfillment of other jobs like recognition and interpretation of a tumor. In the interest of rehearsing a thoroughly produced pseudo-coloring which can keep the whole knowledge of grey-scale images and derives no deformity in images, uniform color space is suggested [10].

In a uniform color space, identical alterations in the component of color space correlate to identical noticeable alterations in color chroma and color hue angle. However, many elements cause some difficulties. Some of these elements are as follows: (1) the color of an area relies on neighborhood colors; (2) chromatic accommodation impacts are influenced by eye motions, because colors are a head or come after other colors in time; and (3) color discriminability evaluates on the angular stretch of the investigation area. The 1976 CIELAB color space is suggested by the International Commission on Illumination (CIE).

The space of CIELAB is shown in Fig. 1.

Fig. 1
figure 1

CIELAB color space [11]

It is an attempt to provide an intuitively uniform color space. In this color space, the amount of dissimilarities of the colors in terms of luminance, chroma, and hue can be nearly described by the two-point separation length. In addition, tristimulus values XYZ can obtain CIELAB coordinates (L*, a*, b*) as expressed by the formulas in Eq. (1) where the white point values are denoted by the subscript n [12, 13]:

$$\begin{aligned} L^{{^{*} }} & = 116g\left( {\frac{Y}{{Y_{n} }}} \right) - 16 \\ a^{{^{*} }} & = 500\left[ {g\left( {\frac{X}{{X_{n} }}} \right) - g\left( {\frac{Y}{{Y_{n} }}} \right)} \right] \\ b^{{^{*} }} & = 200\left[ {g\left( {\frac{Y}{{Y_{n} }}} \right) - g\left( {\frac{Z}{{Z_{n} }}} \right)} \right], \\ \end{aligned}$$
(1)

where

$$g(m) = m^{1/3} \quad m > \left( {\frac{6}{29}} \right)^{3}$$
(2)
$$g(m) = \frac{1}{3}\left( {\frac{29}{6}} \right)^{2} m + \frac{4}{29}\quad m \le \left( {\frac{6}{29}} \right)^{3} .$$
(3)

3 mean shift clustering (ms)

MS is a nonparametric technique that was introduced in 1975, [14]. This technique does not need knowing about the number of clustering and does not force any format of the clusters in advance [1518]. Supposing a set of feature vectors, y i (i = 1,…, n), be in a feature space of dimension d. The relation between the density of any point y in terms of the kernel density evaluator K(y) and the size of window h is obtained by Eq. (4):

$$f_{h,K} (y) = \frac{{c_{k,d} }}{{nh^{d} }}\sum\limits_{i = 1}^{n} {K\left( {\left\| {\frac{{y - y_{i} }}{h}} \right\|^{2} } \right)} .$$
(4)

The zero gradient of Eq. (4) produces Eq. (5)

$$= \frac{{2c_{k,d} }}{{nh^{(d + 2)} }}\sum\limits_{i = 1}^{n} {q\left( {\left\| {\frac{{y - y_{i} }}{h}} \right\|^{2} } \right)\left( {\frac{{\sum\nolimits_{i = 1}^{n} {y_{i} q\left( {\left\| {\frac{{y - y_{i} }}{h}} \right\|^{2} } \right)} }}{{\sum\nolimits_{i = 1}^{n} {q\left( {\left\| {\frac{{y - y_{i} }}{h}} \right\|^{2} } \right)} }} - y} \right)} ,$$
(5)

where \(q(s) = - K^{{\prime }} (s)\), supposing function q be the derivate of function K. Equation (5) is a product of two expressions. The first expression is proportional to the density estimate at y, and the MS vector which is the second expression is introduced by Eq. (6):

$$m_{h} (y) = \frac{{\sum\nolimits_{i = 1}^{n} {y_{i} q\left( {\left\| {\frac{{y - y_{i} }}{h}} \right\|^{2} } \right)} }}{{\sum\nolimits_{i = 1}^{n} {q\left( {\left\| {\frac{{y - y_{i} }}{h}} \right\|^{2} } \right)} }} - y.$$
(6)

It is noticed that the direction of the greatest amount of growth in the density is certainly pointed by the MS vector. The MS algorithm is demonstrated in Fig. 2. The red arrow displays the MS vector while the updated center of the cluster is indicated by the green circle. It is noteworthy to mention that the window size variable h is the only variable of the MS technique. It has remarkable impact on the acquired results.

Fig. 2
figure 2

Demonstration of mean shift algorithm ; the red arrow shows the MS vector; the green circle shows the updated center of the cluster

4 Experimental Results

In this study K-means [19] and FCM [20] were applied for color segmentation of breast thermograms [21, 22]. However, the three techniques K-means, MS, and FCM are applied and compared. Sixty breast thermal images accessible from five different breast thermography centers [2327] were investigated in this work.

These three techniques put all cases into practice . With K-means algorithm, almost empty clusters are appeared in some experiments. Besides, if K-means is redone some times, the outcomes for distinct experiments are non-identical in accordance with the drawbacks of the K-means technique [2831]. Figure 3b shows the experiment in which two almost empty clusters are obtained for a cancerous individual as shown in Fig. 3a. Nevertheless by performing the FCM technique (Fig. 3c), no empty cluster is obtained because fuzzy c-means technique measures the colors in a comparative way and categorize them in groups that are not with rigid borders. Consequently, data point is able to associate to more than one group although in hard clustering (K-means) data is collected to rigid groups suggesting each data point associates to particularly one group [3234]. Also performance of FCM for a normal individual (Fig. 4a) is demonstrated in Fig. 4b. MS technique is easily affected by parameter h, the window size. In this work, MS algorithm was performed for a normal case. Figure 4b shows the results for h = 12 with eleven empty clusters while Fig. 4c for h = 15 with four empty clusters.

Fig. 3
figure 3

a An inflammatory cancer breast thermogram. b Two almost empty clusters out of eight clusters obtained by K-means technique. c Eight non-empty clusters obtained by FCM clustering

Fig. 4
figure 4

a A normal breast thermogram. b Eleven almost empty clusters out of seventeen clusters obtained by MS with h = 12. c Four almost empty clusters out of eight clusters obtained by MS with h = 15

In this work, FCM technique has potential to identify the two first hottest areas by measuring their colors with respect to the provided palette spectrum colors. By color segmenting of breast thermograms , we are able to identify the two hottest clusters. Consequently providing some helpful data from the doubtful areas by measuring the color of the two hottest areas with the color of the provided palette spectrum is feasible.

5 Conclusion

Although it is controversial that with pseudo-colored gray images more data can be allowed to the observer, but precisely produced pseudo-color image indeed is capable of tumor identification that is equivalent to the gray scale and upgrades accomplishment of other jobs like perception and judgment of a tumor. In this research, three techniques for breast thermograms color segmentation were used. MS technique is easily affected by parameter h, the window size, and accordingly, in this study, we usually challenged empty clusters and selecting a suitable h was not straight forward. K-means technique reduces the sum of within cluster outcomes numerically and clusters are tight and fully divided and produce precise outcomes. Due to the K-means drawbacks, nevertheless, for many cases, K-means technique is not practicable where clusters are not separated and an element in an image may associate to various clusters. Choosing appropriate initial cluster centers is crucial for K-means clustering. Inadequate initial centers may produce empty clusters. In FCM technique, elements of the image are included in different clusters with disparate levels of association. By reason of the fuzzy nature of the breast thermograms, more faultless outcomes were resulted by FCM method to segment breast thermograms .

In this study, 60 breast thermograms were studied. By implementing FCM technique, we provided the two first hottest areas for each subject where convenient features are obtainable. There are some conveniences of breast thermograms color segmentation by FCM such as identification of irregular subjects by contrasting the analogous clusters from the contralateral breasts (over the symmetrical line), determination of level of malignity with identifying the two first hottest areas, and extraction of some useful features from them.