Keywords

1 Introduction

Retinal fundus images present different objects in the human retina. We sometimes face dangerous diseases such as hypertension, Diabetic Maculopathy (DM) and Diabetic retinopathy (DR), both of which come from the abnormal changes in the structure of these objects, especially in the retinal blood vessels. Evidence of this is some changes in vessel calibre caused hypertension [1, 2]. DR can be detected by the onset of neovascularization, being the early blindness symptom [3]. Arteriovenous nicking [4, 5] can cause a stroke.

These diseases are difficult to recognize in the early stage because of a lack of signs. These problems are presented clearly in the final stages. At that time, the treatment becomes more expensive and ineffective. Thus, retinal vessel change detection is essential to prevent the first step. As a result, this treatment becomes effective and cheap. The performance of medical diagnosis depends on accurate vessel segmentation because the accurate result consists of important clues contributing to the success of the following diagnosis processing.

Some different areas in the medical image are divided by the popular segmentation techniques based on colour, texture and intensity [6, 7]. The resulting image is helpful in identifying the objects. Other proper techniques are also used for segmenting medical images. Here they are threshold [8,9,10,11,12], combined both domain and threshold [13,14,15,16], deep learning [17,18,19,20,21], filter [22,23,24,25], clustering [26], etc. but the techniques have some drawbacks. Evidence of this, the techniques cannot work well in a few particular situations. For example, the retinal vessel image appears as a central light reflex or two vessels located nearby, or the vessels are very tiny, etc. To improve the segmented image quality, the limitations need to be handled.

Recently, researchers proposed some retinal blood vessel segmentation methods. However, their results have consisted of some limitations. For example, Staal [27] showed disconnected vessels. Soares [28] presented the vessels merging and the vessel splitting. Nguyen [29] proposed a method to segment the retinal blood vessel based on improving the salient region combined with the Sobel operator. The segmented images are low accuracy in the case of the vessel pixels with different saliency levels. Nguyen [30] introduced an effective method to segment the retinal blood vessels. However, the method also consists of some disadvantages. Both large vessels and tiny ones cannot be detected at the same time. Because the method only used one window size to classify a pixel belonging to the background or vessel. The window size contributes to the success of the line basic detection technique. Besides, with the tiny window size, a large vessel will divide into two tiny vessels. But, the two close tiny vessels will merge as larger vessels if the window size is set too large. Mustafa [31] introduced a retinal blood vessel segmentation technique better than the Kirsch templates' segmentation method. But, this method is not suitable for detecting tiny vessels.

Based on some of the mentioned limitations, an effective retinal blood vessel segmentation method is proposed to handle these drawbacks. This proposed method consists of two steps: applying STICT to enhance the source image quality and using MLD with different window sizes for vessel segmentation. Our proposed method is better than the current methods. The following sections consist of the background in Sect. 2, the proposed method in Sect. 3, the experiment results in Sect. 4 and the conclusion in Sect. 5.

2 Background

2.1 Translation Invariant Contourlet Transform

The contourlet transform includes two stages. The first stage is a Laplacian pyramid (LP) [32]. The second is a directional filter bank (DFB) [33]. In the first stage, an image is separated into some radial subbands plus an approximation image. In the second stage, the DFB is used for every detail subband where the highest number of directions are used for the finest subband, and to obtain the anisotropic scale rule of width a length, the number of directions levels is reduced at each other radial detail subband. Thus, the contourlet transform is realized by applying two stages of subsampled FBs to translate the invariant contourlet transform (TICT). The translation invariant (TI) schemes were developed to serve for translation invariant Laplacian pyramid (TILP) and translation invariant directional filter bank (TIDFB).

2.2 Line Detector

With a line detector [30], the image is divided into various channels: green, red and blue. The green channel is chosen to serve the line detector process. Because in the green channel, the vessels are brighter than the background. A window size W x W was applied to each pixel to calculate the grey level average at every pixel. The value is calculated as \(I_{avg}^W\). The angular resolution decides the number of lines that pass through the centre pixel. This value of each line is calculated by averaging the pixel value.

The winning line has the highest value, shown by \(I_{max}^W\). The line response of each pixel is calculated as Eq. (1):

$${R}_{w}={I}_{max}^{w}-{I}_{avg}^{w}$$
(1)

If a pixel has a high line response, the pixel will be the vessel pixel. The vessel consists of the pixels of the winning line. However, the pixel is considered a background pixel if its line response is low. The window size influences the performance of the method. This size of the window is double the calibre of the vessel.

This line detector technique is helpful for the light reflection situation. The value of the middle pixel is lower than the others in a large vessel. It makes the classification performance not high due to the tiny difference in the intensity values. But, the method found the pixels that belong to the vessel. A few central reflex pixels belong to the winning line. As a result, these pixels do not make the average value be dropped deeply. Therefore, the pixels are classified as vessel pixels thanks to the high response value. Moreover, the other pixels are considered background pixels thanks to applying the long-length lines. Although, the line detector technique is useful for retinal blood vessel segmentation. However, the technique cannot solve a few drawbacks, such as merging close vessels or dividing a large vessel into two smaller vessels.

3 Efficient Retinal Vessels Segmentation Based on EMLD

This part proposes an efficient retinal blood vessel segmentation method by applying EMLD. The generalized block diagram of the method is presented in Fig. 1.

Fig. 1.
figure 1

The diagram of the method.

Our method consists of two steps: denoising and segmenting the image. Each step will be presented in its subsection.

3.1 Denoising Image

In the first step, the original retinal image is converted into a grey image which is enhanced using STICT. This work should improve the input retinal image quality because this source retinal image usually consists of noise. This step can describe below:

  1. (i)

    This TILP is applied to partition the input retinal image into subbands. Each level of the TILP consists of four high-pass subbands.

  2. (ii)

    To decrease redundant creation by the redundant transform of the TILP. The critically sampled DFBs applied to the TILP in the same way it was used to TIDFBs to realize the TICT. The technique is semi translation invariant contourlet transform (STICT).

3.2 Segmenting Image

Recently, some researchers have proposed efficient medical image segmentation. Evidence of this, Nguyen [30] applied MLD to segment the vessels in the retinal image. According to the results, the technique is better than the primary line detection technique. However, this technique consists of some limitations. For example, the method cannot detect both large and small vessels. In particular, a few small vessels cannot be detected by the method because the method used only a window size during the detection process.

We, thus, proposed EMLD by applying the MLD with different window sizes. The basic line detector used to build the MLD. The line detector is computed by equation (2):

$${R}_{w}^{L}={I}_{max}^{L}-{I}_{avg}^{w}$$
(2)

where 1 ≤  L ≤  W, \({I}_{max}^{L}\) and \({I}_{avg}^{W}\) have the same meaning as the previous part. The values of L are controlled to implement the line detectors at various scales.

In our method, this maximum window size is applied to detect these large vessels, while the minimum window size is used to detect the small vessels. These window sizes are defined as Wmax and Wmin for the maximum window size and the minimum ones, respectively. Based on the vessel calibre, the window size will get a suitable value that approximates double vessel calibre.

It used Eq. (2) to detect both large and small vessels thanks to various suitable window sizes. Each window size has its benefits. With the Wmax, the method can avoid dividing a large vessel into two small vessels. Moreover, the Wmin applied to decrease some limitations when a pixel of the background is located at a particular position. In fact, the background noise will increase if the line length is too short. The line responses at various scales are combined to avoid the situation. It is evident there is a slight difference among the raw response values that come from the line detector at each scale. Consequently, it is difficult to distinguish between the background and the vessels because of tiny differences. To differentiate between the background and the vessels easier, the raw response image values were standardized to make the unit standard deviation distribution and zero mean as Eq. (3):

$${R}{^\prime}=\frac{R-{R}_{mean}}{{R}_{std}}$$
(3)

R′ plays a role as the standardized response value, R plays a role as the value of the raw response, Rmean plays a role as the mean and Rstd as the standard deviation of the raw response value. This standardization has an important role in the distribution of intensity values. This technique is necessary for the response images produced by line detectors at different scales because its result is used for the combination process.

In the combination process, each scale has the same weight. The segmentation result is made by combining the line responses with various scales. The response at each pixel is defined by Eq. (4):

$${R}_{combined}=\frac{1}{{n}_{L}+1}\left({\sum }_{L}{R}_{w}^{L}+{I}_{igc}\right)$$
(4)

where Iigc plays a role as the value of the inverted green channel at the corresponding pixel, nL plays a role as the scale number, and RwL plays a role as the line detector response at scale L. The green channel is integrated into the combination to distinguish the similarity between the blood vessels and other structures easier.

Lastly, the final segmented image combines the result images with different window sizes.

4 Experiment and Results

The DRIVE dataset [34] is used to evaluate the segmentation process. This dataset supplied some of the expert symptom assessment results that are considered standard to evaluate the results of the proposed method and the other methods.

The result of the segmentation process is made by classifying the pixel into a vessel or background. Therefore, in terms of the positive side, when a pixel is accurately classified into the vessel group, the event is considered as true positive (TP), and when a pixel is accurately classified into the background group, the event is considered true negative (TN). Regarding the negative side, when a pixel is incorrectly classified into the vessel group, the event is considered false positive (FP). When a pixel is incorrectly classified into the background group, the event is considered as false negative (FN).

The sensitivity and specificity are the statistical measures to evaluate the performance of methods [35]. The sensitivity is presented by Eq. (5), and the specificity is given by Eq. (6):

$$sensitivity=\frac{TP}{TP+FN}$$
(5)
$$specificity=\frac{TN}{TN+FP}$$
(6)

The accuracy is presented by Eq. (7):

$$accuracy=\frac{TP+TN}{P+N}$$
(7)

In the segmentation process, N is the sum of pixels that belong to the background, and P is the sum of pixels that belong to the vessel. The segmentation process's accuracy depends on the homogeneity level created by comparing the binary classification result and the ground truth. The segmentation method is considered effective when its accuracy is high.

In this study, the DRIVE dataset is used for the implementation; these ground truths vessels in the DRIVE dataset are used as a measurement standard to evaluate our method's and other methods’ performance [30, 31]. Our proposed method runs twenty images in DRIVE. Two typical cases will be presented below.

These results are shown in Figs. 2, 3, 4 and 5. Each figure will be presented below:

Fig. 2.
figure 2

The input image and enhanced image.

Fig. 3.
figure 3

The input image and enhanced image.

Figure 2a shows the input image, the result of the STICT technique shown in Fig. 2b. Moreover, Fig. 3a shows the input image, the result of the STICT technique shown in Fig. 3b. The technique should be done to enhance the quality of input image. It can make the input image reduce noise. This result is necessary for the next stage because the performance of EMLD will be low if the input image consists of noises. In other words, the line response of each pixel is calculated more accurately with the support of the STICT. Based on the value, a pixel will be considered a vessel pixel or background pixel.

These results showed in Figs. 4 and 5. Figure 4a shows the input image, our result showed in Fig. 4b, the result of Nguyen [30] showed in Fig. 4c, and the result of Mustafa [31] is shown in Fig. 4d. Furthermore, the Fig. 5a shows the input image, our result showed in Fig. 5b, the result of Nguyen [30] showed in Fig. 5c and the result of Mustafa [31] showed in Fig. 5d.

Fig. 4.
figure 4

The input image and segmented images.

Fig. 5.
figure 5

The input image and segmented images.

According to the Figs. 4 and 5. It is clear that the segmented images of our method are better than the segmented images of other methods because our proposed method detected more vessels, thanks to applying STICT to enhance the quality of input image and use EMLD to segment vessels.

Table 1 presents the performance measures, including sensitivity, specificity, and accuracy. These parameter values are shown in italics when their values get highest.

Table 1. The first retinal blood vessel image.
Table 2. The second retinal blood vessel image.

To take insight from Tables 1 and 2, the performance of our proposed method is the best in all parameters, including sensitivity, specificity and accuracy. There is a big difference in sensitivity and a small difference in specificity and accuracy.

Table 3. The average results of our proposed method and other segmentation methods.

All images in the DRIVE dataset serve to evaluate the average results that are made by applying our proposed method and recent segmentation methods. Based on the numbers in Table 3, we can easily recognize which method is the best in sensitivity, specificity, and accuracy. The average performance of our proposed method is the best in all measures.

5 Conclusions

With some dangerous diseases, the doctor needs to use segmented medical images that supply more clues related to medical problems. The paper proposes an efficient method to separate the blood vessels from retinal images. Our method includes two steps: apply the STICT to improve the quality of the source image and use MLD with different window sizes to detect vessels. This method can detect more vessels thanks to applying the two window sizes. This brings us to the conclusion that our retinal image segmentation method is more effective than recent ones according to computing these performance measures.