Introduction

Stroke is the second leading cause of disability, after dementia. Recent reports show that more than 15 million people globally suffer a stroke each year [1]. Stroke primarily occurs due to two reasons: (a) the blood supply to the brain is blocked due to stenosis in the lumen of the carotid artery or (b) the sudden rupture of a blood vessel within the brain [2]. The degree of luminal narrowing is considered as an indirect measure of the carotid atherosclerosis [3]. Accurate characterization of atherosclerotic luminal changes is therefore crucial in patient’s diagnosis. Figure 1 illustrates the plaque formation in the carotid artery.

Fig. 1
figure 1

Illustration of plaque formation leading to stenosis in the carotid artery (courtesy of AtheroPoint™, Roseville, CA, USA)

B-mode ultrasound imaging (US) is a non-invasive and inexpensive technique for the visualization and measurement of the carotid arterial morphology [47]. The lumen diameter (LD) and stenosis is calculated using manual methods where the doctor or sonographer can put caliper-based points at the near lumen-intima (LI) and the far LI corresponding to the proximal and distal ends of the carotid artery. The LD is then measured as the distance between these points [8]. Manual measurement of LD is tedious and has poor inter- and intra-observer agreement [9]. Development of faster hardware and innovative algorithms in the recent years has made it possible to automatically compute the LD [5, 10, 11]. Due to better image reconstruction tools in ultrasound imaging, such as compound and harmonic imaging, it is now possible to image the carotid artery in real time [8, 12]. However, clearly work needs to be done for accurate lumen segmentation. We believe that in clinical application, carotid LD should be equally more significant than carotid intima-media thickness (IMT) [11]. This is because the plaque develops on both the sides (proximal and distal) of the carotid walls and carotid LD truly reflects a more comprehensive indicator of coronary artery disease (CAD) risk compared to carotid IMT.

Automated image-based methods for carotid lumen segmentation from US images are challenging due to the variability in data sets, such as: shape and size of carotid artery, occlusions caused by multi-focal plaques, arterial curvature, orientation of scanning probe, gain control during the acquisition, type of the transducer (linear vs. phased array), scanning protocol, and frequency of operation [5, 8, 11, 13, 14]. Furthermore, the non-uniform plaque growth at the interface of lumen and walls creates a challenge in the segmentation of the carotid artery because the image contrast depends on the composition and grade of the plaque [13]. Speckle noise level is another factor which affects the segmentation accuracy [5].

Several techniques have been attempted for carotid lumen segmentation, lumen boundary estimation, and lumen diameter measurement. We have classified these techniques into three categories [10, 15, 16] as: region-based [17, 18], boundary-based [1922] or fusion-based [6].

Region-based techniques are not much explored in literature. In [17], the authors followed a global pixel classification strategy using the row-wise intensity distribution model from histograms to identify the lumen region from the carotid artery images. The main advantage of region-based strategy is that it is independent of gradient information which makes the segmentation robust to cases in which gaps exist in the boundaries of the carotid arteries. The major contribution of this paper is a region-based algorithm which uses the hypothesis that the blood has a constant density [23].

The boundary-based approaches or deformable models either follow parametric curves (traditional snakes) or geometric curves (level set) [2428]. Manual initialization is required for snake based methods in a local neighborhood of the borders of the carotid artery. Moreover, since snake-based approaches rely on intensity gradients, they are often susceptible to false edges, and are sensitive to speckle noise and to gaps within the lumen borders. Even though few authors succeeded in carotid lumen segmentation, they provided no discussion about the quantification of LD.

Golemati et al. [19] used Hough’s transform (HT) for the automated carotid lumen segmentation based on the assumption that straight lines (longitudinal images) and circles (transverse images) can be used for approximating the carotid lumen. However, this assumption may not be always valid, as the shape and orientation of the carotid arteries may vary depending on the acquisition procedure [6]. Further, Golemati et al. [19] suggests the use of active contours in combination with the Hough’s transform technique for diseased arteries, where the arterial boundary departs from a simple geometrical shape. The aforementioned facts motivated us to develop a novel algorithm for carotid artery segmentation and LD measurement by combining the Hough’s transform with the level set approach. We use this method for benchmarking our region-based strategy.

This paper presents two different algorithms for automated segmentation of the lumen borders in carotid ultrasound scans. Algorithm-1 uses the region-based strategy, which is the key contribution of this study, while the algorithm-2 (boundary-based strategy) is presented for benchmarking. Both techniques are modeled as a two stage process: (i) global shape segmentation (or stage-I) for extraction of region of interest, and (ii) local shape extraction (or stage-II) of lumen interfaces (near LI and far LI corresponding to the near (proximal) and far (distal) walls of the carotid scans).

The layout of the paper is as follows: Materials and Methods provide background information on the material used for this study and explain the automated lumen segmentation algorithms in detail. Results section presents the experimental results on two different databases including performance evaluation of the two techniques. Discussion section presents the discussion on the results and the paper concludes in Conclusion.

Materials and methods

Patient demographics and data acquisition

Japan database (JDB)

Two hundred and two patients left and right common carotid artery (CCA) (404 Images) B-mode ultrasound images were retrospectively analysed (ethics approval and IRB granted, Toho University, Japan). Informed consent was obtained from all individual participants included in the study. There were 155 males (76.7 %) and 47 females (23.3 %) with mean age 67 and 75 years, respectively. These patients had a mean HbA1c of 6.28 ± 1.1 (mg/dl), Glucose 108 ± 31 (mg/dL), LDL cholesterol 101.27 ± 31.6 (mg/dl), HDL cholesterol of 50.26 ± 14.8 (mg/dl) and total cholesterol of 175.04 ± 38 (mg/dl). Two trained observers (Neuro radiologists with 10 years of experience) manually traced the lumen diameter borders.

The ultrasound scanner (Aplio XV, Aplio XG, Xario, Toshiba, Inc., Tokyo, Japan) used to examine the carotid arteries, was equipped with a 7.5-MHz linear array transducer. Same experienced sonographer (16 years of experience) had performed all the scans. The data were acquired in a time period ranging from July 2009 to August 2010. In this database, the vertical resolution was 0.05 ± 0.01 mm/pixel.

Hong Kong database (HKDB)

There were 50 subjects with a total of 300 images (6 ultrasound scans per patient, 3 scans each side of the neck showing three different acquisition scans: anterior, anterolateral and posterolateral approaches with simultaneous ECG gating). These subjects included asymptomatic postmenopausal Chinese women whose age ranged from 54 to 67 years. Informed consent was obtained from all individual participants included in the study. The mean age was 60.2 years. Of these 50 females (mixed pool), 28 females were abnormal (diseased) having following distribution: one was diabetic, three were hypertensive, 15 had hypercholesterolemia, seven had both hypertension and hyper cholestolemia, and two had all three abnormalities. The 22 normal (controls) females had normal blood pressure, total cholesterol and glucose levels in fasting blood. The carotid arteries were examined using a 13.5 MHz linear transducer of Sonoline Antares (Siemens, USA) ultrasound scanner. Each scan was digitally captured for 10 s and the digital dynamic clips of the six segments were archived for off-line analysis with the software Syngo.

Two trained observers (neuroradiologists with 10 years of experience) manually traced the lumen diameter borders in both the datasets using ImgTracer™ (AtheroPoint™, USA), a user-friendly commercial software [29] The observers have to choose 15 to 25 edge points proximal to the bulb depending upon the length of the carotid artery and the output will be ordered set of traced (x,y) coordinates.

Automated lumen segmentation system

We have modeled the segmentation algorithm as a two stage process. The motivation of the stage-I strategy comes from the fundamental reconstruction of the ultrasound image, where the distal (far) wall of the carotid artery has the brightest region [30]. This taps the information to build the global shape using scale-space framework [31, 32] combined with spectral analysis [33]. We analyze each column of the image pixels from bottom to top and identify the peaks and then, select the pair of peaks with high intensity value separating a pool of very small intensity values. These pair of peaks corresponds to media-adventitia (MA) borders. The region between the two MA borders is the region-of-interest (ROI). The output of the global shape is the ROI in which the local proximal and distal walls will be searched for lumen delineation. Figure 2 shows the region of interest estimation of the global-shape extraction process.

Fig. 2
figure 2

Overall block diagram of the lumen region segmentation. Region based approach uses a classifier whereas in the boundary based approach level set is employed to extract the lumen region

The local-based processing is modelled into two sets of competing methods: (a) region-based and (b) boundary-based, where each technique when combined with the global-shape method yields its own independent method. Both regional and boundary-based models are shown in local stages (stage-II) in Fig. 2.

  1. a.

    Region Based (Classification) Technique for Lumen Segmentation

The region-based model captures lumen intensities under the constant blood density assumption [34]. This motivates the use of a statistical-based classifier such as K-means classifier [33, 35]. The ROI region is first captured using the combination of scale-space with spectral analysis (global shape method). This ROI has three regions: (a) low intensity lumen region, (b) high intensity or brightest adventitia region and (c) medium intensity plaque region. Thus we adapt a K-mean classifier with three pre-defined classes. On application of K-mean classifier, which yield three regions and lumen region is selected and enveloped as the largest region in binary form. The proximal and distal lumen morphologic borders are then delineated which constitute the borders of the lumen. Finally, the LD is estimated using the Polyline distance metric [36, 37].

The overall work flow of region-based approach is illustrated for a single image in Fig. 3. Figure 3(a) shows the cropped image using the automated cropping technique. Figure 3(b) shows the points captured using the spectral analysis. Figure 3(c) shows the spline fitted adventitial borders (both near and far). Figure 3(d) shows the ROI inside the rectangle (global shape extraction). Figure 3(e) shows the binary lumen from the K-means classifier. Figure 3(f) shows the final region-based LD borders after spline fitting (local shape extraction).

  1. b.

    Boundary Based Approach (Level Set Based) for Lumen Segmentation

Fig. 3
figure 3

Results of different algorithmic steps of region-based technique on a single image. a Cropped image, b bright adventitial border points detected using spectral analysis, c Spline fitted on these points to show the adventitial borders, d the resulting ROI is marked with a rectangle extracted using global-shape extraction system, e binary lumen obtained from the classifier in the ROI using local region-based system, (f) final lumen borders after smoothing and spline fitting

The fundamental assumption in the boundary-based approach is the ability to track the walls of the interfaces during the cardiac cycle. Though, we are not tracking walls of the interfaces in this scope of study, but the boundary-based approach originates from the idea that edge-model can be adapted for tracking such wall borders. These edge borders can be parametric or geometric when tracked. We model the interface detection using geometric models-based so-called level sets [17, 38, 39]. The elementary idea is to represent the boundaries of the carotid artery using a level set function, the evolution of which is controlled by some partial differential equations (PDEs) [39]. The key advantage of level set is the ability to follow the morphologic edges of the interfaces. However, the challenge in the level set is that where should be the starting interface before the push-pull can stretch the geometric curves to the stable position. Several techniques can be used to compute the initial edges before level set can be applied. We adapted a simple method keeping the assumption that lumen region has nearly constant density. We need something which can approximate the border of the interfaces which can then be acted as a bounding box when using level set. This approximation can be achieved using Hough’s transform. The Hough’s transform will give small line segments which will constitute the area within which the level set can explore. Hence, the level set evolution is limited by the borders obtained from the Hough’s transform.

The level set method used in this paper is based on the general DRLSE (Distance Regularized Level set Evolution) technique proposed by Li et al. [40]. In DRLSE, the regularity of the level set function is intrinsically maintained during the level set evolution which is derived as the gradient flow and minimizes certain energy functional. The advantage of using DRLSE method is that the distance regularization effect eliminates the need for re-initialization of the level set function and thereby avoids its induced numerical errors. The complete levels set formulation (LSF) is explained below. Let I be an image on a domain Ω. Then the edge indicator function g can be defined as [40]:

$$ g\triangleq \frac{1}{1+{\left|\nabla {G}_{\sigma }*I\right|}^2} $$
(1)

where, ∇G σ is a Gaussian kernel with a standard deviation σ. The convolution in Eq. (1) is used to smooth the image to reduce the noise. For an LSF :Ω → Ʀ, we define an energy functional ɛ( Φ) by [40]:

$$ \upvarepsilon \left(\varPhi \right)=\mu {R}_p\left(\varPhi \right)+\lambda {L}_g\left(\varPhi \right)+\alpha {A}_g\left(\varPhi \right) $$
(2)

where, μ is the coefficient of the distance regularization term R p (Φ). λ > 0 and α ∈ Ʀ are the coefficients of the energy functional L g (Φ) and A g (Φ), which are defined by [40]:

$$ {L}_g\left(\varPhi \right)\triangleq {\displaystyle \underset{\varOmega }{\int }}g\delta \left(\varPhi \right)\left|\nabla \varPhi \right|dx $$
(3)
$$ {A}_g\left(\varPhi \right)\triangleq {\displaystyle \underset{\varOmega }{\int }}gH\left(-\varPhi \right)dx $$
(4)

where, δ and H are the Dirac delta function and the Heaviside function, respectively.

The level set evolution in Eq. (2) can be implemented by using the finite difference method provided in [40]. The iteration process used in the numerical implementation of DRLSE can be expressed as [40]:

$$ {\varPhi}_{\mathrm{i},\mathrm{j}}^{\mathrm{k}+1}={\varPhi}_{\mathrm{i},\mathrm{j}}^{\mathrm{k}}+\varDelta tL\left({\varPhi}_{\mathrm{i},\mathrm{j}}^{\mathrm{k}}\right),\kern4em k=0,1,2\dots . $$
(5)

where the time dependent LSF Φ(x, y, t) is given in discretized form \( {\varPhi}_{\mathrm{i},\mathrm{j}}^{\mathrm{k}} \) with spatial index (i, j) and temporal index k. \( L\left({\varPhi}_{\mathrm{i},\mathrm{j}}^{\mathrm{k}}\right) \) is the approximation of the right hand side in the evolution equations and ∆t is the time step. The parameters used in the general DRLSE level set formulation [40] are given in the Table 1, choice of which was kept same for experiments on images in both the databases.

Table 1 Parameters used for the boundary-based (level set) segmentation

Figure 4 shows different algorithmic steps of boundary-based technique illustrated on a single image. Hough’s transform is used as an initialization step before applying level set. After obtaining the ROI, Hough’s transform is used to approximate the lumen borders using straight lines. The Hough’s transform will give small line segments which will constitute the boundary for the level set to explore (see Fig. 4c). This will further increases the accuracy of the level set technique by avoiding the bleeding through the patches in the lumen interfaces.

Fig. 4
figure 4

Results of different algorithmic steps of boundary-based technique on a single image. a ROI obtained using scale-space based global-shape extraction system; b Hough transform approximation of straight line borders; c lumen borders using the level set technique (local shape extraction); d final lumen borders after spline fitting

Performance metrics

To evaluate the performance of our automated system, we have used the following statistical measures. Mean error and standard deviation shows the error and variation between the automated lumen diameter and manually traced diameter. Precision of merit (PoM) will provide similarity of mean automated lumen diameter with respect to the mean manual lumen diameter. Coefficient of correlation (CC) will give an idea about the closeness of the automated measurements with the manual expert tracings. The Bland-Altman plots will help in accurately determining the data distribution. Dice similarity (DS) and Jaccard index (JI) provides the relationship between the areas of manual and automated lumen segmentation regions. Further, T-test, Mann-Whitney test, ANOVA test and Chi-square test are performed to prove the statistical significance of the results. Statistical analysis was performed with the MedCalc 15.0 software (MedCalc, Belgium).

Results

Japan database

Table 2 compares automated lumen diameter against manual readings. This shows the closeness of the automated method against manual readings.

Table 2 Automated and manual mean lumen diameters

Figure 5 visually compares the results of both the methods. Left side shows the result of region-based (proposed) method and right side shows the result of boundary-based method. It can be seen that the region-based method outperforms the boundary-based technique.

Fig. 5
figure 5

a and b shows the visual comparison of the results using region-based and boundary-based methods. Left side shows the result of region-based (proposed) method and right side shows the result of boundary-based method. Auto lumen borders are marked in solid white, while manual lumen borders are shown in dashed white

HK database

Table 3 compares automated lumen diameter against manual readings. This shows the closeness of the automated method against the manual readings.

Table 3 Automated and manual mean lumen diameters

Figure 6 visually compares the results of both the methods. Left side shows the result of region-based (proposed) method and right side shows the result of boundary-based method.

Fig. 6
figure 6

a and b shows the visual comparison of the results of both the methods. Left side shows the result of region-based method and right side shows the result of boundary-based method. Auto lumen borders are marked in solid white, while manual lumen borders are shown in dashed white

Performance evaluation

Japan database

Figure 7 shows the scatter plot of automated LD against manual LD. The coefficient of correlation (CC) between the Auto LD and the two manual tracings for the region-based techniques are 0.91 (p < 0.0001) and 0.92 (p < 0.0001), respectively. Similarly, the CC between the Auto LD and the two manual LD’s for the boundary-based techniques are 0.86 (p < 0.0001) and 0.85 (p < 0.0001), respectively. The Bland-Altman plots given in Fig. 8 give a much clear picture of the data including outliers. Cumulative frequencies of signed and unsigned Auto LD errors against manual tracings were computed. The unsigned cumulative frequency shows the total number of measurements with less than a particular error value irrespective of the sign (positive or negative). It was found that above 90 % of the LD measurements were within 1 mm error compared to the manual tracings for the region-based technique, whereas only around 80 % of the LD measurements fell within the 1 mm error range for the boundary-based technique. Further, the maximum error value did not exceed 2 mm in any of the cases. The cumulative frequency is an important performance statistic which refers to the error in the automated system.

Fig. 7
figure 7

a and b shows the scatter diagram for Auto LD against Manual-1 LD for the region-based and boundary-based techniques, respectively. Similarly, c and d shows the scatter diagram for Auto LD against Manual-2 LD for the region-based and boundary-based techniques, respectively

Fig. 8
figure 8

a and b shows the Bland-Altman plots for Auto LD against Manual-1 LD for the region-based and boundary-based techniques, respectively. Similarly, c and d shows the Bland-Altman plots for Auto LD against Manual-2 LD for the region-based and boundary-based techniques, respectively

Mann–Whitney test

The Mann–Whitney test is a non-parametric statistical test of the null hypothesis. This test is performed to determine whether or not two samples come from the same population. The analysis of the automated and manual LD variables was performed using Mann-Whitney test for independent samples. Tables 7 and 8 in appendix shows detailed Mann-Whitney test results for region-based and boundary-based techniques, respectively.

T- test

T-test was performed for independent samples to compare the means of two independent observations (automated and manual measurements). The null hypothesis is that the difference between the means of automated and manual measurements is equal to 0. Tables 9 and 10 shows the results of T test for both the methods. Since the calculated P-values are greater than the conventional 0.05, the null hypothesis cannot be rejected and the conclusion is that the two means do not differ significantly.

ANOVA test

ANOVA (one-way analysis of variance) is used to test the difference between the means of automated and manual measurements. The null hypothesis is that both the automated and the manual measurements have the same mean. We have performed the test at level of significance, 0.05. If, P > 0.05, we accept the null hypothesis or else, we reject the null hypothesis. The results of ANOVA test are given in Tables 11 and 12. Since the calculated P-value for LD is greater than 0.05, there is a statistically significant relationship between the auto (Auto LD) and the manual measurements (Manual-1 LD and Manual-2 LD).

Chi-Square test

Chi-Square test is performed to analyses the difference between the automated and the manual measurements. The detailed test results are given in Tables 13 and 14 in appendix section. Since, the test results are not significant at P < 0.05; the automated measurements (Auto LD) are not significantly different from the respective manual measurements.

Dice similarity and Jaccard index

Table 4 shows the relation of lumen area according to Dice similarity and Jaccard index for both the methods. If both regions are equal, dice similarity is 1 or 100 %. Here, the Dice similarity of both the readings is closer to 100. From the table, the lumen area calculated by automated method is close to manual segmentation area, and Manual-1 shows slightly higher similarity compared to Manual-2. Further, when considering Jaccard index, Manual-1 shows higher similarity than Manual-2.

Table 4 Dice similarity and Jaccard index

HK database

Figure 9 shows the scatter plot of automated lumen diameter against manual lumen diameters. The coefficient of correlation (CC) between the Auto LD and the two manual tracings for the region-based techniques are 0.94 (p < 0.0001) and 0.93 (p < 0.0001), respectively. Similarly, the CC between the Auto LD and the two manual tracings for the boundary-based techniques are 0.84 (p < 0.0001) and 0.83 (p < 0.0003), respectively. The Bland-Altman plots given in Fig. 10 will give a much clear picture of the data including outliers.

Fig. 9
figure 9

a and b shows the scatter diagram for Auto LD against Manual-1 LD for the region-based and boundary-based techniques, respectively. Similarly, c and d shows the scatter diagram for Auto LD against Manual-2 LD for the region-based and boundary-based techniques, respectively

Fig. 10
figure 10

a and b shows the Bland-Altman plots for Auto LD against Manual-1 LD for the region-based and boundary-based techniques, respectively. Similarly, c and d shows the Bland-Altman plots for Auto LD against Manual-2 LD for the region-based and boundary-based techniques, respectively

The cumulative frequency is an important performance statistics which refers the intensity of error in the automated system. Cumulative frequencies of signed and unsigned Auto LD errors against manual tracings were computed. It is found that above 90 % of the LD measurements were within 1 mm error compared to the manual tracings for the region-based technique, whereas only around 80 % of the LD measurements fall within the 1 mm error range for the boundary-based technique. Further, 100 % of the LD measurements fall within the 2 mm range for both the methods.

Mann-Whitney test

The analysis of the automated and manual LD variables was performed using Mann-Whitney test for independent sample. Tables 15 and 16 in appendix show Mann-Whitney test results for region-based and boundary-based techniques, respectively.

T- test

In order to check for differences between the manual and automated measurement, T-test was performed for independent samples. Tables 17 and 18 shows the results of T test for both the methods.

ANOVA test

We have performed the ANOVA test at level of significance, 0.05. If, P > 0.05, we accept the null hypothesis or else, we reject the null hypothesis. The results of ANOVA test are given in Tables 19 and 20. Since the calculated P-value for LD is greater than 0.05, there is a statistically significant relationship between the auto (Auto LD) and the manual measurements (Manual-1 LD and Manual-2 LD).

Chi-Square test

The detailed Chi-Square test results are given in Tables 21 and 22. Since, the test results are not significant at P < 0.05; the automated measurements (Auto LD) are not significantly different from the respective manual measurements.

Dice similarity and Jaccard index

Table 5 shows the relation of lumen area according to Dice similarity and Jaccard index for both the methods. From the table, the lumen area calculated by automated method is close to manual segmentation area, and Manual-1 shows slightly higher similarity compared to Manual-2. Further, when considering Jaccard index, Manual-1 shows higher similarity than Manual-2.

Table 5 Dice similarity and Jaccard index

Discussion

Automated system

The objective of this research was to design and develop two different algorithms for automated segmentation of the lumen borders from B-mode carotid ultrasound images. Performance of both algorithms was compared against the manual expert tracings. The system-based approach is novel and provides reliable, accurate, and validated clinical tool for lumen segmentation. It has a sound theoretical foundation and is driven by the requirements of the physicians and vascular radiologists which is an integral part of the clinical tool design. The main requirement is that the system must be fully automated from start to end and this is one of the reasons, we had to validate our system extensively with variability in data set and against the ground truth traced by the physicians.

We derived our concepts from the physics of ultrasound which yields the role of image reconstruction. Since the physics of ultrasound dictates that the far wall intensity to be highest with current ultrasound gain, frequency of attenuation, compound and harmonic imaging; we thus adapted a signal-based approach for detecting the location of the far wall in the B-mode ultrasound image [41]. The second aspect is that the anatomic structures which need to be segmented do not change on an average basis over the entire population. The scale-space concept to capture variable wall thickness during lumen segmentation is adapted for the first time. Further, the concept of capturing the global shape using the physics of ultrasound and spectral analysis prevents erroneous capture of walls such as jugular vein and other muscles which are floating around in the ultrasound image. Our database had variability on the wall thickness since our database consisted of women’s data from Hong Kong and diabetic patients from Japan. By adjusting the σ value, we were able to capture the far wall adventitia region accurately. Figure 11 illustrates the variation of carotid artery images in the Japan database across different patients. Auto LD tracings using region-based technique on these images are given along with manual expert tracings.

Fig. 11
figure 11

Illustrating the variation of carotid artery images across different patients in Japan database. a Curved lumen diameter (LD) borders; b High plaque deposit and narrowing of carotid artery; c Low contrast image with jugular vein interference. d Image having poor contrast difference at near wall. The solid white indicates the automated LD using region-based technique and the dashed white indicates the manual expert tracings

One of the features which our system is based on is the hemodynamics of the fluid flow in the artery that is the blood. Sousa et al. [42] studied the link between hemodynamic behavior and stenosis pathophysiology based on clinical US examinations, and provided estimates of flow changes and forces at the carotid artery walls. In the current study, the concept of blood flow class constancy is used where blood flow is assumed to have fixed class intensity with least variation [43]. Using this property, we picked the flow pixels using pixel-based regional information under the statistical classifier framework. The interesting part is that we have carefully and intelligently applied these paradigms to avoid picking shadows due to calcium present in the near wall or poor image resolution in the lumen region.

We have adapted a mimicking approach which can mimic the calliper points along the lumen-intima interface (for near and far walls), and this will automatically give the LD. This mimicking approach can be best captured when the edges of the lumen-intima are captured. Since the lumen-interfaces have strong gradient edges in the presence of darker blood flow in ultrasound images, we need a model which can clamp the edges and follow the morphology of the plaque build-up. Thus we need a model which can capture both the morphology of the atherosclerotic disease and clamp the edges where it is supposed to be. The pioneering work of level set which culminated to capture the edges and morphology of the disease and we had extended such a model (so-called boundary based model) with improved characteristics both in terms of speed and complexity. This gives another novel framework for lumen segmentation and was utilized under benchmark paradigm. Note that, two objectives were met during this transition: (a) benchmarking process to compare regional-based approach against boundary-based approach and (b) provide a solid foundation to establish another system-based paradigm.

The experimental results on two different datasets illustrated the superior performance of the region-based strategy over the boundary-based strategy. This is due to the fact that, unlike region-based methods, boundary-based algorithms rely much on edges or image gradients. One explanation is that boundary-based methods are more sensitive to changes in gradient information at the edges of the wall during the cardiac cycle [44]. It is during this cardiac cycle, the image is frozen and processed for LD estimation. This sensitivity can be compensated for by taking neighbouring frames into consideration which is beyond the scope of the study. The second explanation for higher error is that atherosclerotic disease is multifocal and the plaque variations can be non-uniform along the carotid artery. Boundary-based techniques are sensitive to sudden changes in the variations of the grayscale intensity distribution along the common carotid artery, and will therefore yield less optimal results. This can be compensated for by taking regional statistics embedded with boundary-based information [15, 16].

One of the key applications of LD measurement is to understand the correlation between LD measurement and plaque score. Recently Saba et al. [45] has performed a comparative study of correlation between the carotid inter-adventitial diameter (IAD)/LD and the plaque score to predict the risk of stroke. Even though the plaque score has been correlated to IMT before [46], the novelty in this paper is the correlation of LD with the plaque score. The study by Saba et al. [45] used the scale-space based automated lumen diameter measurement system to accurately measure the carotid LD/IAD. However, the objective of the present study is completely different as we are comparing the performance of two different automated LD measurement systems.

Brief survey and benchmarking

Accurate knowledge and understanding of the geometry of the carotid arteries is important in their segmentation. Several automated algorithms have been proposed for the segmentation of the CCA, but majority of them are focused on the segmentation and measurement of the IMT. However, we have found that some authors pointed out the need for automated measurement of carotid lumen diameter [4749]. The lumen segmentation algorithms have been classified into regional-based vs. boundary-based: region-based [17, 18], boundary-based [1922]. Here, we describe each of these techniques and their limitations and discuss how they validated the measured parameters. Table 6 shows a comparative analysis of the carotid lumen segmentation techniques.

Table 6 Comparative analysis of the carotid lumen segmentation techniques

Golemati et al. [19] used a Hough transform for the automated lumen segmentation based on the assumption that the B-mode ultrasound images of longitudinal and transverse sections can be approximated using straight lines and circles, respectively. Though, they achieved a systolic LD accuracy of 98 % and diastolic LD accuracy of 97 %, this assumption may not be always valid, as the shape and orientation of the carotid arteries in the ultrasound scan may vary depending on the acquisition procedure. The authors tested their algorithm on just 10 subjects and the major weakness is the assumption that the vessels are straight which is not always true.

Molinari et al. [41] developed a completely automated IMT measurement system called carotid measurement using dual snakes (CMUDS): a class of AtheroEdge system (Global Biomedical Technologies, Inc., Roseville, CA). CMUDS was modeled as a dual parametric system corresponding to LI and MA borders with initialization from the far adventitia layer. The novelty of CMUDS was the first-order absolute moment–based external energy, which provided stable deformation. This system was evaluated on a multi-ethnic database of 665 longitudinal images and had achieved a success rate of 99.2 %. In another approach proposed by Molinari et al. [7], they proposed an automated method based on the assumption that the highest pixel intensities are in the far wall of the CCA. This methodology is patented which consisted of generating the composite image by registration. They used level-set-based algorithm followed by morphological image processing for lumen segmentation and the technique was evaluated on 200 CCA images. Since the above two methods were proposed for IMT measurement, we cannot compare their performance with the proposed technique.

The segmentation of carotid lumen region has been carried out by Santos et al. [17] based on binary image morphology, which utilized hypoechogenic characteristics of the lumen. This work also included a denoising step using an anisotropic diffusion filter to remove the speckle noise. The ROI was estimated using the method mentioned in the Golemati et al. [19]. They compared the area overlap between the auto and manual segmentation on 11 longitudinal US images and showed a mean area overlap of 96.78 %. Threshold-based region detection is always susceptible to false region estimation if the images are noisy due to low resolution or hyperechoic characteristics or shadows due to calcium in near wall. Further, the morphological processing operations are sensitive to noise and hence are not very stable.

The study by Loizou et al. [20] was concentrated on bifurcation region segmentation and diameter estimation through speckle reduction filtering and snake-based segmentation. This algorithm faced the challenge of initialization of snakes and did not give a high value of coefficient of correlation between their method and manual reading (CC = 0.63). Further, this algorithm was evaluated on just 20 images. Yang et al. [21] used an active shape-based model to segment both media-adventitia (MA) boundary and lumen-intima (LI) boundary on transverse views slices from three-dimensional ultrasound (3D US) images. This was a semi-automated method and was tested on a small database having 68 images form 17 patients. The authors have used the point distribution model to represent an object as a set of labeled points. However, no ROI was computed. The performance was evaluated against manual tracings. The method yielded a Dice similarity coefficient (DSC) of 94.4 % ± 3.2 % and 92.8 % ± 3.3 % for the media-adventitia-boundary (MAB) and lumen-intima-boundary (LIB), respectively. Yang et al. [8] method did not use longitudinal images in their study.

Lumen axis detection for carotid artery was discussed by Rocha et al. [22] using dynamic programming. They assumed that the CCA is the largest blood vessel nearest to the bottom of the image and the jugular vein will appear above the CCA. Even though the algorithm by Rocha et al. tested on 199 images and has got a success rate of 99.5 %, they did not attempt to measure the carotid LD. Their objective was limited to the detection of carotid lumen axis and not the measurement of any parameters. One of the main limitations of their study is that the near wall was not considered in their approach, since its visibility in 2D B-modes images is usually much worse than the far wall visibility. However, the measurement of lumen diameter requires the segmentation of near wall borders.

A user-independent, real-time algorithm is introduced by Sifakis et al. [18] for carotid artery localization in longitudinal B-mode ultrasound images. Their technique used basic image statistics along with arterial anatomical knowledge. They built their model with the assumption of three regions (brightest far wall region, darkest lumen, and then the second brightest near wall) in an ultrasound image. The method’s evaluation and parameter value optimization were performed on a threefold cross validation basis. Even though the method proposed by Sifakis et al. [18] showed carotid artery segmentation, there was no discussion about the quantification of lumen diameter and benchmarking against gold standard. Recently, Carvalho et al. [50] proposed a method using simultaneously acquired B-mode ultrasound and contrast enhanced ultrasound image sequences. Their approach included motion estimation from image sequences and image registration. This work seemed to be quite different from the proposed approach. However, they used a graph based approach for the lumen segmentation.

Sharma et al. [51] presented AtheroEdge™ 2.0, a two stage process for automated carotid lumen diameter measurement. This method combines spectral analysis with pixel classification paradigm for LD measurement. The outputs of this clinical system were LD along with stenosis severity index (SSI). The algorithm was tested on 202 patients and achieved a precision-of-merit of 98.05 % and 99.03 % w.r.t to two observers. However, this system lacks a model-based approach for lumen segmentation.

The proposed method combines the scale-space paradigm with pixel classification as global and local models. This paper also benchmarks the current paradigm against a boundary-based approach where the global shape is extracted first followed by edge capturing using a level set paradigm. Further, we have tested our algorithm on a large database consisting of more than 700 images from 252 subjects. We got the POM values 99.7 % and 97.9 % and CC values 0.94 and 0.93 compared to manual tracings by experts which are quite good for an automated system. Hence we believe that our automated system has the potential to use in a clinical environment.

Strengths, weakness and extensions

Though our system is able to show promising results and has been well validated against manual tracings, we believe that there is a scope for improving the automated system. We have a plan to take out the curved vessels from the database and develop an improved system considering the curvature. Other possibilities include the application of any de-speckling technique before processing and multi-resolution analysis. Even though the proposed system showed high performance, as a future work, it can be tested on multi-ethnic databases from different countries. This will also help in improving the system by tackling more irregularities in the walls of the carotid artery.

Conclusions

We have presented two different algorithms for the automated lumen region segmentation and diameter measurement from B-mode ultrasound images. Both techniques are modelled as a two stage process. Stage one consisted of global-based model using scale-space framework combined with spectral analysis for extraction of the region of interest which is common for both algorithms. Stage two is modelled as a local-based strategy that extracts the lumen interfaces. The algorithm-1 is modelled as a region-based strategy using a multiclass framework, whereas the algorithm-2 is modelled as a boundary-based approach that uses the level set framework. Comprehensive statistical data analysis was performed to ensure the superior performance of the region-based technique against the two manual tracings. The region-based system showed high precision of merit and reliability. This image-based system for the automated lumen diameter measurement holds significant potential as a tool for stroke and cardiovascular risk stratification.