1 Introduction

Breast cancer is one of the leading causes of deaths from cancer for the female population in both developed and developing countries. Earlier treatment requires early diagnosis, and early diagnosis requires an accurate and reliable diagnostic procedure that allows physicians to differentiate benign breast tumors from malignant ones [1]. The most frequently adopted medical imaging studies for early detection and diagnosis of breast cancers include mammography and ultrasonography. Ultrasound examination, which is non-invasive and non-radiative, is a more convenient and suitable tool for palpable tumors in daily clinical practice. However, ultrasound examination is very operator dependent. The image is non-specific for the diagnosis of benign or malignant lesions according to the echogenecity. The examination described by Stavros et al. [2] is much more extensive than the usual examinations performed at most breast imaging centers. However, the above diagnostic results are achieved by experienced radiologists. Many invasive diagnostic procedures are still required in most cases. Most of these procedures could be avoided if a more specific diagnostic test was available because the rate of positive findings in a biopsy for cancer is low [3]. Thus, a computer-aided diagnosis (CAD) system would be expected to be helpful in diagnosing breast cancer because of the difficulty of such diagnoses.

Chen et al. [47] applied textural features in breast ultrasound images to differentiate between benign and malignant tumors with neural network classifiers. Textural variation in the ultrasound image has been deemed a useful characteristic for distinguishing benign and malignant tumors [8]. The CAD utilizes a multilayer perception (MLP) neural network to perform a good diagnostic result. However, the training process is prolonged and diagnostic performance normally relies on the initial parameter setting [9], i.e., number of neurons, learning rate and moment value are hard to decide. The selections of initial parameters will affect the results drastically. Whereas, the support vector machine (SVM) reveals the feasibility and superiority to extract higher-order statistics. The SVM has become extremely popular in terms of classification and prediction. This study employs the SVM model as a classifier instead of MLP for identifying benign and malignant lesions in the ultrasound image. The proposed diagnosis system can classify the ultrasound images of a breast more accurately and efficiently. The SVM is a reliable choice for the new proposed system because it is fast and excellent in ultrasound image classification.

2 Materials and methods

Normally, a physician can readily pinpoint a tumor in an ultrasound image by the tumor shape and the contrast of internal echoes. Automatic tumor segmentation on an ultrasound image is hard. No satisfactory approaches appear to exist so far, to our knowledge. Thus, the physician manually extracted the rectangular sub-image of the region of interest (ROI) in this study. The rectangular ROI included around 0–5 mm extension from the tumor border. The proposed system employed intensity variation and textural information from the ROI sub-images as features with which to diagnose breast tumors.

2.1 Data acquisition

The ultrasound image database comprises 250 images of pathologically proven benign breast tumors from 215 patients and carcinomas from 35 patients. The ultrasound images were captured at the largest diameter of the tumor. The breast ultrasound image databases contain only histological confirmed cases (either by fine needle aspiration, core needle biopsy or open biopsy). Ultrasound images were performed using an ALOKA SSD 1200 (Tokyo, Japan) scanner and with freeze-frame capability and 7.5 MHz linear transducer. No acoustic standoff pad was used in any of the cases. The databases were supplied by the coauthor, an experienced physician, Dr. Chen. Most of the cases were pathologically proven and some were followed up at least for 2 years.

The monochrome ultrasound images in databases are quantized into 256 gray levels, i.e., eight bits per pixel. The ROI is manually selected by Dr. Chen using the ProImage package. That is, the hardware-bundled software package was used to capture the full image of the ultrasonic scanner and to select the ROI manually. The ROI was then saved as a digital form for later analysis by the proposed SVM system. Figure 1a demonstrates a real-time digitized monochrome ultrasound image. Figure 1b presents an exacted ROI for the tumor.

Fig. 1
figure 1

a A 736×556 full breast ultrasound image and b the region of interest sub-image is approximately 1.79 cm×1.62 cm in size, captured with a resolution of 104×94 pixels (a 1 cm×1 cm rectangle contains 58×58 pixels)

2.2 Ultrasonic textural analysis

An ultrasound image consists of many points with different values of gray level intensity. Different tissues have significantly different textures. The textural variation between benign and malignant is an effective feature for classifying breast tumors. The proposed CAD system exploits the correlation between adjacent pixels within images as features to classify breast tumor. We utilized textural features, i.e., block difference of inverse probabilities (BDIP), block variation of local correlation coefficients (BVLC) [10] and auto-covariance matrix, to identify breast tumor.

Block difference of inverse probabilities, which is one of the proposed texture features, is defined as the difference between the number of pixels in a block. That is

$$ {\text{BDIP}} = P^{2} - \frac{{{\sum\nolimits_{(x,y) \in B} {f(x,y)} }}} {{\max _{{(x,y) \in B}} f(x,y)}}, $$
(1)

where B denotes a block of size P × P. The larger the variation of intensities there is in a block, the higher the value of BDIP. BVLC, the second texture feature, is known to measure texture smoothness well. The value of BVLC is defined as follows:

$$ {\text{BVLC}} = {\mathop {\max }\limits_{(k,l) \in O_{4} } }[\rho (k,l)] - {\mathop {\min }\limits_{(k,l) \in O_{4} } }[\rho (k,l)],\quad O_{4} = \{ (0,1),(1,0),(1,1),(1, - 1)\} , $$
(2)

and

$$ \rho (k,l) = \frac{{(1/P^{2} ){\sum\nolimits_{(x,y) \in B} {f(x,y)f(x + k,y + l)\mu _{{0,0}} ,\mu _{{k,l}} } }}} {{\sigma _{{0,0}} ,\sigma _{{k,l}} }}, $$
(3)

where μ 0,0 and σ 0,0 represent the local mean value and standard deviation of the block with size P × P. The (k, l) term denotes four orientations (−90°, 0°, −45°, 45°). As a result, μ k,l and σ k,l represent the mean value and standard deviation of the shifted block, respectively. The larger BVLC value indicated that the ingredients in the block are rough. In this study, P is chosen to be 2. The third texture feature we propose, the auto-covariance matrix, can reflect the inter-pixel correlation within an image. The modified auto-covariance coefficients between pixel (i, j) and pixel (i + Δm, j + Δn) in an image with size M × N is defined as

$$ \gamma ({\text{ $ \Delta $ }}m,{\text{ $ \Delta $ }}n) = 1 - \frac{{A({\text{ $ \Delta $ }}m,{\text{ $ \Delta $ }}n)}} {{A(0,0)}}, $$
(4)

where

$$ A({\text{ $ \Delta $ }}m,{\text{ $ \Delta $ }}n) = \frac{1} {{(M - {\text{ $ \Delta $ }}m)(N - {\text{ $ \Delta $ }}n)}}{\sum\limits_{x = 0}^{M - 1 - {\text{ $ \Delta $ }}m} {{\sum\limits_{y = 0}^{N - 1 - {\text{ $ \Delta $ }}n} {{\left| {(f(x,y) - \mu )(f(x + {\text{ $ \Delta $ }}m,y + {\text{ $ \Delta $ }}n) - \mu )} \right|}} }} }, $$
(5)

where μ is the mean value of f(x, y). The size of the auto-covariance matrix was Δm × Δn. These texture features were performed as feature vector for each tumor ROI sub-image to diagnose breast cancer.

2.3 Support vector machine classification

The aim of SVM is to devise a computationally efficient way of learning separating hyperplanes in a high-dimensional feature space [11]. The SVMs have been shown to be an efficient method for many real-world problems because of its high generalization performance without the need to add a priori knowledge. Thus, SVMs have much attention as a successful tool for classification [12, 13], image recognition [14, 15] and bioinformatics [16, 17]. The SVM model can map the input vectors into a high-dimensional feature space through some non-linear mapping, chosen a priori. In this space, an optimal separating hyperplane is constructed. SVM is the implementation of the structural risk minimization principle whose object is to minimize the upper bound on the generalization error. Given a set of training vectors (l in total) belonging to separate classes, (x 1, y 1), (x 2, y 2), (x 3, y 3), ..., (x l, y l), where \( x_{i} \in R^{n} \) denotes the ith input vector and \( y_{i} \in \{ + 1, - 1\} \) is the corresponding desired output. The maximal margin classifier aims to find a hyperplane w: wx + b = 0 to separate the training data. In the possible hyperplanes, only one maximizes the margin and the nearest data point of each class. Figure 2 shows the optimal separating hyperplane with the largest margin. The support vectors denote the points lying on the margin border. The solution to the classification is given by the decision function

$$ f(x) = {\text{sign}}{\left( {{\sum\limits_{i = 1}^{N_{{{\text{SV}}}} } {\alpha _{i} y_{i} k(s_{i} ,x) + b} }} \right)}, $$
(6)

where α i is the positive Lagrange multiplier, s i is the support vector (N SV in total) and k(s i , x) is the function for convolution of the kernel of the decision function. The radial kernels perform best in our experimental comparison, and, hence, are chosen in the proposed diagnosis system. The radial kernels are defined as

$$ k(x,y) = \exp {\left( { - \gamma (x - y)^{2} } \right)}, $$
(7)

where \( \gamma \in R \) is a non-zero parameter.

Fig. 2
figure 2

Optimal hyperplane for support vector machine

In this study, the modified normalized auto-covariance matrix and a set of the first and second moments of BDIP and BVLC are used as the feature vectors for the input of the SVM classifier. In this study, both Δm and Δn are 5. Because the value of γ(0, 0) is always 1, excluding the element γ(0, 0), other texture features are formed as a 28-D image feature vector. The 28-D textural feature vector is used as the input signal of the SVM classifier. Take note that the output value of the SVM is either −1 or 1. When the output value of an ROI sub-image of a suspicious tumor region is near enough to 1, the CAD system will classify the tumor in the ultrasound image as malignant. Conversely, when the output value is close to −1, the tumor will be diagnosed as benign.

3 Simulations and results

The most common means of measuring diagnostic accuracy for reconstructed images is based on receiver operating characteristic (ROC) analysis. The other measure was the A Z value, which was calculated by the ROC curves (software package LABROC1 by Prof. C.E. Metz, University of Chicago, IL, USA). The area A Z under the ROC curve is an index of the quantitative measure of the overall performance of a diagnosis [18], as shown in Fig. 3. A Z value could, therefore, compare performances using different methods to clearly distinguish positive and negative finding of breast tumors. Usually, the overall performance of a diagnostic system can be evaluated by examining the ROC area index, A Z , over the testing output values. The k-fold cross-validation method [19] is used to estimate the performance of the proposed SVM system and the MLP system designed by Chen et al. [7] (denoted by Chen’s CAD). Once trained, the CAD systems are then tested on the group that was set aside. The second group is then removed, and the remaining (k−1) groups are trained and the CAD systems are tested on the excluded group. This process is repeated until all the k groups have been used in turn as the group that is set aside and used for testing. In the simulations, k is 10 and each group has 25 ultrasound images. Because the radial kernels perform best in the experimental results, the kernels are chosen in the proposed SVM diagnosis system. Figure 4 shows the diagnosis performance for the SVM system with different γ values. With γ ranging from 0.01 to 0.02, the proposed CAD system obtains a stable and high accuracy.

Fig. 3
figure 3

The graph of receiver operating characteristic (ROC) curves (A Z , the area under the ROC curve, is an index of the quantitative measure of the overall performance of a diagnosis)

Fig. 4
figure 4

Performance of the proposed support vector machine system with parameter γ (gamma) values

Figures 5 and 6 illustrate the diagram of the ROC curve for Chen’s CAD and the proposed SVM system in the classification of malignant and benign tumors. The proposed SVM system and Chen’s CAD achieve A Z =0.9561 and 0.9496, respectively. Table 1 lists the number of misdiagnosed cases of the Chen’s CAD (threshold=0.2) and the proposed SVM system for each test set. The accuracy, sensitivity, specificity, PPV and NPV of Chen’s CAD and the proposed SVM system are illustrated in Table 2. We also compare the computation time for the CAD systems. The simulations were made on a single CPU Intel Pentium-4® 2.4 GHz personal computer with Microsoft Windows XP® operating system. Table 3 shows the training for the ultrasound image database and the average diagnosis time for each breast tumor. The training and average diagnosis time of the Chen’s CAD is 778 and 1,994 times longer than that of the proposed CAD, respectively. In all the simulations, the results from the proposed CAD system obtain better classification performance and speedy computation than those obtained with Chen’s CAD.

Fig. 5
figure 5

Diagram of the receiver operating characteristic (ROC) curve using Chen’s computer-aided diagnosis in the classification of malignant and benign tumors. The A Z value for the ROC curve is 0.9496±0.020

Fig. 6
figure 6

Diagram of the receiver operating characteristic (ROC) curve using the proposed computer-aided diagnosis in the classification of malignant and benign tumors. The A Z value for the ROC curve is 0.9561±0.018

Table 1 The number of misdiagnosed cases of Chen’s computer-aided diagnosis (CAD) and the proposed CAD for each test set
Table 2 The performance of Chen’s computer-aided diagnosis (CAD) and the proposed CAD with γ=0.01
Table 3 The computation time using Chen’s CAD and the proposed CAD. The training time is evaluated by using the ultrasound image database that contains 250 images. The diagnosis time is the average computation time for an ultrasound image

4 Conclusions

Ultrasound has become one of the major imaging modalities for the diagnosis of breast lesions. Improved imaging techniques permit the management of detected breast lesions to become less invasive. This study proposes an efficient CAD system using the SVM model to differentiate between benign and malignant tumors. CAD systems have been developed for diagnosis of many human diseases on medical imaging. An ingenious CAD system is able to assist radiologists and physicians in detecting lesions and in differentiating benign from malignant lesions on the basis of medical images. The results produced by CAD can be used as a “second opinion” to assist radiologists in their interpretations and to improve diagnostic accuracy. To avoid needless biopsy and enhance the diagnostic accuracy, a CAD system can provide a second beneficial support reference. The proposed CAD system diagnoses breast tumors using texture features within the ultrasound image. The MLP classifier in Chen’s CAD is replaced by the SVM model to obtain a better result for distinguishing between benign and malignant cases. The proposed CAD performs differential diagnosis very well. From the highly satisfactory specificity and sensitivity of results, the proposed CAD is expected to be a helpful tool for classifying benign and malignant tumors in ultrasound images. The MLP system suffered from the time consuming and initial condition dependent problems. The proposed SVM approach identifies solid breast nodules with a comparatively high accuracy. With the expansion of the database, new cases can easily be trained and used as references. Experimental results demonstrate the feasibility and excellent performance of the proposed CAD system in ultrasound image classification.