Introduction

Lung cancer accounts for the highest number of cancer-related deaths compared to any other cancer [1]. In the USA, an estimated number of new lung cancer cases is 2, 21, and 200 in 2015, accounting for about 13 % of all cancer’s diagnosed [1]. The estimated number of deaths due to lung cancer in the USA is 1, 58, and 040 in 2015, accounting for about 27 % of all cancer deaths [1]. Lung cancer is frequent for smokers as well as for non-smokers, and its causes remain unknown. At present, there is no effective way to prevent lung cancer except campaigning against smoking. Early diagnosis of lung cancer could improve 5-year survival rate from 15 to 80 % [9]. Several studies proved that screening program can substantially reduce the mortality rate due to lung cancer [9, 29]. The pulmonary nodules are potential manifestation of lung cancer [4]. The pulmonary nodules are blob-like structures with a diameter of 3 to 30 mm. Small nodules (diameter <1 cm) are difficult to access by standard needle biopsy. The biopsy procedure is more difficult for small nodules, located away from the chest wall. A classification scheme with minimal user intervention is required for the diagnosis of lung cancer.

Several features such as 2D shape-based [14], 3D shape-based [10], 3D texture-based [12], and combinations of 2D shape-based and texture-based [2, 11] features have been used for the classification of pulmonary nodules. No classification work is reported in the literature using the combination of 3D shape-based and 3D texture-based features. The aim of the present work is to improve the performance of classification of nodules using a combination of 2D shape-based, 3D shape-based, 3D margin-based, 2D texture-based, and 3D texture-based features. A semi-automated technique is used for segmentation of pulmonary nodules [7]. The proposed method provides the best classification accuracy using a set of 49 features. Support vector machine (SVM) is used for classification of benign and malignant nodules. The proposed method is compared with the most recent classification work of Han et al. [12]. The proposed classification scheme is also compared with the automated Han et al., where the pulmonary nodules are segmented using the method of Dhara et al. [7].

The paper is arranged as follows: “Prior Works on Classification of Pulmonary Nodules” section reviews the reported works in the literature, “Classification of Benign and Malignant Pulmonary Nodules” section describes the proposed classification framework, “Feature Selection” section describes the method of feature selection, and “Database” section describes the database. In “Results and Discussion” section, results of classification are stated.

Prior Works on Classification of Pulmonary Nodules

Several classifiers such as linear discriminant analysis (LDA) [2, 15, 20], artificial neural network (ANN) [19, 28], and SVM [12] have been used for the classification of benign and malignant pulmonary nodules. The reported classification works based on LDA, ANN, and SVM are reviewed in subsequent paragraphs.

McNitt-Gray et al. [20] used shape and texture features of a nodule slice and reported the classification accuracy of 90.3 % on a data set of 31 nodules using LDA. Armato et al. [2] also used several shape and texture features and classified the nodules using LDA. The method was validated on a data set of 470 nodules and reported A z is 0.79. Way et al. [31] developed a computer-aided diagnosis (CAD) system using morphological and gray-level features. A data set of 96 nodules was used and the reported A z is 0.83±0.04.

Matsuki et al. [19] considered seven clinical parameters and 16 radiological findings to categorize the pulmonary nodules into benign and malignant. The method was validated on a data set of 155 nodules using a three-layer, feed-forward ANN with a back-propagation algorithm. The performance of the ANN classifier was evaluated using the round-robin method of training and testing. The reported A z is 0.95. Suzuki et al. [28] applied massive training artificial neural network for the classification of benign and malignant nodules. Three gray-level-based features, two edge-based features, a morphological feature, and the clinical information were used to represent the pulmonary nodules. The data set consists of 76 malignant nodules and 413 benign nodules. They also evaluated the classifier performance using the round-robin method of training and testing, and the reported the A z of 0.88.

Han et al. [12] studied the role of three well-known texture features (Haralick, Gabor, and local binary patterns) in the classification of pulmonary nodules. They established the efficacy of 3D Haralick features as compared to the Gabor and local binary pattern-based features of a representative of the nodule. The method was validated on a data set of 1012 nodules of LIDC/IDRI. The highest classification accuracy was obtained by considering the nodules with composite rank of malignancy “1” and “2” as benign and “4” and “5” as malignant. They used SVM classifier and reported an A z of 0.94. This technique depends on radiologists for the segmentation of nodules.

Classification of Benign and Malignant Pulmonary Nodules

A cubic volume of interest (VOI) of size (40×40×40 mm) is selected considering the seed point as the centroid of the VOI. Pulmonary nodules are segmented using an established segmentation technique [7]. Several shape-based, margin-based, and texture-based features are computed from the segmented nodule. The selected features are used for the classification of nodules. The block diagram of the proposed classification scheme is provided in Fig. 1.

Fig. 1
figure 1

Block diagram of the proposed classification scheme

Segmentation of Pulmonary Nodules

Several works on segmentation based on morphological filtering [17, 18, 21] and active contour model [25] are reported in the literature. In this paper, pulmonary nodules are segmented using the method of Dhara et al. [7]. The segmentation technique is applicable for different types of pulmonary nodules based on their internal texture (viz, solid, part-solid, and non-solid) and external attachment (viz, juxta-pleural and juxta-vascular). In this segmentation technique, pulmonary nodules are classified into solid/part-solid or non-solid based on the analysis of intensity distribution in the core of nodule. Depending on the class level of nodules (solid/partsolid or non-solid), a particular algorithm is set for nodule segmentation.

The performance of the segmentation technique of Dhara et al. [7] was compared with the techniques of Kuhnigh et al. [17], Moltz et al. [21], and Kubota et al. [18] in terms of four contour-based metrics (mean distance, Pratt function, Hausdorff distance, and modified Hausdorff distance) and six region-based metrics (accuracy, overlap, sensitivity, specificity, similarity angle, and similarity region) [26] by mean of comparing the binary mask of the segmented nodule with the reference ground truth mask. The experimental results prove that the segmentation technique of Dhara et al. [7] is reliable for the segmentation of various types of pulmonary nodules with an improved accuracy.

Representation of Pulmonary Nodules

Malignant nodules have uneven surface due to the presence of spiculation and lobulation, whereas benign nodules have smooth surface (Fig. 2). The margin-sharpness of nodules and the gray scale distribution of nodules play an important role in discrimination of benign and malignant nodules. The imaging characteristics of nodules are represented in terms of several shape-based, margin-based, and texture-based features. The procedures of computing these features are described in subsequent sections.

Fig. 2
figure 2

Example of nodules: a malignant (composite rank of malignancy:5) and b benign (composite rank of malignancy:1)

3D Shape-Based Features

Several 3D shape-based features are computed from the binary mask of the segmented nodules as described below.

  • Sphericity: The irregularity of the shape of a nodule is represented by sphericity [27]

    $$ \mathit{Sphericity} = 6 \sqrt{\pi}V{A}^{-3/2}, $$
    (1)

    where V is the volume of the nodule and A is its surface area. The volume of a nodule is computed by multiplying the total number of voxel with voxel resolution. The surface area of a nodule is computed by summing the area of all triangular faces of the nodule mesh. In this article, the nodule mesh is generated using marching cubes algorithm [16] whenever needed. The range of sphericity is [0,1].

  • Spiculation: The spiculation is computed using the method of Dhara et al. [8]. The height and base area of each spicule are determined. The net spiculation of a nodule is computed in two different ways

    $$ \mathit{Spiculation}_{1} = {\Sigma}_{i = 1}^{N}{e^{- {\omega_{i}}}}{h_{\text{i}}}, $$
    (2)

    and,

    $$ \mathit{Spiculation}_{2} = \frac{\sum\limits_{i=1}^{N}{h_{\text{i}}\cos{\omega_{i}}}}{\sum\limits_{i=1}^{N}{h_{\text{i}}}}, $$
    (3)

    where ω i is the solid angle subtended at peak point of i th spicule, h i is the height of a spicule, and N is the total number of spicules of a nodule under consideration. The nonlinear weighting factor is introduced to emphasize the contribution of sharp spicules.

  • Lobulation: Lobulated surface is associated with the uneven growth rate of nodules. Dhara et al. [8] represented the lobulation in terms of the ratio of total concave surface area and total convex surface area of the nodule mesh. Region growing is applied to get the concave and convex part of the nodule mesh. The lobulation of a nodule is defined as

    $$ Lobulation = \frac{{{S_{\text{concave}}}}}{{{S_{\text{convex}}}}}, $$
    (4)

    where S convex is the total convex surface area and S concave is the total concave surface area of the nodule.

  • Volume: Volume of a nodule is computed by multiplying the total number of voxels with voxel resolution [8].

  • Surface area: Surface area of a nodule is computed by summing the area of all triangular faces of the nodule mesh.

  • Equivalent diameter 3D: It is defined as [27]

    $$ D = 2\sqrt[3]{{\frac{{3V}}{{4\pi }}}}, $$
    (5)

    where V is the volume of the nodule.

  • Major axis length 3D: The major axis length of the bounding ellipsoid of a nodule [21].

  • Minor axis length 3D: The minor axis length of the bounding ellipsoid of a nodule [21].

  • Convex surface area: It is computed by summing the area of all triangular faces in the equivalent triangular mesh (generated by marching cubes algorithm [16]) of the convex hull of a nodule.

2D Shape-Based Features

Several 2D shape-based features are computed from the biggest axial slice (S biggest) of each nodule [24]. The procedures of computing these features are described below.

  • Area: The number of pixel in S biggest multiplied by pixel resolution.

  • Perimeter: The perimeter of S biggest multiplied by pixel resolution.

  • Equivalent diameter 2D: It is the equivalent diameter of S biggest and represented as \(\frac {2}{\pi }\sqrt {Area}\)

  • Convex area: The area of convex hull of S biggest

  • Convex perimeter: The perimeter of the convex hull of S biggest

  • Compactness: It is defined as \(\frac {{Perimeter{^{2}}}}{{4\pi Area}}\)

  • Major axis length: The major axis length of the bounding ellipse of S biggest

  • Minor axis length: The minor axis length of the bounding ellipse of S biggest

  • Circularity: The circularity is defined as

    $$ Circularity = \frac{{4\pi A}}{{{q^{2}}}}, $$
    (6)

    where A is the area of S biggest and q is the convex perimeter of S biggest.

  • Standard deviation of radial distance 2D: The standard deviation of radial distances of all boundary pixels S biggest.

Margin-Based Features

Margin sharpness is a measure of the difference in the intensity of the boundary region of a nodule and its surroundings. Margin sharpness is represented using the combination of acutance and histogram spread of averaged gradient of the nodule. The averaged gradient for K th boundary normal at a nodule slice is represented as [23]

$$ {f_{\text{d}}}(K) = \frac{1}{N}\sum\limits_{k = 1}^{N} {\frac{{(f(k) - b(k))}}{{2k}}}.\ $$
(7)

where f(k) and b(k) are part of normal boundary, located in the foreground and background of the nodule. The acutance of the nodule is defined as

$$ \text{Acutance of nodule} = \frac{1}{{{d_{\max }}}}{\left[ {\frac{1}{K}\sum\limits_{k = 1}^{K} {{f_{d}^{2}}(k)} } \right]^{\frac{1}{2}}}, $$
(8)

where d max is the maximum value of averaged gradient computed at all boundary pixel of the nodule. The average gradient of all boundary pixels for all the slices containing the nodule is denoted by a vector named N_gradient. The dimension of N_gradient is (P×1), where P is the total number of boundary pixel in all image slices containing the nodule. Histogram spread of average gradient (HSAG) of nodule is defined as

$$ \text{HSAG} = \frac{(3^{rd} - 1^{st}) \text{ quartile of histogram of {N}}\_{\text{gradient}}}{\text{(maximum-minimum) of N}\_{\text{gradient}}}. $$
(9)

The histogram spread was introduced by Tripathi et al. [30]. The range of histogram spread is (0,1].

2D Texture-Based Features

Six Haralick features [13] (viz, entropy, energy, inverse difference moment, sum entropy, difference entropy, and contrast) are computed from the biggest representative slice of the segmented nodule. Each combinations of directions (0,45,90,135 degree) and distances (1,2,3,4 pixels) generate a separate gray-level co-occurrence matrix (GLCM). Eleven Haralick features are calculated from each GLCM [24]. The mean of each Haralick feature computed from several GLCMs is used in the proposed work. Histogram of oriented gradient (HOG) [5] features is computed from the biggest representative slice of the segmented nodule. The mean, variance, and standard deviation of HOG features are used in the proposed classification scheme.

3D Texture-Based Features

Han et al. [12] introduced 3D Haralick features. They reported that the Haralick features considering nine directions (Fig. 3) provide better classification performance than five directions. In the proposed method, GLCM was computed for each nodule considering nine directions as described Fig. 3. Fourteen Haralick features [13] were computed from each GLCM matrix. The maximum correlation coefficient is not considered in our experiment, as it is computationally expensive [12]. The mean and the range of each Haralick feature over the nine directions is computed as

$$ {H^{k}}_{\text{mean}} = \frac{1}{n}\sum\limits_{i = 1}^{9} {{H_{\text{ik}}}}, $$
(10)
$$ {H^{k}}_{\text{range}} = \underset{i = 1:9}{\max } \{ {H_{\text{ik}}}\} - \underset{i = 1:9}{\min} \{ {H_{\text{ik}}}\} , $$
(11)

where H ik is the k th Haralick feature considering the i th direction. Total number of 3D texture features is 26.

Fig. 3
figure 3

Nine directions for computation of GLCM matrix

Feature Selection

The performance of the proposed classification scheme can be improved by selecting a set of relevant features. The relevant features are selected by considering A z and p values. The A z is computed using ROCKIT [22], and p values is computed by performing 2-tailed Student’s t test. The list of selected features is provided in “Results of Feature Selection”.

Database

The proposed classification scheme and the competing technique are evaluated on a data set of 891 nodules of Lung Image Database Consortium and Image Database Resource Initiative (LIDC/IDRI) public database [3]. Each slice of lung CT images consists of a matrix of 512×512 pixels. The pixel resolution range from 0.5 to 0.8 mm. The mean ± standard deviation of volume of nodules is 855.48±1545.10 mm3. The meam ± standard deviation of number of slices comprise nodules is 6.44±4.84. In LIDC/IDRI, the nodules with texture index 1 are considered as non-solid, texture index 2 or 3 are considered as part-solid and texture index 4 or 5 are considered as solid. The result of biopsy is not provided for pulmonary nodules. A group of four radiologists have provided the rank of malignancy of each nodule on a scale of 1:5, where rank 1 indicates the least chance of malignancy and rank 5 indicates the highest chance for malignancy. The composite rank of malignancy is the mode of malignancy rank of four radiologists [6]. The composition of nodules based on the composite rank of malignancy is provided in Table 1. Han et al. [12] categorized the pulmonary nodules into benign and malignant, based on the composite rank of malignancy. They considered three different configurations (see Table 2) for the evaluation of their CAD system. The same configurations are used for the evaluation of proposed classification scheme.

Table 1 Composition of pulmonary nodules based on composite rank of malignancy
Table 2 Configurations used for the evaluation of the proposed classification scheme and the competing technique

Results and Discussion

Results of Feature Selection

Total number of features computed from the segmented nodule is 56. The A z and p values of these features are provided in Table 3. Out of 56 features, A z is ≥0.60 and p value is less than 0.05 for 49 features (rank 1 to rank 50 excluding feature of rank 8 in Table 3). These 49 features are used in the proposed classification scheme. The spiculation of nodule is an important feature as it provides A z of 0.78 using Eq. 2. However, the computation of spiculation using Eq. 2 is very sensitive to segmentation error. Hence, spiculation is computed using Eq. 3, as the value of spiculation is more reproducible using Eq. 3.

Table 3 List of computed features with A z and p value

Results of Classification

SVM is used for the classification of benign and malignant nodules. Separate training and test data sets are constructed using five-fold cross-validation approach. The training and testing are performed over 100 times for each configuration. The mean A z for a particular configuration is obtained from the set of 100 classification outcomes. The proposed method is compared with the method of Han et al., which depends on the manual segmentation of pulmonary nodules by a trained radiologist. The method of Han et al. is automated using the segmentation technique of Dhara et al. and named as automated Han et al. The average value of A z for the proposed method, the method of Han et al., and automated Han et al. are provided in Table 4 for different configurations. The ROC plots for different configurations are provided in Fig. 4. The results in Table 4 depict that the proposed method outperforms automated Han et al. as well as the method of Han et al.

Fig. 4
figure 4

a, b, and c show the ROC for configuration 1, configuration 2, and configuration 3, respectively

Table 4 Average values of A z for configuration 1, configuration 2, and configuration 3

The method of Han et al. makes use of the segmented nodule provided by the experienced radiologists. On the other hand, the proposed technique and automated Han et al. use an established segmentation technique and they are affected by the segmentation error. Hence, the method of Han et al. outperforms automated Han et al. though both of them use the same set of features. The proposed classification scheme considers several 2D shape-based, 3D shape-based, margin-based, and texture-based features of nodules, whereas the method of Han et al. uses only texture-based features. Due to the inclusion of more relevant features, the proposed classification scheme outperforms automated Han et al. and the method of Han et al.

The results in Table 4 depict that the classification accuracy is higher in configuration 1 as compared to configuration 2 and configuration 3 due to the absence of uncertain nodules in configuration 1. The classification accuracy in configuration 2 is higher than configuration 3, and the result is consistent with the work of Han et al. The data suggests that nodules with composite rank of malignancy “3” have much similarity towards benign category. The sensitivity and specificity at the best operating point (a point on the ROC curve nearest to the point [0,1]) are provided in Table 5. The results depict that the sensitivity and specificity of the proposed method are slightly better compared to automated Han et al. and the method of Han et al. for three configurations.

Table 5 Sensitivity and specificity at the best operating point for configuration 1, configuration 2, and configuration 3

Conclusion

We have developed a classification scheme for differentiation of benign and malignant pulmonary nodules. The classification scheme could be operated with minimal user intervention and could assists radiologists in the diagnosis of lung cancer. While the most recent technique depends on the user for nodule segmentation, the proposed system would be fast and easy to use as the radiologists need to provide only a seed point for segmentation of nodules. Several shape-based, margin-based, and texture-based features are analyzed to improve the accuracy of classification. An optimal feature set is determined for efficient representation of nodules in multidimensional feature space. The proposed classification scheme outperforms the competing techniques for all configurations. Further work is required to improve the performance of classification. To improve the performance, more research work is required on segmentation of pulmonary nodules and improvement of feature set for proper representation of nodules in the feature space.