Abstract
Lung cancer is one of the leading causes of mortality worldwide. The survival rate of lung cancer depends on its timely detection and diagnosis. For pulmonary cancer detection, numerous Computer-Assisted Diagnosis (CADx) systems have been developed that use the CT scan imaging modality. Recent advancement in deep learning techniques has enabled these CADx to automatically model high-level abstractions in CT-Scan images using a multi-layered Convolutional Neural Network (CNN). Our proposed CAD system comprises 3D residual U-Net for nodule detection. Initially, the 3D residual U-Net resulted in false positive results; therefore, a multi-Region Proposal Network (mRPN) was proposed for the improvement of nodule detection. The detected nodules are assigned a probability of malignancy. Furthermore, each detected nodule is classified into four classes based on its respective malignancy score. Extensive experimental results illustrate the effectiveness of our 3D residual U-Net model. These results demonstrate the exceptional detection performance achieved by our proposed model with a sensitivity of 97.65% and an average classification accuracy of 96.37%. Performance analysis demonstrates the potential of the proposed CAD system for the detection and classification of lung nodules with high efficiency and precision.
Access provided by Autonomous University of Puebla. Download conference paper PDF
Similar content being viewed by others
Keywords
1 Introduction
Lung cancer has the highest mortality rate in both males and females where the 3-year survival rate for patients with lung cancer is 25% [19]. There are no obvious symptoms at the beginning of lung cancer, and as a consequence, most patients seek treatment at the later stage, minimizing survival chances. Therefore, early detection and diagnosis of lung cancer is of the utmost importance [1]. The chest computed tomography (CT) imaging modality provides high-resolution images of nodules with lavish details; however, pulmonary nodules have inhomogeneous densities and lower contrast compared to blood vessel segments and other anatomical structures, increasing the complexity of nodule detection [2, 12]. To assist radiologists in automatically detecting nodules and replacing the time-consuming manual delineation of nodules, Computer-Aided Detection (CADe) and Diagnosis (CADx) systems are developed. The latest technologies use Artificial Intelligence (AI) to assist in the auxiliary diagnosis of the disease and improve the overall accuracy of the diagnosis while decreasing the detection time [10]. In recent literature, researchers have presented deep learning-based CAD systems with promising results. The convolutional neural network (CNN) framework has been used for the classification of nodules [7] and the reduction of false positive (FP) [20]. Shen et al. proposed a Multi-Crop CNN (MC-CNN) [18] and Setio et al. developed Multi-View CNN (MV-CNN) [16] to classify lung nodule. A 3D-CNN model based on Volumes of Interest (VOI) and a Fully Convolutional Network (FCN) was used to produce a score map for nodule classification [9]. Both CADe and CADx systems have been independently investigated [6], CADe are unable to provide lesion’s radiological characteristics, consequently missing crucial information, while CADx systems do not identify lesions and therefore do not possess high levels of automation. Therefore, a new and advanced CAD system is needed that incorporates the benefits of detection from CADe and diagnosis from CADx into a single system for better performance.
1.1 Contribution
Our contribution is as follows:
-
3D Residual U-Net Model A novel nodule detection method is proposed using 3D CT images for candidate nodule detection; compared to existing 2D U-Net models, our 3D residual model considers rich spatial features and therefore has more discriminative selection criteria.
-
Multi-Region Proposal Network (mRPN) We added four RPNs so that nodules with varying diameters can be detected with ease and efficiency. The RPN split-and-merge cascade network mitigates the problem of undetected small nodules.
-
Malignancy Score-Based Approach (MSBA) Malignancy score is calculated to classify each detected nodule into one of the four classes based on its aggregate malignancy score.
-
False-Positive Reduction Algorithm (FPRA) We proposed an algorithm for a false positive (FP) reduction rate. Comparative results are much better than the existing FP reduction algorithm. The performance of the proposed CAD system is evaluated with state-of-the-art CAD systems using various performance evaluation metrics. The experimental results showed that the proposed method can not only be used for detection but also performs well for the classification of pulmonary cancer nodules as malignant and benign.
1.2 Paper Organization
Section 2 describes our approach, Sect. 3 discusses the implementation details, Sect. 4 describes the experimental results, and Sect. 5 concludes this paper.
2 Our Approach
We improved three aspects of the lung cancer detection models. First, the datasets used by most authors do not consider the lung wall. We considered location as one of the most important features and therefore found that most nodules occur in close proximity to the lung wall. Therefore, our model avoided omitting the edge of the lung. Secondly, we used 3d U-Net to filter candidate nodules [22]. Finally, we applied CNNs for nodule classification.
2.1 Pre-processing
Our in-house dataset comprises 56 patients’ CT scans using a GE CT scanner (with contrast and 3mm slice thickness) in DICOM format. Each CT scan is composed of 80 to 200 distinct slices and the primary tumor was manually delineated using 3D-Slicer software. We resized each CT-scan data using average upsampling and average downsampling with bilinear interpolation by ImageJ software. Furthermore, we scaled the pixel value using the min-max scalar method and applied the CLAHE (Contrast Limited Adaptive Histogram Equalization) method. Data augmentation is necessary because deep learning-based models require large training datasets. The positive dataset to train our proposed model was insufficient, leading to the overfitting problem. We increased the positive dataset sample by using \(128\times 128\times 128\) window size. We used affine transformations (rotation [\(0^\circ \) to \(270^\circ \) around the center point], flipping, translation, and scaling) along with image enhancement by Gaussian High Pass filter with kernel size \(3\times 3\) to improve image quality and sharpening filter. For each image \(P_{n}(\textrm{z})\) shown in Eq. (1), and S in Eq. (2) where \(r_t\), \(z_t\) is the reset and update to apply the affine transformations, respectively. While \(\hat{h}_{t}\) is the final augmented state of the data. The affine transformations were standardized so that the average samples have variance=1 and mean=0.
2.2 RPN Split-Merge Cascade Network
For the detection of various nodules having different diameters, we used varying levels of \(RPN_{Lx}\) referring to different sizes of the nodule. We set the \(RPN_{L1}\) as small anchors to detect the diameter of the nodule \(\tau \) that ranges from 3 mm to 10 mm and has a volume \(\upsilon <\)= 80 mm\(^3\) while \(RPN_{L2}\), \(RPN_{L3}\) and \(RPN_{L4}\) have large anchors to detect nodules ranging from \(\tau \) = 10 mm–20 mm or \(\upsilon \) = 80–200 mm\(^3\), \(\tau \) = 20 mm–30 mm or \(\upsilon >\)= 200–300 mm\(^3\) and \(\tau >\)= 30 mm or \(\upsilon >\) 300 mm\(^3\), respectively. The motivation behind these RPNs is the four stages of lung cancer that are categorized by different diameters, while \(RPN_{L1}\) is for all the input nodules. The RPN split and merge cascade network starts with the \(RPN_{L1}\) which is further split to either \(RPN_{L2}\) or the rest and then in the next step it is input for the \(RPN_{L3}\) or \(RPN_{L4}\). Since each RPN level generates separate RoI sets, a merging layer is required that combines the RoI sets into one, the RPN levels merge layer \(RPN_m\) takes the input RoI sets from all the RPN levels (\(RPN_{L1}\), \(RPN_{L2}\), \(RPN_{L3}\), \(RPN_{L4}\)) and outputs an aggregate RoI set \(RoI_{agg}\). For the possibility of duplicate RoI or low objectiveness score RoI, we used the non-maxima suppression (Non-MS) when the intersection over union (IoU) overlap is above the threshold (threshold set at \(\rho _t = 0.5\)). After using the Non-MS, we selected the top hundred RoI with low objectiveness scores for further use. Nodule detection using different levels of RPN having various anchors improves the detection phase, as both diameter and volume are taken into consideration.
2.3 3D Residual U-Net Training Strategy and Architecture
Our proposed model relies heavily on exploiting the symmetries of the 3D space [16]. Therefore, the lung CT scan is converted into 3D fragments which are used as input for the 3D residual net. For 3D CT images containing lung nodules, the lung nodule regions were cropped to a size of \(128\times 128\times 64\). The 3D residual net detects the module malignancy based on the characteristics obtained from the input image, and the probability of cancer stage is estimated [5]. We used binary valued threshold (corrosion & expansion) and Laplacian of Gaussian to segment the lung nodules (including lung wall), morphologic closing was performed and connected component operations are labeled in order to remove background and noise. A hole-filling algorithm based on contour information was also used to reserve the nodules on the lung wall. We obtained a collection of the interval \([x_k,y_k]\) that contains all the intervals of \(C_n\). \(\textbf{M}\), so if n is large enough, \(\displaystyle \sum _{k=1}^n\left| y_k-x_k\right| <\eta \). But \(\displaystyle \sum _{k=1}^n\left| f_c(y_k)-f_c(x_k)\right| =1\). The segmentation issue is addressed by taking N partitions of the set of features represented by P of classes M, thus minimizing the cost term of the error function by assigning the pixel P in Eq. (3) and Eq. (4)
Another challenge for our proposed model was learning the complex inner spatial relationship between parameters using deeper CNN. We added multiple residual blocks in the middle of the 3D U-Net model, which is capable of producing higher-level packet information. Taking into account the complex anatomical structures surrounding the lung lesion, we needed an effective method to use contextual information at multiple levels [15]. An overview of the CAD system for the detection of lung nodules using the 3D residual U-Net and multi-Region Proposal Network (mRPN) is shown in Fig. 1.
2.4 Malignancy Score-Based Approach (MSBA)
For the classification of detected lung lesions in the nodule detection phase, the Malignancy Score-Based Approach (MSBA) is used to achieve the assessment of nodule malignancy of candidate lesions. For this phase, the regions of interest (RoIs) that are marked by the nodule detection phase are redefined from each marked location resulting from the last step. MSBA assigns the malignancy score to RoIs by considering the metastasis information provided in the data set to classify the candidate nodule into T0, T1, T2, and T3 stages. The details of MSBA are provided in Algorithm 1. The neighboring pixels in 3D surrounding the RoIs are taken into consideration in terms of intensity values and their eigenvalues (Hessian Matrix and the Gradient Matrix) to assign an aggregate score to each candidate-marked lesion. The result of this step is the allocation of the average score to all candidate nodules.
3 Implementation
In addition to our in-house lung cancer CT dataset, we used publicly available datasets namely the LIDC-IDRI [4], ANODE09 [8], and LUNA16 [17] for evaluation. The probability of nodules is calculated for the nodule candidates generated by the classification model. On the basis of this probability, we mark the nodules as benign and malignant in Fig. 2. To reduce false positive results, we have proposed an algorithm that considers the probability of the candidate nodule and further improves the classification of the nodules into different stages while omitting false positive results at each stage. The details of our proposed algorithm for false positive reduction are provided in Algorithm 2.
4 Experimental Results
Our results are obtained using the concept that if a detected nodule is very close to the annotated nodule, we gain a score and the score is related to the FROC curve on sensitivity at 1/8, 1/4, 1/2, 1, 2, 4, and 8 [21]. We obtained a score of 0.974 (MAX = 1) by randomly selecting data records as the test set (excluding training and validation dataset), and an accuracy of approximately 0.997 was recorded. The results are validated using the common performance metric of computer-aided detection and diagnosis systems, i.e., average accuracy, specificity, and sensitivity. For a detailed performance analysis of our CAD system, we applied the ROC curve, which presents the TPR (True Positive Rate) as the FPR function (False Positive Rate). We plotted the sensitivity (TPR) with their respective FPR to compare our CAD system with state-of-the-art CAD systems.
The effectiveness of our method 3D Residual U-Net is verified by comparing it with Convolutional Neural Networks (CNN), Massive training artificial neural networks (MTANNs), Fully Convolutional Networks (FCN), Region-based Fully Convolutional Networks (RFCN), and RNN (Recurrent Neural Networks), the performance results are depicted in Fig. 3. Table 2 summarizes the performance of our proposed CAD system in comparison to other CAD systems in terms of accuracy, sensitivity, specificity, False Positive (FP), and average FP (using FPRA). It is visible from the confusion matrix in Table 1 that the proposed model maintains a high sensitivity for the classification of types of lung cancer even in the region where FP per scan is low. Although the proposed model achieved a high sensitivity value for stage classification for T0, T2, and T3, the sensitivity value of T1 decreases as the FP per scan becomes small, which is impractical in the clinical environment [14]. A summary of our proposed CAD system in terms of stage classification is shown in Table 1. The experimental results demonstrate the superiority in the classification and class generalization of our proposed 3D Residual U-Net based CAD system. A comparison of nodule classification by CAD systems is shown in Fig. 4.
A comparison of our model with the existing state-of-the-art CADe systems on the LIDC-IDRI dataset with varying nodule sizes is shown in Table 3. We have compared the detection accuracy of our CADe system with the detection accuracy of five other existing systems, which are evaluated on the dataset from the LIDC-IDRI database. The high accuracy of our proposed model with varying sizes signifies the detection capability of our 3D Residual U-Net model. Our method has promising results in discriminating cancer nodule types without compromising detection accuracy (Fig. 4).
5 Conclusion
We proposed a novel CAD system for the automatic detection and classification of lung nodules in CT images. Our CAD system comprises two models, the 3D residual U-Net and multi-Region Proposal Network (mRPN), which have demonstrated effective nodule detection results even with small-sized lesions. For the classification of nodules, the aggregate malignancy score is calculated for each detected nodule. Based on this score, detected nodules are classified into four classes; T0, T1, T2, and T3. Experimental results illustrate the efficacy of our proposed CAD system in comparison to state-of-the-art CAD systems that use various performance evaluation metrics. Our proposed CAD system is generic and therefore could possibly be extended to the detection of other cancers.
References
Adams, S.J., Stone, E., Baldwin, D.R., Vliegenthart, R., Lee, P., Fintelmann, F.J.: Lung cancer screening. Lancet 401(10374), 390–408 (2023)
Alahmari, S.S., Cherezov, D., Goldgof, D.B., Hall, L.O., Gillies, R.J., Schabath, M.B.: Delta radiomics improves pulmonary nodule malignancy prediction in lung cancer screening. IEEE Access 6, 77796–77806 (2018)
Alves, J.H., Neto, P.M.M., Oliveira, L.F.: Extracting lungs from ct images using fully convolutional networks. In: 2018 International Joint Conference on Neural Networks (IJCNN), pp. 1–8. IEEE (2018)
Armato, S.G., III.: The lung image database consortium (LIDC) and image database resource initiative (IDRI): a completed reference database of lung nodules on CT scans. Med. Phys. 38(2), 915–931 (2011)
Dou, Q., Chen, H., Yu, L., Qin, J., Heng, P.A.: Multilevel contextual 3-D CNNs for false positive reduction in pulmonary nodule detection. IEEE Trans. Biomed. Eng. 64(7), 1558–1567 (2017)
Firmino, M., Angelo, G., Morais, H., Dantas, M.R., Valentim, R.: Computer-aided detection (CADe) and diagnosis (CADx) system for lung cancer with likelihood of malignancy. BioMed. Eng. OnLine 15(1), 2:1–2:17 (2016)
van Ginneken, B., Setio, A.A.A., Jacobs, C., Ciompi, F.: Off-the-shelf convolutional neural network features for pulmonary nodule detection in computed tomography scans. In: IEEE International Symposium on Biomedical Imaging, pp. 286–289 (2015)
van Ginneken, B., et al.: Comparing and combining algorithms for computer-aided detection of pulmonary nodules in computed tomography scans: the ANODE09 study. Med. Image Anal. 14(6), 707–722 (2010)
Hamidian, S., Sahiner, B., Petrick, N., Pezeshk, A.: 3D convolutional neural network for automatic detection of lung nodules in chest CT. In: Proceedings of SPIE, vol. 10134 (2017)
Huang, S., Yang, J., Shen, N., Xu, Q., Zhao, Q.: Artificial intelligence in lung cancer diagnosis and prognosis: current application and future perspective. In: Seminars in Cancer Biology. Elsevier (2023)
Hussein, S., Gillies, R., Cao, K., Song, Q., Bagci, U.: TumorNet: lung nodule characterization using multi-view convolutional neural network with Gaussian process. In: IEEE International Symposium on Biomedical Imaging, pp. 1007–1010 (2017)
Li, X., Deng, Z., Deng, Q., Zhang, L., Niu, T., Kuang, Y.: A novel deep learning framework for internal gross target volume definition from 4d computed tomography of lung cancer patients. IEEE Access 6, 37775–37783 (2018)
Masood, A., et al.: Computer-assisted decision support system in pulmonary cancer detection and stage classification on CT images. J. Biomed. Inf. 79, 117–128 (2018)
Masood, A., et al.: Automated decision support system for lung cancer detection and classification via enhanced rfcn with multilayer fusion rpn. IEEE Trans. Ind. Inf. 16(12), 7791–7801 (2020)
Masood, A., et al.: Cloud-based automated clinical decision support system for detection and diagnosis of lung cancer in chest CT. IEEE J. Transl. Eng. Health Med. 8, 1–13 (2019)
Setio, A.A.A., et al.: Pulmonary nodule detection in CT images: false positive reduction using multi-view convolutional networks. IEEE Trans. Med. Imaging 35(5), 1160–1169 (2016)
Setio, A.A.A., et al.: Validation, comparison, and combination of algorithms for automatic detection of pulmonary nodules in computed tomography images: The LUNA16 challenge. Med. Image Anal. 42, 1–13 (2017)
Shen, W., et al.: Multi-crop convolutional neural networks for lung nodule malignancy suspiciousness classification. Pattern Recogn. 61, 663–673 (2017)
Siegel, R.L., Miller, K.D., Wagle, N.S., Jemal, A.: Cancer statistics, 2023. Ca Canc. J. Clin. 73(1), 17–48 (2023)
Tan, M., Deklerck, R., Jansen, B., Bister, M., Cornelis, J.: A novel computer-aided lung nodule detection system for CT images. Med. Phys. 38(10), 5630–5645 (2011)
Teramoto, A., Fujita, H.: Fast lung nodule detection in chest CT images using cylindrical nodule-enhancement filter. Int. J. Comput. Assist. Radiol. Surg. 8(2), 193–205 (2013)
Xie, Y., Zhang, J., Xia, Y., Fulham, M., Zhang, Y.: Fusing texture, shape and deep model-learned information at decision level for automated classification of lung nodules on chest CT. Inf. Fusion 42, 102–110 (2018)
Yuan, J., Liu, X., Hou, F., Qin, H., Hao, A.: Hybrid-feature-guided lung nodule type classification on CT images. Comput. Graph. 70, 288–299 (2018)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Masood, A., Naseem, U., Nasim, M. (2023). Fully Automated CAD System for Lung Cancer Detection and Classification Using 3D Residual U-Net with multi-Region Proposal Network (mRPN) in CT Images. In: Ali, S., van der Sommen, F., van Eijnatten, M., Papież, B.W., Jin, Y., Kolenbrander, I. (eds) Cancer Prevention Through Early Detection. CaPTion 2023. Lecture Notes in Computer Science, vol 14295. Springer, Cham. https://doi.org/10.1007/978-3-031-45350-2_3
Download citation
DOI: https://doi.org/10.1007/978-3-031-45350-2_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-45349-6
Online ISBN: 978-3-031-45350-2
eBook Packages: Computer ScienceComputer Science (R0)