Abstract
COVID-19 is a highly communicable disease with various variants after various mutations, having different effects from person-to-person. Hence, early detection and diagnosis of the infection can help in controlling the spreading rate. In the present paper, a weighted average ensemble deep learning using ResNet50 and InceptionV3 is proposed for a binary detection of COVID using CT scan images. The model is created with Google Colab using Keras module and trained using graphics processing unit (GPU). Here, SARS-CoV-2-CT-Scan-Dataset is accustomed to validate the performance of the proposed algorithm. The proposed model achieved an accuracy of 94.23% compared to individual models.
Access provided by Autonomous University of Puebla. Download conference paper PDF
Similar content being viewed by others
Keywords
1 Introduction
Coronavirus disease 2019 also known as COVID-19 has been announced high risk by the World Health Organization (WHO) and has started the decade with a new strain of a respiratory disease. People who are afflicted with this ailment might have cold, fever, and chest tightness as the symptoms [1]. The recent variant omicron and deltacron are expected to have mild effects with a great communicability factor among the population, resulting in a high spread rate.
The mild symptoms or the asymptomatic patients are the easiest one to cure during the pandemic [2]. The most important test for finding out the infection is RT-PCR, but the problem is the shortage of the testing kits [3]. People who are at high risk due to the infection can be helped out by detecting COVID-19 early and so they can get their treatment start without waiting for the result of RT-PCR which will help them to recover soon and will help in decreasing the mortality rate. It can help identify patients with high levels of COVID and test them without RT-PCR [4]. Based on extensive clinical criteria, the CT technician should make the careful decision to operate on a CT to verify an aberrant diagnosis from a chest X-ray [5].
The authors of [6] have suggested a unique detailed-oriented capsule network architecture capable of recognizing fine-grained and discriminative picture characteristics for classification of patients with COVID-19 by following data augmentation model. The accuracy they obtained by their model is 87.6% and F1-score is 0.871.
In [7], authors have suggested a model named as EffiecintCovidNet which has a method based on voting and a cross-dataset exploration. By using their model for identification of COVID-19 using CT reports, they are getting the accuracy of 87.60% on their dataset. In [8], the authors build a CNN design model for distinguishing COVID-19 CT scans from others and detecting COVID-19. Their model CNN-2 is much better then original SqueezeNet with accuracy of 85.03% having F1-score of 0.862. The authors of [9] performed ensemble learning on the combined dataset using ResNet50, Inception V4, and EfficientNetB0 and produced predictions based on majority vote, and by using their model they are getting the accuracy of 95.36%. The convolution neural network (CNN) technique in deep learning [10] has shown significant utility in image classification and thus is most widely used by researchers today.
For CT-scan analysis [11] of the chest, deep learning techniques are popular because everyone may use them with low-cost imaging techniques and have a large amount of relevant data to train models with. Manual evaluation of the images of CT scans reports and X-rays reports requires their specialized knowledge, and the process is also time consuming and might be inaccurate sometimes, that is why there comes the algorithms of deep learning and machine learning which can help to extract the relevant information and perform the same task of evaluation of COVID-19 in an easy way [12]. The goodness of the two most effective pre-trained deep CNN models, namely InceptionV3 and ResNet50, has been explored for ensemble learning before too in some research, and their performances have been analysed on the basis of accuracy and other parameters [13]. Their study's goal is to provide a transfer-learning strategy based on CNN for identifying COVID-19 utilizing multiple models which can be more accurate [14]. Ultrasounds, dermoscopy, X-rays, magnetic resonance imaging (MRI), cognitive analytic therapy (CAT), and positron emission tomography (PET) are dynamic and developing domains for research, especially in image-processing techniques and algorithms [15]. The model which is been worked upon in this research is created with Google Colab using Keras module and trained using graphics processing unit (GPU).
The following are the primary contributions of this planned research. In this research, there is a hybrid model which is using the weighted average ensemble method so that the models can be trained and there will be the qualities of both of the model, i.e. InceptionV3 and ResNet50, and this will increase the accuracy of the detection. In InceptionV3, the accuracy is 90.23% and in ResNet50 the accuracy is 89.65% and so the hybrid model accuracy is 94.23% and having F1-score of 91.56%.SARS-CoV-2-CT-Scan-Dataset has been used for training and testing purposes with 80:20 ratio.
The paper is subdivided into the subsequent sections. Section 2 provides the information about the methodology of the research and information about the models used. Section 3 discusses the result of the proposed model along with detailed discussion. Finally, Sect. 4 concludes the paper.
2 Methodology
In this section, the detailed methodology used for COVID detection is proposed using ensemble deep learning algorithms.
2.1 ResNet50
ResNet50 [16] uses the concept of skip connection that resolves the vanishing gradient problem. In skip connection connects the input of the model to the output of the convolution block by either training the layer or just skipping the layer. It mainly consist two types of block namely identity block as shown in Fig. 2 and convolution block as shown in Fig. 3. Identity block works when the output size of the layer is equal to the input size of the layer, whereas convolution block works when the size of input and output layer differs by the method of 1 × 1 convolution block in the skip connection part which provides the solution of making the size equal. ResNet50 as shown in Fig. 1 is more precise with a large dataset as compared to the small dataset [17]. The architecture of ResNet50 is shown in Fig. 1.
In the following equation, f(x) is minimized which symbolizes the difference between the input and output variables for the provided layers. Hence, providing the minimum data loss for different layers used in the model.
where in Eq. (1) x defines input for the convolution layer.
2.2 InceptionV3
InceptionV3 [18] model is represented in Fig. 4. Learns from different sized filters parallel at multiple stages. The model uses both small and big filters to assess all vital information from the images. This can happen due to variations in the location of the information. Inception covers a wider network by using parallel filters of different sizes rather than deeper networks. The number of parameters is reduced to increase the computational complexity by using factorization of convolution in smaller convolution and asymmetric convolution and makes the algorithm more memory efficient. A small CNN model is used in the middle of the layers, while training the model and the loss computed is added to the main network loss thus reducing the vanishing gradient problem [19]. Figure 2. describes the architecture of InceptionV3 and the different blocks which were used to build this model.
2.3 Hybrid Ensemble Model
Ensemble learning [20] provides better performance and reduces the diffusion, i.e. decrease in variance, of the predictions with respect to any single model. Ensemble learning has three types: bagging, boosting, and stacking. This paper used the bagging type ensemble learning approach in which different models trained on the same dataset and then by averaging, validating the results, the ensemble model will predict the final class.
Deep learning procedure is used which bundle predictions from different models by taking individual model’s features in proportion to the model's effectiveness or capabilities. This model is commonly known as weighted sum/average ensemble. To achieve greater accuracy, authors proposed this hybrid weighted average approach and predicted the result. The training set will be used to fit the hybrid model in Fig. 5, and the validation set will be used to evaluate it. The model weighting will be determined by the validation set's accuracy.
Here, Accuracy1 is the accuracy of ResNet50 and Accuracy2 is the accuracy of InceptionV3. W1 in Eq. (2) and W2 in Eq. (3) are calculated weight for the ResNet50 and InceptionV3, respectively. Equation (4) describes the hybrid weighted approach to predict the results.
3 Results and Discussions
3.1 Dataset
In the present paper, the proposed model is used to improve the accuracy of COVID and non-COVID detection from CT scan image reports. SARS-CoV-2-CT-Scan-Dataset [17] from Kaggle is utilized for model training, validation, and testing. The experiment has been performed on Google Colab by using Keras library. The dataset comprises 2481 CT scan images fetched from 120 patients including 1252 COVID images and 1229 non-COVID images. The training and testing dataset is divided into 80:20 ratio, whereas the training dataset is further bisect into training and validation by the ratio of 80:20.
3.2 Accuracy
The model has been parameterized according to accuracy, specificity, sensitivity, precision, and F1-score, defined below. The proposed hybrid model is compared with individual model is summarized in Table 1. Following evaluation parameters are used to compare the proposed model with the individual model.
where TP, TN, FP, and FN act as true positive, true negative, false positive, and false negative values, sequentially.
3.3 Discussion
Table 2 shows the comparisons between previous work done in this field and the proposed model solution to resolve the problem with better and more accurate results. InceptionV3 and ResNET50 are deep learning algorithms which have great impact on the vanishing gradient problem of dataset and helps us to produce better accuracy than other CNN algorithms. Hybrid model uses the qualities of both the algorithms which are combined using the weighted average ensemble method that also helps to achieve a step ahead accuracy and prediction then the single model. Decaps and Peekaboo [6] model proposed an accuracy of 87.6% on the dataset prepared by Zhaoetal consisting of 746 CT scan images which is lesser by 7.035% that the proposed methodology. Another method namely DRE-Net [21] executed over a small database of only 88 CT scan reports which have reached an accuracy of 93%. It showed an unexpected result with all the parameters having the same value. The proposed model uses the best techniques ResNet50, InceptionV3, and ensemble learning models.
The confusion matrix represented in Fig. 6. helps us to get a visual idea of the accuracy represented by the proposed model.
4 Conclusion
In this paper, a hybrid ensemble model is proposed using InceptionV3 and ResNet50 to detect COVID and non-COVID patients CT scan images. The experiment has been executed on Google Colab by the help of Keras library implemented over the dataset extracted from Kaggle named as SARS-CoV-2-CT-Scan-Dataset. The proposed model uses bagging ensemble learning which predicts the final accuracy by the weighted average of the individual model accuracy. It provides an accuracy of 94.23% which is higher than the InceptionV3 and ResNet50 model which have an accuracy of 90.23% and 89.65%, respectively. This research can be further improved to segregate different lung diseases.
References
Hussain A, Kaler J, Tabrez E, Tabrez S, Tabrez SSM (2020) Novel COVID-19: a comprehensive review of transmission, manifestation, and pathogenesis. Cureus. https://doi.org/10.7759/CUREUS.8184
Hashmi HAS, Asif HM (2020) Early detection and assessment of covid-19. Front Med 7:311. https://doi.org/10.3389/FMED.2020.00311/BIBTEX
Wang W, Xu Y, Gao R, Lu R, Han K, Wu G, Tan W (2020) Detection of SARS-CoV-2 in different types of clinical specimens. JAMA 323(18):1843–1844. https://doi.org/10.1001/JAMA.2020.3786
Mangal A, Kalia S, Rajgopal H, Rangarajan K, Namboodiri V, Banerjee S, Arora C (2020) CovidAID: COVID-19 detection using chest X-ray. http://arxiv.org/abs/2004.09803
Tenda ED, Yulianti M, Asaf MM, Yunus RE, Septiyanti W, Wulani V, Pitoyo CW, Rumende CM, Setiati S, Tenda ED (2020) The importance of chest CT scan in COVID-19: a case series. Acta Med Indones-Indones J Intern Med 52
Mobiny A, Cicalese PA, Zare S, Yuan P, Abavisan MS, Wu CC, Ahuja J, de Groot PM, Nguyen HV (2020) Radiologist-level COVID-19 detection using CT scans with detail-oriented capsule networks
Silva P, Luz E, Silva G, Moreira G, Silva R, Lucio D, Menotti D (2020) COVID-19 detection in CT images with deep learning: a voting-based scheme and cross-datasets analysis. Inf Med Unlocked 20. https://doi.org/10.1016/j.imu.2020.100427
Polsinelli M, Cinque L, Placidi G (2020) A light CNN for detecting COVID-19 from CT scans of the chest. Pattern Recogn Lett 140:95–100. https://doi.org/10.1016/j.patrec.2020.10.001
Shrivastava P, Singh A, Agarwal S, Tekchandani H, Verma S (2021) Covid detection in CT and X-ray images using ensemble learning. In: Proceedings—5th international conference on computing methodologies and communication, ICCMC 2021, pp 1085–1090. https://doi.org/10.1109/ICCMC51019.2021.9418308
Modi S, Guhathakurta R, Praveen S, Tyagi S, Bansod SN (2021) Detail-oriented capsule network for classification of CT scan images performing the detection of COVID-19. Mater Today: Proc. https://doi.org/10.1016/j.matpr.2021.07.367
Nayak SR, Nayak DR, Sinha U, Arora V, Pachori RB (2021) Application of deep learning techniques for detection of COVID-19 cases using chest X-ray images: a comprehensive study. Biomed Signal Process Control 64. https://doi.org/10.1016/j.bspc.2020.102365
Hasan NI (2021) A hybrid method of covid-19 patient detection from modified CT-scan/chest-X-ray images combining deep convolutional neural network and two- dimensional empirical mode decomposition. Comput Methods Programs Biomed Update 1:100022. https://doi.org/10.1016/j.cmpbup.2021.100022
Tekchandani H, Verma S, Londhe ND (2018) Severity assessment of lymph nodes in CT images using deep learning paradigm. In: 2018 second international conference on computing methodologies and communication (ICCMC), pp 686–691
Thakur S, Kumar A (2021) X-ray and CT-scan-based automated detection and classification of covid-19 using convolutional neural networks (CNN). Biomed Signal Process Control69. https://doi.org/10.1016/j.bspc.2021.102920
Aslan MF, Unlersen MF, Sabanci K, Durdu A (2021) CNN-based transfer learning–BiLSTM network: a novel approach for COVID-19 infection detection. Appl Soft Comput 98. https://doi.org/10.1016/j.asoc.2020.106912
Mascarenhas S, Agarwal M (2021) A comparison between VGG16, VGG19 and ResNet50 architecture frameworks for image classification. In: 2021 international conference on disruptive technologies for multi-disciplinary research and applications (CENTCON), pp 96–99. https://doi.org/10.1109/CENTCON52345.2021.9687944
SARS-COV-2 Ct-Scan Dataset | Kaggle (n.d.) Retrieved March 12, 2022, from https://www.kaggle.com/plameneduardo/sarscov2-ctscan-dataset
Szegedy C, Vanhoucke V, Ioffe S, Shlens J, Wojna Z (2016) Rethinking the inception architecture for computer vision. In: Proceedings of the IEEE computer society conference on computer vision and pattern recognition, pp 2818–2826. https://doi.org/10.1109/CVPR.2016.308
Paperspace Blog (n.d.) from https://blog.paperspace.com/popular-deep-learning-architectures-resnet-inceptionv3-squeezenet. Last accessed 2022/04/24
Ensemble learning Simplified. by Sunila Gollapudi | Retrieved March 27, 2022, from https://medium.datadriveninvestor.com/ensemble-learning-simplified-868a99b6d945
Song Y, Zheng S, Li L, Zhang X, Zhang X, Huang Z, Chen J, Wang R, Zhao H, Chong Y, Shen J, Zha Y, Yang Y (2021) Deep learning enables accurate diagnosis of novel coronavirus (COVID-19) with CT images. https://doi.org/10.1109/TCBB.2021.3065361
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Aggarwal, V., Jat, D.K., Maheshwari, V., Kumar, M. (2023). A Hybrid Ensemble Deep Learning Model for COVID-19 Detection Using CT Scan. In: Rani, A., Kumar, B., Shrivastava, V., Bansal, R.C. (eds) Signals, Machines and Automation. SIGMA 2022. Lecture Notes in Electrical Engineering, vol 1023. Springer, Singapore. https://doi.org/10.1007/978-981-99-0969-8_45
Download citation
DOI: https://doi.org/10.1007/978-981-99-0969-8_45
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-0968-1
Online ISBN: 978-981-99-0969-8
eBook Packages: EnergyEnergy (R0)