COVID-19 and pneumonia diagnosis from chest X-ray images using convolutional neural networks

Hariri, Muhab; Avşar, Ercan

doi:10.1007/s13721-023-00413-6

COVID-19 and pneumonia diagnosis from chest X-ray images using convolutional neural networks

Original Article
Published: 13 March 2023

Volume 12, article number 17, (2023)
Cite this article

Download PDF

Network Modeling Analysis in Health Informatics and Bioinformatics Aims and scope Submit manuscript

COVID-19 and pneumonia diagnosis from chest X-ray images using convolutional neural networks

Download PDF

3108 Accesses
11 Citations
1 Altmetric
Explore all metrics

Abstract

X-ray is a useful imaging modality widely utilized for diagnosing COVID-19 virus that infected a high number of people all around the world. The manual examination of these X-ray images may cause problems especially when there is lack of medical staff. Usage of deep learning models is known to be helpful for automated diagnosis of COVID-19 from the X-ray images. However, the widely used convolutional neural network architectures typically have many layers causing them to be computationally expensive. To address these problems, this study aims to design a lightweight differential diagnosis model based on convolutional neural networks. The proposed model is designed to classify the X-ray images belonging to one of the four classes that are Healthy, COVID-19, viral pneumonia, and bacterial pneumonia. To evaluate the model performance, accuracy, precision, recall, and F1-Score were calculated. The performance of the proposed model was compared with those obtained by applying transfer learning to the widely used convolutional neural network models. The results showed that the proposed model with low number of computational layers outperforms the pre-trained benchmark models, achieving an accuracy value of 89.89% while the best pre-trained model (Efficient-Net B2) achieved accuracy of 85.7%. In conclusion, the proposed lightweight model achieved the best overall result in classifying lung diseases allowing it to be used on devices with limited computational power. On the other hand, all the models showed a poor precision on viral pneumonia class and confusion in distinguishing it from bacterial pneumonia class, thus a decrease in the overall accuracy.

COVID-19 X-Ray Image Classification Using Deep Convolution Neural Network

Binary and Ternary Classifiers to Detect COVID-19 Patients Using Chest X-ray Images: An Efficient Layered CNN Approach

Article 27 April 2024

Deep Learning Approach for COVID-19 Diagnosis Using X-Ray Images

Discover the latest articles, news and stories from top researchers in related subjects.

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The flare-up of the COVID-19 has increased the need for new effective and faster diagnostic methods than those manual diagnosis provided by the experts. The huge number of infected people and insufficient number of medical staff and health facilities in some countries increased the burden on the health system. On the other hand, the widespread usage of rapid diagnosis tools, which help in taking measurements and suggesting an appropriate treatment, is an evidence of both sieging effect of the pandemic and their usefulness in mitigating the spread of virus. In recent years, the reliance on machine learning techniques in the medical field has increased dramatically. Roy et al. (2022) discussed the prospects of supervised machine learning (SML) in the healthcare sector, the challenges it faces, how to solve it and the opportunity for healthcare through AI and SML in the near future. In general, these techniques have proven to be effective in diagnosing the diseases with acceptable accuracy and high speed. Jaiswal et al. (2021) proposed an optimized technique for identification of blindness in retinal images using the deep learning models. Ensembles of convolutional neural networks (CNN) has shown to be an efficient tool for skin cancer detection (Al-Karawi 2022) while segmentation of skin diseases also possible with the methods based on CNN (Huang et al. 2020). Among the many other studies based on medical images, there are different applications such as detection and diagnosis of gastric cancer (Cao et al. 2019), breast cancer (Wang et al. 2019), brain tumors (Salçin 2019), pneumonia (Avsar 2021), lung diseases (Kabiraj 2022) and lung cancer (Gunjan et al. 2022; Agarwal 2021). The use of machine learning techniques in the medical field was not limited to diagnosing diseases only, but also included several domains such as segmenting the medical images (Pal et al. 2022; Rajinikanth et al. 2022) and use the segmented images for specific purposes like predicting the type of the fetal brain (pathological or neurotypical) and predicting the gestational age of the fetus (Gangopadhyay et al. 2022).

Symptoms of the COVID-19 vary from person to person; however, the most frequently reported symptoms include fatigue, coughing, and shortness of breath. The problem is that these symptoms may be associated with other similar illnesses such as pneumonia (Zayet et al. 2020). Reverse transcription polymerase chain reaction (RT-PCR) tests are currently one of the most popular and reliable methods to discover the presence or absence of this virus. However, these tests have many drawbacks. This method is slow as sometimes it takes 24 h for a result to appear. In addition, it puts medical staff at risk of catching the virus as a result of the physical contact with the patient. It is also expensive, thus inaccessible for poor countries. Therefore, a quick, reliable and cheap way to diagnose COVID-19 and pneumonia infections is necessary and help to take the appropriate actions. Chest radiography (chest X-ray) is also a commonly used method for diagnosing the lung diseases and detect COVID-19; however, this method has drawbacks as well. To diagnose the diseases by the X-ray images, experts are required to inspect the images. In addition, it can provide false results because of the similarity between chest X-ray images of people infected with COVID-19 and different types of pneumonia.

CNN is a popular machine learning method that is used to classify images and detect objects. In this work, a sequential CNN architecture is proposed to detect the X-ray images belonging to patients with COVID-19, Viral Pneumonia, and Bacterial Pneumonia. For benchmarking purposes, the classification performance of the proposed architecture was compared with those obtained by widely-used CNN models pretrained on ImageNet dataset. These benchmark models are MobilenetV2, InceptionResNetV2, ResNetV2, EfficientNet B2, EfficientNet B0, NasNetMobile, InceptionV3, VGG16 and VGG19. These models differ in terms of design, number of parameters and depth. These factors allow for a comparison of models interchangeably with the proposed model. In terms of the practical implications, the property of being lightweight allows the proposed model to be used on devices with limited processing capability. In other words, it opens the way to design and develop cheap auxiliary tools to detect lung diseases.

Many works have been conducted to diagnose COVID-19 and pneumonia diseases; however, most of these works merge viral and bacterial lung diseases into one category. This leads to less understanding about how CNNs perform in classifying these diseases separately and provides a limited diagnosis scheme. In addition, the number of parameters is not discussed in the proposed models presented in these studies, so it is not clear how well these models can work in environments with low resources. Therefore, within the scope of this study, the answers to the following research questions are sought:

Q1: How successful is the proposed CNN model in detecting the lung diseases separately (i.e. COVID-19, Viral Pneumonia and Bacterial Pneumonia)?

Q2: Can a light model with low number of parameters, and thus low computation cost, performs well for this classification task?

As a result of the experiments performed, a CNN model is proposed to address the limitations of the existing studies. In particular, the contributions of this study are given in the list below.

The proposed model has a lower number of convolutional layers and parameters than the benchmark models. Therefore, this is a lightweight model that requires relatively small amount of calculation at training and test phases.
It may achieve better classification results. In particular, the proposed model is capable of distinguishing the COVID-19, viral and bacterial pneumonia cases with a high true detection rate.
As a result of being lightweight, the proposed model does not require expensive and powerful hardware as it includes relatively a low number of parameters. This advantage makes it applicable to devices with low computational power such as edge devices and single board computers.

The remainder of this paper is organized as follows: In Sect. 2, the existing studies in the related literature are reviewed. In Sect. 3, the dataset, models and performance metrics are introduced. Section 5 and 6 represent the results and discussion. Finally, the paper is concluded in Sect. 6.

2 Literature review

Among various approaches for COVID-19 detection, chest X-ray images are widely used and hence there are many available studies in this context. Due to the automated feature extraction capability of convolutional neural networks (CNN), they are used for classification of unstructured data such as images. Therefore, there are numerous previous studies in which chest X-ray images were used together with CNN models for detection of COVID-19 infections. In some studies, the researchers aim to discriminate the X-ray images of positive COVID-19 cases from the healthy X-ray images. However, COVID-19 cases are very likely to be confused with pneumonia infections which can be bacterial or viral, hence, there are other studies where the detection is considered as a three classes or a four classes problem to detect COVID-19 and pneumonia together.

The number of studies considering a binary problem to detect healthy and COVID-19 X-ray images is relatively high. For instance, Reynaldi et al. (2021) used CNN with the Resnet-101 model as an image recognition method to detect COVID-19. The authors used a dataset contains 2562 images categorized as positive COVID-19 (1281 images) and negative COVID-19 (1281 images). Contrast Limited Adaptive Histogram Equalization (CLAHE) preprocessing process was applied on dataset and the results showed that the model with CLAHE data achieved better result with accuracy of 99.61% compared with the raw data where the accuracy was 99.22%. In addition, Hemdan et al. (2003) used many deep convolutional neural network models (VGG19, DenseNet201, InceptionV3, ResNetV2, InceptionResNetV2, Xception, and MobileNetV2) to classify X-ray images into positive or negative COVID-19. The authors used dataset of 50 chest X-ray images that includes 25 positive and 25 healthy cases. The results showed that VGG19 and DenseNet201 provides the highest classification performance with accuracy of 90%. Narin et al. (2021) used 5 pre-trained convolutional neural network models namely, ResNet50, ResNet101, ResNet152, InceptionV3 and InceptionResNetV2 to detect COVID-19. The dataset they used contains 7396 chest X-ray images classified as 341 COVID-19 images, 2800 Normal images, 2772 Bacterial Pneumonia and 1493 Viral Pneumonia. The dataset was divided into three binary-class datasets: dataset-1 contains COVID-19 and Normal classes, dataset-2 contains COVID-19 and Viral Pneumonia classes while dataset-3 contains COVID-19 and Bacterial Pneumonia classes. ResNet50 model achieved the best classification results with accuracy of 96.1%, 99.5% and 99.7% for dataset-1, dataset-2 and dataset-3, respectively. Ohata et al. (2020) used transfer learning models as features extractors to detect COVID-19. Transfer learning models that were used in this work are VGG16, VGG18, InceptionV3, InceptionResNetV2, ResNet50, NASNetLarge, NASNetMobile, Xception, MobileNet, DenseNet121, DenseNet169 and DenseNet201. These models were combined with many classifiers like k-Nearest Neighbor, Bayes, Random Forest, Multilayer Perceptron (MLP), and Support Vector Machine (SVM). The authors used two datasets where both of them have the same images for the COVID-19 class, but they have different images for the healthy class. The datasets are balanced and consist of 194 images for each class. The results showed that MobileNet model with the SVM classifier (linear kernel) achieved the best mean accuracy of 98.46% for one of the datasets while DenseNet201 model with MLP classifier was the best for another dataset with a mean accuracy of 95.64%.

In another work with binary class of images, Breve et al. (2011) performed a set of exhaustive classification experiments. In COVID-19 detection problem, they used 21 different CNN models that are VGG, ResNet, DenseNet, EfficientNet and their derivatives (i.e. DenseNet121, EfficientNetB1, ResNet152). In addition, ensembles of these CNN models were also employed. Their dataset contains 16,352 chest X-ray images where 2358 images are COVID-19 positive and 13,994 are COVID-19 negative. The negative data includes images with non-COVID-19 pneumonia. The results showed that DenseNet169 achieved the best results with an accuracy and F1 score of 98.15% and 98.12%, respectively. The ensemble approach increased the accuracy and F1 score of DenseNet169 to 99.25% and 99.24%, respectively. Maheen et al. (2010) used different pre-trained CNN models to detect COVID-19 using chest X-ray images. The models are: AlexNet, VGG-16, MobileNet-V2, SqeezeNet, ResNet-34, ResNet-50 and COVIDX-Net. The dataset contains 406 images distributed evenly to COVID-19 and healthy classes. ResNet-34 achieved the best prediction accuracy of 98.33%. Shenoy et al. (2010) proposed a new CNN model to detect COVID-19. A dataset contains 4316 chest X-ray images (2158 COVID-19 negative scans and 2,158 COVID-19 positive scans) was used and data augmentation technique was used. The model achieved an accuracy of 95.5%. Hasoon et al. (2021) proposed many methods that combines between image processing and classifiers (i.e. K-Nearest Neighbor (KNN) and Support Vector Machine (SVM)) for classification and early detection of COVID-19. The dataset includes normal and pneumonia COVID-19 X-ray images. The method that combines Local binary pattern (LBP) and KNN achieved the best accuracy of 99%. Mohammed et al. (2022) proposed an integrated method for selecting the optimal deep learning model based on a novel crow swarm optimization algorithm for COVID-19 diagnosis. ResNet50 model achieved the best accuracy of 91.46%.

Detection of pneumonia together with COVID-19 is also considered by many researchers. In that case, it becomes a three-classes problem. One of the methods was proposed by Montalbo (2021) where DenseNet121 was modified to classify normal, COVID-19 and Pneumonia (Bacterial and Viral) classes. The resulting model, which has lower parameters and depth than the original one, achieved an accuracy of 97.99%. It did not achieve better accuracy compared to the base model but showed to be able to outperform against some state-of-the-art deep convolutional neural network models. Same author in another study (Montalbo 2022) applied a truncation method on various of famous deep convolutional neural networks to reduce the number of parameters of the models and make it applicable with low computing resources. Chest X-ray images were used and the results showed that the InceptionResNetV2 model achieved the best accuracy of 97.41% in three-classes classification (Normal, COVID-19 and Pneumonia) after truncating it and reducing its parameters to 441 K. In addition, Shome et al. (2021) proposed a vision transformer-based deep learning pipeline for detecting COVID-19 using chest X-ray images. Dataset with three-classes (Normal, COVID-19 and Pneumonia) contains 30 K of chest X-ray images (10 K for each class) was used and the proposed model achieved an accuracy of 98% for binary classification (Normal and COVID-19) and 92% for multi-class classification. Nagi et al. (2022) used a relatively large dataset to check the performance of deep learning. Xception model was the best in terms of accuracy. The model achieved an accuracy of 94.21% while the Custom-Model (proposed model in the study) achieved an accuracy of 92.38%.

Transfer learning is a widely utilized practical tool in this three-class problem as well. Makris et al. (2020) used several well-known CNN model with a dataset containing 336 chest X-ray images. According to the results, VGG16 and VGG19 achieved the best accuracy score of 95%. El Asnaoui et al. (2020) used well-known CNN architectures, namely DenseNet201, InceptionV3, InceptionResNetV2, Resnet50, MobileNetV2, VGG16 and VGG19 to classify COVID-19. The database used in this work contains 6087 X-ray and CT images (231 COVID-19, 1493 Viral Pneumonia, 2780 Bacterial Pneumonia and 1583 Normal images). COVID-19 and Viral Pneumonia classes were considered as one class in the classification process. InceptionResNetV2 and Densnet201 achieved the best results with accuracy of 92.18% and 88.09%, respectively. Alqudah et al. (2020) used pretrained and proposed models like ShuffleNet, MobileNet and AOCTNet to extract the automated features from the images, then they passed these features to Soft-max, Support Vector Machine (SVM), Random Forest (RF) and K-Nearest Neighbor (KNN) classifiers. It was shown that features extracted by MobileNet performed the best accuracy.

In addition to modification of available transfer learning models, there are some other studies in which specific CNN architectures are proposed. For instance, Antonchuk et al. (2021) proposed a new CNN model for detecting COVID-19 and influenza cases. The model achieved an accuracy score of 93% on a dataset consisting of 4,152 X-ray images for each class. The CNN architecture proposed by Atitallah et al. (2023) was tested on two different datasets. First dataset (COVIDx) contains 15,475 chest X-ray images (8851 Normal, 6,053 Pneumonia and 571 COVID-19) while the other (Enhanced COVID-19) includes 1092 chest X-ray images (364 images for each class). Data augmentation was applied on datasets and class weight method was applied on COVIDx dataset to re-balance it. The results showed that the proposed model achieved an accuracy of 94% and 99% for COVIDx and enhanced COVID-19 datasets, respectively. Liu et al. (2022) proposed an approach comprises of many stages. EfficientNetV2 was considered as backbone network then ResNet101 (feature fusion), Convolutional Block Attention Module and SVM classifiers, respectively, were used. The dataset contains three-classes (COVID-19, Normal and Pneumonia) and data augmentation was applied. The results showed that the system achieved an accuracy of 99.89%.

Different from the studies considering the Viral and Bacterial Pneumonia as one single class, it is possible to take them as separate classes and eventually have a four-classes problem. One example of such works is proposed by Zeiser et al. (2021). In their work, pretrained DenseNet121, InceptionResNetV2, InceptionV3, MovileNetV2, ResNet50 and VGG16 models were used for classification of the X-ray images together with CLAHE as a preprocessing method. Their dataset contains 5,181 images categorized to four-classes as COVID-19, Normal, Viral Pneumonia and Bacterial Pneumonia. The results showed that VGG16 achieved the best classification performance with an accuracy of 85.11%, sensitivity of 85.25%, specificity of 85.16% and F1-score of 85.03%. Bolhassani (2021) used an unbalanced chest X-yay dataset together with ResNet50 and DenseNet121 models. To eliminate the effect of the class imbalance, they applied data augmentation and achieved an accuracy score of 80.0%. Sait et al. (2021) proposed a model based on InceptionV3 model and multilayer perceptron. Dataset consists of four-classes (COVID-19, Normal, Bacterial and Viral Pneumonia) of chest X-ray images was used without data augmentation. The dataset was split into train and validation sets with a ratio of 80:20. It is noted that the authors did not use part of the dataset as test data which is important to check the robustness of the model's performance. The proposed model achieved a validation accuracy of 91.3% on the chest ray images. In a study focused on determining the seriousness of lung disease using chest X-ray images, Rajinikanth et al. (2022) implemented a pre-trained InceptionV3 scheme with chosen multi-class classifiers to detect the pneumonia and check its severity level. The dataset contains four-classes (Normal, Mild, Moderate, and Severe Pneumonia). The result achieved by K-Nearest Neighbor (KNN) classifier was the best in this work with an accuracy of 85.18%.

Based on the explanations above, the existing studies about pneumonia and COVID-19 detection using X-ray images are summarized in Table 1. As can be seen from the available studies in the literature, there are very different approaches for pneumonia and COVID-19 diagnosis; however, the majority of these studies consider it as a binary-class problem or merge viral and bacterial pneumonia together in one class. In other words, the analysis made on the three mentioned lung diseases is very limited. In addition, most of the works that proposed new models do not consider the computational load of the model. Typically, deeper models with more convolutional layers may achieve better feature extraction eventually yielding more successful classification. However, such models have major drawbacks like the requirement of large amount of images and expensive hardware with heavy computational capability. This problem is present especially in those studies considering the four-class problem (healthy, COVID-19, viral pneumonia, and bacterial pneumonia). Therefore, this situation has been addressed to some extent in this study by proposing a model with reduced convolutional layers as well as number of weights. Hence, it becomes more suitable for the detection task to be executed on a larger scale of digital devices including those with relatively lower computational power.

Table 1 Summary of related works

COVID-19 and pneumonia diagnosis from chest X-ray images using convolutional neural networks

Abstract

Similar content being viewed by others

COVID-19 X-Ray Image Classification Using Deep Convolution Neural Network

Binary and Ternary Classifiers to Detect COVID-19 Patients Using Chest X-ray Images: An Efficient Layered CNN Approach

Deep Learning Approach for COVID-19 Diagnosis Using X-Ray Images

Explore related subjects

1 Introduction

2 Literature review

3 Methods

3.1 Dataset

3.2 The proposed CNN architecture

3.3 Transfer learning models

3.4 Evaluation metrics

3.5 Hyperparameters tuning

4 Experimental results

4.1 Transfer learning results

4.2 The proposed model results

4.3 Ablation study

4.4 Optimizer effect

5 Discussion

6 Conclusion

Data availability

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation