Deep CNN for Brain Tumor Classification

Ayadi, Wadhah; Elhamzi, Wajdi; Charfi, Imen; Atri, Mohamed

doi:10.1007/s11063-020-10398-2

Deep CNN for Brain Tumor Classification

Published: 06 January 2021

Volume 53, pages 671–700, (2021)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Neural Processing Letters Aims and scope Submit manuscript

Deep CNN for Brain Tumor Classification

Download PDF

Wadhah Ayadi ORCID: orcid.org/0000-0002-8516-763X¹,
Wajdi Elhamzi^2,3,
Imen Charfi³ &
…
Mohamed Atri⁴

6009 Accesses
177 Citations
Explore all metrics

Abstract

Brain tumor represents one of the most fatal cancers around the world. It is common cancer in adults and children. It has the lowest survival rate and various types depending on their location, texture, and shape. The wrong classification of the tumor brain will lead to bad consequences. Consequently, identifying the correct type and grade of tumor in the early stages has an important role to choose a precise treatment plan. Examining the magnetic resonance imaging (MRI) images of the patient’s brain represents an effective technique to distinguish brain tumors. Due to the big amounts of data and the various brain tumor types, the manual technique becomes time-consuming and can lead to human errors. Therefore, an automated computer assisted diagnosis (CAD) system is required. The recent evolution in image classification techniques has shown great progress especially the deep convolution neural networks (CNNs) which have succeeded in this area. In this regard, we exploited CNN for the problem of brain tumor classification. We suggested a new model, which contains various layers in the aim to classify MRI brain tumor. The proposed model is experimentally evaluated on three datasets. Experimental results affirm that the suggested approach provides a convincing performance compared to existing methods.

Brain Tumor Classification Using Convolution Neural Network

Classification of Brain Tumor of Magnetic Resonance Images Using Convolutional Neural Network Approach

Brain tumor magnetic resonance image classification: a deep learning approach

Article 19 May 2022

Discover the latest articles, news and stories from top researchers in related subjects.

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The brain tumor is considered as the most common brain diseases. It is an uncontrolled and unnatural growth of brain cells [1]. It represents one of the most lethal cancers and life-threatening. According to cancer statistics in the USA, it is about 23,000 patients was diagnosed with a brain tumor in 2015. After 2 years, based on statistics, this sort of tumor is considered as the leading cause of cancer mortality around the world both in children and in adults [2]. In the aim to detect the tumor, the radiologist widely exploits medical imaging techniques [3]. Among the various available techniques, MRI is more favorited for brain tumors according to its harmless nature. In daily routine, the radiologist identifies the brain tumors manually. The tumor classification process needs an extremely time consuming and it is based on the skills and experience of the radiologist. With the increase of patient number, the amount of data to be daily analyzed is large which make the readings based on visual interpretation expensive and inaccurate. Furthermore, the classification of brain tumors to various pathological types is more challenging compared to binary classification. The related challenges are attributed to some factors such as the high variations with respect to shape, size, and intensity for the same tumor type [4] also the similar appearances for varied pathological types [5]. A wrong diagnosis of a brain tumor can lead to a serious problem and decrease the chance of survival for the patient. In order to surpass the manual diagnosis drawbacks, there is a surge of interest in designing automated image processing systems [6,7,8]. Many researchers have suggested several techniques to ameliorate the CAD system that can classify some tumors in brain MRI images. Traditional machine learning methods used in the classification process are usually based on different steps such as preprocessing, dimension reduction, feature extraction, feature selection, and classification. The feature extraction represents the crucial phase in an effective CAD system [9]. It is a challenging task and requires prior knowledge about the problem domain since the classification accuracy based on the good features extracted. The traditional techniques for feature extraction can be sorted into three types: Spatial domain features, Wavelet and frequency features and Contextual and Hybrid features. The new CAD methods yield an improved performance due to the uses of deep learning (DL).

DL represents a subset of machine learning which does not require handcrafted features [10, 11]. It has been successfully proven to minimize the gap between human vision and computer vision in pattern recognition and can provide higher performance than traditional techniques [12]. It surpassed state-of-the-art schemes in several fields as generating text [13], face verification [14], image description [15], the game of Go [16], and grand challenges [17]. The higher performance in several fields encouraged the exploitation of DL in the medical image for classification, detection, and segmentation [18,19,20,21,22,23,24]. According to [25] and only in 2016, there are about 220 works based on deep learning at medical images are reported and this number will increase in the next years. Around 190 of them used CNN. DL permits the exploitation of a pre-trained CNN model for medical images especially for brain tumor classification, which was developed for other applications as AlexNet [26], GoogLeNet [4], ResNet-34 [5].

CNNs have gained higher performance on huge, labeled datasets as ImageNet [17] that contains more than one million images. However, it is hard to exploit such deep CNNs in the medical field. First, the size of the medical datasets is generally small because such datasets need the availability of expert radiologists to manually examine and label the images, which is time-consuming, laborious, and costly. Second, training deep CNN is a complicated task for a small dataset because of over-fitting and convergence problems. Third, domain expertise is needed to repeatedly revise the model and adjust the learning parameters to provide better performance. Therefore, training deep CNN from scratch represents a challenging task that is tedious and demands much diligence and patience.

A new model for brain tumor classification based on CNN is discussed in this paper. It contains various layers as convolution, Rectified Linear Unit (ReLu), and a pooling. Our new approach does not involve any segmentation in the pre-processing step, in contrast to some previous methods, which require prior segmentation of tumors. We validated our algorithm on three public datasets.

Our contribution in this work comprises of the following key points:

A novel and robust model is presented for automated classification of brain tumors, which is effective in the extraction of important features on the MRI dataset.
The model suggested exploiting 3 × 3 kernels for all convolutional layers with a small stride in the aim to learn the small texture of tumors in brain images, unlike other models, which use 11 × 11 or 9 × 9 as kernels size with higher strides.
The novel model achieves good accuracy in brain tumor classification with few preprocessing compared to other techniques, which need a tumor segmentation before the classification step.
Our model provides an acceptable classification accuracy compared to new methods, despite the few training data samples.

This paper is structured into eight sections. Some previous works are briefly given in Sect. 2. A background is outlined in Sect. 3. The proposed model is discussed in Sect. 4. The performance evaluation is outlined in Sect. 5. Section 6 is provided to describes the dataset exploited. Section 7 summarizes the results. Section 8 discussion and Sect. 9 concludes.

2 Related Work

Different methods had been suggested in the past years for classification and segmentation. These techniques used traditional machine learning [27,28,29] and recently exploited deep learning models [30,31,32,33,34,35,36,37,38,39,40,41,42,43,44]. We have looked into this section of the works exploited for brain tumor classification.

A new technique for MRI brain tumor classification which is proposed by Hemanth et al. [45] used a modified Neural Network. 540 MR brain images were exploited to test the suggested method. The dataset consists of four tumors class which are namely astrocytoma, meningioma, metastase, and glioma. The used images are of 256 × 256 size. Normalization is performed as a preprocessing step. Eight features are acquired based on the first-order histogram and GLCM. The suggested method provides promising results that can reach 95% as sensitivity, 98% as specificity, and 98% as accuracy.

Other work is proposed by Lin et al. [46] to classify meningioma tumor in a different grade. Grade-I contains the non-cancerous, which are slow-growing tumors. Grade-II contains cancerous and noncancerous tumors. The grade-III contains cancerous tumors, which can grow quickly. Different features are exploited as contextual and radiological features. No segmentation or preprocessing process is performed. In the classification step, the authors used multiple logistic regression. The proposed scheme is tested using 120 patients MRI images, were 90 with Grade I and 30 with Grade II or III. They exploited several sequences as FLAIR T1 and T2. DWI transformation is utilized to extract features. The results are acceptable with the exploited dataset. However, this kind of method requires many large datasets to ensure its validity.

Other work aimed to classify multi-grade brain tumors is presented in [47]. The proposed method is based on a pre-trained CNN model and segmented images. The model is tested based on three datasets. Various techniques for data augmentation are exploited in order to enhance accuracy. The technique is experimentally evaluated on the original and the augmented dataset. The provided results are convincing compared to previous works.

In order to help the radiologists in MRI classification, Sachdeva et al. [48] have suggested a semi-automatic classification scheme that contains varied steps. To detect tumor areas, a content based active contour system that allows the radiologist to manually indicate the region of interest (ROI), which is saved as a segmented ROI (SROI) is applied in the first step. Then, 71 texture and intensity features are extracted using the SROI. Optimal feature selection is performed with the application of the Genetic Algorithm (GA). The last phase classifies the chosen features using two classifier SVM and ANN. The suggested scheme is tested on different datasets. The first dataset contains 428 MR images and the second includes 260. The first set of images contains six tumors categories as Glioblastoma Multiforme (GBM), Meningioma (MEN), Astrocytoma (AS), childhood tumor-Medulloblastoma (MED), and secondary tumor-Metastatic (MET). The second dataset contains only three tumor categories, which are AS, MEN, and Low-Grade Glioma. The suggested GA-SVM aims to find a preliminary probability for the tumor category, while GA-ANN aims to confirm the accuracy. The performance calculated on the first group of images shows that GA based approach enhanced SVM accuracy to 91.7% while ANN accuracy to 94.9%. SVM accuracy has raised to 89% and 94.1% for ANN in the second group of images. The results demonstrate that the classifier GA-ANN offered the highest results compared to GA-SVM. Besides, the GA-SVM yields the speed while GA-ANN yields the accuracy. According to results, the suggested scheme has acceptable performance and can assist radiologists in taking a better decision to classify brain tumors.

Cheng et al. [27] are the first authors exploited the famous dataset [49]. The suggested system takes advantage of the manually delineated tumor border for feature extraction. They utilized the augmented tumor region as a region of interest (ROI), which was spliced into subregions based on the adaptive spatial division method. The feature was extracted in three manners, which are the gray-level co-occurrence matrix (GLCM), intensity histogram, and the bag of words (BoW). SVM achieved the highest accuracy. The experiments followed a standard five-fold cross-validation procedure. Accuracy, sensitivity, and specificity are the performance measures calculated. The highest accuracy is about 91.28%.

Ismael et al. [28] used Gabor filter and discrete wavelet transform (DWT) in the aim to extract statistical features for the classification. This algorithm exploits the tumor segmented as input and the multi-layer perceptron (MLP) as a classifier. A random division of database images into 70% and 30% was done to form the training set and validation set, respectively. The accuracy achieved is about 91.9%.

Different preprocessing manners are investigated by Tahir et al. [29] to ameliorate the classification result. They grouped these techniques into three groups: noise removal, edge detection, and contrast enhancement. The possible combinations are applied to different image sets. The authors affirm that the combination of various preprocessing techniques is more beneficial than applying a single technique. The SVM classifier was exploited and reported 86% as the highest accuracy on Figshare dataset.

According to the results, the available tumor detection systems are not providing satisfactory output. For this reason, there is a big need to get robust automated CAD systems. The conventional machine learning requires domain-specific expertise and experience. It needs efforts for manual extraction of features, which can decrease the efficiency of the system. The deep learning-based techniques surpass these drawbacks due to automatic feature extraction, which are robust for classification purposes based on the convolutional layers.

In the aim to ameliorate the classification accuracy in this dataset, Paul et al. [33] applied three different classifiers: CNN, fully connected neural network, and random forest. CNN provides the highest accuracy rate, which attained 90.26%. The proposed model contains various layers as convolutional, MaxPool, and fully connected.

In this regard, Afshar et al. [34] propose a modified CNN architecture called capsule network (CapsNet) for the brain tumor classification. The proposed CapsNet exploits the spatial relationship between the tumor and its surrounding tissues. 86.56% present the highest accuracy provided based on segmented tumors and 72.13% based on raw brain images.

Other work used the deep belief network (DBN) in the aim to discriminate between healthy controls and patients with schizophrenia presented by 83 and 143 patients respectively from Radiopaedia dataset [50]. The proposed DBN provides 73.6% as accuracy compared to SVM which provides 68.1%.

Zhou et al. [35] proposed a holistic brain tumor classification technique. The features are extracted from the axial view using an auto-encoder and classified based on the Long Short Term Memory (LSTM). The proposed technique is tested on selected slices (989, axial only) and it reported 92.13% as the best accuracy.

Similarly, Pashaei et al. [36] developed a new architecture for the brain tumors classification. The proposed model contains five layers to extract features. A kernel Extreme Learning Machines (KELM) is used to classify images based on these extracted features. The accuracy achieved is about 93.68%, which exceeds Support Vector Machine, Radial Base Function, and some other classifiers.

Abiwinanda et al. [37] investigated the application of CNN for this data set and designed seven various neural networks. The second model provided the highest performance, which contains two convolutional and one fully connected layers. This simple model and without any previous segmentation achieve 98.51% as training accuracy and 84.19% as test accuracy.

Another reported use of CNN on this dataset is by Ghassemi et al. [38] where they proposed a new model for multi-class brain tumors classification. The model is pre-trained firstly as a discriminator in a generative adversarial network (GAN) in order to extract important features. Then, the last fully connected layer was replaced with a SoftMax classifier in the aim to differentiate three tumors. The proposed model contains six layers. It was used with various data augmentation techniques. It achieved 93.01% and 95.6% as accuracy on introduced and random split respectively.

Recently, various new architectures have been suggested in the goal to generalize the CNN to the graph domain, especially in the medical image classification [51].

Different authors choose the graph CNN (GCNN) as a solution for tumor classification [51,52,53]. In [52], Song et al. exploit a GCNN model in order to classify Alzheimer’s disease (AD) into four categories. The proposed network contained eleven layers: nine convolutional and two FC. The ReLU activation is exploited after each layer. A Softmax is employed as the final layer in order to compute the class probabilities. The proposed scheme is tested based on the Alzheimer’s Disease Neuroimaging Initiative (ADNI) dataset. The original dataset contains 12 images per class. Due to the few data volume, various data augmentation is applied. The dataset is increased from 12 to 132 images per class. In the aim to generate a robust assessment, the author exploits a 10-fold CV. The average accuracy provided is about 65% for the SVM classifier and 89% based on GCNN.

Another work for AD classification is proposed by Guo et al. [53]. The author exploits GCNN in the goal to classify AD into 2 and 3 classes. The proposed model is tested based on the ADNI dataset. For the 2 class classification, the proposed GCNN attain 93% compared to the known ResNet architecture and SVM classifier where they reached 95% and 69% respectively. In the classification of the 3 classes scenario, the proposed GCNN attain 77%, and 65% and 57% for ResNet and SVM respectively.

A modified GCNN architecture is used in [51] for the early detection of AD. 160 images of patients from the ADNI dataset are used to test the suggested scheme. 5-fold CV is exploited to calculate the performance of the suggested model. The accuracy results surpass 90%.

Despite the various proposed schemes for the classification of brain tumors, these techniques suffer from several limitations that can be summarized as follows. The accuracy provided by the state-of-the-art schemes is inadequate considering the importance of MRI classification in the medical area. Several methods used the manually delineated tumor regions for the classification that prevented them from being fully automated. The algorithms utilized CNN and its variants could not provide an influential improvement in performance. Hence, performance evaluation based on various metrics other than accuracy becomes significant. Besides, the CNN models provide generally poor performance with small data sets, which is the case for the medical image database.

A new scheme is suggested to exceed these drawbacks. The suggested system provides the highest classification performance, compared to previous works, using three open datasets. Despite the use of a smaller number of training samples, the proposed method provides acceptable results.

3 Background

In the last years, DL has shown promising performance in several domains. DL models have the ability to learn automatically multiple levels of information from a large set of data. They have a huge advantage compared to traditional machine learning that needs a lot of effort for tuning the features and expert knowledge. Several architectures are proposed for DL. CNN represents the most exploited in image processing field due to its ability to recognize patterns in images [54].

CNN model can contain several types of layers. The frequently used are convolutional, pooling, and fully connected layers. The convolutional layer represents the main layer in a CNN scheme. It is used for feature extraction as edges and colors of the image. The pooling layer is exploited to decrease the dimensionality of extracted features, which leads to minimizing the complexity and the computational time. The fully connected layer represents the last step in CNN model, which aims to achieve linearity in the networks.

An optimizer algorithm is exploited by each CNN model in the training phase with the aim to update the weights. The model utilizes the classification loss as input and back propagate the error into the network to update the filters and the weights. At the final step, a SoftMax activation function is exploited in order to normalize the output sum. According to literature, a deeper CNN model can solve more complex tasks and improve accuracy.

In the medical image area, especially for brain tumor classification, several works have been suggested. In the last years, researchers have proposed multi-class brain tumor classification [33,34,35,36,37,38] since the binary classification [55] is insufficient for the doctor to choose the suitable treatment for the patient.

Despite the different techniques proposed in the literature, the brain tumor classification method still has such limitations that need to be considered. The binary classification leaves various ambiguities for the doctors and it is not enough to decide the good treatment. For a clear understanding of the doctor, the multi-classification is needed. Furthermore, the several datasets used, represent an obstacle for researchers to achieve precise comparison. To overcome these limitations, we suggested a new deep CNN scheme for multi-class brain tumor classification using three publicly datasets.

A new scheme was proposed in the aim to generate an accurate classification system. The suggested approach is illustrated in Fig. 1. Our technique comprises various steps. The input images are obtained from the dataset. Normalization and contrast enhancer are exploited to ameliorate image quality. A new CNN scheme is suggested to extract the important feature. The test image is classified in one of input classes based on Softmax activation function in the last step.

4 Proposed Model

Recently, CNN is widely exploited in all types of medical image processing applications particularly in MRI brain tumor classification and segmentation. In this work, a new CNN model is suggested for brain tumor multi-class classification.

The general architecture of the proposed sequential model is outlined in Fig. 2 and details are described in Table 1. It is consisted of several layers each having its own functionality. Image with 256 × 256 size represents the input model. Ten convolutional layers are exploited to extract the important feature. Max-pooling layer after every two convolutional layers is employed to reduce the data size. Each convolutional layer use 3 × 3 filters while 2 × 2 are applied in pooling layers. A non-linearity layer is added to ameliorate the fitting ability of CNN. Furthermore, a batch Normalization is used after each convolution layer to get the best-optimized results and speed up the network convergence. Fully connected layers with 64 neurons is employed. The output layer exploits softmax classifier. We will briefly introduce these layers in this section.

Table 1 CNN proposed structure

Deep CNN for Brain Tumor Classification

Abstract

Similar content being viewed by others

Brain Tumor Classification Using Convolution Neural Network

Classification of Brain Tumor of Magnetic Resonance Images Using Convolutional Neural Network Approach

Brain tumor magnetic resonance image classification: a deep learning approach

Explore related subjects

1 Introduction

2 Related Work

3 Background

4 Proposed Model

4.1 Convolution Layer

4.2 Non-linearity Layer

4.3 Batch Normalization Layer

4.4 Pooling Layer

4.5 Fully Connected Layer (FC)

5 Performance Metrics

6 Image Data

6.1 Figshare Dataset

6.2 Radiopaedia Dataset

6.3 REMBRANDT Dataset

6.3.1 Two-Class Data (C2)

6.3.2 Three-Class Data (C3)

6.3.3 Four-Class Data (C4)

6.3.4 Five-Class Data (C5)

6.3.5 Six-Class Data (C6)

7 Experimental Results

7.1 Hyper-parameters

7.2 Complexity Analysis

7.3 Experimental Results for Figshare Dataset

7.4 Experimental Results for Radiopaedia Dataset

7.4.1 Experimental Results Before Data Augmentation

7.4.2 Experimental Results After Data Augmentation

7.4.3 Comparison with Previous Methods

7.5 Experimental Results for REMBRANDT Dataset

7.5.1 Experimental Results Before Data Augmentation

7.5.2 Experimental Results After Data Augmentation

7.5.3 Comparison with Previous Methods

8 Discussion

9 Conclusion and Future Work

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation