Improved convolutional neural network based histopathological image classification

Rachapudi, Venubabu; Lavanya Devi, G.

doi:10.1007/s12065-020-00367-y

Improved convolutional neural network based histopathological image classification

Special Issue
Published: 18 February 2020

Volume 14, pages 1337–1343, (2021)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Evolutionary Intelligence Aims and scope Submit manuscript

Improved convolutional neural network based histopathological image classification

Download PDF

1043 Accesses
28 Citations
Explore all metrics

Abstract

Histopathological image classification is one of the important application areas of medical imaging. However, an accurate and efficient classification is still an open-ended research due to the complexity in histopathological images. For the same, this paper presents an efficient architecture of convolutional neural network for the classification of histopathological images. The proposed method consists of five subsequent blocks of layers, each having convolutional, drop-out, and max-pooling layers. The performance of the introduced classification system is validated on colorectal cancer histology image dataset which consists of RGB-colored images belonging to eight different classes. The experimental results confirm the higher performance of the proposed convolutional neural network against existing different machine learning models with the lowest error rate of 22.7%.

A Shallow Convolutional Neural Network Model for Breast Cancer Histopathology Image Classification

Classification of Breast Cancer Histopathological Images using Convolutional Neural Networks with Hierarchical Loss and Global Pooling

Breast Cancer Histology Image Classification Based on Deep Neural Networks

Discover the latest articles, news and stories from top researchers in related subjects.

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Drug development and disease diagnosis are manifest through the microscopic examination of the surgical samples or biopsy. This analysis of biopsy is termed as histopathology and is generally performed manually by the pathologists. To perform diagnosis, pathologists study various properties of the biopsy like tissue structure, count of tissue cells, or disparity in the shape of the cells [1, 2]. However, this procedure has number of concerns such as time taken and costly procedure. Moreover, the knowledge of the pathologist guides the manual analysis, hence this approach is biased in nature [3]. Therefore, automatic analysis is utmost important for unbiased and fast disease diagnosis [4]. The digital transformation has digitized the biopsy in the form of images by capturing through microscopic mounted camera and termed as histopathological images. The analysis of such images through advanced computing technologies has resulted in better diagnosis. Therefore, histopathological image analysis is the prime area of medical research wherein accurate classification of histopathological images is the key step for meticulous diagnosis [5]. However, histopathological image classification is a challenging problem due to the complexity in the histopathological images [2, 6]. To illustrate the involved complexity, Fig. 1 illustrates the representative histopathological images of four types of cancers, taken from the publicly available colorectal cancer histology dataset [7].

In literature, machine learning models are widely preferred for the histopathological image analysis wherein a set of biopsy images are used to train a classifier which further infers the respective class of an unknown image. The general procedure of a traditional classification system consists of three phases, namely image pre-processing, feature extraction, and classification. The procedure of extracting features from training image and modeling the optimal decision boundary for the classification is still a quite successful in medical research. However, the success of such approaches is highly dependent on the extracted features [8]. Moreover, the extracted features are dependent on the method used for the same which is likely specified by humans. In literature, many types of such techniques exist like principal component analysis, clustering of image patches, dictionary approaches, and many more. A brief review of such techniques can be found in [9].

Zhang et al. [10] assembled two random classifiers, namely support vector machine (SVM) and multi-layer perceptron to classify the biopsy images. For validation of the method, a dataset of 361 images were used which includes 119 normal tissue images, 102 carcinoma in situ, and 140 lobular carcinoma images. The classification accuracy, given by the proposed system, was 99.25% which was a good accuracy for the considered dataset. Further, Kowal et al. [11] used three different classification techniques after the nuclei segmentation for the categorization of breast cancer images into benign and malignant classes. For the same, they first performed a nuclei segmentation using four clustering methods and then extracted the features to train the classifiers. The trained classifier gives the 96% accuracy for the dataset. Similarly, Filipczuk et al. [12] discriminated benign or malignant biopsies by using four traditional learning models, namely KNN (K-nearest neighbor), naive Bayes classifier, decision tree and SVM with an accuracy of 98.51%. Moreover, Asri et al. [13] performed a performance comparison among four machine learning methods, namely SVM, decision tree, naive bayes, and KNN on Wisconsin breast cancer image dataset, having total 699 images of benign and malignant classes. Out of these four machine learning models, SVM obtained the best accuracy of 97.13%. Although traditional machine learning models perform good in case of histopathological image classification, their accuracy is highly dependent on the extracted features [14] which are decided by human being and may be biased towards human knowledge and experience. Instead of human involvement, a better approach would be letting the machine learns the optimal features from the input data and performs the required analysis. This type of automated feature extraction is the main reason and success factor for deep learning models. Deep learning based models have been successfully applied in various applications like image classification, machine translation, speech recognition and many more.

Deep learning models are composed of large network of layers made up with neurons and perform classification by learning features internally. Deep learning models have reported outperforming results in histopathology image analysis, such as mitosis detection [15], tissue grading (classification) [8], and nuclei segmentation [16] from the high-resolution images. Generally, convolutional neural network (CNN) has been quite successful deep learning model for histopathological image analysis, especially for detection [17, 18] and classification [19,20,21]. The architecture of CNN transforms the input data to output by using a combination of different layers like convolution, pooling, and drop-out. Lo et al. [22] used the CNN for the first time on medical image. However, the first CNN that succeeded on a real-world application was LeNet [23] and solved the hand-written digit recognition. With the advancement in computing systems, there has been potential growth in the use of CNN based methods for automated classification of histopathology images, specifically after the introduction of AlexNet which won the ImageNet challenge with a large margin. Saha et al. [24] used handcrafted features, like intensity, morphological, and textual features, with deep learning model and achieved superior accuracy in the detection of mitoses from histopathological breast images. Further, a Han et al. [21] presented a new deep learning model for multi-class cancer classification from the histopathological breast images. Zheng et al. [25] introduced a new architecture based on CNN for the breast tumor classification. Litjens et al. [26] reviews various such models for the histopathological image analysis.

Although, CNN shows better performance for various image classification problems, it still lacks for histopathological image classification due to the lack of number of labeled histopathological images. As in CNN, large number of parameters are to be tuned which may lead to over-fitting problem in the model. To reduce the over-fitting problem, a large number of labeled histopathological images are required for training. However, to obtain the labeled images is a costly process due to the dependency on pathologists. Therefore, in case of limited histopathological image dataset, an efficient CNN model is required which should have fewer parameters to tune and can perform good on smaller dataset. Hence, in this paper an efficient light weighted CNN model is presented, especially for histopathological images classification with small dataset. The performance of the proposed model is validated against H&E stained histopathological cancer images taken from the colorectal cancer histology dataset and compared with different traditional machine learning methods.

The organization of rest of the paper is as follows: Sect. 2 briefs the standard layers of a convolutional neural network. The proposed convolutional neural network has been detailed in Sect. 3. The experimental results are discussed in Sect. 4. Finally, conclusion is drawn in Sect. 5.

2 Preliminaries

2.1 Convolutional neural network

A convolutional neural network (CNN) is a sequence of multiple layers, where each layer may belong to one of the five main layers, namely convolutional, non-linear activation, pooling, drop-out, and full-connected, CNN takes the input image and models the best representative features to attain high accuracy. Generally, it has been used for the image classification tasks, while its other applicability domains include the transfer learning, wherein a pretrained CNN is applied on new problem domain for either feature extraction or classification task. The architecture of a typical CNN is illustrated in Fig. 2. The detailed overview about each layer is discussed below.

2.2 Convolution layer

This layer corresponds to apply the convolutional operation on the input values. Specifically, the input to this layer is a matrix and convolved with ‘K’ learnable filters (or kernels) to generate ‘K’ new feature maps. A feature map is the summation of the dot product between the filter value and input value along with an added bias. Figure 3a represents the working of convolution operation.

2.3 Activation layer

In this layer, the generated feature map is mapped to a non-linear value by using non-linear activation functions. In CNN, rectified linear unit (ReLU) has been the most widely used activation function. It returns zero if the input value is less than zero else the input value is returned. Figure 3b depicts the function for the same. Other preferred activation functions are tanh and sigmoid. Usually, convolutional layer and activation layer are used in combination.

2.4 Pooling layer

In pooling layer, input values are down-sampled with focus on extracting relevant and important features. This layer benefits in reducing the computational complexity by performing the spatial dimensionality reduction of the given input values. Generally, there are two types of pooling layers, namely average pooling and max-pooling, out of which max-pooling is the most popular one. In max-pooling, maximum value from a region of input is filtered out by placing a kernel (usually of size 2 × 2) over the considered region. Figure 4 depicts the max-pooling operation.

2.5 Drop-out layer

In this layer, a set of neurons are randomly de-activated which results in generating zero output while training the CNN. The main reason of this layer is to avoid over-fitting and generalizing the model.

2.6 Fully connected layer

The neuron of this layer is connected to every neuron of the previous layer which is conventional to the hidden layer of a multi-layer neural network.

3 Proposed light weighted CNN

The paper proposes a new architecture of the convolutional neural network for the histopathological image classification as depicted in Fig. 5. The presented CNN model contains 01 input layer, 05 subsequent blocks of convolution layers, drop-out layer and max-pooling layer, and 01 fully connected layer. In complete CNN model, there are 16 convolutional layers, 05 dropout layers, 05 max-pooling layers, and 01 fully connected layer. As shown in Fig. 5, the first layer is the input layer, containing 150 × 150 × 3 neurons. The number of neurons in the input layer is generally equals to number of pixels in the input image. In this work, each input color image contains three channels, each of size 150 × 150. The input layer is followed by first block, containing four subsequent layers of convolution operation, 01 drop-out layer and 01 max-pooling layer. Each convolutional layer of first block consists of 16 filters of size (3 × 3) with activation function as ReLU and same padding. To overcome the problem of over-fitting, the sequence of convolution operations is followed by a drop-out layer with a significant probability (0.3). The drop-out layer is further connected to max-pooling layer with filter size of (3 × 3). The max-pooling layer is used to reduce the dimensions of the feature maps, generated by the convolution operations.

The output of first block is given to next block which also contains four convolutional layers with 32 filters of size (3 × 3), the drop-out layer with probability 0.2 and max-pooling layer. In the next block, similar four convolutional layers have been used with 64 filters of size (3 × 3), followed by the drop-out layer with probability of 0.1 and max-pooling layer. Then, the fourth block contains three convolutional layers with 128 filters of size (3 × 3), drop-out layer with probability of 0.05, and max-pooling layer. The output of this layer is used by the last block of single convolutional layer, carrying 256 filters of size (3 × 3), a drop-out layer with 0.05 probability, and a max-pooling layer. Lastly, a dense layer with activation function as softmax is used to perform the classification task. For illustration, Fig. 5 represents the architecture of the proposed convolutional neural network. In the proposed model, the drop-out probability is reduced from 0.3 to 0.05 as dependencies generally occur at the initial layers which cause the over-fitting problem. Furthermore, the number of filters are also varied from first block to last block to capture the significant feature map.

4 Experimental results

4.1 Considered dataset

This paper uses the colorectal cancer histology dataset which is made publicly available by Kather et al. [29]. The dataset consists of histopathological images of human patients with colorectal cancer and represents different texture patterns. The dataset consists of eight categories, namely stroma, debris, adipose, mucose, tumor, lympho, complex, and empty. This dataset is a collection of RGB colored images with 0.495 µm per pixel, captured at the magnification of 20 ×, and digitized with an Aperio ScanScope (Aperio/Leica biosystems) [29].

4.2 Results

To validate the performance of the proposed CNN model, a confusion matrix, generated by it, is shown in Fig. 6. In the confusion matrix, x-axis represents the predicted labels and y-axis depicts the true labels. As there are eight classes of images, 8 × 8 size confusion matrix is generated. From the confusion matrix, it can be visualized that for the classes mucosa, tumor, and debris, the classification accuracy is greater than 90%. From stroma, complex, and adipose classes, 84%, 77%, and 74% respectively images are correctly identified. However, for empty and lympho classes, the prediction is lower than 70% due to various variations available in the images. Moreover, to judge the efficiency of the proposed method, precision, recall, F1-score, and support measures are computed and presented in Table 1. From the table, it can be seen that the minimum precision is 0.55 for complex class while the highest is 0.97 for mucosa class. Similarly, other parameters values are good for all classes as maximum parameters values are greater than 70% which signify the efficiency of the proposed CNN.

Table 1 Performance of the proposed CNN with respect to precision, recall, F1-score, and support

Full size table

To compare the performance of the proposed method, four classifiers are considered, namely 1-nearest neighbor (1-NN), linear basis function support vector machine (linSVM), radial basis function support vector machine (rbfSVM), and ensemble of decision trees (ensTree). As stated, a machine learning model learns from a set of extracted features from the input dataset rather than the input directly. In this paper, different feature extraction methods are considered, namely higher-order histogram features (HOHF), local binary patterns (LBP), gray-level co-occurrence matrix (GLCM), Gabor filters (GF), and perception-like features (PF). Therefore, the comparison models are named accordingly i.e., 1-NN-HOHF, ensTree-HOHF, linSVM-HOHF, and rbfSVM-HOHF for higher-order histogram features (HOHF). Similarly, other names are presented in association with classifiers and respective feature extraction methods which give a total 20 methods for comparison. For performance analysis among the proposed and considered methods, the classification error has been computed on the same dataset. Table 2 tabulates the classification error of the proposed CNN against the considered models. Since, the comparison models are deterministic, the results are taken from [7]. It can be observed from the table that if the classifier is 1-NN, then for HOHF features, it shows the best performance with error rate 35.6%. For ensTree classifier, the features extracted from GLCM provide the best error rate of 40.9%. In case of linSVM, LBP features give 24.6% error rate which is least among other feature extraction methods. Similarly, for rbfSVM, LBP features show the minimum error rate of 23.8%. The worst performance of 52.4% error rate is given by 1-NN classifier with PF features. This signifies that no single feature extraction method can give the optimum features and different classifiers give variations in the classification performance. That’s why, deep learning methods are preferred to classify the histopathological images. From the table, it can be visualized that the proposed CNN achieves the lowest classification error i.e., 22.7% among all other methods. For more visual analysis, the error rates, generated by various methods are depicted in bar graphs as shown in Fig. 7. From the bar graphs also, it can be seen that the proposed CNN has the smallest bars as compared to all the four classifiers and respective feature extraction methods. Therefore, it can be stated that the proposed CNN may serve as an alternative solution for the histopathological image classification.

Table 2 Classification error of the proposed CNN and considered machine learning methods

Full size table

5 Conclusion

This paper presents a new architecture for the convolutional neural network for the classification of the histopathological images. The proposed convolutional neural network has been defined with multiple combination of convolutional layer, activation layer, max-pooling layer, drop-out layer, and dense layer. The experimental analysis of the proposed convolutional neural network has been conducted for the colorectal cancer histology dataset which is publicly available. The dataset contains RGB colored images, having eight classes. The performance has been analyzed in terms of precision, F1-score, recall, support, confusion matrix, and classification error. For fair analysis, the proposed method has been compared with 20 different methods. The comparative methods are created using different existing machine learning models which works on manually extracted features. From the experimental results, it can be visualized that the proposed convolutional neural network provides the lowest error rate of 22.7% as compared to other considered methods. In future, different layers and their combinations may be considered for the improvement.

References

Gurcan MN, Boucheron L, Can A, Madabhushi A, Rajpoot N, Yener B (2009) Histopathological image analysis: a review. IEEE Rev Biomed Eng 2:147
Article Google Scholar
Mittal H, Saraswat M (2019) Classification of histopathological images through bag-of-visual-words and gravitational search algorithm. In: Lecturer notes of soft computing for problem solving. Springer, pp 231–241
Mittal H, Saraswat M (2019) An automatic nuclei segmentation method using intelligent gravitational search algorithm based superpixel clustering. Swarm Evol Comput 45:15–32
Article Google Scholar
Pal R, Saraswat M (2019) Histopathological image classification using enhanced bag-of-feature with spiral biogeography-based optimization. Appl Intell 49:3406–3424
Article Google Scholar
Saraswat M, Arya K (2014) Automated microscopic image analysis for leukocytes identification: a survey. Micron 65:20–33
Article Google Scholar
Rachapudi V, Devi GL (2019) Feature selection for histopathological image classification using levy flight salp swarm optimizer. Recent Patents Comput Sci 12:329. https://doi.org/10.2174/2213275912666181210165129
Article Google Scholar
Kather JN, Weis C-A, Bianconi F, Melchers SM, Schad LR, Gaiser T, Marx A, Zöllner FG (2016) Multi-class texture analysis in colorectal cancer histology. Sci Rep 6:27988
Article Google Scholar
Pal R, Saraswat M (2018) Enhanced bag of features using alexnet and improved biogeography-based optimization for histopathological image analysis. In: 2018 eleventh international conference on contemporary computing (IC3). IEEE, pp 1–6
Bengio Yoshua VP, Aaron C (2013) Representation learning: a review and new perspectives. IEEE Trans Pattern Anal Mach Intell 35(8):1798–1828
Article Google Scholar
Zhang Y, Zhang B, Coenen F, Lu W (2013) Breast cancer diagnosis from biopsy images with highly reliable random subspace classifier ensembles. Mach Vis Appl 24:1405–1420
Article Google Scholar
Kowal M, Filipczuk P, Obuchowicz A, Korbicz J, Monczak R (2013) Computer-aided diagnosis of breast cancer based on fine needle biopsy microscopic images. Comput Biol Med 43:1563–1572
Article Google Scholar
Filipczuk P, Fevens T, Krzyzak A, Monczak R (2013) Computer aided breast cancer diagnosis based on the analysis of cytological images of fine needle biopsies. IEEE Trans Med Imaging 32:2169–2178
Article Google Scholar
Asri H, Mousannif H, Moatassime HA, Noel T (2016) Using machine learning algorithms for breast cancer risk prediction and diagnosis. Procedia Comput Sci 83:1064–1069
Article Google Scholar
Bengio PVY, Courville A (2013) Representation learning: a review and new perspectives. IEEE Trans Pattern Anal Mach Intell 35:1798–1828
Article Google Scholar
Cireşan DC, Giusti A, Gambardella LM, Schmidhuber J (2013) Mitosis detection in breast cancer histology images with deep neural networks. In: International conference on medical image computing and computer-assisted intervention. Springer, pp 411–418
Maqlin P, Thamburaj R, Mammen JJ, Manipadam MT (2015) Automated nuclear pleomorphism scoring in breast cancer histopathology images using deep neural networks. In: International conference on mining intelligence and knowledge exploration. Springer, pp 269–276
Yıldırım O, Pławiak P, Tan R-S, Acharya UR (2018) Arrhythmia detection using deep convolutional neural network with long duration ECG signals. Comput Biol Med 102:411–420
Article Google Scholar
Oh SL, Ng EY, San Tan R, Acharya UR (2019) Automated beat-wise arrhythmia diagnosis using modified u-net on extended electrocardiographic recordings with heterogeneous arrhythmia types. Comput Biol Med 105:92–101
Article Google Scholar
Baloglu UB, Talo M, Yildirim O, San Tan R, Acharya UR (2019) Classification of myocardial infarction with multi-lead ECG signals and deep CNN. Pattern Recognit Lett 122:23–30
Article Google Scholar
Talo M, Baloglu UB, Yıldırım O, Acharya UR (2019) Application of deep transfer learning for automated brain abnormality classification using MR images. Cognit Syst Res 54:176–188
Article Google Scholar
Han Z, Wei B, Zheng Y, Yin Y, Li K, Li S (2017) Breast cancer multi-classification from histopathological images with structured deep learning model. Sci Rep 7(1):4172
Article Google Scholar
Lo S-C, Lou S-L, Lin J-S, Freedman MT, Chien MV, Mun SK (1995) Artificial convolution neural network techniques and applications for lung nodule detection. IEEE Trans Med Imaging 14(4):711–718
Article Google Scholar
LeCun Y, Bottou L, Bengio Y, Haffner P et al (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324
Article Google Scholar
Saha M, Chakraborty C, Racoceanu D (2018) Efficient deep learning model for mitosis detection using breast histopathology images. Comput Med Imaging Graph 64:29–40
Article Google Scholar
Zheng Y, Jiang Z, Xie F, Zhang H, Ma Y, Shi H, Zhao Y (2017) Feature extraction from histopathological images based on nucleus-guided convolutional neural network for breast lesion classification. Pattern Recognit 71:14–25
Article Google Scholar
Litjens G, Kooi T, Bejnordi BE, Setio AAA, Ciompi F, Ghafoorian M, Van Der Laak JA, Van Ginneken B, Sánchez CI (2017) A survey on deep learning in medical image analysis. Med Image Anal 42:60–88
Article Google Scholar
A beginner’s guide to understanding convolutional neuralnetworks adit deshpande engineering at forward|-ucla cs’19. https://adeshpande3.github.io/A-Beginner%27s-Guide-To-Understanding-Convolutional-Neural-Networks/. Accessed 13 July 2019
An intuitive guide to convolutional neural networks. https://www.freecodecamp.org/news/an-intuitive-guide-to-convolutional-neural-networks-260c2de0a050/. Accessed 13 July 2019
Collection of textures in colorectal cancer histology | zenodo. https://zenodo.org/record/53169#.XShERZMzbq1. Accessed 13 July 2019

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Koneru Lakshmaiah Education Foundation, Vaddeswaram, Guntur, A.P., India
Venubabu Rachapudi
Department of Computer Science and Systems Engineering, AUCE(A), Andhra University, Visakhapatnam, India
G. Lavanya Devi

Authors

Venubabu Rachapudi
View author publications
You can also search for this author in PubMed Google Scholar
G. Lavanya Devi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Venubabu Rachapudi.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Rachapudi, V., Lavanya Devi, G. Improved convolutional neural network based histopathological image classification. Evol. Intel. 14, 1337–1343 (2021). https://doi.org/10.1007/s12065-020-00367-y

Download citation

Received: 21 August 2019
Revised: 02 December 2019
Accepted: 09 February 2020
Published: 18 February 2020
Issue Date: September 2021
DOI: https://doi.org/10.1007/s12065-020-00367-y

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Improved convolutional neural network based histopathological image classification

Abstract

Similar content being viewed by others

A Shallow Convolutional Neural Network Model for Breast Cancer Histopathology Image Classification

Classification of Breast Cancer Histopathological Images using Convolutional Neural Networks with Hierarchical Loss and Global Pooling

Breast Cancer Histology Image Classification Based on Deep Neural Networks

1 Introduction