Abstract
Glaucoma detection is an important research area in intelligent system and it plays an important role to medical field. Glaucoma can give rise to an irreversible blindness due to lack of proper diagnosis. Doctors need to perform many tests to diagnosis this threatening disease. It requires a lot of time and expense. Sometime affected people may not have any vision loss, at the early stage of glaucoma. For detecting glaucoma, we have built a model to lessen the time and cost. Our work introduces a CNN based Inception V3 model. We used total 6072 images. Among this image 2336 were glaucomatous and 3736 were normal fundus image. For training our model we took 5460 images and for testing we took 612 images. After that we obtained an accuracy of 0.8529 and a value of 0.9387 for AUC. For comparison, we used DenseNet121 and ResNet50 algorithm and got an accuracy of 0.8153 and 0.7761 respectively.
Access provided by Autonomous University of Puebla. Download conference paper PDF
Similar content being viewed by others
Keywords
1 Introduction
Glaucoma is a complicated disease that damages optical nerve and causes irreversible blindness due to lack of proper diagnosis. This “sneak thief of sight” can affect anyone at any age. Even newborn babies can be affected. According to doctors, it grows in such a manner that patient does not experience any complication. By 2040, the number of glaucoma affected people likely to increase 111.8 million [2]. Glaucoma causes vision loss and blindness due to damage of optic nerve. Our optic nerve provides visual sensation to our brain from both eyes. We know, eyes continuously make aqueous humor and it fills the front part of eyes. If the drainage channels of aqueous humor are blocked, the IOP (Intraocular pressure) increases and optic nerve may become permanently damaged. There are five major types of glaucoma: Open angle glaucoma, Angle Closure Glaucoma, Congenital Glaucoma, Normal Tension Glaucoma and Secondary Glaucoma. For glaucoma diagnosis, ophthalmologist need to perform a comprehensive examination of eye, including Tonometry, Gonioscopy, Ophthalmoscopy, Nerve fiber analysis and Perimetry. These diagnosis procedures are expensive and also time consuming. So, to cope with this great ocular problem, a CNN architecture will be approached for glaucoma detection.
Glaucoma is one of the most dangerous causes of blindness. Sometimes patient have no symptoms and the vision may remain 6/6 till late stage. Actually, there are no specific symptoms during early stage. Some patient may never have increased Intra-ocular pressure. Besides some patient with high Intra-ocular pressure may not diagnose glaucoma, which is called ocular hyper tension. Early detection of glaucoma associated with immediate treatment that has been shown to prevent major problems.
Vision loss caused by glaucoma is not reversible with treatment, even surgery cannot help to recover it. In USA, after cardiac attack and cancer, blindness is the third formidable health complication. Only the better awareness could prevent permanent visual disability. In this era of artificial intelligence, automated health care system has the capability to identify diseases within a short period. In order to serve the medical community, deep learning algorithm will help to detect glaucoma. Work flow for diagnosis glaucoma will be faster than the regular one. So affected people will get proper treatment during first stage of glaucoma. However, glaucoma is preventable if it is diagnosed early and effective treatment is provided. That is what motivated us to conduct this thesis.
2 Problem Statement
Eyes are important sensory organs that provides sight. Some parts of eyes are: cornea, sclera, choroid, iris, pupil, lens, ciliary muscle, suspensory ligament, conjunctiva, anterior chamber (between cornea and iris), posterior chamber (between iris and lens), macula, vitreous humour, aqueous humour, hyaloid canal, retina, optic nerve, optic disc, blood vessels, fovea.
Glaucoma is referred as an eye disease that damage optic nerve and cause vision loss. Optica nerve carries information that we can see through eye to brain. Optic nerve head is called optic disc, it connects retina and optic nerve. The center of optic disc is called optic cup. When the optic cup enlarges and occupy more area of optical disc then the cup to disc ratio (CDR) increases. When the cup to disc ratio is greater than normal range, the patient’s eye is suspected as glaucomatous eye. Doctors need to perform many tests such as: Ophthalmic Test, Tonometry, Ophthalmoscopy, Perimetry, Pachymetry, Gonioscopy. After getting results from different test, doctor have to decide whether it is a glaucomatous eye or not. Careful evolution is important to detect glaucoma and there is a high chance of not getting accurate result due to lack of skill. This work proposes an efficient method for detecting glaucoma which will lessen time and costs [25] at the same time in order to facilitate ophthalmologists and optometrists.
-
\(\Rightarrow \) An automated system for glaucoma diagnosis.
-
\(\Rightarrow \) Applied augmentation technique for getting varied images.
-
\(\Rightarrow \) Used large amount of image data.
-
\(\Rightarrow \) Collected images from different available data sets.
-
\(\Rightarrow \) Compared with other popular CNN methods.
3 Literature Review
O. J. Afolabi et al. [5] introduced a redesigned U-Net model named U-Net Lite and XGB (extreme gradient boost) algorithm. From RIM ONE V3 and DRISTI-GS the extreme gradient boost algorithm achieved an accuracy of 88.6 and an AUC-ROC value of 93.6. Chaudhary P. K, and Pachori R. B. [10] has proposed the order zero and order one 2D-FBSE-EWT (two dimensional Fourier-Bessel series expansion based empirical wavelet transform) methods at quarter, half and full frequency scales which are used for disintegrating fundus image into consequential sub-images. Then from obtained sub-images, proposed method 1: a conventional ML based method and proposed method 2: an ensemble ResNet50 based method, are studied for detection. In this paper [23], S Pathan et al. proposed image processing methods are used to define an automated framework for Computer Aided Diagnosis (CAD) of glaucoma. Here pre-processing algorithm includes the identification and exclusion of blood vessels for effective OD and OC segmentation. The use of a decision tree classifier and a circle finder approach helped in robust OD segmentation. The proposed OC segmentation method aims to enhance the OC region by creating a new channel due to reduced variability between the pixels of OD and OC. The obtained threshold value for the segmentation algorithms is not limited to a single dataset. Feature extraction requires domain knowledge of glaucoma, such as the CDR and NRR area, as well as statistical color and texture features. The classifiers used for classification are SVM, ANN, and AdaBoost classifier ensemble with dynamic selection methods for identifying fundus images whether it is affected or not. A ten-fold cross validation is also performed for the ensemble of AdaBoost classifiers with dynamic selection methods, SVM, and ANN. In this paper, Mufti Mahmud et al. [20] stated that overnight advances in hardware based technologies during the previous many years have opened up additional opportunities for life researchers to assemble multimodal data in different applications, for example, omics, clinical imaging, bioimaging and (cerebrum/body)- machine interfaces which have created novel freedoms for advancement of devoted information escalated AI strategies. Specifically, recent research in reinforcement learning, deep learning, and their combination promise to advance the future of AI. Mufti Mahmud et al.discussed about different CNN architectures [19] and also stated that diverse biological data from various application domains is multi-modal, multidimensional, and complex in nature. The author included that “Currently, a massive amount of such data is publicly available”. The availability of these data came with a significant challenge in analyzing and recognizing patterns in them, which necessitated the use of sophisticated machine learning [21] tools. In paper [28] Saxena et al. proposed an architecture that differentiates between the patterns for glaucoma and non-glaucoma using of the CNN. The total work was evaluated within six layers. Authors used ROI extraction, dropout, data-augmentation for preprocessing of data. For the experiments, authors used SCES and ORIGA data set. They got .822 and .882 values for the ORIGA and SCES data set respectively. In paper [22] Palakvangsa-Na-Ayudhya et al. proposed an automated system using Mask Regional - Convolutional Neural Network [32]. It is an advancement of Faster R-CNN by joining a branch for predicting segmented masks on each ROI along with the existing branch for classify an object and bounding box regression. This automatic screening system calculates CDR. They used four datasets: Drishti GS1 and RIMONE (r1, r2, r3). They used four datasets individually and also in a combined manner. They got values for 50 epochs of Individual dataset: (RIM One r3 0.66, Drishti-GS1 0.73, RIM One r1 0.74, RIM One r2 0.78) and for 100 epochs (RIM One r3 0.68, Drishti-GS1 0.75, RIM One r1 0.75, RIM One r2 0.85). With the computational time of 8 h, 4 h and 2 h they obtained 0.68(400 epoch), 0.71(200 epoch), 0.64(100 epoch) respectively for combined dataset. Then they set up the epochs to 200 with the 10-fold cross validation and achieved accuracy of 0.78. Pinos-Velez et al. [24] diagnosed glaucoma by the using of ISNT rule. In a normal eye CDR ratio is below 0.3. ISNT rule was used for measure the width of retinal rim. Juneja, M et al. proposed [17] an approach based on deep learning [18] which is disc cup segmentation glaucoma network (DC-GNet). This segmentation network extracts the CDR, DDLS and ISNT feature from fundus images. The input images to the CNN model were cropped to 512–512 pixels and resized to 256–256 pixels. This network has 28 layers: pooling layers, drop out layers, 2D convolutional layers and up sampling layers. An accuracy of 0.937 (Dristi dataset) and 0.996 (RIM One dataset) were achieved from segmentation of disc. And from cup segmentation technique they got an accuracy of 0.900 (Dristi dataset) and 0.978 (RIM One dataset). Debasree Sarkar and Soumen Das [27] proposed a method which used media filter for noise reduction. Thresholding is applied to extract to OD (optic disc) and OC (optic cup). By using RIM-ONE data set they got an accuracy of 97.58. A. Serener and S. Serte proposed a system [29] detects early and advanced glaucoma automatically. They applied ResNet50 and GoogLeNet algorithm and got an accuracy of 79 and 83 respectively.
4 Method
According to Fig. 1, After collecting fundus images, divided into two set: training and testing images. We have trained our model after applying augmentation techniques. During the training time we took 600 images for validating our model from training images. Then evaluated it using test images.
4.1 Data Collection
For this work, we have collected images from ACRIMA dataset [1], LAG dataset [3] and Glaucoma Data set and combined them.
4.2 Data Augmentation
Data augmentation is a process which helps to increase the diversity of data for training a model without gathering new data. It acts as a regularizer. It enhance the performance of the model [14]. It helps to avoid over-fitting problem. Neural network treats augmented images as distinct images. The deep learning neural network library of Keras provides the facility of data augmentation. We augmented our data using ImageDataGenerator class. We applied rotation, width shift, height shift, zooming, sheer, channel shift and horizontal shift. After applying the augmentation [30] technique more image data were generated. We used data augmentation technique [12] only for our training dataset. And for evaluating our model we used original images rather than augmented images. Shifting of an image means moving all pixels in one direction. Two types of shift can be done width shift and height shift. Shifting helps us to change the position of an object. Flipping of an image means reversing the columns or rows of pixels in case of horizontal or vertical flip respectively. It is similar to rotating an object left to right or up to down. Rotation is done by rotating an image clockwise or anticlockwise within 0 to 360\(^{\circ }\). In zooming technique, images are either zooms in or zooms out. Value less than 1 used to zoom in and greater than 1 zooms outs an image and value equals to 1 does not have any effect. Shearing of an image means shifting a specific part of the image like a parallelogram. In shear one axis remains fixed. In channel shift, RGB channel values are shifted randomly.
Figure 2 shows an original fundus image and images after applying flipping, shearing, rotating, shifting, zooming and channel shifting on the original image.
4.3 Inception V3
Inception V3 [31] is a CNN pre trained model [9]. It is computationally more efficient and focuses on using less computational power. It is a multi-level feature extractor. Inception V3 model is a collection of symmetric and asymmetric building blocks. It includes convolution, max pooling, average pooling, dropouts, concats and fully connected layers. By using Softmax, loss is computed. A schematic diagram is given below: [16]. We have collected images according to Table 1, then applied augmentation technique [7] according to Sect. 4.2 for getting varied fundus images. We trained Inception V3 model using augmented training dataset. Our model has total 312 layers: 1 input layer, 94 Cov2d layer, 94 batch normalization layer, 94 activation layer, 11 mixed layer, 8 average pooling layer, 4 max pooling layers, 2 concatenate layer, 3 global average pooling layer, 1 dense layer. We evaluated our model using test fundus images. Finally, our model will able to detect fundus images whether it is normal or glaucomatous.
5 Results
5.1 Evaluation Criteria
There are different performance [13] matrices for evaluating a model. In this work we utilize Confusion matrix, Accuracy, Precision, Recall, Specificity, F1 score to evaluate the performance. Confusion matrix gives a clear idea of values like True Positives, False Positives, True Negatives and False Negative.
-
True Negative (TN): When the actual value was negative and predicted negative.
-
True Positive (TP): When the actual value was positive and predicted positive.
-
False Negative (FN): When the actual value was positive but predicted negative.
-
False Positive (FP): When the actual value was negative but predicted positive.
5.2 Comparison of Different Types of CNN Model
Area under curve given in Fig. 4 measures the capability of a classifier to separate between classes. Higher the value of AUC better the classifier and its performance. AUC range value lies between 0 to 1. It is an important evaluation criterion. We can notice that Fig. 3(a) has the higher AUC value for Inception v3 which is 0.9387.
From Table 2 and Fig. 5, we can notice that due to uneven class distribution, precision value (normal class) and recall value (glaucoma class) of DenseNet121 has highest value than other two models. We know F1 score is called the weighted average of precision and recall. So, for coping with this uneven class we should consider F1 score rather than precision and recall value. According to Table 2, Fig. 4, Fig. 5 we can say Inception V3 model has highest test accuracy, AUC value and F1 score. So Inception V3 is the best classifier for this problem.
6 Conclusion
Glaucoma is complication that is associated with the damage of optic nerve and causes permanent blindness. This approach to medical image processing technology [26] will enlarge the application of detecting glaucoma. This thesis work will lead to the computer-generated result to improve the clinician’s judgment standard of glaucoma detection. This model can detect more normal fundus images than glaucomatous image due to higher number of normal fundus images in dataset. We got less accuracy than other previous work as we took a huge amount of data than other. Besides our dataset is a collection of different publicly available dataset.
Though optic disc is the brightest part of the fundus, we did not use multi-level segmentation [4] technique. Also, our target classes were not equally distributed with positive and negative samples.
In future we will train this model using good quality images and for coping with data imbalance issue we will introduce resampling technique to our model. We have a plan to build different integrated models to improve the detection of glaucoma. For the integration, we will use different algorithms and techniques along with Inception V3 like CNN, RNN, LSTM [11], deep learning [6], belief rule base [8, 15], etc. Besides we plan to extend our study of convolutional neural network to multiple ocular diseases detection like cataract, retinal detachment, diabetic retinopathy.
References
figshare. https://rb.gy/vkuly5. Accessed 21 Apr 2021
World Glaucoma Association. https://www.glaucomapatients.org/basic/statistics/. Accessed 21 Apr 2021
Kaggle. https://www.kaggle.com/sreeharims/glaucoma-dataset. Accessed 21 Apr 2021
Abedin, M.Z., Nath, A.C., Dhar, P., Deb, K., Hossain, M.S.: License plate recognition system based on contour properties and deep learning model. In: 2017 IEEE Region 10 Humanitarian Technology Conference (R10-HTC), pp. 590–593. IEEE (2017)
Afolabi, O.J., Mabuza-Hocquet, G.P., Nelwamondo, F.V., Paul, B.S.: The use of U-Net lite and extreme gradient boost (XGB) for glaucoma detection. IEEE Access 9, 47411–47424 (2021). https://doi.org/10.1109/ACCESS.2021.3068204
Ahmed, T.U., Hossain, M.S., Alam, M.J., Andersson, K.: An integrated CNN-RNN framework to assess road crack. In: 2019 22nd International Conference on Computer and Information Technology (ICCIT), pp. 1–6. IEEE (2019)
Ahmed, T.U., Hossain, S., Hossain, M.S., ul Islam, R., Andersson, K.: Facial expression recognition using convolutional neural network with data augmentation. In: 2019 Joint 8th International Conference on Informatics, Electronics & Vision (ICIEV) and 2019 3rd International Conference on Imaging, Vision & Pattern Recognition (icIVPR), pp. 336–341. IEEE (2019)
Ahmed, T.U., Jamil, M.N., Hossain, M.S., Andersson, K., Hossain, M.S.: An integrated real-time deep learning and belief rule base intelligent system to assess facial expression under uncertainty. In: 2020 Joint 9th International Conference on Informatics, Electronics & Vision (ICIEV) and 2020 4th International Conference on Imaging, Vision & Pattern Recognition (icIVPR), pp. 1–6. IEEE (2020)
Basnin, N., Nahar, L., Hossain, M.S.: An integrated CNN-LSTM model for micro hand gesture recognition. In: Vasant, P., Zelinka, I., Weber, G.-W. (eds.) ICO 2020. AISC, vol. 1324, pp. 379–392. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-68154-8_35
Chaudhary, P.K., Pachori, R.B.: Automatic diagnosis of glaucoma using two-dimensional Fourier-Bessel series expansion based empirical wavelet transform. Biomed. Signal Process. Control 64, 102237 (2021)
Chowdhury, R.R., Hossain, M.S., Hossain, S., Andersson, K.: Analyzing sentiment of movie reviews in bangla by applying machine learning techniques. In: 2019 International Conference on Bangla Speech and Language Processing (ICBSLP), pp. 1–6. IEEE (2019)
Chowdhury, R.R., Hossain, M.S., ul Islam, R., Andersson, K., Hossain, S.: Bangla handwritten character recognition using convolutional neural network with data augmentation. In: 2019 Joint 8th International Conference on Informatics, Electronics & Vision (ICIEV) and 2019 3rd International Conference on Imaging, Vision & Pattern Recognition (icIVPR), pp. 318–323. IEEE (2019)
Hossain, E., Shariff, M.A.U., Hossain, M.S., Andersson, K.: A novel deep learning approach to predict air quality index. In: Kaiser, M.S., Bandyopadhyay, A., Mahmud, M., Ray, K. (eds.) Proceedings of International Conference on Trends in Computational and Cognitive Engineering. AISC, vol. 1309, pp. 367–381. Springer, Singapore (2021). https://doi.org/10.1007/978-981-33-4673-4_29
Islam, M.Z., Hossain, M.S., ul Islam, R., Andersson, K.: Static hand gesture recognition using convolutional neural network with data augmentation. In: 2019 Joint 8th International Conference on Informatics, Electronics & Vision (ICIEV) and 2019 3rd International Conference on Imaging, Vision & Pattern Recognition (icIVPR), pp. 324–329. IEEE (2019)
Islam, R.U., Hossain, M.S., Andersson, K.: A deep learning inspired belief rule-based expert system. IEEE Access 8, 190637–190651 (2020)
Jon Shlens, S.R.S.: https://ai.googleblog.com/2016/03/train-your-own-image-classifier-with.html. Accessed 21 Apr 2021
Juneja, M., Thakur, S., Wani, A., Uniyal, A., Thakur, N., Jindal, P.: DC-Gnet for detection of glaucoma in retinal fundus imaging. Mach. Vis. Appl. 31, 1–14 (2020). https://doi.org/10.1007/s00138-020-01085-2
Kabir, S., Islam, R.U., Hossain, M.S., Andersson, K.: An integrated approach of belief rule base and deep learning to predict air pollution. Sensors 20(7), 1956 (2020)
Mahmud, M., Kaiser, M.S., McGinnity, T.M., Hussain, A.: Deep learning in mining biological data. Cogn. Comput. 13(1), 1–33 (2020). https://doi.org/10.1007/s12559-020-09773-x
Mahmud, M., Kaiser, M.S., Hussain, A., Vassanelli, S.: Applications of deep learning and reinforcement learning to biological data. IEEE Trans. Neural Netw. Learn. Syst. 29(6), 2063–2079 (2018). https://doi.org/10.1109/TNNLS.2018.2790388
Nahar, N., Hossain, M.S., Andersson, K.: A machine learning based fall detection for elderly people with neurodegenerative disorders. In: Mahmud, M., Vassanelli, S., Kaiser, M.S., Zhong, N. (eds.) BI 2020. LNCS (LNAI), vol. 12241, pp. 194–203. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-59277-6_18
Palakvangsa-Na-Ayudhya, S., Sapthamrong, T., Sunthornwutthikrai, K., Sakiyalak, D.: Glaucoviz: assisting system for early glaucoma detection using mask R-CNN. In: 2020 17th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON), pp. 364–367. IEEE (2020)
Pathan, S., Kumar, P., Pai, R.M., Bhandary, S.V.: Automated segmentation and classification of retinal features for glaucoma diagnosis. Biomed. Signal Process. Control 63, 102244 (2021)
Pinos-Velez, E., Flores-Rivera, M., Ipanque-Alama, W., Herrera-Alvarez, D., Chacon, C., Serpa-Andrade, L.: Implementation of support tools for the presumptive diagnosis of glaucoma through identification and processing of medical images of the human eye. In: 2018 IEEE International Systems Engineering Symposium (ISSE), pp. 1–5. IEEE (2018)
Rahaman, S., Hossain, M.S.: A belief rule based clinical decision support system to assess suspicion of heart failure from signs, symptoms and risk factors. In: 2013 International Conference on Informatics, Electronics and Vision (ICIEV), pp. 1–6. IEEE (2013)
Rezaoana, N., Hossain, M.S., Andersson, K.: Detection and classification of skin cancer by using a parallel CNN model. In: 2020 IEEE International Women in Engineering (WIE) Conference on Electrical and Computer Engineering (WIECON-ECE), pp. 380–386. IEEE (2020)
Sarkar, D., Das, S.: Automated glaucoma detection of medical image using biogeography based optimization. In: Bhattacharya, I., Chakrabarti, S., Reehal, H.S., Lakshminarayanan, V. (eds.) Advances in Optical Science and Engineering. SPP, vol. 194, pp. 381–388. Springer, Singapore (2017). https://doi.org/10.1007/978-981-10-3908-9_46
Saxena, A., Vyas, A., Parashar, L., Singh, U.: A glaucoma detection using convolutional neural network. In: 2020 International Conference on Electronics and Sustainable Communication Systems (ICESC), pp. 815–820. IEEE (2020)
Serener, A., Serte, S.: Transfer learning for early and advanced glaucoma detection with convolutional neural networks. In: 2019 Medical Technologies Congress (TIPTEKNO), pp. 1–4. IEEE (2019)
Shorten, C., Khoshgoftaar, T.M.: A survey on image data augmentation for deep learning. J. Big Data 6(1), 1–48 (2019)
Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J., Wojna, Z.: Rethinking the inception architecture for computer vision. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2016)
Zisad, S.N., Hossain, M.S., Andersson, K.: Speech emotion recognition in neurological disorders using convolutional neural network. In: Mahmud, M., Vassanelli, S., Kaiser, M.S., Zhong, N. (eds.) BI 2020. LNCS (LNAI), vol. 12241, pp. 287–296. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-59277-6_26
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Afroze, T., Akther, S., Chowdhury, M.A., Hossain, E., Hossain, M.S., Andersson, K. (2021). Glaucoma Detection Using Inception Convolutional Neural Network V3. In: Mahmud, M., Kaiser, M.S., Kasabov, N., Iftekharuddin, K., Zhong, N. (eds) Applied Intelligence and Informatics. AII 2021. Communications in Computer and Information Science, vol 1435. Springer, Cham. https://doi.org/10.1007/978-3-030-82269-9_2
Download citation
DOI: https://doi.org/10.1007/978-3-030-82269-9_2
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-82268-2
Online ISBN: 978-3-030-82269-9
eBook Packages: Computer ScienceComputer Science (R0)