A three-stage novel framework for efficient and automatic glaucoma classification from retinal fundus images

Singh, Law Kumar; Khanna, Munish; Garg, Hitendra; Singh, Rekha; Iqbal, Md.

doi:10.1007/s11042-024-19603-z

A three-stage novel framework for efficient and automatic glaucoma classification from retinal fundus images

Published: 14 June 2024

(2024)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Multimedia Tools and Applications Aims and scope Submit manuscript

A three-stage novel framework for efficient and automatic glaucoma classification from retinal fundus images

Download PDF

Law Kumar Singh ORCID: orcid.org/0000-0002-7073-6852¹,
Munish Khanna²,
Hitendra Garg¹,
Rekha Singh³ &
…
Md. Iqbal⁴

114 Accesses
1 Citation
Explore all metrics

Abstract

Glaucoma is one of the leading causes of visual impairment worldwide. If diagnosed too late, the disease can irreversibly cause severe damage to the optic nerve, resulting in permanent loss of central vision and blindness. Therefore, early diagnosis of the disease is critical. Recent advancements in machine learning techniques have greatly aided ophthalmologists in timely and efficient diagnosis through the use of automated systems. Training the machine learning models with the most informative features can significantly enhance their performance. However, selecting the most informative feature subset is a real challenge because there are 2ⁿ potential feature subsets for a dataset with n features, and the conventional feature selection techniques are also not very efficient. Thus, extracting relevant features from medical images and selecting the most informative is a challenging task. Additionally, a considerable field of study has evolved around the discovery and selection of highly influential features (characteristics) from a large number of features. Through the inclusion of the most informative features, this method has the potential to improve machine learning classifiers by enhancing their classification performance, reducing training and testing time, and lowering system diagnostic costs by incorporating the most informative features. This work aims in the same direction to propose a unique, novel, and highly efficient feature selection (FS) approach using the Whale Optimization Algorithm (WOA), the Grey Wolf Optimization Algorithm (GWO), and a hybridized version of these two metaheuristics. To the best of our knowledge, the use of these two algorithms and their amalgamated version for FS in human disease prediction, particularly glaucoma prediction, has been rare in the past. The objective is to create a highly influential subset of characteristics using this approach. The suggested FS strategy seeks to maximize classification accuracy while reducing the total number of characteristics used. We evaluated the efficacy of the proposed approach in classifying eye-related glaucoma illnesses. In this study, we aim to assist professionals in identifying glaucoma by utilizing a proposed clinical decision support system that integrates image processing, soft-computing algorithms, and machine learning, and validates it on benchmark fundus images. Initially, we extract 65 features from the 646 retinal fundus images in the ORIGA benchmark dataset, from which a subset of features is created. For two-class classification, different machine learning classifiers receive the elected features. Employing 5-fold and 10-fold stratified cross-validation has enhanced the generalized performance of the proposed model. We assess performance using several well-established statistical criteria. The tests show that the suggested computer-aided diagnosis (CAD) model has an F1-score of 97.50%, an accuracy score of 96.50%, a precision score of 97%, a sensitivity score of 98.10%, a specificity score of 93.30%, and an AUC score of 94.2% on the ORIGA dataset. To demonstrate its excellence, we compared the suggested approach’s performance with other current state-of-the-art models. The suggested approach shows promising results in predicting glaucoma, potentially aiding in the early diagnosis and treatment of the disease. Furthermore, real-time applications showcase the proposed approach’s suitability, enabling its deployment in areas lacking expert medical practitioners. Overburdened expert ophthalmologists can use this approach as a second opinion, as it requires very little time for processing the retinal fundus images. The proposed model can also aid, after incorporating required modifications, in making clinical decisions for various diseases like lung infection and, diabetic retinopathy.

A novel hybridized feature selection strategy for the effective prediction of glaucoma in retinal fundus images

Article 21 October 2023

Feature subset selection through nature inspired computing for efficient glaucoma classification from fundus images

Article 23 February 2024

Emperor penguin optimization algorithm- and bacterial foraging optimization algorithm-based novel feature selection approach for glaucoma classification from fundus images

Article 27 May 2023

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Over the last ten years, there has been a significant increase in interest in biomedical research. The significant amount of clinical and healthcare data generated as a result of technological advancements in medical research may help to explain this. Healthcare professionals who use this data to improve patient care are immensely intrigued by its potential. This information essentially promotes improved illness diagnosis and, as a result, better healthcare services. The biomedical data come from many different sources, are widely accessible, and include a large spectrum of information. However, it is often impractical to handle vast volumes of data manually. Therefore, people use data mining techniques to enhance existing data analysis methods and extract more profound and valuable insights from the data. Real-world datasets have a variety of traits or properties. It is possible that not all of these characteristics will be necessary to extract useful data from the databases. When certain machine learning or data mining approaches process the data, the presence of non-informative properties in the data does not enhance the learning algorithm’s effectiveness. In reality, these characteristics may sometimes make the learning algorithm perform worse while also lengthening the training period. As a result, it is crucial to carefully choose the ideal collection of characteristics for a methodical approach. Through the use of fewer features and lower training costs, this endeavor seeks to improve and maintain the performance of the underlying learning algorithm. Furthermore, the reduced proportions of the features would necessitate a lower quantity of storage space. Feature selection (FS) is one of the data pre-processing methods often used in data mining and machine learning applications [1]. The sizeable amount of high-dimensional data produced by modern technologies has made FS a crucial step in the data pre-processing process. Duplicate, irrelevant, and noisy properties in high-dimensional data can severely impact the accuracy of classification. Scholars use it when datasets may contain duplicated and unimportant data. By removing superfluous features, FS reduces dimensionality and improves classification accuracy. We can describe it as an optimization issue, aiming to enhance or sustain classification performance by selecting the optimal feature collection. Basically, there are three different feature selection approaches [2]: filter model, wrapper model, and embedded model. When there are more features (characteristics), it becomes computationally expensive to use exhaustive subset search techniques to find the right feature subset. This can be attributed to the exponential increase in the number of potential feature subsets. For the dataset with N features, there are a total of 2^N possible feature subset options. The FS procedure is a challenging combinatorial issue. We must use an FS approach to select the subset that exhibits the best performance. Research has demonstrated that metaheuristic algorithms excel in combinatorial tasks [3]. In order to quickly locate the almost ideal feature subset, the metaheuristic algorithms have the capacity to investigate 2^N possible feature subsets.

The Branch and Bound method use a monotonic evaluation function and has a low time complexity [4]. Because it is unable to handle large amounts of data, designing an evaluation function is very difficult. Other methods that may be used in this situation include scatter search and greedy search, according to [5, 6]. Many of these algorithms face challenges in avoiding local optima and incur significant computational burdens [7]. Because of their ability to efficiently find ideal solutions, FS methods based on evolutionary algorithms have gained popularity in recent years. In order to efficiently and reliably provide solutions for optimization issues, evolutionary algorithms make use of biological notions of evolution [8]. The term “chromosome” refers to each possible result. A chromosome contains a gene, a piece of structural DNA that determines the presence or absence of a certain characteristic. A gene’s value may be either 1 or 0, with 1 indicating the existence of a certain feature and 0 indicating its absence. We use the population to compile a complete list of viable solutions. A contender is a popular term for a chosen option. A candidate’s chosen fitness value from the population influences their performance. As the candidate’s fitness value rises, so does their performance. Through the use of genetic procedures like crossover and mutation, a small number of candidates increase population diversity [9].

Consequently, researchers have noted the use of evolutionary algorithms in diagnosing various ailments, with the aim of improving patient care through efficient and timely prediction. Researchers [12] used the augmented shuffled frog leaping algorithm to predict illnesses such as lung cancer and colon tumors, among others. Similar to [13, 14] covered the particle swarm optimization (PSO) technique for predicting lung cancer. Furthermore, [10] used the gravitational search-based algorithm (GSA) developed in [11] to forecast illnesses including breast cancer, heart disease, and dermatological problems. In recent years, the grasshopper optimization algorithm (GOA) [15], has become a very potent optimization tool. This technique replicates the natural foraging behaviour of a group of grasshoppers. The method is successful because it successfully strikes a good balance between exploration and exploitation, thereby limiting trapping in local optima. The experimental results presented in [15] provide additional proof that the GOA can either increase or decrease the average fitness of a population of randomly generated search agents over the course of iterations, depending on whether the goal is to maximize or minimize fitness. The researchers are also employing several widely recognized optimization algorithms, such as PSO [13], Bat Algorithm [16], States of Matter Search [17], Cuckoo Search [18], Flower Pollination Algorithm [19], Firefly Algorithm [20], GSA [10], and Genetic Algorithm [21, 22].

Meta-heuristic algorithms that draw their inspiration from nature have become well-known in recent years as effective answers to difficult real-world issues. These algorithms have shown astounding performance and efficacy. These algorithms may make use of the population’s vast knowledge in order to get the best results. These algorithms include the Elephant Herd Optimizer [23], Moth Search Algorithm [24], Cuckoo Search Algorithm [25], Monarch Butterfly Optimisation [26, 27], Elephant Herding optimization algorithm [28], Krill Herd [29] and Teaching learning-based algorithm [30]. Numerous optimisation issues, including complex design issues, node localization in wireless sensor networks, fault diagnosis, economic load dispatch, high-performance computing, high-dimension optimisation problems, image matching, and the knapsack problem, are commonly solved using these algorithms [31,32,33,34,35,36,37,38,39]. Various methods have been shown to be dependable and efficient in resolving various problems. Additionally, a number of academics have tried to use stochastic approaches to address feature selection problems. These techniques include genetic algorithms [21], simulated annealing [40], tabu search [41], bacterial foraging optimization algorithm [42], and artificial bee colonies [43]. Researchers [44, 45] have used a record-to-record trip approach based on fuzzy logic to address rough set attribute reduction issues. According to [46], the method entails thinking of attributes as graph nodes in order to build a graph model. To solve the feature selection issue, one can implement ant colony optimization approach to choose the nodes [47]. Proposed an FS method based on artificial bee colonies and differential evolution. Authors verified the method using fifteen datasets from the UCI collection. Previous research [48, 49] has used the Ant Lion Optimizer (ALO) as a feature selection model. The Flower Pollination Algorithm (FPA) [51], the Dragonfly Algorithm [52], and the Grey Wolf Optimizer (GWO) are modern algorithms that have effectively solved FS issues. Researchers have used a salp swarm algorithm (SSA) based on chaos to improve feature selection [50, 53]. The authors also employed a competent crossover strategy [55] to enhance the SSA’s ability to handle the FS difficulty. In order to find optimality, the authors’ work [51] presented a hybrid GWO-ALO algorithm that combines the global search powers of GWO with the exploitation capabilities of ALO. Recent research [54] employed the whale optimization approach (WOA) as a feature selection technique. Scholars have also combined the approaches of simulated annealing (SA) and WOA [52] to investigate feature subsets and choose the best feature set. The findings of this work contribute to the body of evidence showing that, when tested on benchmark image datasets, GWO, WOA, and their hybrid algorithms provide competitive results. These algorithms are good at locating key characteristics. These factors have led to the use of these three techniques in the field of feature selection for the classification of glaucoma, globally spreading an eye-related disease.

Glaucoma (Fig. 1) is a medical disorder that has the potential to harm the eye’s optic nerve. Over time, the issue gets worse. Experts believe a large buildup of pressure inside the eye is the main cause of the incident. The elevated intraocular pressure has harmed the optic nerve. The brain receives visual information from the optic nerve. Glaucoma’s increasing pressure over time can cause considerable damage that can lead to vision loss, including permanent impairment and the potential development of total blindness. Regaining lost vision might be difficult. Lowering intraocular pressure has the potential to help restore vision. In modern times, people in their 40s and older frequently suffer from this condition, but those aged 55 and above are the most commonly affected. Because glaucoma can take many different forms, it can be difficult to track how the disease is changing and could even advance undetected.

The eye contains the fluid known as aqueous humor. It normally comes out of a mesh-like duct in a typical person. The blockage in the duct causes the eye to continuously produce fluid, which accumulates over time. The medical community is still researching the cause of the channel obstruction, making it a hereditary problem. Inappropriate use of the drainage angle. The accumulation of fluid causes the pressure inside the eye to rise. We refer to the pressure within the eye as intraocular pressure (IOP) damage to the optic nerve. Millions of incredibly tiny nerve fibers make up the optic nerve. In many aspects, it is similar to various electric cables made up of countless tiny wires. Any human will become more prone to developing blind spots that impair vision when these nerve fibers start to deteriorate over time. These blind zones are rarely noticeable unless a person has lost a significant amount of their visual nerve fibers. Simple causes of glaucoma include a minor wound, an infection, or any other type of damage that causes obstruction of blood vessels in an average person’s eye. Although it is extremely rare, there are situations when eye surgery may be used to treat another issue.

Glaucoma, the second most common cause of blindness worldwide (after cataract), is a common chronic disorder that poses a serious risk to ocular health [56]. The World Health Organization (WHO) estimates that glaucoma affects 65 million people globally [57]. People sometimes refer to glaucoma as the “silent theft of sight” [58] due to its irreversible nature and the absence of symptoms in its early stages. Even though there is no known cure for glaucoma at this time, early detection and the right care may greatly help patients avoid visual loss and lower their risk of becoming blind. Clinical settings commonly use the measurement of intraocular pressure (IOP) to detect glaucoma. IOP is a well-known glaucoma symptom. This disorder has the potential to have negative consequences such as optic nerve injury, abnormalities in the visual field, and eventually blindness [59]. Therefore, glaucoma evaluation considers IOP as a crucial signal. The IOP of certain people with glaucoma, however, may decrease within the normal range, making this technique ineffective [60]. As a result, relying solely on IOP measurement may result in missed detection of these specific situations. Conducting optic nerve head (ONH) tests, which require clinical ophthalmologists to evaluate glaucoma using retinal pictures, is another frequently used technique for glaucoma screening [61]. Ophthalmologists routinely manually improve the retinal picture throughout the glaucoma screening procedure and make diagnoses based on their experience and domain-specific knowledge. The ineffectiveness and length of the diagnostic procedure render the two approaches listed above unsuitable for population screening. As a result, the development of an automated glaucoma screening system is both very advantageous and necessary for a broad and early diagnosis of the problem. However, the development of digital retinal image processing and artificial intelligence has made it feasible to perform automated glaucoma screening. Large-scale screening can benefit from this method due to its reliable accuracy and efficiency. Pathological signs of glaucoma often include an expanded optic cup and degradation of the retinal nerve margin [62, 63]. The ONH is the primary cause of these abnormal symptoms. As a result, the ONH evaluation is an important method for glaucoma screening. Clinical measurement analysis and image-based feature analysis are the two main groups of automated glaucoma diagnostic techniques that use fundus pictures. The term “clinical measurement analysis” refers to the evaluation of certain geometric features related to glaucoma, such as the ratio of the optic cup to disc (CDR) [64], the diameter of the optic disc [65], and the area of the optic cup. The most important of these traits, as recognized by clinical ophthalmologists, is the CDR, which shows a substantial link with glaucoma screening. An observer may quickly recognize the optic disc (OD) in a color retinal picture because of its distinctive appearance as a bright yellow oval area. It may be challenging to see the optic cup (OC), located in the middle of the OD and distinguished by its brilliant oval or circular shape. The remaining peripheral portion of the optic disc is referred to as the neuroretinal rim, with the exception of the optic cup area. A bigger CDR indicates a higher possibility of developing glaucoma, and conversely, a smaller CDR predicts a lower probability, as shown in Fig. 1. This observation is based on clinical experience and domain knowledge. Researchers have developed numerous automated glaucoma diagnosis techniques, many of which rely on clinical traits such as the CDR. A few recent state-of-the-art research and the most current methods that academics have developed for glaucoma prediction are shown in Table 1 below.

Table 1 Recent state-of-the-art approaches for glaucoma classification

A three-stage novel framework for efficient and automatic glaucoma classification from retinal fundus images

Abstract

Similar content being viewed by others

A novel hybridized feature selection strategy for the effective prediction of glaucoma in retinal fundus images

Feature subset selection through nature inspired computing for efficient glaucoma classification from fundus images

Emperor penguin optimization algorithm- and bacterial foraging optimization algorithm-based novel feature selection approach for glaucoma classification from fundus images

Explore related subjects

1 Introduction

1.1 Motivation and the novelty in the work

2 Proposed approach

2.1 Dataset and the details about features extracted

2.1.1 Structural features (cup diameter, disk diameter, cup to disk ratio, RIM, DDLS)

2.1.2 GLCM features

2.1.3 Gray level run length matrix (GLRM)

2.1.4 First order statistical (FoS) features

2.1.5 Discrete wavelet transforms (DWT) features

2.2 Methodology implemented in this work

2.2.1 Objective function

2.3 Recent advances in optimization

2.4 Rationale behind selecting GWO and WOA

2.5 GWO-WOA hybrid

Mathematical model and algorithm for optimization

2.5.1 Pseudo code

Variables used in the algorithm

2.5.2 Flowchart

3 Implementation, results and comparison

3.1 Hardware and software

3.2 Performance measuring indicators and evaluation metrics

Parameters values for implemented machine learning models

3.3 Results

3.3.1 Results of hybrid GWOWOA algorithm

Convergence evaluation for population 10

Convergence evaluation for population 20

3.4 Discussion

3.5 Comparison with the current best practices

4 Conclusion

Data availability

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation