Keywords

1 Introduction

Alzheimer’s disease is a degenerative neurologic condition in which the brain shrinks (atrophy) and brain cells die. Alzheimer’s disease is the most frequent form of dementia, defined as a progressive loss of cognitive, behavioral, and social abilities that impair a person's capacity to operate independently [1, 2]. Proper treatment helps the patients to reduce the disease symptoms temporarily. Correct medicines can assist patient by preserving their independence and mental functionality. Alzheimer's disease is known as no cure disease it effects patient’s brain. In this illness the patient suffers loss of memory that can lead to mortality in the advanced stages of illness.

Alzheimer’s disease has its characterization by loss of memory. Early identifications of this disease include recalling recent events or discussions [3]. Memory functionality also get effected as the disease progressed. A person suffering from Alzheimer’s may notice difficulties in remembering things and managing thoughts. These changes in the brain are caused following problems:

  1. 1.

    Memory: This illness cause badly memory loss, making the infected person difficult to operate at work or home. An Alzheimer's patient may experience the following symptoms:

    • Replicate statements and queries as needed.

    • Remember about discussions, appointments, or events and remember about them.

    • Frequently misplaced belongings, keeping them in strange places.

    • Family members’ names and ordinary objects are eventually forgotten.

    • Facing problems in recognizing items, express thoughts, or in conversations.

  2. 2.

    Reasoning and thinking: The disease of Alzheimer’s impairs concentration and thinking, especially when abstract notions like numbers are involved [4]. Multitasking is particularly difficult, including managing finances, paying payments on time, and cheque books balancing can be a difficult task. This all is because a person who is suffering from Alzheimer's disease may eventually lose the ability to recognize and cope with the numbers.

  3. 3.

    Making decisions and judgments: Alzheimer’s disease impairs a person’s capacity, making sound assessments and decisions in everyday situations becomes very difficult. As a result, a person can make bad or unusual decisions that lead to social encounters or improper dressing that is inappropriate for the present weather. Even responding successfully to ordinary situations like a stove on fire or unexpected driving conditions seems very difficult to a person suffering from this disease.

  4. 4.

    Organizing and carrying out routine activities: Over time, the person with such a disease starts facing problems with daily routine that involve normal tasks, such as playing games, kitchen work driving etc. Extending this, in the advanced stage of Alzheimer's disease patient forget daily work like bathing, dressing, etc.

  5. 5.

    Personality and behaviour change: As Alzheimer’s disease causes changes in the brain functionality, this impacts the mood and behaviour of a person. Such a person shows the following issues.

Apathy, Depression, Social isolation, Mood Swings, Others’ mistrust, changes in sleeping patterns, Aggressiveness, irritability, Wandering, lost inhibitions, and Delusions.

The daily diagnosis of an Alzheimer's patient is difficult. But the patient needs daily monitoring, and the family members are also curious about the patient's status. Earlier detection of Alzheimer’s disease was difficult. In most cases, it is diagnosed after death. But with the help of artificial intelligence, it is possible to detect Alzheimer’s disease in its early stage. This chapter compares various methods to detect Alzheimer’s using Machine Learning (ML) techniques.

Brain images are not good for Alzheimer’s disease. It is mostly used to study strokes, trauma, or tumors to understand cognitive change. This chapter uses Magnetic resonance imaging (MRI) data from the Open Access Series of Imaging Studies (OASIS) [5,6,7]. MRI provides detailed image of brain. MRI take help of radio waves and a magnetic field to capture brain status.

This work helps in the early detection of Alzheimer’s patients. This is very useful for recovering the disease at its initial phase itself. The work in this paper explains various machine learning algorithms [8,9,10,11,12,13]. In addition, the work evaluates different machine learning algorithms on MRI data to identify Alzheimer patients, and each algorithm shows its accuracy in the identification process. As a result, parameter tuning has been carried out by considering the machine learning approach with the highest accuracy to be the best optimal feasible solution for the dataset.

2 Related Work

Bari et al. [14] described the problem of Alzheimer’s disease very well. We also used the same dataset (OASIS) used by the author. The Author used various ML techniques and compared the result. The data set Bari et al. [14] used s small but contains useful information that we reuse in our proposed work.

Moradi et al. [15] Experimented with Moderate Cognitive Impairment (MCI) to use MCI as a bridge between age-related cognitive decline and Alzheimer's disease. The Authors demonstrate the usage of a machine learning-based MRI biomarker. According to the paper, their aggregate biomarker attained a tenfold cross-validation AUC score of 0.9020 in differentiating between progressive MCI (pMCI) and stable MCI (sMCI). The Author used semi-supervised learning to implement sMCI/pMCI classification on data from AD patients and normal controls rather than MCI patients. Regularized logistic regression was used to choose features. To avoid possible confusion between changes owing to AD and those related to normal aging, they eliminated aging effects from MRI data before classifier training. Finally, they created an aggregate biomarker using a random forest classifier by first learning a distinct MRI biomarker and then merging age and cognitive data from MCI participants.

Zhang et al. [16] focused on Eigen brains and machine learning; this research suggests a new computer-aided diagnosis (CAD) approach for MRI brain imaging. They employ critical slices from the 3D volumetric data acquired by the MRI to generate Eigen brain pictures based on EEG data in their method. After that, they used kernel support vector machines with several kernels trained using particle swarm optimization. Their polynomial kernel (92.36%) was more accurate than their linear and radial basis function kernels (91.47% and 86.71%, respectively).

Magnin et al. [17] suggested a method based on support vector machine (SVM) classification of whole-brain anatomical MRI to distinguish patients with Alzheimer’s disease from old controls in this publication. The researchers parceled three-dimensional T1-weighted MRI data from 16 patients with Alzheimer’s disease and 22 elderly controls into regions of interest (ROIs). The grey matter properties of these ROIs were subsequently used to classify participants using an SVM algorithm. The classifier has a mean accuracy of 94.5% based on their findings. The fact that they should have accounted for age-related changes in the grey matter and worked with a tiny data set could be one of their technique's drawbacks.

Khan et al. [18] reviewed the four most common deep learning and machine learning methods for brain disease identification. Some significant insights into modern ML/DL approaches are revealed in his paper. The most difficult part of the analysis is feature extraction, identification, and classification techniques using ML/DL methods. He also discussed methods to increase classification precision. The author also focuses on the quantity of training data that must be increased. The author used hybrid algorithms and combined supervised learning with ML and DL. Khan et al. [18] also identified the shortcomings of current ML/DL-based methods in identifying different types of brain disorders. The article offers a debate centered on a collection of open research problems to create efficient AI-based medical systems Incorporation of XAI methods is the ultimate goal for applications. This will aid healthcare workers in developing self-assurance, and AI-based solutions will become a treatment for individuals with neurological diseases (Table 1).

Table 1 Related work

Saratxaga et al. [19] proposed a method using balance accuracy calculation. They achieved 88% of accuracy. The author deployed BrainNet 2d and BrainNet3D techniques for classification; both methods belong to deep learning. The author used OASIS-1 and OASIS-2 datasets for training and testing purposes. The authors used 2D and 3D architecture in this paper and applied a subject-level approach using transfer learning. But this approach needs more resources and high computing power. The result could be promising better. There is a scope for modification in the method used.

Sudershan et al. [20] discussed the problem of Alzheimer’s disease and the nonavailability of drugs. The authors also discussed the importance of early detection of disease and the problem with the available datasets. The authors experimented with mild cognitive impairment, structural magnetic resonance, import vector machine, and support vector machine. They achieve 75% of accuracy in validation. The experimented models are novel in this dataset but there is a scope for improvement because of poor accuracy.

Helaly et al. [21] discussed the importance of early detection of Alzheimer’s disease. The authors used a convolutional neural network (CNN) for their work. They classified Alzheimer’s disease into four broad categories. The authors used 2D and 3D brain image data for classification from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) dataset. Once the classification is done, the authors applied transfer learning models, for example, VGG19. The authors achieved 93.61 and 95.17 for 2D and 3D multi-class classifications and 97% of accuracy in the VGG19 pre-trained model.

Shakila Basheer et al. [22] investigated the MRI images dataset and used a large set of features for processing. As per the research, they conclude that age is the most important feature of Alzheimer’s disease. They achieved 92.39% accuracy with the CNN model. There is a scope to improve the result by applying more validation to external data, and researchers also apply the same model to different datasets.

Martinez-Murcia et al. [23] experimented with a deep convolutional autoencoder that helps decompose a large dataset. The author uses conventional features and neuropsychologic variables to get the result. They achieve 84% accuracy, which is acceptable with the image data. But there is still scope for improvement, and researchers can improve accuracy.

Prajapati et al. [24] perform binary classification using a deep neural network. The author used an ADNI image dataset with three hidden layers and a k-fold combination with the fully connected network. They achieved 85% of accuracy with the scope for improvement.

3 Understanding of Data

3.1 Data

In this chapter, we used the Open Access Series of Imaging Studies (OASIS) data [25,26,27] from Kaggle. With the help of this MRI data, we will find the various categories of dementia. This dataset contains 60–96 years of patients with longitude values. This data is also balanced in nature. It has 72 nondemented and 64 demented patients’ information. Each patient was scanned exactly once. Fourteen patients were found nondemented earlier but later were found demented.

3.2 Initial Data Analysis (IDA)

Before starting our analysis, we try to understand the nature of the data with a few statistical analyses. We try to establish correlations between features that help us better understand MRI data and dementia [28].

Figure 1 uses the numeric features to represent their minimum, maximum, and average values.

Fig. 1
7 line graphs plot the minimum, maximum, and average values of education years, social and economic status, the status of mini-mental examination, ratings of clinical dementia, estimated total intracranial volume, normalize whole brain volume, and the atlas scaling factor.

Representation of minimum, average, and maximum values of every feature for graph implementation

While observing the value of the dataset, we can infer the following.

  1. 1.

    Men are more prone than women to be demented, as in Alzheimer’s disease.

  2. 2.

    Demented patients had fewer years of education than healthy people.

  3. 3.

    Compared to the Demented group, the Nondemented group had a larger brain volume.

  4. 4.

    Compared to Demented and non-demented patients, demented have a higher proportion of people in their 70 and 80 s.

3.3 Data Pre-Processing

Table 2 represents the sample dataset used in this chapter. This dataset had some values that needed to be added. We have two choices: leave the entire record of the missing values or fill in a value in the missing space. We used Imputation as a method to fill in missing values. After that, we compare our model with our benchmark parameters, such as accuracy, AUC, and recall. We divided the data set into two parts one for training purposes and the second for testing purposes. We divide the dataset into a 7.5:2.5 ratio, i.e., 75% of the data is for training, and 25% is used for testing.

Table 2 Sample dataset

4 Performance Evaluation

4.1 Evaluation Metric

Our focus is on the early detection of Alzheimer’s patients. We mainly focused on true positives and false positives. We wish to increase the true positive rate and decrease the false positive rate. So, we effectively detect Alzheimer’s patients. For this purpose, we used AUC, Precision, and recall. In the Machine Learning (ML) method, if we have a labeled dataset, then two possibilities occur [29]. The either experimented result matches with the original result, i.e., True event, or not matched i.e., false event. On this basis, we can divide these true or false positive events into the following categories.

Table 3 represents the confusion matrix. Our statistics are based on TP and FP mostly. We try to increase TP and reduce the FP. This confusion matrix helps us to determine precision, recall, and accuracy as follows:

$$Precision = \left( {\sum TP} \right) / \left( {\sum \left( {TP + FP} \right) } \right)$$
(1)
$$Recall = \left( {\sum TP} \right) / \left( {\sum \left( {TP + FN} \right) } \right)$$
(2)
$$Accuracy =\sum \left( {TP + TN} \right) / \left( {\sum \left( {TP + FP + FN + TN} \right) } \right)$$
(3)
Table 3 Confusion matrix (Ma et al. [30])

Equation 1 talks about a condition where we have a ratio of actual Alzheimer’s patients and positive patients found by our algorithm. This situation is known as precision. Equation 2 represents a ratio of total positive (patients) out of the total number of people tested correctly. Equation 3 informed us how many correct results we achieved from the total experiments.

4.2 Algorithms

We Applied the following machine learning algorithms.

  • Logistic Regression (LR): LR is a binary classification method [31]. It is used with supervised ML models. This model is best suited if we classify data in a yes/no format, such as whether the patient is suffering from Alzheimer’s or not. The LR algorithm uses the following mathematical function.

    $$Logistic\,function = 1/(1 + e^\wedge ( - x))$$
    (4)
  • This function is also known as the sigmoid function. This LR function uses the likelihood function (conditional probability) to calculate the loss. Figure 2 explains the behavior of the logistic regression function. The values are categorized into two segments. Values move toward positive infinity, represented as 1, and values move toward negative infinity, represents by 0. This model gave 75% of accuracy in a testing environment. We received the following results with LR algorithms.

    Fig. 2
    A line graph of logistic regression depicts the sigmoid function versus input. It passes through the points (1, 0), (3, 0), (5, 9), (7, 9), (10, 0), (11, 1), (13, 1), (15, 1), (19, 1), and (21, 1). The values are approximated.

    Logistic regression

  • The max accuracy at the time of validation is 75%

  • The value of the regularization parameter is 10

  • Test accuracy is 78%

  • Value of recall at the time of test is 75%

  • AUC shows 79% score

4.2.1 Support Vector Machine (SVM)

SVM is a supervised classification method [32]. This method uses the kernel function for calculating the distance. This method is best suited for binary classification and outlier detection. It uses subset distance (support vector) for classification, making it memory efficient. But, if the number of features exceeds the number of samples, this method must be more balanced. SVM works effectively due to its core mathematical function, i.e., quadratic programming problem solver, but its space and time complexity increases rapidly.

In our experiments, we found the following result with the SVM model.

  • We achieve the best accuracy with 77%

  • The value for parameter ‘C’ is 100%

  • The value for gamma we achieved 0.1

  • Correct prediction with AUC is 81%

  • Highest recall value is 70%

  • Test recall (with the best parameter) is 82%.

Figure 3 explains the working of the SVM model. The SVM uses a hyper-plane model for classification or regression purposes. The larger separation of two planes is possible due to this hyper-plan function.

Fig. 3
A scatter plot depicts the S V M model. The plotted points are (6, 8.5), (7, 10), (7.5, 8.5), (9, negative 4.2), and (9.5, negative 1). It indicates the points at (6, negative 6.5), (8, negative 7), and (7.2, negative 4.2). The values are approximated.

SVM Model

4.2.2 Decision Tree

A decision tree is another powerful algorithm in ML, as shown in Fig. 4. It is used for classification and regression. This method returns a true or false decision on an input. As the name suggests, it is a binary tree structure. It may grow internally in a binary tree manner. This method helps us in the question-answering pattern. For example, it takes features like patient age. If the age is greater than 50, then this method gives a positive answer, and the patient may be vulnerable to Alzheimer’s disease.

Fig. 4
A decision tree depicts the classification of the genetic disorder. If the genetic disorder is true, then vulnerable and if it is false, then the age is greater than 50. If the age greater than 50 is true, then vulnerable, and if it is false, then safe.

Decision tree

We receive the following result with decision tree algorithms.

  • The best accuracy (on validation set) computes as 77%

  • Best parameter (for the maximum depth) comes out as 2

  • We achieve the best accuracy with 81%

  • Highest recall value is 65%

  • Correct prediction with AUC is 82%.

4.2.3 Random Forest Classifier

Figure 5 shows the working of the random forest random. A random forest classifier is a collection of various independent trees. It is an ensemble machine-learning technique. As the name suggests, it is a forest of several decision trees.

Fig. 5
A tree diagram depicts the classification of data into genetic disorders. If genetic disorders are true, then vulnerable and if it is false, then age greater than 50. If the age is greater than 50 is true, then vulnerable, and if it is false, then safe.

Random forest

We received the following result with a random forest classifier.

  • Best accuracy (on validation set) computes as 80%

  • Best parameters (of M, d, m) comes out as 14 5 7

  • We achieve the best accuracy with 84%

  • Highest recall value is 80%

  • Correct prediction with AUC is 84%.

ML algorithms help us to analyze data and classification. We achieve good accuracy and result with SVM. If data is labeled, supervised learning is the best way to classify the data.

5 Result Analysis

To verify our results, we experimented with different ML methods like SVM, Decision tree, LR, Random forest, etc. The result we get is verified with a confusion matrix. The confusion matrix is the standard method to verify the experimented result. Our primary focus in this research is to avoid a false negative. Because if we start drug with a patient who has not suffered from this disease and we start medicins, it is too dangerous for the patient’s health. To avoid this scenario, the AUC is the best option for a performance metric. We also experimented with accuracy and recall to verify our results. We compared the results of all ML algorithms. Figure 6 shows all results. That Random forest is best for AUC. Best accuracy we achieved through SVM and Random forest. But recall is higher in random forest and Logical Regression methods. Since data is labeled and the random forest is an ensemble learning algorithm. So, it gives better results in comparison to all algorithms.

Fig. 6
A horizontal triple bar graph plots for A U C, recall, and accuracy of random forest, decision tree, S V M, and logistic regression. The recall of random forest, decision tree, S V M, and logistic regression is lower than A U C and accuracy.

Result generation and comparison of AI methods

6 Conclusion and Future Directions

In work mentioned, we applied the most common ML techniques and got better results than early researchers. Due to the data size, we are restricting ourselves to employing a more complex model. Since data is labeled then, supervised ML methods are best suitable for this dataset.

The future direction of this research is to collect more data with a variety such as images and other patterns. With the voluminous data, we can go for deep learning approaches.