NRIC: A Noise Removal Approach for Nonlinear Isomap Method

Yousaf, Mahwish; Khan, Muhammad Saadat Shakoor; Rehman, Tanzeel U.; Ullah, Shamsher; Jing, Li

doi:10.1007/s11063-021-10472-3

NRIC: A Noise Removal Approach for Nonlinear Isomap Method

Published: 08 March 2021

Volume 53, pages 2277–2304, (2021)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Neural Processing Letters Aims and scope Submit manuscript

NRIC: A Noise Removal Approach for Nonlinear Isomap Method

Download PDF

Mahwish Yousaf¹,
Muhammad Saadat Shakoor Khan²,
Tanzeel U. Rehman¹,
Shamsher Ullah³ &
…
Li Jing¹

345 Accesses
5 Citations
Explore all metrics

Abstract

Nonlinear manifold learning is a popular dimension reduction method that determines large and high dimensional datasets’ structures. However, these nonlinear manifold learning methods, including isomap and locally linear embedding, are sensitive to noise. In this paper, we focus on the noisy nonlinear manifold learning method, such as Isomap. The main problem of the Isomap is sensitivity to noise. Our proposed new method noise removal isomap with a classification (NRIC), is based on the local tangent space alignment (LTSA) algorithm with classification techniques to remove noises and optimize neighborhood structure Isomap. The primary purpose of the NRIC is to increase efficiency, reduce noise, and improve the performance of the graph. Experiments on the real-world datasets have shown that the NRIC method outperforms efficiently and maintains an accurate low dimensional representation of the noisy nonlinear manifold learning data. The results show that LTSA with classification techniques provides high accuracy, mean-precision, mean-recall, and areas under the (ROC) curve (AUC) of the high dimensional datasets and optimizes the graphs.

An Extended-Isomap for high-dimensional data accuracy and efficiency: a comprehensive survey

Article 14 August 2024

A Robust Locally Linear Embedding Method Based on Feature Space Projection

An Extended Isomap Approach for Nonlinear Dimension Reduction

Article 09 May 2020

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Nonlinear Manifold learning is an efficient approach for dimension reduction. Various methods and algorithms have been offered for analyzing the basic structure of the high and large-scale dimensional data, which attracted much more attention in the machine learning area [55]. The basic idea of manifold learning is to transform the high dimensional data into low dimensional space and retain the most important information [17]. In recent years has attracted much more attention to the great importance of studying manifold learning for nonlinear dimension reduction [51, 58].

In 2000, the Isometric Mapping algorithm became a hot research topic in nonlinear manifold learning and information science [54]. Isomap is a nonlinear manifold learning algorithm that is widely used for nonlinear dimension reduction [14]. The basic idea of Isomap is a variant of the Multidimensional Scaling (MDS) metric, which preserves the global intrinsic structure of the data points. It maps the high dimensional data into low dimensional space [45]. Heeyoul and Seungjin [2, 6] proposed the kernel Isomap algorithm to solve the noises and the outlier problem in topological stability. The limitation of the proposed schemes [2, 6] is it destroyed the original Isomap algorithm. This approach preserves topological stability when dealing with outliers and noises. They also suggested the robust kernel Isomap method for topological stability, noises, and outliers problems [8]. They reduced the effect of outliers based on the topological structure with network flow help [7, 8]. H. Chang and DY proposed [5] the robust LLE method for noises and outliers. This method can improve the robustness of LLE, eliminating the outliers and noises in data points. The main drawback of this method outliers and noises are still controlled in data points, and robustness is still reduced to some data points. Kouropteva et al. [23, 24] and Shao et al. [42] proposed the selection of the optimal parameter values method for LLE, and Isomap and Saxena et al. [41] proposed the integrated approach for Isomap and LLE.

In addition, Bo Li et al. [28] proposed the expanded Isomap approach for improving the robust LLE process, and the robustness of the original Isomap was reduced. This method uses the weighted Principal Component Analysis (PCA) [5] to measure the noises and outliers in data points. So every weight point in the data set will be allocated through local robust PCA. To detect weighted noises and outliers, R. McGill et al. use the box statistic method [30]. After de-noising, this method will easily retain the topological structure [28].

In our motivation, we have focused on studying the nonlinear Isomap noise problem. The Isomap algorithm is also noise sensitive. Isomap algorithm is not suitable for real-world datasets, as the datasets are noiseless. We propose a novel approach called Noise Removal Isomap with Classification (NRIC) method for overcoming the Isomap noise problem.We have used the Local Tangent Space Alignment (LTSA) algorithm with classification techniques for the Isomap noise problem. To effectively eliminate the noises in data points, we used the concept of LTSA as a nonlinear manifold learning technique. We have used different classification techniques such as Support Vector Machine (SVM) [49, 50], K Nearest Neighbor (KNN) [16, 29], Naïve Bayes (NB) [20], and Random Forest (RF) [25]. Isomap algorithm can’t easily map high-dimensional data to low-dimensional space by using classification techniques and real-world datasets because it’s very noisy.

Our proposed method results show that the LTSA algorithm with classification techniques can significantly improve the original Isomap in a noisy environment. Also, our proposed method produces accurate results for large and high-dimensional datasets while reducing data point noise. In Sect. 3, we explain in detail the techniques and the algorithms.

Contributions In summary, the contribution of our proposed method is given below:

1.
We propose the NRIC method for the Isomap noise problem. Our proposed method used an LTSA algorithm with well-known classification techniques. Our proposed NRIC method can easily embed the high dimensional data space into low dimensional space and optimize the neighborhood graph.
2.
We conduct extensive experiments to analyze our NRIC method on five datasets empirically. We calculate the accuracy, mean-precision, mean-recall, and Area under the ROC (Receiving Operating Characteristics) Curve (AUC) for our proposed method and provide the effective noise removal results.
3.
We improve the Isomap noise problem’s performance by using the different neighborhood value of K. The experiment section shows the effectiveness of our proposed NRIC method according to K values.

The paper is organized as follows. In Sect. 2, we will give brief details of the Isomap and Machine learning classifier. We will provide details of the proposed method and LTSA, and classification techniques in Sect. 3. Section 4 describes the experimental results on five large scale datasets. Finally, the paper is concluded in Sect. 5.

2 Related Work

Classical Isomap is viewed as a variant of Multidimensional Scaling (MDS) metric to model nonlinear data using its geodesic distance. The primary purpose of Isomap preserves the geometry of data and gets the geodesic distance between all pairs of data points. The geodesic distance is divided into two parts, such as neighborhood data points and faraway data points. In neighborhood points, the Euclidean distances between neighboring points are provided approximated geodesic distance by input-space. In faraway points, the geodesic distances are calculated the approximated by the shortest paths in neighborhood points [28, 37, 38]. The main three steps of Isomap are given below:

Step-1: Build neighborhood graph G Firstly, build the K nearest neighbor (KNN) graph G of manifold learning based on the Euclidean distance d between two data points in the input space $X_{i}$ and $X_{j}$, i.e., d=$X_{i}$, $X_{j}$ = ||$X_{i}$-$X_{j}$|| [28].

Step-2: Calculate the shortest distance When builds the neighborhood graph G, then calculate the geodesic distance matrix between sub-neighborhood faraway data points and computes the shortest path distance between any two data points $X_{i}$ and $X_{j}$ is the graph G by Dijkstra and Floyd’s algorithm [18].

Steps-3: Build a d-dimensional embedding graph Isomap uses the MDS algorithm to compute the low d-dimensional embedding of the data points and make the geodesic distance dense matrix [43].

2.1 Machine Learning Classifier

Machine Learning (ML), which is used in different research fields, including Artificial intelligence, data classification, and Statistic concerned with the automatic acquisition of knowledge datasets. These techniques are capable of improving the performance of datasets from experience [34]. The famous research field of ML is data classification. Data classification is provided various algorithms such as Support Vector Machine (SVM), K Nearest Neighbor (KNN), Random Forest (RF), Naïve Bayes (NB), Artificial Neural Network (ANN), Classification and Regression Tree (CART), Decision Tree, etc. [33]. The primary process of classification is to predict the label’s data points from given datasets. The label data points are sometimes called classes, targets, and categories. Classification Predictive Modeling (CPM) is mapped the input data points through a mapping function and predicts the possible output data points. A detailed description of classification algorithms is given in Sect. 3.

3 Noise Removal Isomap with Classification (NRIC)

In this section, we propose a novel approach, which is called the Noise Removal Isomap with Classification (NRIC) method for the Isomap noise problem. The main idea of our NRIC method is to eliminate the noise quickly, map the high dimensional data into low dimensional space, and then easily optimize the neighborhood graph. We have using the LTSA algorithm with classification techniques in our proposed method. Our NRIC method provided high accuracy rather than Isomap and reduced the noise from data points. We have used classification techniques, such as SVM, KNN, NB, and RF, with different K values. The algorithm 1 of our NRIC method is given below:

3.1 Local Tangent Space Alignment (LTSA) Algorithm

In 2004, Zhang and Zha introduced the nonlinear local tangent space alignment method for embedding [59]. This method can easily embed the high dimensional data into low dimensional space. LTSA can be used for noise problems and very efficiently eliminates noise. LTSA’s central concept is LLE variants and employed the same geometric manifolds as LLE. LTSA uses a distinct method to the embedded manifold space compared with LLE. In LLE, every point of the datasets is locally linearly embedded into the manifold’s linear plot then constructed the low dimensional datasets. So that preserved the locally linear relationships of the original datasets. Moreover, LTSA has built a locally linear patch by using the PCA method on the neighbors, and then the patch can be evaluated as an approximation of local tangent space at the point [52]. The algorithm 2 of LTSA is given below:

3.2 Support Vector Machine (SVM) Classifier

SVM has attracted much more attention and used very actively in several research applications such as regression, learning classification, and ranking function. The basic idea of SVM is dependent on the Structural Risk Minimization (SRM) principle and statistical learning theory and identifying the position of decision space, also called hyperplane, that generates the optimal partition of classes [4, 9, 13, 35]. SVM uses an isolating hyperplane to create an SVM event model classifier. The main issues of SVM cannot be isolated directly in the information space. This method provides a probability function to identify an answer by performing principle information space improvement in high dimensional space, where a perfect portioning hyperplane can be found [39]. In the experiment, we have used the linear kernel model for the SVM classifier. The algorithm 3 of SVM [47] is given below:

3.3 K- Nearest Neighbors (KNN) Classifier

The KNN classifier is the simplest classification method and used in machine learning and data mining. It is beneficial and easy to implement. It does not require a fitting model for classifying various types of datasets and provide the best performance of the multiple types of datasets [21]. In contrast, the best performance of KNN depends on the distance metric for calculating the distance between data points of Euclidean. The KNN data points often use Euclidean distance for similarity [1]. The KNN makes the training samples by itself according to the laws of classification. The KNN algorithm is classifying the objects based on the nearest training samples in the attributes. KNN method is a kind of lazy learning and instance-based learning because the KNN function is locally approximated, and all execution of KNN is delayed until classification [36]. The KNN classifier can easily find the closest samples from training datasets. In the results section, K parameters are used as an optimal value [56, 57]. The algorithm 4 of KNN [44] is given below:

3.4 Naïve Bayes (NB) Classifiers

NB Classifiers are easy probability classifiers based on the Bayes Theorem [27], and the NB classifier is mostly used when input data dimensionality is high. This classifier is efficient for computing the available output data based on the input data. It adds new available raw input data at runtime and has an efficient probabilistic classifier [35]. Different types of NB classifier is accessible by the assumptions on the distribution of features; these are called event models of NB classifier [22], including Bernoulli or multinomial distributions, Gaussian distributions [32], and discrete features. We have used the Gaussian event model in our proposed method for calculating the accuracy. In our proposed method, we used the Gaussian event model to calculate accuracy. It’s also slightly quicker and more efficient than SVM [53]. The NB [40] algorithm 5 is given below:

3.5 Random Forest (RF) Classifier

T. Kam Ho [15] was introduced RF in 1995, which uses the tree as parallel. RF is a collaborative learning classification algorithm (ensemble) combining the same and different types of more than one algorithm to classify the object. RF classifier is a randomly selected subset of the training dataset in the set of decision trees. It is a fast method to train the dataset rather than other techniques such as deep learning, although less slow to predict once trained datasets [3, 25]. The algorithm 6 of RF [46] is given below:

4 Results and Discussion

This section analyzes the effectiveness and efficiency of the NRIC, general experiments on high dimensional and large scale datasets. We have analyzed the proposed algorithms’ performance with an LTSA algorithm and classification techniques such as SVM, KNN, NB, and RF. Moreover, we have also compared our proposed NRIC method with different neighborhood K values.

4.1 Datasets

The experiments were organized on a large scale and high dimensional datasets such as Iris [11], Wine [12], Labeled Faces in the Wild (LFW) people [19], Breast Cancer [31], and Digits [26]. The detailed information on the datasets is listed in Table 1.

Table 1 Experimental datasets

NRIC: A Noise Removal Approach for Nonlinear Isomap Method

Abstract

Similar content being viewed by others

An Extended-Isomap for high-dimensional data accuracy and efficiency: a comprehensive survey

A Robust Locally Linear Embedding Method Based on Feature Space Projection

An Extended Isomap Approach for Nonlinear Dimension Reduction

Explore related subjects

1 Introduction

2 Related Work

2.1 Machine Learning Classifier

3 Noise Removal Isomap with Classification (NRIC)

3.1 Local Tangent Space Alignment (LTSA) Algorithm

3.2 Support Vector Machine (SVM) Classifier

3.3 K- Nearest Neighbors (KNN) Classifier

3.4 Naïve Bayes (NB) Classifiers

3.5 Random Forest (RF) Classifier

4 Results and Discussion

4.1 Datasets

4.2 Evaluation Methods

4.3 Accuracy

4.4 Mean-Precision

4.5 Mean-Recall

4.6 Area Under the ROC Curve (AUC)

5 Conclusion

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Human and animal rights

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation