LBDAG-DNE: Locality Balanced Subspace Learning for Image Recognition

Ding, Chuntao; Sun, Qibo

doi:10.1007/978-3-319-59288-6_18

Chuntao Ding¹⁷ &
Qibo Sun¹⁷

Part of the book series: Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering ((LNICST,volume 201))

Included in the following conference series:

International Conference on Collaborative Computing: Networking, Applications and Worksharing

Abstract

The cloud-computing environment makes it possible to select the best features when tuning parameters. Various dimensionality reduction algorithms can achieve the best features with the tuning of parameters. Double adjacency graphs-based discriminant neighborhood embedding (DAG-DNE) is a typical graph-based dimensionality reduction method, and has been successfully applied to image recognition. It involves the construction of two adjacency graphs, with the goal of learning the intrinsic structure of the data. However, it may impair the different degrees of importance of the intra-class information and inter-class information of the given data. In this paper, we develop an extension of DAG-DNE, called locality balanced double adjacency graphs-based discriminant neighborhood embedding (LBDAG-DNE) by considering the intra-class information and inter-class information of the given data differently. LBDAG-DNE can find a good projection matrix, which allows neighbors belonging to the same class to be compact while neighbors belonging to different classes become separable in the subspace. Experiments on two image databases illustrate the effectiveness of the proposed approach.

Access provided by CONRICYT-eBooks. Download conference paper PDF

Appropriate points choosing for subspace learning over image classification

Article 12 November 2018

Global structure-guided neighborhood preserving embedding for dimensionality reduction

Article 11 January 2022

Collaborative Representation Based Discriminant Local Preserving Projection

Article 27 April 2022

Keywords

1 Introduction

Dimensionality reduction is one of the most useful tools for data analysis in data mining. Many dimensionality reduction algorithms can achieve the best features when tuning parameters. However, tuning parameters in the process of dimensionality reduction significantly increases the time cost. Cloud computing [6, 7], which has supercomputing power, can extract features more efficiently when tuning parameters.

The most popular dimensionality reduction algorithms include locally linear embedding (LLE) [1], ISOMAP [2], and Laplacian eigenmap (LE) [3]. These algorithms only provide the embedding results for training samples. There are many extensions that attempt to solve the out-of-sample problem, such as locality preserving projections (LPP) [4, 5]. These algorithms could preserve the local information by constructing an adjacency graph, but they cannot work well in classification because they are unsupervised.

Many supervised algorithms have been proposed to overcome the aforementioned drawbacks. Linear discriminant analysis (LDA) was proposed in [8,9,10], Yan et al. proposed marginal Fisher analysis (MFA) [11]; Zhang et al. proposed discriminant neighborhood embedding (DNE) [12]; Ding et al. proposed similarity-balanced discriminant neighborhood embedding (SBDNE) [14] and double adjacency graph-based discriminant neighborhood embedding (DAG-DNE) [13], and so on. However, these algorithms may not consider the different degrees of intra-class information and inter-class information, which is important to learn the projection matrix.

Inspired by recent progress, in this study, we propose a novel supervised discriminant subspace learning algorithm called locality balanced double adjacency graphs-based discriminant neighbor embedding (LBDAG-DNE). In LBDAG-DNE, we employ DAG-DNE to construct two adjacency graphs to preserve the intra-class information and inter-class information, which link every sample to its homogeneous and heterogeneous neighbors, respectively. In LBDAG-DNE, we introduce a parameter that can balance the intra-class information and inter-class information depending on the situational requirements. Thus, LBDAG-DNE could maintain the balance between intra-class information and inter-class information and find an optimal projection matrix. Experimental results validate the effectiveness of LBDAG-DNE in comparison with several related state-of-the-art methods.

The rest of this paper is structured as follows. In Sect. 2, we provide a summary of the classic algorithms. Our LBDAG-DNE algorithm is introduced in Sect. 3. The experimental results are presented in Sect. 4. Finally, we provide the concluding remarks in Sect. 5.

2 Related Work

Over the past few years, dimensionality reduction techniques have received much attention, and correspondingly, many algorithms have been proposed [11,12,13, 15]. We will briefly introduce some of the classic algorithms in this section.

Yan et al. [11] proposed MFA in 2005, which finds an optimal projection matrix by simultaneously minimizing the intra-class scatter and maximizing the inter-class scatter by constructing two adjacency graphs. However, it cannot determine the optimal discriminant subspace.

Soon after this, Zhang et al. [12] proposed DNE. It maintains the local structure and distinguishes homogeneous and heterogeneous neighbors by constructing an adjacency graph, which can determine the optimal discriminant subspace. However, DNE does not construct a link between each point and its heterogeneous neighbors when constructing the adjacency graph.

Recently, Ding et al. [13] proposed DAG-DNE, which can effectively solve the problem of DNE and LDNE, with each sample respectively linked to its homogeneous and heterogeneous neighbors by constructing double adjacency graphs. However, DAG-DNE simply considers intra-class information and inter-class information to have the same degree of importance. In actuality, they play different roles in the classification task.

The above algorithms may simply consider intra-class information and inter-class information to have the same degree of importance. However, they play different roles in the classification task. Thus, when projected into a low-dimensional space, some more important discriminative information may be missed.

3 Our Proposed LBDAG-DNE

3.1 LBDAG-DNE

Let $ \{ ({\mathbf{x}}_{i} ,y_{i} )\}_{i = 1}^{N} $ be a set of training samples, where $ {\mathbf{x}}_{i} \in R^{d} $ and $ y_{i} \in \{ 1,2, \ldots ,C\} $. LBDAG-DNE aims to find a projection matrix $ {\mathbf{P}} $, with the ability to project the data from a high-dimensional space into a low-dimensional space $ {\mathbf{V}}_{i} = {\mathbf{P}}^{T} {\mathbf{x}}_{i} $, which allows neighbors belonging to the same class to be compact while neighbors belonging to different classes become separable.

Similar to DAG-DNE, LBDAG-DNE requires the construction of two adjacency graphs. Let $ {\mathbf{F}}^{w} $ and $ {\mathbf{F}}^{b} $ be the intra-class and inter-class adjacency matrices, respectively. For a sample $ {\mathbf{x}}_{i} $, $ NH_{k}^{w} ({\mathbf{x}}_{i} ) $ and $ NH_{k}^{b} ({\mathbf{x}}_{i} ) $ denote its $ K $ homogeneous and heterogeneous neighbors, respectively.

The intra-class adjacency matrix $ {\mathbf{F}}^{w} $ is defined as

$$ F_{ij}^{w} = \left\{ {\begin{array}{*{20}l} { + 1,} \hfill & {{\mathbf{x}}_{i} \in NH_{k}^{w} ({\mathbf{x}}_{j} )\;or\;{\mathbf{x}}_{j} \in NH_{k}^{w} ({\mathbf{x}}_{i} )} \hfill \\ {0,} \hfill & {otherwise} \hfill \\ \end{array} } \right. $$

(1)

and the inter-class adjacency matrix $ {\mathbf{F}}^{b} $ is

$$ F_{ij}^{b} = \left\{ {\begin{array}{*{20}l} { + 1,} \hfill & {{\mathbf{x}}_{i} \in NH_{k}^{b} ({\mathbf{x}}_{j} )\;or\;{\mathbf{x}}_{j} \in NH_{k}^{b} ({\mathbf{x}}_{i} )} \hfill \\ {0,} \hfill & {otherwise} \hfill \\ \end{array} } \right. $$

(2)

The intra-class scatter is defined as follows:

$$ \begin{aligned} \varPhi ({\mathbf{P}}) = & \,\sum\limits_{i.j} {||{\mathbf{P}}^{T} {\mathbf{x}}_{i} - {\mathbf{P}}^{T} {\mathbf{x}}_{j} ||^{2} } F_{ij}^{w} \\ \, = & \,2tr\{ {\mathbf{P}}^{T} {\mathbf{X}}({\mathbf{D}}^{w} - {\mathbf{F}}^{w} ){\mathbf{X}}^{T} {\mathbf{P}}\} \\ \end{aligned} $$

(3)

where $ {\mathbf{D}}^{w} $ is a diagonal matrix, and its entries are the column sums of $ {\mathbf{F}}^{w} $.

The inter-class scatter is as follows:

$$ \begin{aligned} \varPsi ({\mathbf{P}}) = & \,\sum\limits_{i.j} {||{\mathbf{P}}^{T} {\mathbf{x}}_{i} - {\mathbf{P}}^{T} {\mathbf{x}}_{j} ||^{2} } F_{ij}^{b} \\ \, = & \,2tr\{ {\mathbf{P}}^{T} {\mathbf{X}}({\mathbf{D}}^{b} - {\mathbf{F}}^{b} ){\mathbf{X}}^{T} {\mathbf{P}}\} \\ \end{aligned} $$

(4)

where $ {\mathbf{D}}^{b} $ is a diagonal matrix, and its entries are the column sums of $ {\mathbf{F}}^{b} $.

The goal is to allow neighbors belonging to the same class be compact, while neighbors belonging to different classes become separable in the subspace. We need to maximize the margin of total inter-class scatter and total intra-class scatter, i.e.,

$$ \Theta ({\mathbf{P}}) =\Psi ({\mathbf{P}}) - \beta\Phi ({\mathbf{P}}) $$

(5)

where $ \beta \in [0,10] $ is a tuning parameter that controls the tradeoff between intra-class information and inter-class information.

LBDAG-DNE seeks to find a projection matrix $ {\mathbf{P}} $ by solving the following objective function. The complete derivation and theoretical justifications are similar to those of DAG-DNE. Therefore, the details of the derivation and theoretical justification can be found in [13].

$$ \left\{ {\begin{array}{*{20}l} {\mathop {\text{max}}\limits_{{\mathbf{P}}} } \hfill & {{\text{tr}}\left\{ {{\mathbf{P}}^{T} {\mathbf{XSX}}^{T} {\mathbf{P}}} \right\}} \hfill \\ {s.t.} \hfill & {{\mathbf{P}}^{T} {\mathbf{P}}\,{ = }\,{\mathbf{I}}} \hfill \\ \end{array} } \right. $$

(6)

where $ S = {\mathbf{D}}^{b} - {\mathbf{F}}^{b} - \beta *{\mathbf{D}}^{w} + \beta *{\mathbf{F}}^{w} $.

The projection matrix $ {\mathbf{P}} $ can be found by solving the generalized eigenvalue problem as follows:

$$ {\mathbf{XSX}}^{T} {\mathbf{P}} = \lambda {\mathbf{P}} $$

(7)

Thus, $ {\mathbf{P}} $ is composed of the optimal $ r $ projection vectors corresponding to the $ r $ largest eigenvalues.

The details for LBDAG-DNE are given in Algorithm 1.

3.2 Connection to LBDAG-DNE and DAG-DNE

By constructing two adjacency graphs, DAG-DNE can maintain the local intrinsic structure for the original data in the subspace, allowing it to effectively find optimal discriminant directions. However, DAG-DNE simply considers the intra-class information and inter-class information to have the same degree of importance. In actuality, they play different roles in the classification task. Thus, when projected into the low-dimensional space, some more important discriminative information may be missed. LBDAG-DNE regulates the different levels of the intra-class information and inter-class information by introducing a balance factor. As a result, LBDAG-DNE can adjust the balance factor according to the actual situation to achieve a good performance.

4 Experiments and Analysis

4.1 Data Sets

We conducted experiments on three data sets that are publicly available: MNIST^{Footnote 1}, UMIST^{Footnote 2}. Brief descriptions of these data sets are given below (see Table 1 for some important statistics):

Table 1. Data sets used in our experiments

Full size table

MNIST is a data set of handwritten digits. Each image is represented as a 784-dimensional vector.

UMIST is a data set that takes into account race, sex, and appearance, which we downsampled to a size of $ 32 \times 32 $ for computational efficiency.

4.2 Experimental Setup

All of the algorithms were implemented in MATLAB 2012b, and executed on an Intel (R) i5 Core CPU 2.50 GHz machine with 4 GB of RAM. Our experiment required the nearest neighbor parameter $ K $ to construct adjacency graphs. For simplicity, the nearest neighbor (NN) classifier was used for classifying test images in the projected spaces.

4.3 Comparison Algorithms

To demonstrate the effectiveness and efficiency of our proposed LBDAG-DNE, we compared it with three other state-of-the-art algorithms. The following is a list of information concerning the experimental settings of each method:

(1)
DNE: discriminant neighborhood embedding proposed in [12].
(2)
MFA: marginal Fisher analysis proposed in [11].
(3)
DAG-DNE: double adjacency graphs-based discriminant neighborhood embedding proposed in [13].

4.4 Performance Metric

The classification result was evaluated by comparing the obtained label of each sample with the label provided by the data set. We used the accuracy [11, 12] to measure the classification performance. Given a data point $ {\mathbf{x}}_{i} $, let $ c({\mathbf{x}}_{i} ) $ and $ c'({\mathbf{x}}_{i} ) $ be the obtained classification label and the label provided by the corpus, respectively. The accuracy is defined as follows:

$$ Accuracy = \frac{{\sum\nolimits_{i = 1}^{N} {\delta (c({\mathbf{x}}_{i} ),c'({\mathbf{x}}_{i} ))} }}{N} $$

(9)

where $ N $ is the total number of samples, and $ \delta (a,b) $ is the delta function that equals one if $ a = b $ and equals zero otherwise.

4.5 Experimental Results

To evaluate the effectiveness and correctness of the proposed algorithm, experiments were carried out on the MNIST, UMIST, and ORL databases, and the results were compared with those of DNE, MFA, and DAG-DNE.

In the parameter selection step, we randomly selected 60% of the images from the 60% training set as the training set, and the remaining 40% of the images from the 60% training set as the test set to selection parameters and then used the result to choose $ \beta $.

4.5.1 Results with Handwritten Dataset

For the MNIST data set, we considered five classes, including the digits 1, 3, 5, 7, and 9. For each class, we randomly selected 50 samples from the original training set as our training samples, and 50 samples from the original test set as our test samples. Figure 1 shows some image samples from the MNIST dataset. The performances of the four methods are reported in Fig. 2. We used $ K = 1 $, 3, 5, and 7 to construct the adjacency graphs for all the methods.

Here, we mainly focus on the effect of the dimensionality of the discriminant subspace on the classification accuracy under different choices for the nearest neighbor parameter $ K $. Without prior knowledge, $ K $ was set to be 1, 3, 5, and 7. PCA was utilized to reduce the dimensionality from 784 to 80. We repeated 30 trials and report the average results. Figure 2(a), (c), (e), and (g) shows the accuracy of the four methods with different dimensions and different values of $ K $. Figure 2(a), (c), (e), and (g) shows that the classification accuracies of all four methods increase rapidly, and then almost become stable. More importantly, we can obviously see that LBDAG-DNE performs better than DNE, MFA, and DAG-DNE across a wide dimensionality range on the MNIST dataset, and the increase for LBDAG-DNE is the most rapid.

From Fig. 2(b), (d), (f), and (h), we can observe that LBDAG-DNE can obtain a good performance at a relatively low discriminant subspace, and can reduce the computational complexity and improve the classification performance.

Thus, the experimental results on the MNIST dataset illustrate that LBDAG-DNE outperforms the other algorithms. In spite of the variation in $ K $, LBDAG-DNE has the highest recognition accuracy among these methods.

4.5.2 Results with UMIST Dataset

For UMIST datasets, we randomly selected 20% of the images from the database as training samples, with the remaining 80% used as test samples. Figure 3 shows some image samples from the UMIST dataset. We repeated 20 runs and report the average results and corresponding parameters in Table 2.

Table 2. Best average recognition rates of all methods on UMIST dataset.

Full size table

First, we consider the parameter selection. The nearest neighbor parameter $ K $ is selected from the set $ \{ 1,3\} $. Figure 4 illustrates the relationship between the accuracy and the value of $ \beta $. From Fig. 4, we know that the accuracy is not the highest when $ \beta = 1 $, where $ \beta $ is a tuning parameter that balances the tradeoff between intra-class information and inter-class information. The intra-class information and inter-class information play different roles in the classification task.

Figure 5(a) and (c) shows the accuracies of the four methods vs. the dimensionality of the subspace with different $ K $. Figure 5(b) and (d) shows the relationship for the subspace dimension with the best accuracy. As seen in Fig. 5(a) and (c), the classification accuracies of all four algorithms increase rapidly. However, LBDAG-DNE has the fastest increase. From Fig. 5(b) and (d), we can see that LBDAG-DNE has the lowest discriminant subspace, which provides a good performance.

Furthermore, Table 2 reports the best average recognition rates on the test sets for all of the methods, along with the corresponding dimension of the reduced subspace under different values of $ K $. In spite of the variation in $ K $, LBDAG-DNE has the highest recognition rate among these algorithms.

Based on the results of the handwriting and face recognition experiments, we can see that the classification performance of LBDAG-DNE is the best compared to DNE, MFA, and DAG-DNE. This suggests that the intra-class information and inter-class information have different degrees of importance for classification. In other words, they play different roles in the classification task. Moreover, the superiority of LBDAG-DNE was effectively demonstrated in all of the experiments. We could reduce the computational complexity and improve the classification using LBDAG-DNE to extract the effective features.

5 Conclusion

The superior computing power of cloud computing makes it possible to utilize tuning parameters to select the best features. In this paper, we proposed a novel supervised discriminant subspace learning algorithm, called LBDAG-DNE, with the goal of learning a good embedded subspace from the original high-dimensional space for classification. LBDAG-DNE maintains the intra-class and inter-class structure by constructing adjacency graphs and balances them by introducing a balance parameter. More importantly, by introducing a balance parameter, it can also regulate the different levels of the intra-class information and inter-class information. Thus, LBDAG-DNE could find an optimal projection matrix. Experimental results show that LBDAG-DNE could achieve the best classification performance in comparison with several related state-of-the-art methods.

Notes

References

Roweis, S.T., Saul, L.K.: Nonlinear dimensionality reduction by locally linear embedding. Science 290(5500), 2323–2326 (2000)
Article Google Scholar
Tenenbaum, J., Silva, V.D., Langford, J.C.: A global geometric framework for nonlinear dimensionality reduction. Science 290(5500), 2319–2323 (2000)
Article Google Scholar
He, X.F., Yan, S.C., Hu, Y.C., Niyogi, P.: Face recognition using laplacianfaces. IEEE Trans. Pattern Anal. Mach. Intell. 27(3), 328–340 (2001)
Google Scholar
He, X.F., Niyogi, P.: Locality preserving projections. In: Proceedings of Advances in Neural Information Processing Systems, pp. 153–160 (2003)
Google Scholar
Xu, Y., Zhong, A.N., Yang, J., Zhang, D.: LPP solution schemes for use with face recognition. Pattern Recogn. 43(12), 4165–4176 (2010)
Article MATH Google Scholar
Wang, S.G., Zhou, A., Hsu, C.H., Xiao, X.Y., Yang, F.C.: Provision of data-intensive services through energy-and QoS-aware virtual machine placement in national cloud data centers. IEEE Trans. Emerg. Top. Comput. 4(2), 290–300 (2016)
Article Google Scholar
Wang, S.G., Fan, C.Q., Hsu, C.H., Sun, Q.B., Yang, F.C.: A vertical handoff method via self-selection decision tree for internet of vehicles. IEEE Syst. J. 10(3), 1183–1192 (2016)
Article Google Scholar
Fukunaga, K.: Introduction to Statistical Pattern Recognition. 2nd edn. Academic Press (2013)
Google Scholar
Martinez, A.M., Kak, A.C.: PCA versus LDA. IEEE Trans. Pattern Analysis Mach. Intell. 23(2), 228–233 (2001)
Article Google Scholar
Yu, H., Yang, J.: A direct LDA algorithm for high-dimensional data with application to face recognition. Pattern Recogn. 34(10), 2067–2070 (2001)
Article MATH Google Scholar
Yan, S.C., Xu, D., Zhang, B.Y., Zhang, H.J., Yang, Q., Lin, S.: Graph embedding and extensions: a general framework for dimensionality reduction. IEEE Trans. Pattern Analysis Mach. Intell. 29(1), 40–51 (2007)
Article Google Scholar
Zhang, W., Xue, X.Y., Guo, Y.F.: Discriminant neighborhood embedding for classification. Pattern Recogn. 39(11), 2240–2243 (2006)
Article MATH Google Scholar
Ding, C.T., Zhang, L.: Double adjacency graphs-based discriminant neighborhood embedding. Pattern Recogn. 48(5), 1734–1742 (2015)
Article Google Scholar
Ding, C.T., Zhang, L., Lu, Y,P., He, S.P.: Similarity-balanced discriminant neighborhood embedding. In: Proceedings of 2014 International Joint Conference on Neural Networks, pp. 1213–1220 (2014)
Google Scholar
Xu, J.L., Wang, S.G., Zhou, A., Yang, F.C.: Machine status prediction for dynamic and heterogeneous cloud environment. In: Proceedings of 2016 IEEE International Conference on Cluster Computing, pp. 136–137 (2016)
Google Scholar

Download references

Acknowledgement

This work is supported by the National Science of Foundation of China, under grant No. 61571066 and grant No. 61472047.

Author information

Authors and Affiliations

State Key Laboratory of Networking and Switching Technology, Beijing University of Posts and Telecommunications, Beijing, 100876, China
Chuntao Ding & Qibo Sun

Authors

Chuntao Ding
View author publications
You can also search for this author in PubMed Google Scholar
Qibo Sun
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chuntao Ding .

Editor information

Editors and Affiliations

Beijing University of Posts and Telecommunications, Beijing, China
Shangguang Wang
Beijing University of Posts and Telecommunications, Beijing, China
Ao Zhou

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ding, C., Sun, Q. (2017). LBDAG-DNE: Locality Balanced Subspace Learning for Image Recognition. In: Wang, S., Zhou, A. (eds) Collaborate Computing: Networking, Applications and Worksharing. CollaborateCom 2016. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 201. Springer, Cham. https://doi.org/10.1007/978-3-319-59288-6_18

Download citation

DOI: https://doi.org/10.1007/978-3-319-59288-6_18
Published: 05 July 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-59287-9
Online ISBN: 978-3-319-59288-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics