Towards a face recognition method based on uncorrelated discriminant sparse preserving projection

Chen, Zhanwei; Huang, Wei; Lv, Zhihan

doi:10.1007/s11042-015-2882-0

Towards a face recognition method based on uncorrelated discriminant sparse preserving projection

Published: 05 September 2015

Volume 76, pages 17669–17683, (2017)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Multimedia Tools and Applications Aims and scope Submit manuscript

Towards a face recognition method based on uncorrelated discriminant sparse preserving projection

Download PDF

Zhanwei Chen¹,
Wei Huang² &
Zhihan Lv³

1132 Accesses
39 Citations
Explore all metrics

Abstract

Feature extraction has always been an important step in face recognition, the quality of which directly determines recognition result. Based on making full use of advantages of Sparse Preserving Projection (SPP) on feature extraction, the discriminant information was introduced into SPP to arrive at a novel supervised feather extraction method that named Uncorrelated Discriminant SPP (UDSPP) algorithm. The obtained projection with the method by sparse preserving intra-class and maximizing distance inter-class can effectively express discriminant information, while preserving local neighbor relationship. Moreover, statistics uncorrelated constraint was also added to decrease redundancy among feature vectors so as to obtain more information as possible with little vectors as possible. The experimental results show that the recognition rate improved compared with SPP. The method is also superior to recognition methods based on Euclidean distance in processing face database in light.

A dimensionality reduction method based on structured sparse representation for face recognition

Article 07 March 2016

Single-Sample Face Recognition Based on WSSRC and Expanding Sample

Face Recognition Based on Structural Incoherence and Low Rank Projection

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

In all biometric recognition technologies, face recognition has become one of the hot areas of pattern recognition and computer visual for its huge application potential in fields of public safety, intelligent access control, criminal investigation and others due to advantages of strong adaptability, high security as well as non-contact intelligent interaction. Traditional face recognition process generally includes four steps as face detection, image preprocessing, feature extraction and classification, where the core is feature extraction and classification. Under ideal conditions, namely reference image has same acquisition condition as image to be recognized, the object is matched and database size is moderate, traditional face recognition methods can achieve satisfactory result. In practice, the situation is complex. For example, the illumination change, face signature change, facial expression change as well as aging and occlusion problems all put forward new demands and challenges on traditional methods. Initially, the face recognition was studied as a general pattern recognition problem and mainly implemented with methods based on face geometry structural feature. There was not very significant research results, nor been applied practically. Afterwards, many typical face recognition algorithms were proposed in the climax phase [14, 21–23, 28]. The classic Eigenface [22] and Fisherface [21] were put forward in this period. The conclusion that template matching method superior to geometric feature method and role of Eigenface ended researches on face recognition based on structural feature. It also promoted development of appearance-based linear subspace modeling [16, 24] and face recognition based on statistical pattern recognition technology largely [13, 19]. Overall, the face recognition algorithms proposed in this period has good performance under ideal conditions, which have been applied for commercial face recognition systems.

With the development of face recognition technologies, the recognition system can achieve satisfactory performance under control conditions. However, the mainstream technology could not deal with illumination and pose change, expression change, aging problem, occlusion problem, low-quality image, resulting in significantly dropping of recognition rate. Therefore, researchers began to study on face recognition under uncontrolled environment. The illumination and attitude change are factors been considered most in the face recognition problem. The illumination change mainly expresses as illumination intensity change and angle change. The gray distribution difference of obtained two images of same face under different light condition exceeds gray distribution difference of different face under same light condition, thus resulting in decreased recognition rate. Similarly, as to multi-attitude face recognition, when the face attitude take place changes within or outside image plane, the image difference of same person under different attitude will be greater than that of different person under same attitude, resulting in recognition errors.

As to robust illumination and attitude face recognition methods, the model has expanded from 2D to 3D. The existing to couple with illumination can be divided into two classes as illumination compensation based on transformation and that based on illumination sample synthesis. The illumination compensation method based on transformation is an early method, mainly including histogram equalization method [29], nonlinear transformation method [5] and homomorphic filtering [7]. These methods cannot arrive at ideal face image processing result under extreme illumination condition. The method based on illumination sample synthesis mainly estimate illumination condition of face illumination image to be recognized to simulate standard illumination condition, thus normalizing image into standard illumination condition before image recognition. The existing illumination sample synthesis methods can be divided into four classes, namely methods based on invariant feature, illumination change model, face image normalization and shape from shaping (SFS) [25]. The former method is represented by quotient image (QI) [20], light cone method [27] and spherical harmonics method [1]. The latter three methods all need to estimate 3D information of face from 2D image.

The multi-attitude face recognition methods based on 2D image mainly include method based on multiple view [26], attitude invariant feature [12, 16] and deformation model [3, 11, 15]. The method based on multiple views needs multiple images from different perspective of each person. It can be divided into different subsets in according to attitude and then train each attitude subnet. In case of recognition, firstly estimate which typical attitude is that of face to be recognized closer to and then recognize in the corresponding range. The attitude invariant feature method tries to find a feature independent from attitude to solve multi-attitude face recognition problem. Such method has complex computation, the attitude change range can be processed by which is also small. The method based on deformation model deal with multi-attitude problem by synthesizing new visual image. It arrives at images of each face attitude with different methods from a known face image. The 3D model method generally establishes a shape model to estimate shape and texture parameter, or building 3D face model of face to be recognized [2, 6, 8]. The recognition was carried on estimated parameters or generated positive face image. Although the 3D model method can achieve satisfactory recognition rate, which needs to pay higher computation cost.

The feature extraction has been an important step in the face recognition, which directly affects the recognition result. After decades of development, feature extraction methods are no longer limited to the classic Eigenface and Fisherface. Various different feature extraction methods were constantly applied to face recognition. In recent years, the feature extraction methods based on manifold were constantly proposed, which complete feature extraction by mining nonlinear manifold structure within data. However, the feature projection obtained in this way all addressed to training data. To new test sample, it cannot find sample in the embedded space. It is the so-called out-of-sample problem. To address this problem, some linear approximation methods were brought out. The local preserving projection (LPP) is one of them [10]. Nonetheless, there are some other problems in the manifold learning methods, such as neighborhood size selection and optimal selection of other parameters still plague researchers. Inspired by sparse representation, the sparse preserving projection (SPP) method was proposed [18]. It automatically selects neighborhood number using sparse representation. Meanwhile, the weight in neighborhood graph is no longer computed by given formula, but obtained from sparse representation process. It simultaneously carries out neighbor selection the weight determination, thus solving the above problem well. Although the sparse representation process keeps SPP certain discrimination feature, it is also an unsupervised feature extraction method, not involving class information. As far as face recognition as concerned, the discrimination information is helpful for improving recognition rate, which plays important role. Therefore, the paper tries to improve SPP inspired by LPP. The discrimination information is introduced to arrive at a face feature recognition method with stronger discrimination. At the same time, the statistical uncorrelated constraints are added to decrease redundant among eigenvectors. The UDSPP is proposed. The specific arrangement of the paper is as follows. The related concepts of sparse preserving projection and sparse reconstruction are introduced in Section 2. The UDSPP method is proposed in Section 3. In Section 4, the recognition experiment on the proposed algorithm is carried out, the performance is also analyzed. Section 5 concludes our work.

2 Sparse preserving projection and sparse reconstruction

2.1 Sparse representation

In the traditional signal representation, the signal can be decomposed on a set of complete orthogonal basis. The decomposition process is always accompanied by complex computation. Based on wavelet analysis, Mallat et al. proposed sparse representation idea. It utilizes sparsity of signal for modeling. A small number of non-zero coefficients are used to reflect internal structure and nature properties of signal so that the signal expression be simple. Thus, the sparse representation opened up a new research of signal representation.

In general, the signal can be expressed as the linear combination of a vector basis {t _i} ^M_i = 1 . It means if the signal f ∈ ℝ ^N is given, it can be linear represented by {t _i} ^M_i = 1 , t _i ∈ ℝ ^N.

$$ f=T\alpha =\left[\begin{array}{llll}{t}_1\hfill & {t}_2\hfill & \cdots \hfill & {t}_N\hfill \end{array}\right]\left[\begin{array}{c}\hfill {\alpha}_1\hfill \\ {}\hfill {\alpha}_2\hfill \\ {}\hfill \vdots \hfill \\ {}\hfill {\alpha}_N\hfill \end{array}\right], $$

(1)

where α is the express factor. In practice, commonly used complete orthogonal basis includes Fourier basis, wavelet basis and cosine basis. At this moment, M = N.

In the sparse representation, complete orthogonal is replaced by redundant basis, namely M > > N. At this time, t _i, i = 1, ⋯, M is no longer linearly independent, so D = {t _i} ^M_i = 1 , M > > N is called dictionary. When the redundant dictionary is used to express signal, the ultra-complete representation of signal f is as:

$$ f=D\alpha . $$

(2)

At given f and D, the above formula is an underdetermined problem. It has an infinite number of solution α. Because of this underdeterminedness, it is possible of signal sparse representation.

The target of sparse representation is to find a most sparse solution from all possible solutions of (2), namely the one contains minimum non-zero elements. The non-zero element number in vector α can be expressed by ‖α‖₀, so the sparse representation model can be expressed as follows:

$$ \arg \underset{\alpha }{ \min }{\left\Vert \alpha \right\Vert}_0s.t.f=D\alpha . $$

(3)

Since solving (3) is a NP-hard problem, the minimization problem based on ℓ ₁ norm is usually used for approximation and the optimal model can be obtained:

$$ \arg \underset{\alpha }{ \min }{\left\Vert \alpha \right\Vert}_1s.t.f=D\alpha . $$

(4)

After decades of development, the compressed sensing theory based on sparse representation theory was brought out. It breaks through the limit of Nyquist sampling theorem, which attracted interesting of many scholars [4]. Meanwhile, with constantly improvement of sparse representation theorem and various optimizing algorithms, the application of sparse representation has involved in aspects of image processing, which has become one of hot field of image processing.

2.2 Sparse matrix reconstruction

Feature extraction use linear or non-linear projection to project high-dimensional space where sample located to corresponding low-dimensional feature space. Except for reducing dimension to decrease computation complexity in case of recognition, the feature extraction is also helpful to reduce noise so that the extracted feature has better separability.

The general feature extraction method can be expressed as follows. Given as data sample set {x _i} ⁿ_i = 1 , where x _i ∈ ℝ ^m is an m-dimensional column vector and n is sample number. Find an appropriate linear transformation matrix W ∈ ℝ ^m ((d < < m)) to project original sample point to a new low-dimensional feature space, namely y _i = W ^T x _i. The y _i is the obtained low-dimensional feature.

Sparse preserving projection is a feature extraction method adaptive to face recognition proposed in recent years. It find projection matrix by preserving sparse reconstruction relationship before and after feature extraction. Compared with methods to determine neighborhood relationship with k neighborhood as LPP, the determination of neighborhood relationship based on sparse representation is more reasonable. At the same time, the neighborhood selection and weight matrix determination of SPP can be carried out simultaneously.

Given a data sample set {x _i} ⁿ_i = 1 , where x _i ∈ ℝ ^m, X = [x ₁, x ₂, ⋯, x _n] ∈ ℝ ^m × n is the matrix containing all samples as column vectors. The target of sparse representation is to express an unknown sample x with as few elements in X as possible. In case of solving weight matrix in SPP, its model can be expressed as follows:

$$ \begin{array}{r}\hfill { \min}_{s_i}{\left\Vert {s}_i\right\Vert}_1\\ {}\hfill s.t.{x}_i=X{s}_i\\ {}\hfill 1={1}^T{s}_i\end{array}, $$

(5)

where s _i = [s _i,1, s _i,2, ⋯, s _{i,i − 1}, 0, s _i,i + 1, ⋯, s _i,n]^T is an n-dimensional vector. The i-th element is 0. In other words, the x _i is represented linearly by elements except x _i in X and the element s _i,j, j ≠ i indicates contribution of x _j on reconstruction of x _i. After all s _i, i = 1, ⋯, n been obtained, the sparse reconstruction matrix is S = [s ₁, ⋯, s _n]^T.

The objective function of SPP is defined as [18]:

$$ { \min}_w{\displaystyle \sum_{i=1}^n{\left\Vert {w}^T{x}_i-{w}^TX{s}_i\right\Vert}^2}. $$

(6)

By algebraic transformation, the above objective function can be converted into following form:

$$ { \min}_w{\displaystyle \sum_{i=1}^n{\left\Vert {w}^T{x}_i-{w}^TX{s}_i\right\Vert}^2}={w}^TX\left(I-S-{S}^T+{S}^TS\right){X}^Tw. $$

(7)

Set S _α = I − S − S ^T + S ^T S and introduce constraint w ^T XX ^T w = 1 to avoid degeneration solution, the SPP model can be transformed into following optimization problem:

$$ { \min}_w\frac{w^TX{S}_{\alpha }{X}^Tw}{w^TX{X}^Tw}. $$

(8)

Its solution w is the eigenvector corresponding to minimum eigenvalue of following generalized eigenvalue:

$$ X{S}_{\alpha }{X}^Tw=\lambda X{X}^Tw. $$

(9)

Set w ₁, w ₂, ⋯, w _d as eigenvector corresponding to d minimum eigenvalues, namely λ ₁ ≤ λ ₂ ≤ ⋯ ≤ λ _d, so the obtained projective matrix is W = [w ₁, w ₂, ⋯, w _d].

3 UDSPP method

Compared with LPP, the SPP uses sparse representation to construct neighborhood graph to avoid selecting neighbor range k. On the other hand, k neighbor selects neighbors with Euclidean distance as a measure, which is sensitive to data noise. As long as there is noise in the data, it will cause chaos in the neighbor relation graph. Thanks to sparse representation, SPP can solve the problem effectively. So it is more practical. As to face recognition, the supervised feature extraction method can mine discrimination features in the data better, thus improving recognition performance. In order to enhance discrimination of SPP, the paper tries to improve SPP by adding discrimination information so that the obtained new feature extraction method UDSPP can better adapt to face recognition. The introduction of statistical uncorrelated constraints aims at reducing redundant among eigenvectors and improving effectiveness of feature extraction.

3.1 SPP

Firstly, the objective function of discrimination sparse preserving projection can be given as:

$$ \min \frac{{\displaystyle {\sum}_{c=1}^C{\displaystyle {\sum}_{i=1}^{n_c}{\left({y}_i^c-{\displaystyle {\sum}_{j=1}^{n_c}{s}_{ij}^c}{y}_i^c\right)}^2}}}{{\displaystyle {\sum}_{i,j=1}^C{\left({m}_i-{m}_j\right)}^2}}, $$

(10)

where C is class number of samples, n _c is sample number in the c-th class, y ^c_i is the projection of the i-th sample in the c-th class to feature space. The m _i and m _j are mean vector of the i-th class sample and the j-th class sample in the feature space respectively, namely $ {m}_i=\frac{1}{n_i}{\displaystyle {\sum}_{k=1}^{n_i}{y}_k^i} $ and $ {m}_j=\frac{1}{n_j}{\displaystyle {\sum}_{k=1}^{n_j}{y}_k^j} $. We hope to maximize distance between classes while keeping sparse reconstruction relationship in intra-class of optimal projection of objective function, so as to enhance discrimination.

With simple algebraic transformation, the molecule in objective function can be transformed into the following form:

$$ \begin{array}{l}{\displaystyle {\sum}_{c=1}^C{\displaystyle {\sum}_{i=1}^{n_c}\left({y}_i^c-{\displaystyle {\sum}_{j=1}^{n_c}{s}_{ij}^c}{y_i^c}^2\right)}}={\displaystyle {\sum}_{c=1}^C{\displaystyle {\sum}_{i=1}^{n_c}{\left({w}^T{x}_i^c-{\displaystyle {\sum}_{j=1}^{n_c}{s}_{ij}^c}{w}^T{x}_i^c\right)}^2}}\\ {}={\displaystyle {\sum}_{c=1}^C{\displaystyle {\sum}_{i=1}^{n_c}{\left({w}^T{x}_i^c-{w}^T{X}_c{s}_i^c\right)}^2}}={\displaystyle {\sum}_{c=1}^C{w}^T\left({I}_c-{S}_c-{S}_c^T+{S}_c^T{S}_c\right)}w,\\ {}={w}^T\left(I-{S}_{\beta }-{S}_{\beta}^T+{S}_{\beta}^T{S}_{\beta}\right)w\end{array} $$

(11)

where $ I=\left[\begin{array}{ccc}\hfill {I}_1\hfill & \hfill 0\hfill & \hfill 0\hfill \\ {}\hfill 0\hfill & \hfill \ddots \hfill & \hfill 0\hfill \\ {}\hfill 0\hfill & \hfill 0\hfill & \hfill {I}_c\hfill \end{array}\right] $, I _c is n _c × n _c unit matrix, $ {S}_{\beta }=\left[\begin{array}{ccc}\hfill {S}_1\hfill & \hfill 0\hfill & \hfill 0\hfill \\ {}\hfill 0\hfill & \hfill \ddots \hfill & \hfill 0\hfill \\ {}\hfill 0\hfill & \hfill 0\hfill & \hfill {S}_C\hfill \end{array}\right] $, S _c is the coefficient reconstruction matrix within the c-th class, c = 1, ⋯, C.

The denominator of objective function can be converted into:

$$ \begin{array}{l}{\displaystyle \sum_{i,j=1}^C{\left({m}_i-{m}_j\right)}^2}={\displaystyle \sum_{i,j=1}^C{\left(\frac{1}{n_i}{\displaystyle \sum_{k=1}^{n_i}{y}_k^i}-\frac{1}{n_j}{\displaystyle \sum_{k=1}^{n_j}{y}_k^j}\right)}^2}\\ {}={\displaystyle \sum_{i,j=1}^C\left[{w}^T\left({f}_i-{f}_j\right){\left({f}_i-{f}_j\right)}^Tw\right]}=w{}^TS_bw,\end{array} $$

(12)

where $ {f}_i=\frac{1}{n_i}{\displaystyle \sum_{k=1}^{n_i}{x}_k^i} $ is the mean vector of the i-th class sample in the original space and S _b is the dispersion matrix of inter-class.

Set S _γ = I − S _β − S ^T_β + S ^T_β S _β, the objective function is converted into the following form:

$$ { \min}_w\frac{w^TX{S}_{\gamma }{X}^Tw}{w^T{S}_bw}. $$

(13)

It should be noted that when sample of each person in the training database decreases, each image cannot be linearly expressed only using samples of same person. It will lead to inaccurate sparse reconstruction relationship. The reconstructed weight matrix cannot reflect relationship among samples well. It was known that the sparse representation has certain discrimination itself [8]. So we try to replace sparse reconstruction relationship in intra-class with global reconstruction relationship in case of less training samples of each person. Under such status, the molecular of sparse preserving projection discrimination is that of SPP, namely:

$$ { \min}_w\frac{w^TX{S}_{\alpha }{X}^Tw}{w^T{S}_bw}. $$

(14)

In this way, each sample can be accurately expressed. Meanwhile, the samples within same class have greater contribution. It means the relationship among samples of same class can also be well represented. The experiment result also shows the replacement is reasonable.

3.2 UDSPP model

In order to enable the extracted feature with statistical uncorrelated to decrease redundancy among eigenvectors, add a statistical uncorrelated constraint into the objective function, namely w ^T S _t w = 1. Where, S _t is the total dispersion matrix. Thus the proposed UDSPP model is as follows:

$$ \begin{array}{l}{ \min}_{w^T{S}_tw=1}\frac{w^TX{S}_{\gamma }{X}^Tw}{w^T{S}_bw}={ \min}_{w^T{S}_tw=1}\frac{w^TX{S}_{\gamma }{X}^Tw}{w^T\left({S}_b-{S}_w\right)w}\\ {}={ \min}_{w^T{S}_tw=1}\frac{w^TX{S}_{\gamma }{X}^Tw}{1-{w}^T{S}_ww},\end{array} $$

(15)

where S _w is the intra-class dispersion matrix. According to [6], S _t can be written as XGX ^T and S _w written as XMX ^T, where G = I − (1/n)ee ^T, e = (1, ⋯, 1)^T and M = I − E. If x _i and x _j belong to the c-class, then E _ij = 1/n _c, otherwise E _ij = 0. The constraint optimization problem can be transformed into the form as follows:

$$ { \min}_{w^T{S}_tw=1}{w}^TX\left({S}_{\gamma }+M\right)w. $$

(16)

Its solution w is the eigenvector corresponding to the smallest eigenvalue of the following generalized eigenvalue problem:

$$ X\left({S}_{\gamma }+M\right){X}^Tw=\lambda XG{X}^Tw. $$

(17)

Set w ₁, w ₂, ⋯, w _d as eigenvector corresponding to the smallest d eigenvalues, namely λ ₁ ≤ λ ₂ ≤ ⋯ ≤ λ _d. The obtained projection matrix is W = [w ₁, w ₂, ⋯, w _d]. Thus, we give the UDSPP with statistical uncorrelated feature.

4 Recognition experiment and performance analysis

4.1 Face recognition database

Before recognition experiment, we firstly introduce the face database to be used, sample images in which are also provided. The Extended Yale B database [12, 17] contains 2414 frontal face images of 38 individuals. These images were collected under controlled laboratory environment, including different lighting conditions. After cutting and standardization, image size is 192 × 168. Figure 1 shows part of images under modest and brighter illumination in the database.

The images in FERET database were collected in a semi-controlled environment [9]. The complete database contains 1564 subsets and 14136 images. Figure 2 shows part of images from FERET database.

4.2 Experiment results on different face database

4.2.1 Experiment result on extended Yale B

Firstly, the experiment was carried out on Extended Yale B database. The proposed UDSPP method was compared with SPP and LDA. We randomly selected 10 images for training, and then randomly selected 10 images for test. The training images are not overlap with test images. The experiment was repeated 20 times. Table 1 shows the maximum average recognition rate and corresponding dimension using the nearest neighbor classifier with three methods for feature extraction. Figure 3 shows average recognition rate of each dimension.

Table 1 The highest average recognition rate and corresponding dimension (%)

Full size table

As it can be seen from Table 1 and Fig. 3, the recognition rate is significantly improved compared with SPP as UDSPP added discrimination. Compared with traditional LDA method, as the Extended Yale B is database with extreme illumination changes, UDSPP and SPP have higher recognition rate than LDA as both methods reflect sparse reconstruction relationship of inter-class samples because of intra-class sparse preserving structure of UDSPP and global sparse preserving structure of SPP.

4.2.2 Experiment result on FERET database

The comparison on FERET database was taken the following form. Select 4 images or 5 images as training images each time and the remaining for test. Table 2 shows the highest average recognition rate and corresponding dimension.

Table 2 The highest average recognition rate and corresponding dimension

Full size table

It can be seen from Table 2 that as to general positive face database, the recognition rate is less than that of LDA when there is no significant light influence in the image. It is because LDA has a strong discrimination. When there is no affect from computation distance, it is more conducive to face recognition. However, as UDSPP combines advantages of SPP and LDA, its recognition rate is higher than above two methods. It can also been known from the table that the recognition rate of UDSPP significantly decreases in case of 4 training samples of each individual. It is analyzed that there is too less training samples, so the intra-class reconstruction matrix cannot reflect intra-class sample relationship well. In accordance with above analysis, the intra-class sparse reconstruction relationship will be reserved to replace it with global sparse reconstruction relationship. The average recognition rate corresponding to each dimension with four methods, including G-UDSPP that represented with global sparse representation is provided in Figs. 4 and 5.

4.2.3 Performance comparison with DSPP

Compare UDSPP with another existing discrimination sparse preserving projection (DSPP) [32]. The DSPP directly combine sparse preserving projection with linear discrimination analysis to arrive at the objective function as follows:

$$ \begin{array}{l} \min {w}^T\left(X{S}_{\alpha }{X}^T-\gamma \left({S}_b-{S}_w\right)\right)w\\ {}s.t.{w}^TX{X}^Tw=1.\end{array} $$

(18)

The comparison results are shown in Figs. 6 and 7. The Extended Yale B database still randomly selects 10 samples for training and another 10 samples for test. The FERET database uses 5 samples of each individual for training to compute average recognition rate.

As it can be seen from Figs. 6 and 7, the proposed method has equivalent recognition rate comparing with DSPP as a face feature extraction method. The UDSPP even has a certain degree of superiority. Table 3 shows the specific recognition rate of corresponding dimension, where the MRA means maximum recognition rate.

Table 3 Average recognition rate comparison of DSPP and UDSPP (%)

Full size table

It can be seen from Table 3 that the UDSPP has difference with DSPP. Table 4 shows the recognition result under different training and test sample at 80-dimension and 120-dimension.

Table 4 Recognition rate comparison under different training test samples (%)

Full size table

From the Table 4, it can be seen that the UDSPP is quite different from DSPP for specific training test instance. As to same training test set, both methods arrive at quite different recognition result under same dimension.

5 Conclusion

Based on sparse preserving projection and linear discrimination analysis, the paper proposed a new discrimination sparse preserving projection method. Meanwhile, the statistical uncorrelated constraint was also introduced to decrease redundant among eigenvectors. The experiment results show that the recognition rate was significantly improved compared with SPP added discrimination method in the paper. At the same time, compared with LDA based on Euclidean distance, it also has certain advantages, especially on the face database with light. As to general face database, if the sample of each class is too less, the recognition of UDSPP will also decline. It is because the shortage of intra-class samples leads to inaccuracy of intra-class sparse reconstruction relationship. Under such circumstance, the global sparse representation should be used to replace intra-class sparse to compensate for sample shortage. Overall, the UDSPP is effective as a face feature extraction method. Its improvement method and adaption scope will be our research focus in the future.

References

Basri R, Jacobs DW (2003) Lambertian reflectance and linear subspaces. IEEE Trans Pattern Anal Mach Intell 25(2):218–233
Article Google Scholar
Biswas S, Chellappa R (2010) Pose-robust albedo estimation from a single image. IEEE Conf Comput Vis Pattern Recognit 2683–2690
Chai X, Shan SG, Chen XL et al (2007) Locally linear regression for pose-invariant face recognition. IEEE Trans Image Process 16(7):1716–1725
Article MathSciNet Google Scholar
Chen SS, Donoho DL, Saunders MA (1998) Atomic decomposition by basis pursuit. SIAM J Sci Comput 20(1):33–61
Article MathSciNet MATH Google Scholar
Chen WL, Er MJ, Wu SQ (2006) Illumination compensation and normalization for robust face recognition using discrete cosine transform in logarithm domain. IEEE Trans Syst Man Cybernt B Cybern 36(2):458–466
Article Google Scholar
Cootes TF, Edwards GJ, Taylor C (2001) Active appearance models. IEEE Trans Pattern Anal Mach Intell 23(6):681–685
Article Google Scholar
Fan CN, Zhang FY (2011) Homomorphic filtering based illumination normalization method for face recognition. Pattern Recogn Lett 32(10):1468–1479
Article Google Scholar
Gao Y, Leung MKH, Wang W et al (2001) Fast face identification under varying pose from a single 2-D model view. IEEE Proc Vis Image Signal Process 148(4):248–253
Article Google Scholar
Gui J, Sun ZN, Jia W et al (2012) Discriminant sparse neighborhood preserving embedding for face recognition. Pattern Recogn 45(8):2884–2893
Article MATH Google Scholar
He X, Yan S, Hu Y et al (2005) Face recognition using Laplacian faces. IEEE Trans Pattern Anal Mach Intell 27(3):328–340
Article Google Scholar
Jimenez DG, Castro JLA (2007) Toward pose-invariant 2-D face recognition through point distribution models and facial symmetry. IEEE Trans Inf Forensics Secur 2(3):413–429
Article Google Scholar
Kim T, Kittler J (2005) Locally linear discriminant analysis for multimodally distributed classes for face recognition with a single model image. IEEE Trans Pattern Anal Mach Intell 27(3):318–327
Article Google Scholar
Liu S, Cheng X, Fu W et al (2014) Numeric characteristics of generalized M-set with its asymptote [J]. Appl Math Comput 243(9):767–774
MathSciNet MATH Google Scholar
Liu S, W Fu, L He, et al (2015) Distribution of primary additional errors in fractal encoding method [J]. Multimed Tools Appl (in press)
Liu S, Fu W, Zhao W (2013) A novel fusion method by static and moving facial capture [J]. Math Probl Eng 2013:1–6
Google Scholar
Lv Z, Tek A, Da Silva F, Empereur-Mot C, Chavent M, Baaden M (2013) Game on, science-how video game technology may help biologists tackle visualization challenges [J]. PLoS ONE 8(3):e57990
Article Google Scholar
Phillips PJ, Wechsler H, Huang J et al (1998) The FERET database and evaluation procedure for face-recognition algorithms. Image Vis Comput 16(5):295–306
Article Google Scholar
Qiao L, Chen S, Tan C (2010) Sparisity preserving projections with applications to face recognition. Pattern Recogn 43(1):331–341
Article MATH Google Scholar
Qin J, He Z (2005) A SVM face recognition method based on Gabor-featured key points. International Conference on Machine Learning and Cybernetics, pp 5144–5149.
Shashua A, Riklin-Raviv T (2001) The quotient image: class-based re-rendering and recognition with varying illuminations. IEEE Trans Pattern Anal Mach Intell 23(2):129–139
Article Google Scholar
Tianyun Su, Wen Wang, Zhihan Lv et al (2015) Rapid delaunay triangulation for random distributed point cloud data using adaptive hilbert curve. Comput Graph
Turk M, Pentland A (1991) Eigenfaces for recognition. J Cogn Neurosci 3(1):71–86
Article Google Scholar
Xiaoming Li, Zhihan Lv, Jinxing Hu, Ling Yin, Baoyun Zhang, Shengzhong Feng (2015) WebVRGIS based traffic analysis and visualization system. Adv Eng Softw
Yang J, Zhang D, Frangi AF et al (2004) Two-dimensional PCA: a new approach to appearance-based face representation and recognition. IEEE Trans Pattern Anal Mach Intell 26(1):131–137
Article Google Scholar
Zhao WY, Chellappa R (2000) Illumination-insensitive face recognition using symmetric shape-fromshading. IEEE Conf Comput Vis Pattern Recognit 286–293
Zheng ZG, Jeong HY, Huang T et al (2015) KDE based outlier detection on distributed data streams in sensor network [J]. J Sens 2015:1–11
Google Scholar
Zheng ZG, Wang P, Liu J et al (2015) Real-time Big data processing framework: challenges and solutions [J]. Appl Math Inf Sci 9(6):2217–2237
MathSciNet Google Scholar
Zhihan L, Alaa H, Shengzhong F, Haibo L, Shafiq Ur R (2014) Multimodal hand and foot gesture interaction for handheld devices [J]. ACM Trans Multimed Comput Commun Appl (TOMM) 11(1):10:1–10:19
Google Scholar
Zhihan L, Alaa H, Shengzhong F, Rehman S u, Haibo L (2015) Touch-less interactive augmented reality game on vision based wearable device [J]. Pers Ubiquit Comput 19(3–4):551–567
Google Scholar

Download references

Acknowledgments

The research was founded within the project No. 132102210575 entitled: “Research on image fusion technology based on Representation Learning”, being a key scientific and technical tackle-key-problem plan project of Henan province, supported by The Science and Technology Department of Henan Province, China.

Author information

Authors and Affiliations

School of Computer Science and Technology, Zhoukou Normal University, Zhoukou, 466001, China
Zhanwei Chen
School of Computer and Communication Engineering, Light Engineering Institutes of Zhengzhou, Zhengzhou, 450002, China
Wei Huang
SIAT, Chinese Academy of Science, Beijing, China
Zhihan Lv

Authors

Zhanwei Chen
View author publications
You can also search for this author in PubMed Google Scholar
Wei Huang
View author publications
You can also search for this author in PubMed Google Scholar
Zhihan Lv
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhihan Lv.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chen, Z., Huang, W. & Lv, Z. Towards a face recognition method based on uncorrelated discriminant sparse preserving projection. Multimed Tools Appl 76, 17669–17683 (2017). https://doi.org/10.1007/s11042-015-2882-0

Download citation

Received: 16 July 2015
Revised: 05 August 2015
Accepted: 10 August 2015
Published: 05 September 2015
Issue Date: September 2017
DOI: https://doi.org/10.1007/s11042-015-2882-0

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Towards a face recognition method based on uncorrelated discriminant sparse preserving projection

Abstract

Similar content being viewed by others

A dimensionality reduction method based on structured sparse representation for face recognition

Single-Sample Face Recognition Based on WSSRC and Expanding Sample

Face Recognition Based on Structural Incoherence and Low Rank Projection

1 Introduction