Keywords

1 Introduction

As a biometric technology, face has many distinct advantages compared with other biometric characteristics: it can be captured from a long distance which is friendly and convenience especially for the information security or access control application and it also has a wealthy structure and relatively larger area which is not easily to be occluded. Hence face recognition has becoming an indispensable biological authentication method and attracting many attentions.

During the last decades, many face recognition approaches have been proposed and can be roughly divided into two types: pixel-based approach and feature-based approach [1]. The Principle component analysis (PCA), Linear Discriminant Analysis (LDA) and Independent Component Analysis (ICA) methods are the most typical pixel-based methods and have been proved to be effective for recognition with large databases. The feature-based approach mainly include Local Binary Pattern (LBP), Gabor, SIFT and their modified approaches. Most the above methods can achieve satisfying recognition result upon frontal and high resolution face image. However the feature extraction methods usually rely on artificial selection. Besides, to extract more robust deep-level features in order to express face information more effectively is still difficult. Hence face recognition performances usually drops heavily due to pose variation and other factors under unconstrained environment, which is still a challenging task for researchers [2, 3].

Deep Learning [4] has become an important research area in computer vision. Deep Belief Network (DBN) is a typical Deep learning method with strong learning and expression ability. It can learn essential data feature from small samples and extract feature automatically without artificial selection. However, when pixel level image are import to DBN directly, the recognition performance normally decline since DBN ignores the local features of the images [5]. In order to make full use of the learning ability of DBN, proposed a novel approach combined with local feature and DBN is proposed. This paper is organized as follows: Sect. 2 describes the proposed technique, Experiment results are demonstrated in Sect. 3. Then we conclude the paper in Sect. 4.

2 Technical Approach

2.1 Center-Symmetric Local Binary Pattern (CS-LBP)

Local binary pattern (LBP) is an feature descriptor proposed by Heikkila which has been proven to be effective at texture feature description. And it can be seen as a standard approach for extract structural model of texture information. However texture features extracted by LBP is usually too nuanced to be robust for flat area of images [6]. To compensate the shortage, CS-LBP is proposed. It encode the change of the image from four different direction with center symmetric principle. The CS-LBP features can be described with Eqs. (1) and (2):

$$ {\text{CS}} - {\text{LBP}}_{{{\text{R}},{\text{N}},{\text{T}}}} \left( {{\text{X}},{\text{Y}}} \right) = \mathop \sum \limits_{{{\text{i}} = 0}}^{{\left( {\frac{\text{N}}{2}} \right) - 1}} {\text{S}}\left( {{\text{n}}_{\text{i}} - {\text{n}}_{{{\text{i}} + \left( {\frac{\text{N}}{2}} \right)}} } \right)2^{\text{i}} $$
(1)
$$ {\text{s}}\left( {\text{x}} \right) = \left\{ {\begin{array}{*{20}r} \hfill {1,} & \hfill {{\text{x}} > {\text{T}}} \\ \hfill {0,} & \hfill { {\text{otherwise}}} \\ \end{array} } \right. $$
(2)

where \( {\text{n}}_{\text{i}} \) and \( {\text{n}}_{{{\text{i}} + ({\text{N}}/2)}} \) correspond to the gray value of center-symmetric area pixels, N represents the pixel numbers on a circle of radius R. To enhance the robustness of CS-LBP feature on flat image regions, a threshold T is set for the change of image intensity. Compared with the traditional LBP, CS-LBP has lower dimension and lower computational complexity. Also it is more robust to noise interference. Hence CS-LBP is used for feature extraction to preserve more useful information of the image and reduce the impact of noises like pose variation.

2.2 Deep Belief Network

Since the deep learning architecture is proposed, it has drawn much interests. The deep learning architecture normally consist of feature detector units arranged in layers. Simple features are extracted by lower layers and put into higher layers to extract more complex features [7]. One of the most typical deep architectures DBN is proposed by Hinton in 2006 [8]. It is a multi-layer generative model composed of unsupervised Restricted Boltzmann Machine which consists of a visible layer as well as a hidden layer. It is build to detect more complex features which can reveal hidden information and higher-order correlations of the data.

To demonstrate its basic principle, a DBN model is shown in Fig. 1. As shown on the left part of Fig. 1, vi (i = 1, 2, …) represents the vector of visible layer; hi (i = 1, 2, …) represent the vector of the hidden layer. As shown on the right part of Fig. 2, a RBM [9] is composed of a visible layer and a hidden layer. The number of the visible units in the lower RBM equals to the number of the hidden units in next higher RBM. Pre-training the DBN model consists of learning the RBMs one by one, during which the learned features of one RBM are put into the next RBM as the input ‘data’.

Fig. 1
figure 1

DBN structure containing two RBMs

Fig. 2
figure 2

Process of the proposed algorithm

2.3 DBN Combined with CS-LBP Based Face Recognition

Applying DBN to realistic-sized images is challenging because pixie-level face images are high-dimensional which will cause very high computationally complexity to the training algorithm. Besides DBN usually ignore the local features of images which is important for recognition. Hence the DBN Combined with CS-LBP algorithm is proposed in this part.

As shown in Fig. 2, firstly the local features of the input images are extracted by CS-LBP. Secondly, the obtained features are feed into the DBN instead of original face images as the input of the visible layer. Thirdly, pre-training the DBN from the bottom layer to the top layer. The main process is: training the first layer’s network parameters and use its output as the second layer’s input data and so on. To obtain the best net parameters, the Back Propagation (BP) algorithm is applied to fine tune the pre-trained DBN. In this paper, the final DBN model contains 3 layers and the number of iterations for each layer is 20. Finally, the Euclidean distance classifier is applied for classifying the face images accurately.

3 Experiment and Analysis

We conducted a number of experiments on the ORL face database [10] to evaluate the proposed face recognition algorithm. As shown in Fig. 3 ORL face databases consisted of 40 persons. Each person contained 10 different images with pose and expression variation.

Fig. 3
figure 3

Images example of ORL database

3.1 Experiment 1

We selected the approximate frontal face image of each person to form the test dataset, and the remaining 9 images of the same person to form the training dataset. The training dataset consist of 360 images, and test dataset contains 40 images. The recognition results are shown in Fig. 4.

Fig. 4
figure 4

Recognition rate comparison. a Proposed method. b LBP + DBN. c LBP + PAC. d DBN

Figure 4 shows the comparison between the proposed method and other 3 methods including LBP combined with PCA, LBP combined with DBN and DBN. The experimental result shows that: the rank1 recognition rate of the proposed method is 97.5 %. It is at least 7.5 % higher than the rank1 recognition rate of the other 3 typical algorithm as shown in Fig. 4b–d. Hence, the proposed methods gains obvious performance improvement over the other three typical methods.

3.2 Experiment 2

To further demonstrate the effectiveness of the proposed method on face images with pose variation, several other experiments are conducted. Since each person has 10 images with pose variation of different degree, each image of the same person is used to be the test image in turn, which means 10 group of test datasets and training datasets is formed. Then the proposed algorithm is performed on these 10 datasets. The final recognition rate is the average of these ten experiments, which demonstrate the recognition rate of the proposed method on multi-pose face images. Table 1 shows the result.

Table 1 Comparison of different recognition method on multi-pose face image

As shown in Table 1, when employing the face images with pose variation as the test set, the performance of the four algorithm all degrade to a certain extent. However, the proposed method still achieve the highest rank1 recognition rate 92.78 % which is at least 5 % higher than the other three methods. This verifies the proposed algorithm combining the local features and the advantage of DBN gains obvious performance improvement over the multi-pose face image.

4 Conclusion

A face recognition approach combining with CS-LBP and DBN is proposed in this paper. The CS-LBP is applied to extract local texture features of face image, which are imported to DBN instead of original face images. Then the network model is confirmed through pre-training and fine tuning layer by layer. A large amount of experiments are conducted on the ORL face database. By comparing with the LBP combined with DBN, LBP combined with PCA, and typical DBN, the proposed method is proven to be a significant improvement on recognition performance as well as a feasible way to combat the influence of pose variation.