Face Recognition Based on Deep Belief Network Combined with Center-Symmetric Local Binary Pattern

Li, Chen; Wei, Wei; Wang, Jingzhong; Tang, Wanbing; Zhao, Shuai

doi:10.1007/978-981-10-1536-6_37

Chen Li⁵,
Wei Wei⁶,
Jingzhong Wang⁵,
Wanbing Tang⁵ &
…
Shuai Zhao⁵

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 393))

1266 Accesses
11 Citations

Abstract

Human face recognition performances usually drops heavily due to pose variation and other factors. The representative deep learning method Deep Belief Network (DBN) has been proven to be an effective method to extract information-rich features of face image for recognition. However the DBN usually ignore the local features of image which are proven to be important for face recognition. Hence, this paper proposed a novel approach combined with local feature Center-Symmetric Local Binary Pattern (CS-LBP) and DBN. CS-LBP is applied to extract local texture features of face image. Then the extracted features are used as the input of Deep Belief Network instead of face image. The network structure and parameters are trained to obtain the final network model for recognition. A large amount of experiments are conducted on the ORL face database, and the experimental results show that compared with LBP, LBP combined with DBN and DBN, the proposed method has a significant improvement on recognition rates and can be a feasible way to combat with pose variation.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Restricted Boltzmann Machine with Adaptive Local Hidden Units

A multimodal deep learning framework using local feature representations for face recognition

Article Open access 04 September 2017

Deep Belief Network Based on Double Weber Local Descriptor in Micro-expression Recognition

Keywords

1 Introduction

As a biometric technology, face has many distinct advantages compared with other biometric characteristics: it can be captured from a long distance which is friendly and convenience especially for the information security or access control application and it also has a wealthy structure and relatively larger area which is not easily to be occluded. Hence face recognition has becoming an indispensable biological authentication method and attracting many attentions.

During the last decades, many face recognition approaches have been proposed and can be roughly divided into two types: pixel-based approach and feature-based approach [1]. The Principle component analysis (PCA), Linear Discriminant Analysis (LDA) and Independent Component Analysis (ICA) methods are the most typical pixel-based methods and have been proved to be effective for recognition with large databases. The feature-based approach mainly include Local Binary Pattern (LBP), Gabor, SIFT and their modified approaches. Most the above methods can achieve satisfying recognition result upon frontal and high resolution face image. However the feature extraction methods usually rely on artificial selection. Besides, to extract more robust deep-level features in order to express face information more effectively is still difficult. Hence face recognition performances usually drops heavily due to pose variation and other factors under unconstrained environment, which is still a challenging task for researchers [2, 3].

Deep Learning [4] has become an important research area in computer vision. Deep Belief Network (DBN) is a typical Deep learning method with strong learning and expression ability. It can learn essential data feature from small samples and extract feature automatically without artificial selection. However, when pixel level image are import to DBN directly, the recognition performance normally decline since DBN ignores the local features of the images [5]. In order to make full use of the learning ability of DBN, proposed a novel approach combined with local feature and DBN is proposed. This paper is organized as follows: Sect. 2 describes the proposed technique, Experiment results are demonstrated in Sect. 3. Then we conclude the paper in Sect. 4.

2 Technical Approach

2.1 Center-Symmetric Local Binary Pattern (CS-LBP)

Local binary pattern (LBP) is an feature descriptor proposed by Heikkila which has been proven to be effective at texture feature description. And it can be seen as a standard approach for extract structural model of texture information. However texture features extracted by LBP is usually too nuanced to be robust for flat area of images [6]. To compensate the shortage, CS-LBP is proposed. It encode the change of the image from four different direction with center symmetric principle. The CS-LBP features can be described with Eqs. (1) and (2):

$$ {\text{CS}} - {\text{LBP}}_{{{\text{R}},{\text{N}},{\text{T}}}} \left( {{\text{X}},{\text{Y}}} \right) = \mathop \sum \limits_{{{\text{i}} = 0}}^{{\left( {\frac{\text{N}}{2}} \right) - 1}} {\text{S}}\left( {{\text{n}}_{\text{i}} - {\text{n}}_{{{\text{i}} + \left( {\frac{\text{N}}{2}} \right)}} } \right)2^{\text{i}} $$

(1)

$$ {\text{s}}\left( {\text{x}} \right) = \left\{ {\begin{array}{*{20}r} \hfill {1,} & \hfill {{\text{x}} > {\text{T}}} \\ \hfill {0,} & \hfill { {\text{otherwise}}} \\ \end{array} } \right. $$

(2)

where $ {\text{n}}_{\text{i}} $ and $ {\text{n}}_{{{\text{i}} + ({\text{N}}/2)}} $ correspond to the gray value of center-symmetric area pixels, N represents the pixel numbers on a circle of radius R. To enhance the robustness of CS-LBP feature on flat image regions, a threshold T is set for the change of image intensity. Compared with the traditional LBP, CS-LBP has lower dimension and lower computational complexity. Also it is more robust to noise interference. Hence CS-LBP is used for feature extraction to preserve more useful information of the image and reduce the impact of noises like pose variation.

2.2 Deep Belief Network

Since the deep learning architecture is proposed, it has drawn much interests. The deep learning architecture normally consist of feature detector units arranged in layers. Simple features are extracted by lower layers and put into higher layers to extract more complex features [7]. One of the most typical deep architectures DBN is proposed by Hinton in 2006 [8]. It is a multi-layer generative model composed of unsupervised Restricted Boltzmann Machine which consists of a visible layer as well as a hidden layer. It is build to detect more complex features which can reveal hidden information and higher-order correlations of the data.

To demonstrate its basic principle, a DBN model is shown in Fig. 1. As shown on the left part of Fig. 1, vi (i = 1, 2, …) represents the vector of visible layer; hi (i = 1, 2, …) represent the vector of the hidden layer. As shown on the right part of Fig. 2, a RBM [9] is composed of a visible layer and a hidden layer. The number of the visible units in the lower RBM equals to the number of the hidden units in next higher RBM. Pre-training the DBN model consists of learning the RBMs one by one, during which the learned features of one RBM are put into the next RBM as the input ‘data’.

2.3 DBN Combined with CS-LBP Based Face Recognition

Applying DBN to realistic-sized images is challenging because pixie-level face images are high-dimensional which will cause very high computationally complexity to the training algorithm. Besides DBN usually ignore the local features of images which is important for recognition. Hence the DBN Combined with CS-LBP algorithm is proposed in this part.

As shown in Fig. 2, firstly the local features of the input images are extracted by CS-LBP. Secondly, the obtained features are feed into the DBN instead of original face images as the input of the visible layer. Thirdly, pre-training the DBN from the bottom layer to the top layer. The main process is: training the first layer’s network parameters and use its output as the second layer’s input data and so on. To obtain the best net parameters, the Back Propagation (BP) algorithm is applied to fine tune the pre-trained DBN. In this paper, the final DBN model contains 3 layers and the number of iterations for each layer is 20. Finally, the Euclidean distance classifier is applied for classifying the face images accurately.

3 Experiment and Analysis

We conducted a number of experiments on the ORL face database [10] to evaluate the proposed face recognition algorithm. As shown in Fig. 3 ORL face databases consisted of 40 persons. Each person contained 10 different images with pose and expression variation.

3.1 Experiment 1

We selected the approximate frontal face image of each person to form the test dataset, and the remaining 9 images of the same person to form the training dataset. The training dataset consist of 360 images, and test dataset contains 40 images. The recognition results are shown in Fig. 4.

Figure 4 shows the comparison between the proposed method and other 3 methods including LBP combined with PCA, LBP combined with DBN and DBN. The experimental result shows that: the rank1 recognition rate of the proposed method is 97.5 %. It is at least 7.5 % higher than the rank1 recognition rate of the other 3 typical algorithm as shown in Fig. 4b–d. Hence, the proposed methods gains obvious performance improvement over the other three typical methods.

3.2 Experiment 2

To further demonstrate the effectiveness of the proposed method on face images with pose variation, several other experiments are conducted. Since each person has 10 images with pose variation of different degree, each image of the same person is used to be the test image in turn, which means 10 group of test datasets and training datasets is formed. Then the proposed algorithm is performed on these 10 datasets. The final recognition rate is the average of these ten experiments, which demonstrate the recognition rate of the proposed method on multi-pose face images. Table 1 shows the result.

Table 1 Comparison of different recognition method on multi-pose face image

Full size table

As shown in Table 1, when employing the face images with pose variation as the test set, the performance of the four algorithm all degrade to a certain extent. However, the proposed method still achieve the highest rank1 recognition rate 92.78 % which is at least 5 % higher than the other three methods. This verifies the proposed algorithm combining the local features and the advantage of DBN gains obvious performance improvement over the multi-pose face image.

4 Conclusion

A face recognition approach combining with CS-LBP and DBN is proposed in this paper. The CS-LBP is applied to extract local texture features of face image, which are imported to DBN instead of original face images. Then the network model is confirmed through pre-training and fine tuning layer by layer. A large amount of experiments are conducted on the ORL face database. By comparing with the LBP combined with DBN, LBP combined with PCA, and typical DBN, the proposed method is proven to be a significant improvement on recognition performance as well as a feasible way to combat the influence of pose variation.

References

Klare B, Jain AK (2010) Heterogeneous face recognition: matching NIR to visible light images. In: Pattern recognition international conference on IEEE, pp 1513–1516
Google Scholar
Shuicheng Y, Dong X, Benyu Z et al (2007) Graph embedding and extension: a general framework for dimensionality reduction. IEEE Trans Pattern Anal Mach Intell 29:40–51
Article Google Scholar
Jian Y, David Z, Frangi AF et al (2004) Two-dimensional PCA: a new approach to appearance-based face representation and recognition. IEEE Trans Pattern Anal Mach Intell 26:131–137
Article Google Scholar
Arel I, Rose DC, Karnowski TP (2010) Deep machine learning—a new frontier in artificial intelligence research [Research frontier]. IEEE Comput Intell Mag 5:13–18
Article Google Scholar
Liang SF, Liu YH, Li-Chen LI (2014) Face recognition under unconstrained based on LBP and deep learning. J Commun 35:154–160
Google Scholar
Coates A, Ng AY (2011) Selecting receptive fields in deep networks. Adv Neural Inf Process Syst, 2528–2536
Google Scholar
Lee H, Grosse R, Ranganath R et al (2009) Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations. In: International conference on machine learning, pp 609–616
Google Scholar
Hinton GE, Salakhutdinov RR (2006) Reducing the dimensionality of data with neural networks. Science 313:504–507
Article MathSciNet MATH Google Scholar
Schölkopf B, Platt J, Hofmann T (2007) Greedy layer-wise training of deep networks. Adv Neural Inf Process Syst 19:153–160
Google Scholar
[Online] Available: http://www.cl.cam.ac.uk/research/dtg/attarchive/facedatabase.html

Download references

Acknowledgment

This research is supported by the National Natural Science Foundation of China (61503005), by the Scientific Research Starting Foundation of North China University of Technology, and the National Natural Science Foundation of China (61371142).

Author information

Authors and Affiliations

College of Computer Science, North China University of Technology, Beijing, China
Chen Li, Jingzhong Wang, Wanbing Tang & Shuai Zhao
College of Electronic and Information Engineering, North China University of Technology, Beijing, China
Wei Wei

Authors

Chen Li
View author publications
You can also search for this author in PubMed Google Scholar
Wei Wei
View author publications
You can also search for this author in PubMed Google Scholar
Jingzhong Wang
View author publications
You can also search for this author in PubMed Google Scholar
Wanbing Tang
View author publications
You can also search for this author in PubMed Google Scholar
Shuai Zhao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chen Li .

Editor information

Editors and Affiliations

Computer Science and Engg., Seoul National Uni. of Sci. & Techn. Computer Science and Engg., Seoul, Korea (Republic of)
James J. (Jong Hyuk) Park
School of Comp. Science and Technology, Huazhong Univ. of Science and Technology School of Comp. Science and Technology, Wuhan, China
Hai Jin
Dept. of Multimedia Eng., Dongguk University Dept. of Multimedia Eng., Seoul, Korea (Republic of)
Young-Sik Jeong
College of Computer & Information Sci, King Saud University College of Computer & Information Sci, Riyadh, Saudi Arabia
Muhammad Khurram Khan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, C., Wei, W., Wang, J., Tang, W., Zhao, S. (2016). Face Recognition Based on Deep Belief Network Combined with Center-Symmetric Local Binary Pattern. In: Park, J., Jin, H., Jeong, YS., Khan, M. (eds) Advanced Multimedia and Ubiquitous Engineering. Lecture Notes in Electrical Engineering, vol 393. Springer, Singapore. https://doi.org/10.1007/978-981-10-1536-6_37

Download citation

DOI: https://doi.org/10.1007/978-981-10-1536-6_37
Published: 30 August 2016
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-1535-9
Online ISBN: 978-981-10-1536-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics