Large Margin Coupled Mapping for Low Resolution Face Recognition

Zhang, Jiaqi; Guo, Zhenhua; Li, Xiu; Chen, Youbin

doi:10.1007/978-3-319-42911-3_55

Jiaqi Zhang¹⁵,
Zhenhua Guo^15,17,18,
Xiu Li¹⁵ &
…
Youbin Chen¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9810))

Included in the following conference series:

Pacific Rim International Conference on Artificial Intelligence

2758 Accesses
4 Citations

Abstract

Traditional face recognition algorithms can achieve significant performance under well-controlled environments. However, these algorithms perform poorly when the resolution of the face images varies. A two-step framework is proposed to solve the resolution problem through adopting super-resolution (SR) and performing face recognition on the super-resolved face images. However, such method usually has poor performance on recognition tasks as SR focuses more on visual enhancement, rather than classification accuracy. Recently, Coupled Mapping (CM) has been introduced into face recognition framework across different resolutions, which learns a common feature subspace for both high-resolution (HR) and low-resolution (LR) face images. In this paper, inspired by maximum margin projection, we propose Large Margin Coupled Mapping (LMCM) algorithm, which learns projections to maximize the margin between distance of between-class subjects and distance of within-class ones in the common space. Experiments on public FERET and SCface databases demonstrate that LMCM is effective for low-resolution face recognition.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Transfer Subspace Learning based on Double Relaxed Regression for Image Classification

Article 22 March 2022

Coupled Discriminant Multi-Manifold Analysis with Application to Low-Resolution Face Recognition

Heterogeneous Face Recognition Based on Super Resolution Reconstruction by Adaptive Multi-dictionary Learning

Keywords

1 Introduction

A great number of achievements have been made in the area of automatic face recognition during last decades, especially under well-controlled circumstances. However, the performance of face recognition system in real world always degrades dramatically when the quality of input face images becomes poor, such as low-resolution. This is a specific concern in surveillance environment where the target is far from the sensor, resulting in low-resolution face images.

To solve the low-resolution (LR) problem, a two-step framework is proposed following the intuition of first recovering lost detail information of LR face images and then applying traditional face recognition algorithms on recovered face images. In fact, most proposed two-step algorithms of LR face recognition apply super-resolution (SR) technique as the first step [1–5]. The super-resolved face images are then passed to the second general face recognition pipe. Through the development of last decade, there exists many SR algorithms to reconstruct high-resolution (HR) images from a single LR image [1] or multiple LR images [2]. In many real-world face recognition systems, the intuitive solution is interpolation which are simple and fast, such as bilinear, cubic and so on. The learning-based super-resolution (LSR) algorithms [1, 3–5] recently draw a lot of attention owing to its promising performance. Freeman et al. [1] proposed a patch-wise Markov Random Field as the SR prediction model and recovered HR images by MAP estimation. Baker and Kanade [3] proposed to recover the HR face image from an input LR one by “face hallucination” model based on face priors. Liu et al. [5] proposed to combine a holistic and a local model for SR reconstruction. Inspired by locally linear embedding (LLE) [7], Chang et al. recovered the HR face image from the spatial neighbors of its LR counterpart. Yang et al. [8] proposed to incorporate sparse representation into SR framework which achieves outstanding performance. However, these algorithms aim more at the effect of visual enhancement rather than the performance of the specific face recognition task.

Recently, some algorithms avoiding an explicit SR stage have been introduced into face recognition flow. Gunturk et al. [9] investigated to transfer from pixel domain to eigenface domain for SR reconstruction. Hennings-Yeomans et al. [10, 11] integrated the aims of SR and face recognition simultaneously through a joint objective function. Although these methods improve the recognition rate, their speed even for the speed-up version is slow due to an optimization procedure for each test image. To avoid the super-resolution step, Coupled Mapping (CM) based methods are proposed for LR face recognition. Li et al. [12] proposed Coupled Locality Preserving Mapping (CLPM) based on CM for LR face recognition. Inspired by locality preserving methods [13, 14] for dimensionality reduction, the CLPM brought in a penalty weighting matrix into the objective function to preserve the local relationship of the original space. The CLPM emphasized more on the objective of recognition rather than just reconstruction and thus yielded a better performance. However, it ignored the label information of the training set, which is vital for face recognition. To take advantage of label information, some LDA-like algorithms were introduced into coupled mapping, such as Simultaneous Discriminant Analysis (SDA) [19], Coupled Marginal Fisher Analysis (CMFA) [18]. In [17], Shi et al. first constructed local optimization for each training sample according to the relationship of neighboring data points and then incorporated the local optimizations together for building the global structure. However, these algorithms fail to consider recognition and geometric information of training set simultaneously, thus some valuable information is missing and performance is limited for challenging problems [17].

In this paper, we propose a novel algorithm called Large Margin Coupled Mapping (LMCM) for LR face recognition, which takes both recognition information of the training data and the local geometric relationship of face image pairs into account to maximize the distance of between-class pairs and minimize the distance of within-class pairs in the common subspace. With appropriate constraints, the new-defined optimization problem could be solved in an analytical close-form. So it can be fast enough for real time applications.

The remaining of this paper is organized as follows. Section 2 demonstrates the LR face recognition problem and the formulation of CM. Section 3 describes the details of our proposed algorithm LMCM. Section 4 shows experimental results on FERET and SCface databases. Section 5 draws conclusions of this paper.

2 Low Resolution Face Recognition

In the scenario of LR face recognition, the task could be simplified to find an appropriate distance measure between a LR face image $ l_{i} $ and a HR one $ h_{j} $, i.e., $ d_{ij} = dist\left( {l_{i} ,h_{j} } \right) $. Here, $ l_{i} \in {\mathbb{R}}^{m} ,\;i = 1,\,2,\, \ldots \,,\,N_{p} $ and $ h_{j} \in {\mathbb{R}}^{M} ,\;j = 1,\,2,\, \ldots \,,\,N_{g} $, (m < M) represent the m-dimension feature vectors of the LR query images and the M-dimension HR ones registered in the gallery set, respectively. Due to the dimension mismatch of the feature vectors of LR and HR face images, some common distances (e.g. Euclidean distance) obviously cannot be applied directly. To deal with this problem, traditional two-step algorithms based on explicit SR attempt to find a mapping, $ {f_{SR}} $: $ {\mathbb{R}}^{m} \mapsto {\mathbb{R}}^{M} $, to project the LR image into the target HR space, and then directly calculate the distance in the HR space:

$$ d_{ij} = dist\left( {f_{SR} \left( {l_{i} } \right),h_{j} } \right) $$

(1)

Different from the two-step algorithms, CM based methods intend to establish two coupled mappings: $ f_{L} $: $ {\mathbb{R}}^{m} \mapsto {\mathbb{R}}^{n} $ for LR face images and $ f_{H} $: $ {\mathbb{R}}^{M} \mapsto {\mathbb{R}}^{n} $, to project both the LR and HR feature vectors into a common feature space. Here, n represents the dimensionality of the new common feature space. Then the distance can be measured by:

$$ d_{ij} = dist\left( {f_{L} \left( {l_{i} } \right),f_{H} \left( {h_{j} } \right)} \right) $$

(2)

Now the critical problem is to pursue an ideal common feature space. For low-resolution face recognition, the objective of CM algorithm is that the projections of LR and HR face image of the same subject should be as close as possible in the new common feature space. Let $ f_{L} \left( l \right) = P_{L}^{T} l $ and $ f_{H} \left( h \right) = P_{H}^{T} h $ be linear mappings, respectively, where $ P_{L} $ and $ P_{H} $ are two projection matrices with size of $ m \times n $ and $ M \times n $. This principle is formulated as the following objective function:

$$ J_{CM} \left( {P_{L} ,P_{H} } \right) = \sum\nolimits_{i = 1}^{{N_{t} }} {\left\| {P_{L}^{T} l_{i} - P_{H}^{T} h_{i} } \right\|^{2} } $$

(3)

$ N_{t} $ represents the number of the training images.

We use $ L = \left[ {l_{1} ,\,l_{2} ,\, \ldots \,,\,l_{{N_{t} }} } \right] $ and $ H = \left[ {h_{1} ,\,h_{2} ,\, \ldots \,,\,h_{{N_{t} }} } \right] $ to denote the original LR and HR feature vectors in the training set, respectively. Equation (3) can be reformulated as

$$ J_{CM} \left( {P_{L} ,P_{H} } \right) = tr\left( {\left\| {P_{L}^{T} L - P_{H}^{T} H} \right\|^{2} } \right) $$

(4)

where $ tr( \cdot ) $ is the matrix trace operator. Furthermore, using some deductions of linear algebra, Eq. (4) can be rewritten as

$$ J_{CM} \left( {P_{L} ,P_{H} } \right) = tr\left( {\left[ {\begin{array}{*{20}c} {P_{L} } \\ {P_{H} } \\ \end{array} } \right]^{T} \left[ {\begin{array}{*{20}c} L & 0 \\ 0 & H \\ \end{array} } \right]\left[ {\begin{array}{*{20}c} I & { - I} \\ { - I} & I \\ \end{array} } \right]\left[ {\begin{array}{*{20}c} L & 0 \\ 0 & H \\ \end{array} } \right]^{T} \left[ {\begin{array}{*{20}c} {P_{L} } \\ {P_{H} } \\ \end{array} } \right]} \right) $$

(5)

We can further let $ P = \left[ {\begin{array}{*{20}c} {P_{L} } \\ {P_{H} } \\ \end{array} } \right] $, $ Z = \left[ {\begin{array}{*{20}c} L & 0 \\ 0 & H \\ \end{array} } \right] $ and $ A = \left[ {\begin{array}{*{20}c} I & { - I} \\ { - I} & I \\ \end{array} } \right] $, where $ I $ is the identity matrix. Finally, we can get a compact form as

$$ J_{CM} \left( {P_{L} ,P_{H} } \right) = tr\left( {P^{T} ZAZ^{T} P} \right) $$

(6)

$ P_{L} $ and $ P_{H} $ can be obtained by minimizing Eq. (6). The details of the optimization procedure can be referred to [12].

3 Proposed LMCM

The CM algorithm described above obtains the projection matrices following the criteria that the distance between each LR face image and the corresponding HR one should be as close as possible. However, it only takes advantage of part of verification information of the training data, e.g. the face image pairs belonging to the same subject. In this paper, we draw an inspiration from Maximum Margin Projection (MMP) [16] and propose LMCM algorithm for LR face recognition, which seeks linear coupled mappings to force a margin between the distance of between-class subjects and the distance of within-class ones in the common feature space, as shown in Fig. 1. To achieve this, we utilize the verification information along with local geometry and identification information of the training data.

Verification Information with Local Geometry:

Under this scenario, verification information lies in the distance between face image pairs: ones of identical subjects tend to have small distance and ones of different subjects tend to have large distance.

In order to discover both discriminant and geometrical structures of the face images, we construct two graphs, within-class graph $ G_{w} $ and between-class graph $ G_{b} $. In graph $ G_{w} $, face images share the same identities are connected, while in graph $ G_{b} $, face images belong to different subjects are connected. Let $ W_{w} $ and $ W_{b} $ represent the weight matrices of $ G_{w} $ and $ G_{b} $, respectively. As HR feature is considered to have more discriminant information, we build these weight matrices in the original HR image space. We define them as the following form

$$ W_{w,ij} = \left\{ {\begin{array}{*{20}c} {e^{{ - \frac{{\left\| {h_{j} - h_{i2} } \right\|_{2} }}{\sigma }}} ,\; if\; h_{i} ,h_{j} \; connected\; in \;G_{w} } \\ 0 \\ \end{array} } \right. $$

(7)

$$ W_{b,ij} = \left\{ {\begin{array}{*{20}c} {e^{{ - \frac{{\left\| {h_{j} - h_{i2} } \right\|_{2} }}{\sigma }}} ,\; if\; h_{i} ,h_{j} \; connected\; in\; G_{b} } \\ 0 \\ \end{array} } \right. $$

(8)

where $ \sigma $ is the mean distance between each pair of face images in the training data.

Now, consider the problem of mapping LR and HR face images into a common subspace so that the connected face images of $ G_{w} $ stay as close as possible, while the connected face images of $ G_{b} $ stay as far as possible. Let $ P_{L} $ and $ P_{H} $ represent projection matrices. A reasonable criterion for learning the projection matrices is to optimize the following objective functions:

$$ \mathop { \hbox{min} }\limits_{{_{{P_{L} ,P_{H} }} }} \sum\nolimits_{i,j} {\left\| {P_{L}^{T} l_{i} - P_{H}^{T} h_{j} } \right\|_{2}^{2} W_{w,ij} + \left\| {P_{L}^{T} l_{i} - P_{L}^{T} l_{j} } \right\|_{2}^{2} W_{w,ij} + \left\| {P_{H}^{T} h_{i} - P_{H}^{T} h_{j} } \right\|_{2}^{2} W_{w,ij} } $$

(9)

$$ \mathop {\hbox{max} }\limits_{{P_{L} ,P_{H} }} \sum\nolimits_{i,j} {\left\| {P_{L}^{T} l_{i} - P_{H}^{T} h_{j} } \right\|_{2}^{2} W_{b,ij} + \left\| {P_{L}^{T} l_{i} - P_{L}^{T} l_{j} } \right\|_{2}^{2} W_{b,ij} + \left\| {P_{H}^{T} h_{i} - P_{H}^{T} h_{j} } \right\|_{2}^{2} W_{b,ij} } $$

(10)

where $ W_{w} $ and $ W_{b} $ represents the weight matrices of $ G_{w} $ and $ G_{b} $ respectively. The objective function (9) constructed on the within-class graph $ G_{w} $ imposes a large penalty if neighboring face images of the identical subject in original space are mapped far apart. Similarly, the objective function (10) constructed on the between-class graph $ G_{b} $ imposes a large penalty if neighboring face images belonging to different subjects are mapped close together. The ultimate goal of these objectives is to force a margin between face feature vectors of different subjects.

Following some simple algebraic steps, the objective function (9) can be reduced to the following matrix form

$$ \begin{aligned} & \mathop { \hbox{min} }\limits_{{_{{P_{L} ,P_{H} }} }} Tr\left( {P_{L}^{T} L\left( {2D_{w}^{L} + D_{w}^{H} - W_{w} - W_{w}^{T} } \right)L^{T} P_{L} + P_{H}^{T} H\left( {D_{w}^{L} + 2D_{w}^{H} - W_{w} - W_{w}^{T} } \right)H^{T} P_{H} } \right) \\ & \quad - \,Tr\left( {P_{L}^{T} LW_{w} H^{T} P_{H} + P_{H}^{T} HW_{w}^{T} L^{T} P_{L} } \right) \\ \end{aligned} $$

(11)

where $ D_{w}^{L} = \sum\nolimits_{j} {W_{w,ij} } $ and $ D_{w}^{H} = \sum\nolimits_{i} {W_{w,ij} } $.

Similarly, the objective function (10) can be reduced to a similar matrix form

$$ \begin{aligned} & \mathop {\hbox{max} }\limits_{{P_{L} ,P_{H} }} Tr\left( {P_{L}^{T} L\left( {2D_{b}^{L} + D_{b}^{H} - W_{b} - W_{b}^{T} } \right)L^{T} P_{L} + P_{H}^{T} H\left( {D_{b}^{L} + 2D_{b}^{H} - W_{b} - W_{b}^{T} } \right)H^{T} P_{H} } \right) \\ & \quad - \,Tr(P_{L}^{T} LW_{b} H^{T} P_{H} + P_{H}^{T} HW_{b}^{T} L^{T} P_{L} ) \\ \end{aligned} $$

(12)

where $ D_{b}^{L} = \sum\nolimits_{j} {W_{b,ij} } $ and $ D_{b}^{H} = \sum\nolimits_{i} {W_{b,ij} } $.

Similar deduction with (5) to (6), we can rewrite Eqs. (11) and (12) as follows

$$ \mathop { \hbox{min} }\limits_{{_{{P_{L} ,P_{H} }} }} Tr(P^{T} ZA_{w} Z^{T} P) $$

(13)

$$ \mathop {\hbox{max} }\limits_{{P_{L} ,P_{H} }} Tr(P^{T} ZA_{b} Z^{T} P) $$

(14)

where $ {\text{P}} = \left[ {\begin{array}{*{20}c} {P_{L} } \\ {P_{H} } \\ \end{array} } \right] $, $ {\text{Z}} = \left[ {\begin{array}{*{20}c} L & 0 \\ 0 & H \\ \end{array} } \right] $, $ {\text{A}}_{w} = \left[ {\begin{array}{*{20}c} {2D_{w}^{L} + D_{w}^{H} - W_{w} - W_{w}^{T} } & { - W_{w} } \\ { - W_{w}^{T} } & {D_{w}^{L} + 2D_{w}^{H} - W_{w} - W_{w}^{T} } \\ \end{array} } \right] $, $ {\text{A}}_{b} = \left[ {\begin{array}{*{20}c} {2D_{b}^{L} + D_{b}^{H} - W_{b} - W_{b}^{T} } & { - W_{b} } \\ { - W_{b}^{T} } & {D_{b}^{L} + 2D_{b}^{H} - W_{b} - W_{b}^{T} } \\ \end{array} } \right] $.

Identification Information as Regularization Term:

The identification information classifies the face image into one of the subjects, which encourages the algorithm to learn projection matrix that can map each face image into its own cluster. In this paper, we take advantage of identification information by minimizing the within-class scatter. In learning projection matrices $ P_{L} $ and $ P_{H} $, we aim to solve the following optimization problem:

$$ \mathop {\hbox{min} }\limits_{{P_{L} ,P_{H} }} S_{W} $$

(15)

where $ S_{W} $ represents the within-class scatter. As the overall mean of the training data is zero, the definitions of the scatter matrix are formulated as:

$$ S_{W} = \sum\nolimits_{i} {(x_{i} - \mu_{i,c} )(x_{i} - \mu_{i,c} )^{T} } $$

(16)

where $ x_{i} $ is the n-dimension feature projected by high or low resolution face images into the new common space, $ \mu_{i,c} $ is the mean of the projected feature with class label of c which $ x_{i} $ belongs to. With some linear algebra, Eq. (16) can be rewritten in the following matrix form:

$$ S_{W} = \left( {X - U} \right)\left( {X - U} \right)^{T} $$

(17)

where U is the $ n \times 2N_{t} $ mean matrix with column $ \mu_{i,c} $, and X is the $ n \times 2N_{t} $ data matrix with column $ x_{i} $. Let $ \varLambda $ be a $ C \times C $ diagonal matrix with element $ \varLambda_{i} $. These matrices can be represented by $ P_{L} $ and $ P_{H} $ as:

$$ U = P^{T} ZD\Lambda ^{ - 1} D^{T} $$

(18)

$$ X = P^{T} Z $$

(19)

where $ P = \left[ {\begin{array}{*{20}c} {P_{L} } \\ {P_{H} } \\ \end{array} } \right] $, $ Z = \left[ {\begin{array}{*{20}c} L & 0 \\ 0 & H \\ \end{array} } \right] $ and $ D = \left\{ {d_{ij} } \right\}_{{2N_{t} \times C}} $ with

$$ d_{ij} = \left\{ \begin{aligned} & 1,\; if\; x_{i} \, \in \,class\;j \\ & 0,\;if \;x_{i} \, \notin \,class \;j \\ \end{aligned} \right. $$

(20)

With (18) and (19), Eq. (17) can be rewritten as:

$$ S_{W} = P^{T} Z(I - D\Lambda ^{ - 1} D^{T} )(I - D^{T}\Lambda ^{ - 1} D)Z^{T} P $$

(21)

In this paper, the identification information is taken as a regularization term. This is the main difference between our proposed algorithm and CMFA in [18], where identity matrix is taken as the regularization term in the denominator. And the identification term is a key factor for performance improvement. Finally, the optimization problem with objective functions (13) and (14) reduces to

$$ \mathop {\hbox{max} }\limits_{{P_{L} ,P_{H} }} \frac{{Tr(P^{T} ZA_{b} Z^{T} P)}}{{Tr(P^{T} ZA_{w} Z^{T} P + \xi S_{W} )}} $$

(22)

where $ \xi $ is the balance factor between the verification and identification information. In the experiments below, this factor is set to 0.05;

The coupled projection matrices $ P_{L} $ and $ P_{H} $ that maximize the objective function (22) can be obtained by solving the generalized eigenvalue problem

$$ \left( {ZA_{b} Z^{T} } \right)P = \lambda (ZA_{w} Z^{T} + \xi Z( - D\Lambda ^{ - 1} D^{T} )(I - D^{T}\Lambda ^{ - 1} D)Z^{T} )P $$

(23)

After obtaining the projection matrices $ P_{L} $ and $ P_{H} $, we mapped both LR and HR images into the common space and utilize Euclidean norm to measure the distance of each image pair, as described in (24).

$$ Dis = \left\| {P_{L}^{T} l_{i} - P_{H}^{T} h_{i} } \right\|^{2} $$

(24)

For each probe image, we take as its identity the subject with the smallest distance in the gallery. We use True Positive Identification Rate (TPIR), also refer to as Rank-1 Identification Rate in this circumstances, to measure the performance of our proposed method, as defined in the following

$$ TPIR = \frac{\# (correct\;idetified\;images)}{\# (probe)} $$

(25)

4 Experimental Results

To evaluate effectiveness of the proposed method, we applied our methods on two public databases: FERET [6] and SCface [15]. Performance is measured by rank-1 identification rate. Before projection, the gray pixel distribution of one image is normalized to have average intensity 0, standard deviation 1 and unit norm.

4.1 Experimental Result on FERET Database

We follow the same test protocol as [17] when we conduct experiments on a subset of FERET database. The subset (ba, bd, be, bf, bg, bj, bk) contains 200 subjects with variations of illumination (bk), expression (bk) and pose (bd, be, bf, bg). We choose 50 subjects for training and the rest 150 subjects are used for test. In the test phase, 4 images of each subject are selected as gallery and the remaining as the probe. In the experiment, the HR face images and corresponding LR ones are scaled with resolution of 32 × 32 and 8 × 8. Figure 2 shows some of the HR (top row) and LR (bottom row) face images in FERET database. To evaluate our proposed LMCM algorithm, we compare it with CLPM [12], SDA [19], CMFA [18] and the algorithm proposed in [17].

Table 1 presents the experiment results of LMCM algorithm on FERET database. Our method with 53-D features achieves the recognition rate of 90.00 %, which is higher than 55.22 % for CLPM, 72.09 % for SDA, 75.98 % for CMFA and 80.90 % for coupled mapping method used in [17]. The main reason lies in that our method takes more advantage of the supervised information of the training set than other methods. There are two main differences between CMFA and our proposed algorithm. First, we construct the weight matrices $ W_{w} $ and $ W_{b} $ in a different way, which can capture more discriminant information compared to the method applied in CMFA. Second, we use within-class scatter as the regularization term instead of identical matrix, which can take advantage of the identification in the training data. Our proposed LMCM algorithm also shows its high capability to handle different variations, such as pose and expression, except for low resolution. Table 2 is the test time for each image pair.

Table 1. Rank 1 performance on FERET database. The values are rank-1 identification rate (%)

Full size table

Table 2. Test time for each LR and HR image pair

Full size table

4.2 Experimental Result on SCface Database

To show the real recognition performance of our LMCM algorithm under the surveillance circumstances, the SCface database is chosen as a new set to illustrate the recognition performance of LMCM. SCface is a database of static images of human faces [15] captured by surveillance cameras. Images were taken in uncontrolled indoor environment using five video surveillance cameras at three different distances. The database contains 4,160 face images (in visible and infrared spectrum) of 130 subjects, as shown in Fig. 3. Face images from different cameras and distances mimic the real-world conditions. The subset used contains images from surveillance cameras cam1–cam5: (I) distance of 2.6 m (i.e., LR), and (II) distance of 1.0 m (i.e., HR). The resolution of the processed images is 48 × 48 and 16 × 16 for the HR and LR, respectively.

For this experiment, the protocol of [17] is implemented. All subjects are used for training and test. In the experiment, LMCM is compared with CLPM, SDA, CMFA and Coupled Mapping Method in [17]. For SCface database, 80 subjects are selected to define the training set. The rest of 50 subjects are used as the test set. This procedure is repeated 10 times. The average results are presented in Table 3. Overall, the rank 1 recognition rates are much lower compared to the FERET database due to the real world challenges posed in SCface database. We can see from the results that our proposed LMCM algorithm improves the LR face recognition significantly on SCface database. The main reason lies in that LMCM learns the discriminant information between HR and LR face images to force a margin between the projection of identical and different subjects according to recognition information. Compared to other algorithms in Table 3, our proposed algorithm apparently can capture more such discriminant feature for LR face recognition (Table 4).

Table 3. Experiment on SCface. The values are rank-1 identification rate (%)

Full size table

Table 4. Test time for each LR and HR image pair

Full size table

5 Conclusion

In this paper, we propose a novel algorithm to solve low-resolution face recognition problem without SR procedure. Our method projects both the HR and LR face images into a new common feature subspace by maximizing the distance of features with different labels and minimizing the distance of features with identical label. The objective function attempts to force a margin between different subjects using both the identification and verification information. Experimental results on FERET and SCface databases show that our proposed method can achieve promising performance. In the future, applying nonlinear mappings by kernel methods and using more discriminative features instead of raw intensity will be studied.

References

Freeman, W., Pasztor, E., Carmichael, O.: Learning low-level vision. Int. J. Comput. Vis. 40, 25–47 (2000)
Article MATH Google Scholar
Elad, M., Feuer, A.: Super-resolution reconstruction of image sequences. IEEE Trans. Pattern Anal. Mach. Intell. 21(9), 817–834 (1999)
Article Google Scholar
Baker, S., Kanade, T.: Hallucinating faces. In: Proceedings of the International Conference on Automatic Face and Gesture Recognition, pp. 83–88 (2000)
Google Scholar
Chang, H., Yeung, D., Xiong, Y.: Super-resolution through neighbor embedding. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 275–282 (2004)
Google Scholar
Liu, C., Shum, H., Zhang, C.: A two-step approach to hallucinating faces: global parametric model and local nonparametric model. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 192–198 (2001)
Google Scholar
Philips, P., Moon, H., Pauss, P., Rivzvi, S.: The feret evaluation methodology for face-recognition algorithms. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1090–1104 (2000)
Google Scholar
Roweis, S., Saul, L.: Nonlinear dimensionality reduction by locally linear embedding. Science 290, 2323–2326 (2000)
Article Google Scholar
Yang, J., Wright, J., Huang, T., Ma, Y.: Image super-resolution via sparse representation. IEEE Trans. Image Process. 19(11), 2861–2873 (2010)
Article MathSciNet Google Scholar
Gunturk, B., Batur, A., Altunbasak, Y., Hayes, M., Mersereau, R.: Eigenface-domain super-resolution for face recognition. IEEE Trans. Image Process. 12(5), 597–606 (2003)
Article Google Scholar
Hennings-Yeomans, P., Baker, S., Kumar, B.: Simultaneous super-resolution and feature extraction for recognition of low-resolution faces. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–8 (2008)
Google Scholar
Hennings-Yeomans, P., Baker, S., Kumar, B.: Robust low-resolution face identification and verification using high-resolution features. In: 2009 16th IEEE International Conference on Image Processing (ICIP), pp. 33–36. IEEE (2009)
Google Scholar
Li, B., Chang, H., Shan, S., Chen, X.: Low-resolution face recognition via coupled locality preserving mappings. IEEE Sig. Process. Lett. 17(1), 20–23 (2010)
Article Google Scholar
Belkin, M., Niyogi, P.: Laplacian eigenmaps for dimensionality reduction and data representation. Neural Comput. 15, 1373–1396 (2003)
Article MATH Google Scholar
He, X., Niyogi, P.: Locality preserving projections. Neural Inf. Process. Syst. 16, 153–160 (2004)
Google Scholar
Grgic, M., Delac, K., Grgic, S.: SCface–surveillance cameras face database. Multimedia Tools Appl. 51(3), 863–879 (2011)
Article Google Scholar
He, X., Deng, C., Han, J.: Learning a maximum margin subspace for image retrieval. IEEE Trans. Knowl. Data Eng. 20(2), 189–201 (2008)
Article Google Scholar
Shi, J., Qi, C.: From local geometry to global structure: learning latent subspace for low-resolution face image recognition. IEEE Sig. Process. Lett. 22(5), 554–558 (2015)
Article Google Scholar
Siena, S., Boddeti, V.N., Kumar, B.V.K.V.: Coupled marginal fisher analysis for low-resolution face recognition. In: Computer Vision–ECCV Workshops and Demonstrations, pp. 240–249, 2012
Google Scholar
Zhou, C., Zhang, Z., Dong, Y., Zhen, L., Li, S.Z.: Low-resolution face recognition via simultaneous discriminant analysis. In: International Joint Conference on Biometrics (IJCB), pp. 1–6 (2011)
Google Scholar

Download references

Acknowledgement

This work is partially supported by the Key Laboratory of Intelligent Perception and Systems for High-Dimensional Information (Nanjing University of Science and Technology), Ministry of Education (Grant No. 30920140122006).

Author information

Authors and Affiliations

Graduate School at Shenzhen, Tsinghua University, Shenzhen, China
Jiaqi Zhang, Zhenhua Guo & Xiu Li
Huazhong University of Science and Technology, Wuhan, China
Youbin Chen
Key Laboratory of Measurement and Control of Complex Systems of Engineering, Ministry of Education, Southeast University, Nanjing, China
Zhenhua Guo
Key Laboratory of Intelligent Perception and Systems for High-Dimensional Information, Ministry of Education, Nanjing University of Science and Technology, Nanjing, China
Zhenhua Guo

Authors

Jiaqi Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Zhenhua Guo
View author publications
You can also search for this author in PubMed Google Scholar
Xiu Li
View author publications
You can also search for this author in PubMed Google Scholar
Youbin Chen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jiaqi Zhang .

Editor information

Editors and Affiliations

Cardiff University, Cardiff, United Kingdom
Richard Booth
Southeast University , Nanjing, China
Min-Ling Zhang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, J., Guo, Z., Li, X., Chen, Y. (2016). Large Margin Coupled Mapping for Low Resolution Face Recognition. In: Booth, R., Zhang, ML. (eds) PRICAI 2016: Trends in Artificial Intelligence. PRICAI 2016. Lecture Notes in Computer Science(), vol 9810. Springer, Cham. https://doi.org/10.1007/978-3-319-42911-3_55

Download citation

DOI: https://doi.org/10.1007/978-3-319-42911-3_55
Published: 10 August 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-42910-6
Online ISBN: 978-3-319-42911-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Large Margin Coupled Mapping for Low Resolution Face Recognition

Abstract

Similar content being viewed by others

Transfer Subspace Learning based on Double Relaxed Regression for Image Classification

Coupled Discriminant Multi-Manifold Analysis with Application to Low-Resolution Face Recognition

Heterogeneous Face Recognition Based on Super Resolution Reconstruction by Adaptive Multi-dictionary Learning

Keywords

1 Introduction

2 Low Resolution Face Recognition

3 Proposed LMCM

Verification Information with Local Geometry:

Identification Information as Regularization Term:

4 Experimental Results

4.1 Experimental Result on FERET Database

4.2 Experimental Result on SCface Database

5 Conclusion

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Large Margin Coupled Mapping for Low Resolution Face Recognition

Abstract

Similar content being viewed by others

Transfer Subspace Learning based on Double Relaxed Regression for Image Classification

Coupled Discriminant Multi-Manifold Analysis with Application to Low-Resolution Face Recognition

Heterogeneous Face Recognition Based on Super Resolution Reconstruction by Adaptive Multi-dictionary Learning

Keywords

1 Introduction

2 Low Resolution Face Recognition

3 Proposed LMCM

Verification Information with Local Geometry:

Identification Information as Regularization Term:

4 Experimental Results

4.1 Experimental Result on FERET Database

4.2 Experimental Result on SCface Database

5 Conclusion

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation