Keywords

1 Introduction

Biometrics is a unique attribute possessed by every individual. It can be either physiological (e.g., face, iris, fingerprint, palmprint, etc.) or behavioral (e.g., gait, keystroke dynamics, mouse dynamics, etc.) attribute to identify the user. With rapid technological advancements, biometrics have replaced passwords or personal identification number (PIN) in several access control applications like financial, healthcare, immigration, surveillance, etc. Principal component analysis (PCA) [1] and linear discriminant analysis (LDA) [2, 3] are two popular approaches used in the biometric recognition process. PCA is an unsupervised learning approach which learns eigenfaces from covariance matrix generated by unlabeled training data. PCA preserves the distributions in training data but has no information about the classification. Hence, it fails miserably in pattern recognition (PR) applications. On the other hand, LDA is a supervised learning approach. It generates optimal projection by maximizing the between-class distance and minimizing the within-class distance, thereby LDA is more influential than PCA in PR applications.

During biometric recognition, it is obvious that the dimension of the modality like face, iris, etc., is higher than the number of sample images in the central database. This leads to “Small Sample Size” (3S) problem [4]. The 3S problem can be alleviated by increasing the number of sample images per person or by using a dimension reduction technique. The former one becomes infeasible due to the storage cost, effort, and time to be spent in collecting several sample images, hence selecting a dimension reduction technique is better than increasing the number of sample images per person. Among several linear dimension reduction techniques, random projection (RP) [5, 6] is a feasible approach as it maps a set of points in a high-dimensional space to a new set of points in lower-dimensional space while approximately preserving the pairwise distances between them.

Cancelable biometric system (CBS) [7] is a template securing mechanism which generates cancelable biometric templates out of original biometric attributes for a user-specific key. The cancelable biometric template is generated out of original biometric attributes using a secure and discriminability-preserving transformation function and user-specific key. In case of a database breach, the cancelable biometric templates are alone compromised. Hence, the privacy of the biometric attributes is preserved during attacks. The CBS also provides an added advantage of generating numerous and diverse cancelable biometric templates for various applications just by changing the transformation function and/or user-specific key, thereby preventing privacy threats. In this way, CBS provides high level of security, privacy, and revocability to biometrics that may help to increase public confidence for acceptance of biometric-based systems.

2 Literature Survey

CBS is a biometric template securing technique in which enrollment and authentication are performed in the transformed domain. The CBS transformation techniques have been broadly classified into biometric salting and non-invertible transforms as in [8]. Apart from these two techniques, several other categories of CBS have emerged in recent times.

Biometric salting can be defined as a transformation technique which generates cancelable biometric templates is by mixing in an artificial pattern. The mixing patterns can be a random/synthetic pattern or a pure random noise. Non-invertible transformation technique can be defined as a cancelable biometric template generation technique which uses a secret key as a constraint for the transformation function. Several other techniques have risen in recent times. For instance, hashing-based transformation [9,10,11] uses the index positions of the biometric feature template derived from several basic hashing techniques to generate cancelable templates. The Bloom filter-based transformations were introduced in [12]. The Bloom filters were initially introduced by Rathgeb et al. in [13,14,15] which map multiple code words to identical position to generate cancelable biometric templates.

The following research gaps have been identified from the literature survey.

  • The transformation is mostly applied at feature level which becomes time. There exists a research gap for generating cancelable templates by applying the transforms at the signal level

  • The 3S problem persists if the dataset contains less number of sample images

  • The security, privacy, revocability, and diversity of the biometric information of user must be preserved simultaneously while achieving good recognition rate

This work proposes a novel cancelable biometric recognition technique called Random Permutation-based Linear Discriminant Analysis (RPLDA) method that addresses the above issues. The approach aims to generate secure, revocable, non-invertible, privacy-preserving, and performance preserving templates even in the stolen token scenario.

The research contributions of the proposed system have been listed as follows:

  • The proposed cancelable biometric template generation system—RPLDA is capable of generating cancelable biometric templates which have been transformed at the signal level

  • The 3S problem has been alleviated by employing LDA

  • The recognition performance of RPLDA has been proved to be better in the transformed domain

  • The RPLDA satisfies the basic properties of CBS such as non-invertibility, diversity, unlinkability, and revocability which are evident from the research outcomes

The rest of the paper is organized as follows. Section 3 explains the proposed RPLDA technique, analysis of the relationship between LDA and RPLDA and its applicability as a cancelable biometric system. Section 4 mentions the experimental setup. Section 5 describes the results. Section 6 gives a brief conclusion.

3 Proposed Technique

Intermediate templates are generated by employing a random permutation matrix on the given training images of the biometric traits. The random permutation matrix is chosen to be the PIN. The random permutation matrix is selected to be a matrix whose entries are “0” and “1”, distributed randomly. Through the proposed technique, the cancelable features in the intermediate templates are recognized using LDA [2, 3]. The finally extracted cancelable features are the discriminant vectors which comprise the cancelable template.

3.1 Preliminaries of LDA

LDA is a popular feature extraction technique. The main objective of the LDA is to derive the direction along which the variance in the data is high.

Assume that \(x \in \Re^{d}\) is a column vector and it represents each image in “d” dimensional space. If there are “c” users, i.e., \(\{ 1,2,3, \ldots,c\}\), such that each user has \({{N}}_{i}\) images. Thus, the total number of training images is given by Eq. (1)

$$N = \sum\limits_{i = 1}^{N} {N_{i} }$$
(1)

Let \(\hat{x}\) represent the mean image vector of the training data as shown in Eq. (2)

$$\overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{x} = \frac{1}{N}\sum\limits_{{i = 1}}^{N} {N_{i} \overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{x} _{i} }$$
(2)

where the value of \(\overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{x}_{i}\) is given by Eq. (3)

$$\overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{x}_{i} = \frac{1}{{N_{i} }}\sum\limits_{i = 1}^{N} x$$
(3)

The total scatter matrix (M) for the training data is given by Eq. (4) as below:

$$M = M_{w} + M_{b}$$
(4)

where \(M_{{\text{w}}}\) and \(M_{{\text{b}}}\) are within-class scatter matrix and between-class scatter matrix, respectively as given by Eqs. (5) and (6).

$$M_{{\text{w}}} = \sum\limits_{i = 1}^{c} {(x - \overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{x}_{i} )(x - \overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{x}_{i} )^{t} }$$
(5)
$$M_{{\text{b}}} = \sum\limits_{i = 1}^{c} {N_{i} (\overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{x}_{i} - \overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{x} )(\overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{x}_{i} - \overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{x} )^{t} }$$
(6)

The criterion function (J) for a projection matrix \(\psi = \{ w_{1} \left| w \right._{2} \left| . \right....\left| w \right._{c - 1} \}\) is given by Eq. (7)

$$J(\psi ) = \frac{{\left| {\psi^{t} M_{b} \psi } \right|}}{{\left| {\psi^{t} M_{w} \psi } \right|}}$$
(7)

For an optimal projection matrix, say \(\psi^{*}\) the eigenvectors corresponding to the largest eigenvalues and can be modeled as

$$\psi^{*} = \{ w_{1}^{*} \left| w \right._{2}^{*} \left| . \right....\left| w \right._{c - 1}^{*} \} .$$

Thus, the transformed data points (T) are given by Eq. (8)

$$T = \psi^{t} x$$
(8)

3.2 Proposed RPLDA Scheme

A random permutation matrix is projected on the sample image of the biometric traits of the users thereby generating an intermediate template which is a column vector. The random permutation matrix is chosen to be an involutory matrix to convert the given sample image into a column vector. The cancelable templates are generated by extracting the LDA features out of the intermediate template. Thus, the biometric patterns can be renewed or revoked whenever required just by changing the random permutation matrix.

If x is an input sample, then a random permuted image (x′) or an intermediate template can be generated from the input sample x using a random permutation matrix R, as represented in Eq. (10).

$$x' = Rx$$
(10)

The LDA features of the intermediate template are determined to construct the cancelable template.

The various entities involved in the generation of the cancelable templates using the proposed RPLDA scheme have been illustrated in Fig. 1. The users are enrolled in an application during the enrollment phase. The users are verified during the verification phase. During the enrollment phase, the biometric image sensor captures the image samples of the biometric trait of the user. The random permutation matrix projection unit generates a random permutation matrix which is an involutory matrix. This random permutation matrix is projected on the image sample of the user to generate an intermediate template. The LDA feature extraction unit extracts the features from the intermediate template to generate a cancelable template which is stored in the database.

Fig. 1
figure 1

Enrollment phase and authentication phase in proposed system

During the authentication phase, the user again produces the biometric trait to the biometric imaging sensor. The random permutation matrix projection uses the same matrix which was generated in the enrollment phase from the user to generate a query intermediate template. The LDA feature extraction unit extracts LDA features from the query intermediate template and generates a query cancelable template. This query template is matched with the cancelable template stored in the database by the matching unit to grant access to the application, in case if the templates match.

A sample face image and its corresponding cancelable template are shown in Fig. 2. The major contribution of the research work is the relationship between eigenvalues and eigenvectors of original biometric images and cancelable templates. The eigenvalues possessed by the original biometric patterns and the cancelable templates are the same. But the eigenvectors of the cancelable template are randomly permuted form of the eigenvectors of the original biometric images. The randomly permuted intermediate templates correspond to the random permutation matrix.

Fig. 2
figure 2

a Sample face image from ORL dataset; b cancelable template generated from a sample face image in (a) using proposed RPLDA; c sample iris image from UBIRIS dataset; d cancelable template generated from sample iris image in (c) using proposed RPLDA; e sample ear image from IITD dataset; f cancelable template generated from sample ear image in (e) using proposed RPLDA

4 Experimental Setup

The cancelable template has been generated using proposed RPLDA technique using the iris, face and ear biometrics are chosen from publicly available datasets like UBIRIS [16], ORL [17], and IITD [18] databases, respectively. Table 1 displays the details of the databases.

Table 1 Summary of datasets used in experiments

The performance of the proposed techniques is evaluated in terms of the security provided by the proposed technique, average classification accuracy, and average training time of the algorithm. The training image from each identity is selected randomly and the remaining images are used as testing set. This process is repeated 40 times to achieve a stable classification accuracy and average training time. All the experiments are performed on Intel Xeon E3 CPU 2.4 GHz with Windows 7 and 8 GB memory.

5 Results and Discussions

In this section, the performance of the proposed technique has been analysed both qualitatively and quantitatively with other state-of-the-art methods, viz. Gray-Salt (GS) PCA, Block-Remapping (BR) PCA, and RPPCA [19]. The cancelable templates were generated independently using each technique. The classification accuracy for each technique has been determined using nearest neighborhood technique. Then the classification accuracies have been compared with the classification accuracy of the proposed RPLDA technique.

The classification accuracy measures the percentage of identities correctly classified by some technique or algorithm. It depends on factors such as the number of training used and the number of dimensions in the transformed representation. The clssification accuracy of the proposed RPLDA technique has been reported in Table 2. The classification accuracy of the proposed technique is higher when compared to the state-of-art techniques like GSPCA, BRPCA and RPPCA as shown in Table 2.

Table 2 Classification accuracy achieved by proposed RPLDA technique

The equal error ratio (EER) is the metric which is used to measure the matching performance of a CBS. The EER value must be as low as possible to indicate that CBS has a good matching performance. The EER value of the proposed system is calculated and compared with the EER values of the RPPCA, GSPCA, and BRPCA. The EER comparative results have been listed in Table 3. It can be inferred from Table 3 that the EER of the proposed system is lower than the other state-of-the-art techniques,

Table 3 EER (%) achieved by the proposed RPLDA technique

The average training time required by the proposed techniques is shown in Table 4. The RPLDA has been trained using only few training images of each user to alleviate 3S problem [20]. The average training time over 40 runs of the proposed algorithm was measured on all the datasets as shown in Table 4. It is observed that the training time of the proposed technique increases with an increase in the number of training images. It can be readily observed from Table 4 that the training time required by RPLDA is significantly less and feasible.

Table 4 Training time (in seconds) required by the proposed RPLDA technique

The cancelable templates generated using the proposed system are non-invertible. If a brute-force attack has been simulated against the cancelable templates generated using the proposed system, then the number of iterations required to reverse-engineer the cancelable templates is completely dependent on the input image size. It will take (112 × 92)!, (150 × 200)!, and (180 × 50)! for achieving a preimage of the original biometric image. It is very difficult to reverse-engineer the cancelable templates generated using the proposed system. Hence, the cancelable templates generated using the proposed system are non-invertible.

The cancelable templates generated using the proposed system are diverse. A user can register to different applications using different cancelable templates which correspond to a single biometric pattern. This is achieved just by changing the random permutation matrix. With the change in the pattern of the entries in the random permutation matrix, the cancelable templates also change. Assume if there are “n” entries in a random permutation matrix, then “n” different cancelable templates can be generated from the input biometric pattern.

6 Conclusion

In this research work, a simple yet powerful technique called as random permutation-based linear discriminant analysis (RPLDA) has been proposed. The proposed technique is applicable in cancelable biometric recognition. The users are required to provide their biometric trait and the PIN which is the random permutation matrix. The random permutation matrix is projected on the biometric image and an intermediate template is generated. The LDA extracts cancelable features from the intermediate template and generates the cancelable template. The effectiveness of the proposed technique has been illustrated by the results of the experiments conducted on different datasets. The classification accuracy of the proposed technique is found to be better than the state-of-the-art techniques. The training time is also found to be very less. The proposed technique is also proved to be effective against 3S problem. The future study can be directed toward application of diagonal LDA for cancelable biometric recognition.