Classical and modern face recognition approaches: a complete review

Ali, Waqar; Tian, Wenhong; Din, Salah Ud; Iradukunda, Desire; Khan, Abdullah Aman

doi:10.1007/s11042-020-09850-1

Classical and modern face recognition approaches: a complete review

Published: 02 October 2020

Volume 80, pages 4825–4880, (2021)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Multimedia Tools and Applications Aims and scope Submit manuscript

Classical and modern face recognition approaches: a complete review

Download PDF

Waqar Ali ORCID: orcid.org/0000-0003-0846-7281^1,2,
Wenhong Tian³,
Salah Ud Din⁴,
Desire Iradukunda⁵ &
…
Abdullah Aman Khan¹

4925 Accesses
69 Citations
18 Altmetric
2 Mentions
Explore all metrics

Abstract

Human face recognition have been an active research area for the last few decades. Especially, during the last five years, it has gained significant research attention from multiple domains like computer vision, machine learning and artificial intelligence due to its remarkable progress and broad social applications. The primary goal of any face recognition system is to recognize the human identity from the static images, video data, data-streams and the knowledge of the context in which these data components are being actively used. In this review, we have highlighted major applications, challenges and trends of face recognition systems in social and scientific domains. The prime objective of this research is to sum-up recent face recognition techniques and develop a broad understanding of how these techniques behave on different datasets. Moreover, we discuss some key challenges such as variability in illumination, pose, aging, cosmetics, scale, occlusion, and background. Along with classical face recognition techniques, most recent research directions are deeply investigated, i.e., deep learning, sparse models and fuzzy set theory. Additionally, basic methodologies are briefly discussed, while contemporary research contributions are examined in broader details. Finally, this research presents future aspects of face recognition technologies and its potential significance in the upcoming digital society.

Face Recognition Research and Development

Face Recognition: A Review and Analysis

On the frontiers of pose invariant face recognition: a review

Article 27 July 2019

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Face as a human identity spans over the centuries in human civilizations. It has been one of the most proactively studied topics in computer vision and machine learning research for more than five decades. Compared with other common biometrics such as iris, retina or fingerprint based identification, face recognition has the ability to uncover uncooperative subjects in a proficient manner. In recent years, a lot of successful research outcomes have been recorded for images in a controlled environment, but in an unconstrained environment like variable illumination and diverse pose, face recognition is still a complex and challenging problem [65, 73, 168, 206].

Face recognition has been an active research area for the past few decades. Especially, during the last five years, it has emerged as one the most flourishing research topic among image processing, computer vision, machine learning, artificial intelligence and visual surveillance due to broad social, scientific, commercial applications. The primary goal of face recognition system is to recognize the human identity from the static images [196], video data [264] or from data-streams [183] including the knowledge of the context in which these data components are being actively used. Due to rapid progress in sensors and imaging technologies, the face recognition systems have been broadly used in many real-world applications. Specifically, human-computer interaction, human-robot interaction, person identification for security objectives, face as a mobile biometric recognition, law enforcement, voter identification, counter-terrorism, border control and immigration, day-care centers, social security, banking, e-commerce [93, 132, 138] etc. With a wide range of real-world applications, face recognition has received remarkable attention from both commercial stakeholders and research communities. There is an essential demand for robust face recognition algorithms that are highly capable of dealing with complex real-world situations. As illustrated in Fig. 1, there is a continuous rising trend in number of peer review articles from the year 2000 to 2020. We collect data with keywords “face recognition” and “face identification” from two well-known research collection datasets i.e. dblp.org and web of science. Many literature reviews were presented during the last three decades on face recognition and identification research. Few of the recent [47, 93, 96, 200, 219, 229, 281, 284] literature reviews have also been considered. The main contributions of this paper are summarized as follows:

In recent years, many reviewers [1, 5, 17, 38, 44, 45, 52, 79, 149, 217, 229, 258, 285] reviewed only a limited aspect of face recognition literature like pose [52, 285], illumination [79], dimension [1, 38], feature types [229, 258], occlusion [17, 44], etc. We precisely attempt to cover all aspects from the past, present up-to-the future directions of face recognition technology that can act as a benchmark for next-generation researchers to understand this domain. Up to the best of our knowledge, this is a kind of unique effort in recent years.
We conducted a comprehensive analysis and cross-comparison for more than 32 classical (Table 3), 24 deep learning (Table 4), 16 dictionary learning (Table 5) and 8 fuzzy logic (Table 6) based face recognition approaches. Also, we summarized the recognition rate of each approach categorically (Tables 3-6 respectively) to have an insight into the experimental results of existing models.
Since many literature reviews are available on classical face recognition techniques (work done before 2015). We precisely summarize the classical work but briefly illustrate recent deep learning, dictionary learning and fuzzy logic based face recognition techniques.
We summarize face recognition approaches in a novel categorical way as presented in Fig. 6, for each category we compose the developments in chronological order.
One of our contributions is to integrate a complete overview of face recognition research, state-of-the-art technologies, common algorithms, popular vendors (Table 2) and recent face datasets (Table 1) in practice today.

Table 1 Key characteristics of publically available known 2D & 3D face datasets, acronyms # Sub is used for number of subjects, VRI for variable resolution images and MC for multiple cameras with different hardware specifications

Full size table

Table 2 Leading face recognition technology vendors

Full size table

Table 3 Summary of selected classical face recognition approaches [ Acronym SCD used for Self Created Dataset with specified number of subject images ]

Full size table

Table 4 Summary of Deep learning-based face recognition approaches

Full size table

Table 5 Summary of dictionary learning based face recognition methods, Acronym Ext. YB used for Extended Yale-B, FR used for face recognition

Full size table

Table 6 Summary of selective fuzzy logic based face recognition algorithms, Acronym Ext. YB used for Extended Yale-B, FR used for face recognition

Full size table

The remaining part of this review is organized as follows. Before diving in details of traditionally used algorithms, a general overview of a face recognition system is presented in Section 2. We first introduced popular publically available face recognition datasets in Section 3. Later on we discuss leading face recognition technology vendors around the globe and some key application areas of face recognition technologies in Section 4 followed by major challenges in Section 5. Then, we categorically present four frameworks of face recognition methodologies in Section 6. Finally, we closed by discussing future research directions in Section 7 and conclusion in Section 8.

2 A general face recognition system

Generally, a face recognition system deals with the input image as a classification problem. An overview for a face recognition system is presented in Fig. 2:

2.1 Acquisition and preprocessing

In any image processing system for features extraction and image understanding, image acquisition and preprocessing is the first step. We extract the target images from the input source either from video streaming or static images. Image preprocessing is an integral part of image processing systems and helps in obtaining accurate results and avoid noise [13]. Here, systems have to face some challenges that can cause hindrance in the overall process especially in an unconstrained environment, which may include varying image background, pose, aging, illumination, and expression, etc. During the last two decades, different datasets have been developed that offer complete media set images to test the reliability of newly proposed algorithms. A substantial amount of work has been done for dealing with these challenges [13, 121, 228, 285]. Image preprocessing also depends on the area of application as well. For example, for a smartphone user, clear face acquisition is not much challenging as compared to an image that has been taken from a crowded place such as an airport to detect a suspicious person. Image preprocessing is always recommended as one of the initial stages, as the performance of feature extraction also depends on preprocessing. In the case of an unconstrained environment where learning depends on environmental settings, post-processing of data can also be considered for removal of noise from the given image.

2.2 Face detection

Face detection is a process of localizing and extracting the face region from the background. It is another essential step in overall facial recognition process and has been well-studied in the field of computer vision [167]. Early face detection approaches such as Viola-Jones face detector [252] was capable to detect real time facial regions from input images. With the passage of time, it become an active research area and a major part of any visual face understanding framework. Bunch of research efforts [35, 49, 167, 181, 197, 305] has been noted for robust face detection through a verity of algorithms in recent years.

2.3 Feature extraction

Feature extraction acts as a core activity for any face recognition system. It has a significant effect on overall system performance. Various types of features extractor models have been developed like SIFT, SVM, STIP, STISM [13, 71, 94]. We discuss these and many other recent feature extractors and descriptors in later part of this paper. Different aspects have been used to classify feature extraction such as global vs local feature extraction methods [229], hand-crafted vs learning-based methods, 2D vs 3D features extraction methods [127].

2.4 Feature selection

Most of the times feature extractor or descriptor generates a large feature space against a single image and it becomes more complex when we want to recognize a person from a video streaming. The large feature space is further processed through PCA, SVD, MDS, LDA, and LDR [43, 296] for selecting the principal features and dimensionality reduction. This has a positive impact on system performance because it reduces the overall cost and makes better use of system time for recognition purpose.

2.5 Feature matching

Once we get the principal features from the input face image then these features are iteratively matched with an existing feature database for the target objective. As shown in Fig. 2, face identity recognition is an iterative process. Basically, the system reveals the identification from the database. Many well-known classification techniques like SVM, RBFNN, NC-K-mean, PNI, and CNN [13, 241] are used for features matching purpose.

3 Known benchmark databases

Performance evaluation for a face recognition system is one of the key research challenges. A large number of face recognition benchmarks have been established and are publically available to facilitate newly proposed algorithms. Table 1 summarize the most widely used 2D and 3D face databases. The significance of each dataset has been highlighted through different imaging conditions such as the total number of images, number of subjects, color (Gray or RGB), reported resolution of subject images and year. Additionally, we have discussed a very brief introduction and few limitations for each of these datasets in light of the literature that we have studied for this survey.

MS-Celeb-1M [82]: The Microsoft Celeb is a face image database of around 8.2 million face images harvested from the internet to develop face recognition technologies. It is one of the largest publically available face image collection of around 100, 000 individuals. Most of the people in this dataset are American and British actors and popular internet celebrities. Since source images were not purposefully recorded, the dataset has very limited support for illumination and pose invariants.
PICS [244]: The Psychological Image Collection at Stirling (PICS) is another large collection of face images especially maintained to conduct facial expressions related research. The images are organized into sets, each set contains facial images and other objects as well. The presence of other object images makes it a little confusing for specialized facial recognition techniques.
CAS-PEAL [64]: The Chinese Academy of Sciences Pose, Expression, Accessory, and Lighting (CAS-PEAL) face dataset has been constructed as a key project of National Hi-Tech Program by Chinese Academy of Sciences. It was created to build up a central repository of face images from different Chinese ethnic groups. It has strong support for Pose, Expression, and illumination invariants but limitations occur at aging and occlusion like challenges. Also, only the availability of Chinses faces makes a bound for algorithm evaluation.
SoF [4]: Spaces on face dataset contains facial images with variable, pose, illumination and occlusion effects. The dataset image collection is special invariant to variable illumination occlusions effects. Moreover, three image filters, that may evade face detectors and facial recognition systems, were applied to each image. All generated images are categorized into three levels of difficulty i.e. easy, medium, and hard.
CMU-PIE [225]: Carnegie Mellon University - Pose Illumination and Expression (CMU-PIE) has been collected between October and December 2000. Each subject in the dataset has 13 different poses, 43 different illumination conditions, and 4 variable expression images. In the initial version, all images recorded in a single recording session i.e. dataset have no support for aging invariant.
FERET [191]: Facial Recognition Technology (FERET), initially collected under a collaborative effort between Dr. Harry Wechsler at George Mason University and Dr. Jonathan Phillips at the Army Research Laboratory in Adelphi, Maryland. Although, later versions have been improved, still it has certain limitations for state-of-the-art challenges.
LFW [103]: Labeled Faces in the Wild (LFW) dataset is very common in academic literature for general-purpose face recognition algorithm evaluation. The downside of this dataset is the limited number of per subject images for state-of-the-art challenges such as illumination, pose, and occlusion invariants.
SCface [77]: Surveillance Cameras Face Database (SCface) is a face image dataset purposefully recorded in an uncontrolled indoor environment using five different cameras of variable qualities. The collection involved only 130 individuals, this is a major hitch for machine learning bases algorithms.
AR [172]: The AR face database was created by Aleix Martinez and Robert Benavente in the Computer Vision Center (CVC) at the U.A.B. It contains over 4,000 color images of 126 individuals (70 men and 56 women). Images feature frontal view faces with different facial expressions, illumination conditions, and occlusions (sunglasses and scarf). All images were taken twice, each in two different sessions, separated by 14 days’ time gap.
FEI [242]: The FEI is a Brazilian face dataset, contains face images that were purposefully recorded between June 2005 and March 2006 at the Artificial Intelligence Laboratory of FEI in Sao Bernardo do Campo, Sao Paulo, Brazil. Mostly, images are of students and staff at FEI Lab, aging between 19 and 40 years old with variable pose, appearance, and hairstyle. All images were taken against a white homogenous background which is not a true representation of real-world diverse situations.
NIST-GBU [192]: The National Institute of Standards and Technology: Good, Bad and Ugly (NIST-GBU) face dataset was a part of Face and Ocular Challenge Series (FOCS) project. It consists of three frontal still face images for each subject participant with variable illumination effects at both indoors and outdoors environments.
Oulu Physics [171]: The University of Oulu Physics-Based face dataset was collected at the Machine Vision and Media Processing Unit, University of Oulu. It contains two unique properties, first 16 different cameras with variable illumination conditions were used to record the images. Secondly, the camera channel response and the spectral power distribution of illuminants are also provided. The database may be of general interest to face recognition researchers and of specific interest to color researchers.
BANCA [18]: Biometric Access Control for Networked and E-Commerce Applications (BANCA) face database was collected as part of the European BANCA project. Both high and low-quality microphones and cameras were used for image creation. The subjects were recorded in three different scenarios, controlled, degraded and adverse over 12 different sessions spanning three months.
Yale B [70]: Yale and Yale B are very common in academic literature for face recognition algorithm evaluation. The initial version Yale face dataset has only 165 grayscale images in GIF format of 15 individuals. The extended version Yale-B contains more participants with 64 illumination conditions per subject and 9 different pose angles.
MIT-CBCL [220] : The MIT-CBCL face recognition database contains face images of 10 subjects. It provides two training sets, the first one contains high-resolution images with frontal, full-face view, and half-face view. The second one is a synthetic part; it contains 324 images per subject from 3D head models for 10 subjects. The head models were generated by fitting a morphable model to the high-resolution training images.
AT&T/ORL [64]: The AT&T database of faces formerly referred to as the ORL face database contains 40 distinct subjects and 10 images for each subject. Images were recorded between April 1992 and April 1994 by the Speech, Vision and Robotics Group of the Cambridge University engineering department lab. The images were taken at different times, varying the lighting, facial expressions (open or closed eyes, smiling or not smiling), and facial details (glasses or no glasses). All the images were taken against a dark homogeneous background with the subjects in an upright, frontal position.
FRGC [190] : Face Recognition Grand Challenge (FRGC) contains both 2D & 3D facial images. Each subject has recorded images in different sessions with four controlled still images, two uncontrolled still images, and one three-dimensional image.
UoY-2 [245] : The University of York 3D face database has been gathered to facilitate research into 3D face recognition. Currently, a limited part of dataset is publically available that has certain hitches for illumination and aging invariants.
BU-3DFE [93] : The Binghamton University 3D Facial Expression (BU-3DFE) dataset is a fine collection of White, Black, central Asian, Indians face images and Narrow in term of pose, illumination, expression, Hispanic, and Latino people. The dataset contains seven expressions for each subject with four different intensity levels.
FRAV3D [41]: The FRAV3D face dataset was collected by Universidad Rey Juan Carlos, Spain. All data acquired through a low resolution Minolta VIVID 700 scanner. It is not suitable for aging invariant
Texas 3DFRD [243]: The University of Texas 3D face dataset is a collection of 3D color face images of 105 subjects. These images were acquired using a stereo imaging system at high spatial resolution of 0.32 mm along the x, y, and z dimensions. During the acquisition process, the color and range images were captured simultaneously.
GavabDB [216]: The dataset compiled by GAVAB research group Universidad Rey Juan Carlos, Spain. A limited use of examples in face recognition literature. Each image consists in a three-dimensional mesh representing a face surface. There are systematic variations over the pose and facial expression of each person. In particular, there are 2 front and 4 rotated images without any facial expressions, and 3 front images in which the subject presents different and accentuated facial expressions are included.
BJUT-3D [20]: The BJUT-3D is a Chinese 3D face dataset by Multimedia and Intelligent Software Technology Beijing. Design and construction of the face database mainly include acquisition of prototypical 3D face data, preprocessing and standardizing of the data and the structure design. Currently, BJUT-3D database is the largest Chinese 3D face database in the world.

4 Face recognition applications

With ongoing digital society, face recognition systems are being harvested by leading technology companies around the world. Due to potential commercial applications and growing market trends, there is a healthy competition among these companies for delivering the best possible performance. An overview of vendors providing face recognition solutions is illustrated in Table 2. Additionally, leading technology companies such as IBM, Google, Microsoft and Apple are fighting to gain a winning position in face recognition technology. A review on [86] with keywords “face recognition” and “face identification” demonstrates that the number of patents (granted) are drastically increasing in past two decades. As shown in Fig. 3, there is a growing interest of companies for owning face recognition frameworks in the form of patents registration. The statistics powered by WIPO (World Intellectual Property Organization) and USPTO (United States patent and trade office) also witness for the same stream. After two successful facial identification patents in August 2017 and January 2018, Google has introduced a bunch of recent patents for facial recognition [86]. Using these as an advantage, the new trends will allow Google to identify faces from personal communications, social networks, collaborative apps, blogs and much more.

In this section, we attempt to summarize rich face recognition applications and commercial significance. Popular vendors, specialized in face recognition such as IBM, Innovatrics, Advanced Biometrics and IDEMIA are providing convenient, reliable and flexible solutions with worldwide installation. Key application areas have been investigated where face recognition technologies are increasingly adopted and revolutionalizing automation capacities.

4.1 Access control

With the increasing popularity of face recognition systems, it has been adopted by various automatic access control mechanisms for human to machine interaction. It has also been replaced with other authentication control methods such as password protection, fingerprints, iris verification, etc. Furthermore, smartphone and CCTV cameras have become widespread, use for face-based authentication has been feasible for many real-world applications. Hardware-based verification systems are being rapidly extended to control face-based authorization for a single login to multiple networked services. Face-based automatic access to automatic teller machine (ATM), online funds transfer and access to cyphertext are also getting popular in a variety of social practices [200]. For the same reasons, face identity-based billing, cheque processing, sensitive laboratorys’ access, courier services for sensitive objects and face identity-based automatic access control is demanded everywhere.

4.2 Surveillance

Face recognition is one of the most important yet challenging tasks for fully automatic and smart surveillance systems. Surveillance is defined as close observation or monitoring, especially of a suspected spy or criminal. It is one of the most important and widespread face recognition applications. These systems are created to meet security objectives for both outdoor and indoor public crowds, for example, monitoring public areas, airports halls, banks, and geographical borders, etc. Due to huge data volume acquired by camera networks, brute force spy detection algorithms are not sufficient enough to intelligently recognize suspects and terrorists in sensitive places. State-of-the-art face recognition research provides a platform for intelligent video surveillance systems that involves hardware and software aspects, such as automatic interfaces, pattern recognition, signal processing, and machine learning algorithms to achieve highly optimistic results [292]. Additionally, the recognition capabilities of machine based techniques are proved to be more efficient than the humans [164] in real-world applications. To leverage the capabilities of humans and machines, face recognition solutions in surveillance systems can be effectively used as a tool to support human operators in carrying out complex monitoring and recognition tasks. Although, the state-of-the-art methods for surveillance systems have achieved satisfactory results but still there are lot of challenges for effective surveillance such as occlusion, blur subject images and limited training data [22, 77, 96].

4.3 Entertainment

It has been noticed that face recognition has been increasingly popular in the entertainment sector as well. The most exciting areas are virtual realities, mobile gaming, human-robot interaction, human-computer interaction, training, and theme park gaming zones [48] etc. Furthermore, recently T. Feltwell et al. [59] introduced an interesting game which asks the players to capture the likeness of members of the public. It is motivated by free-to-play models and the phenomenal success of famous game “Pokemon GO”, but they proposed a different experience where players can hunt and capture members practically in the real world.

4.4 Law enforcement

Face recognition systems have been proved to be a highly effective tool for law enforcement bodies to identify criminals or find missing persons. Manually investigating hours of video material for searching a specific identity is quite a tedious task for law enforcement officials. For example, it is recently reported [87] that law enforcement agencies in China have taken just seven minutes to locate a BBC reporter John Sudworth using its powerful CCTV camera network of more than 170 million cameras and face recognition technology. Face recognition research brings a new generation of intelligent and efficient investigative capabilities of law enforcement bodies [213]. Overstay and illegal citizenship is another challenging problem for densely populated communities all over the world. The advancement in face recognition technologies steps forwarded in capturing illegal immigrants and overstay foreigners. Furthermore, face recognition technology is also in practice for banking [144], voter identification [170], crime investigation [23, 96] counter-terrorism [60] and immigration [126] purposes.

4.5 Other common applications

Recently, Kwon and Lee [132] introduce a comprehensive set of techniques for face recognition in software applications. Similarly, Salici and Ciampini [214] presents face recognition applications for forensic investigation department. Experiments on 130 real cases for identification are proved to be successful and are validated by forensic experts. Another recent application is introduced by Calo et al. [29] for privacy control in viewing, updating and destroying digital information.

5 Key challenges for face recognition

Face recognition research in controlled environments has been proved to be very successful and obtain required targets for social applications, substantial challenges are still there for an uncontrolled environment where subjects are dynamic, variations are difficult to catch in machine perspective due to certain causes. In this section, we describe different challenges for face recognition technologies.

5.1 Pose variation

It mainly refers to change or rotation of subject image out of the plane in different poses on 2D or 3D perspectives [289]. The pose is the most difficult recognition challenge especially when the subject is in an uncooperative environment like searching a spy in a public crowed, terrorists protection on airport and finding thieves in big marts, etc. The most suitable solution of this task might be the collection of multiple gallery images of the subject but it seems to be impractical in most of the real-world applications. For example, if only a passport photo of every person is stored in the image database, then how can we identify a terrorist among a football stadium crowd? Numerically, the difference between two same images with a different pose is greater than the difference between two images of different persons. Therefore, local approaches such as Local Binary Pattern (LBP) and Elastic Bunch Graph Matching (EBGM) are considered to be more efficient than of holistic approaches [1]. Similarly, Beveridge et al. [27] introduce the Point-and-Shoot Face Recognition Challenge (PaSC) a competition that examined 5 different methodologies on the PaSC database [26], which contains unconstrained face photos and videos of outdoor and indoor scenes. In this competition the best performing algorithm [141] claims that their approach is nearly perfect on pose variation, later on same set of methods proved to be unsuccessful on other face datasets.

5.2 Illumination variation

Variable illumination or lightning effect is another major challenge for face recognition systems. Due to skin reflectance properties, multiple camera sensors, resolution effects and environmental conditions, illumination is totally uncontrolled for machine intelligence. Traditional approaches have their own limitations with varying illumination effects. It has been theoretically [299] as well as practically [3] proved that the effects of fluctuating illumination are more significant than the difference between two different images. Classical methods like eigenface [250], fisherface [24], probabilistic and Bayesian matching [175] and SVM [181] are unable to deal with illumination variations. On the other hand, a rich volume of recent literature [7, 36, 89, 99, 100, 182, 253, 254, 293, 294, 297] is a strong witness for the significance of dealing illumination fluctuations for performance tuning of face recognition systems. The Fig. 4 presents a sample taken from CMU-PIE dataset shows a significant impact of change in illumination conditions:

5.3 Occlusion

Face occlusion means hiding important facial features due to the presence of other objects like hate, helmet, sunglasses, hand, scarf, mask, etc. Occlusion is another critical factor that seriously affects the performance of face recognition technique. As illustrated in Fig. 5, it is a common phenomenon in real life that certain facial features are occluded due to hand, hair or scarf, etc., especially when the subject is in an uncontrolled environment. Significant research attempts [66, 83, 117, 130, 134, 140, 196, 199, 236, 251, 265, 276, 277, 291] have been made to deal with occlusion in face images. One simple solution is adopted in various models: consider the occluded part as noise and subtract it from the given face image and compare rest of the information with stored image, but it is not universal for all cases. Recently, Nojavanasghari et al. [180] introduce a model to deal with hand over occlusions from a dataset of non-occluded faces. Similarly, Iliadis et al. [106] present an iterative method to address the occlusion problem for face identification. The proposed approach makes use of robust representation based on two features in order to model occlusion efficiently. First, the occluded part is fitted as a distribution described by a tailored loss function, secondly, it is expressed by a specific structure. Finally, Alternating Direction Method of Multipliers (ADMM) model is utilized to make it computationally efficient and robust.

5.4 Aging

With the passage of time human face changes in a nonlinear and inconsistent way. This is critical for both human and machine intelligence to recognize faces with varying aging effects. This problem is harder enough to solve as compared to other face recognition challenges that only a few efforts found in the literature to address age variation. Aging continuously changes the texture, shape and appearance style of a human face. Most of the classical methods [61, 135, 142, 148] that deal with age variation problem are composed of two steps: features extraction and calculation of distance metric. Sometimes, these steps ignore the contact of these components and a fixed distance threshold affects the algorithm performance. Furthermore, it is not possible for most of the real-world applications like passport or ID card image database to frequently update gallery images. Chen et al. [33] propose a Cross-Age Reference Coding (CARC) model to deal with age invariant. The model encodes low-level features of a face image with an age variation reference space. It needs only a linear projection to extract features that make it highly scalable. The authors evaluate the model on a self-constructed dataset of more than 160,000 images of 2000 celebrities (ranging 16 to 62 age group) and obtain considerable performance. Another deep convolutional neural network-based age invariant face recognition model is proposed by Wen et al. [261]. The model contains two major components: convolution unit for feature learning and latent factor fully connected layer that is responsible for age variation feature learning. The CNN architecture is carefully designed to detain micro-level variations and therefore proved to be successful.

6 Face recognition frameworks

Since, face recognition has been a key research area for the last three decades by many research communities like machine learning, artificial intelligence, image processing, and computer vision. Methods proposed for face recognition belong to vast and diverse scientific domains and that is why it is difficult to draw a clear line that categorizes these approaches in a standard way. Also, the usage of hybrid models makes it difficult to categorize these approaches in standard branches for feature representation or classification. However, according to recent literature, we sum-up and present face recognition approaches as a clear and high-level categorization. Figure 6 shows the categorical distribution of face recognition approaches:

6.1 Classical approaches

The research in face recognition has long historic roots such as in the 1950s psychology and 1960s in engineering literature [298]. These beginning concepts were derived from pattern recognition systems as discussed in an MIT Ph.D. thesis [210] by Lawrence Gilman. He first identified that a 2D features extracted from a photograph can be matched with the 3-D representation. Subsequent research identified practical difficulties in variable environmental conditions which are still challenging with today’s modern supercomputers and GPU’s. Although these early research methods have been driven by pattern recognition, they were based on the geometrical relationships between facial points. Most of these methods are obviously highly dependent on detection of these facial points in a challenging environment as well as the consistency of these relationships across different variations. These issues are still a critical challenge for the research community. Another early attempt for developing face recognition system was by Kanade et al. [115]. They utilize simple image processing techniques to extract a vector of 16 facial parameters. It used a simple Euclidean distance measure for matching these feature vectors and achieve 75% accuracy rate on a predefined database of 20 people using 2 images per person.

In 2003, Zhao et al. [298] presented a precise and brief overview of techniques being employed by face detection and recognition community during last 30 years. They discuss many psychological and neuroscience aspects of face recognition. As for the method is concerns, face recognition techniques are initially categorized into three broad ways, holistic methods (PCA, LDA, SVM, ICA, FLD and PDBNN), features based methods (pure geometry methods, dynamic link architecture, Hidden Markov Model and Convolution Neural Network) and many hybrid methods like Modular Eigenfaces, Hybrid LFA, Shape normalized and component based methods.

6.1.1 Holistic methods

Holistic methods, also known as global features based methods, attempt to identify face using global characteristics, i.e., entire face representation is considered rather than individual components like mouth, eyes, and nose, etc. Sirovich et al. [227] was the first who take advantage of Principal Component Analysis (PCA) to represent global face features. More early research attempts include: Turk et al. [250] with Eigenface, Kamran et al. [58] with Linear Discriminant Analysis (LDA), Zhao et al. [300] with Subspace Linear Discriminant Analysis (SLDA) and Bartlett et al. [21] with Independent Component Analysis (ICA) for face recognition. In the same era, Guo et al. [80] introduced the use of well known method of the time Support Vector Machine SVM for recognizing most challenging Cambridge ORL face dataset. Similarly, Liu et al. [153] approach with an evolutionary algorithm for the projection of face image based on generalization error and Kawulok et al. [119] proposed model to assign variable significance parameters to different facial regions is a key milestone in classical face recognition tenure. Furthermore, holistic methods are subdivided into two key branches, i.e., linear holistic techniques and non-linear holistic techniques.

Xue et al. [273] utilize well-known approximation technique Non-negative Matrix Factorization to overcome limitations of PCA especially when face images are captured under an uncontrolled environment like varying expressions, illumination and or occlusions. Experimental results on AR database demonstrate that the proposed model has achieved high accuracy as compared to Bayesian and PCA methods. Yang [274] takes advantage of kernel methods and proposed a kernel PCA and kernel LDA for face recognition. Face images are mapped into a higher-dimensional space by a kernel function and utilized PCA∖LDA to build feature space. Another global feature approach based on locality preserving projections, known as Laplacianface [92], mapped the face images into a face subspace for analysis. It has been found that an integration maintains local information and deduces a face subspace for optimal global face structure.

Another milestone achieved in 2009, when R. Kasturi et al. [118] proposed a framework for face recognition and object detection from real time video streaming. The proposed framework takes video data as input and detects recognize face images. It provided an optimum set of measures for comprehensive comparisons can be made and failures could be analyzed by dividing the accuracy score and precision. The approach given in this paper is applicable for both direct measurement of tracking technologies as well as iterative algorithm development.

6.1.2 Local features based methods

Local features based approaches first evaluate the input image to segregate distinctive facial regions like the mouth, nose, eyes, etc. and then figure out the geometric relationships among these facial points. A group of statistical pattern recognition techniques and graph matching methods are available to compare these features. Early days efforts for local features based face recognition includes: Kanade et al. [154] with linear feature matching scheme, Yuille et al. [282] with interest of the points-based features extraction method using deformable templates, Brunelli et al. [28] with geometric features based templates matching scheme and Leads et al. [133] with dynamic link architecture based Gabor filter utilization are most considerable in classical face recognition literature.

Cox et al. [42] introduced a distance matching based local features model that has gained the highest recognition rate like 95% on a database of 685 people. Similarly, Wiskott et al. [263] utilize graph matching concepts to deal with a very challenging issue of single image per person face recognition. The proposed model based on elastic bunch graph matching extracts concise face descriptions in the form of image graphs and classifies them into different facial regions. Each region is represented by a set of wavelet components. These sets of wavelet components act as a foundation for elastic graph matching. Another novel framework proposed by Edwards et al. [54] introduces active appearance model that contains a statistical, photo-realistic model of the shape and grey-level appearance of faces. The model utilizes the information available from training data and facilitates the different facial parts based on a system-generated ID.

Ahonen et al. [6] introduced the most widely used algorithm Local Binary Pattern (LBP). At first, the face image is divided into small regions of the interest points from which region-wise local binary pattern histograms are extracted and a single spatially enhanced global feature histogram is constructed that efficiently represents the face image. The classification of features is achieved through the nearest neighbor classifier. It was the most novel idea of its time and still in practice today in most of face recognition techniques. Tan et al. [239] present a generalized LBP texture descriptor that efficiently utilizes local texture features. The approach is comparatively simple and efficient for change in illumination effect while preserving the essential appearance details that are needed for recognition. It is also demonstrated that replacing local histogram with local distance transformed based similarity measure is a powerful tool to boost up the performance of LBP based recognition.

Queirolo, et al. [201] provide another generic framework for 3D face identification. The method used a simulated annealing based approach to range image registration with the Surface Interpenetration Measure (SIM). Primarily, the feature space is extracted by segmenting the face image into four regions. Each region is compared with the corresponding region from an image that has been already enrolled in the database. To perform entire image comparison, a modified version of simulated annealing-based approach is used to eliminate face expression effects. Kemelmacher-Shlizerman and Basri [121] introduced another 3D Face recognition model by reconstructing a single image using a single reference face shape. The method exploits the likeness of faces by obtaining a single input face image and uses a single 3D reference model of a different person’s face. The basic idea is to use the input faces as a guide for recognizing the targeted reference model by reconstruction of 3D image. The authors also claim that the method can handle unconstrained lighting effects. In the same stream, Li et al. [139] with a nonparametric-weighted fisherfaces model, Han et al. [279] with a generic eigenvector weighting model for local face features and Choi et al. [39] with a pixel selection model based on discriminant features have gained considerable research attention in past years.

6.1.3 Hybrid methods

The third category in classical face recognition approaches is hybrid methods that use both local and holistic methods simultaneously. Hybridizing or mixing different approaches to obtain better results is a common research trend. Many researchers [85, 165, 183, 246, 253, 267] exploit hybrid models to take advantages of both local and global methods. The literature on hybrid face recognition approaches is quite vast and scattered in multiple disciplines and the key methods that we considered includes: Tolba et al. [246] with a combined classifier based on Radial Basis Function (RBF) and Learning Vector Quantization (LVQ), Huang et al. [102] with integration of holistic and feature analysis-based approach utilizing a Markov random field model and Wang et al. [256] with a biologically inspired features model and Local Binary Patterns (LBP).

Lavanya and Inbarani [137] come up with a hybrid approach based on Principal Component Analysis (PCA) and Tolerance Rough Similarity (TRS). In this hybrid approach, first PCA is used to extract feature matrix from the face image then TRS is applied to extract similarity index. Results achieved high accuracy rates up-to 97% on different datasets (OUR, YALE and ORL) and reflect a consistent performance. Hashemi and Gharahbagh [91] use comparatively a different approach for face recognition purpose. The algorithm uses Eigen values of 2D wavelet transform, k-means and correlation coefficient as a preprocessing method and RBFN network as classifier to obtain feature vector. After obtaining features, the training process carried out by the RBFN classifier, the smallest Euclidean distance is calculated for each person from the selected feature vectors. For a new face image, initially the feature vector is computed and then the distance of this new vector with all centers is compared to see similarity index. In best case results achieved 96% of recognition accuracy.

Hu et al. [98] proposed a uniquely different direction for face identification which is based on fog-computing based scheme. It has addressed the problem of face identification on the internet of things. Initially, a face recognition system generates a matrix of identities for an individual. After that, a proposed fog-computing based model decides individual identity. Experimental results show that the model can efficiently save bandwidth and achieve high accuracy level up-to 96.77% which is a remarkable contribution as compared to previous ones. Jain and Park [109] come up with another different direction of identification, i.e., soft biometrics. The soft biometrics refers to the attributes of individuals such as their age, gender, gesture, the size and shape of their head, approximate height, etc. Various researchers [12, 14, 16, 23, 46, 47, 74, 108, 116, 186, 212, 249] believes that soft biometrics research can play a vital role to deal with the challenging issues like variable illumination, pose, rotation, scale, and background related complexities for face recognition and human re-identification. Following are some classifications in this regard:

Demographic attributes: The term demographics attributes refer to age, gender, ethnicity, and race. These are widely used soft biometrics in multiple applications.
Anthropometric attributes: The Anthropometrics attributes for soft biometric refers to the geometrical appearance and shape of the face body and skeleton etc.
Medical attributes: The medical attributes refers to the broad range of more efficient soft biometrics like health, weight, skin texture and DNA patters information, etc.
Material, behavioral, and other soft biometric attributes: Accessories associated with any person like a hat, jewelry, bag, glasses, etc. can also be very helpful for an identity recognition system.

Lin et al. [151] recently claim that most of the face identification techniques are appropriate for a test or small data set, but couldn’t achieve such experimental results in real world applications that contain millions of human faces. So, the system should be robot enough to deal efficiently with large data sets like in MS-Celeb-1M. The authors proposed a 3 step algorithm that used a Max Feature Map (MFM) activation function to train an initial classifier by mapping raw images into the feature matrix. Secondly, face identities from MS-Celeb-1M are clustered into three subsets: a pure set, a hard set and a mess set. Third and final, locality sensitive hashing (LSH) method is used to fasten the search of the closest centroid. Experimental results proved that model is suitable and appropriate even for large datasets like MS-Celeb-1M.

6.1.4 Summary of classical approaches

With more than 50 years of research efforts in face recognition, the literature on classical face recognition approaches is quite vast and scattered in multiple disciplines. In the above section, we attempt to precisely summarize the time line based key milestones in face recognition research. Conclusively, we confined the early literature into three broad categories i.e. holistic methods, local features based methods and hybrid approaches. Table 3 categorically presents classical face recognition approaches and the performance evaluation of these methods on the corresponding datasets. It has been noticed that local features based methods and hybrid approaches better performed under different illumination conditions, facial expressions and other key challenges. Also, they are comparatively less sensitive to noise and translation invariant.

6.2 Modern approaches

In recent years, there is an immense improvement due to machine learning-based algorithms and methodologies in a lot of social and scientific domains. We summarize the modern era of face recognition approaches as three subcategories i.e. deep learning-based methods, sparse or dictionary learning-based methods and fuzzy logic-based techniques. An overview and research contributions in these categories are presented in this section.

6.2.1 Deep learning based face recognition

It has been observed that deep neural networks have massive computational power for object recognition and it has revolutionized machine learning during the last few years. Researchers from all the fields including but not limited to social sciences and engineering to life sciences considering deep frameworks to hybridize their existing models and get radical results. Many researchers, especially in face recognition community [15, 232], affirms that it has remarkable computational power with outstanding accuracy and result oriented behavior. In this section, we present a brief overview of recent developments in deep learning for face recognition.

Why Deep Learning?

Current research in face recognition through deep learning is largely based on hybridizing existing models with deep networks. Researchers focus on introducing improved techniques for better face recognition. It has been realized that deep neural networks and related techniques are highly suitable for achieving high performance in term of accuracy and robustness. Its ability to classify a large number of unlabeled face images in a robust and accurate way gives it an upper hand on classical face recognition approaches. Ongoing research in deep models in unconstrained environment with variable face images by means of pose, illumination, and cosmetics effects have achieved outstanding results. Although some efforts have been done for assessment of deep learning technique for face recognition, a huge gap is still there against different variations.

Early Deep Learning Models:

Early face recognition applications as discussed in [230, 298] use ANN for large scale features representation. This idea ultimately leads towards maximization of hidden layers in a neural network. Each layer in a deep model is based on a kernel function with a defined optimization objective and its multilayer architecture [15] that made possible to extract features space in a generalized way. The ability to automatically learn from unlabeled data without any human involvement make it incredible for identification purpose. The use of deep belief network in 2006 [97], hybridizing different approaches by Sun et al. [231, 233,234,235] and classification of ImageNet with CNN [129] are believed as key milestones, and it has opened a new generation of face recognition research in last few years.

Grm et al. [78] presented a comprehensive review about the strengths and weaknesses of deep learning models for face recognition. They use four most common deep CNN models AlexNet, VGG-Face, GoogLeNet, and SqueezeNet, to extract features from input images and analyze how certain factors like brightness, noise, blur, and missing data values affect the output performance through a well-known LFW benchmark. The performance metrics given by LFW protocol for accuracy verification is used under the ten-fold cross-validation test protocol with separately selected t threshold value. Separate data sets for training and evaluation are used, i.e., the VGG data set for training and LFW for performance evaluation. Experimental results are presented in four different categories, the first one is Gaussian blur the other covariates were noise, brightness, and missing data. Experimental results proved that output results are least affected by changing input conditions in color, contrast or other parameters through deep models selected for data evaluation.

Since, hashing has been very popular in access and retrieval algorithms for many decades due to its quick retrieval and low storage cost. To utilize this capability of hashing Tang et al. [240] come-up with an innovative idea of deep hashing based on classification and quantization errors for face image retrieval. As presented in Fig. 7 the proposed model learns image representation features from hash codes and classifier simultaneously. Deep model predicts image labels and generates corresponding hash codes for quick image matching. The prediction and quantization error are highly linked with each other and they jointly assist the learning process of deep network. To evaluate the performance proposed model famous YouTube, Cifar-10 and FaceScrub datasets are used and results prove that proposed model is highly efficient.

Preprocessing and Deep Models:

Since face alignment is one of the key preprocessing challenges among overall face recognition process. It aims to localize facial landmarks and predict face position in the input image. This is an open research problem for many decades. Shi et al. [223] address the problem by using deep regression model and gain a significant improvement in output results. The model contains a global layer that formulates the initial face shape and multiple local layers that iteratively update the shape estimated by global layer. The function of the global layer is illustrated in (1)

$$ \left[ S_{0}=GR (I)=r_{0} (\phi_{0} ; \theta_{0} ), \phi_{0} = g(I) \right ] $$

(1)

The function of global layer as proposed in [223].

The outcome of the global layer S₀ is the initial shape estimation from input image I, whereas the function GR() represent the global feature space that extracts d −dimensional features. Here, r₀ is a regression function that takes two parameters ϕ₀ and 𝜃₀ respectively. The set of local layers take both inputs, the image I and the predecessor shape S0/St iteratively. A general structure of local layer formulation is presented in (2)

$$ \left[ S_{t}= LR^{t} (I, S_{t-1} ) = S_{t-1} +r_{t} (\phi_{t} ; \theta_{t} ) \right] $$

(2)

The function of the local layer as proposed in [223]. This layer-wise local learning locates the parameters of function LR^t sequentially, from 𝜃₀ to 𝜃_t to approximately minimize the objective function. Parameters at every layer are optimized by considering the trained parameters of the predecessor layer. More recently a modified approach for face alignment by constructing deep face features is presented by Jiang et al. [111]. In this model, a large scale independent training dataset has been prepared.

Design Invariants for Deep Model:

Here, an important question is how to design a deep neural network for a particular face recognition system? Many researchers [90, 110, 114, 174, 257, 268, 292] address this problem in different ways but a considerable effort has been done by Chen et al. [35]. Design principles for a deep convolutional neural network architecture have been discussed for unconstrained facial images both for static and video streaming data. It is illustrated that most of the times CNN architectures have to analyze millions of parameters that impose massive computational and memory overhead, so a balanced compromise between the cost and efficiency is required while designing a deep architecture. Positively, all critical aspects of face recognition system like face alignment, detection, association, and face verification have been discussed. Another considerable effort has been done by Hasanpour et al. [90] while introducing core design principles of deep convolutional network. Many critical parameters like number of hidden layers, architecture formulation, handling homogeneous layers, local correlation preservation, maximum utilization of predecessor information at each layer, performance utilization, balanced weight allocation, isolate prototyping, maximum utilization of dropouts and adoptive features pooling have presented in detailed way and relative calculation measures are also provided. In the similar direction, recently Nguyen et al. [178] have presented a comprehensive review of the measures to choose a best deep model for a particular face recognition system. The authors evaluated three most popular deep face models VGG Face B, CenterLoss C and VIPLFaceNet and in light of experimental results they demonstrate certain characteristics of these deep models suitable for face recognition.

Recently, Ranjan et al. [208] provide a detailed overview of designing deep-learning frameworks for face recognition. This is a comprehensive overview, on design techniques illustrates subject independent multitask learning benefits of deep neural network for face recognition systems. The key contribution of this research is a list of future issues for designing a face recognition system such as face detection in a big crowd, variable illumination and pose or expression constraints for face detection, identifying model dependencies on large training datasets, controlling cost in case of deep models, complexity handling and mathematical formulation of functions in each hidden layer, handling degradation and data bias in training process, establishing theoretical foundations to understand the behaviors of deep models and incorporating domain knowledge while designing a deep network. Same as, Peng et al. [189] introduced a high-dimensional re-ranking deep representation for face recognition. The proposed method first build a feature space by extracting and concatenating deep features on local facial patches via a convolutional neural network (CNN). Then, the authors used a novel locally linear re-ranking framework to refine the initial ranking outcomes, which can explore valuable information from the initial ranking results. The method need not any human interaction or data annotation and can be served as an unsupervised post processing model. Likewise, Wen et al. [262] propose a concept of center loss to enhance the learning power of deep model. The suggested scheme effectively pulls the deep features of a selected class to their centers and joint supervision model not only stretch the inter-class features differences, but also the intra-class features variations are mitigated. Therefore, the discriminative power of the deeply learned features can be highly enhanced.

Efficiency and Robustness in Deep Model:

Efficiency and robustness of a system is another important aspect which is thoroughly analyzed by Mohammadi et al. [176] while presenting a study of robustness for face recognition. It is also highlighted in this research work that this is not enough for a particular face recognition system that it may obtain high accuracy results but it should be robust against presentation attacks like a person X present a photo of person Y in front image acquisition camera. Other forms of presentation attacks include digital photographs or video, artificial masks, and make-up, etc. The authors proved through experimental results on three popular CNN based face recognition methods VGG-Face, LightCNN, and faceNet that Deep CNN model is highly accurate up to 98 % and less vulnerable to presentation attack as well.

Recent Developments:

Kim et al. [127] show that transfer learning from a convolutional neural network on a 2D face images can better perform for 3D face identification. They use VGG face for initially 2D training and then add 3D data to enlarge the size of data which make CNN more robust for recognition. Additionally augmented 3D data is converted into a 2D depth map, isolated random points are separated from this depth map to simulate hard occlusions. Finally, a fine-tuning phase added to represent 3D face features from the previous layers. Lin et al. [151] present another novel idea based on clustering lightened deep model for large scale face identification to deal with Microsoft Challenge of recognizing one million celebrities (MS-Celeb-1M). Since deep models have memory and computational overheads, most of the suggested techniques work well on ordinary data sets like LFW. On the other hand, sometimes real life face recognition systems have to deal with a large number of face identities. The authors propose a three-phase model to deal with this problem. At the first stage, a face feature representation model is trained through a function entitled as Max-Feature-Map (MFM) on a publically available cross-domain dataset CASIA-Web. In the second stage, face features are clustered into three independent sets called mess, hard and pure sets. Each set acts a cluster of feature space and its cluster center is used as corresponding MID for MS-Celeb-1M. This approach automatically reduces the number of comparisons and the effect of noise in an input image. In the third or final stage, the Locality Sensitive Hashing (LSH) algorithm is used to accelerate the search process for nearest centroid. Another important question has been addressed by Zhong et al. [304] that what kind of features are most effective for a deep face model and how we best represent these features? The most important phenomenon that has been highlighted was the high level features correspondence with complex face attributes which the human could not express in few words.

Bashbaghi et al. [22] suggested a triplet-loos function based deep learning architecture for face recognition in video surveillance. The proposed architecture is highly suitable for single sample per person which is a very challenging situation in current face recognition systems. The whole architecture is mainly divided into two broad categories: first triplet-loss function originally proposed by [218] based deep CNN model, second, deep autoencoders. The defined function is capable to learn from complex face representations like low illuminations, contrast or brightness and this makes sure robustness in inter/intraclass variations. The autoencoder as shown in Fig. 8 is used to normalize discrepancies in face image capturing conditions and reconstruct a fine image with the help of input image. This work well for both single reference training and domain adoption issues. The performance of the proposed model is evaluated on a dataset especially collected for video surveillance applications called Cox Face DB [161]. Finally, experimental results proved that CCM-CNN and CFR-CNN have provided significant improvement in recognition performance with lower computational cost.

Recently, Iranmanesh et al. [107] come up with a totally different way of face recognition. They utilize deep coupled learning framework for polarimetric thermal face images to compare with a group of visual face images. The proposed model is capable to train a deep model by making full use of polarimetric thermal information. The deep model locates global discriminative features through nonlinear embedding feature space to match the polarimetric thermal faces with visible faces. The experimental results proved that the deep couple framework is highly efficient as compared to traditional approaches. Likely, Liu et al. [157] proposed a deep hypersphere embedding model (SphereFace) under open-set protocol settings and Deng et al. [50] suggest a novel additive angular margin loss function for deep face recognition. The proposed model provides much better geometrical clarification to find discriminative face features by maximizing decision boundary in angular space based on l₂ normalized weights. The authors claim to have significant improvement in recognition results on many popular face recognition benchmarks like LFW, AgeDB, CFP and more importantly on MegaFace Challenge. Another recent development was made by Lin et al. [152], the idea is quite intuitive and simple that they utilized local features for all the salient facial points and produce feature tensor to represent 3D face. Similarity of two 3D faces can thus be calculated by two feature tensors. To resolve the unavailability of large samples, a feature tensor based data augmentation approach has been introduced to augment the number of feature tensors. Results on BOSPHROUS and BU3DFE face dataset demonstrate the effectiveness of proposed approach. In the same era, Efremova et al. [55] proposed an easy-to-implement five class classification model for face and emotion recognition with neural networks. The suggested framework is capable for large-scale emotion identification on different platforms such as, desktop, mobile and VPU.

Regardless of many advantages of deep neural network, there is a darker side of the picture as well. In last few years many researchers [2, 19, 184, 193, 195, 203] have illustrated some loopholes in deep learning models for face recognition as well. Goswami et al. [76] illustrate that due to the complex formulation of the function that learns with the hidden layers of a deep network it is difficult to mathematically formulate and validate each of them. There are three key aspects of this publication. First, the authors show that the deep face models considerably affected due to adversarial attacks. Secondly, it is important that system should be robust enough to determine that which sort of images could be caused of such distortions and take necessary countermeasures by means of a proposed model in deep network hidden layers. Finally, the identified suspicious images will be rejected by means of the novel contribution of this research. In most of the practical face recognition systems, we have limited training data but a large amount of actual unseen images that deep model need to be classified. This leads towards biased classification in an ordinary trained deep neural networks. A research group at NEC Laboratories and Michigan State University under Yin et al. [278] proposed a transfer learning-based framework to deal with long-tail data for a face recognition system. According to Mostafa et al. [57] long-tail data have exponential production rate and even it has no comparison with big data due to its data generation behavior and resource consumption. A Gaussian prior is supposed in all regular classes and the variance from these classes is transferred to the long-tail class. This facilitates the long-tail data distribution to be more similar to the regular distribution and ultimately balance the limited training data which broadly impact the biased decision making. The author’s experiments on MS-Celeb-1M, LFW and IJB-A datasets by restricting the number of samples on long-tail classes shows state-of-the-art results against the proposed algorithm.

Galea and Farrugia [62] introduce a deep neural networks for matching software generated sketches to face photos by utilizing morphed faces and transfer learning. This is one of the most curtail and sensitive social security issues that eyewitness based (human drafted or software generated) sketches need to be automatically synchronized with built-in large scale criminal face galleries. The existing methods are coupled with human intervention and still have poor performance. Furthermore, most of these algorithms have not been designed to work with software generated faces. The authors propose a three steps methodology. First of all, a deep CCN is trained by means of transfer learning and then it is being utilized to determine the identity of a composed sketch with face photos. Secondly, a 3D morphable model is applied to integrate both software generated face sketches with step one identified face photos. Finally, a specialized heterogeneous large-scale software generated face composite sketch database called UoM-SGFS is used with an extension of twice number of subjects to boost up the performance.

Summary of Deep Learning Methods:

Deep learning based face recognition methods have gained significant research attention in recent years. In the above section, we present recent developments in a chronological order. Due to potential applications, the volume of literature is too vast and diversified in multiple branches. Therefore, we precisely discuss key milestones with respect to their time series sequence, technical strength and popularity. We composed different aspects of deep learning methods regarding early contributions, design invariants for deep networks, efficiency parameters of deep learning methods and recent developments in deep learning-based face recognition. Table 4 presents deep learning-based face recognition approaches and the performance evaluation of selective methods on corresponding datasets. Additionally, it has been observed that the use of intermediate visual features in convolutional neural networks to describe visual attributes has been rarely discussed in academic literature. This could be a promising future research line if combined with the pre-trained autonomies units.

6.2.2 Sparse representation models

Sparse representation is a powerful pixel-wise classification technique that learns redundant dictionaries from input images and classifies them accordingly. It has the ability to discover semantic information which is very useful for visual understanding domains. Sparse representation has been proven [24, 56, 169, 266] to be the most powerful tool for feature representation in high dimensional data structures. It has gained significant attention and obtained remarkable performance in various applications such as visual understanding, audio encoding, and medical image processing [37]. Specifically, the discriminative feature representation power of dictionary learning is highly valuable for face recognition applications. The facts like face images are naturally sparse, highly complex, high-dimensional and availability of greedy or convex optimization techniques for sparse models make it more suitable for face recognition.

Dictionary Learning for Face Recognition:

Dictionary learning is a branch of machine learning algorithms that aims to find a matrix called dictionary in which a training data submits a sparse representation. In our context, if collections of face samples are there in a random distribution, we can extract discriminative features by learning the desired dictionary from training data. The learned dictionary plays a vital role in the success of the sparse representation [194]. We have to learn a task-specific dictionary from the given face images. Therefore, as an emerging research field, existing theories and approaches for feature representation need to be rebuilt for dictionary learning.

Early Contributions:

Some early milestones like Wright et al. [266] and Rubinstein et al. [211] provide a strong theoretical foundation on sparse representation in visual understanding and pattern recognition. Also, researchers such as [128, 290] presented a comprehensive review and highlight how research trends come towards sparse modeling in the previous two decades. A considerable effort that has opened various new directions for the future of machine intelligence has been done by Tosic and Frossard [248]. In our context, Wright et al. [255] robust sparse representation algorithm is one of the early and most efficient sparse models for face recognition. The proposed framework gives a different view of the two most critical face recognition problems: robustness to occlusion and efficient feature extraction. The existing theories in sparse models show that it can prior predict how much occlusion the model can handle. Experimental results show that if sparsity is properly addressed then feature extraction is not a critical issue in the face recognition process. Furthermore, the framework is capable to handle errors caused by occlusion and illumination because these errors are sparse with respect to pixel basis.

Zhang and Li [287] proposed an extended version of K-SVD (K mean Singular Value Decomposition) algorithm that incorporates the classification error in the objective function, which improves the learning power of dictionary and performance of linear classifier. The proposed discriminative K-SVD algorithm finds a dictionary and supports the classifier using standard K-SVD method to create overcomplete dictionaries for sparse representation. This is a quite different approach as compared to most of the existing methodologies that iteratively solve sub-problems with the hope to achieve a globally optimal solution. The proposed method shows outstanding results on YaleB and AR datasets. More recently, Liu et al. [160] proposed an image-set based face recognition using K-SVD dictionary learning. Basically, the proposed approach is to learn variation dictionaries from gallery and probe face images separately, and then utilized an improved joint sparse representation, which employs the information learned from both gallery and probe samples effectively.

Chen et al. [32] proposed a generalized dictionary learning-based face recognition framework for video streaming. The method is proved to be robust for change in illumination, pose, and variations in video sequences. The proposed model is based on three general steps: first of all the input video is partition into a group of frames with the same illumination effect and pose invariant. This group-wise framing strategy helps to eliminate temporal redundancy while keeping in view of changing pose and illumination. For each group of frames, a sub-dictionary is constructed and representation error is minimized through sparse representation. Secondly, these learned dictionaries are combined together to form a global sequence-specific dictionary. Step three is formulated purely for recognition purpose and here the frames from the input video are projected onto the sequence-specific dictionary constructed in step-two. This projection automatically leads towards recognition with a cost effective way. Performance evaluation on three most challenging datasets for video-based face recognition (Face and Ocular Challenge Series (FOCS), Multiple Biometric Grand Challenge (MBGC) and the Honda/UCSD datasets) shows a significant improvement as compared to other methods.

Recent developments for dictionary learning based face recognition:

Meng et al. [173] introduced a future fusion-based linear discriminative redundant dictionary algorithm that improved the face recognition capability of the sparse model. First of all, an initial layer extracts common local features and concatenate them to form a feature vector or atoms. Secondly, Linear Discriminant Analysis (LDA) is used for dimensionality reduction and rebuild the dictionary of atoms. Experimental results proved that this has a positive impact on the structural design and discriminative ability of the dictionary. Similarly, Liu et al. [156] hybridized collaborative representation based classification (CRC) with sparse representation-based classification (SRC) and obtain outstanding results for face recognition. The proposed hybrid model assume that the training data samples in each category play an equal role in learning the dictionary. Based on these assumptions, it generates a dictionary that contains the training samples in the corresponding class.

Xu et al. [272] come-up with another different thinking direction, it is basically l2 regularization based sparse representation algorithm that is computationally efficient and it achieves noticeable performance for face image classification on various dataset. The fundamental claim in this article is still proven among pattern recognition community that sparse representation is highly efficient for image classification and identification purpose. The proposed method in this article, suggests that discriminative sparseness can be achieved by reducing correlation among test samples taken from different classes. In the similar direction, Liao et al. [150] proposed an efficient subspace learning based face recognition algorithm. The proposed method utilized sparse constraint, low-rank technology and label relaxation model to reduce the disparity between domains. Additionally, a high-performance dictionary learning algorithm work by constructing the embedding terms, non-local self-similarity terms and it ultimately drop down the time complexity. Experiments on wide range of face recognition datasets such as FRGC, LFW, CVL, Yale B and AR face proved the effectiveness of proposed algorithm. Similarly, other researchers [158, 163, 288] have also utilized sparse representation for face recognition purpose.

To overcome a major limitation of deep learning models that it may not work well if the number of training samples is too small, Shao et al. [222] proposed a two-step dynamic dictionary optimization model for face image classification. Initially, a dictionary with a set of artificial faces taken from a pair of face differences is constructed. Secondly, dictionary optimization methods are utilized to eliminate redundancy in this constructed dictionary. The original samples with small contribution were discarded to shorten the extended dictionary to a more compact structure. This optimized dictionary can be utilized for face classification based on sparse representation. In the same direction. Recently Lin et al. [151] proposed a virtual dictionary-based kernel sparse representation to overcome limited sample problem for face recognition. The given model automatically provides a number of new training samples and termed virtual dictionary from the original dataset. Then, it uses the constructed virtual dictionary and training set to build the kernel sparse representation for classification (KSRC) model. The coordinate descent algorithms have been used to solve KSRC model and enhance computational efficiency.

Jing et al. [113] utilized multispectral imaging to boost the performance of face recognition system. Multi-view dictionary learning is an efficient feature learning model that learns dictionaries from different views of the same object and has obtained state-of-the-art classification results. In the proposed model, multiview dictionary learning has been used to multi-spectral face recognition by a totally different concept called multispectral low-rank structured dictionary learning (MLSDL) method. The method learns numerous dictionaries that include a spectrum-common dictionary and few spectrum-specific dictionaries. Each dictionary has a set of class-specified sub-dictionaries. Low-rank matrix recovery algorithm is used to regularize the multi-spectral dictionary learning process so that MLSDL can easily handle issues in multi-spectral face recognition. Performance evaluation on HK PolyU, CMU and UWA hyper-spectral face databases prove the significance of the proposed framework.

More recently, Wang et al. [259] highlighted a common problem in most of the dictionary learning-based face recognition systems that they used training samples as a dictionary which leads towards fitting error. To overcome this issue, the authors introduce a Laplacian graph embedded class-specific dictionary learning model. It trains a weight matrix and augments a Laplacian graph to rebuild the dictionary. This weight allocation approach helps to assign different weights to different dictionaries that ultimately improve classification accuracy. In the same way, another novel viewpoint is presented by Peng et al. [188] different from existing dictionary learning approaches. The authors consider both the native spatial domain and the Fourier frequency domains for dictionary learning. In this two-tier approach, initially a dictionary is constructed on the actual dataset and the Fourier transformed dataset respectively, which prepares data complementary in both spatial and frequency domains. Secondly, this dictionary is integrated for classification by collaborative representation. The proposed method promotes the discriminative ability of dictionary learning and achieves better classification performance on OR, FERET, AR and extended Yale B datasets.

Zhang and Li [287] used dictionary learning for face recognition. Dictionary learning is a branch of machine learning algorithms that aims to find a matrix called dictionary in which a training data submits a sparse representation. The authors proposed Discriminative K-SVD (K mean Singular Value Decomposition) which is based on K-SVD algorithm. It incorporates the classification error in objective function so that the linear classifier and power of the dictionary could be utilized to enhance system performance. The proposed method finds a dictionary and supports the classifier using a procedure derived from the K-SVD which has been already proven successful performance. This class of face recognition technique has been quite popular among many researchers [32, 88, 104, 112, 113, 145] during last few years.

Currently, Shang et al. [221] have introduced a single gallery based face identification method by means of an extended joint sparse representation. A key issue has been addressed that a single image per person is enrolled in the source database, while at the time of image acquisition, multiple images have been received different in view, pose, scale and rotation invariants. A customized dictionary-based face identification approach is proposed to handle this problem. It first build-up a customized dictionary from face images and then proposed an extended joint sparse representation that utilized information from both areas, customized dictionary and gallery samples for image classification purpose. Finally, the performance of the proposed method is tested on many popular face datasets including YALE, AR, GEORGIA, LFW, CMU-PIE, Multi-PIE, and compared it with other face recognition algorithms. Another dictionary based novel sparse representation model proposed by Keinert et al. [120], it includes a non-convex sparsity-inducing penalty and a robust non-convex loss function. The proposed penalty encourages group sparsity by utilizing an approximation, and the loss function is chosen to make the algorithm more efficient to noise, occlusions, and disguises.

Summary of Dictionary Learning Methods:

In the above section, we have broadly summed-up the latest developments in sparse models and dictionary learning-based face recognition algorithms. It has been observed that sparse coding and dictionary learning models have largely influenced the existing ways of face recognition. The following section describes an overall summary and comparative analysis of these approaches, Table 5 presents dictionary learning-based face recognition approaches and a comparative analysis of selective methods on corresponding databases.

6.2.3 Fuzzy set theory

Primarily, the fuzzy set theory was introduced in 1965 by Lotfi Aliasker Zadeh [283]. Since its inception, it has been applied in a variety of disciplines such as logic, decision theory, operations research, computer science, artificial intelligence, pattern recognition, and robotics, etc. Especially, during the last few years, its adoption has revolutionary changed the various research domains.

Fuzzy Logic and Face Recognition:

As discussed earlier, handling for variation in illumination is one of the most challenging issue in face recognition systems. A wide spectrum of methodologies have been proposed during last few decades but till the time no approach could be fully successful to deal with illumination variation. Fuzzy set theory has been proved to achieve highly successful results in this regard. It has a major research contribution for feature representations in machine intelligence [197, 207]. Early, research attempts [31, 131, 147, 275, 301] demonstrated outstanding results for applying fuzzy set theories in face recognition systems.

Fuzzy fisherface classifier by Kwak and Pedrycz [131] is one of the early fuzzy based face recognition model. The proposed models used a gradual level of projection for a class being considered as a membership grade with prediction, and such a detailed discrimination helps improve classification results. Furthermore, a fuzzy K-nearest neighbor class assignment is used during operating on features vectors being generated by PCA. The experimental results on Yale, ORL and CNU (Chungbuk National University) face databases demonstrate significant improvements in classification results.

Recent Developments in Fuzzy based Face Recognition:

Li [143] proposed a fuzzy-based 2DPCA face recognition algorithm that utilized fuzzy K-nearest neighbor method and calculates the membership degree matrix of the training samples that is being utilized to achieve fuzzy means of each class. These calculated fuzzy means are incorporated into the definition of the general scatter matrix with a prediction that can improve classification results. The proposed model is evaluated on the FERET, ORL and YALE face datasets. Experimental results show that the strategy works well even in a challenging environment such as variation in illumination expression and pose constraints.

Since the early days of face recognition research, illumination variation is one of the most challenging invariant to obtain accurate results. Oulefki et al. [182] come up with fuzzy reasoning model to deal with illumination variations by an image enhancement approach based on fuzzy theory. The model is an adaptive enhancement technique which magnifies non-uniform illumination and low contrasts. As Illustrated in Fig. 9 the proposed approach magnifies both the brightness and contrast of the original image. Also, by means of its fuzzy logic, it brings out the hidden details of the given image.

The fuzzy reasoning model has been assessed by using four blind reference image quality metrics taken from input visual face. Then, a fuzzy logic system is used to emerge information received from these four matrixes (A, B, C, and D) to reconstruct an enhanced image. The authors also provide a comparative analysis with six state-of-the-art methods against the proposed model. It provides considerable performance outcomes on well-known face datasets Yale-B, Mobio, FERET and Carnegie Mellon University for pose, illumination, and expression. In the similar direction, other researchers like Pu Huang et al. [105] introduced fuzzy linear regression discriminant projection model and Álvarez et al. [30] with a fuzzy distance-based model for skull-face overlay in a craniofacial superimposition that efficiently handles pose, illumination and contrast variations with the help of fuzzy logic.

Recently, Du et al. [53] highlighted the importance of uncertainty for face recognition and utilized an interval type-2 based fuzzy linear discriminant analysis model to overcome the uncertainty factor in highly complex face recognition environment. First of all, a supervised interval type-2 fuzzy C-Mean (IT2FCM) algorithm is introduced to utilize classified information. Then, it is incorporated into a linear discriminant analysis model to reduce noise effects and obtain correct local distribution. Secondly, the interval type-2 fuzzy linear discriminant analysis (IT2LDA) algorithm utilizes the IT2FCM model to weight each face pattern to an independent class and calculate the average of each class. After that, they are applied to fuzzy within-class and fuzzy between-class scatter matrix accordingly. The proposed IT2FLDA framework is highly capable to find the optimum directions for maximizing the ratio of fuzzy within-class and fuzzy between-class scatter matrix. The resulting feature space is more discriminative and robust for recognition purpose. Another recent development made by Sing et al. [226] introduced confidence factor weighted Gaussian function with parallel fuzzy rank-level fusion. The proposed framework generate fuzzy ranks induced by a Gaussian function based on the confidence of a classifier. In oppose to the traditional ranking, this fuzzy ranking reflects few associations among the outputs (confidence factors) of a classifier. These fuzzy ranks, produced by multiple representations of a face image, are fused weighted by the corresponding confidence factors of the classifier to make the final ranks for face recognition. In similar way Rejeesh [202] introduced an interest point based face recognition using adaptive neuro fuzzy inference system.

Summary of Fuzzy Based FR Methods:

In the above section, we have broadly summed-up the power of fuzzy set theory for face recognition in light of recent developments. It has been noticed that fuzzy-based methods are highly potential for dealing with complex face recognition issues especially illumination and pose variation related complexities. Table 6 presents fuzzy-based face recognition approaches and a comparative analysis of recognition rates against selective methods on corresponding databases.

7 Future research directions

The future of face recognition is not limited to an experimental lab. It has taken prime importance in the modern digital society. A study shows [72] that by 2022 the global market share of face recognition technologies would be grown up-to $9.6 billion, with compound annual growth rate (CAGR) of 21.3% for the period 2016-2022. Leading technology stakeholders around the world increasingly adopt face recognition technologies with their key products, for example, Facebook with DeepFace, Google with FaceNet, Amazon with Rekognition and VK with FindFace have emerged face recognition to enhance their product’s capability. Satisfactory results have been achieved in a controlled environment, significant research gaps are still there for uncontrolled lightening, pose, occlusion and aging invariants. Unfortunately, real-world scenarios are quite uncontrolled, so we need to strengthen our models enough to meet uncontrolled scenarios. We found some active future research directions from selected literature and summarized as follows:

7.1 Soft biometrics:

Soft biometrics-based facial recognition is one of the most vibrant future research fields that use verbal descriptors for face identification. The ongoing research on soft biometrics has been focused on recognition by categorical and comparative attributes, which is simple and time-efficient as well. Although, significant research efforts have been made in recent years [11, 47, 75, 84, 124, 179, 237, 247], still there are lot of challenges in midway. It can not only help to track users but can also enhance the systems’ capability to recognize individuals even when face images are completely not available. Also, these indicators can also be combined with hard biometrics and feed such features into the state-of-the-art CNNs can have significant performance gain. We believe that combining the temporal information and multi-source images into a soft biometric based recognition system can address a plenty of challenges in person identification domain and results through this process have far deeper impact in the current state-of-the-art face recognition research.

7.2 Age estimation:

Human facial images can be used to interpret so many biological details such as age spots, wrinkles, hair color, face shape, skin texture etc. provide a concrete foundation to estimate the age, gender, emotion or ethnicity of the subject image. Research community from computer vision and face recognition has put lots of effort to develop age estimation models based on human facial images. The popularity might be due to the fact that it has widespread potential applications, such as intelligent advertising, human-computer interaction, effective filtering in criminal investigation, etc. [270]. In general, variety of methods and algorithms have been applied such as age estimation as multiclass classification problem [33, 61, 68, 69, 135, 155, 269], regression based methods [63, 81, 136, 146, 161, 162, 177] and deep learning [142, 148, 149, 159, 238, 280] based methods are very common in academic literature. For a robust age estimation system challenges fall because of its dependency on high dimensional features space, limited availability of a standard databases, massive amounts of unlabeled data and some inherent issues in digital face images such as moustache, beard, glasses, occlusion, illumination, color and pose are the key barriers for reliable age estimation [198]. Despite the significant research efforts for age estimation, to date, the performance of proposed methods is somehow justified in controlled environment but still poor in real world dynamic situations. In depth research efforts regarding the impact of race and gender in age progress are still demanded, as well as the classification of aging with respect to different facial regions, training more specific classifiers are most demanding areas. A balanced data augmentation strategy with auto-encoders and use of GAN networks could be a practical solution [8]. Moreover, auto-encoders can be enhanced by fusion of multiple biometrics with age to obtain more reliable and adaptive systems [198].

7.3 Gender classification:

Face recognition based gender classification is also an important research area with wide range of social and commercial applications such as credit card verification system, visual surveillance system, image database investigations, and dynamic marketing systems [9] Unlike other application areas of face recognition such as variation in pose, illumination or occlusion, the information utilization model for gender classification needs to be more intelligent and smart. In recent literature, most of the gender classification systems use the same set of feature extractors and face database as for face recognition systems [4, 9, 34, 67, 123], but this is not universal for all the cases. The challenges are multifold, including but not limited to the cultural/ethnic dependencies of facial appearances, soft biometrics, color, cosmetics effects, low resolution of the input images, limited availability of standard benchmark datasets and partial occlusion of the facial images.

7.4 Expression recognition:

Over the past two decades, facial expression recognition has been a major research area among computer vision and face recognition community. It has a wide range of applications such as social behavior analysis, robotic assistive nurses, psychology, political campaign, social trends discovery, recommender systems and intelligent marketing [10, 125, 187, 205, 260, 295]. In conventional expression recognition systems, the developed algorithms work on the constrained database with the assumption that databases have rich information. In the unconstrained environment, the performance of state-of-the-art approaches is limited due to certain inherent issues in recognition process. Also, facial expressions are spontaneous, instant, highly heterogeneous and appearance varies for person to person. During past few years, a valuable growing research tendency on micro-expression recognition has been noticed in academic literature [205, 303]. Micro-expression recognition is also challenging but have interesting and useful commercial applications.

7.5 Pose variation:

A major technical constraint for building a robust pose invariant based recognition system is the availability of limited number of training samples. The pose recognition research community is working to develop fully automatic and reliable algorithms with ability to work with limited training samples. Also, we couldn’t avoid this by increasing training samples due to technical and environmental issues. Although, many potential methodologies [52, 285] have been recently introduced and they work well in one way but still a major performance hole through other ends as well. Existing approaches still have limited performance, an individual improvement in pose-robust feature extraction, multi-view subspace learning, and face synthesis is highly demanded. From this perspective, combining several deep learning techniques i.e. hybrid solutions will have better capacity to accommodate the complex challenges of real word dynamic situations.

7.6 Transfer learning:

Among machine learning frameworks, the ability to transfer knowledge (learning) to a new set of models or scenarios is simply transfer learning. It has introduced a radical change in performance outcomes of deep learning applications like natural language process [271, 306], activity recognition [215], pattern and object recognition, etc. In recent years, transfer learning-based hybrid models [40, 49, 101, 209, 278] for face recognition have gained significant popularity. In light of the issues discussed in Section 5, we argue that integrating transfer learning with the highlighted challenges in face recognition could have promising results.

7.7 Robustness with limited training data:

This is a key challenge for learning-based approaches (deep learning, transfer learning or dictionary learning) that in real-world training samples are limited while the robustness of these algorithms depends on wide range of training samples. In recent years, approaches like [22, 121, 221] proposed a much powerful direction to achieve significant performance improvement, but a gap remains for practical scenarios. Future researchers need to strengthen the approaches like low-dimensional sub-space or simple mapping that comparatively require less number of training samples as compared to learning-based approaches.

7.8 Computational time:

Although, most of the existing face recognition methods have achieved high accuracy (some cases 100 % as reported in Table 3), but it takes a lot of time to train the recognition model. Especially, deep learning frameworks have to internally go through billions and billions of comparisons which make the model impractical for real world practical scenarios. For example, for automated surveillance in a football stadium crowd, we need to recognize subjects in milliseconds and react accordingly. Here computation time is quite important but unfortunately, high accuracy and efficient (deep or sparse models) are not computationally efficient. Computational time reduction is one of the key research challenges for future researchers.

7.9 Multi-spectral face recognition:

Multi-spectral face recognition has procured remarkable consideration over the past few years because of its potential ability to acquire spatial and spectral information from electromagnetic range which is not possible to retrieve through traditional visible imaging systems. It is a positive alternative of the traditional visual spectrum [25, 51, 204, 302] and have shown considerable improvements to deal with illumination challenges. Additionally, with the advances in deep learning the CNN based approaches have high discriminative power to analyze multi-dimensional features. Therefore, adapting multi-spectral features space in CNNs can be a promising research direction for future face recognition community.

8 Conclusion

As of today, face recognition is one of the most vibrant research areas among machine learning, artificial intelligence and pattern recognition communities. With a lot of social and scientific applications, it has an enormous impact on the upcoming digital society. In the present survey, we have discussed a number of classical appearance-based approaches and numerous recent machine learning-based like deep learning, transfer learning and dictionary learning approaches for face recognition. The accuracy achieved by different researchers has been discussed on various data sets like MS-Celeb-1M, YALE, AT&T, AR, 3DMAD, GEORGIA, PubFig, and CMU-PIE, etc. Certain challenges have also been discussed such as computational cost reduction, illumination variations, viewpoint robustness, scale and rotation variations, 3D data acquisition, etc. Occlusion and background subtraction still have a lot of question marks for future research. In the end, by comparing various techniques we conclude that hybridizing soft bio-metrics with other existing face recognition methods can achieve remarkable success in this potential research area.

References

Abate AF, Nappi M, Riccio D, Sabatino G (2007) 2d and 3d face recognition: a survey. Pattern Recogn Lett 28(14):1885–1906
Google Scholar
Abbe E, Sandon C (2018) Provable limitations of deep learning coRR. arXiv:1812.06369
Adini Y, Moses Y, Ullman S (1997) Face recognition: The problem of compensating for changes in illumination direction. IEEE Trans Pattern Anal Mach Intell 19(7):721–732
Google Scholar
Afifi M, Abdelhamed A (2019) Afif4: Deep gender classification based on adaboost-based fusion of isolated facial features and foggy faces. J Vis Commun Image Represent 62:77–86
Google Scholar
Aghamaleki JA, Chenarlogh VA (2019) Multi-stream CNN for facial expression recognition in limited training data. Multimed Tools Appl 78 (16):22861–22882
Google Scholar
Ahonen T, Hadid A, Pietikȧinen M (2004) Face recognition with local binary patterns. In: Proceedings of 8th European Conference on Computer Vision-ECCV, Prague, Czech Republic, Part I, pp 469–481
Akram MU, Awan HM, Khan AA (2014) Dorsal hand veins based person identification. In: 4Th international conference on image processing theory, tools and applications, IPTA 2014, Paris, pp 289–294
Al-Shannaq AS, Elrefaei LA (2019) Comprehensive analysis of the literature for age estimation from facial images. IEEE Access 7:93229–93249
Google Scholar
Al-wajih E, Ahmed M (2020) A new application for gabor filters in face-based gender classification. Int Arab J Inf Technol 17(2):178–187
Google Scholar
Ali W, Jie S, Aman KA, Saifullah T (2019) Context-aware recommender systems: Challenges and opportunities 48(5):655
Almudhahka NY, Nixon MS, Hare JS (2018) Comparative Face Soft Biometrics for Human Identification. Springer International Publishing, Cham, pp 25–50
Google Scholar
An L, Chen X, Liu S, Lei Y, Yang S (2017) Integrating appearance features and soft biometrics for person re-identification. Multimed Tools Appl 76(9):12117–12131
Google Scholar
Andreopoulos A, Tsotsos JK (2013) 50 years of object recognition: Directions forward. Comput Vis Image Underst 117(8):827–891
Google Scholar
Annamalai P, Raju K, Ranganayakulu D (2018) Soft biometrics traits for continuous authentication in online exam using ICA based facial recognition. I J Netw Secur 20(3):423–432
Google Scholar
Arashloo SR (2016) A comparison of deep multilayer networks and markov random field matching models for face recognition in the wild. IET Comput Vis 10(6):466–474
Google Scholar
Arigbabu OA, Ahmad SMS, Adnan WAW, Yussof S (2015) Recent advances in facial soft biometrics. Vis Comput 31(5):513–525
Google Scholar
Azeem A, Sharif M, Raza M, Murtaza M (2014) A survey: face recognition techniques under partial occlusion. Int Arab J Inf Technol 11(1):1–10
Google Scholar
Bailly-Bailliére E, Bengio S, Bimbot F, Hamouz M, Kittler J, Mariéthoz J, Matas J, Messer K, Popovici V, Porée F et al (2003) The banca database and evaluation protocol. In: International conference on audio-and video-based biometric person authentication. Springer, pp 625–638
Bair S, DelVecchio M, Flowers B, Michaels AJ, Headley WC (2019) On the limitations of targeted adversarial evasion attacks against deep learning enabled modulation recognition. In: Proceedings of the ACM Workshop on Wireless Security and Machine Learning, WiseML@WiSec 2019, Miami, pp 25–30
Baocai Y, Yanfeng S, Chengzhang W, Yun G (2009) Bjut-3d large scale 3d face database and information processing. J Comput Res Dev 6:020
Google Scholar
Bartlett MS, Movellan JR, Sejnowski TJ (2002) Face recognition by independent component analysis. IEEE Trans Neural Netw 13(6):1450–1464
Google Scholar
Bashbaghi S, Granger E, Sabourin R, Parchami M (2018) Deep learning architectures for face recognition in video surveillance coRR. arXiv:1802.09990
Becerra-Riera F, Morales-Gonzȧlez A, Mėndez-Vȧzquez H (2019) A survey on facial soft biometrics for video surveillance and forensic applications. Artif Intell Rev 52(2):1155–1187
Google Scholar
Belhumeur P, Hespanha JP, Kriegman D (1997) Eigenfaces vs. fisherfaces: Recognition using class specific linear projection. IEEE Trans Pattern Anal Mach Intell 19(7):711–720
Google Scholar
Benamara NK, Zigh E, Stambouli TB, Keche M (2018) Combined and weighted features for robust multispectral face recognition. In: Proceedings of 6th IFIP TC 5 International Conference on Computational Intelligence and Its Applications - CIIA, Oran, Algeria, pp 549–560
Beveridge JR, Phillips PJ, Bolme DS, Draper BA, Givens GH, Lui YM, Teli MN, Zhang H, Scruggs WT, Bowyer KW et al (2013) The challenge of face recognition from digital point-and-shoot cameras. In: IEEE Sixth international conference on biometrics: theory, Applications and Systems (BTAS), pp 1–8
Beveridge JR, Zhang H, Flynn PJ, Lee Y, Liong VE, Lu J, de Assis Angeloni M, de Freitas Pereira T, Li H, Hua G et al (2014) The ijcb 2014 pasc video face and person recognition competition. In: IEEE International joint conference on biometrics (IJCB), pp 1–8
Brunelli R, Poggio T (1993) Face recognition: Features versus templates. IEEE Trans. Pattern Anal Mach Intell 15(10):1042–1052
Google Scholar
Calo SB, Ko BJ, Lee K, Salonidis T, Verma DC (2018) Controlling privacy in a face recognition application. In: US Patent app 15/876,307. Google patents
Campomanes-Alvarez C, Ȧlvarez BRC, Guadarrama S, Ibȧṅez Ȯ, Cordȯn O (2017) An experimental study on fuzzy distances for skull-face overlay in craniofacial superimposition. Fuzzy Sets Syst 318:100–119
MathSciNet Google Scholar
Chatzis V, Bors AG, Pitas I (1999) Multimodal decision-level fusion for person authentication. IEEE Trans Syst Man Cybern Part A 29(6):674–680
Google Scholar
Chen Y, Patel VM, Phillips PJ, Chellappa R (2012) Dictionary-based face recognition from video. In: Proceedings of 12th European Conference on Computer Vision ECCV, Florence, Part VI, pp 766–779
Chen BC, Chen C, Hsu WH (2014) Cross-age reference coding for age-invariant face recognition and retrieval. In: European conference on computer vision. Springer, pp 768–783
Chen J, Liu S, Chen Z (2017) Gender classification in live videos. In: 2017 IEEE International conference on image processing, ICIP 2017, Beijing, pp 1602–1606
Chen J, Ranjan R, Sankaranarayanan S, Kumar A, Chen C, Patel VM, Castillo CD, Chellappa R (2018) Unconstrained still/video-based face verification with deep convolutional neural networks. Int J Comput Vis 126(2-4):272–291
MathSciNet Google Scholar
Chen G (2019) An experimental study for the effects of noise on face recognition algorithms under varying illumination. Multim Tools Appl 78(18):26615–26631
Google Scholar
Cheng H (2015) Sparse representation, modeling and learning in visual recognition - theory, algorithms and applications. Advances in computer vision and pattern recognition. Springer, Berlin
Chihaoui M, Elkefi A, Bellil W, Amar CB (2016) A survey of 2d face recognition techniques. Computers 5(4):21
Google Scholar
Choi S, Choi C, Jeong G, Kwak N (2012) Pixel selection based on discriminant features with application to face recognition. Pattern Recognit Lett 33(9):1083–1092
Google Scholar
Chugh T, Singh M, Nagpal S, Singh R, Vatsa M (2017) Transfer learning based evolutionary algorithm for composite face sketch recognition. In: IEEE Conference on computer vision and pattern recognition workshops, CVPR workshops, Honolulu, pp 619–627
Conde C, Serrano Ȧ, Cabello E (2006) Multimodal 2d, 2.5d & 3d face verification. In: Proceedings of the International Conference on Image Processing, ICIP 2006, Atlanta, pp 2061–2064
Cox IJ, Ghosn J, Yianilos PN (1996) Feature-based face recognition using mixture-distance. In: Conference on computer vision and pattern recognition (CVPR ), San Francisco, pp 209–216
Cunningham JP, Ghahramani Z (2015) Linear dimensionality reduction: survey, insights, and generalizations. J Mach Learn Res 16:2859–2900
MathSciNet Google Scholar
Dagnes N, Vezzetti E, Marcolin F, Tornincasa S (2018) Occlusion detection and restoration techniques for 3d face recognition: a literature review. Mach Vis Appl 29(5):789–813
Google Scholar
Danelakis A, Theoharis T, Pratikakis I (2015) A survey on facial expression recognition in 3d video sequences. Multimed Tools Appl 74(15):5577–5615
Google Scholar
Dantcheva A, Velardo C, D’Angelo A, Dugelay J (2011) Bag of soft biometrics for person identification - new trends and challenges. Multimed Tools Appl 51(2):739–777
Google Scholar
Dantcheva A, Elia P, Ross A (2016) What else does your biometric data reveal? A survey on soft biometrics. IEEE Trans Inf Forensic Secur 11 (3):441–467
Google Scholar
De Carrera PF, Marques I (2010) Face recognition algorithms. Master’s thesis in Computer Science. Universidad Euskal Herriko
de Souza GB, Santos DFS, Pires RG, Marana AN, Papa JP (2017) Efficient transfer learning for robust face spoofing detection. In: Proceedings of 22nd Iberoamerican Congress on Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications, CIARP, Valparaíso, pp 643–651
Deng J, Guo J, Xue N, Zafeiriou S (2019) Arcface: Additive angular margin loss for deep face recognition. In: IEEE Conference on computer vision and pattern recognition, CVPR 2019, Long beach, pp 4690–4699
Diarra M, Gouton P, Jerome AK (2016) A comparative study of descriptors and detectors in multispectral face recognition. In: 12Th international conference on signal-image technology & internet-based systems, SITIS, Naples, pp 209–214
Ding C, Tao D (2016) A comprehensive survey on pose-invariant face recognition. ACM Trans Intell Syst Technol 7:37:1–37:42
Google Scholar
Du Y, Lu X, Zeng W, Hu C (2018) A novel fuzzy linear discriminant analysis for face recognition. Intell Data Anal 22(3):675–696
Google Scholar
Edwards GJ, Cootes TF, Taylor CJ (1998) Face recognition using active appearance models. In: Proceedings of 5th European Conference on Computer Vision Computer Vision - ECCV, Freiburg, vol II, pp 581–595
Efremova N, Patkin M, Sokolov D (2019) Face and emotion recognition with neural networks on mobile devices: Practical implementation on different platforms. In: 14Th IEEE international conference on automatic face & gesture recognition, FG 2019, Lille, pp 1–5
Elad M (2012) Sparse and redundant representation modeling - what next?. IEEE Signal Process Lett 19(12):922–928
Google Scholar
Elag M, Kumar P, Marini L, Myers JD, Hedstrom M, Plale BA (2017) Identification and characterization of information-networks in long-tail data collections. Environ Modell Softw 94:100–111
Google Scholar
Etemad K, Chellappa R (1997) Discriminant analysis for recognition of human face images. In: International conference on audio- and video-based biometric person authentication, pp 125–142
Feltwell T, Wood G, Linehan C, Lawson S (2017) An augmented reality game using face recognition technology. In: Proceedings of the ACM Conference Companion Publication on Designing Interactive Systems, pp 44–49
Fianyi I, Zia T (2016) Biometric technology solutions to countering today’s terrorism. Int J Cyber Warf Terror 6(4):28–40
Google Scholar
Fu Y, Guo G, Huang TS (2010) Age synthesis and estimation via faces: a survey. IEEE Trans Pattern Anal Mach Intell 32(11):1955–1976
Google Scholar
Galea C, Farrugia RA (2018) Matching software-generated sketches to face photographs with a very deep cnn, morphed faces, and transfer learning. IEEE Trans Inf Forensic Secur 13(6):1421–1431
Google Scholar
Galiani S, Gȧlvez RH (2019) An empirical approach based on quantile regression for estimating citation ageing. J. Informetrics 13(2):738–750
Google Scholar
Gao W, Cao B, Shan S, Chen X, Zhou D, Zhang X, Zhao D (2008) The CAS-PEAL large-scale chinese face database and baseline evaluations. IEEE Trans Syst Man Cybern Part A, 38(1):149–161
Google Scholar
Garain J, Kumar RK, Kisku DR, Sanyal G (2019) Addressing facial dynamics using k-medoids cohort selection algorithm for face recognition. Multimed Tools Appl 78(13):18443–18474
Google Scholar
Gautam G, Mukhopadhyay S (2019) An adaptive localization of pupil degraded by eyelash occlusion and poor contrast. Multimed Tools Appl 78(6):6655–6677
Google Scholar
Geetha A, Sundaram M, Vijayakumari B (2019) Gender classification from face images by mixing the classifier outcome of prime, distinct descriptors. Soft Comput 23(8):2525–2535
Google Scholar
Geng X, Zhou Z, Smith-miles K (2007) Automatic age estimation based on facial aging patterns. IEEE Trans Pattern Anal Mach Intell 29(12):2234–2240
Google Scholar
Geng X, Yin C, Zhou Z (2013) Facial age estimation by learning from label distributions. IEEE Trans Pattern Anal Mach Intell 35(10):2401–2412
Google Scholar
Georghiades A, Belhumeur P, Kriegman D (2001) From few to many: Illumination cone models for face recognition under variable lighting and pose. IEEE Trans Pattern Anal Mach Intell 23(6): 643–660
Google Scholar
Gilbert A, Illingworth J, Bowden R (2008) Scale invariant action recognition using compound features mined from dense spatio-temporal corners. In: Proceedings of 10th European Conference on Computer Vision, Marseille, Part I, pp 222–233
Global opportunity analysis and industry forecast, 2015-2022. Available online: https://www.alliedmarketresearch.com/press-release/facial-recognition-market.html. Accessed: 2019-07-23
Gong D, Li Z, Huang W, Li X, Tao D (2017) Heterogeneous face recognition: A common encoding feature discriminant approach. IEEE Trans Image Process 26(5):2079–2089
MathSciNet Google Scholar
Gonzalez-Sosa E, Fiėrrez J, Vera-Rodríguez R, Alonso-Fernandez F (2018) Facial soft biometrics for recognition in the wild: Recent works, annotation, and COTS evaluation. IEEE Trans Inf Forensic Secur 13(8):2001–2014
Google Scholar
Gonzalez-Sosa E, Fiėrrez J, Vera-Rodríguez R, Alonso-Fernandez F (2018) Facial soft biometrics for recognition in the wild: Recent works, annotation, and COTS evaluation. IEEE Trans Inf Forensic Secur 13(8):2001–2014
Google Scholar
Goswami G, Ratha NK, Agarwal A, Singh R, Vatsa M (2018) Unravelling robustness of deep learning based face recognition against adversarial attacks. In: Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, New Orleans
Grgic M, Delac K, Grgic S (2011) Scface — surveillance cameras face database. Multimed Tools Appl 51(3):863–879
Google Scholar
Grm K, Struc V, Artiges A, Caron M, Ekenel HK (2018) Strengths and weaknesses of deep learning models for face recognition against image degradations. IET Biometr 7(1):81–89
Google Scholar
Gu̇nther M, Shafey LE, Marcel S (2016) Face recognition in challenging environments: an experimental and reproducible research survey. In: Face recognition across the imaging spectrum, pp 247–280
Guo G, Li S, Chan KL (2000) Face recognition by support vector machines. In: 4Th IEEE international conference on automatic face and gesture recognition (FG 2000), Grenoble, pp 196–201
Guo G, Fu Y, Dyer CR, Huang TS (2008) Image-based human age estimation by manifold learning and locally adjusted robust regression. IEEE Trans Image Process 17(7):1178–1188
MathSciNet Google Scholar
Guo Y, Zhang L, Hu Y, He X, Gao J (2016) Ms-celeb-1m: a dataset and benchmark for large-scale face recognition. In: European conference on computer vision. Springer, pp 87–102
Guo S, Tan G, Pan H, Chen L, Gao C (2017) Face alignment under occlusion based on local and global feature regression. Multimed Tools Appl 76(6):8677–8694
Google Scholar
Guo BH, Nixon MS, Carter JN (2018) Fusion analysis of soft biometrics for recognition at a distance. In: IEEE 4Th international conference on identity, security, and behavior analysis, ISBA 2018, Singapore, pp 1–8
Gutta S, Wechsler H (1997) Face recognition using hybrid classifiers. Pattern Recogn 30(4):539–553
Google Scholar
https://patents.google.com. Accessed: 2019-07-23
https://techcrunch.com/2017/12/13/china-cctv-bbc-reporter/?guccounter=1. Accessed: 2019-07-23
Haghiri S, Rabiee HR, Soltani-farani A, Hosseini SA, Shadloo M (2014) Locality preserving discriminative dictionary learning. In: IEEE International conference on image processing, ICIP, Paris, pp 5242–5246
Han X, Yang H, Xing G, Liu Y (2020) Asymmetric joint gans for normalizing face illumination from a single image. IEEE Trans Multimed 22(6):1619–1633
Google Scholar
HasanPour SH, Rouhani M, Fayyaz M, Sabokrou M, Adeli E (2018) Towards principled design of deep convolutional networks: Introducing simpnet. CoRR arXiv:1802.06205
Hashemi VH, Gharahbagh AA (2015) Article:a novel hybrid method for face recognition based on 2d wavelet and singular value decomposition. Amer J Netw Commun 4(4):90–94
Google Scholar
He X, Yan S, Hu Y, Niyogi P, Zhang H (2005) Face recognition using laplacianfaces. IEEE Trans Pattern Anal Mach Intell 27(3):328–340
Google Scholar
He L, Li H, Zhang Q, Sun Z (2019) Dynamic feature matching for partial face recognition. IEEE Trans Image Process 28(2):791–802
MathSciNet Google Scholar
He Q, He B, Zhang Y, Fang H (2019) Multimedia based fast face recognition algorithm of speed up robust features. Multimed Tools Appl 78(17):24035–24045
Google Scholar
Heisele B, Ho P, Wu J, Poggio T (2003) Face recognition: component-based versus global approaches. Comput Vis Image Underst 91(1-2):6–21
Google Scholar
Heng W, Jiang T, Gao W (2019) How to assess the quality of compressed surveillance videos using face recognition. . IEEE Trans Circ Syst Video Techn 29(8):2229–2243
Google Scholar
Hinton GE, Osindero S, Teh YW (2006) A fast learning algorithm for deep belief nets. Neural Comput 18(7):1527–1554
MathSciNet Google Scholar
Hu P, Ning H, Qiu T, Zhang Y, Luo X (2017) Fog computing based face identification and resolution scheme in internet of things. IEEE Trans Ind Inf 13(4):1910–1920
Google Scholar
Hu C, Lu X, Liu P, Jing X, Yue D (2019) Single sample face recognition under varying illumination via QRCP decomposition. IEEE Trans Image Process 28(5):2624–2638
MathSciNet Google Scholar
Hu C, Wu F, Yu J, Jing X, Lu X, Liu P (2020) Diagonal symmetric pattern-based illumination invariant measure for severe illumination variation face recognition. IEEE Access 8:63202–63213
Huan E, Wen G (2020) Transfer learning with deep convolutional neural network for constitution classification with face image. Multim Tools Appl 79 (17-18):11905–11919
Google Scholar
Huang R, Metaxas DN, Pavlovic V (2004) A hybrid face recognition method using markov random fields. In: 17Th international conference on pattern recognition, ICPR, Cambridge, pp 157–160
Huang GB, Mattar M, Berg T, Learned-Miller E (2008) Labeled faces in the wild: a database forstudying face recognition in unconstrained environments. In: Workshop on faces in ‘real-life’ images: detection, alignment, and recognition
Huang K, Dai D, Ren C, Lai Z (2017) Learning kernel extended dictionary for face recognition. IEEE Trans Neural Netw Learn Syst 28(5):1082–1094
Google Scholar
Huang P, Gao G, Qian C, Yang G, Yang Z (2017) Fuzzy linear regression discriminant projection for face recognition. IEEE Access 5:4340–4349
Iliadis M, Wang H, Molina R, Katsaggelos AK (2017) Robust and low-rank representation for fast face identification with occlusions. IEEE Trans Image Process 26(5):2203–2218
MathSciNet Google Scholar
Iranmanesh SM, Dabouei A, Kazemi H, Nasrabadi NM (2018) Deep cross polarimetric thermal-to-visible face recognition. In: International conference on biometrics, ICB, Gold Coast, pp 166–173
Jaha ES, Nixon MS (2016) From clothing to identity: Manual and automatic soft biometrics. IEEE Trans Inf Forensic Secur 11(10):2377–2390
Google Scholar
Jain AK, Park U (2009) Facial marks: Soft biometric for face recognition. In: Proceedings of the International Conference on Image Processing, ICIP, Cairo, pp 37–40
Jha D, Ward L, Yang Z, Wolverton C, Foster I, Liao W.k, Choudhary A, Agrawal A (2019) Irnet: A general purpose deep residual regression framework for materials discovery. In: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. ACM, pp 2385–2393
Jiang B, Zhang J, Deng B, Guo Y, Liu L (2017) Deep face feature for face alignment and reconstruction. CoRR arXiv:1708.02721
Jing L, Ng MK, Zeng T (2013) Dictionary learning-based subspace structure identification in spectral clustering. IEEE Trans Neural Netw Learn Syst 24(8):1188–1199
Google Scholar
Jing X, Wu F, Zhu X, Dong X, Ma F, Li Z (2016) Multi-spectral low-rank structured dictionary learning for face recognition. Pattern Recogn 59:14–25
Google Scholar
Jiu M, Sahbi H (2019) Deep representation design from deep kernel networks. Pattern Recogn 88:447–457
Google Scholar
Kanade T (1973) Picture processing system by computer complex and recognition of human faces. Ph.D. thesis, Kyoto University, Japan
Kang W, Lu Y, Li D, Jia W (2019) From noise to feature: Exploiting intensity distribution as a novel soft biometric trait for finger vein recognition. IEEE Trans Inf Forensic Secur 14(4):858–869
Google Scholar
Kasapakis V, Gavalas D (2017) Occlusion handling in outdoors augmented reality games. Multimed Tools Appl 76(7):9829–9854
Google Scholar
Kasturi R, Goldgof DB, Soundararajan P, Manohar V, Garofolo JS, Bowers R, Boonstra M, Korzhova VN, Zhang J (2009) Framework for performance evaluation of face, text, and vehicle detection and tracking in video: data, metrics, and protocol. IEEE Trans Pattern Anal Mach Intell 31(2): 319–336
Google Scholar
Kawulok M, Wu J, Hancock ER (2011) Supervised relevance maps for increasing the distinctiveness of facial images. Pattern Recognit 44(4):929–939
Google Scholar
Keinert F, Lazzaro D, Morigi S (2019) A robust group-sparse representation variational method with applications to face recognition. IEEE Trans Image Process 28(6):2785–2798
MathSciNet Google Scholar
Kemelmacher-Shlizerman I, Basri B (2011) 3d face reconstruction from a single image using a single reference face shape. IEEE Trans Pattern Anal Mach Intell 33(2):394–405
Google Scholar
Kepenekci B (2001) Face recognition using gabor wavelet transform. PhD thesis, The Middle East Technical University
Khan K, Attique M, Syed I, Sarwar G, Irfan MA, Khan R (2019) A unified framework for head pose, age and gender classification through end-to-end face segmentation. Entropy 21(7):647
MathSciNet Google Scholar
Khan AA, Shao J, Ali W, Tumrani S (2020) Content-Aware summarization of broadcast sports Videos:An Audio–Visual feature extraction approach. Neural Process Lett:1–24
Khan S, Chen L, Yan H (2020) Co-clustering to reveal salient facial features for expression recognition. IEEE Trans Affect Comput 11(2):348–360
Google Scholar
Kim K (2005) Intelligent immigration control system by using passport recognition and face verification. In: International symposium on neural networks. Springer, pp 147–156
Kim D, Hernandez M, Choi J, Medioni G (2017) Deep 3d face identification. In: IEEE International joint conference on biometrics (IJCB), pp 133–142
Kreutz-Delgado K, Murray JF, Rao BD, Engan K, Lee T, Sejnowski TJ (2003) Dictionary learning algorithms for sparse representation. Neural Comput 15(2):349–396
Google Scholar
Krizhevsky A, Sutskever I, Hinton GE (2017) Imagenet classification with deep convolutional neural networks. Commun ACM 60(6):84–90
Google Scholar
Kumar BKS, Swamy MNS, Ahmad MO (2019) Visual tracking using structural local DCT sparse appearance model with occlusion detection. Multimed Tools Appl 78(6):7243–7266
Google Scholar
Kwak KC, Pedrycz W (2005) Face recognition using a fuzzy fisherface classifier. Pattern Recogn 38(10):1717–1732
Google Scholar
Kwon B, Lee K (2018) An introduction to face-recognition methods and its implementation in software applications. Int J Inf Technol Manag 17 (1/2):33–43
Google Scholar
Lades M, Vorbru̇ggen JC, Buhmann JM, Lange J, von der Malsburg C, Wu̇rtz RP, Konen W (1993) Distortion invariant object recognition in the dynamic link architecture. IEEE Trans Comput 42(3):300–311
Google Scholar
Lahasan BM, Lutfi SL, Segundo RS (2019) A survey on techniques to handle face recognition challenges: occlusion, single sample per subject and expression. Artif Intell Rev 52(2):949–979
Google Scholar
Lanitis A, Taylor CJ, Cootes TF (2002) Toward automatic simulation of aging effects on face images. IEEE Trans Pattern Anal Mach Intell 24 (4):442–455
Google Scholar
Lanitis A, Draganova C, Christodoulou C (2004) Comparing different classifiers for automatic age estimation. IEEE Trans Syst Man Cybern Part B 34(1):621–628
Google Scholar
Lavanya B, Inbarani HH (2018) A novel hybrid approach based on principal component analysis and tolerance rough similarity for face identification. Neural Comput Appl 29(8):289–299
Google Scholar
Lee W, Kim J (2018) Social relationship development between human and robot through real-time face identification and emotional interaction. In: ACM/IEEE International conference on human-robot interaction, HRI, Chicago, pp 379
Li D, Prasad M, Hsu S, Hong C, Lin C (2012) Face recognition using nonparametric-weighted fisherfaces. EURASIP J Adv Signal Process 2012:92
Google Scholar
Li X, Dai DQ, Zhang X, Ren CX (2013) Structured sparse error coding for face recognition with occlusion. IEEE Trans Image Process 22 (5):1889–1900
MathSciNet Google Scholar
Li H, Hua G, Shen X, Lin Z, Brandt J (2014) Eigen-pep for video face recognition. In: Asian conference on computer vision. Springer, pp 17–33
Li Y, Wang G, Lin L, Chang H (2015) A deep joint learning approach for age invariant face verification. In: CCF Chinese conference on computer vision. Springer, pp 296–305
Li X (2014) Face recognition method based on fuzzy 2dpca. J Electr Comput Eng 2014:919041:1–919041:7
Google Scholar
Li Q, Li T, Xia B, Ni M, Liu X, Zhou Q, Qi Y (2016) FIRST: face identity recognition in smart bank. Int J Seman Comput 10(4):569
Google Scholar
Li Z, Lai Z, Xu Y, Yang J, Zhang D (2017) A locality-constrained and label embedding dictionary learning algorithm for image classification. IEEE Trans Neural Netw Learn Syst 28(2):278–293
MathSciNet Google Scholar
Li X, Makihara Y, Xu C, Yagi Y, Ren M (2018) Gait-based human age estimation using age group-dependent manifold learning and regression. Multim Tools Appl 77(21):28333–28354
Google Scholar
Li X, Song A (2013) Fuzzy MSD based feature extraction method for face recognition. Neurocomputing 122:266–271
Google Scholar
Li Y, Wang G, Nie L, Wang Q, Tan W (2018) Distance metric optimization driven convolutional neural network for age invariant face recognition. Pattern Recogn 75:51–62
Google Scholar
Liao H (2019) Facial age feature extraction based on deep sparse representation. Multimed Tools Appl 78(2):2181–2197
Google Scholar
Liao M, Gu X (2019) Face recognition based on dictionary learning and subspace learning. Digital Signal Process 90:110–124
MathSciNet Google Scholar
Lin S, Zhao Z, Su F (2017) Clustering lightened deep representation for large scale face identification. In: Proceedings of the Second International Conference on Internet of things and Cloud Computing, ICC, Cambridge, pp 101:1–101:5
Lin S, Liu F, Liu Y, Shen L (2019) Local feature tensor based deep learning for 3d face recognition. In: 2019 14Th IEEE international conference on automatic face gesture recognition (FG 2019), pp 1–5
Liu C, Wechsler H (2000) Evolutionary pursuit and its application to face recognition. IEEE Trans Pattern Anal Mach Intell 22(6):570–582
Google Scholar
Liu C, Wechsler H (2002) Gabor feature based classification using the enhanced fisher linear discriminant model for face recognition. IEEE Trans Image Process 11(4):467–476
Google Scholar
Liu K, Yan S, Kuo CJ (2015) Age estimation via grouping and decision fusion. IEEE Trans Inf Forensic Secur 10(11):2408–2423
Google Scholar
Liu B, Gui L, Wang Y, Wang Y, Shen B, Li X, Wang Y (2017) Class specific centralized dictionary learning for face recognition. Multimed Tools Appl 76(3):4159–4177
Google Scholar
Liu W, Wen Y, Yu Z, Li M, Raj B, Song L (2017) Sphereface: Deep hypersphere embedding for face recognition. In: 2017 IEEE Conference on computer vision and pattern recognition, CVPR 2017, Honolulu, pp 6738–6746
Liu X, Lu L, Shen Z, Lu K (2018) A novel face recognition algorithm via weighted kernel sparse representation. Fut Gener Comput Syst 80:653–663
Google Scholar
Liu H, Lu J, Feng J, Zhou J (2018) Label-sensitive deep metric learning for facial age estimation. IEEE Trans Inf Forensic Secur 13(2):292–305
Google Scholar
Liu J, Liu W, Ma S, Wang M, Li L, Chen G (2019) Image-set based face recognition using k-svd dictionary learning. Int J Mach Learn Cybern 10(5):1051–1064
Google Scholar
Liu J, Qiao R, Li Y, Li S (2019) Witness detection in multi-instance regression and its application for age estimation. Multim Tools Appl 78 (23):33703–33722
Google Scholar
Liu N, Zhang F, Duan F (2020) Facial age estimation using a multi-task network combining classification and regression. IEEE Access 8:92441–92451
Lu C, Min H, Gui J, Zhu L, Lei Y (2013) Face recognition via weighted sparse representation. J Vis Commun Image Represent 24(2):111–116
Google Scholar
Lu C, Tang X (2015) Surpassing human-level face verification performance on LFW with gaussianface. In: Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, Austin, pp 3811–3819
Lu X, Wang Y, Jain AK (2003) Combining classifiers for face recognition. In: Proceedings of the IEEE International Conference on Multimedia and Expo, ICME, Baltimore, pp 13–16
Luo J, Ma Y, Takikawa E, Lao S, Kawade M, Lu B (2007) Person-specific SIFT features for face recognition. In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP, Honolulu, pp 593–596
Luu K, Zhu C, Bhagavatula C, Le THN, Savvides M (2016) A Deep Learning Approach to Joint Face Detection and Segmentation, pp 1–12
Mahmood A, Uzair M, Al-mȧadeed S (2018) Multi-order statistical descriptors for real-time face recognition and object classification. IEEE Access 6:12993–13004
Google Scholar
Malioutov DM, Ċetin M., Willsky AS (2004) Optimal sparse representations in general overcomplete bases. In: IEEE International conference on acoustics, speech, and signal processing, ICASSP, Montreal, pp 793–796
Mandavkar AA, Agawane RV (2015) Mobile based facial recognition using otp verification for voting system. In: 2015 IEEE International advance computing conference (IACC), pp 644–649
Marszalec EA, Martinkauppi JB, Soriano MN, Pietikaeinen M (2000) Physics-based face database for color research. J Electron Imaging 9 (1):32–39
Google Scholar
Martinez AM (1998) The ar face database. CVC Technical Report24
Meng F, Tang Z, Wang Z (2017) An improved redundant dictionary based on sparse representation for face recognition. Multimed Tools Appl 76 (1):895–912
Google Scholar
Miikkulainen R, Liang J, Meyerson E, Rawal A, Fink D, Francon O, Raju B, Shahrzad H, Navruzyan A, Duffy N et al (2019) Evolving deep neural networks. In: Artificial intelligence in the age of neural networks and brain computing Elsevier, pp 293–312.
Moghaddam B, Jebara T, Pentland A (2000) Bayesian face recognition. Pattern Recogn 33(11):1771–1782
Google Scholar
Mohammadi A, Bhattacharjee S, Marcel S (2018) Deeply vulnerable: a study of the robustness of face recognition to presentation attacks. IET Biometr 7(1):15–26
Google Scholar
Nakano R, Kobashi S, Alam SB, Morimoto M, Wakata Y, Ando K, Ishikura R, Hirota S, Aikawa S (2015) Neonatal brain age estimation using manifold learning regression analysis. In: 2015 IEEE International conference on systems, man, and cybernetics, Kowloon Tong, pp 2273–2276
Nguyen V, Do T, Nguyen V, Ngo TD, Duong DA (2018) How to choose deep face models for surveillance system?. In: 10Th asian conference on modern approaches for intelligent information and database systems, ACIIDS, Dong Hoi City, Extended Posters, pp 367–376
Nixon MS, Correia P, Nasrollahi K, Moeslund T, Hadid A, Tistarelli M (2015) On soft biometrics. Pattern Recogn Lett 68(2):218–230
Google Scholar
Nojavanasghari B, Hughes CE, Baltrusaitis T, Morency LP (2017) Hand2face: Automatic synthesis and recognition of hand over face occlusions. Seventh International Conference on Affective Computing and Intelligent Interaction (ACII), pp 209–215
Osuna E, Freund R, Girosit F (1997) Training support vector machines: an application to face detection. In: Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp 130–136
Oulefki A, Mustapha A, Boutellaa E, Bengherabi M, Tifarine AA (2018) Fuzzy reasoning model to improve face illumination invariance. SIViP 12 (3):421–428
Google Scholar
Ouyang D, Zhang Y, Shao J (2019) Video-based person re-identification via spatio-temporal attentional and two-stream fusion convolutional networks. Pattern Recogn Lett 117:153–160
Google Scholar
Papernot N, McDaniel PD, Jha S, Fredrikson M, Celik ZB, Swami A (2016) The limitations of deep learning in adversarial settings. In: IEEE European symposium on security and privacy, euros&p, Saarbru̇cken, pp 372–387
Parchami M, Bashbaghi S, Granger E, Sayed S (2017) Using deep autoencoders to learn robust domain-invariant representations for still-to-video face recognition. In: 14Th IEEE international conference on advanced video and signal based surveillance, AVSS, Lecce, pp 1–6
Park U, Jain AK (2010) Face matching and retrieval using soft biometrics. IEEE Trans Inf Forensic Secur 5(3):406–415
Google Scholar
Pei W, Dibeklioglu H, Baltrusaitis T, Tax DMJ (2020) Attended end-to-end architecture for age estimation from facial expression videos. IEEE Trans Image Process 29:1972–1984
MathSciNet Google Scholar
Peng Y, Li L, Liu S, Lei T (2018) Space-frequency domain based joint dictionary learning and collaborative representation for face recognition. Signal Process 147:101–109
Google Scholar
Peng C, Wang N, Li J, Gao X (2019) Re-ranking high-dimensional deep local representation for nir-vis face recognition. IEEE Trans Image Process 28(9):4553–4565
MathSciNet Google Scholar
Phillips PJ, Flynn PJ, Scruggs T, Bowyer KW, Worek W (2006) Preliminary face recognition grand challenge results. In: 2006. FGR 2006. 7th international conference on Automatic face and gesture recognition. IEEE, pp 15–24
Phillips PJ, Wechsler H, Huang J, Rauss PJ (1998) The feret database and evaluation procedure for face-recognition algorithms. Image Vis Comput 16(5):295–306
Google Scholar
Phillips PJ, Beveridge JR, Draper BA, Givens GH, O’Toole AJ, Bolme DS, Dunlop JP, Lui YM, Sahibzada H, Weimer S (2011) An introduction to the good, the bad, & the ugly face recognition challenge problem. In: Ninth IEEE international conference on automatic face and gesture recognition, Santa Barbara, pp 346–353
Pitas K, Loukas A, Davies M, Vandergheynst P (2019) Some limitations of norm based generalization bounds in deep neural networks. CoRR arXiv:1905.09677
Plenge E, Klein SS, Niessen WJ, Meijering E (2015) Multiple sparse representations classification. PLOS ONE 10(7):1–23
Google Scholar
Poder E (2017) Capacity limitations of visual search in deep convolutional neural network. CoRR arXiv:1707.09775
Poon G, Kwan KC, Pang W (2019) Occlusion-robust bimanual gesture recognition by fusing multi-views. Multimed Tools Appl 78 (16):23469–23488
Google Scholar
Pujol FA, Pujol M, Jimeno-morenilla A, Pujol MJ (2017) Face detection based on skin color segmentation using fuzzy entropy. Entropy 19(1):26
Google Scholar
Punyani P, Gupta R, Kumar A (2020) Neural networks for facial age estimation: a survey on recent advances. Artif Intell Rev 53(5):3299–3347
Google Scholar
Qian J, Yang J, Zhang F, Lin Z (2014) Robust low-rank regularized regression for face recognition with occlusion. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp 21–26
Qian J (2018) A survey on sentiment classification in face recognition. J Phys Conf Ser 960:012030
Queirolo CC, Silva L, Bellon ORP, Segundo MP (2010) 3d face recognition using simulated annealing and the surface interpenetration measure. IEEE Trans Pattern Anal Mach Intell 32(2): 206–219
Google Scholar
Rejeesh MR (2019) Interest point based face recognition using adaptive neuro fuzzy inference system. Multimed Tools Appl 78(16):22691–22710
Google Scholar
Radford A, Metz L, Chintala S (2016) Unsupervised representation learning with deep convolutional generative adversarial networks. In: 4Th international conference on learning representations, ICLR 2016, San Juan, Conference Track Proceedings
Raghavendra R, Raja KB, Venkatesh S, Cheikh FA, Busch C (2017) On the vulnerability of extended multispectral face recognition systems towards presentation attacks. In: IEEE International conference on identity, security and behavior analysis, ISBA, New Delhi, pp 1–8
Rajan S, Chenniappan P, Devaraj S, Madian N (2019) Facial expression recognition techniques: a comprehensive survey. IET Image Process 13 (7):1031–1040
Google Scholar
Rakshit RD, Nath SC, Kisku DR (2018) Face identification using some novel local descriptors under the influence of facial complexities. Expert Syst Appl 92:82–94
Google Scholar
Ramalingam S (2018) Fuzzy interval-valued multi criteria based decision making for ranking features in multi-modal 3d face recognition. Fuzzy Sets Syst 337:25–51
MathSciNet Google Scholar
Ranjan R, Sankaranarayanan S, Bansal A, Bodla N, Chen J, Patel VM, Castillo CD, Chellappa R (2018) Deep learning for understanding faces: Machines may be just as good, or better, than humans. IEEE Signal Process Mag 35(1):66–83
Google Scholar
Rassadin A, Gruzdev A, Savchenko A (2017) Group-level emotion recognition using transfer learning from face identification. In: Proceedings of the 19th ACM International Conference on Multimodal Interaction, ICMI, pp 544–548
Roberts LG (1963) Machine perception of three-dimensional solids. Ph.D. thesis Massachusetts Institute of Technology
Rubinstein R, Bruckstein AM, Elad M (2010) Dictionaries for sparse representation modeling. Proc IEEE 98(6):1045–1057
Google Scholar
Saeed U, Khan MM (2018) Combining ear-based traditional and soft biometrics for unconstrained ear recognition. J Electron Imaging 27(05):051220
Sajjad M, Nasir M, Ullah FUM, Muhammad K, Sangaiah AK, Baik SW (2019) Raspberry pi assisted facial expression recognition framework for smart security in law-enforcement services. Inf Sci 479:416–431
Google Scholar
Salici A, Ciampini C (2017) Automatic face recognition and identification tools in the forensic science domain. In: International tyrrhenian workshop on digital communication. Springer, pp 8–17
Sargano AB, Wang X, Angelov P, Habib Z (2017) Human action recognition using transfer learning with deep representations. In: International joint conference on neural networks, IJCNN, Anchorage, pp 463–469
Savran A, Alyüz N., Dibeklioġlu H, Ċeliktutan O, Gökberk B, Sankur B, Akarun L (2008) Bosphorus database for 3d face analysis. In: European workshop on biometrics and identity management. Springer, pp 47–56
Sawant MM, Bhurchandi KM (2019) Age invariant face recognition: a survey on facial aging databases, techniques and effect of aging. Artif Intell Rev 52(2):981–1008
Google Scholar
Schroff F, Kalenichenko D, Philbin J (2015) Facenet: a unified embedding for face recognition and clustering. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 815–823
Sepas-Moghaddam A, Pereira F, Correia PL (2019) Face recognition: A novel multi-level taxonomy based survey. CoRR arXiv:1901.00713
Serre T, Wolf L, Poggio T (2005) Object recognition with features inspired by visual cortex. In: IEEE Computer society conference on computer vision and pattern recognition (CVPR), vol 2, pp 994–1000
Shang K, Huang Z, Liu W, Li Z (2018) A single gallery-based face recognition using extended joint sparse representation. Appl Math Comput 320:99–115
Google Scholar
Shao C, Song X, Feng Z, Wu X, Zheng Y (2017) Dynamic dictionary optimization for sparse-representation-based face classification using local difference images. Inf Sci 393:1–14
Google Scholar
Shi B, Bai X, Liu W, Wang J (2018) Face alignment with deep regression. IEEE Trans Neural Netw Learn Syst 29(1):183–194
MathSciNet Google Scholar
Sim T, Baker S, Bsat M (2001) The cmu pose, illumination, and expression (pie) database of human faces. Tech. Rep. CMU-RI-TR-01-02, Carnegie Mellon University, Pittsburgh
Sim T, Baker S, Bsat M (2002) The cmu pose, illumination, and expression (pie) database. In: Proceedings of Fifth IEEE International Conference on Automatic Face and Gesture Recognition. IEEE, pp 53–58
Sing JK, Dey A, Ghosh M (2019) Confidence factor weighted gaussian function induced parallel fuzzy rank-level fusion for inference and its application to face recognition. Inf Fus 47:60–71
Google Scholar
Sirovich L, Kirby M (1987) Low-dimensional procedure for the characterization of human faces. J Opt Soc Am A 4(3):519–524
Google Scholar
Skocaj D, Leonardis A, Bischof H (2007) Weighted and robust learning of subspace representations. Pattern Recognit 40(5):1556–1569
Google Scholar
Soltanpour S, Boufama B, Wu QMJ (2017) A survey of local feature methods for 3d face recognition. Pattern Recogn 72:391–406
Google Scholar
Stonham TJ (1986) Practical Face Recognition and Verification with Wisard. Springer, Netherlands, pp 426–441
Google Scholar
Sun Y, Wang X, Tang X (2013) Deep convolutional network cascade for facial point detection. In: IEEE Conference on computer vision and pattern recognition, Portland, pp 3476–3483
Sun Y, Wang X, Tang X (2013) Hybrid deep learning for face verification. In: IEEE International conference on computer vision, ICCV, Sydney, pp 1489–1496
Sun Y, Wang X, Tang X (2014) Deep learning face representation from predicting 10, 000 classes. In: IEEE Conference on computer vision and pattern recognition, CVPR, Columbus, pp 1891–1898
Sun Y, Chen Y, Wang X, Tang X (2014) Deep learning face representation by joint identification-verification. In: Proceedings of the 27th International Conference on Neural Information Processing Systems - Volume 2, NIPS, pp 1988–1996
Sun Y, Liang D, Wang X, Tang X (2015) Deepid3: Face recognition with very deep neural networks. CoRR arXiv:1502.00873
Su Y, Yang Y, Guo Z, Yang W (2015) Face recognition with occlusion. In: 3Rd IAPR asian conference on pattern recognition (ACPR), pp 670–674
Sun Y, Zhang M, Sun Z, Tan T (2018) Demographic analysis from biometric data: achievements, challenges, and new frontiers. IEEE Trans Pattern Anal Mach Intell 40(2):332–351
Google Scholar
Sun P, Liu H, Wang X, Yu Z, Suping W (2019) Similarity-aware deep adversarial learning for facial age estimation. In: IEEE International conference on multimedia and expo, ICME 2019, Shanghai, pp 260–265
Tan X, Triggs B (2010) Enhanced local texture feature sets for face recognition under difficult lighting conditions. IEEE Trans Image Process 19(6):1635–1650
MathSciNet Google Scholar
Tang J, Li Z, Zhu X (2018) Supervised deep hashing for scalable face image retrieval. Pattern Recogn 75:25–32
Google Scholar
Tang Z, Wu X, Fu B, Chen W, Feng H (2018) Fast face recognition based on fractal theory. Appl Math Comput 321:721–730
MathSciNet Google Scholar
The fei face image database available online:. https://fei.edu.br/~cet/facedatabase.html. Accessed: 2019-07-23
The texas 3d face database, available online:. http://live.ece.utexas.edu/research/texas3dfr/. Accessed: 2019-07-23
The university of stirling face database, available online. http://pics.stir.ac.uk/. Accessed: 2019-06-23
The university of york 3d face database, available online:. https://www-users.cs.york.ac.uk/nep/research/3Dface/tomh/3DFaceDatabase.html. Accessed: 2019-07-23
Tolba AS (2000) A parameter-based combined classifier for invariant face recognition. Cybern Syst 31(8):837–849
MathSciNet Google Scholar
Tome-Gonzalez P, Fiėrrez J, Vera-Rodríguez R, Nixon MS (2014) Soft biometrics and their application in person recognition at a distance. IEEE Trans Inf Forensic Secur 9(3):464–475
Google Scholar
Tosic I, Frossard P (2011) Dictionary learning. IEEE Signal Proc Mag 28(2):27–38
Google Scholar
Tsai C, Shih K (2019) Mining a new biometrics to improve the accuracy of keystroke dynamics-based authentication system on free-text. Appl Soft Comput 80:125–137
Google Scholar
Turk M, Pentland A (1991) Eigenfaces for recognition. J Cogn Neurosci 3(1):71–86
Google Scholar
Vezzetti E, Marcolin F, Tornincasa S, Ulrich L, Dagnes N (2018) 3d geometry-based automatic landmark localization in presence of facial occlusions. Multimed Tools Appl 77(11):14177–14205
Google Scholar
Viola PA, Jones MJ (2004) Robust real-time face detection. Int J Comput Vis 57(2):137–154
Google Scholar
Vishwakarma VP, Goel T (2019) An efficient hybrid dwt-fuzzy filter in DCT domain based illumination normalization for face recognition. Multimed Tools Appl 78(11):15213–15233
Google Scholar
Vishwakarma VP, Dalal S (2020) A novel non-linear modifier for adaptive illumination normalization for robust face recognition. Multim Tools Appl 79(17-18):11503–11529
Google Scholar
Wagner A, Wright J, Ganesh A, Zhou Z, Ma Y (2009) Towards a practical face recognition system: Robust registration and illumination by sparse representation. In: IEEE Computer society conference on computer vision and pattern recognition (CVPR ), Miami, pp 597–604
Wang Y, Anderson PG, Gaborski RS (2009) Face recognition using a hybrid model. In: IEEE Applied imagery pattern recognition workshop, Washington, pp 1–8
Wang D, Cui P, Zhu W (2016) Structural deep network embedding. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, pp 1225–1234
Wang H, Hu J, Deng W (2018) Face feature extraction: A complete review. IEEE Access 6:6001–6039
Wang L, Wang Y, Liu B (2018) Laplace graph embedding class specific dictionary learning for face recognition. J Electr Comput Eng 2018:2179049:1–2179049:11
MathSciNet Google Scholar
Wang K, Peng X, Yang J, Meng D, Qiao Y (2020) Region attention networks for pose and occlusion robust facial expression recognition. IEEE Trans Image Process 29:4057–4069
Google Scholar
Wen Y, Li Z, Qiao Y (2016) Latent factor guided convolutional neural networks for age-invariant face recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 4893–4901
Wen Y, Zhang K, Li Z, Qiao Y (2016) A discriminative feature learning approach for deep face recognition. In: Computer vision - ECCV 2016 - 14th european conference, Amsterdam, Proceedings, Part VII, pp 499–515
Wiskott L, Krüger N, Kuiger N, von der Malsburg C (1997) Face recognition by elastic bunch graph matching. IEEE Trans Pattern Anal Mach Intell 19(7):775–779
Google Scholar
Wong Y, Chen S, Mau S, Sanderson C, Lovell BC (2011) Patch-based probabilistic image quality assessment for face selection and improved video-based face recognition. In: IEEE Biometrics workshop, computer vision and pattern recognition (CVPR) workshops. IEEE, pp 81–88
Wright J, Yang AY, Ganesh A, Sastry SS, Ma Y (2009) Robust face recognition via sparse representation. IEEE Trans Pattern Anal Mach Intell 31(2):210–227
Google Scholar
Wright J, Ma Y, Mairal J, Sapiro G, Huang TS, Yan S (2010) Sparse representation for computer vision and pattern recognition. Proc IEEE 98(6):1031–1044
Google Scholar
Wu H, Shao J, Xu X, Ji Y, Shen F, Shen HT (2018) Recognition and detection of two-person interactive actions using automatically selected skeleton features. IEEE Trans Hum-Mach Syst 48(3):304–310
Google Scholar
Wu F, Jing X, Dong X, Hu R, Yue D, Wang L, Ji Y, Wang R, Chen G (2020) Intraspectrum discrimination and interspectrum correlation analysis deep network for multispectral face recognition. IEEE Trans Cybern 50 (3):1009–1022
Google Scholar
Xie J, Pun C (2019) Chronological age estimation under the guidance of age-related facial attributes. IEEE Trans Inf Forensic Secur 14 (9):2500–2511
Google Scholar
Xie J, Pun C (2020) Deep and ordinal ensemble learning for human age estimation from facial images. IEEE Trans Inf Forensic Secur 15:2361–2374
Google Scholar
Xu J, Ma S, Zhang Y, Wei B, Cai X, Sun X (2017) Transfer deep learning for low-resource chinese word segmentation with a novel neural network. In: Natural language processing and chinese computing - 6th CCF international conference, NLPCC 2017, Dalian, Proceedings, pp 721–730
Xu Y, Zhong Z, Yang J, You J, Zhang D (2017) A new discriminative sparse representation method for robust face recognition via l₂ regularization. IEEE Trans Neural Netw Learn Syst 28(10): 2233–2242
MathSciNet Google Scholar
Xue Y Non-negative matrix factorization for face recognition. Ph.D. thesis (2007). AAI3302378
Yang M (2002) Kernel eigenfaces vs. kernel fisherfaces: Face recognition using kernel methods. In: 5Th IEEE international conference on automatic face and gesture recognition (FGR), Washington, pp 215–220
Yang W, Yan H, Wang J, Yang J (2008) Face recognition using complete fuzzy LDA. In: 19Th international conference on pattern recognition (ICPR), Tampa, pp 1–4
Yang M, Zhang L, Yang J, Zhang D (2011) Robust sparse coding for face recognition. In: IEEE Conference on computer vision and pattern recognition (CVPR), pp 625–632
Yang M, Zhang L, Yang J, Zhang D (2013) Regularized robust coding for face recognition. IEEE Trans Image Process 22(5):1753–1766
MathSciNet Google Scholar
Yin X, Yu X, Sohn K, Liu X, Chandraker M (2018) Feature transfer learning for deep face recognition with long-tail data. CoRR arXiv:1803.09014
Ying Han P, Jin ATB, Heng Siong L (2011) Eigenvector weighting function in face recognition. Discrete Dynamics in Nature and Society
Yoo B, Kwak Y, Kim Y, Choi C, Kim J (2018) Deep facial age estimation using conditional multitask learning with weak label expansion. IEEE Signal Process Lett 25(6):808–812
Google Scholar
Yu D, Wu X (2018) 2dpcanet: a deep leaning network for face recognition. Multimed Tools Appl 77(10):12919–12934
Google Scholar
Yuille AL, Hallinan PW, Cohen DS (1992) Feature extraction from faces using deformable templates. Int J Comput Vis 8(2):99–111
Google Scholar
Zadeh LA (1965) Fuzzy sets. Inf Control 8(3):338–353
Google Scholar
Zafar U, Ghafoor M, Zia T, Ahmed G, Latif A, Malik KR, Sharif AM (2019) Face recognition with bayesian convolutional networks for robust surveillance systems. EURASIP J Image Video Process 2019:10
Google Scholar
Zhang X, Gao Y (2009) Face recognition across pose: A review. Pattern Recogn 42(11):2876–2896
Google Scholar
Zhang X, Gao Y (2009) Face recognition across pose: A review. Pattern Recogn 42(11):2876–2896
Google Scholar
Zhang Q, Li B (2010) Discriminative k-SVD for dictionary learning in face recognition. In: The twenty-third IEEE conference on computer vision and pattern recognition, CVPR, San Francisco, pp 2691–2698
Zhang L, Zhou W, Chang P, Liu J, Yan Z, Wang T, Li F (2012) Kernel sparse representation-based classifier. IEEE Trans Signal Process 60(4):1684–1695
MathSciNet Google Scholar
Zhang H, Zhang Y, Huang TS (2013) Pose-robust face recognition via sparse representation. Pattern Recognit 46(5):1511–1521
Google Scholar
Zhang Z, Xu Y, Yang J, Li X, Zhang D (2015) A survey of sparse representation: Algorithms and applications. IEEE Access 3:490–530
Google Scholar
Zhang S, Li X, He H, Miao Y (2018) A next best view method based on self-occlusion information in depth images for moving object. Multimed Tools Appl 77(8):9753–9777
Google Scholar
Zhang Y, Shao J, Ouyang D, Shen HT (2018) Person re-identification using two-stage convolutional neural network. In: 24Th international conference on pattern recognition, ICPR 2018, Beijing, pp 3341–3346
Zhang Y, Hu C, Lu X (2019) IL-GAN: illumination-invariant representation learning for single sample face recognition. J Vis Commun Image Represent 59:501–513
Zhang W, Zhao X, Morvan J, Chen L (2019) Improving shadow suppression for illumination robust face recognition. IEEE Trans Pattern Anal Mach Intell 41(3):611–624
Google Scholar
Zhang F, Zhang T, Mao Q, Xu C (2020) Geometry guided pose-invariant facial expression recognition. IEEE Trans Image Process 29:4445–4460
Google Scholar
Zhang L, Luo F (2020) Review on graph learning for dimensionality reduction of hyperspectral image. Geo spatial Inf Sci 23(1):98–106
Google Scholar
Zhang M, Li Y, Wang N, Chi Y, Gao X (2020) Cascaded face sketch synthesis under various illuminations. IEEE Trans Image Process 29:1507–1521
MathSciNet Google Scholar
Zhao W, Chellappa R, Phillips PJ, Rosenfeld A (2003) Face recognition: a literature survey. ACM Comput Surv (CSUR) 35(4):399–458
Google Scholar
Zhao W, Chellappa R (1999) Robust face recognition using symmetric shape-from-shading. Computer Vision Laboratory, Center for Automation Research. University of Maryland Maryland, Md
Zhao W, Chellappa R, Phillips PJ (1999) Subspace linear discriminant analysis for face recognition. Citeseer
Zheng Y, Yang J, Wang W, Wang Q, Yang J, Wu X (2006) Fuzzy kernel fisher discriminant algorithm with application to face recognition. 6th World Congress Intell Control Autom 2:9669–9672
Zheng Y, Elmaghraby A (2011) A brief survey on multispectral face recognition and multimodal score fusion. In: IEEE International symposium on signal processing and information technology, ISSPIT, pp 543–550
Zhi R, Liu M, Zhang D (2020) A comprehensive survey on automatic facial action unit analysis. Vis Comput 36(5):1067–1093
Google Scholar
Zhong Y, Den W (2019) Exploring features and attributes in deep face recognition using visualization techniques. In: 2019 14Th IEEE international conference on automatic face gesture recognition (FG 2019), pp 1–8
Zhu C, Zheng Y, Luu K, Savvides M (2017) CMS-RCNN: Contextual Multi-Scale Region-Based CNN For Unconstrained Face Detection. Springer International Publishing, Cham, pp 57–79
Google Scholar
Zhu S, Yu K (2018) Concept transfer learning for adaptive language understanding. In: Proceedings of the 19th Annual SIGdial Meeting on Discourse and Dialogue, Melbourne, pp 391–399

Download references

Author information

Authors and Affiliations

School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, 611731, China
Waqar Ali & Abdullah Aman Khan
Faculty of Information Technology, The University of Lahore, Lahore, 54000, Pakistan
Waqar Ali
School of Information and Software Engineerng, University of Electronic Science and Technology of China, Chengdu, 611731, China
Wenhong Tian
Data Mining Lab, School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, 611731, China
Salah Ud Din
School of Electronic Science and Engineering, University of Electronic Science and Technology of China, Chengdu, 611731, China
Desire Iradukunda

Authors

Waqar Ali
View author publications
You can also search for this author in PubMed Google Scholar
Wenhong Tian
View author publications
You can also search for this author in PubMed Google Scholar
Salah Ud Din
View author publications
You can also search for this author in PubMed Google Scholar
Desire Iradukunda
View author publications
You can also search for this author in PubMed Google Scholar
Abdullah Aman Khan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Waqar Ali.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ali, W., Tian, W., Din, S.U. et al. Classical and modern face recognition approaches: a complete review. Multimed Tools Appl 80, 4825–4880 (2021). https://doi.org/10.1007/s11042-020-09850-1

Download citation

Received: 23 September 2019
Revised: 10 July 2020
Accepted: 09 September 2020
Published: 02 October 2020
Issue Date: January 2021
DOI: https://doi.org/10.1007/s11042-020-09850-1

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Classical and modern face recognition approaches: a complete review

Abstract

Similar content being viewed by others

Face Recognition Research and Development

Face Recognition: A Review and Analysis

On the frontiers of pose invariant face recognition: a review

Explore related subjects

1 Introduction

2 A general face recognition system

2.1 Acquisition and preprocessing

2.2 Face detection

2.3 Feature extraction

2.4 Feature selection

2.5 Feature matching

3 Known benchmark databases

4 Face recognition applications

4.1 Access control

4.2 Surveillance

4.3 Entertainment

4.4 Law enforcement

4.5 Other common applications

5 Key challenges for face recognition

5.1 Pose variation

5.2 Illumination variation

5.3 Occlusion

5.4 Aging

6 Face recognition frameworks

6.1 Classical approaches

6.1.1 Holistic methods

6.1.2 Local features based methods

6.1.3 Hybrid methods

6.1.4 Summary of classical approaches

6.2 Modern approaches

6.2.1 Deep learning based face recognition

Why Deep Learning?

Early Deep Learning Models:

Preprocessing and Deep Models:

Design Invariants for Deep Model:

Efficiency and Robustness in Deep Model:

Recent Developments:

Summary of Deep Learning Methods:

6.2.2 Sparse representation models

Dictionary Learning for Face Recognition:

Early Contributions:

Recent developments for dictionary learning based face recognition:

Summary of Dictionary Learning Methods:

6.2.3 Fuzzy set theory

Fuzzy Logic and Face Recognition:

Recent Developments in Fuzzy based Face Recognition:

Summary of Fuzzy Based FR Methods:

7 Future research directions

7.1 Soft biometrics:

7.2 Age estimation:

7.3 Gender classification:

7.4 Expression recognition:

7.5 Pose variation:

7.6 Transfer learning:

7.7 Robustness with limited training data:

7.8 Computational time:

7.9 Multi-spectral face recognition:

8 Conclusion

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation