Face Recognition Using VGG16 CNN Architecture for Enhanced Security Surveillance—A Survey

Olaitan, Alashiri; Adewale, Adeyinka; Misra, Sanjay; Agrawal, Akshat; Ahuja, Ravin; Oluranti, Jonathan

doi:10.1007/978-981-19-5037-7_80

Alashiri Olaitan⁴¹,
Adeyinka Adewale⁴¹,
Sanjay Misra⁴²,
Akshat Agrawal⁴³,
Ravin Ahuja⁴⁴ &
…
Jonathan Oluranti⁴¹

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 936))

1039 Accesses
2 Citations

Abstract

A review of the web camera surveillance, face recognition, convolution neural network (CNN), digital images are presented in this work. Previous works on face recognition systems for enhanced surveillance-based security are presented together with relevant deep learning concepts and theories relating to convolutional neural networks. In-depth analysis is summarized and presented in concise way.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Automation of surveillance systems using deep learning and facial recognition

Article 06 January 2023

Convolutional Neural Network Super Resolution for Face Recognition in Surveillance Monitoring

A Convolution Neural Networks and IoT-Based Approach to Surveillance System

Keywords

1 Introduction

Face recognition (FR) is among the most well-studied aspects of computer vision. Through the use of deep learning algorithms and bigger volume datasets, researchers have subsequently seen substantial development in FR, notably for limited social media web images, such as high-resolution photos of famous faces taken by professional photos [1]. However, the far more difficult FR in unrestrained and low-resolution surveillance imagery, on the other hand, remains unsolved and largely unexplored. In the extent of image analysis besides computer vision, face recognition is a challenging task. Face recognition is a biometric technology that uses a digital image to identify or authenticate a person. It is mostly utilized in security and surveillance. Deep neural networks have lately made great progress in general object recognition [2]. Automatic Face Recognition and Surveillance aid in the development of a secure technology for the upcoming era of computers [3]. The following is indeed the basis for a review of the literature: As stated in [4] variations in time, age, and circumstances have an impact on each person’s face, skeletal structure, muscle development, and body composition. Face recognition systems, equally images and videos, are becoming increasingly popular to use. This system is essentially focused on a variety of poses, expressions, and illuminations.

Face recognition has been shown to exhibit compression impacts since the imageries are immediately retained and delivered in a compressed state, while depictions have been tested with extensively, but mostly in uncompressed image files. It addresses challenges like monitoring and image classification and object appearances in a way utilized as a collection and compressed video segments per any blob examining even while working with still-to-videos.

Face recognition was demonstrated in real-time using a camera, an image, or a set of faces tracked in a video by the researchers in [2]. They evaluated the distance among both landmarks and particularly in comparison the test image to various established encoded image landmarks, derived HOG features, and then categorized them notwithstanding the lighting, expression, radiance, aging, transformations (translate, rotate, and scale the image), or pose during the recognition phase. Researchers were able to create an automatic face recognition system using a picture or video of a person’s face acquired via a mobile device or a webcam.

Face recognition was achieved by integrating two methods: the histogram of oriented gradient (HOG) and the Convolutional Neural Network (CNN). HOG excels at identifying image edges and corners. When contrasted with the local binary pattern (LBP) [5], which employs all eight dimensions for each pixel, HOG utilizes a specific direction for each pixel. However, the coarseness of the binning used by LBP causes it to lose information. Under complicated changes in light and time conditions, HOG features with fewer dimensions perform better than LBP features. With reduced computing time for feature extraction and a reduced number of feature vector magnitudes, HOG features outperform VGG16, VGG19, and others.

The very same image may reflect different pixel information due to variations in illumination and intensity of the acquired images, which is a significant impediment in identifying a person’s face. The acquired image was first processed to grayscale, and afterward, the gradient of each pixel was examined based on the lighter to a darker pixel value in the HOG approach. It spotted all of the faces in an image frame based on the gradient analysis. To set up the posing and projecting an image of the frontal face, a face landmark is being employed.

The CNN learning algorithm was used to detect faces depending on the encoded face of the current frame and already cached encoded faces. Face recognition has long been regarded as a watershed moment in image processing. Even while cameras are now found in almost every home, on the streets, and in businesses, detecting a person from the footage is a time-consuming operation, limiting the security’s effectiveness, this is one of the reasons face recognition have to be enhanced for effective use of the webcam for surveillance [6]. They overcame the limitations of using webcams for surveillance by improving the face recognition algorithm. The face recognition algorithm consistently contrasts a live video stream with an uploaded image from the database, so when the specific object that is the person is detected, an incredibly quick alert is sent. Surveillance cameras, particularly those installed at airports and other public locations, maybe an extremely effective tool for locating missing people as well as other wanted individuals. They overcame the detection of more than one face in their work.

Surveillance is essential nowadays as societies depend on them to improve safety and security, especially where crime is likely to occur such as car parks, supermarkets, office environments banks, construction sites, and motorways. Currently, video data is mainly being used for forensic purposes; this makes it lose the benefit of being a pro-active real-time alerting system since most of the crimes are usually discovered after the harm has been done. This leaves room for further research in surveillance that is continuous monitoring to send an alert in real-time. Smart cameras are now being incorporated into intelligent systems for surveillance to recognize looks in a crowd in real-time.

2 Biometrics

Biometrics is body measurements and calculations associated with human features. Physiological appearances are referred to the shape of the body. Examples to mention on which researches are going on few embraces like face recognition, iris recognition, palm veins, Deoxyribonucleic acid (DNA), fingerprint, face recognition, palm, vein, retina, and odor/scent [7,8,9,10].

2.1 Face Biometrics

Face recognition finds a useful application in several cases including eliminating duplicate entries in a country’s voter registration system preventing a person from registering twice. In access control such as computer logon or office access, security at airports for passengers and airline staff all around the world. It is driver’s licensing offices, for next of kin benefit recipients, police bookings, banking, electoral registration, employee IDs, identification of newborns, national identity cards, in surveillance operations, passport verification, criminals list verification at police sector, Visa processing, and Card Security control at ATMs.

While facial recognition can be done reliably, quickly, and continuously in controlled environments, the technology is currently too rigid and general to cope with real-world situations. Aging, transformation in facial hair, viewpoint distinctions, and cluttered contextual are an automatic facial recognition system that faces some significant problems. Face identification is difficult to automate because faces are a type of natural object that does not lend itself to simplistic geometric interpretations. Computer-assisted face recognition has the potential of being able to manage a huge quantity of faces, meanwhile, the human brain has restricted memory [11].

2.2 Face Detection Methods

Detecting Faces Techniques are classified as feature-based techniques, in which characteristics provided by [12] express an individual’s identity and image-based techniques. With all its statistics and structural classifier, the feature-based algorithm [13] outlines how local features are obtained and their positions. Similarly, image-based approaches [14] used algebraic processes to define color modifications. The qua-ternion is used to build generalized linear filtering methods and a new color edge detector.

Researchers in [4] classified facial recognition algorithms into two invariant techniques: discriminative and generative approaches. Discriminative approaches rely on basic data such as age, weight, skeletal structure, and body mass to be studied, however generative approaches outline the procedure for feeding the data into the model. The Who Is It database, which was created by developing an effective database, comprises age and weight information as well as facial imagery. The database solely includes public figures in an attempt to show changes in age and weight over time. The program tries to distinguish images that have changed in age and weight over time. The outcome of weight is evaluated besides subsequently, neural networks are taught. Training comes first, followed by testing in a learning-based system. In comparison to other methodologies, the researcher obtained a 28.53% Rank-I identification performance accuracy with a 3.4% minimal error rate. Over decades, the system has attempted to detect an individual’s facial appearance, position, aging, actual or artificial, disguise, and plastic surgery as described by certain researchers on covariates of imageries.

Face detection in color photos is challenging once the background is multifaceted and the luminance varies, making skin detection problematic and resulting in false positives. A parallel structure algorithm of skin color recognition was used to enhance detection reliability and to create a classifier using a Gaussian-mixture model and the Ada-boost training algorithm to eliminate false positives. Face Candidates algorithm is used to test the face detection algorithm for the skin color model, and then Ada-boost trained algorithm is used to test the classifier’s verification algorithm on several images as training examples. Face recognition has also employed color palettes with various applications such as images retrieval, color palette, and color transfer.

The expression and impression of color amalgamations is conveyed by the mixing of colors classified into abstract categories through a distinctive set of colors. The pattern matching method outlines the entire facial features to associate input and reference patterns for face detection. For a typical human being, the most difficult challenge is to apply the facial recognition retrieval model for a correct match in the shortest amount of time. Exclusively, when relating with non-static or dynamic environments such as live streaming, webcam recording, or viewing real-time video where facial features are not distinct enough to use as an input image. To create such a model, the research presented by [15] created a model for solving both steps, Facial Detection, and Facial Recognition. Pattern recognition in video files is used in the facial detection stage, which is executed using a single picture matching algorithm. The second phase was to deliberate the image input from the camera, which began with a GUI for chopped square frame design to transmit the important key extent for separating facial features from a complicated background. Second, the out-turn picture obtained through the data source is recognized, and the mean is calculated using Successive Mean Quantization Transform (SMQT) and Eigen techniques applied to the images. After that, it breaks up using the Sparse Network of Windows (SNOW) classifier for facial detection at a high-speed rate with no impact on the background context. The method has been tested on 150 input image snapshots collected from a webcam and has been verified to be 100% accurate.

Developed a new Surveillance Face Recognition Challenge, dubbed QMUL-SurvFace, to encourage the development of innovative FR algorithms that are successful and robust for low-resolution surveillance face pictures [16]. The low-resolution facial images were captured from real surveillance videos, not from fake downsampling of high-resolution footage. This baseline contains 463,507 facial images representing 15,573 distinct identities taken in uncooperative surveillance scenarios over a significant period. As a result, QMUL-SurvFace is a true-performance surveillance FR problem with low resolution, motion blur, uncontrolled poses, changing occlusion, poor illumination, and backdrop clutters. Evaluate the FR performances of five sample deep learning face recognition models (DeepID2, CentreFace, Vgg-Face, FaceNet, and SphereFace) against current standards on the QMUL-SurvFace task [16].

Appearance-based Face Detection. To identify the relevant features of the face and non-facial imagery, these methods use statistical analysis and machine learning techniques. The learnt qualities are expressed in the format of distribution models or discriminant functions, which are subsequently used to detect faces. Meanwhile, dimensional minimization is commonly used to increase computation and detection effectiveness.

Feature-based Face Detection. These methods, also referred to as constituent face recognition, rely on the relationship between the components of the face, it employs invariant features of faces for detection. The idea is that humans can detect faces and objects in a variety of positions and lighting environments, hence attributes or features (such as brows, nose, eyes, mouth, and skin color) must be invariant across these variations. A statistical model is initiated based on the retrieved features to depict their relationships and verify the presence of a face. The reliability of visual feature detection is crucial in this approach.

2.3 Face Recognition Methods

Face recognition methods can be grouped broadly into two: Learning-based Methods and Hand-crafted Methods.

Learning-based Methods. The learning-based methods usually employ convolutional neural networks (CNN) of varying configurations and depths (layers). A baseline CNN is made up of several layers, each of which uses a variational function to transfer one volume of activations to another. Its architecture consists of at least a Convolutional Layer, Pooling Layer, and Fully-Connected Layer. Due to their excellent learning ability especially for large-scale input data, they are constantly being employed by more and more researchers [17]. Deep learning’s accomplishment in face recognition has recently surpassed those such as handcrafted and machine learning methods [18]. CNN architectures strive to be deeper and much more complex to acquire improved recognition performance, which consumes resources, time, as well as space. Nevertheless, CNN is used to learn and extract useful features from an image, they also have the advantage that different configurations already trained for specific tasks exist and could be adapted. Certain layers of a trained model (typically the last output layer) can be removed, and the activations of the lower levels can then be used as fixed feature extractors. Several studies have achieved promising results using these deep characteristics [19, 20], and [21].

Hand-crafted Methods. The hand-crafted methods are further divided into four broad categories: a global approach, local approach, appearance or holistic approach, and other methods (which do not fall under the first three).

Global Approach. These are features based on the general texture or appearance of the image. There are a lot of global feature extraction approaches in literature but the most widely used are Gabor filters [22]; Histogram of oriented Gradients [23]; Local phase quantization (LPQ) [24]; Discrete Cosine Transform (DCT) [25]; Local Binary Patterns (LBP) [24, 26]; Weber local descriptor (WLD) [23, 27]; Local Oriented Statistics Information Booster (LOSIB) [23].

Local Approach. This approach focuses on the local facial features such as eyes, mouth, and nose, computes their locations, and applies statistical properties, geometry, or appearance as the determining factors for classification. These are traits that are focused on the image’s most crucial details and their spatial relationships with each other. The most commonly used textures are Scale Invariant Feature Transformation (SIFT) [28]; Speeded Up Robust Features (SURF) [29]; Symmetry Assessment by Feature Expansion (SAFE) [28]; Binary Robust Invariant Scalable Keypoints (BRISK) [30]; Oriented FAST and Rotated BRIEF (ORB) [31]; Phase Intensive Local Pattern (PILP) [29].

Holistic Approach. The entire facial region is regarded as data input for the facial capture system in this approach e.g., Eigenfaces, Principal Component Analysis (PCA), Linear Discriminant Analysis and independent component analysis, and so on. The holistic-based technique tries to distinguish a face by employing global representations, that is, the image as a whole. To acquire the feature vectors, methods like Principal Component Analysis (PCA), Independent Component Analysis (ICA), and Linear Discriminant Analysis (LDA) are utilized. Examples of these features include Eigenface (implemented using PCA) and Fisherface (implemented using LDA).

Biologically Inspired Features (BIF). are imitative primate’s feed-forward model of visual object recognition pipeline which is acknowledged to be intelligent to recognize visual patterns with high exactness. Gabor functions are employed to model basic cells in mammalian brains’ visual cortex. Gabor filters show frequencies and orientations that are similar to frequencies and orientations in the human visual system. As a result, Gabor filter image processing is regarded to be similar to human comprehension in the visual system. These features were applied by [13].

Elastic Bunch Graph Matching (EBGM). EBGM stands for feature-based face recognition. Certain facial traits are selected by manual interaction. These characteristics are used to create a bunch graph. The bunch graph’s numerous nodes represent various facial landmarks. We may establish the gap among a given test image trait and the closest accessible train image feature by scanning for the shortest measure and analyzing a single train image to all of the training images. A feature extraction method incorporates both a holistic and a local approach. 3D imagery is used in the majority of hybrid techniques. Also, because the image of a person’s face is acquired in 3D, the technology can identify the curves of the eye sockets, including the forms of the chin and forehead. Since the technique uses depth and an axis of assessment, a profile face may be sufficient as it has significant data to build a whole face.

Others. Some researchers [32], have exploited the use of facial marks such as moles and freckles to try to recognize faces though in combination with LBP and Fisher vectors. Other approaches include Active Shape Models (ASM) which uses the shape of an object (face) as its features by a collection of landmark points at clear corners of the face and facial landmark boundaries [33]. AAM builds a shape model and an intensity model from such a collection of training samples using principal component analysis (PCA) [34].

3 Generic Modes of Face Recognition System

A face recognition system comparing it to other biometric recognition systems operates in two modes [35]:

a.
The training mode: the face image of an individual is captured using an acquisition sensor like camera and scanner. The acquired face image is processed and stored in the database with a label (name or unique number) for easy identification or verification.
b.
The testing mode: the face image stored is once again acquired and processed to obtain the necessary features required to either verify or identify the individual.

3.1 Generic Modules of Face Recognition Systems

A face recognition system as shown in Fig. 1 is designed using the following basic modules. Modules 3 and 4 are carried out with CNN i.e., the CNN architecture is used for the features extraction and the classification stages.

A block diagram of a generic face recognition model with 4 modules presents how the features of an image are stored in the database via different stages. — **Fig. 1**

a.
Images acquisition: an acquisition sensor like a camera or sensor is used to capture faces from images or videos. The images must have a considerable amount of spatial information about the face before they can be useful.
b.
Images pre-processing: the pre-processing entails cropping out faces from the acquired images and performing some enhancement on them, to make subsequent processing easy and also to advance the overall performance of the system.
c.
Feature extraction: this involves extracting the low-level features like edges, lines, dots, medium-level features e.g., texture and color, and high-level features e.g., shape from the face images. The features are used for the recognition process.
d.
Matching/classification: this compares the features obtained during recognition against the stored images to produce a matching score.
e.
Ion against the stored images to engender a matching score.

3.2 Overview of Convolutional Neural Network

Convolutional Neural Network (CNN) helps to achieve excellent learning ability for classification of both large-scale and small-scale input data [17]. CNN because of its flexibility and adaptability makes it possible to take out different configurations from already trained models. Convolution simply means applying filters also known as kernels or windows to each image pixel. It tries every possible match. Convolution is performed at each convolutional layer. A layer means a stacking operation and in a convolution layer, the layer consists of the stack images that have been filtered. CNN has been existing since the 1990s but has gained popularity due to its ability to solve recognition problems hence improving computer vision. CNN has its uniqueness from another neural network because of its assumption that all inputs are images, this allows it models its architecture in a way that it recognizes basic image-defined features which help in pattern recognition, face recognition, digits recognition, and many more.

3.3 Overview of Deep Neural Network

Deep learning is a sort of machine learning which enables computers to learn by instance in the likely manner that humans do. Deep learning has progressed to the degree that it can currently beat humans in certain tasks, which include object classification in imagery.

The intrusion detection challenge has been compliant with machine learning methods due to the vast capacity of network telemetry besides other sorts of security data. Numerous modern commercial intrusion detection systems, or security platforms, employ machine learning-based algorithms as part of their detection technique. These methods are often classified as part of the intrusion detection approach’s oddity detection class.

There are two types of machine learning models: shallow learning or typical models and deep learning models from 40 machine learning models. Deep learning models are neural network models with a large degree of hidden layers that are currently in use. These models can learn extremely complex nonlinear functions, and hierarchical layering allows them to learn relevant feature representations from incoming data. Deep learning algorithms have recently achieved success in a variety of domains, including image 45 categorization. There are two key reasons deep learning has lately become useful:

a.
Deep learning necessitates a significant deal of computational power. A parallel architecture is suited for deep learning on high-performance GPUs. This helps developers to reduce deep learning network training time from weeks to hours each when used during conjunction either clusters or cloud computing.
b.
Deep learning requires substantial labeled data. For example, driverless car development requires millions of images and thousands of hours of video. Apart from scalability, another benefit mention often about deep learning models is their ability to perform automatic feature extraction from raw data, also called feature learning.

Deep learning architectures for example deep neural networks, deep belief networks, convolutional neural networks, and recurrent neural networks have been put into fields including natural language processing, vision speech, computer vision, speech recognition, audio recognition, medical image analysis, machine translation, material inspection, and bioinformatics to mention few where the findings are on par with, if not better than, the efficiency of a human expert. Generally, these architectures can be put into 3 specific categories:

Feed-Forward Neural Networks. This is the least used model of neural networks in practical applications. The first layer is the inputs, while the last layer is the outputs. Neural networks with far more than a hidden layer are referred to as “deep” neural networks. They do a series of calculations that change how related the instances are. Each layer’s neurons’ activity is a nonlinear function of the previous layer’s neurons’ activities.

Recurrent Networks. In their connection graph, these have directed cycles. As a result, following the arrows can sometimes lead you back to where you started. These may exhibit complicated dynamics, making training them difficult. They have a physically more realistic aspect to them. There is a great deal of interest right now in figuring out how to train recurrent networks efficiently. Modeling sequential data using recurrent neural networks is a quite natural development. They’re similar to very deep nets with one hidden layer each time slice, with the exception that they use the same weights and receive input at each time slice. They possess the ability to recall information for a long time in their concealed condition, but it is extremely difficult to instruct them to use this skill.

Symmetrically Connected Networks. These are similar to recurrent networks, but the unit connections are symmetrical (they have the same weight in both directions). Recurrent networks are substantially more difficult to examine than symmetric networks. As they follow an energy function, they are likewise limited in what they can perform. “Hopfield Nets” are symmetrically linked nets with no hidden units. “Boltzmann machines” are hidden units in an asymmetrically linked network.

4 Related DNN-Based Face Recognition Work

This section reviews existing works related to the development of a face recognition system for enhanced security surveillance. Several research works have been carried out in the field of face recognition from images captured by webcam with impressive results; however, there is still a lot of room for contribution. A summary table is presented in Table 1.

Table 1 Previous work on the development of a face recognition system for enhanced security surveillance

Full size table

5 Summary

After reviewing existing works of literature, Deep Convolutional Neural Network (DCNN) has proved to attain state-of-the-art results for face recognition as a security means to prevent intrusion, 1w especially for large datasets. DCNN also has the advantage over other neural networks for image classification because DCNN automatically detects the important features without any human supervision. This chapter also explains the importance of surveillance concerning face recognition.

References

Lumaban MBP, Battung GT (2020) WEBCAM-based surveillance system with face recognition feature. Int J Eng Adv Technol 9
Google Scholar
Ahamed H, Alam I, Islam MM (2018) HOG-CNN-based real-time face recognition. International conference on advancement in electrical and electronic engineering, pp 1–4
Google Scholar
Chawla D, Trivedi MC (2018) A comparative study on face detection techniques for security surveillance. A comparative study on face detection techniques for security surveillance, pp 531–541
Google Scholar
Singh M, Nagpal S, Singh R, Vatsa M (2014) On recognizing face images with weight and age variations. IEEE Access 2:822–830
Article Google Scholar
Ghorbani M, Targhi AT, Dehshibi MM (2015) HOG and LBP: towards a robust face recognition system. International conference on digital information management, pp 138–141
Google Scholar
Kumar PR, Surendar M, Kumar TUMDM (2019) Smart surveillance cam using face recognition algorithm. J Netw Comput Appl
Google Scholar
Aniche C, Yinka-Banjo C, Ohalete P, Misra S (2021) Biometric e-voting system for cybersecurity. In: Artificial intelligence for cyber security: methods, issues and possible horizons or opportunities. Springer, Cham, pp 105–137
Google Scholar
Ugot OA, Yinka-Banjo C, Misra S (2021) Biometric fingerprint generation using generative adversarial networks. In: Artificial intelligence for cyber security: methods, issues and possible horizons or opportunities. Springer, Cham, pp 51–83
Google Scholar
Olanrewaju L, Oyebiyi O, Misra S, Maskeliunas R, Damasevicius R (2020) Secure ear biometrics using circular kernel principal component analysis, Chebyshev transform hashing and Bose–Chaudhuri–Hocquenghem error-correcting codes. SIViP 14(5):847–855
Article Google Scholar
Assibong PA, Wogu IAP, Misra S, Makplang D (2020) The utilization of the biometric technology in the 2013 Manyu division legislative and municipal elections in cameroon: an appraisal. In: Advances in electrical and computer technologies. Springer, Singapore, pp 347–360
Google Scholar
Mohammed AA, Minhas R, Wu QMJ, Sid-Ahmed MA (2011) Human face recognition is based on multidimensional PCA and extreme learning machines. Pattern Recogn 44(10–11):2588–2597. https://doi.org/10.1016/j.patcog.2011.03.013
Article MATH Google Scholar
Antón-Rodríguez M, González-Ortega D, Díaz-Pernas F, Martínez-Zarzuela M, Díez-Higuera J (2012) Color-texture image segmentation and recognition through a biologically-inspired architecture. Pattern Recogn Image Anal 22:54–68
Google Scholar
Choi SE, Lee YJ, Lee SJ, Park KR, Kim J (2011) Age estimation using a hierarchical classifier based on global and local facial features. Pattern Recogn 44(6):1262–1281. https://doi.org/10.1016/j.patcog.2010.12.005
Article MATH Google Scholar
Carré P, Denis P, Fernandez-Maloigne C (2014) Spatial color image processing using clifford algebras: application to color active contour. SIViP 8:1357–1372
Article Google Scholar
Pattanasethanon P, Savithi C (2012) Human face detection and recognition using web-cam. J Comput Sci 8:1585
Article Google Scholar
Mustafah YM, Azman AW, Bigdeli A, Lovell BC (2007) An automated face recognition system for intelligence surveillance: smart camera recognizing faces in the crowd. 2007 1st ACM/IEEE International conference on distributed smart cameras, ICDSC, pp 147–152. https://doi.org/10.1109/ICDSC.2007.4357518
Wu X, He R, Sun Z, Tan T (2018) A light CNN for deep face representation with noisy labels. IEEE Trans Inf Forensics Secur 13(11):2884–2896. https://doi.org/10.1109/TIFS.2018.2833032
Article Google Scholar
Zheng HH, Zu YX (2018) A normalized light CNN for face recognition. J Phys: conference series 1087(6). https://doi.org/10.1088/1742-6596/1087/6/062015
Shang C, Ai H (2018) Cluster convolutional neural networks for facial age estimation. Proceedings—international conference on image processing, ICIP, 2017–Sept, pp 1817–1821. https://doi.org/10.1109/ICIP.2017.8296595
Rattani A, Reddy N, Derakhshani R (2018) Convolutional neural network for age classification from smart-phone based ocular images. IEEE international joint conference on biometrics, IJCB 2017, 2018–Jan, pp 756–761. https://doi.org/10.1109/BTAS.2017.8272766
Yoo B, Kwak Y, Kim Y, Choi C, Kim J (2018) Multitask learning with weak label expansion. IEEE Signal Proc Lett 25(6):808–812. Retrieved from https://doi.org/10.1109/LSP.2018.2822241
Bharadwaj S, Bhatt HS, Vatsa M, Singh R (2010) Periocular biometrics: when iris recognition fails. BTAS, pp 1–6
Google Scholar
Castrillón-Santana M, Lorenzo-Navarro J, Ramón-Balmaseda E (2016) On using periocular biometric for gender classification in the wild. Pattern Recogn Lett 82:181–189. https://doi.org/10.1016/j.patrec.2015.09.014
Article Google Scholar
Xu J, Cha M, Heyman JL, Venugopalan S, Abiantun R, Savvides M (2010) Robust local binary pattern feature sets for periocular biometric identification. IEEE 4th International conference on biometrics: theory, applications and systems, BTAS 2010, pp 3–10. https://doi.org/10.1109/BTAS.2010.5634504
Lyle JR, Miller PE, Pundlik SJ, Woodard DL (2012) Soft biometric classification using local appearance periocular region features. Pattern Recogn 45(11):3877–3885. https://doi.org/10.1016/j.patcog.2012.04.027
Article Google Scholar
Uzair M, Mahmood A, Mian A, McDonald C (2015) Periocular region-based person identification in visible, infrared, and hyperspectral imagery. Neurocomputing 149:854–867
Article Google Scholar
Aginako N, Castrillón-Santana M, Lorenzo-Navarro J, Martínez-Otzeta JM, Sierra B (2017) Periocular and iris local descriptors for identity verification in mobile applications. Pattern Recogn Lett
Google Scholar
Sequeira AF, Chen L, Ferryman J, Wild P, Alonso-Fernandez F, Bigun J (2017) Cross-spectral iris/periocular recognition competition, in Biometrics. 2017 IEEE international joint conference on, pp 725–732
Google Scholar
Bakshi S, Sa PK, Majhi B (2015) A novel phase-intensive local pattern for periocular recognition under the visible spectrum. Biocybernetics Biomed Eng 35(1):30–44. https://doi.org/10.1016/j.bbe.2014.05.003
Article Google Scholar
Karahan Ş, Karaöz A, Özdemir ÖF, Gü AG, Uludag U (2014) On identification from periocular region utilizing sift and surf. Proceedings-22nd Europeans
Google Scholar
Alonso-Fernandez F, Bigun J (2016) A survey on periocular biometrics research. Pattern Recogn Lett pp 96–105
Google Scholar
Uzair B, Menaa F, Khan BA, Mohammad FV, Ahmad VU, Djeribi R, Menaa B (2018) Isolation, purification, structural elucidation, and antimicrobial activities of kocumarin, a novel antibiotic isolated from actinobacterium Kocuria marina CMG S2 associated with the brown seaweed Pelvetiacanaliculata. Microbiol Res 206:186–197. https://doi.org/10.1016/j.micres.2017.10.007
Article Google Scholar
Zou F, Li J, Min W (2019) Distributed face recognition based on load balancing and dynamic prediction. Appl Sci (Switzerland) 9(4). https://doi.org/10.3390/app9040794
Makhija Y, Sharma RS (2019) Face recognition: novel comparison of various feature extraction techniques, in Harmony search and nature inspired optimization algorithms. Springer, pp 1189–1198
Google Scholar
Sawhney S, Kacker K, Jain S, Singh N (n.d.) No title. Real-time smart attendance system using face recognition techniques
Google Scholar
Besnassi M, Neggaz N, Benyettou A (2020) Face detection based on evolutionary Haar filter. Pattern Anal Appl 23(1):309–330
Article Google Scholar
Yun W-H et al (2018) Automatic recognition of children engagement from facial video using convolutional neural networks. IEEE Trans Affect Comput 11(4):696–707
Article MathSciNet Google Scholar
Tabatabaie ZS et al (2009) A hybrid face detection system using a combination of appearance-based and feature-based methods. Int J Comput Sci Netw Sec 9(5):181–185
MathSciNet Google Scholar
Wu, Yulin, and Mingyan Jiang (2018) Multi-layer CNN features fusion and classifier optimization for face recognition. Proceedings of the 2018 2nd international conference on computer science and artificial intelligence
Google Scholar
Aitkenhead MJ, McDonald AJS (2003) A neural network faces a recognition system. Eng Appl Artif Intell 16(3):167–176. https://doi.org/10.1016/S0952-1976(03)00042-3
Article Google Scholar
Yang B et al (2017) Facial expression recognition using weighted mixture deep neural network based on double-channel facial images. IEEE Access 6:4630–4640
Article Google Scholar
Bhowmik MK et al (2019) Enhancement of robustness of face recognition system through reduced gaussianity in Log-ICA. Expert Syst Appl 116:96–107
Article Google Scholar
Sajjad M, Nasir M, Muhammad K, Khan S, Jan Z, Sangaiah AK, Elhoseny M, Baik SW (2020) Raspberry Pi assisted face recognition framework for enhanced law-enforcement services in smart cities. Future Gener Comput Syst 108:995–1007. https://doi.org/10.1016/j.future.2017.11.013
Article Google Scholar
Chowdhry DA, Hussain A, Ur Rehman MZ, Ahmad F, Ahmad A, Pervaiz M (2013) Smart security system for the sensitive area using face recognition. Proceedings—2013 IEEE conference on sustainable utilization and development in engineering and technology, IEEE CSUDET 2013, pp 11–14. https://doi.org/10.1109/CSUDET.2013.6670976
Chetty G, Sharma D (2006) Distributed face recognition: a multiagent approach. Lecture notes in computer science (Including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics), 4253 LNAI, pp 1168–1175. https://doi.org/10.1007/11893011_148
Agarwal V, Bhanot S (2018) Radial basis function neural network-based face recognition using firefly algorithm. Neural Comput Appl 30(8):2643–2660
Article Google Scholar
Owandkar M, Kolte A, Peshave D, Jadhav S (2017) Attendance monitoring system using face recognition. Int Res J Eng Technol (IRJET) 4(5):1163–1168. Retrieved from https://www.irjet.net/archives/V4/i5/IRJET-V4I5228.pdf
Zhang Y, Hu C, Lu X (2018) Face recognition under varying illumination based on singular value decomposition and retina modeling. Multimedia Tools Appl 77(21):28355–28374
Article Google Scholar
Deniz S, Lee D, Kurian G, Altamirano L, Yee D, Ferra M, Hament B, Zhan J, Gewali L, Oh P (2018) Computer vision for attendance and emotion analysis in school settings
Google Scholar
Olivares-Mercado J et al (2018) Face recognition system based on MOTIF features. J Mod Opt 65(18):2124–2132
Article MathSciNet Google Scholar
Trokielewicz M, Szadkowski M (2017) Iris and periocular recognition in Arabian racehorses using deep convolutional neural networks. In: 2017 IEEE international joint conference on biometrics (IJCB). IEEE
Google Scholar
Gupta SK, Ashwin TS, Reddy Guddeti RM (2018) CVUCAMS: computer vision-based unobtrusive classroom attendance management system. Proceedings—IEEE 18th international conference on advanced learning technologies, ICALT 2018, pp 101–102. https://doi.org/10.1109/ICALT.2018.00131

Download references

Acknowledgements

The authors appreciate the sponsorship from Covenant University through its Center for Research, Innovation and Discovery, Covenant University, Ota Nigeria.

Author information

Authors and Affiliations

Center of ICT/ICE Research, Covenant University, Ota, Ogun, Nigeria
Alashiri Olaitan, Adeyinka Adewale & Jonathan Oluranti
Department of Computer Science and Communication, Østfold University College, Halden, Norway
Sanjay Misra
Amity University, Haryana, India
Akshat Agrawal
Shri Viswakarma Skill University, Gurgaon, Hariyana, India
Ravin Ahuja

Authors

Alashiri Olaitan
View author publications
You can also search for this author in PubMed Google Scholar
Adeyinka Adewale
View author publications
You can also search for this author in PubMed Google Scholar
Sanjay Misra
View author publications
You can also search for this author in PubMed Google Scholar
Akshat Agrawal
View author publications
You can also search for this author in PubMed Google Scholar
Ravin Ahuja
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan Oluranti
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Akshat Agrawal .

Editor information

Editors and Affiliations

KIET Group of Institutions, Ghaziabad, India
Pradeep Kumar Singh
Institute of Computer Science, Polish Academy of Sciences, Warsaw, Poland
Sławomir T. Wierzchoń
Department of Computer Engineering, NIT Kurukshetra, Haryana, India
Jitender Kumar Chhabra
Department of Computer Science and Engineering, Institute of Technology, Nirma University, Ahmedabad, Gujarat, India
Sudeep Tanwar

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Olaitan, A., Adewale, A., Misra, S., Agrawal, A., Ahuja, R., Oluranti, J. (2022). Face Recognition Using VGG16 CNN Architecture for Enhanced Security Surveillance—A Survey. In: Singh, P.K., Wierzchoń, S.T., Chhabra, J.K., Tanwar, S. (eds) Futuristic Trends in Networks and Computing Technologies . Lecture Notes in Electrical Engineering, vol 936. Springer, Singapore. https://doi.org/10.1007/978-981-19-5037-7_80

Download citation

DOI: https://doi.org/10.1007/978-981-19-5037-7_80
Published: 16 November 2022
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-5036-0
Online ISBN: 978-981-19-5037-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Face Recognition Using VGG16 CNN Architecture for Enhanced Security Surveillance—A Survey

Abstract

Similar content being viewed by others

Automation of surveillance systems using deep learning and facial recognition

Convolutional Neural Network Super Resolution for Face Recognition in Surveillance Monitoring

A Convolution Neural Networks and IoT-Based Approach to Surveillance System

Keywords

1 Introduction

2 Biometrics

2.1 Face Biometrics

2.2 Face Detection Methods

2.3 Face Recognition Methods

3 Generic Modes of Face Recognition System

3.1 Generic Modules of Face Recognition Systems

3.2 Overview of Convolutional Neural Network

3.3 Overview of Deep Neural Network

4 Related DNN-Based Face Recognition Work

5 Summary

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Face Recognition Using VGG16 CNN Architecture for Enhanced Security Surveillance—A Survey

Abstract

Similar content being viewed by others

Automation of surveillance systems using deep learning and facial recognition

Convolutional Neural Network Super Resolution for Face Recognition in Surveillance Monitoring

A Convolution Neural Networks and IoT-Based Approach to Surveillance System

Keywords

1 Introduction

2 Biometrics

2.1 Face Biometrics

2.2 Face Detection Methods

2.3 Face Recognition Methods

3 Generic Modes of Face Recognition System

3.1 Generic Modules of Face Recognition Systems

3.2 Overview of Convolutional Neural Network

3.3 Overview of Deep Neural Network

4 Related DNN-Based Face Recognition Work

5 Summary

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation