Keywords

1 Introduction

Face plays a key role in human being’s interactions. By identifying the facial expressions from the images of face, a number of applications in the human–computer interaction area will get simplified. In the past, many research works had been carried out on the problems of the different facial expressions of the modern human being in FR analysis [1, 2]. The pioneer studies of Ekman [3] in late 70s have given a proof to the classification of the basic facial expression. Face detection (FD) is stated as the process of obtaining faces from scenes. So, in a positive way the system captures a certain region in the image as a face. This procedure has many implementations on face detection and poses estimation. Face location is an interpreted approach of face detection (FD). Its aim is to establish the position of a face where there is only one face in the image. Current works on facial expression and recognition vary mainly in the selected facial features and the classifiers that are used to discriminate amongst the various facial expressions. There are many techniques which are planned for modern human face appearance identification from still pictures (database of images) to image scenes (video recorder).

In the literature survey, most of the researchers attempt to classify basic facial expressions by using different algorithms such as Haar cascade [4], multi-task convolutional neural network (MTCNN) [5], Naive Bayes [6], SVM [7], HMM [8]etc.

The target of our paper is

  • To identify and track face from a static image of ancient skulls.

  • To recognize an early man’s face.

  • A comparative analysis of different machine learning-based classification approach for face detection in the area of archaeological field.

2 Details of Facial Detection Techniques

2.1 Haar Cascade

Haar cascade is an algorithm used in detecting objects in the artificial intelligence (AI) field to detect matter in an representation or videocassette which is based on the domain projected by Viola–Jones in 2001 [9]. A Haar cascade is basically a classifier detecting objects as it has been trained for. In this machine learning (ML) technique, a cascade function is skilled here by overlaying the positive images over a collection of negative images. Now the Haar-like features are different image features, and they owe their name to their inherent Haar wavelets. Cascading is a specific case of the whole learning based on a series of interconnected several classifiers using all information that gets collected from the output, from a given classifier which becomes an additional information for the next classifiers in the cascade.

The four levels of Haar Cascade algorithm have been shown in Fig. 1.

Fig. 1
figure 1

Haar cascade algorithm

Input Haar feature creating integral AdaBoost training cascading classifiers output.

Selection images.

It is popular about Haar Cascade algorithm that it can detect faces in an image, but can be trained to identify almost any type of object.

2.2 Multitask Cascaded Convolutional Neural Network (MTCNN)

One of the popular approaches is MTCNN, described by Kaipeng Zhang et al. [10]. The MTCNN is well-known as it is achieved then state-of-the-art resulting on a range of benchmark data sets and also popularly called landmark detection because of its capability to recognize other facial feature types like eyes and mouth. The system uses a cascade structure consisting three networks.

2.3 Analysing Face Detection Depending on Facial Features

Various performances applied on the facial features detecting modern human being by using different classification techniques to recognize a face, emotions, age-gender, etc. The face detection process has been shown in Fig. 2.

Fig. 2
figure 2

Face detection process

Captured Extracting Matching Comparing Graphically Representing

2.3.1 Captured

Algorithms of face detection spotlight on the detection of front human faces. The first step is face exposure, i.e. capturing. Capturing means record accurately in words or pictures. Face capturing means obtaining or trapping only the facial part from the taken data. In this paper, we have captured face of early man, skull, and mummies using Haar cascade classifier and MTCNN classifier.

2.3.2 Extracting

Face features extraction is the action of extracting face fundamental character like the left side of eye and right side of eye, nose, left side of mouth and right side of mouth etc.from face images. It is very significant for the initialization procedure methods such as face identifying, face appearance detection, or face identification. In this paper, while using Haar cascade, we extracted eyes from face images and while using MTCNN, we extracted eyes, nose, and mouth.

2.3.3 Matching

A face matching algorithm is a set of systems that a computer uses to identify a face in an image and then to match up to that face to another face (or features) to establish whether there is a match. In this paper, we have tried matching the face of an early man to the human face (i.e., reconstructed or a 3D model made), skull to an early man or human (reconstructed).

2.3.4 Comparing

When we make a comparison, we consider two or more things and discover the differences. In this paper, once collected results by using Haar cascade classifier and MTCNN classifier, we compared them and tried getting better results which would give us a good collection of information regarding early men, skulls and mummies.

2.3.5 Graphically Representing

Representing data graphically is one of the most commonly used approaches of presentation. The idea of graphical communication is to deliver message or information to the receiver in a valid way.

In this paper, the parameters of both eyes, nose, both sides of mouth based on the feature extraction of skull, group of early men by MTCNN classifier has been represented graphically (Fig. 3).

Fig. 3
figure 3

Commands using two classifiers to detect face

2.3.6 Face Detection PseudoCodes

3 Discussion of Proposed Work

We collected facial images of different categories like early man, ancient skull, mummy’s (ancient) face and ancient sculptures, to detect facial features. We used OpenCV software version-4.1.2 and two classifiers, namely Haar cascade classifiers and MTCNN classifier, and compared the results.

3.1 Haar Cascade Classifier

A Viola–Jones detection algorithm-based classifier, i.e. Haar cascade classifier, flowchart has been shown in Fig. 4.

Fig. 4
figure 4

A flowchart showing Haar cascade classifier mechanism

A very basic line up used to exhibit an image using Haar-like features and OpenCV can be written as follows:

figure c

3.2 MTCNN Classifier

The multitask cascaded convolutional neural network (MTCNN) algorithm works in three steps where one neural network is used for each.

Steps are:

  1. I.

    Firstly, it will figure the possible face location and their rectangular region like an attention network in faster regional convolutional neural networks (RCNN). The outcome of this stage is tones of face detection and large number of fake detection.

  2. II.

    Here, the images and outcomes that were predicted at first are used. It creates a clarification of the obtained consequence to erase mainly fake detection and aggregate rectangular region.

  3. III.

    This is the last stage, which purifies even more the forecast and adds facial marker predictions.

Now, a very basic line up used to demonstrate an image and its plotting using OpenCV, matplotlib can be written as follows:

figure d

3.3 Details of the Images

See Table 1.

Table 1 Description of various early man, ancient skull, sculpture and mummy’s images

3.4 Comparison Analysis I

Table 2 summarizes the difference between the observations obtained from the ancient skulls, fossils, early man, 3D model (face reconstruction), mummy (Egypt), sculptures face images using the two classifiers, viz. cascade classifier and MTCNN.

Table 2 Comparison analysis I of various face images using cascade classifier and MTCNN

3.5 Comparison Analysis II

Very few facial images are detected by cascade classifier only but they are not detected by MTCNN. Again most of the facial images are detected by MTCNN only. This comparison analysis has been shown in Table 3.

Table 3 Comparison analysis II of various face images using cascade classifier and MTCNN

3.6 Graphical Representation Analysis

In this paper, it has been shown that only MTCNN provides different parameters of the facial images obtained from various ancient skulls, early men, mummy and sculpture. The parameters (left side of eye (LE), right side of eye (RE), nose (N), left side of mouth (ML), right side of mouth (MR)) of two images are described graphically in Table 4.

Table 4 Graphical Representation analysis

In the first picture, only one skull’s face image has been taken which is detected using MTCNN classifier, whereas in the second picture, a group of eight early men face images has been taken, but only seven face images are detected. So graphically, the parameters of seven face images are plotted together. Also graphically the parameters of one face image of first picture are plotted. This graphical representation analysis is described in Table 4.

4 Conclusion and Future Scope

Face detection is the challenging aspects in the research area of artificial intelligence, computer vision (CV), where it analyses the present mankind’s face images.

In this paper, we tried a new idea and implemented face detection methods on ancient skull images, early man, mummies (Egypt) face images, sculptures and 3D face models (i.e. reconstructed ancient faces).

The focus in the direction of the face detection has been increased in the previous years due to its vast applications in the real world. The research conducted in this field leads to encouraging results but still we are incapable to find the face detection technique which is able to execute resourcefully in the various situations of daily lives.

We hope that the study of this paper will help and opens a new gate for not only archaeologists with their research works [11], but also in computer vision field.