Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

1 Introduction

A mechanism which takes as input a number n of data and combines them to result in a representative value of the information is called integration method. In the literature there exist methods which combine information from different sources. In a MNN it is common to use different methods for integrate the information, such as Type-1 and Type-2 fuzzy logic for Multimodal Biometrics [3], for Human Recognition Based on Iris, Ear and Voice Biometrics [16], for Pattern Recognition [17], the fuzzy Sugeno integral for Pattern Recognition of human face and fingerprint [10], Interval Type-2 Fuzzy Logic Sugeno Integral for Face Recognition [11], a probabilistic sum integrator in Classification using Redundant Mapping [9], a novel Bayesian learning method for information aggregation in modular neural networks [23], Self-Organizing Maps for Japanese Historical Character Recognition [4], Choquet Integral in a modular neural network [8], among others.

The Choquet integral was created by the French mathematician Gustave Choquet in 1953 [2]. The Choquet integral with respect to a fuzzy measure is a very popular data aggregation approach. The generalized Choquet integral with respect to a signed fuzzy measure can act as an aggregation tool, which is especially useful in many applications [6, 21, 24].

This paper is organized as follows: Sect. 2 shows the technique that is applied for the combination of the several information sources: the concepts of Fuzzy Measures and Choquet integral. Section 3 presents Edge detection based on Sobel with type-1 and type-2 fuzzy systems. Section 4 describes the architecture of the modular neural network proposed and in the Sect. 5 the simulation results are shown. And finally in Sect. 6 the Conclusions are presented.

2 Fuzzy Measures and Choquet Integral

In 1974 the concepts of “fuzzy measure and fuzzy integral” were defined by Sugeno [20] in order to define sets that do not have well-defined boundaries. A fuzzy measure is a nonnegative function monotone of values defined in “classical sets”. Currently, when referring to this topic, the term “fuzzy measures” has been replaced by the term “monotonic measures”, “non-additive measures” or “generalized measures” [14, 18, 22]. When fuzzy measures are defined on fuzzy sets, we speak of fuzzified monotonic measures [22].

2.1 Fuzzy Measures

A fuzzy measure can be defined as a fuzzy measure μ with respect to the dataset X, and must satisfy the following conditions:

  1. 1.

    \( \mu \left( X \right) = 1;\;\mu ({\O} ) = 0 \; {\text{Boundary conditions}} \)

  2. 2.

    \( Si \, A \subset B, \, then\;\mu \left( A \right) \le \left( B \right) \; {\text{Monotonicity}} \)

In condition two, A and B are subsets of X.

A fuzzy measure is a Sugeno measure or λ-fuzzy, if it satisfies the condition (1) of addition for some λ > −1.

$$ \mu \left( {A\, \cup \,B} \right) = \mu \left( A \right) + \mu \left( B \right) + \lambda \mu \left( A \right)\mu \left( B \right) $$
(1)

where λ can be calculated with (2):

$$ f\left( \lambda \right) = \left\{ {\mathop \prod \limits_{i = 1}^{n} (1 + M_{i} (x_{i} )\lambda )} \right\} - (1 + \lambda ) $$
(2)

The value of the parameter λ is determined by the conditions of Theorem 1.

Theorem 1

Let \( \mu \left( {\left\{ x \right\}} \right) < 1 \) for each x ∈ X and let \( \mu \left( {\left\{ x \right\}} \right) > 0 \) for at least two elements of X, then (2) determines a unique parameter in the following way:

$$ \begin{aligned} & If\; \mathop \sum \limits_{x \in X} \mu \left( {\{ x\} } \right) < 1, \quad then\;\lambda \;is\;in\;the\;interval\;(0,\infty ). \\ & If \;\mathop \sum \limits_{x \in X} \mu \left( {\{ x\} } \right) = 0, \quad then\;\lambda = 0;\;That \, is \, the \, unique \, root \, of \, the \, equation. \\ & If \;\mathop \sum \limits_{x \in X} \mu \left( {\{ x\} } \right) > 1, \quad then\;\lambda \;se\; is\;in\;the\;interval\;( - 1,0). \\ \end{aligned} $$

The fuzzy measure represents the importance or relevance of the information sources when computing the aggregation. The method to calculate Sugeno measures, carries out the calculation in a recursive way, using (3) and (4).

$$ \mu (A_{1} ) = \mu (M_{i} ) $$
(3)
$$ \mu (A_{i} ) = \mu (A_{(i - 1)} ) + \mu (M_{i} ) + \left( {\lambda \mu (M_{i} )*\mu (A_{(i - 1)} } \right) $$
(4)

where \( A_{i} \) represents the fuzzy measure and \( M_{i} \) represents the fuzzy density determined by an expert, where \( 1 < M_{i} \le \cdots \le {\text{n}} \), should be permuted with respect to the descending order of their respective μ(A i ).

In the literature there are 2 types of Integral, the integral of Sugeno and the Choquet Integral.

2.2 Choquet Integral

The Choquet integral can be calculated using Eq. (5) or an equivalent expression (6)

$$ \begin{aligned} Choquet & = \mathop \sum \limits_{i = 1}^{n} \left\{ {\left[ {A_{i} - A_{(i - 1)} } \right]*D_{i} } \right\} \\ {\text{With A}}_{0} & = 0 \\ \end{aligned} $$
(5)

Or also

$$ \begin{aligned} & Choquet = \mathop \sum \limits_{i = 1}^{n} A_{i} *\left\{ {\left[ {D_{i} - D_{(i + 1)} } \right]} \right\} \\ & With \, D_{(n + 1)} = 0 \\ \end{aligned} $$
(6)

In this case A i represents the fuzzy measurement associated with a data D i .

3 Edge Detection

Edge detection can be defined as a method consisting of identifying changes that exist in the light intensity, which can be used to determine certain properties or characteristics of the objects in the image. A number of edge detectors have been developed by various researchers. Amongst them, the most important operators are the Sobel Operator [19], Prewitt operator [15], Robert operator [15], Canny operator [1] and Kirsch operator [5]. The resultant images of the edge detectors preserve more details of the original images, which is a desirable feature for a pattern recognition system.

We used the Cropped Yale database to perform the training of the modular neural network, which has images of 38 people with 60 samples of each individual. To each of the images we applied a pre-processing by making use of Sobel edge detector with type-1 and type-2 fuzzy logic systems [7] in order to highlight features, some of the images can be displayed in Fig. 4a. Each image of the Cropped Yale database has a size of 168 × 192.

3.1 Sobel

The Sobel operator is applied to a digital image in gray scale, is a pair of 3 × 3 convolution masks, one estimating the gradient in the x-direction (columns) (7) and the other estimating the gradient in the y-direction (rows) (8) [12].

$$ sobel_{x} = \left[ {\begin{array}{*{20}c} { - 1} & 0 & 1 \\ { - 2} & 0 & 2 \\ { - 1} & 0 & 1 \\ \end{array} } \right] $$
(7)
$$ sobel_{y} = \left[ {\begin{array}{*{20}c} 1 & { 2} & { 1} \\ 0 & { 0 } & { 0} \\ { - 1} & { - 2} & { - 1} \\ \end{array} } \right] $$
(8)

If we have Im,n as a matrix of m rows and r columns, where the original image is stored, then gx and gy are matrices having the same dimensions as I, which at each element contains the horizontal and vertical derivative approximations and are calculated by (9) and (10) [12].

$$ g_{x} = \mathop \sum \limits_{i = 1}^{i = 3} \mathop \sum \limits_{j = 1}^{j = 4} Sobel_{x,ij} *I_{r + i - 2,c + j - 2} \quad\begin{array}{*{20}c}{for = 1,2, \ldots ,m} \\ {for = 1,2, \ldots ,n} \\ \end{array} $$
(9)
$$ g_{y} = \mathop \sum \limits_{i = 1}^{i = 3} \mathop \sum \limits_{j = 1}^{j = 4} Sobel_{y,i,j} *I_{r + i - 2,\;c + j - 2} \quad\begin{array}{*{20}c} {for = 1,2, \ldots ,m} \\ {for = 1,2, \ldots ,n} \\ \end{array} $$
(10)

In the Sobel method the gradient magnitude g is calculated by (11).

$$ g = \sqrt {g_{x}^{2} + g_{y}^{2} } $$
(11)

For the type-1 and type-2 fuzzy inference systems, 3 inputs can be used as can seen in Figs. 1 and 2, two of them are the gradients with respect to the x-axis and y-axis, calculated with (9) and (10), which we call DH and DV, respectively. The third variable M is the image after the application of a low-pass filter hMF in (12); this filter allows to detect image pixels belonging to regions of the input where the mean gray level is lower. These regions are proportionally affected more by noise, which is supposed to be uniformly distributed over the whole image [12].

$$ hMF = \frac{1}{25}*\left[ {\begin{array}{*{20}c} {\begin{array}{*{20}c} 1 & 1 & 1 \\ 1 & 1 & 1 \\ \end{array} } & {\begin{array}{*{20}c} 1 & 1 \\ 1 & 1 \\ \end{array} } \\ {\begin{array}{*{20}c} 1 & 1 & 1 \\ 1 & 1 & 1 \\ 1 & 1 & 1 \\ \end{array} } & {\begin{array}{*{20}c} 1 & 1 \\ 1 & 1 \\ 1 & 1 \\ \end{array} } \\ \end{array} } \right] $$
(12)
Fig. 1
figure 1

Variables for the edge detector with the type-1 fuzzy Sobel

Fig. 2
figure 2

Variables for the edge detector with the type-2 fuzzy Sobel

After applying the edge detector of Type-1 with Sobel, the resulting image can be viewed in Fig. 3b.

Fig. 3
figure 3

a Original image database Cropped Yale, b image with type-1 Sobel edge detector, c image with interval type-2 Sobel edge detector

The rules used for the tests in the Type-1 and Type-2 Sobel edge detectors are [12]:

  1. 1.

    If (dh is LOW) and (dv is LOW) then (y1 is HIGH).

  2. 2.

    If (dh is MIDDLE) and (dv is MIDDLE) then (y1 is LOW).

  3. 3.

    If (dh is HIGH) and (dv is HIGH) then (y1 is LOW).

  4. 4.

    If (dh is MIDDLE) and (hp is LOW) then (y1 is LOW).

  5. 5.

    If (dv is MIDDLE) and (hp is LOW) then (y1 is LOW).

  6. 6.

    If (m is LOW) and (dv is MIDDLE) then (y1 is HIGH).

  7. 7.

    If (m is LOW) and (dh is MIDDLE) then (y1 is HIGH).

The images generated by the Type-2 Sobel edge detector are shown in Fig. 3c.

4 Modular Neural Networks

To each image of the Cropped Yale database we applied a preprocessing with the Type-1 and Type-2 Sobel edge detector, after this, each image is divided into 3 sections horizontal. Each section was used as training data in each one of the three modules of the modular neural network, as shown in Fig. 4. The integration of the simulation of each module is made with the Choquet integral.

Fig. 4
figure 4

Architecture proposal of the modular neural network using type-1 Sobel edge detector

The Training Parameters are:

Training method: gradient descendent with momentum and adaptive learning rate back-propagation (Traingdx).

Each module has two hidden layers [200 200].

Error goal: 0.0001

Epochs: 500

In Table 1 the distribution of the training data is shown.

Table 1 Distribution of the training data

4.1 The Experiment with a Modular Neural Network Recognition System and the Choquet Integral for the Modules Fusion

The experiment consist on applying each evaluated edge detector to obtain a data set of same well know benchmark data bases of images, like Cropped Yale database of faces and then train a neural network to compare the recognition rate using the k-fold cross-validation method [13].

5 Simulation Results

In the experiments we performed 27 tests in simulation of the trainings with each edge detectors making variations in the fuzzy densities with the values of 0.1, 0.5 and 0.9 and performing the calculation of the parameter λ with the bisection method, and these parameters can be find in Table 2.

Table 2 Fuzzy densities and λ parameter

In Table 3, the results obtained using preprocessing of the image, with the type-1 Sobel edge detector, are shown. The first column represents the number of tests performed for each simulation, of the column M1 to M5 the average obtained in each module for each training is displayed and finally the last three columns present the mean of the five trainings, the standard deviation and the maximum of each simulation.

Table 3 Results of one simulation using type-1 Sobel edge detector, in test 1

In Table 4 we show the percentages of recognition of the Modular neural network using the Choquet integral as integration method with a preprocessing in the images with the Type 1 Sobel edge detector. Each result of the tests is the average of 27 simulations with different fuzzy densities, as seen in Table 3.

Table 4 Results of training with type-1 Sobel edge detector

In Table 5 we can find the results of the simulations using the Type-2 Sobel edge detector as preprocessing method.

Table 5 Results of training with type-2 Sobel edge detector

It can be noted that when using the Type-2 Sobel edge detector it was obtained 98.49 % of recognition whereas with type-2 Sobel edge detector a 98.68 % was obtained.

6 Conclusions

The use of the Choquet integral as an integration method of responses of a modular neural network applied to face recognition has yielded favorable results when performing the aggregation process of the preprocessed images with the detectors of Sobel edges. The resultant images of the edge detectors preserve more details of the original images, which is a desirable feature for a pattern recognition system. However it is still necessary to use a method that optimizes the values of the Sugeno measure assigned to each source of information because these were designated arbitrarily.