Keywords

1 Introduction

In computer vision and pattern recognition, image local descriptors play an important role for many recognition tasks. A lot of local descriptors have been proposed in the past decade such as Scale-Invariant Feature Transform (SIFT), Local Binary Pattern (LBP), Histogram of Oriented Gradient (HOG), and Weber Local Descriptor (WLD), etc. Among them, WLD is a new one proposed by Chen et al. [1] in 2010. In WLD, differential excitation and orientation of the input image are firstly calculated, and then the histogram is constructed according to these two components. Generally, WLD descriptor has the following advantages: (1) It utilizes Weber function to detect local salient pattern which make it more robust to noises and illumination changes. (2) It constructs histogram according to salient pattern’s gradient orientation, combining two features together make it more discriminatory. After WLD descriptor was proposed, a lot of its variants have been proposed and have achieved very promising performance for different tasks. In this paper, we make a review for WLD and its variants. We firstly present the basic algorithm of WLD. Secondly, we divide the algorithms of WLD’s variants into three categories such as differential excitation-based, orientation-based and multiple features based. Finally, we summarize their applications for biometrics.

2 Basic Algorithm of Weber Local Descriptor

2.1 Weber’s Law and Differential Excitation Descriptor

In the fields of psychology, Weber’s law holds that the ratio of stimulation increment \( \Delta I \) to its original intensity \( I \) is a constant. It means we have reason to think that the region is background or area containing noises if the region changes after stimulation. Inspired by Weber’s Law, Chen et al. [1] proposed differential excitation descriptor to calculate such a change:

$$ \upxi\left( {x_{c} } \right) = arctan\left( {\frac{{v_{s}^{00} }}{{v_{s}^{01} }}} \right) = arctan\left[ {\mathop \sum \limits_{i = 0}^{p - 1} \left( {\frac{{x_{i} - x_{c} }}{{x_{c} }}} \right)} \right] $$
(1)

where \( p \) is the number of neighboring pixels, \( x_{c} \) is the central pixel and \( x_{i} \) indicates the ith neighboring pixel, \( v_{s}^{00} \) and \( v_{s}^{01} \) are calculated through two differential excitation masks \( f_{00} \) and \( f_{01} \) as shown in Fig. 1.

Fig. 1.
figure 1

The construction process of Weber local descriptor.

2.2 Orientation and Histogram

The orientation component of WLD is gradient orientation of current pixel:

$$ \uptheta\left( {x_{c} } \right) = arctan\left( {\frac{{x_{5} - x_{1} }}{{x_{7} - x_{3} }}} \right) $$
(2)

where \( x_{5} - x_{1} \) and \( x_{7} - x_{3} \) are the intensity differences in vertical and horizontal direction. After calculating these two components, a 2-dimensional histogram \( \left\{ {WLD\left( {\xi_{j} ,\phi_{t} } \right)} \right\} \) is constructed for feature representation. For simplicity, this 2D histogram is further encoded into 1D histogram according to each column. Such a WLD feature has been successfully used in texture classification and face recognition [1].

3 The Variants of WLD Descriptor

A lot of variants of WLD have been proposed, which can be classified into three types i.e., differential excitation-based, orientation-based and multiple features-based. Table 1 lists 13 variants of WLD, and these variants will be introduced in our paper one by one.

Table 1. The variants of WLD

3.1 Differential Excitation-Based Variants of WLD

The differential excitation component of WLD aims to extract local salient patterns, so it is necessary for this component to be expressive and robust. According to this principle, several variants of WLD have been proposed.

(1) Modified WLD (MWLD)

Jiao and Baojun [2] proposed MWLD for space image registration. The modules of differential excitation and orientation are modified to fit the circle stars’ smooth edges and to extract more sharp edge information. Those modules are showed in Fig. 2. Experiments show it is expressive for space images.

Fig. 2.
figure 2

Modules of differential excitation and orientation in MWLD

(2) Simplified WLD (SWLD)

Zuo et al. [3] proposed a methodology for face images called SWLD, which is applied on the magnitude of Gabor filtered image and only computes differential excitation around current pixel’s 12 neighbors:

$$ SWLD_{l,r,12} \left( x \right) = \vartheta_{l} \left\{ {tan^{ - 1} \left[ {\sum\nolimits_{i = 0}^{11} {\frac{{x_{i} - x}}{x}} } \right]} \right\} $$
(3)

where \( x_{i} \) are 12 neighbors of \( x \) at radius \( {\text{r}} \) and \( \vartheta_{l} \) is a uniform quantizer with \( {\text{l}} \) quantization levels. After calculating SWLD and LBP on magnitude of filtered image and GLBP on phase of filtered image, concatenated histogram is built and successfully used for face recognition.

(3) Gabor WLD (GWLD)

Motivated by WLD and the advantages of Gabor, Sun et al. [4] proposed GWLD for bovine iris recognition. The original images are preprocessed with a set of Gabor filters to obtain 40 magnitude maps. Then a modified WLD [16] is applied over each map. It calculates differential excitation with the largest intensity difference in the neighborhood:

$$ \upxi\left( {x_{c} } \right) = arctan\left[ {\frac{{f_{00} }}{{x_{c} }}} \right], f_{00} = p *\left( {x_{j} - x_{c} } \right) $$
(4)

where \( x_{j} \) is the corresponding neighbor pixel of Chebyshev distance:

$$ x_{j} = argmax_{{x_{i} }} \left| {x_{i} - x_{c} } \right|, i = 0,1, \ldots ,p - 1 $$
(5)

Orientation is the same as original WLD’s gradient orientation. Finally, GWLD histogram is obtained and experiments show it has encouraging performance, but with high computation cost.

(4) Improved WLD (IWLD)

Since WLD is not good at handling information of flat area, Zhang et al. [5] proposed IWLD, which computes the orientation of eight intensity differences separately:

$$ \xi_{mx} \left( {x_{c} } \right) = arctan\left( {\alpha \left\langle{X,J_{x}}\right\rangle} \right), \xi_{my} \left( {x_{c} } \right) = arctan\left( {\alpha \left\langle{X,J_{y}}\right\rangle } \right) $$
(6)

where \( {\text{X}} = \left( {\frac{{x_{0} - x_{c} }}{{x_{c} }},\frac{{x_{1} - x_{c} }}{{x_{c} }}, \ldots ,\frac{{x_{p - 1} - x_{c} }}{{x_{c} }}} \right)^{T} \) and \( \left\langle , \right\rangle \) is the inner product operator. \( J_{x} = \left( {cos\theta_{0} ,cos\theta_{1} , \ldots ,cos\theta_{p - 1} } \right)^{T} \) and \( J_{y} = \left( {sin\theta_{0} ,sin\theta_{1} , \ldots ,sin\theta_{p - 1} } \right)^{T} \) denote the angle between \( {\text{X}} \) direction and \( x_{i} - x_{c} \).

Then, IWLD magnitude and orientation are defined according to those two values:

$$ \xi_{m} \left( {x_{c} } \right) = \sqrt {\left( {\xi_{mx} \left( {x_{c} } \right)} \right)^{2} + \left( {\xi_{my} \left( {x_{c} } \right)} \right)^{2} } , \xi_{o} \left( {x_{c} } \right) = arctan\frac{{\xi_{my} \left( {x_{c} } \right)}}{{\xi_{mx} \left( {x_{c} } \right)}} $$
(7)

Angle information of each neighboring pixel’s intensity difference makes contributions to image representation. Experiments on face databases demonstrate the proposed approach’s excellent performance.

3.2 Orientation-Based Variants of WLD

Since direction information makes great contributions for some recognition tasks, several orientation-based variants of WLD have also been proposed.

(5) Sobel WLD

Original WLD’s gradient orientation considers only vertical and horizontal neighboring pixels, which is susceptible to noises. It has been replaced with Sobel operator [6, 17] and Isotropic Sobel operator [18], which calculates six neighboring pixels instead of four neighboring pixels. Experiments show it can provide more stable feature histograms and enhance the classification rates for face recognition.

(6) Modified WLD (MWLD)

Lian et al. [7] proposed MWLD as the pedestrian detector. The novelty is that the histogram is constructed in a new way by voting the gradient magnitude into each bin:

$$ MWLD_{2D} \left( {r,t} \right) = \mathop \sum \limits_{i = 0}^{M - 1} \mathop \sum \limits_{j = 0}^{N - 1} \omega \left( {x_{i,j} } \right)\varphi \left( {\xi \left( {x_{i,j} } \right),r} \right)\varphi \left( {\theta \left( {x_{i,j} } \right),t} \right) $$
(8)

and

$$ \upomega\left( {x_{i,j} } \right) = \sqrt {\left( {x_{5} - x_{1} } \right)^{2} + \left( {x_{7} - x_{3} } \right)^{2} } , \varphi \left( {x,y} \right) = \left\{ {\begin{array}{*{20}l} {1,} \hfill & {x = y} \hfill \\ {0,} \hfill & {otherwise} \hfill \\ \end{array} } \right. $$
(9)

where \( {\text{M*N}} \) represents the image size, \( r \) and \( t \) are the numbers of the differential excitation bins and dominant orientation. Experiments on fast pedestrian detection show MWLD performs better than other descriptors with higher speed.

(7) Computationally-Efficient Histogram-Based Image Local Descriptor (CHILD)

In order to solve the problem of sensitiveness to noise and image rotation, Anamandra and Chandrasekaran [8] proposed a descriptor called CHILD, which uses Laplacian of Gaussian for differential excitation, and the orientation is obtained by convolving with Tiansi fractional derivative filter. CHILD histogram is constructed at last, which seems more robust for texture classification. It has also been used on color texture images in [19].

(8) Weber Orientation Descriptor (WOD)

Weibin et al. [9] designed WOD for vanishing point detection. The differential excitation considers positive value of average differences:

$$ \xi_{WOD} \left( {p_{c} } \right) = \left\{ {\begin{array}{*{20}l} {\sqrt {G\left( {p_{c} } \right)} ,} \hfill & {if\;G\left( {p_{c} } \right) \ge 0} \hfill \\ {0,} \hfill & {if\;G\left( {p_{c} } \right) < 0} \hfill \\ \end{array} ,\,\,} \right.G\left( {p_{c} } \right) = arctan\left( {\frac{{p_{c} - \bar{p}_{i} }}{{p_{\text{c}} }}} \right) $$
(10)

The orientation is computed as the index of maximum Gabor energy:

$$ \theta_{WOD} \left( {p_{c} } \right) = Argmax_{{\varphi_{n} }} E_{{\varphi_{n} }} \left( {p_{c} } \right), E_{{\varphi_{n} }} \left( {p_{c} } \right) = Re\left( {\hat{I}_{{\varphi_{n} }} \left( {p_{c} } \right)} \right)^{2} + Im\left( {\hat{I}_{{\varphi_{n} }} \left( {p_{c} } \right)} \right)^{2} $$
(11)

where \( \hat{I}_{{\varphi_{n} }} \left( {p_{c} } \right) \) is the convolution of image and Gabor filter \( g_{{\varphi_{n} }} \). Experiments show WOD can achieve favorable performance results.

(9) Improved WLD (IWLD)

After WLD, Chen et al. [10] proposed a new WLD for person re-identification. The orientation component is improved by coding LBP [20] after quantifying gradient orientation into several intervals:

$$ LBP_{m,n} \left( {x_{c} } \right) = \left\lfloor{B_{m,n}^{p} ,B_{m,n}^{p - 1} , \ldots ,B_{m,n}^{1}}\right\rfloor, B_{m,n}^{i} = {\text{q}}\left( {\phi_{m,n} \left( {x_{c} } \right)} \right) \otimes q\left( {\phi_{m,n} \left( {x_{i} } \right)} \right) $$
(12)

where \( \phi_{m,n} \left( {x_{c} } \right) \) is the value of orientation and \( {\text{q}}\left( \cdot \right) \) is the floor function. Then the histogram is constructed according to the bins of differential excitation and orientation. Experimental results proved that the method is robust for image rotation and illumination, it performs well for person re-identification.

3.3 Multiple Features-Combined Variants of WLD

In order to extract richer information for recognition, researchers proposed several variants that combining other descriptors with WLD. In this section, we will briefly introduce these algorithms.

(10) LBP/WLD

Since LBP can capture the orientation of local textures, Yuqing and Yan [11] combined LBP and WLD into dynamic texture representation to achieve better performance. WLD is computed with the sigmoid function of differential excitation, then the block match information is measured according to their LBP and WLD patterns:

$$ \left( {LBP/WLD} \right)_{m} = \left\{ {i|min\left( {Diff\left( {p_{0} - p_{i} } \right)} \right)} \right\} $$
(13)

where \( {\text{Diff}}\left( \cdot \right) \) and WLD are defined as:

$$ {\text{Diff}}\left( {p_{0} - p_{i} } \right) = \left| {LBP\left( {p_{0} } \right) - LBP\left( {p_{i} } \right)} \right| + \left| {WLD\left( {p_{0} } \right) - WLD\left( {p_{i} } \right)} \right| $$
(14)
$$ {\text{WLD}}\left( {I_{c} } \right) = sigmoid\left( {\sum\nolimits_{i = 0}^{p - 1} {\left| {\frac{{I_{i} - I_{c} }}{{I_{c} }}} \right|} } \right) $$
(15)

At last, LBP/WLD histogram is concatenated with dynamic texture histogram. Experiments show it performances better than LBMP for dynamic texture recognition.

(11) WLD Variance (WLDV)

In order to improve texture classification accuracy, Dawood et al. [12] proposed a hybrid approach that combines WLD with contrast information, which is actually image variance calculated with the Probability Weighted Moments:

$$ S_{p} = \frac{\sqrt \pi }{n}\sum\nolimits_{i = 0}^{n} {\left[ {X_{i} - 2\left( {1 - \frac{i - 0.5}{n}} \right)X_{i} } \right]} $$
(16)

where \( X_{i} \) indicates ordered observations in a sample of size n. Finally, the joint histogram is represented for texture classification.

(12) Patterns of Weber Magnitude and Orientation (PWMO)

Wang et al. [13, 14] proposed a computationally efficient, discriminative and robust feature descriptor called PWMO for face recognition system. PWMO consist of two parts: patterns of Weber magnitude, which is encoded with patch-based LBP, and patterns of Weber orientation, which is encoded with patch-based local XOR:

$$ {\text{p}\_\text{LBP}}\left( {{\text{x}}_{\text{c}} , {\text{y}}_{\text{c}} } \right) = \left[ {\sum\nolimits_{\text{n = 0}}^{7} {2^{\text{n}} {\text{D}}\left( {{\text{S}}_{ 1} \left( {{\text{P}}_{{{\text{c}}_{\text{n}}^{\text{R}} }}^{\text{r}} } \right) , {\text{S}}_{ 1} \left( {{\text{P}}_{\text{c}}^{\text{r}} } \right)} \right) ,\ldots ,\sum\nolimits_{\text{n = 0}}^{ 7} { 2^{\text{n}} {\text{D}}\left( {{\text{S}}_{\text{q}} \left( {{\text{P}}_{{{\text{c}}_{\text{n}}^{\text{R}} }}^{\text{r}} } \right) , {\text{S}}_{\text{q}} \left( {{\text{P}}_{\text{c}}^{\text{r}} } \right)} \right)} } } \right] $$
(17)

where \( P_{c}^{r} \) is the image patch centered at current pixel with radius r, and \( c_{n}^{R} \) are the neighboring pixels with radius R. \( {\text{S}}\left( {P_{c}^{r} } \right) \) is a pre-defined statistic such as the bins of magnitude histogram and the orientation with largest accumulative magnitude.

Finally, the concatenated histogram is constructed and block-based whitened PCA is introduced to select the most discriminative features. Experimental results demonstrate the proposed method is suitable for face recognition.

(13) Weber Local Binary Pattern (WLBP)

Liu et al. [15] proposed a local descriptor called WLBP, which uses Laplacian of Gaussian for differential excitation and uniform pattern of LBP [21] for orientation component. At last, a more discriminative and robust histogram is constructed. Experiments show it is superior to other state-of-the-art methods not only for face recognition, but also for texture classification.

4 Applications for Biometrics

After WLD has been proposed, it has been widely used on many applications. As for biometrics, WLD and its variants have been successfully applied on face detection [1], face recognition [14, 15], fingerprint liveness detection [22], iris recognition [4] and palmprint recognition [23]. WLD feature is robust to variations in facial expression and illumination, which make it performance well for face images. Since WLD is suitable for high-contrast patterns such as the ridges and valleys of fingerprint and palmprint, it achieves greater performance while using for the liveness detection on those two tasks. Besides, WLD also has been extended to bovine iris recognition, which is robust to noise and illumination in the original images, and it indeed has enhanced the representation power.

5 Conclusions

WLD has become one of important local image descriptors. In this paper, it is the first time to make a review of WLD and its variants. We also survey their applications for biometrics. After an analysis, we divide its variants into three categories, i.e., differential excitation-based, orientation-based, and multiple features-based. For differential excitation-based methods, different differential excitation information is added to make the descriptor more robust to noise. For orientation-based methods, direction extraction algorithms are modified to obtain more effective feature representation. For multiple features based methods, WLD is combined with other descriptors to form a new descriptor and can capture richer information. In our future work, we plan to make a performance evaluation of WLD and its variants for different biometrics tasks such as face, iris, palmprint, and vein recognition.