A neighbourhood feature-based local binary pattern for texture classification

Lan, Shaokun; Li, Jie; Hu, Shiqi; Fan, Hongcheng; Pan, Zhibin

doi:10.1007/s00371-023-03041-3

A neighbourhood feature-based local binary pattern for texture classification

Original article
Published: 18 August 2023

Volume 40, pages 3385–3409, (2024)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

The Visual Computer Aims and scope Submit manuscript

A neighbourhood feature-based local binary pattern for texture classification

Download PDF

Shaokun Lan¹,
Jie Li¹,
Shiqi Hu²,
Hongcheng Fan³ &
…
Zhibin Pan ORCID: orcid.org/0000-0002-4695-311X^1,4

406 Accesses
3 Citations
Explore all metrics

Abstract

The CNN framework has gained widespread attention in texture feature analysis; however, handcrafted features still remain advantageous if computational cost needs to take precedence and in cases where textures are easily extracted with few intra-class variation. Among the handcrafted features, the local binary pattern (LBP) is extensively applied for analysing texture due to its robustness and low computational complexity. However, in local difference vector, it only utilizes the sign component, resulting in unsatisfactory classification capability. To improve classification performance, most LBP variants employ multi-feature fusion. Nevertheless, this can lead to redundant and low-discriminative sub-features and high computational complexity. To address these issues, we propose the neighbourhood feature-based local binary pattern (NF-LBP). Inspired by gradient’s definition, we extract the neighbourhood feature in a local region by simply using the first-order difference and 2-norm. Next, we introduce the neighbourhood feature (NF) pattern to describe intensity changes in the neighbourhood. Finally, we combine the NF pattern with the local sign component and the centre pixel component to create the NF-LBP descriptor. This approach provides better complementary texture information to traditional local sign pattern and is less sensitive to noise. Additionally, we use an adaptive local threshold in the encoding scheme. Our experimental results of classification accuracy and F1 score on five texture databases demonstrate that our proposed NF-LBP method attains outstanding texture classification performance, outperforming existing state-of-the-art approaches. Furthermore, extensive experimental results reveal that NF-LBP is strongly robust to Gaussian noise and salt-and-pepper noise.

Joint-scale LBP: a new feature descriptor for texture classification

Article 24 December 2015

ELGONBP: A grouped neighboring intensity difference encoding for texture classification

Article 25 August 2022

Affine-Gradient Based Local Binary Pattern Descriptor for Texture Classification

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Texture classification has become a major research topic in various fields, including biomedical image analysis [1], remote sensing [2], image retrieval [3], object recognition [4], and face recognition [5], among others. Texture classification poses a primary challenge in dealing with variations that occur within a given class, which are usually caused by rotation, illumination, viewpoint, and scale. Feature extraction methods based on local descriptors garnered considerable interests owing to the stronger robustness to noise compared to global descriptors, which are easily affected by external conditions. However, most local descriptors face challenges when dealing with high-dimensional and low-discriminatory sub-features, which lead to redundant and inefficient feature information. To address these challenges, some local descriptors like scale-invariant feature transform (SIFT) [6] and speeded-up robust features (SURF) [7] are presented for describing patches surrounding carefully selected keypoints within a texture image. Other descriptors like Histogram of Oriented Gradients (HOG) [8] are utilized to extract global image features. Among all local descriptors, local binary pattern (LBP) [9] has become one of the primary methods for texture feature extraction, owing to its theoretical simplicity and computational efficiency. LBP is a nonparametric texture feature descriptor that extracts features from different texture regions and combines them together as an overall texture feature. In the LBP method, the centre pixel $g_{c}$ is compared with its neighbourhood pixels in the sampling template to obtain a binary code (0 or 1), which is then converted into the corresponding decimal number, which represents the local feature. The conventional LBP method has a limitation in that it only takes into account the sign component of the local difference vector and fails to incorporate other important local structural texture information, leading to incomplete representation of the texture features. As a result, its practical applications are significantly hindered.

Since Ojala’s original work [9], many LBP variants have been proposed to overcome the limitations of the original LBP. The local ternary pattern (LTP) [10] applies three quantization intervals to threshold the centre pixel, making it effective in dealing with minor noise. The ResExLBP [11] utilizes image resizing and greyscale transformation to improve its classification accuracy in detecting COVID-19. In recent years, it has been suggested that capturing and combining more discriminative and complementary local features is an effective approach to achieve stronger robustness and attain a comprehensive texture representation. [12] proposed a completed local binary pattern (CLBP), which achieves a comprehensive representation of local features by combining the three highly complementary local features: sign, magnitude, and centre pixel. Inspired by the framework of CLBP, lots of LBP variants have been proposed, including CLBC [13] and multi-scale CLBP [14]. To enhance the noise robustness, several approaches have been proposed. One such approach is the SFB-OCPS presented by [15]. This technique replaces centre pixels on edges with an optimal pixel based on statistical features, which effectively improves the classification performance. Another approach is the ACPS strategy [16] that chooses an adaptable centre pixel to transform non-uniform patterns into uniform patterns while preserving crucial microstructural texture features. Additionally, the BRINT descriptor presented by [17] replaces the sampled neighbourhood pixels with the average greyscales of the neighbourhood pixels, while the RCLBP framework [18] enhances the robustness of feature extraction in the presence of noise by integrating the non-local means filter, wavelet thresholding, and completed local binary pattern framework. SALBP [19] selects an adaptive sampling scale for every neighbourhood pixel. In order to enhance the robustness of the centre pixel, a novel approach called image segmentation-based central multi-scale local binary pattern (ISCM-LBP) was introduced [20], which connects the centre pixel with its surrounding neighbourhood pixels.

To fully exploit texture features and enhance the classification capabilities of the LBP algorithm, several LBP variants have been presented. LTrPs [21] extract the relationship between the centre pixel and its neighbourhood pixels in the vertical and horizontal directions. LETRIST [22] constructs features from extremum responses of Gaussian derivative filters and quantizes them into texture codes. The Single Direction Gradient (LBP-SDG) [23] extracts discriminative movement features. In order to depict local feature information, FbLBP algorithm [24] leverages both the sign information and the mean and variance of magnitude of the difference vector $d_{P}$. Local Binary Circumferential and Radial Derivative Pattern (CRDP) [25] fuses circumferential and radial derivative features based on different orders. By leveraging a pixel to patch-based sampling structure to emulate the sampling pattern, the innovative Local Neighbouring Intensity Relationship Pattern (LNIRP) [26] captures local features by investigating neighbourhood greyscale properties. Sorted Local Gradient Pattern [27] utilizes two local gradient patterns to encode gradient information in the local neighbourhood and extracts local features by classifying pixels into two classes based on their intensities.. The Local Neighbourhood Difference Pattern (LNDP) [28] captures the interdependencies among neighbourhood pixels $g_{p}$ at a particular scale R and demonstrates exceptional classification performance on natural scenes. The CMPE [29] combines the maxima and minima of the neighbourhood pixels with the conventional sign information, improving the classification capability. The CJLBP [30] joins neighbourhoods across different scales to acquire large-scale texture features. The OD-LBP [31] suggests the utilization of orthogonal values from the surrounding texture region to portray local texture characteristics. A neighbourhood and centre difference-based-LBP (NCDB-LBP) [32] considers the differences between different neighbourhood as the local features. [33] proposes the LBP-RGB feature to specially extract image features in RGB mode. To enhance the quality of extracted features, a novel approach based on genetic programming (GP) [34] was introduced, which utilizes a three-layer tree-based binary program, learned through the GP optimization process, to integrate patch detection, feature fusion, and classification. In addition, local grouped order pattern and non-local binary pattern (LGONBP) [35] were proposed to combine two innovative texture descriptors, including LGOP that groups nearby points based on a dominant direction, encoding their intra-group intensity order, and NLBP that computes anchors using global image statistics and progressively encodes non-local intensity differences between neighbouring points and anchors. A descriptor CLGC [36] was proposed to combine local and global information to extract colour-texture features, incorporating wavelet transform and a modified version of local ternary pattern for global feature fusion, and utilizing speeded-up robust feature descriptor and bag of words model for local features. A hybrid texture feature extraction method called Hess-ACS-LBP [37] was proposed, which combines the Hessian matrix and ACS-LBP to effectively reveal the macro- and microstructural changes in textures.

Besides the handcrafted LBP and its variants, the CNN architecture [38] showcased exceptional classification capabilities in 2012, and it has gained considerable interest ever since. It has found numerous applications in texture analysis, including medical image analysis [39], remote sensing [40], and face recognition [41]. In recent years, many algorithms for texture feature description and extraction have made significant progress, such as the $\mathrm LPMF^{2}Net$ model [42] that explores complementary features at diverse hierarchical levels. Additionally, the dynamic reparameterization network (DRPN) [43] is effective in dealing with scale variations in the classification of infrared images. Researchers have proposed other CNN-based models, such as the attention mechanism-based CNN proposed by [44], which combines LBP features and attention mechanisms to improve classification results, and the local LBP feature and deep feature blending approach proposed by [45] to enhance the recognition capability of handwritten digits. Despite some inherent limitations such as high computational complexity and challenging parameter tuning, the use of deep features for texture representation remains a topic of debate. In [46], the authors conducted a comparative analysis of handcrafted features versus CNN-based features. Notably, the study revealed that handcrafted features outperformed CNN-based features in cases where textures were easily extracted with few intra-class variation. As a result, this paper focuses on exploring the improvement of the traditional handcrafted descriptors.

For instance, most currently existing LBP variants completely discard the local features in the neighbourhood, resulting in misclassification of different patterns. Furthermore, LBP-based algorithms extract different sub-features to represent local texture features. The sub-features employed in these LBP variants often lack rotational invariance and tend to possess low discriminative capability while being high-dimensional, thereby contributing to complex and redundant classification process. Additionally, there is still untapped potential in leveraging the complementarity between the extracted features of LBP variants. The extraction of local features requires a more comprehensive combination of sub-features to adequately describe the local region, as low complementarity leads to suboptimal classification capabilities.

To tackle these challenges, we introduce a neighbourhood feature-based local binary pattern (NF-LBP) descriptor for texture classification, which highlights the local neighbourhood features by using the proposed neighbourhood feature (NF) pattern and combines it with local sign component and centre pixel component. The main contributions of this paper are as follows:

1.
We propose a novel method to represent the neighbourhood features by simply using the first-order difference and 2-norm.
2.
We apply a new and effective coding method to encode the neighbourhood feature into a local neighbourhood feature pattern with a local adaptive thresholding quantization method. The resulting local neighbourhood feature pattern is denoted as NF pattern. Our experiments have proved that it has a strong feature classification capability and is strongly complementary to sign information.
3.
We propose a texture descriptor called neighbourhood feature-based local binary pattern (NF-LBP), which integrates the local sign component and the centre pixel component with neighbourhood feature pattern. This integration results in a highly comprehensive local texture representation.
4.
We perform comprehensive experiments on 5 different texture databases, which includes texture databases with complex conditions (Outex, CUReT, XU$_{-}$HR, and ALOT) and real-world database (UIUC). Our comparison demonstrates that NF-LBP achieves the outstanding classification performance. Furthermore, extensive experiments with noise demonstrate the excellent noise robustness of the proposed NF-LBP.

The remainder of this paper is as follows: Section 2 provides a brief review of LBP and its related works. Section 3 presents the detailed description of the proposed NF-LBP. Section 4 presents experimental results and analyses. Section 5 concludes the paper.

2 A concise overview of relevant works

Ojala et al. [47] proposed local binary pattern (LBP) and improved it in the consequent works [9]. The conventional LBP focuses on extracting relations of centre pixel $g_{c}$ and the corresponding neighbourhood pixels in a circular local neighbourhood template, which is defined by the chosen radius R and P neighbourhood pixels $g_{p}$ $(p = 0, 1,\ldots , P-1)$, as shown in Fig. 1. The original coding method of LBP is described as follows:

$$\begin{aligned} \text {LBP}_{P,\ R}(g_{c})= & {} \sum _{p=0}^{P-1}2^{p} \times s(g_{p}-g_{c}),\quad \nonumber \\ s(x)= & {} \left\{ \begin{matrix} 1,&{}x\ge 0\\ 0,&{}otherwise \end{matrix} \right. \end{aligned}$$

(1)

Bilinear interpolation is utilized to estimate neighbourhood pixels $g_{p}$ located at non-integer positions within texture image. The LBP method is utilized to every centre pixel $g_{c}$ and thus extracts texture features of a whole image.

Additionally, [48] discovered that the frequency of occurrence of some LBP patterns is obviously higher than other LBP patterns. To classify LBP patterns, they proposed the uniform measure U to represent the spatial transition, which counts the number of bitwise “0/1” and “1/0” in a LBP pattern. The uniform measure U of a LBP pattern $\text {LBP}_{P,\ R}(g_{c})$ is defined as follows:

$$\begin{aligned} U(\textrm{LBP}_{P,\ R}(g_{c}))= & {} \sum _{p=1}^{P-1}|s(g_{p}-g_{c})-s(g_{p-1}-g_{c})|\nonumber \\{} & {} +|s(g_{P-1}-g_{c})-s(g_{0}-g_{c})| \end{aligned}$$

(2)

The value U quantifies the frequency of local pattern changes, where lower values represent low-frequency image signal and higher values indicate high-frequency image signal. Given that natural images are predominantly composed of low-frequency signal, LBP patterns which satisfy $U\le 2$ are considered uniform patterns. Meanwhile, non-uniform patterns refer to the remaining LBP patterns.

To improve classification performance and achieve rotation invariance, based on the definition of the uniform measure U, rotation-invariant uniform LBP operator, which is usually abbreviated as $\text {LBP}_{P,R}^{\text {riu}2}(g_{c})$, is defined as follows:

$$\begin{aligned} \textrm{LBP}_{P,R}^{\text {riu}2}(g_{c})=\left\{ \begin{array}{ll} \sum _{p=0}^{P-1}s(g_{p}-g_{c}),&{}\quad U(\textrm{LBP}_{P, R}(g_{c}))\le 2\\ P+1,&{}\quad \textrm{otherwise} \end{array} \right. \end{aligned}$$

(3)

The LBP referred to in the following context is the $\text {LBP}_{P,R}^{\text {riu}2}(g_{c})$, which is particularly useful for rotation-invariant texture classification.

Figure 2 shows an example of $\text {LBP}_{P,R}^{\text {riu}2}(g_{c})$.

3 The proposed method

3.1 Neighbourhood feature in local region

The conventional LBP method describes the texture feature of a local region by extracting the difference vector $d_{P}=\left[ g_{0}-g_{c}, g_{1}-g_{c}, \ldots , g_{P-1}-g_{c}\right] $ $(p=0, 1,\ldots , P-1)$. In order to depict the local features, the corresponding LBP pattern in the sampled image region uses the sign component $s_{P}$ of difference vector $d_{P}$. However, the texture feature may not be comprehensively represented by only the sign component $s_{P}$. To complement the sign component $s_{P}$ and enhance the classification capability, CLBP [12] introduced magnitude component CLBP$_{-}$M and centre pixel component CLBP$_{-}$C. Magnitude component is generally denoted as $m_{P}$. As shown in Fig. 3, $s_{P}$ and $m_{P}$ can be directly obtained from $d_{P}$. Based on the framework of CLBP, BRINT [17] achieved excellent classification capability by using the arc-based averaging method. The BRINT descriptor can be decomposed into three components: the sign BRINT$_{-}$S, the magnitude BRINT$_{-}$M, and the centre pixel BRINT$_{-}$C. BRINT uses the magnitude information of $d_{P}$ in the same way as CLBP. Although $m_{P}$ can supply complementary texture feature information, there are certain problems with its use. One of these is that in certain cases, if $m_{P}$ cannot be used properly, it is low-informative and low-discriminative, thus leading to inefficient classification performance. The other problem is that the $m_{P}$ is high-dimensional, but it may provide redundant texture feature information. Therefore, how to enhance the classification performance of the LBP algorithms and how to sufficiently extract distinctive texture features in the local region have become the main research direction in the field of texture classification.

Most current existing LBP variants are dedicated to extracting the relations of neighbourhood pixels $g_{p}$ and the corresponding centre pixel $g_{c}$. Under these circumstances, the texture feature information of the neighbourhood is completely discarded, and this can lead to inefficient classification. As shown in Fig. 4, in conventional LBP, the same LBP code can be obtained even if the centre pixels $g_{c}$ are the same, and the sampled neighbourhood pixels $g_{p}$ are different in the local texture structure.

Actually, the texture feature information of the neighbourhood has strong texture classification capability. To avail the local features, texture features of sampled neighbourhood pixels can be exploited to contribute auxiliary and useful neighbourhood feature information. Based on this consideration, neighbourhood feature-based local binary pattern (NF-LBP) is proposed. The important feature contained in sampled neighbourhood pixels can reflect the local intensity and provide crucial complementary texture feature, which is informative and discriminative.

Gradient information has been considered as a significant texture feature of an image since it can indicate the contrast of the local image texture. For the high-contrast areas or edges of an image, the gradient is large; conversely, the gradient is small in the smooth texture regions of an image. For the purpose of discriminative and additional feature extraction, unlike most variants of LBP, we forgo the use of the magnitude part $m_{P}$ extracted from $d_{P}$. We propose to extract the texture feature contained in sampled neighbourhood pixels in the local image region. Gradient is selected as the feature of the sampled neighbourhood in the local pattern.

Strictly speaking, gradient calculation requires a derivative, but to simplify the calculation process and reduce computational complexity, we obtain an approximate derivative of the gradient by directly calculating the first-order difference of the neighbourhood pixels along the arc of the circular neighbourhood. As shown in Fig. 5, we define the two gradients of the neighbourhood pixel in terms of the first-order difference in clockwise and counterclockwise directions, denoted, respectively, as $G_{+,\ g_{p}}$ and $G_{-,\ g_{p}}$ $(p=0, 1,\ldots , P-1)$. The formulas are as follows:

$$\begin{aligned} G_{+,\ g_{p}}= & {} \left\{ \begin{matrix} g_{P-1}-g_{0},&{}\quad p = 0\\ g_{p-1}-g_{p},&{}\quad \textrm{otherwise} \end{matrix} \right. \end{aligned}$$

(4)

$$\begin{aligned} G_{-,\ g_{p}}= & {} \left\{ \begin{matrix} g_{0}-g_{P-1},&{}\quad p = P-1\\ g_{p+1}-g_{p},&{}\quad \textrm{otherwise} \end{matrix} \right. \end{aligned}$$

(5)

where P is the number of neighbourhood pixels in the sampling template.

The defined gradient of neighbourhood pixel $G_{g_{p}}$ $(p=0, 1,\ldots , P-1)$ merges its respective gradients $G_{+,\ g_{p}}$ and $G_{-,\ g_{p}}$ in the form of 2-norm, which is defined as follows:

$$\begin{aligned} G_{g_{p}}=\sqrt{G_{+,\ g_{p}}^{2}+G_{-,\ g_{p}}^{2}} \end{aligned}$$

(6)

After the calculations above, we can obtain a new feature vector in the local region of a texture image. Taking Fig. 5 as an example, in the end, we get a feature vector of neighbourhood pixel gradients $G_{g_{p}}$ in the local sampling pattern, abbreviated as $G_{P}$. Our proposed gradient vector $G_{P}$ can succinctly describe neighbourhood features and has a number of outstanding advantages. First, the gradient information can straightforwardly reflect the local contrast of a texture image. Second, the difference vector between neighbourhood pixels remains the same regardless of how the image is rotated. Thus, the gradient vector $G_{P}$ is rotation invariant. Third, the centre pixel $g_{c}$ may be easily polluted by noise. As shown in Fig. 6, in this case, the difference vector $d_{P}=\left[ g_{0}-g_{c}, g_{1}-g_{c}, \ldots , g_{P-1}-g_{c}\right] $ will be completely changed, thus losing its discriminative capability. Compared with the difference vector $d_{P}$, the gradient vector $G_{P}$ will not be altered by the noise and still contain discriminative feature information of the local texture. Hence, the gradient vector $G_{P}$ is more robust to noise than $d_{P}$.

3.2 Neighbourhood feature (NF) pattern

The gradient vector $G_{P}$ consists of $G_{g_{p}}$, which are continuous values. Same as the difference vector $d_{P}$, $G_{P}$ cannot be straightforwardly utilized for texture classification but needs to be converted to binary strings. Therefore, we propose a neighbourhood feature (NF) pattern by applying a novel coding method for the gradient vector $G_{P}$.

Firstly, as shown in Fig. 7, we perform an image segmentation to obtain $N \times N$ discrete sub-images in a whole texture image. Then, local adaptive threshold of each sub-image is used for binary quantization of gradient vector $G_{P}$ because of the strong correlation of the same feature in a local region of a texture image. We define each local adaptive threshold of $G_{P}$ as the mean value of $G_{g_{p}}$ of each sub-image. It is clear that the local adaptive threshold is more reflective of texture changes in local regions than the global threshold used in LBP and most LBP variants. To achieve a trade-off between computational complexity and discrimination capability, we set N = 4 in our experiments, thus acquiring 4 $\times $ 4 = 16 sub-images. The gradient vector $G_{P}$ in the local region can be therefore encoded into a local NF binary pattern to participate in the subsequent texture classification task.

Secondly, as proposed in Ojala’s work [9], there are high-frequency and low-frequency patterns in the local NF pattern, and their contribution to the texture classification is not of equal importance. Accordingly, we need to encode them differently. The uniform metric parameter, denoted as $\tilde{U}$ in the following, which represents the bitwise of “0/1” and “1/0” in NF binary string, is taken as the threshold to distinguish between uniform NF patterns that occur with high-frequency and non-uniform NF patterns that occur with low frequency. When $U(\text {NF}) \le \tilde{U}$, then this NF binary pattern is defined as an NF uniform pattern; conversely, it is a non-uniform NF pattern.

Subsequently, to fix a reasonable threshold $\tilde{U}$, we select different $\tilde{U}$ and conduct a series of experiments on the UIUC texture database, which contains 25 different texture classes, and each class consists of 40 different texture images. The texture image size on the UIUC database is 640 $\times $ 480. Table 1 shows that for different choices of U values, the average percentage of non-uniform NF patterns on the UIUC database. Observations reveal that if $\tilde{U}$ is chosen to be less than or equal to 4, the non-uniform NF patterns occur with higher frequency than uniform NF patterns, which is inconsistent with the previous definition. If $\tilde{U} = 6$, at sampling radius R = 1 or R = 2, the non-uniform NF patterns hardly appear, which is not a reasonable situation. When $\tilde{U}$ is fixed at 4, the occurrence frequency of non-uniform NF patterns is consistent with Ojala’s research. We, therefore, set the $\tilde{U}$ at 4 in the following encoding process of the NF pattern.

Table 1 Average percentage of non-uniform patterns (%) on UIUC database using different $\tilde{U}$

A neighbourhood feature-based local binary pattern for texture classification

Abstract

Similar content being viewed by others

Joint-scale LBP: a new feature descriptor for texture classification

ELGONBP: A grouped neighboring intensity difference encoding for texture classification

Affine-Gradient Based Local Binary Pattern Descriptor for Texture Classification

Explore related subjects

1 Introduction

2 A concise overview of relevant works

3 The proposed method

3.1 Neighbourhood feature in local region

3.2 Neighbourhood feature (NF) pattern

3.3 Neighbourhood feature-based local binary pattern (NF-LBP)

3.4 Complementarity to LBP and classification performance comparisons of NF-LBP\(_{-}\)NF and CLBP\(_{-}\)M

3.5 Dissimilarity measure

4 Experimental results and analyses

4.1 Detailed analyses of the computational complexity and feature histogram dimension of the NF-LBP

4.2 Experimental results and analyses on UIUC database

4.3 Experimental results and analyses on CUReT database

4.4 Experimental results and analyses on Outex database

4.5 Experimental results and analyses on XU\(_{-}\)HR database

4.6 Experimental results and analyses on ALOT database

4.7 Experimental results and analyses of noise robustness to Gaussian noise

4.8 Experimental results and analyses of noise robustness to salt-and-pepper noise

5 Conclusion

Data Availability

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation